(A) nonlinear voice conversion method using gaussian mixture model가우시안 혼합 모델을 이용한 비선형 화자변환

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 407
  • Download : 0
Voice conversion is the technique for modifying the speech signal of a source speaker so that it sounds as if it had been uttered by another target speaker. There have been many methods being proposed for voice conversion, among which the linear transformation methods using the Gaussian Mixture Models (GMM) have been shown to outperform the others. In this thesis, we identify the problem of the GMM-based linear transformation methods, the over-smoothing effect of the converted speech, and propose a new GMM-based nonlinear transformation method using Radial Basis Function networks. Our system is implemented in the context of the Harmonic plus Noise Model to achieve high quality modification of speech. Our experiments show that our system succeeds in converting speech and our nonlinear method outperforms linear transformation methods for large number of mixture components.
Advisors
Oh, Yung-Hwanresearcher오영환researcher
Description
한국과학기술원 : 전산학전공,
Publisher
한국과학기술원
Issue Date
2006
Identifier
255584/325007  / 020044317
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학전공, 2006.2, [ vi, 39 p. ]

Keywords

speech coding; voice morphing; Voice conversion; nonlinear conversion; 가우시안 혼합 모델; 비선형 변환; 음성 코딩; 화자변환; Gaussian mixture model

URI
http://hdl.handle.net/10203/34707
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=255584&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0