Speech synthesis and speaker modification based on two-band speech model2대역 음성모델에 기반한 음성합성 및 화자변환

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 685
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorOh, Yung-Hwan-
dc.contributor.advisor오영환-
dc.contributor.authorKim, Eun-Kyoung-
dc.contributor.author김은경-
dc.date.accessioned2011-12-13T05:20:21Z-
dc.date.available2011-12-13T05:20:21Z-
dc.date.issued2003-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=181183&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/32836-
dc.description학위논문(박사) - 한국과학기술원 : 전산학전공, 2003.2, [ ix, 82 p. ]-
dc.description.abstractSpeech analysis/synthesis is a technique for analyzing speech signal, converting it to suitable parameters, modifying and resynthesizing speech signal from them, and it is essential for high-quality speech synthesis, speech coding, and seaker modification.For a speech analysis/synthesis, a lot of speech models based on a speech production mechanism have been proposed, and they represent speech signal by several meaningful sets of model parameters. Two-band speech model that is a simplified form of a harmonic/stochastic (H/S) model assumes that voiced and unvoiced characteristics can be mixed in one speech frame and their regions are divided into two bands by the time-varying frequency. The voiced region (periodic part) that has strong periodic characteristics is generally modeled by a sum of sinusoids, whereas the unvoiced region (random part) that does not have periodic characteristics is modeled by a linear filtered signal of white Gaussian noises. The frequency dividing periodic part and random part is called as band-splitting frequency. Since an accurate separation of two parts is a key part of the two-band speech model, it is very important to determine the reasonable band-splitting frequency for the high-quality synthesized speech. In this thesis, a new score function for splitting periodic and random parts of two-band speech model is proposed and the algorithm determining the band-splitting frequency by choosing the value that maximizes the function is described. At first, the combined subband periodicity score (CSPS) function defined as a sum of a periodicity score of lower band spectrum and an non-periodicity score of upper band spectrum for an arbitrary frequency is computed by an autocorrelation function. Furthermore, a recurrence relation is derived for reducing the computational complexity of the CSPS function and a tracking technique for guaranteeing the continuity between neighboring frames is proposed. Experimental results have shown that the pr...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectspeech analysis-
dc.subjectspeech coding-
dc.subjectvoice conversion-
dc.subjectspeech synthesis-
dc.subjectspeech model-
dc.subject음성모델-
dc.subject음성분석-
dc.subject음성부호화-
dc.subject음성변환-
dc.subject음성합성-
dc.titleSpeech synthesis and speaker modification based on two-band speech model-
dc.title.alternative2대역 음성모델에 기반한 음성합성 및 화자변환-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN181183/325007-
dc.description.department한국과학기술원 : 전산학전공, -
dc.identifier.uid000975062-
dc.contributor.localauthorOh, Yung-Hwan-
dc.contributor.localauthor오영환-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0