DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Oh, Yung-Hwan | - |
dc.contributor.advisor | 오영환 | - |
dc.contributor.author | Kim, Eun-Kyoung | - |
dc.contributor.author | 김은경 | - |
dc.date.accessioned | 2011-12-13T05:20:21Z | - |
dc.date.available | 2011-12-13T05:20:21Z | - |
dc.date.issued | 2003 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=181183&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/32836 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 전산학전공, 2003.2, [ ix, 82 p. ] | - |
dc.description.abstract | Speech analysis/synthesis is a technique for analyzing speech signal, converting it to suitable parameters, modifying and resynthesizing speech signal from them, and it is essential for high-quality speech synthesis, speech coding, and seaker modification.For a speech analysis/synthesis, a lot of speech models based on a speech production mechanism have been proposed, and they represent speech signal by several meaningful sets of model parameters. Two-band speech model that is a simplified form of a harmonic/stochastic (H/S) model assumes that voiced and unvoiced characteristics can be mixed in one speech frame and their regions are divided into two bands by the time-varying frequency. The voiced region (periodic part) that has strong periodic characteristics is generally modeled by a sum of sinusoids, whereas the unvoiced region (random part) that does not have periodic characteristics is modeled by a linear filtered signal of white Gaussian noises. The frequency dividing periodic part and random part is called as band-splitting frequency. Since an accurate separation of two parts is a key part of the two-band speech model, it is very important to determine the reasonable band-splitting frequency for the high-quality synthesized speech. In this thesis, a new score function for splitting periodic and random parts of two-band speech model is proposed and the algorithm determining the band-splitting frequency by choosing the value that maximizes the function is described. At first, the combined subband periodicity score (CSPS) function defined as a sum of a periodicity score of lower band spectrum and an non-periodicity score of upper band spectrum for an arbitrary frequency is computed by an autocorrelation function. Furthermore, a recurrence relation is derived for reducing the computational complexity of the CSPS function and a tracking technique for guaranteeing the continuity between neighboring frames is proposed. Experimental results have shown that the pr... | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | speech analysis | - |
dc.subject | speech coding | - |
dc.subject | voice conversion | - |
dc.subject | speech synthesis | - |
dc.subject | speech model | - |
dc.subject | 음성모델 | - |
dc.subject | 음성분석 | - |
dc.subject | 음성부호화 | - |
dc.subject | 음성변환 | - |
dc.subject | 음성합성 | - |
dc.title | Speech synthesis and speaker modification based on two-band speech model | - |
dc.title.alternative | 2대역 음성모델에 기반한 음성합성 및 화자변환 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 181183/325007 | - |
dc.description.department | 한국과학기술원 : 전산학전공, | - |
dc.identifier.uid | 000975062 | - |
dc.contributor.localauthor | Oh, Yung-Hwan | - |
dc.contributor.localauthor | 오영환 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.