DSpace at KOASAS: Speech synthesis and speaker modification based on two-band speech model

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Ph.D.(박사논문)

Speech synthesis and speaker modification based on two-band speech model2대역 음성모델에 기반한 음성합성 및 화자변환

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 685
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Oh, Yung-Hwan	-
dc.contributor.advisor	오영환	-
dc.contributor.author	Kim, Eun-Kyoung	-
dc.contributor.author	김은경	-
dc.date.accessioned	2011-12-13T05:20:21Z	-
dc.date.available	2011-12-13T05:20:21Z	-
dc.date.issued	2003	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=181183&flag=dissertation	-
dc.identifier.uri	http://hdl.handle.net/10203/32836	-
dc.description	학위논문(박사) - 한국과학기술원 : 전산학전공, 2003.2, [ ix, 82 p. ]	-
dc.description.abstract	Speech analysis/synthesis is a technique for analyzing speech signal, converting it to suitable parameters, modifying and resynthesizing speech signal from them, and it is essential for high-quality speech synthesis, speech coding, and seaker modification.For a speech analysis/synthesis, a lot of speech models based on a speech production mechanism have been proposed, and they represent speech signal by several meaningful sets of model parameters. Two-band speech model that is a simplified form of a harmonic/stochastic (H/S) model assumes that voiced and unvoiced characteristics can be mixed in one speech frame and their regions are divided into two bands by the time-varying frequency. The voiced region (periodic part) that has strong periodic characteristics is generally modeled by a sum of sinusoids, whereas the unvoiced region (random part) that does not have periodic characteristics is modeled by a linear filtered signal of white Gaussian noises. The frequency dividing periodic part and random part is called as band-splitting frequency. Since an accurate separation of two parts is a key part of the two-band speech model, it is very important to determine the reasonable band-splitting frequency for the high-quality synthesized speech. In this thesis, a new score function for splitting periodic and random parts of two-band speech model is proposed and the algorithm determining the band-splitting frequency by choosing the value that maximizes the function is described. At first, the combined subband periodicity score (CSPS) function defined as a sum of a periodicity score of lower band spectrum and an non-periodicity score of upper band spectrum for an arbitrary frequency is computed by an autocorrelation function. Furthermore, a recurrence relation is derived for reducing the computational complexity of the CSPS function and a tracking technique for guaranteeing the continuity between neighboring frames is proposed. Experimental results have shown that the pr...	eng
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	speech analysis	-
dc.subject	speech coding	-
dc.subject	voice conversion	-
dc.subject	speech synthesis	-
dc.subject	speech model	-
dc.subject	음성모델	-
dc.subject	음성분석	-
dc.subject	음성부호화	-
dc.subject	음성변환	-
dc.subject	음성합성	-
dc.title	Speech synthesis and speaker modification based on two-band speech model	-
dc.title.alternative	2대역 음성모델에 기반한 음성합성 및 화자변환	-
dc.type	Thesis(Ph.D)	-
dc.identifier.CNRN	181183/325007	-
dc.description.department	한국과학기술원 : 전산학전공,	-
dc.identifier.uid	000975062	-
dc.contributor.localauthor	Oh, Yung-Hwan	-
dc.contributor.localauthor	오영환	-

Appears in Collection: CS-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Speech synthesis and speaker modification based on two-band speech model2대역 음성모델에 기반한 음성합성 및 화자변환

KOASAS

Communities & Collections