DSpace at KOASAS: Emotional singing voice synthesis by changing duration, vibrato and timbre

DSpace at KOASAS

College of Engineering(공과대학)The Robotics Program(로봇공학학제전공)RE-Theses_Master(석사논문)

Emotional singing voice synthesis by changing duration, vibrato and timbre음 길이, 비브라토 그리고 음색의 변화를 이용한 감정 노래 합성

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 697
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Yoo, Chang-Dong	-
dc.contributor.advisor	유창동	-
dc.contributor.author	Park, Youn-Sung	-
dc.contributor.author	박윤성	-
dc.date.accessioned	2011-12-28T02:18:45Z	-
dc.date.available	2011-12-28T02:18:45Z	-
dc.date.issued	2010	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=455132&flag=dissertation	-
dc.identifier.uri	http://hdl.handle.net/10203/54248	-
dc.description	학위논문(석사) - 한국과학기술원 : 로봇공학학제전공, 2010.08, [ vi, 33 p. ]	-
dc.description.abstract	In this thesis, a novel emotional singing voice synthesis system is considered. There were various approaches to express emotion between human and machine or robot through varying facial expression, action and synthesized speech of a robot. Although singing is known as an effective way for expressing emotion, there is no research using singing to express emotion. To synthesize the singing voice with emotion, the statistical parametric synthesis system is used. The statistical parametric synthesis system uses a singing database which is composed of various melodies sung neutrally with restricted set of words and hidden semi-Markov models (HSMMs) of notes ranging from G3 to E5 to construct statistical information. The procedure of statistical parametric synthesis system is composed of mainly two parts, training and synthesis. In training part, both spectrum and excitation parameter are extracted from a singing database, and the statistical information of spectrum and excitation parameter for each note is constructed. Three steps are taken in the synthesis part: (1) Pitch and duration are determined according to the notes indicated by the musical score; (2) Features are sampled from appropriate HSMMs with the duration set to the maximum probability; (3) Singing voice is synthesized by the mel-log spectrum approximation (MLSA) filter using the sampled features as parameters of the filter. Emotion of a synthesized song is controlled by varying the duration, the vibrato parameters and the timbre according to the Thayer`s mood model which defines emotions in tense and energy axis. Perception test is performed to evaluate the synthesized song. The results show that the algorithm can control the expressed emotion of a singing voice given a neutral singing database.	eng
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Vibrato model	-
dc.subject	Emotion expression	-
dc.subject	Statistical singing voice synthesis	-
dc.subject	Timbre conversion filter	-
dc.subject	음색 변조 필터	-
dc.subject	비브라토 모델	-
dc.subject	감정 표현	-
dc.subject	통계학적 노래합성	-
dc.title	Emotional singing voice synthesis by changing duration, vibrato and timbre	-
dc.title.alternative	음 길이, 비브라토 그리고 음색의 변화를 이용한 감정 노래 합성	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	455132/325007	-
dc.description.department	한국과학기술원 : 로봇공학학제전공,	-
dc.identifier.uid	020084053	-
dc.contributor.localauthor	Yoo, Chang-Dong	-
dc.contributor.localauthor	유창동	-

Appears in Collection: RE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Emotional singing voice synthesis by changing duration, vibrato and timbre음 길이, 비브라토 그리고 음색의 변화를 이용한 감정 노래 합성

KOASAS

Communities & Collections