Tree-based modeling of prosody for Korean TTS systems한국어 TTS 시스템을 위한 운율의 트리 기반 모델링

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 723
  • Download : 0
To help listeners understand speech, speakers use prosody, which concerns the suprasegmental aspects of spoken language and carries information that is not readily expressed in the literal meaning of the words nor in their syntactic relations. As text-to-speech systems are being incorporated into more and more various applications like e-mail reader and language education system, human users```` desire for a higher quality system is increasing. However, while the current technology makes it possible to obtain a system whose intelligibility is quite high, the lack of natural prosody is the major source of barriers to meeting the users```` expectation. Therefore, it is the one with the greatest need for improvement. The goal in this thesis is to develop a computational model of Korean prosody that improves the naturalness of synthetic speech. We model four prosodic components which are phrasing, loudness, duration, and speech intonation. Then a prosody generation model is incorporated into our Korean text-to-speech system. Our work on prosody modeling can be described by the following theoretical and experimental contribution. First, we suggest a novel Korean prosody structure from an engineering viewpoint to get a more tractable computational model of Korean prosody. Although various theories on Korean prosody structure have been devised, they are rather complicated and difficult to be embodied in a prosody generation module. Thus, we modify the conventional theories and show the appropriateness of the proposed structure in developing a prosody generator. Second, by taking a tree-based framework for prosody modeling, we scientifically discover the linguistic information saliently affecting Korean prosody and draw up rules for the syntax-to-prosody relationship. Since the tree-based framework gives the high comprehensibility in the prediction process, we are able to identify the underlying rules that control prosody by interpreting the trees. Third, we apply boo...
Advisors
Oh, Yung-Hwanresearcher오영환researcher
Description
한국과학기술원 : 전산학전공,
Publisher
한국과학기술원
Issue Date
2000
Identifier
157675/325007 / 000955269
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학전공, 2000.2, [ xiv, 143 p. ]

Keywords

tree-based modeling; prosody; Korean TTS system; CART; 결정 회귀 트리; 트리 기반 모델링; 운율; 한국어 TTS 시스템

URI
http://hdl.handle.net/10203/33160
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157675&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0