DSpace at KOASAS: Tree-based modeling of prosodic phrasing and segmental duration for Korean TTS systems

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Journal Papers(저널논문)

Tree-based modeling of prosodic phrasing and segmental duration for Korean TTS systems

Cited 26 time in

Cited 0 time in

Hit : 445
Download : 0

Export

Lee, S / Oh, Yung-Hwan researcher

This study describes the tree-based modeling of prosodic phrasing, pause duration between phrases and segmental duration for Korean TTS systems. We collected 400 sentences from various genres and built a corresponding speech corpus uttered by a professional female announcer. The phonemic and prosodic boundaries were manually marked on the recorded speech, and morphological analysis, grapheme-to-phoneme conversion and syntactic analysis were also done on the text. A decision tree and regression trees were trained on 240 sentences (of approximately 20 min length), and tested on 160 sentences (of approximately 13 min length). Features for modeling prosody are proposed, and their effectiveness is measured by interpreting the resulting trees. The misclassification rate of the decision tree was 14.46%, the RMSEs of the regression trees, which predict pause duration and segmental duration, were 132 and 22 ms, respectively, for the test set. To understand the performance of our approach in the run time of TTS systems, we trained and tested tries with the output of our text analyzer. The misclassification rate and the RMSE were 18.49% and 134 ms, respectively, for prosodic phrasing and pause duration on the test set. (C) 1999 Elsevier Science B.V. All rights reserved.

Publisher: ELSEVIER SCIENCE BV

Issue Date: 1999-08

Language: English

Article Type: Article

Keywords: SPEECH; TEXT

Citation: SPEECH COMMUNICATION, v.28, no.4, pp.283 - 300

ISSN: 0167-6393

URI: http://hdl.handle.net/10203/77323

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 26 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Tree-based modeling of prosodic phrasing and segmental duration for Korean TTS systems

This item is cited by other documents in WoS

KOASAS

Communities & Collections