DSpace at KOASAS: Label Embedding for Chinese Grapheme-to-Phoneme Conversion

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Label Embedding for Chinese Grapheme-to-Phoneme Conversion

Cited 0 time in webofscience

Cited 0 time in

Hit : 94
Download : 0

Export

Choi, Eunbi researcher / Kim, Hwa-Yeon / Kim, Jong-Hwan / Kim, Jae-Min

Chinese grapheme-to-phoneme (G2P) conversion plays a significant role in text-to-speech systems by generating pronunciations corresponding to Chinese input characters. The main challenge in Chinese G2P conversion is polyphone disambiguation, which requires selecting the appropriate pronunciation among several candidates. In polyphone disambiguation, calculating probabilities for the entire pronunciations is unnecessary since each Chinese character has only a few (mostly two or three) candidate pronunciations. In this study, we introduce a label embedding approach that matches the character embedding with the closest label embedding among the possible candidates. Specifically, negative sampling and triplet loss were applied to maximize the difference between the correct embedding and the other candidate embeddings. Experimental results show that the label embedding approach improved the polyphone disambiguation accuracy by 4.50% and 1.74% on two datasets compared to the one-hot label classification approach. Moreover, the bidirectional long short-term memory model with the label embedding approach outperformed the previous most advanced model, BERT, demonstrating outstanding performance in polyphone disambiguation. Lastly, we discuss the effect of contextual information in character embeddings on the G2P conversion task.

Publisher: ISCA

Issue Date: 2021-08

Language: English

Citation: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, pp.3196 - 3200

ISSN: 2308-457X

DOI: 10.21437/interspeech.2021-885

URI: http://hdl.handle.net/10203/312359

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Label Embedding for Chinese Grapheme-to-Phoneme Conversion

KOASAS

Communities & Collections