DSpace at KOASAS: Utterance verfifcation using phone-level log-likelihood ratio patterns in word spotting systems

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Master(공학부 석사논문)

Utterance verfifcation using phone-level log-likelihood ratio patterns in word spotting systems

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 948
Download : 0

Export

Kim, Chong-Hyon / 김정현

This thesis proposes an improved method to verify an utterance that results from a word spotting system. A baseline word spotting system is implemented. The word spotting task in this thesis is to detect keywords from phone conversational database and according to the detected keywords, categorize speech data. To meet the systems specific goal and by analysis of target phone conversational speech, we build a multi-speaker dependent word spotting system. The system is based on HMMs and garbage models are used to model non-keyword intervals. These systems performance strongly rely on garbage models modeling non-keyword intervals. Even with accurate modeling of keyword and non-keyword intervals, these systems result in low performance. In order to improve performance of these systems, we use a two-pass structure which consists of a word spotting system and an utterance verification system. Using utterance verification for word spotting, the conventional LRT based method which uses simple mean of PLLRs to obtain confidence measures for each word has problems due to inaccurate keyword boundary information in recognition results and unclear pronunciation of words in continuous speech. So, in this thesis, we propose a method to use pattern of PLLRs in each keyword. This pattern information is used to give different weights to each phone in the process of generating confidence measures for each keyword. This proposed method uses word specific information resulting in more discrimination between in-vocabulary and out-of-vocabulary words. We also introduce another similar conventional method which uses PLLR distribution information for comparison with the proposed method. Experiments are performed on speech data which consists of 500 phone conversations between customers and call center operators. Experimental results for utterance verification shows that, using proposed method, we could achieve performance improvement of 11.8% compared to a baseline LRT based meth...

Advisors: Kim, Hoi-Rin researcher; 김회린 researcher

Description: 한국정보통신대학교 : 공학부,

Publisher: 한국정보통신대학교

Issue Date: 2009

Identifier: 393080/225023 / 020074245

Language: eng

Description: 학위논문(석사) - 한국정보통신대학교 : 공학부, 2009.2, [ viii, 36 p. ]

Keywords: 발화검증; 핵심어 인식; Word spotting; Utterance verification

URI: http://hdl.handle.net/10203/55059

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=393080&flag=dissertation

Appears in Collection: School of Engineering-Theses_Master(공학부 석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Utterance verfifcation using phone-level log-likelihood ratio patterns in word spotting systems

KOASAS

Communities & Collections