DSpace at KOASAS: (A) study on speaker adaptation for a large vocabulary speech recognition system

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

(A) study on speaker adaptation for a large vocabulary speech recognition system대용량 단어 음성인식 시스템을 위한 화자적응에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 427
Download : 0

Export

Koo, Myoung-Wan / 구명완

The main ovjective of this dessertation is the development of a speaker adaptive speech recognition system which can yield the acceptable recognition rate even for speakers who have not provided enough speech to train the recognition system. This system consists of the baseline system and the speaker adaptation system which is made up of two stages:codebook adaptation and HMM parameter adaptation. First, we presented a speaker-dependent system based on HMM. This system has been used the baseline system for speaker adaptation. Second, wo proposed a modified Viterbi scoring algorithm to imorove the discriminability of phonetically similar words. The proposed algorithm weights the Viterbi scores of state which are considered to be perceptually important. When the candidate words were so similar that the phonetical difference between the top 1 and top 2 candidates was one phoneme, the modified Viterbi algorthm reduced the recognition error rate by about 19\% as compared to the conventional method. Third, we proposed a codebook adaptation scheme using a neurallyinspired LVQ whith highly descriminat ability. By the proposed scheme, the codebook was generated to have the descriminant feature rather than the minimum distortion for adaptation speech. From the adaptation speech. From the adaptation experiment, we found that the adaptation with LVQ codebook resulted in higher destortion error than that with conventional codebook but the recognition rate was better, and that LVQ2 codebook, in which K-means each codebook was used to initialize, yielded the best recognition rate. Fourth, we presented a modified corrective training algorithm as a method to improve the performance of HMM parameter adaptation. The observation probability parameters of HMM are re-estimated by this algorithm after performing the spectral mapping algorithm. From the experiment, we found that the performance of the speaker adaptation system was improved after adopting the modified CT algorithm, and...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기 및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1991

Identifier: 61728/325007 / 000835021

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1991.8, [ xi, 125 p. ]

URI: http://hdl.handle.net/10203/36160

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=61728&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

(A) study on speaker adaptation for a large vocabulary speech recognition system대용량 단어 음성인식 시스템을 위한 화자적응에 관한 연구

KOASAS

Communities & Collections