(A) study on speaker adaptation for a large vocabulary speech recognition system = 대용량 단어 음성인식 시스템을 위한 화자적응에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 192
  • Download : 0
The main ovjective of this dessertation is the development of a speaker adaptive speech recognition system which can yield the acceptable recognition rate even for speakers who have not provided enough speech to train the recognition system. This system consists of the baseline system and the speaker adaptation system which is made up of two stages:codebook adaptation and HMM parameter adaptation. First, we presented a speaker-dependent system based on HMM. This system has been used the baseline system for speaker adaptation. Second, wo proposed a modified Viterbi scoring algorithm to imorove the discriminability of phonetically similar words. The proposed algorithm weights the Viterbi scores of state which are considered to be perceptually important. When the candidate words were so similar that the phonetical difference between the top 1 and top 2 candidates was one phoneme, the modified Viterbi algorthm reduced the recognition error rate by about 19\% as compared to the conventional method. Third, we proposed a codebook adaptation scheme using a neurallyinspired LVQ whith highly descriminat ability. By the proposed scheme, the codebook was generated to have the descriminant feature rather than the minimum distortion for adaptation speech. From the adaptation speech. From the adaptation experiment, we found that the adaptation with LVQ codebook resulted in higher destortion error than that with conventional codebook but the recognition rate was better, and that LVQ2 codebook, in which K-means each codebook was used to initialize, yielded the best recognition rate. Fourth, we presented a modified corrective training algorithm as a method to improve the performance of HMM parameter adaptation. The observation probability parameters of HMM are re-estimated by this algorithm after performing the spectral mapping algorithm. From the experiment, we found that the performance of the speaker adaptation system was improved after adopting the modified CT algorithm, and...
Advisors
Un, Chong-Kwanresearcher은종관researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
1991
Identifier
61728/325007 / 000835021
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1991.8, [ xi, 125 p. ]

URI
http://hdl.handle.net/10203/36160
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=61728&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0