(A) study on the use of statistical information in speech recognition based on HMMHMM을 이용한 음성인식에서 통계적 정보의 이용에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 385
  • Download : 0
The need for a speaker-independent large vocabulary speech recognition system has been grown due to its large application area. Although the phoneme-level HMM has been widely used as an efficient algorithm for large vocabulary, the performance of the phoneme-level HMM-based recognition system has to be improved more for practical use. For this reason, a new HMM parameter estimation algorithms are proposed to improve the recognition accuracy, and an efficient pre-classification algorithm is proposed to reduce recognition time. In order to estabilish a benchmark performance performance, a phoneme-level HMM-based recognition system is first implemented as a baseline system. And then, the performance of the baseline system is improved. In this work, we focus mainly on the continuous HMMs and the semi-Continuous HMMs to improve the recognition accuracy for isolated Korean words when only insufficient training data are available. First, in order to model temporal changes in spectra, we propose a modified HMM with nonparametric state duration probability and state duration-dependent observation probability to model state transitions and to have accurately temporal structures and timing informations. Our modeling assumption is essentially based on the fact that the temporal changes and the acoustic effects of timing differences in the spectra characterize the time-varying vocal tract, and consequently play an important role in human perception. To model transitions and state durations, and to consider the temporal structures more accurately, we use not only the transition probability, but also a set of state-duration probability combined with state duration-dependent observation probability. Second, the HMM-based speech recognition system uses a training algorithm, which adjusts parameters to obtain an approximation to the maximum-likehoodestimates(MLE) of HMM parameters. The MLE training algorithm does not attempt to maximize the recognition rate on the training data,...
Advisors
Un, Chong-Kwanresearcher은종관researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
1996
Identifier
106514/325007 / 000855145
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1996.2, [ vi, 120 p. ]

URI
http://hdl.handle.net/10203/36325
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=106514&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0