DSpace at KOASAS: (A) study on the use of statistical information in speech recognition based on HMM

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

(A) study on the use of statistical information in speech recognition based on HMMHMM을 이용한 음성인식에서 통계적 정보의 이용에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 385
Download : 0

Export

Park, Yong-Kyu / 박용규

The need for a speaker-independent large vocabulary speech recognition system has been grown due to its large application area. Although the phoneme-level HMM has been widely used as an efficient algorithm for large vocabulary, the performance of the phoneme-level HMM-based recognition system has to be improved more for practical use. For this reason, a new HMM parameter estimation algorithms are proposed to improve the recognition accuracy, and an efficient pre-classification algorithm is proposed to reduce recognition time. In order to estabilish a benchmark performance performance, a phoneme-level HMM-based recognition system is first implemented as a baseline system. And then, the performance of the baseline system is improved. In this work, we focus mainly on the continuous HMMs and the semi-Continuous HMMs to improve the recognition accuracy for isolated Korean words when only insufficient training data are available. First, in order to model temporal changes in spectra, we propose a modified HMM with nonparametric state duration probability and state duration-dependent observation probability to model state transitions and to have accurately temporal structures and timing informations. Our modeling assumption is essentially based on the fact that the temporal changes and the acoustic effects of timing differences in the spectra characterize the time-varying vocal tract, and consequently play an important role in human perception. To model transitions and state durations, and to consider the temporal structures more accurately, we use not only the transition probability, but also a set of state-duration probability combined with state duration-dependent observation probability. Second, the HMM-based speech recognition system uses a training algorithm, which adjusts parameters to obtain an approximation to the maximum-likehoodestimates(MLE) of HMM parameters. The MLE training algorithm does not attempt to maximize the recognition rate on the training data,...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기 및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1996

Identifier: 106514/325007 / 000855145

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1996.2, [ vi, 120 p. ]

URI: http://hdl.handle.net/10203/36325

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=106514&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

(A) study on the use of statistical information in speech recognition based on HMMHMM을 이용한 음성인식에서 통계적 정보의 이용에 관한 연구

KOASAS

Communities & Collections