DSpace at KOASAS: Discriminative approaches for speech recognition based on continuous density HMM

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Discriminative approaches for speech recognition based on continuous density HMM연속 밀도 HMM에 근거한 음성 인식에서의 분별적인 접근 방법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 346
Download : 0

Export

Chung, Yong-Joo / 정용주

Hidden Markov Model (HMM) has become increasingly popular for speech recognition. Although it is true that HMM is good at modeling the stationary and sequential characteristics of speech signal, it has some drawbacks. One of the most frequently criticized aspects of HMM is its weak discrimination ability between competing classes. In this dissertation work, we present various methods to improve discrimination based on continuous density HMM. To evaluate the performance of the proposed methods, we use two sets of speech materials. One is speech for speaker-independent continuous speech recognition and the other is that for speaker-independent isolated word recognition. First, a discriminative modeling algorithm based on continuous density HMM has been studied. The proposed algorithm assigns different numbers of mixtures to each state of HMM by considering the acoustical variabilities. The variabilities are measured by the change of the entropy information when the number of mixtures is increased. In determining the number of mixtures, a competitive method which takes into account the information of different classes is employed. To obtain a more reliable segmentation information, the use of a training algorithm alternating the increment of the number of mixtures and the segmental k-means training is proposed. The proposed algorithm reduces the error rate considerably compared with a conventional HMM with a fixed number of mixtures in all states. Second, a new approach of using multilayer perceptrons (MLPs) in combination with HMMs is proposed. The MLP outputs are used as the state-dependent weightings of HMM likelihoods. MLP is trained for phoneme classification using the segmentation information which is obtained from the Viterbi alignment of HMM. Two independent MLPs for different parameter sets are trained with inputs of multiple context frames. The phoneme classification rate is considerably enhanced when their outputs are multiplied together. And, a relatio...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기및전자공학과,

Publisher: 한국과학기술원

Issue Date: 1995

Identifier: 101743/325007 / 000885449

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 1995.8, [ vi, 131 p. ]

Keywords: HMM; Speech Recognition; Discriminative Approach; 분별적 방법; HMM; 음성인식

URI: http://hdl.handle.net/10203/36300

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=101743&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Discriminative approaches for speech recognition based on continuous density HMM연속 밀도 HMM에 근거한 음성 인식에서의 분별적인 접근 방법

KOASAS

Communities & Collections