DSpace at KOASAS: Model based approach for robust speech recognition in noisy environments

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Model based approach for robust speech recognition in noisy environments잡음환경에서의 음성인식을 위한 모델에 기반한 접근방식

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 359
Download : 0

Export

Kim, Do-Yeong / 김도영

Presently, the problem of noise robustness is one of the most important issues in speech recognition. In this dissertation work, we devote to solve noise robustness problems in speech recognition based on speech feature vector transform(data transform) and model parameter compensation (distribution transform). First, we presented novel data transformation algorithm which estimates clean speech feature vector from corrupted one. Nonlinear contamination procedure of speech signal in noisy environment was approximated to linear function based on Taylor series expansion. Additive noise was modeled as a Gaussian distribution and spectral tilt was assumed fixed unknown vector. In this case, additive noise mean and variance, and spectral tilt are called by environmental variables those are estimated iteratively in maximum likelihood sense. Different from previous method, we incorporated variance of additive noise into re-estimation procedure with which we had more rigorous solution for environmental variables. We called this algorithm model-based linear approximation(MLA) method. Although the MLA methods was originally devised to compensate speech feature vector without a priori knowledge about noisy environment, we could easily combine the MLA methods with a priori knowledge by Bayesian estimation method. Also, the MLA method was extended to multiple noise condition. Each noise source was assumed to have an independent Gaussian distribution, and mean and variance of each noise source were considered as environmental variables. Experimental results showed that performance of MLA is comparable to that of stereo-data-based data transform algorithm. It is worthy of note that stereo-data-based data transform algorithm resulted in poor performance when there is insufficient adaptation data, while MLA does not need any adaptation data. Comparison with other on-line algorithm was also conducted and it was observed that MLA outperformed other methods especially at low SNR co...

Advisors: Lee, Soo-Young researcher; 이수영 researcher

Description: 한국과학기술원 : 전기및전자공학과,

Publisher: 한국과학기술원

Issue Date: 1998

Identifier: 143475/325007 / 000935034

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 1998.8, [ vii, 113 p. ]

Keywords: Taylor series; Noise robust; Speech recognition; Lombard speech; 롬바드음성; 테일러시리즈; 잡음강인; 음성인식

URI: http://hdl.handle.net/10203/36455

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143475&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Model based approach for robust speech recognition in noisy environments잡음환경에서의 음성인식을 위한 모델에 기반한 접근방식

KOASAS

Communities & Collections