Model based approach for robust speech recognition in noisy environments = 잡음환경에서의 음성인식을 위한 모델에 기반한 접근방식

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 213
  • Download : 0
Presently, the problem of noise robustness is one of the most important issues in speech recognition. In this dissertation work, we devote to solve noise robustness problems in speech recognition based on speech feature vector transform(data transform) and model parameter compensation (distribution transform). First, we presented novel data transformation algorithm which estimates clean speech feature vector from corrupted one. Nonlinear contamination procedure of speech signal in noisy environment was approximated to linear function based on Taylor series expansion. Additive noise was modeled as a Gaussian distribution and spectral tilt was assumed fixed unknown vector. In this case, additive noise mean and variance, and spectral tilt are called by environmental variables those are estimated iteratively in maximum likelihood sense. Different from previous method, we incorporated variance of additive noise into re-estimation procedure with which we had more rigorous solution for environmental variables. We called this algorithm model-based linear approximation(MLA) method. Although the MLA methods was originally devised to compensate speech feature vector without a priori knowledge about noisy environment, we could easily combine the MLA methods with a priori knowledge by Bayesian estimation method. Also, the MLA method was extended to multiple noise condition. Each noise source was assumed to have an independent Gaussian distribution, and mean and variance of each noise source were considered as environmental variables. Experimental results showed that performance of MLA is comparable to that of stereo-data-based data transform algorithm. It is worthy of note that stereo-data-based data transform algorithm resulted in poor performance when there is insufficient adaptation data, while MLA does not need any adaptation data. Comparison with other on-line algorithm was also conducted and it was observed that MLA outperformed other methods especially at low SNR co...
Advisors
Lee, Soo-Youngresearcher이수영researcher
Description
한국과학기술원 : 전기및전자공학과,
Publisher
한국과학기술원
Issue Date
1998
Identifier
143475/325007 / 000935034
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 1998.8, [ vii, 113 p. ]

Keywords

Taylor series; Noise robust; Speech recognition; Lombard speech; 롬바드음성; 테일러시리즈; 잡음강인; 음성인식

URI
http://hdl.handle.net/10203/36455
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143475&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0