Acoustic feature compensation by class-based histogram equalization for robust speech recognition강인한 음성인식을 위한 클래스 기반 히스토그램 등화 기법에 의한 음향특징 보상 방법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 464
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Hoi-Rin-
dc.contributor.advisor김회린-
dc.contributor.authorSuh, Young-Joo-
dc.contributor.author서영주-
dc.date.accessioned2011-12-28T02:44:11Z-
dc.date.available2011-12-28T02:44:11Z-
dc.date.issued2006-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392695&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/54570-
dc.description학위논문(박사) - 한국정보통신대학교 : 공학부, 2006.8, [ viii, 124 p. ]-
dc.description.abstractIn this dissertation, we propose class-based histogram equalization (CHEQ) methods to compensate noisy acoustic features for robust speech recognition. The proposed methods aim at not only compensating for the acoustic mismatch between the training and test speech recognition environments, but also reducing the two fundamental limitations of conventional histogram equalization (HEQ). In contrast to conventional HEQ which uses global reference and test cumulative distribution functions (CDFs), the proposed methods utilize a number of class-specific reference and test CDFs, classify noisy test features into their corresponding classes, and equalize the features by using their class-specific reference and test distributions. According to the methods for utilizing class information, the proposed methods are classified into two forms: hard-CHEQ based on the vector quantization and soft-CHEQ based on the Gaussian mixture model (GMM). A class-tying technique is incorporated into both of the CHEQ methods to improve classification accuracy as well as to circumvent the sparse-data problem entailed in the class-based approach. Finally, CHEQ in combination with a minimum mean-square error log-spectral amplitude (MMSE-LSA) estimator is used for further performance improvement. Experiments on the Aurora-2 database confirmed the effectiveness of the CHEQ methods. The hard and soft-CHEQ methods produce overall average error reductions of 60.13% and 61.19% over the mel-frequency cepstral coefficient (MFCC)-based baseline features and those of 17.50% and 19.68% over conventional HEQ. In addition, the hard and soft-CHEQ methods with the class-tying technique provide additional improvement of 2.55% and 2.78% compared to the untied CHEQ methods, which result in overall average error reductions of 61.15% and 62.27% over MFCC-based baseline features and those of 19.61% and 21.92% over conventional HEQ, respectively. A combination of MMSE-LSA with CHEQ yields marginal performance impr...eng
dc.languageeng-
dc.publisher한국정보통신대학교-
dc.subjectFeature compensation-
dc.subjectClass-based histogram equalization-
dc.subjectAcoustic mismatch-
dc.subjectRobust speech recognition-
dc.subject강인한 음성인식-
dc.subject특징 보상-
dc.subject클래스 기반 히스토그램 등화-
dc.subject음향 불일치-
dc.titleAcoustic feature compensation by class-based histogram equalization for robust speech recognition-
dc.title.alternative강인한 음성인식을 위한 클래스 기반 히스토그램 등화 기법에 의한 음향특징 보상 방법-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN392695/225023-
dc.description.department한국정보통신대학교 : 공학부,-
dc.identifier.uid020025901-
dc.contributor.localauthorSuh, Young-Joo-
dc.contributor.localauthor서영주-
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0