On robust feature representation for speech recognition in adverse environments잡음 환경에서의 음성 인식을 위한 강인한 특징 표현

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 403
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorLee, Soo-Young-
dc.contributor.advisor이수영-
dc.contributor.authorJung, Ho-Young-
dc.contributor.author정호영-
dc.date.accessioned2011-12-14-
dc.date.available2011-12-14-
dc.date.issued1999-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=156186&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/35806-
dc.description학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 1999.8, [ xi, 121 p. ]-
dc.description.abstractThe problem of noise robustness is one of the most important issues for comercializing of speech recognition systems. This dissertation details the development of robust feature representation for the speech recognition in adverse environments. The basic aim is to remove slow-varying noise and speaker-specific components by filtering of feature parameter sequence. While conventional high-pass approaches use a band-pass or a high-pass filter in the feature parameter domain, the proposed methods introduce the decorrelation principle to suppress noise components and to satisfy the observation independent assumption of hidden Markov model (HMM). This decorrelation principle is implemented as a temporal filter to provide an alternative of conventional filtering methods. First, according to the decorrelation principle, a novel filter design method for high-pass approaches was proposed. This decorrelation technique derived a well structured high-pass filter, and the Wiener filtering was added to suppress the artifacts introduced by a overlapped frame analysis. Thus, the resulting filter was implemented as a band-pass filter, which attenuates low modulation frequencies. The proposed frame decorrelation processing (FDP) effectively de-emphasized noise components, and confirmed the effect of high-pass approaches with a theoretical justification. In order to perform the FDP, the power spectrum of the feature sequence was first estimated, and the error bounds due to a feature analysis were extracted. Then, the FDP provided a band-pass filter using the obtained power spectrum and error bounds. The experimental results indicated that the FDP outperformed other methods for a noisy speech recognition. Note that sufficient states for each HMM are required. Since high-pass approaches attenuate the stationary regions, this may be critical in the stationary-based recognizer. Compared to the delta feature with only transitional information, the FDP included both instantaneous and ...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectFrame decorrelation processing-
dc.subjectHidden Markov model-
dc.subjectNoise-robust feature representation-
dc.subjectSpeech recognition-
dc.subjectOn-line blind channel normalization-
dc.subject온라인 블라인드 채널 정규화-
dc.subject프레임 decorrelation 과정-
dc.subject히든 마르코프 모델-
dc.subject잡음에 강인한 특징 표현-
dc.subject음성 인식-
dc.titleOn robust feature representation for speech recognition in adverse environments-
dc.title.alternative잡음 환경에서의 음성 인식을 위한 강인한 특징 표현-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN156186/325007-
dc.description.department한국과학기술원 : 전기및전자공학과, -
dc.identifier.uid000955361-
dc.contributor.localauthorLee, Soo-Young-
dc.contributor.localauthor이수영-
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0