Speech feature extraction in adverse condition by functional modeling of hearing청각 작용의 기능적 모형화에 의한 낯선 환경에서의 음성 특징 추출

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 334
  • Download : 0
Several attempts have been made for invariant feature vectors for a consistent speech recognition system in this dissertation work. The consistent speech recognition means that the recognition accuracy of the system is high whether noise presents or not. A major barrier to the consistent speech recognition is the mismatch between training and testing conditions. The mismatch occurs when the noise type or its amount is different from that in the training phase and it produces discord among feature vectors for the same phonetic unit. In addition, the mismatch in feature vectors causes the degradation of the speech recognition performance. We devoted ourselves to solve the mismatch problem by constructing consistent speech feature extraction method. Three novel feature extraction methods are proposed in order to achieve consistent speech recognition. They are based on human hearing and production mechanism. In the first place, we propose a consistent feature extraction algorithm which employs a sub-pitch-based speech analysis method. The sub-pitch-based speech analysis arises from speech production mechanism, especially glottal waveform of voiced sound. The motive of this algorithm also effected by human hearing mechanism in which pitch information is used in segregation of concurrent vowels. The proposed feature extraction algorithm has advantages in extracting efficient spectral information for women````s voices and consistent feature vectors although environmental mismatch occurs. The reason for advantages is that the sub-pitch-based speech analysis method deals with a short duration, which is smaller than pitch period and has relatively high energy within the period. This short duration prevents the tendency of including pitch harmonics in female spectrum and the spectral information of environment noise. Next, we suggest a pitch-based speaker normalization method which utilizes classical perceptual knowledge in order to normalize individual speaker````s spect...
Advisors
Lee, Yong-HoonresearcherLee, Hwang-Sooresearcher이용훈researcher이황수researcher
Description
한국과학기술원 : 전기및전자공학과,
Publisher
한국과학기술원
Issue Date
2000
Identifier
157613/325007 / 000885271
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 2000.2, [ v, 109 p. ]

Keywords

hearing; consistent feature; 특징추출; 잡음에 대한 강인성; 청각; 일관된 특징; feature extraction; noise robust

URI
http://hdl.handle.net/10203/35819
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157613&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0