DSpace at KOASAS: Speech feature extraction in adverse condition by functional modeling of hearing

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Speech feature extraction in adverse condition by functional modeling of hearing청각 작용의 기능적 모형화에 의한 낯선 환경에서의 음성 특징 추출

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 334
Download : 0

Export

Oh, Kwang-Cheol / 오광철

Several attempts have been made for invariant feature vectors for a consistent speech recognition system in this dissertation work. The consistent speech recognition means that the recognition accuracy of the system is high whether noise presents or not. A major barrier to the consistent speech recognition is the mismatch between training and testing conditions. The mismatch occurs when the noise type or its amount is different from that in the training phase and it produces discord among feature vectors for the same phonetic unit. In addition, the mismatch in feature vectors causes the degradation of the speech recognition performance. We devoted ourselves to solve the mismatch problem by constructing consistent speech feature extraction method. Three novel feature extraction methods are proposed in order to achieve consistent speech recognition. They are based on human hearing and production mechanism. In the first place, we propose a consistent feature extraction algorithm which employs a sub-pitch-based speech analysis method. The sub-pitch-based speech analysis arises from speech production mechanism, especially glottal waveform of voiced sound. The motive of this algorithm also effected by human hearing mechanism in which pitch information is used in segregation of concurrent vowels. The proposed feature extraction algorithm has advantages in extracting efficient spectral information for women````s voices and consistent feature vectors although environmental mismatch occurs. The reason for advantages is that the sub-pitch-based speech analysis method deals with a short duration, which is smaller than pitch period and has relatively high energy within the period. This short duration prevents the tendency of including pitch harmonics in female spectrum and the spectral information of environment noise. Next, we suggest a pitch-based speaker normalization method which utilizes classical perceptual knowledge in order to normalize individual speaker````s spect...

Advisors: Lee, Yong-Hoon researcher; Lee, Hwang-Soo researcher; 이용훈 researcher; 이황수 researcher

Description: 한국과학기술원 : 전기및전자공학과,

Publisher: 한국과학기술원

Issue Date: 2000

Identifier: 157613/325007 / 000885271

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기및전자공학과, 2000.2, [ v, 109 p. ]

Keywords: hearing; consistent feature; 특징추출; 잡음에 대한 강인성; 청각; 일관된 특징; feature extraction; noise robust

URI: http://hdl.handle.net/10203/35819

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157613&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Speech feature extraction in adverse condition by functional modeling of hearing청각 작용의 기능적 모형화에 의한 낯선 환경에서의 음성 특징 추출

KOASAS

Communities & Collections