DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Oh, Yung-Hwan | - |
dc.contributor.advisor | 오영환 | - |
dc.contributor.author | Chi, Sang-Mun | - |
dc.contributor.author | 지상문 | - |
dc.date.accessioned | 2011-12-13T05:24:36Z | - |
dc.date.available | 2011-12-13T05:24:36Z | - |
dc.date.issued | 1998 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143502&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/33115 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 전산학과, 1998.8, [ 3, 93 p. ] | - |
dc.description.abstract | Although speech recognition systems in artificially constrained conditions have already reached high levels of performance, they degrade dramatically when they are applied in the real world, particularly in noisy environments. In noisy environments human speech productions are influenced by noises (Lombard effect) and speech signals are contaminated in ways which affect the performance of speech recognition systems. This thesis describes a Lombard effect compensation and noise suppression method of improving speech recognition performance in noisy environments. First of all, to become familiar with the characteristics of speech affected by the Lombard effect, several features of Lombard speech were investigated. We found that vocal intensity was varied by the Lombard effect, and that the Lombard effect had a statistically significant influence on energy below 500Hz low-band spectral tilt, and the center of gravity determining the spectral structure of speech. Since speech production variations due to the Lombard effect depends on the intensity of the Lombard effect, we formulated a quantitative measure of the Lombard effect level so as to model it more explicitly. Statistical tests were used to discover those features that represent the Lombard effect. A speech degradation model is proposed in order to characterize the distortions of speech in noise and under the Lombard effect. Variations in formant location, formant bandwidth, pitch, spectral tilt, and energy in each frequency band under the Lombard effect are represented by frequency warping and amplitude scaling of each frequency band. Another Lombard effect, the variation of vocal intensity is modeled by a multiplication term depending on the energy of the input speech. Noise contamination is represented by an additive term in the frequency domain. The distortions of noisy Lombard speech are then canceled out according to the speech degradation model. First, spectral subtraction is used to suppress the... | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Noise suppression | - |
dc.subject | Speech recongnition | - |
dc.subject | Lombard effect compensation | - |
dc.subject | 롬바드 효과의 보정 | - |
dc.subject | 잡음 제거 | - |
dc.subject | 음성인식 | - |
dc.title | Robust speech recognition under noisy environments using Lombard effect compensation and dynamic characteristic | - |
dc.title.alternative | 롬바드 효과의 보정과 동적특성을 이용한 잡음환경에 강인한 음성인식 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 143502/325007 | - |
dc.description.department | 한국과학기술원 : 전산학과, | - |
dc.identifier.uid | 000935359 | - |
dc.contributor.localauthor | Oh, Yung-Hwan | - |
dc.contributor.localauthor | 오영환 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.