DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Hahn, Min-Soo | - |
dc.contributor.advisor | 한민수 | - |
dc.contributor.author | Park, Tae-Sun | - |
dc.contributor.author | 박태선 | - |
dc.date.accessioned | 2011-12-30 | - |
dc.date.available | 2011-12-30 | - |
dc.date.issued | 2004 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392344&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/55260 | - |
dc.description | 학위논문(석사) - 한국정보통신대학교 : 공학부, 2004, [ vii, 48 p. ] | - |
dc.description.abstract | The two major factors affecting speaker identification performance are the degradations introduced by noisy communication channels and mismatch between the training and the testing data properties. During the last several years, Gaussian Mixture Models (GMMs) have become very popular in speaker identification systems and have proven to perform very well for clean wideband speech. However, in noisy environments or for noisy band-limited telephone speech, the performance degrades considerably. It is also well known that speaker’s voice always changes over time because of the varying factors such as verbal usage, vocal tract, mood, and health. In this paper, to cope with the mismatches, we proposed the use of prosodic features such as the mean pitch value in voiced intervals while the weighted filter bank analysis (WFBA) is adopted to increase the discriminating capability of mel frequency cepstral coefficients (MFCCs) for speaker identification. In addition, this thesis includes an exhaustive study on several environments and their combinations in order to produce the most robust speaker identification results. The DWFBA method shows 2.77%~4.65% error reduction rate, added pitch information utilization method produces 21.62%~45.39% error reduction rate and combined DWFBA and pitch information utilizing method produces 31.35%~45.39% error reduction rate comparing to the baseline Gaussian Mixture Model. | eng |
dc.language | eng | - |
dc.publisher | 한국정보통신대학교 | - |
dc.subject | GMM | - |
dc.subject | Weighted filter bank analysis | - |
dc.title | GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis | - |
dc.title.alternative | 피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 392344/225023 | - |
dc.description.department | 한국정보통신대학교 : 공학부, | - |
dc.identifier.uid | 020024049 | - |
dc.contributor.localauthor | Hahn, Min-Soo | - |
dc.contributor.localauthor | 한민수 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.