DSpace at KOASAS: GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Master(공학부 석사논문)

GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 533
Download : 0

Export

Park, Tae-Sun / 박태선

The two major factors affecting speaker identification performance are the degradations introduced by noisy communication channels and mismatch between the training and the testing data properties. During the last several years, Gaussian Mixture Models (GMMs) have become very popular in speaker identification systems and have proven to perform very well for clean wideband speech. However, in noisy environments or for noisy band-limited telephone speech, the performance degrades considerably. It is also well known that speaker’s voice always changes over time because of the varying factors such as verbal usage, vocal tract, mood, and health. In this paper, to cope with the mismatches, we proposed the use of prosodic features such as the mean pitch value in voiced intervals while the weighted filter bank analysis (WFBA) is adopted to increase the discriminating capability of mel frequency cepstral coefficients (MFCCs) for speaker identification. In addition, this thesis includes an exhaustive study on several environments and their combinations in order to produce the most robust speaker identification results. The DWFBA method shows 2.77%~4.65% error reduction rate, added pitch information utilization method produces 21.62%~45.39% error reduction rate and combined DWFBA and pitch information utilizing method produces 31.35%~45.39% error reduction rate comparing to the baseline Gaussian Mixture Model.

Advisors: Hahn, Min-Soo researcher; 한민수 researcher

Description: 한국정보통신대학교 : 공학부,

Publisher: 한국정보통신대학교

Issue Date: 2004

Identifier: 392344/225023 / 020024049

Language: eng

Description: 학위논문(석사) - 한국정보통신대학교 : 공학부, 2004, [ vii, 48 p. ]

Keywords: GMM; Weighted filter bank analysis

URI: http://hdl.handle.net/10203/55260

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392344&flag=dissertation

Appears in Collection: School of Engineering-Theses_Master(공학부 석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별

KOASAS

Communities & Collections