DSpace at KOASAS: GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Master(공학부 석사논문)

GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 532
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Hahn, Min-Soo	-
dc.contributor.advisor	한민수	-
dc.contributor.author	Park, Tae-Sun	-
dc.contributor.author	박태선	-
dc.date.accessioned	2011-12-30	-
dc.date.available	2011-12-30	-
dc.date.issued	2004	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392344&flag=dissertation	-
dc.identifier.uri	http://hdl.handle.net/10203/55260	-
dc.description	학위논문(석사) - 한국정보통신대학교 : 공학부, 2004, [ vii, 48 p. ]	-
dc.description.abstract	The two major factors affecting speaker identification performance are the degradations introduced by noisy communication channels and mismatch between the training and the testing data properties. During the last several years, Gaussian Mixture Models (GMMs) have become very popular in speaker identification systems and have proven to perform very well for clean wideband speech. However, in noisy environments or for noisy band-limited telephone speech, the performance degrades considerably. It is also well known that speaker’s voice always changes over time because of the varying factors such as verbal usage, vocal tract, mood, and health. In this paper, to cope with the mismatches, we proposed the use of prosodic features such as the mean pitch value in voiced intervals while the weighted filter bank analysis (WFBA) is adopted to increase the discriminating capability of mel frequency cepstral coefficients (MFCCs) for speaker identification. In addition, this thesis includes an exhaustive study on several environments and their combinations in order to produce the most robust speaker identification results. The DWFBA method shows 2.77%~4.65% error reduction rate, added pitch information utilization method produces 21.62%~45.39% error reduction rate and combined DWFBA and pitch information utilizing method produces 31.35%~45.39% error reduction rate comparing to the baseline Gaussian Mixture Model.	eng
dc.language	eng	-
dc.publisher	한국정보통신대학교	-
dc.subject	GMM	-
dc.subject	Weighted filter bank analysis	-
dc.title	GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis	-
dc.title.alternative	피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	392344/225023	-
dc.description.department	한국정보통신대학교 : 공학부,	-
dc.identifier.uid	020024049	-
dc.contributor.localauthor	Hahn, Min-Soo	-
dc.contributor.localauthor	한민수	-

Appears in Collection: School of Engineering-Theses_Master(공학부 석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

GMM based speaker identificaition utilizing pitch information and weighted filter bank analysis피치 정보 및 DWFBA를 이용한 GMM 기반의 화자 식별

KOASAS

Communities & Collections