DSpace at KOASAS: i-vector와 PLDA를 사용한 텍스트 독립 화자식별

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

i-vector와 PLDA를 사용한 텍스트 독립 화자식별Text-Independent Speaker Identification using i-vector and PLDA

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 2051
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	김회린	-
dc.contributor.advisor	Kim, Hoirin	-
dc.contributor.author	김윤회	-
dc.date.accessioned	2018-06-20T06:21:26Z	-
dc.date.available	2018-06-20T06:21:26Z	-
dc.date.issued	2017	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=675367&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/243261	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2017.2,[v, 49 p. :]	-
dc.description.abstract	Gaussian mixture model (GMM)-universal background model (UBM)으로부터 maximum a posteriori (MAP) 방법으로 적응된 화자의 GMM은 음성 특징 벡터들의 복잡한 분포를 확률적으로 잘 묘사하여 화자식별에 널리 쓰인다. 하지만 GMM-UBM의 복잡성에 비하여 주어진 음성 특징 벡터들의 수가 부족할 경우 화자의 GMM이 충분히 훈련되지 못해 화자식별력 저하를 야기한다. 본 학위논문에서는 GMM mean supervector 공간에서 factor analysis를 통해 얻은 작은 차원의 i-vector를 화자식별에 도입하여 화자식별 성능을 개선하고자 한다. 또한 i-vector는 화자 변동성뿐만 아니라 세션 변동성도 포함하기 때문에 같은 화자의 i-vector라도 세션에 따라 다른 값을 보여 준다. 본 학위논문에서는 세션 보상을 위해 probabilistic linear discriminant analysis (PLDA)를 사용한다. 전화환경에서 i-vector의 방향만을 고려하는 코사인 유사도 값으로 화자식별을 수행한 결과 MAP 방법으로 적응된 GMM을 사용한 화자식별에 비해 46.7%의 상대적 오류율 감소를 보여주었다. 추가적으로 PLDA 기반의 식별을 수행한 결과 코사인 유사도 기반의 식별에 비해 35.1%의 상대적 오류율 감소를 보여주었다.	-
dc.language	kor	-
dc.publisher	한국과학기술원	-
dc.subject	GMM	-
dc.subject	i-vector	-
dc.subject	화자식별	-
dc.subject	factor analysis	-
dc.subject	PLDA	-
dc.subject	speaker identification	-
dc.title	i-vector와 PLDA를 사용한 텍스트 독립 화자식별	-
dc.title.alternative	Text-Independent Speaker Identification using i-vector and PLDA	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	Kim, Yoon-Hoe	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

i-vector와 PLDA를 사용한 텍스트 독립 화자식별Text-Independent Speaker Identification using i-vector and PLDA

KOASAS

Communities & Collections