DSpace at KOASAS: Synthetic speech detection using self-supervised learning and variational information bottleneck

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Synthetic speech detection using self-supervised learning and variational information bottleneck자기지도 학습과 변분 정보 병목을 활용한 합성음성 탐지에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 228
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Kim, Hoirin	-
dc.contributor.advisor	김회린	-
dc.contributor.author	Eom, Youngsik	-
dc.date.accessioned	2023-06-26T19:33:54Z	-
dc.date.available	2023-06-26T19:33:54Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032903&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/309880	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2023.2,[iv, 38 p. :]	-
dc.description.abstract	Synthetic speech detection is a task that decides whether an input speech is from human or synthetic speech system such as a text-to-speech (TTS) or voice conversion (VC). Recent advances in sophisticated synthetic speech generated from TTS or VC systems cause threats to the existing automatic speaker verification (ASV) systems. Also, there is a possibility for abuse of human-like synthetic speech in crimes such as impersonation or fake news. Since such synthetic speech is generated from the diverse algorithms, generalization ability with using limited training data is indispensable for a robust synthetic speech detection system. In this thesis, we propose a self-supervised learning scheme based on the wav2vec 2.0 pretrained model with variational information bottleneck (VIB) for the synthetic speech detection task to improve the generalization ability. Evaluation on the ASVspoof 2019 logical access (LA) database shows that our method improves the performance of distinguishing unseen synthetic speech and genuine speech, outperforming current state-of-the-art synthetic speech detection systems. Furthermore, we show that the proposed system improves performance in low-resource and cross-dataset settings of the synthetic speech detection task, demonstrating that our system is also robust in terms of data size and data distribution. Finally, we also propose a Korean synthetic speech detection system using self-supervised learning by constructing corresponding dataset.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Synthetic speech detection▼aVariational information bottleneck▼aSelf-supervised learning	-
dc.subject	합성음성 탐지▼a변분 정보 병목▼a자기지도 학습	-
dc.title	Synthetic speech detection using self-supervised learning and variational information bottleneck	-
dc.title.alternative	자기지도 학습과 변분 정보 병목을 활용한 합성음성 탐지에 관한 연구	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	엄영식	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Synthetic speech detection using self-supervised learning and variational information bottleneck자기지도 학습과 변분 정보 병목을 활용한 합성음성 탐지에 관한 연구

KOASAS

Communities & Collections