DSpace at KOASAS: Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

Cited 19 time in

Cited 0 time in

Hit : 531
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	GEONMIN, KIM	ko
dc.contributor.author	Lee, Hwaran	ko
dc.contributor.author	Kim, Bo-Kyeong	ko
dc.contributor.author	Oh, Sang-Hoon	ko
dc.contributor.author	Lee, Soo-Young	ko
dc.date.accessioned	2019-01-22T08:30:07Z	-
dc.date.available	2019-01-22T08:30:07Z	-
dc.date.created	2018-12-26	-
dc.date.created	2018-12-26	-
dc.date.created	2018-12-26	-
dc.date.issued	2019-01	-
dc.identifier.citation	IEEE SIGNAL PROCESSING LETTERS, v.26, no.1, pp.159 - 163	-
dc.identifier.issn	1070-9908	-
dc.identifier.uri	http://hdl.handle.net/10203/248985	-
dc.description.abstract	Many speech enhancement methods try to learn the relationship between noisy and clean speechs, obtained using an acoustic room simulator. We point out several limitations of enhancement methods relying on clean speech targets; the goal of this letter is to propose an alternative learning algorithm, called acoustic and adversarial supervision (AAS). AAS makes the enhanced output both maximizing the likelihood of transcription on the pre-trained acoustic model and having general characteristics of clean speech, which improve generalization on unseen noisy speeches. We employ the connectionist temporal classification and the unpaired conditional boundary equilibrium generative adversarial network as the loss function of AAS. AAS is tested on two datasets including additive noise without and with reverberation, Librispeech + DEMAND, and CHiME-4. By visualizing the enhanced speech with different loss combinations, we demonstrate the role of each supervision. AAS achieves a lower word error rate than other state-of-the-art methods using the clean speech target in both datasets.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition	-
dc.type	Article	-
dc.identifier.wosid	000452619700002	-
dc.identifier.scopusid	2-s2.0-85056343657	-
dc.type.rims	ART	-
dc.citation.volume	26	-
dc.citation.issue	1	-
dc.citation.beginningpage	159	-
dc.citation.endingpage	163	-
dc.citation.publicationname	IEEE SIGNAL PROCESSING LETTERS	-
dc.identifier.doi	10.1109/LSP.2018.2880285	-
dc.contributor.localauthor	Lee, Soo-Young	-
dc.contributor.nonIdAuthor	Oh, Sang-Hoon	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Speech enhancement	-
dc.subject.keywordAuthor	room simulator	-
dc.subject.keywordAuthor	connectionist temporal classification	-
dc.subject.keywordAuthor	generative adversarial network	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 19 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

This item is cited by other documents in WoS

KOASAS

Communities & Collections