DSpace at KOASAS: Deep learning-based method for multiple sound source localization with high resolution and accuracy

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Mechanical Engineering(기계공학과)ME-Journal Papers(저널논문)

Deep learning-based method for multiple sound source localization with high resolution and accuracy

Cited 34 time in

Cited 0 time in

Hit : 66
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Soo Young	ko
dc.contributor.author	Chang, Jiho	ko
dc.contributor.author	Lee, Seungchul	ko
dc.date.accessioned	2023-09-13T03:00:45Z	-
dc.date.available	2023-09-13T03:00:45Z	-
dc.date.created	2023-09-13	-
dc.date.created	2023-09-13	-
dc.date.issued	2021-12	-
dc.identifier.citation	MECHANICAL SYSTEMS AND SIGNAL PROCESSING, v.161	-
dc.identifier.issn	0888-3270	-
dc.identifier.uri	http://hdl.handle.net/10203/312544	-
dc.description.abstract	Deep learning-based methods are attracting interest in sound source localization, showing promising results compared to conventional model-based approaches. While these deep learning-based methods have been mainly developed into two approaches, i.e., grid-based and grid-free methods, they inherently involve several limitations that the sound sources should be assumed on the grid points or the number of sound sources should be predefined when constructing a deep neural network's architecture. Breaking away from the existing methods' limitations, we propose a deep learning approach to fulfill multiple sound source localization with high resolution and accuracy, for whether the sound sources are located on the grid points or not. We first suggest a target function to obtain spatial source distribution maps, that can represent multiple sources' positional and strength information, even when the sources are placed off the grid points. While the multiple sound source localization is expanded by the proposed source map into image-to-image pixel-level prediction task, we then propose a fully convolutional neural network (FCN) with an encoder-decoder structure to estimate the multiple sources' positions and strength precisely. Based on the dataset acquired by one to three monopole sources on a square plane of 2.68 x 2.68 m, with a spiral array of 60 microphones at 1, 2, and 10 kHz, we assess both quantitative and qualitative results of the proposed model and demonstrate that our proposed model can achieve highly precise localization results regardless of frequency and the number of sound sources. Besides, we validate that high-resolution source distribution maps can be obtained by the proposed model, from which the positions and the strengths of sound sources are accurately predicted. Lastly, we compare the proposed model with several deconvolution methods, and the results show that the proposed deep learning model significantly outperforms the model-based methods.	-
dc.language	English	-
dc.publisher	ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD	-
dc.title	Deep learning-based method for multiple sound source localization with high resolution and accuracy	-
dc.type	Article	-
dc.identifier.wosid	000670074400008	-
dc.identifier.scopusid	2-s2.0-85105691197	-
dc.type.rims	ART	-
dc.citation.volume	161	-
dc.citation.publicationname	MECHANICAL SYSTEMS AND SIGNAL PROCESSING	-
dc.identifier.doi	10.1016/j.ymssp.2021.107959	-
dc.contributor.localauthor	Lee, Seungchul	-
dc.contributor.nonIdAuthor	Lee, Soo Young	-
dc.contributor.nonIdAuthor	Chang, Jiho	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Sound source localization	-
dc.subject.keywordAuthor	High resolution source map	-
dc.subject.keywordAuthor	Deep learning	-
dc.subject.keywordAuthor	Convolutional neural network	-
dc.subject.keywordAuthor	Fully convolutional neural network	-

Appears in Collection: ME-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 34 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep learning-based method for multiple sound source localization with high resolution and accuracy

This item is cited by other documents in WoS

KOASAS

Communities & Collections