Deep Learning-Enabled High-Resolution and Fast Sound Source Localization in Spherical Microphone Array System

Cited 12 time in webofscience Cited 0 time in scopus
  • Hit : 56
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Soo Youngko
dc.contributor.authorChang, Jihoko
dc.contributor.authorLee, Seungchulko
dc.date.accessioned2023-09-13T01:01:01Z-
dc.date.available2023-09-13T01:01:01Z-
dc.date.created2023-09-13-
dc.date.created2023-09-13-
dc.date.created2023-09-13-
dc.date.issued2022-
dc.identifier.citationIEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, v.71-
dc.identifier.issn0018-9456-
dc.identifier.urihttp://hdl.handle.net/10203/312510-
dc.description.abstractWhile sound source localization (SSL) using a spherical microphone array system can be applied to obtain visual beam patterns of source distribution maps in a range of omnidirectional acoustic applications, the present challenges of the spherical measurement system on the valid frequency ranges and the spatial distortion as well as the grid-related limitations of data-driven SSL approaches raise the need to develop an appropriate method. Imbued by these challenges, this study proposes a deep learning (DL) approach to achieve the high-resolution performance of localizing multiple sound sources tailored for omnidirectional acoustic applications. First, we present a spherical target map representation that can panoramically pinpoint the position and strength information of multiple sound sources without any grid-related constraints. Then, a dual-branched spherical convolutional autoencoder is proposed to obtain high-resolution localization results from the conventional spherical beamforming maps while incorporating frequency-variant and distortion-invariant strategies to address the inherent challenges. We quantitatively and qualitatively assess our proposed method's localization capability for multiple sound sources and validate that the proposed method can achieve far more precise and computationally efficient results than the existing approaches. By extension, we newly present the experimental setup that can create omnidirectional acoustic scenarios for the multiple SSL. By evaluating our proposed method in this experimental setup, we demonstrate the effectiveness and applicability of the proposed method with the experimental data. Our study delivers the proposed approach's potential of being utilized in various SSL applications.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleDeep Learning-Enabled High-Resolution and Fast Sound Source Localization in Spherical Microphone Array System-
dc.typeArticle-
dc.identifier.wosid000783542600002-
dc.identifier.scopusid2-s2.0-85127019471-
dc.type.rimsART-
dc.citation.volume71-
dc.citation.publicationnameIEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT-
dc.identifier.doi10.1109/TIM.2022.3161693-
dc.contributor.localauthorLee, Seungchul-
dc.contributor.nonIdAuthorLee, Soo Young-
dc.contributor.nonIdAuthorChang, Jiho-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorLocation awareness-
dc.subject.keywordAuthorArray signal processing-
dc.subject.keywordAuthorMicrophone arrays-
dc.subject.keywordAuthorMicrowave integrated circuits-
dc.subject.keywordAuthorSpatial resolution-
dc.subject.keywordAuthorAcoustic distortion-
dc.subject.keywordAuthorGovernment-
dc.subject.keywordAuthorAcoustic beamforming-
dc.subject.keywordAuthordeep learning (DL)-
dc.subject.keywordAuthormultiple sound source localization (SSL)-
dc.subject.keywordAuthorreal-time acoustic measurement-
dc.subject.keywordAuthorspherical microphone array (SMA) system-
dc.subject.keywordPlusACOUSTIC SOURCE IDENTIFICATION-
dc.subject.keywordPlusSPEAKER LOCALIZATION-
dc.subject.keywordPlusDECONVOLUTION-
Appears in Collection
ME-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 12 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0