DSpace at KOASAS: DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

Cited 0 time in webofscience

Cited 0 time in

Hit : 87
Download : 0

Export

Lee, Dongheon / Choi, Jung-Woo researcher

In this study, we propose a dense frequency-time attentive network (DeFT-AN) for multichannel speech enhancement. DeFT-AN is a mask estimation network that predicts a complex spectral masking pattern for suppress-ing the noise and reverberation embedded in the short-time Fourier transform (STFT) of an input signal. The proposed mask estimation network incorporates three different types of blocksfor aggregatinginformationin thespatial, spectral, and temporal dimensions. It utilizes a spectral transformer with a modified feed-forward network and a temporal con-former with sequential dilated convolutions. The use of dense blocks and transformers dedicated to the three differ-ent characteristics of audio signals enables more compre-hensive enhancement in noisy and reverberant environ-ments. The remarkable performance of DeFT-AN over state-of-the-art multichannel models is demonstrated based on two popular noisy and reverberant datasets in terms of various metrics for speech quality and intelligibility.

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Issue Date: 2023

Language: English

Article Type: Article

Citation: IEEE SIGNAL PROCESSING LETTERS, v.30, pp.155 - 159

ISSN: 1070-9908

DOI: 10.1109/LSP.2023.3244428

URI: http://hdl.handle.net/10203/305794

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

KOASAS

Communities & Collections