Temporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 99
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Taejunko
dc.contributor.authorNam, Juhanko
dc.date.accessioned2022-12-07T12:00:27Z-
dc.date.available2022-12-07T12:00:27Z-
dc.date.created2022-12-02-
dc.date.created2022-12-02-
dc.date.created2022-12-02-
dc.date.issued2022-11-08-
dc.identifier.citation14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2022, pp.437 - 441-
dc.identifier.issn2309-9402-
dc.identifier.urihttp://hdl.handle.net/10203/302018-
dc.description.abstractEnd-to-end learning models using raw waveforms as input have shown superior performances in many audio recognition tasks. However, most model architectures are based on convolutional neural networks (CNN) which were mainly developed for visual recognition tasks. In this paper, we propose an extension of squeeze-and-excitation networks (SENets) which adds temporal feedback control from the top-layer features to channel-wise feature activations in lower layers using a recurrent module. This is analogous to the adaptive gain control mechanism of outer hair-cell in the human auditory system. We apply the proposed model to speech command recognition and show that it slightly outperforms the SENets and other CNN-based models. We also investigate the details of the performance improvement by conducting failure analysis and visualizing the channel-wise feature scaling induced by the temporal feedback.-
dc.languageEnglish-
dc.publisherAsia-Pacific Signal and Information Processing Association (APSIPA)-
dc.titleTemporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition-
dc.typeConference-
dc.identifier.wosid000922154500070-
dc.identifier.scopusid2-s2.0-85146264009-
dc.type.rimsCONF-
dc.citation.beginningpage437-
dc.citation.endingpage441-
dc.citation.publicationname14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2022-
dc.identifier.conferencecountryTH-
dc.identifier.conferencelocationChiang Mai-
dc.identifier.doi10.23919/APSIPAASC55919.2022.9979907-
dc.contributor.localauthorNam, Juhan-
dc.contributor.nonIdAuthorKim, Taejun-
Appears in Collection
GCT-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0