Combining Multi-Scale Features Using Sample-level Deep Convolutional Neural Networks for Weakly Supervised Sound Event Detection

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 492
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Jongpilko
dc.contributor.authorPark, Jiyoungko
dc.contributor.authorKum, Sangeunko
dc.contributor.authorJeong, Younghoko
dc.contributor.authorNam, Juhanko
dc.date.accessioned2018-02-21T04:13:42Z-
dc.date.available2018-02-21T04:13:42Z-
dc.date.created2017-12-19-
dc.date.issued2017-11-17-
dc.identifier.citationProceedings of the 2nd Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)-
dc.identifier.urihttp://hdl.handle.net/10203/239538-
dc.description.abstractThis paper describes our method submitted to large-scale weakly supervised sound event detection for smart cars in the DCASE Challenge 2017. It is based on two deep neural network methods suggested for music auto-tagging. One is training sample-level Deep Convolutional Neural Networks (DCNN) using raw waveforms as a feature extractor. The other is aggregating features on multi-scaled models of the DCNNs and making final predictions from them. With this approach, we achieved the best results, 47.3% in F-score on subtask A (audio tagging) and 0.75 in error rate on subtask B (sound event detection) in the evaluation. These results show that the waveform-based models can be comparable to spectrogram-based models when compared to other DCASE Task 4 submissions. Finally, we visualize hierarchically learned filters from the challenge dataset in each layer of the waveform-based model to explain how they discriminate the events.-
dc.languageEnglish-
dc.publisherIEEE-
dc.titleCombining Multi-Scale Features Using Sample-level Deep Convolutional Neural Networks for Weakly Supervised Sound Event Detection-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationnameProceedings of the 2nd Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)-
dc.identifier.conferencecountryGE-
dc.identifier.conferencelocationMunich-
dc.contributor.localauthorNam, Juhan-
dc.contributor.nonIdAuthorPark, Jiyoung-
dc.contributor.nonIdAuthorJeong, Youngho-
Appears in Collection
GCT-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0