DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Taejun | ko |
dc.contributor.author | Lee, Jongpil | ko |
dc.contributor.author | Nam, Juhan | ko |
dc.date.accessioned | 2018-12-20T02:17:17Z | - |
dc.date.available | 2018-12-20T02:17:17Z | - |
dc.date.created | 2018-12-05 | - |
dc.date.created | 2018-12-05 | - |
dc.date.created | 2018-12-05 | - |
dc.date.issued | 2018-04-18 | - |
dc.identifier.citation | IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.366 - 370 | - |
dc.identifier.uri | http://hdl.handle.net/10203/247510 | - |
dc.description.abstract | Recent work has shown that the end-to-end approach using convolutional neural network (CNN) is effective in various types of machine learning tasks. For audio signals, the approach takes raw waveforms as input using an 1-D convolution layer. In this paper, we improve the 1-D CNN architecture for music auto-tagging by adopting building blocks from state-of-the-art image classification models, ResNets and SENets, and adding multi-level feature aggregation to it. We compare different combinations of the modules in building CNN architectures. The results show that they achieve significant improvements over previous state-of-the-art models on the MagnaTagATune dataset and comparable results on Million Song Dataset. Furthermore, we analyze and visualize our model to show how the 1-D CNN operates. | - |
dc.language | English | - |
dc.publisher | IEEE | - |
dc.title | Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms | - |
dc.type | Conference | - |
dc.identifier.wosid | 000446384600073 | - |
dc.identifier.scopusid | 2-s2.0-85054289741 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 366 | - |
dc.citation.endingpage | 370 | - |
dc.citation.publicationname | IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | - |
dc.identifier.conferencecountry | CN | - |
dc.identifier.conferencelocation | Calgary Telus Convention Center, Alberta | - |
dc.identifier.doi | 10.1109/ICASSP.2018.8462046 | - |
dc.contributor.localauthor | Nam, Juhan | - |
dc.contributor.nonIdAuthor | Kim, Taejun | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.