DSpace at KOASAS: Raw Waveform-based Audio Classification Using Sample-level CNN Architectures

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Conference Papers(학술회의논문)

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 729
Download : 0

Export

Lee, Jongpil / Park, Jiyoung / Kim, Taejun / Nam, Juhan researcher

Music, speech, and acoustic scene sound are often handled separately in the audio domain because of their different signal characteristics. However, as the image domain grows rapidly by versatile image classification models, it is necessary to study extensible classification models in the audio domain as well. In this study, we approach this problem using two types of sample-level deep convolutional neural networks that take raw waveforms as input and uses filters with small granularity. One is a basic model that consists of convolution and pooling layers. The other is an improved model that additionally has residual connections, squeeze-and-excitation modules and multi-level concatenation. We show that the sample-level models reach state-of-the-art performance levels for the three different categories of sound. Also, we visualize the filters along layers and compare the characteristics of learned filters.

Publisher: Neural Information Processing Systems (NIPS)

Issue Date: 2017-12-08

Language: English

Citation: Machine Learning for Audio Signal Processing Workshop, Neural Information Processing Systems (NIPS)

URI: http://hdl.handle.net/10203/238218

Appears in Collection: GCT-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures

KOASAS

Communities & Collections