Prediction-segmentation tasks for self-supervision of anomaly detection networks under noisy conditions

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 339
  • Download : 0
For detecting anomalies from sounds generated by electronic devices, self-supervised learning of deep neural networks (DNNs) has been popularly employed. In self-supervised learning, a DNN model is trained over normal data to solve some pretext tasks, and test data giving reduced task performance are regarded as anomalies. Popular choices for the pretext task are the reconstruction and the classification tasks, where a model is trained to predict masked parts of the spectrogram and to classify the internal classes of normal data, respectively. However, the reconstruction task is hard to distinguish anomalies from noises in noisy conditions, and the classification task often fails to learn meaningful features when the diversity across internal classes is too small or too evident. We propose the combination of prediction and segmentation tasks to overcome these limitations. For the proposed tasks, two different machine sounds are mixed with a constant ratio, and a model is trained to predict the both the mixed spectrogram of future time and mixing ratio based on the present and past sound mixture. We train a WaveNet-based model using dual tasks simultaneously, which shows remarkable performance improvements over the conventional models and achieves state-of-the-art performance in the DCASE 2020 Task 2 dataset.
Publisher
International Congress and Exposition on Noise Control Engineering
Issue Date
2023-08-21
Language
English
Citation

52nd International Congress and Exposition on Noise Control Engineering, Inter-Noise 2023

URI
http://hdl.handle.net/10203/312272
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0