Melody extraction and detection through LSTM-RNN with harmonic sum loss

Cited 15 time in webofscience Cited 14 time in scopus
  • Hit : 134
  • Download : 0
This paper proposes a long short-term memory recurrent neural network (LSTM-RNN) for extracting melody and simultaneously detecting regions of melody from polyphonic audio using the proposed harmonic sum loss. The previous state-of-the-art algorithms have not been based on machine learning techniques and certainly not on deep architectures. The harmonics structure in melody is incorporated in the loss function to attain robustness against both octave mismatch and interference from background music. Experimental results show that the performance of the proposed method is better than or comparable to other state-of-the-art algorithms.
Publisher
IEEE Signal Processing Society
Issue Date
2017-05
Language
English
Citation

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.2766 - 2770

ISSN
1520-6149
DOI
10.1109/ICASSP.2017.7952660
URI
http://hdl.handle.net/10203/276343
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 15 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0