DSpace at KOASAS: Disentangled Multidimensional Metric Learning for Music Similarity

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Conference Papers(학술회의논문)

Disentangled Multidimensional Metric Learning for Music Similarity

Cited 11 time in

Cited 6 time in

Hit : 357
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	LEE, Jongpil	ko
dc.contributor.author	Bryan, Nicholas J.	ko
dc.contributor.author	Salamon, Justin	ko
dc.contributor.author	Jin, Zeyu	ko
dc.contributor.author	Nam, Juhan	ko
dc.date.accessioned	2020-06-11T01:20:41Z	-
dc.date.available	2020-06-11T01:20:41Z	-
dc.date.created	2020-06-09	-
dc.date.created	2020-06-09	-
dc.date.created	2020-06-09	-
dc.date.issued	2020-05-05	-
dc.identifier.citation	2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020, pp.6 - 10	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	http://hdl.handle.net/10203/274609	-
dc.description.abstract	Music similarity search is useful for a variety of creative tasks such as replacing one music recording with another recording with a similar "feel", a common task in video editing. For this task, it is typically necessary to define a similarity metric to compare one recording to another. Music similarity, however, is hard to define and depends on multiple simultaneous notions of similarity (i.e. genre, mood, instrument, tempo). While prior work ignore this issue, we embrace this idea and introduce the concept of multidimensional similarity and unify both global and specialized similarity metrics into a single, semantically disentangled multidimensional similarity metric. To do so, we adapt a variant of deep metric learning called conditional similarity networks to the audio domain and extend it using track-based information to control the specificity of our model. We evaluate our method and show that our single, multidimensional model outperforms both specialized similarity spaces and alternative baselines. We also run a user-study and show that our approach is favored by human annotators as well.	-
dc.language	English	-
dc.publisher	IEEE	-
dc.title	Disentangled Multidimensional Metric Learning for Music Similarity	-
dc.type	Conference	-
dc.identifier.wosid	000615970400002	-
dc.identifier.scopusid	2-s2.0-85089220679	-
dc.type.rims	CONF	-
dc.citation.beginningpage	6	-
dc.citation.endingpage	10	-
dc.citation.publicationname	2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020	-
dc.identifier.conferencecountry	SP	-
dc.identifier.conferencelocation	Barcelona	-
dc.identifier.doi	10.1109/ICASSP40776.2020.9053442	-
dc.contributor.localauthor	Nam, Juhan	-
dc.contributor.nonIdAuthor	Bryan, Nicholas J.	-
dc.contributor.nonIdAuthor	Salamon, Justin	-
dc.contributor.nonIdAuthor	Jin, Zeyu	-

Appears in Collection: GCT-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 11 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Disentangled Multidimensional Metric Learning for Music Similarity

This item is cited by other documents in WoS

KOASAS

Communities & Collections