DSpace at KOASAS: Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Conference Papers(학술회의논문)

Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice

Cited 0 time in webofscience

Cited 0 time in

Hit : 441
Download : 0

Export

Lee, Kyungyun / Nam, Juhan researcher

Previous approaches in singer identification have used one of monophonic vocal tracks or mixed tracks containing multiple instruments, leaving a semantic gap between these two domains of audio. In this paper, we present a system to learn a joint embedding space of monophonic and mixed tracks for singing voice. We use a metric learning method, which ensures that tracks from both domains of the same singer are mapped closer to each other than those of different singers. We train the system on a large synthetic dataset generated by music mashup to reflect real-world music recordings. Our approach opens up new possibilities for cross-domain tasks, e.g., given a monophonic track of a singer as a query, retrieving mixed tracks sung by the same singer from the database. Also, it requires no additional vocal enhancement steps such as source separation. We show the effectiveness of our system for singer identification and query-by-singer in both the in-domain and cross-domain tasks.

Publisher: International Society for Music Information Retrieval Conference (ISMIR)

Issue Date: 2019-11-04

Language: English

Citation: The 20th International Society for Music Information Retrieval Conference (ISMIR), pp.295 - 302

URI: http://hdl.handle.net/10203/269878

Appears in Collection: GCT-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice

KOASAS

Communities & Collections