DSpace at KOASAS: 비디오 내 음원 위치 추정 모델의 성능 향상을 위한 클래스 인지 대조 학습 기법 제안

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Journal Papers(저널논문)

비디오 내 음원 위치 추정 모델의 성능 향상을 위한 클래스 인지 대조 학습 기법 제안Class-Aware Contrastive Learning for Improving Performance of Sound Source Localization Model in Videos

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 77
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	선주형	ko
dc.contributor.author	김재윤	ko
dc.contributor.author	김주영	ko
dc.contributor.author	이영주	ko
dc.contributor.author	한혜경	ko
dc.contributor.author	윤성의	ko
dc.date.accessioned	2023-12-01T10:00:11Z	-
dc.date.available	2023-12-01T10:00:11Z	-
dc.date.created	2023-12-01	-
dc.date.created	2023-12-01	-
dc.date.issued	2023-11	-
dc.identifier.citation	정보과학회 컴퓨팅의 실제 논문지, v.29, no.11, pp.518 - 524	-
dc.identifier.issn	2383-6318	-
dc.identifier.uri	http://hdl.handle.net/10203/315632	-
dc.description.abstract	비디오 상의 음원 위치를 추정하는 신경망 모델 학습은 이미지 및 오디오 멀티 모달 연구의 중요 분야 중 하나이다. 최근 연구들은 대조 학습법(contrastive learning)을 사용하여 음원 위치 추정 모델을 지도하는 방법을 제안하는데, 이 방법은 서로 다른 비디오는 서로 다른 클래스의 객체를 표현한다고 가정한다. 그러나, 일반적인 학습 데이터셋에는 동일한 객체를 담은 비디오가 존재하기 마련이다. 기존의 학습 과정에는 이러한 비디오들이 학습 배치 내에 함께 존재하여 모델에게 잘못된 지도를 제공할 수 있다. 이러한 문제를 바로잡고자, 본 논문은 음원 위치 추정 모델이 비디오 상의 객체 클래스를 미리 예측하여 데이터를 재배치하는 정확한 대조 학습법을 제안한다. 제안하는 방법은 추가적인 레이블 없이도 기존의 음원 위치 추정 모델의 성능을 개선하였다. 음원 위치 추정 연구 분야의 성능 검증 실험을 통해 이를 뒷받침한다.	-
dc.language	Korean	-
dc.publisher	한국정보과학회	-
dc.title	비디오 내 음원 위치 추정 모델의 성능 향상을 위한 클래스 인지 대조 학습 기법 제안	-
dc.title.alternative	Class-Aware Contrastive Learning for Improving Performance of Sound Source Localization Model in Videos	-
dc.type	Article	-
dc.type.rims	ART	-
dc.citation.volume	29	-
dc.citation.issue	11	-
dc.citation.beginningpage	518	-
dc.citation.endingpage	524	-
dc.citation.publicationname	정보과학회 컴퓨팅의 실제 논문지	-
dc.identifier.kciid	ART003015248	-
dc.contributor.localauthor	윤성의	-
dc.contributor.nonIdAuthor	선주형	-
dc.contributor.nonIdAuthor	김재윤	-
dc.contributor.nonIdAuthor	김주영	-
dc.contributor.nonIdAuthor	이영주	-
dc.contributor.nonIdAuthor	한혜경	-
dc.description.isOpenAccess	N	-
dc.subject.keywordAuthor	심층 학습	-
dc.subject.keywordAuthor	멀티 모달 학습	-
dc.subject.keywordAuthor	음원 위치 추정	-
dc.subject.keywordAuthor	대조 학습	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	multi-modal learning	-
dc.subject.keywordAuthor	sound source localization	-
dc.subject.keywordAuthor	contrastive learning	-

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

비디오 내 음원 위치 추정 모델의 성능 향상을 위한 클래스 인지 대조 학습 기법 제안Class-Aware Contrastive Learning for Improving Performance of Sound Source Localization Model in Videos

KOASAS

Communities & Collections