DSpace at KOASAS: Self-Supervised Visual Representation Learning via Residual Momentum

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Self-Supervised Visual Representation Learning via Residual Momentum

Cited 0 time in webofscience

Cited 0 time in

Hit : 70
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Pham, Trung Xuan	ko
dc.contributor.author	Niu, Axi	ko
dc.contributor.author	Zhang, Kang	ko
dc.contributor.author	Jin, Tee Joshua Tian	ko
dc.contributor.author	Hong, Ji Woo	ko
dc.contributor.author	Yoo, Chang-Dong	ko
dc.date.accessioned	2023-11-27T02:01:54Z	-
dc.date.available	2023-11-27T02:01:54Z	-
dc.date.created	2023-11-25	-
dc.date.created	2023-11-25	-
dc.date.issued	2023	-
dc.identifier.citation	IEEE ACCESS, v.11, pp.116706 - 116720	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	http://hdl.handle.net/10203/315212	-
dc.description.abstract	Self-supervised learning (SSL) has emerged as a promising approach for learning representations from unlabeled data. Momentum-based contrastive frameworks such as MoCo-v3 have shown remarkable success among the many SSL methods proposed in recent years. However, a significant gap in encoder representation exists between the online encoder (student) and the momentum encoder (teacher) in these frameworks, limiting the performance on downstream tasks. We identify this gap as a bottleneck often overlooked in existing frameworks and propose 'residual momentum' that explicitly reduces the gap during training to encourage the student to learn representations closer to the teacher's. We also reveal that a similar technique, knowledge distillation (KD), to reduce the distribution gap with cross-entropy-based loss in supervised learning is useless in the SSL context and demonstrate that the intra-representation gap measured by cosine similarity is crucial for EMA-based SSLs. Extensive experiments on different benchmark datasets and architectures demonstrate the superiority of our method compared to state-of-the-art contrastive learning baselines. Specifically, our method outperforms MoCo-v3 0.7% top-1 in ImageNet, 2.82% on CIFAR-100, 1.8% AP, and 3.0% AP75 on VOC detection pre-trained on the COCO dataset; it also improves DenseCL with 0.5% AP (800ep) and 0.6% AP75 (1600ep). Our work highlights the importance of reducing the teacher-student intra-gap in momentum-based contrastive learning frameworks and provides a practical solution for improving the quality of learned representations.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Self-Supervised Visual Representation Learning via Residual Momentum	-
dc.type	Article	-
dc.identifier.wosid	001121769800001	-
dc.identifier.scopusid	2-s2.0-85174832711	-
dc.type.rims	ART	-
dc.citation.volume	11	-
dc.citation.beginningpage	116706	-
dc.citation.endingpage	116720	-
dc.citation.publicationname	IEEE ACCESS	-
dc.identifier.doi	10.1109/access.2023.3325842	-
dc.contributor.localauthor	Yoo, Chang-Dong	-
dc.contributor.nonIdAuthor	Pham, Trung Xuan	-
dc.contributor.nonIdAuthor	Niu, Axi	-
dc.contributor.nonIdAuthor	Jin, Tee Joshua Tian	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Contrastive learning	-
dc.subject.keywordAuthor	residual momentum	-
dc.subject.keywordAuthor	representation learning	-
dc.subject.keywordAuthor	self-supervised learning	-
dc.subject.keywordAuthor	knowledge distillation	-
dc.subject.keywordAuthor	teacher-student gap	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Self-Supervised Visual Representation Learning via Residual Momentum

KOASAS

Communities & Collections