DSpace at KOASAS: CATs plus plus : Boosting Cost Aggregation With Convolutions and Transformers

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Journal Papers(저널논문)

CATs plus plus : Boosting Cost Aggregation With Convolutions and Transformers

Cited 0 time in webofscience

Cited 0 time in

Hit : 5
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Cho, Seokju	ko
dc.contributor.author	Hong, Sunghwan	ko
dc.contributor.author	Kim, Seungryong	ko
dc.date.accessioned	2024-08-16T02:00:08Z	-
dc.date.available	2024-08-16T02:00:08Z	-
dc.date.created	2024-08-16	-
dc.date.issued	2023-06	-
dc.identifier.citation	IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, v.45, no.6, pp.7174 - 7194	-
dc.identifier.issn	0162-8828	-
dc.identifier.uri	http://hdl.handle.net/10203/322308	-
dc.description.abstract	Cost aggregation is a process in image matching tasks that aims to disambiguate the noisy matching scores. Existing methods generally tackle this by hand-crafted or CNN-based methods, which either lack robustness to severe deformations or inherit the limitation of CNNs that fail to discriminate incorrect matches due to limited receptive fields and inadaptability. In this paper, we introduce Cost Aggregation with Transformers (CATs) to tackle this by exploring global consensus among initial correlation map with the help of some architectural designs that allow us to benefit from global receptive fields of self-attention mechanism. To this end, we include appearance affinity modeling, which helps to disambiguate the noisy initial correlation maps. Furthermore, we introduce some techniques, including multi-level aggregation to exploit rich semantics prevalent at different feature levels and swapping self-attention to obtain reciprocal matching scores to act as a regularization. Although CATs can attain competitive performance, it may face some limitations, i.e., high computational costs, which may restrict its applicability only at limited resolution and hurt performance. To overcome this, we propose CATs++, an extension of CATs. Concretely, we introduce early convolutions prior to cost aggregation with a transformer to control the number of tokens and inject some convolutional inductive bias, then propose a novel transformer architecture for both efficient and effective cost aggregation, which results in apparent performance boost and cost reduction. With the reduced costs, we are able to compose our network with a hierarchical structure to process higher-resolution inputs. We show that the proposed method with these integrated outperforms the previous state-of-the-art methods by large margins.	-
dc.language	English	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	CATs plus plus : Boosting Cost Aggregation With Convolutions and Transformers	-
dc.type	Article	-
dc.identifier.wosid	000982475600039	-
dc.identifier.scopusid	2-s2.0-85141643700	-
dc.type.rims	ART	-
dc.citation.volume	45	-
dc.citation.issue	6	-
dc.citation.beginningpage	7174	-
dc.citation.endingpage	7194	-
dc.citation.publicationname	IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE	-
dc.identifier.doi	10.1109/TPAMI.2022.3218727	-
dc.contributor.localauthor	Kim, Seungryong	-
dc.contributor.nonIdAuthor	Cho, Seokju	-
dc.contributor.nonIdAuthor	Hong, Sunghwan	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Costs	-
dc.subject.keywordAuthor	Transformers	-
dc.subject.keywordAuthor	Correlation	-
dc.subject.keywordAuthor	Semantics	-
dc.subject.keywordAuthor	Feature extraction	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Computer architecture	-
dc.subject.keywordAuthor	Cost aggregation	-
dc.subject.keywordAuthor	efficient transformer	-
dc.subject.keywordAuthor	semantic visual correspondence	-

Appears in Collection: AI-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

CATs plus plus : Boosting Cost Aggregation With Convolutions and Transformers

KOASAS

Communities & Collections