DSpace at KOASAS: Counterfactual Mix-Up for Visual Question Answering

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Counterfactual Mix-Up for Visual Question Answering

Cited 1 time in

Cited 0 time in

Hit : 179
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Cho, Jae Won	ko
dc.contributor.author	Kim, Dong-Jin	ko
dc.contributor.author	Jung, Yunjae	ko
dc.contributor.author	Kweon, In So	ko
dc.date.accessioned	2023-10-04T09:00:29Z	-
dc.date.available	2023-10-04T09:00:29Z	-
dc.date.created	2023-10-04	-
dc.date.issued	2023	-
dc.identifier.citation	IEEE ACCESS, v.11, pp.95201 - 95212	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	http://hdl.handle.net/10203/312984	-
dc.description.abstract	Counterfactuals have been shown to be a powerful method in Visual Question Answering in the alleviation of Visual Question Answering's unimodal bias. However, existing counterfactual methods tend to generate samples that are not diverse or require auxiliary models to synthesize additional data. In this regard, we propose a more diverse and simple counterfactual sample synthesis method called Counterfactual Mix-Up (CoMiU), which generates counterfactual image features and questions through batch-wise swapping in local object-and word-level. This method efficiently facilitates the generation of more abundant and diverse counterfactual samples, which help improve the robustness of Visual Question Answering models. Moreover, with the creation of diverse counterfactual samples, we introduce two more robust and stable contrastive loss functions, namely Batch-Contrastive loss and Answer-Contrastive loss. We test our method on various challenging Visual Question Answering robustness testing setups to show the advantages of the proposed method compared with the current state-of-the-art methods.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Counterfactual Mix-Up for Visual Question Answering	-
dc.type	Article	-
dc.identifier.wosid	001064488700001	-
dc.identifier.scopusid	2-s2.0-85167799118	-
dc.type.rims	ART	-
dc.citation.volume	11	-
dc.citation.beginningpage	95201	-
dc.citation.endingpage	95212	-
dc.citation.publicationname	IEEE ACCESS	-
dc.identifier.doi	10.1109/ACCESS.2023.3303891	-
dc.contributor.localauthor	Kweon, In So	-
dc.contributor.nonIdAuthor	Kim, Dong-Jin	-
dc.contributor.nonIdAuthor	Jung, Yunjae	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Computer vision	-
dc.subject.keywordAuthor	counterfactuals	-
dc.subject.keywordAuthor	visual question answering	-
dc.subject.keywordAuthor	unimodal bias	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Counterfactual Mix-Up for Visual Question Answering

This item is cited by other documents in WoS

KOASAS

Communities & Collections