Counterfactual Mix-Up for Visual Question Answering

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 179
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorCho, Jae Wonko
dc.contributor.authorKim, Dong-Jinko
dc.contributor.authorJung, Yunjaeko
dc.contributor.authorKweon, In Soko
dc.date.accessioned2023-10-04T09:00:29Z-
dc.date.available2023-10-04T09:00:29Z-
dc.date.created2023-10-04-
dc.date.issued2023-
dc.identifier.citationIEEE ACCESS, v.11, pp.95201 - 95212-
dc.identifier.issn2169-3536-
dc.identifier.urihttp://hdl.handle.net/10203/312984-
dc.description.abstractCounterfactuals have been shown to be a powerful method in Visual Question Answering in the alleviation of Visual Question Answering's unimodal bias. However, existing counterfactual methods tend to generate samples that are not diverse or require auxiliary models to synthesize additional data. In this regard, we propose a more diverse and simple counterfactual sample synthesis method called Counterfactual Mix-Up (CoMiU), which generates counterfactual image features and questions through batch-wise swapping in local object-and word-level. This method efficiently facilitates the generation of more abundant and diverse counterfactual samples, which help improve the robustness of Visual Question Answering models. Moreover, with the creation of diverse counterfactual samples, we introduce two more robust and stable contrastive loss functions, namely Batch-Contrastive loss and Answer-Contrastive loss. We test our method on various challenging Visual Question Answering robustness testing setups to show the advantages of the proposed method compared with the current state-of-the-art methods.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleCounterfactual Mix-Up for Visual Question Answering-
dc.typeArticle-
dc.identifier.wosid001064488700001-
dc.identifier.scopusid2-s2.0-85167799118-
dc.type.rimsART-
dc.citation.volume11-
dc.citation.beginningpage95201-
dc.citation.endingpage95212-
dc.citation.publicationnameIEEE ACCESS-
dc.identifier.doi10.1109/ACCESS.2023.3303891-
dc.contributor.localauthorKweon, In So-
dc.contributor.nonIdAuthorKim, Dong-Jin-
dc.contributor.nonIdAuthorJung, Yunjae-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorComputer vision-
dc.subject.keywordAuthorcounterfactuals-
dc.subject.keywordAuthorvisual question answering-
dc.subject.keywordAuthorunimodal bias-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0