Improving adversarial transferability with worst-case aware attacks적대적 공격의 전이가능성 개선을 위한 최악 인지 공격 기법 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 215
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorHong, Seunghoon-
dc.contributor.advisor홍승훈-
dc.contributor.authorMyung, Sunghyun-
dc.date.accessioned2023-06-26T19:31:47Z-
dc.date.available2023-06-26T19:31:47Z-
dc.date.issued2023-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032953&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/309590-
dc.description학위논문(석사) - 한국과학기술원 : 전산학부, 2023.2,[iii, 30 p. :]-
dc.description.abstractGenerating adversarial examples with high transferability is key to practical black-box attack scenarios, where the attacker has limited or no information about the target models. While previous works mainly deal with input transformation or optimization process to reduce overfitting on a surrogate model and enhance adversarial transferability, we find that well-designed model manipulation can provide complementary gain to existing methods. We propose Worst-case Aware Attack (WAA), a simple effective method that provides access to a virtual ensemble of models to mitigate overfitting on a specific model during the adversarial example generation process. Specifically, WAA formulates bi-level optimizations to seek adversarial examples that are robust against the worst-case models, which are created by adding per-example weight perturbation to the source model towards the direction of weakening the adversarial sample in question. Unlike other model manipulation methods, WAA does not require multiple surrogate models or architecture-specific knowledge. Experimental results on ImageNet demonstrate that WAA can be incorporated with a variety of existing methods to consistently improve transferability in different settings, including naturally trained models, adversarially trained models, and adversarial defenses.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectBlack-box Adversarial Attack▼aAdversarial Transferability▼aOverfitting-
dc.subject블랙박스 적대적 공격▼a적대적 전이가능성▼a과적합-
dc.titleImproving adversarial transferability with worst-case aware attacks-
dc.title.alternative적대적 공격의 전이가능성 개선을 위한 최악 인지 공격 기법 연구-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전산학부,-
dc.contributor.alternativeauthor명성현-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0