Improving adversarial transferability with worst-case aware attacks적대적 공격의 전이가능성 개선을 위한 최악 인지 공격 기법 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 132
  • Download : 0
Generating adversarial examples with high transferability is key to practical black-box attack scenarios, where the attacker has limited or no information about the target models. While previous works mainly deal with input transformation or optimization process to reduce overfitting on a surrogate model and enhance adversarial transferability, we find that well-designed model manipulation can provide complementary gain to existing methods. We propose Worst-case Aware Attack (WAA), a simple effective method that provides access to a virtual ensemble of models to mitigate overfitting on a specific model during the adversarial example generation process. Specifically, WAA formulates bi-level optimizations to seek adversarial examples that are robust against the worst-case models, which are created by adding per-example weight perturbation to the source model towards the direction of weakening the adversarial sample in question. Unlike other model manipulation methods, WAA does not require multiple surrogate models or architecture-specific knowledge. Experimental results on ImageNet demonstrate that WAA can be incorporated with a variety of existing methods to consistently improve transferability in different settings, including naturally trained models, adversarially trained models, and adversarial defenses.
Advisors
Hong, Seunghoonresearcher홍승훈researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2023.2,[iii, 30 p. :]

Keywords

Black-box Adversarial Attack▼aAdversarial Transferability▼aOverfitting; 블랙박스 적대적 공격▼a적대적 전이가능성▼a과적합

URI
http://hdl.handle.net/10203/309590
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032953&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0