Back to Search Start Over

Empirical Perturbation Analysis of Two Adversarial Attacks: Black Box versus White Box

Authors :
FRANCK LEPREVOST
Raluca Chitic
Ali Osman Topal
University of Luxembourg: High Performance Computing - ULHPC [research center]
Source :
Applied Sciences; Volume 12; Issue 14; Pages: 7339
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

Through the addition of humanly imperceptible noise to an image classified as belonging to a category ca, targeted adversarial attacks can lead convolutional neural networks (CNNs) to classify a modified image as belonging to any predefined target class ct≠ca. To achieve a better understanding of the inner workings of adversarial attacks, this study analyzes the adversarial images created by two completely opposite attacks against 10 ImageNet-trained CNNs. A total of 2×437 adversarial images are created by EAtarget,C, a black-box evolutionary algorithm (EA), and by the basic iterative method (BIM), a white-box, gradient-based attack. We inspect and compare these two sets of adversarial images from different perspectives: the behavior of CNNs at smaller image regions, the image noise frequency, the adversarial image transferability, the image texture change, and penultimate CNN layer activations. We find that texture change is a side effect rather than a means for the attacks and that ct-relevant features only build up significantly from image regions of size 56×56 onwards. In the penultimate CNN layers, both attacks increase the activation of units that are positively related to ct and units that are negatively related to ca. In contrast to EAtarget,C’s white noise nature, BIM predominantly introduces low-frequency noise. BIM affects the original ca features more than EAtarget,C, thus producing slightly more transferable adversarial images. However, the transferability with both attacks is low, since the attacks’ ct-related information is specific to the output layers of the targeted CNN. We find that the adversarial images are actually more transferable at regions with sizes of 56×56 than at full scale.

Details

ISSN :
20763417
Volume :
12
Database :
OpenAIRE
Journal :
Applied Sciences
Accession number :
edsair.doi.dedup.....a638d3756f07e422ecb1fd9ab17d7267
Full Text :
https://doi.org/10.3390/app12147339