Abstract

Translate

The fast improvement of deep learning methods resulted in breakthroughs in image classification, however, these models are sensitive to adversarial perturbations, which can cause serious problems. Adversarial attacks try to change the model output by adding noise to the input, in our research we propose a combined defense method against it. Two defense approaches have been evolved in the literature, one robustizes the attacked model for higher accuracy, and the other approach detects the adversarial examples. Only very few papers discuss both approaches, thus our aim was to combine them to obtain a more robust model and to examine the combination, in particular the filtering capability of the detector. Our contribution was that the filtering based on the decision of the detector is able to enhance the accuracy, which was theoretically proved. Besides that, we developed a novel defense method called 2N labeling, where we extended the idea of the NULL labeling method. While the NULL labeling suggests only one new class for the adversarial examples, the 2N labeling method suggests twice as much. The novelty of our idea is that a new extended class is assigned to each original class, as the adversarial version of it, thus it assists the detector and robust classifier as well. The 2N labeling method was compared to competitor methods on two test datasets. The results presented that our method surpassed the others, and it can operate with a constant classification performance regardless of the presence or amplitude of adversarial attacks.

Details

Title

2N labeling defense method against adversarial attacks by filtering and extended class label set

Author

Szűcs, Gábor¹

; Kiss, Richárd¹

¹ Budapest University of Technology and Economics, Department of Telecommunications and Media Informatics, Budapest, Hungary (GRID:grid.6759.d) (ISNI:0000 0001 2180 0451)

Pages

16717-16740

Publication year

2023

Publication date

May 2023

Publisher

Springer Nature B.V.

ISSN

13807501

e-ISSN

15737721

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1007/s11042-022-14021-5

ProQuest document ID

2801404285

© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

2N labeling defense method against adversarial attacks by filtering and extended class label set

Jump to:

Abstract

Details

Suggested sources