An active learning framework for adversarial training of deep neural networks

Susmita Ghosh; Abhiroop Chatterjee; Lance Fiondella

doi:10.1007/s00521-024-10851-6

Back

An active learning framework for adversarial training of deep neural networks

Journal article

Peer reviewed

An active learning framework for adversarial training of deep neural networks

Susmita Ghosh, Abhiroop Chatterjee and Lance Fiondella

Neural computing & applications, Vol.37(9), pp.6849-6876

03/2025

DOI: https://doi.org/10.1007/s00521-024-10851-6

Abstract

Artificial Intelligence

Computational Biology/Bioinformatics

Computational Science and Engineering

Data Mining and Knowledge Discovery

Image Processing and Computer Vision

Original Article

Probability and Statistics in Computer Science

Computer Science

This article introduces a novel approach to bolster the robustness of Deep Neural Network (DNN) models against adversarial attacks named “Targeted Adversarial Resilience Learning (TARL)”. The initial evaluation of a baseline DNN model reveals a significant accuracy decline when subjected to adversarial examples generated through techniques like FGSM, PGD, Carlini Wagner, and DeepFool attacks. To address this vulnerability, the article proposes an active learning framework, wherein the model iteratively identifies and learns from the most uncertain and misclassified instances. The key components of this approach include uncertainty estimation score in predicting the class of the input sample, selecting challenging samples based on this uncertainty score, labeling these challenging examples and augmenting them into the training set, and thereafter retraining the model with the expanded training set. The iterative active learning process, governed by parameters such as the number of iterations and batch size, demonstrates the potential to systematically enhance the resilience of DNN against adversarial threats. The proposed methodology has been investigated on several popular datasets such as the SARS-CoV-2 CT scan, MNIST, CIFAR-10, and Caltech-101, and demonstrated to be effective. Experiments illustrate that the learning framework improves the adversarial accuracies from 17.4% to 98.71% for the SARS-CoV-2 dataset, from 8.4% to 99.89% for the MNIST dataset, 1.6% to 78.84% for the CIFAR-10, and 12% to 92.92% for Caltech-101. Further, comparative analysis with several state-of-the-art methods suggests that the proposed framework offers superior defense against various attack methods and offers promising defensive mechanisms to deep neural networks.

Metrics

1 Record Views

Details

Title: An active learning framework for adversarial training of deep neural networks
Creators: Susmita Ghosh - Jadavpur University
Abhiroop Chatterjee - Jadavpur University
Lance Fiondella - Department of Electrical and Computer Engineering, University of Massachusetts
Publication Details: Neural computing & applications, Vol.37(9), pp.6849-6876
Publisher: Springer London
Number of pages: 28
Language: English
Grant note: ISI/TIH/2022/55 / IDEAS - Institute of Data Engineering, Analytics, and Science Foundation, The Technology Innovation Hub at the Indian Statistical Institute, Kolkata
Academic Unit: Department of Electrical and Computer Engineering
Resource Type: Journal article
DOI: https://doi.org/10.1007/s00521-024-10851-6
Record Identifier: 9914432398501301

An active learning framework for adversarial training of deep neural networks

Abstract

Related links

Metrics

Details