Cargando…

Guiding the retraining of convolutional neural networks against adversarial inputs

BACKGROUND: When using deep learning models, one of the most critical vulnerabilities is their exposure to adversarial inputs, which can cause wrong decisions (e.g., incorrect classification of an image) with minor perturbations. To address this vulnerability, it becomes necessary to retrain the aff...

Descripción completa

Detalles Bibliográficos
Autores principales:	Durán, Francisco, Martínez-Fernández, Silverio, Felderer, Michael, Franch, Xavier
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2023
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10495969/ https://www.ncbi.nlm.nih.gov/pubmed/37705636 http://dx.doi.org/10.7717/peerj-cs.1454

_version_	1785105007640576000
author	Durán, Francisco Martínez-Fernández, Silverio Felderer, Michael Franch, Xavier
author_facet	Durán, Francisco Martínez-Fernández, Silverio Felderer, Michael Franch, Xavier
author_sort	Durán, Francisco
collection	PubMed
description	BACKGROUND: When using deep learning models, one of the most critical vulnerabilities is their exposure to adversarial inputs, which can cause wrong decisions (e.g., incorrect classification of an image) with minor perturbations. To address this vulnerability, it becomes necessary to retrain the affected model against adversarial inputs as part of the software testing process. In order to make this process energy efficient, data scientists need support on which are the best guidance metrics for reducing the adversarial inputs to create and use during testing, as well as optimal dataset configurations. AIM: We examined six guidance metrics for retraining deep learning models, specifically with convolutional neural network architecture, and three retraining configurations. Our goal is to improve the convolutional neural networks against the attack of adversarial inputs with regard to the accuracy, resource utilization and execution time from the point of view of a data scientist in the context of image classification. METHOD: We conducted an empirical study using five datasets for image classification. We explore: (a) the accuracy, resource utilization, and execution time of retraining convolutional neural networks with the guidance of six different guidance metrics (neuron coverage, likelihood-based surprise adequacy, distance-based surprise adequacy, DeepGini, softmax entropy and random), (b) the accuracy and resource utilization of retraining convolutional neural networks with three different configurations (one-step adversarial retraining, adversarial retraining and adversarial fine-tuning). RESULTS: We reveal that adversarial retraining from original model weights, and by ordering with uncertainty metrics, gives the best model w.r.t. accuracy, resource utilization, and execution time. CONCLUSIONS: Although more studies are necessary, we recommend data scientists use the above configuration and metrics to deal with the vulnerability to adversarial inputs of deep learning models, as they can improve their models against adversarial inputs without using many inputs and without creating numerous adversarial inputs. We also show that dataset size has an important impact on the results.
format	Online Article Text
id	pubmed-10495969
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-104959692023-09-13 Guiding the retraining of convolutional neural networks against adversarial inputs Durán, Francisco Martínez-Fernández, Silverio Felderer, Michael Franch, Xavier PeerJ Comput Sci Artificial Intelligence BACKGROUND: When using deep learning models, one of the most critical vulnerabilities is their exposure to adversarial inputs, which can cause wrong decisions (e.g., incorrect classification of an image) with minor perturbations. To address this vulnerability, it becomes necessary to retrain the affected model against adversarial inputs as part of the software testing process. In order to make this process energy efficient, data scientists need support on which are the best guidance metrics for reducing the adversarial inputs to create and use during testing, as well as optimal dataset configurations. AIM: We examined six guidance metrics for retraining deep learning models, specifically with convolutional neural network architecture, and three retraining configurations. Our goal is to improve the convolutional neural networks against the attack of adversarial inputs with regard to the accuracy, resource utilization and execution time from the point of view of a data scientist in the context of image classification. METHOD: We conducted an empirical study using five datasets for image classification. We explore: (a) the accuracy, resource utilization, and execution time of retraining convolutional neural networks with the guidance of six different guidance metrics (neuron coverage, likelihood-based surprise adequacy, distance-based surprise adequacy, DeepGini, softmax entropy and random), (b) the accuracy and resource utilization of retraining convolutional neural networks with three different configurations (one-step adversarial retraining, adversarial retraining and adversarial fine-tuning). RESULTS: We reveal that adversarial retraining from original model weights, and by ordering with uncertainty metrics, gives the best model w.r.t. accuracy, resource utilization, and execution time. CONCLUSIONS: Although more studies are necessary, we recommend data scientists use the above configuration and metrics to deal with the vulnerability to adversarial inputs of deep learning models, as they can improve their models against adversarial inputs without using many inputs and without creating numerous adversarial inputs. We also show that dataset size has an important impact on the results. PeerJ Inc. 2023-08-08 /pmc/articles/PMC10495969/ /pubmed/37705636 http://dx.doi.org/10.7717/peerj-cs.1454 Text en © 2023 Durán et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Durán, Francisco Martínez-Fernández, Silverio Felderer, Michael Franch, Xavier Guiding the retraining of convolutional neural networks against adversarial inputs
title	Guiding the retraining of convolutional neural networks against adversarial inputs
title_full	Guiding the retraining of convolutional neural networks against adversarial inputs
title_fullStr	Guiding the retraining of convolutional neural networks against adversarial inputs
title_full_unstemmed	Guiding the retraining of convolutional neural networks against adversarial inputs
title_short	Guiding the retraining of convolutional neural networks against adversarial inputs
title_sort	guiding the retraining of convolutional neural networks against adversarial inputs
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10495969/ https://www.ncbi.nlm.nih.gov/pubmed/37705636 http://dx.doi.org/10.7717/peerj-cs.1454
work_keys_str_mv	AT duranfrancisco guidingtheretrainingofconvolutionalneuralnetworksagainstadversarialinputs AT martinezfernandezsilverio guidingtheretrainingofconvolutionalneuralnetworksagainstadversarialinputs AT felderermichael guidingtheretrainingofconvolutionalneuralnetworksagainstadversarialinputs AT franchxavier guidingtheretrainingofconvolutionalneuralnetworksagainstadversarialinputs

Guiding the retraining of convolutional neural networks against adversarial inputs

Ejemplares similares