Cargando…

Designing Trojan Detectors in Neural Networks Using Interactive Simulations

This paper addresses the problem of designing trojan detectors in neural networks (NNs) using interactive simulations. Trojans in NNs are defined as triggers in inputs that cause misclassification of such inputs into a class (or classes) unintended by the design of a NN-based model. The goal of our...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bajcsy, Peter, Schaub, Nicholas J., Majurski, Michael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356191/ https://www.ncbi.nlm.nih.gov/pubmed/34386268 http://dx.doi.org/10.3390/app11041865

_version_	1783736898515107840
author	Bajcsy, Peter Schaub, Nicholas J. Majurski, Michael
author_facet	Bajcsy, Peter Schaub, Nicholas J. Majurski, Michael
author_sort	Bajcsy, Peter
collection	PubMed
description	This paper addresses the problem of designing trojan detectors in neural networks (NNs) using interactive simulations. Trojans in NNs are defined as triggers in inputs that cause misclassification of such inputs into a class (or classes) unintended by the design of a NN-based model. The goal of our work is to understand encodings of a variety of trojan types in fully connected layers of neural networks. Our approach is (1) to simulate nine types of trojan embeddings into dot patterns, (2) to devise measurements of NN states, and (3) to design trojan detectors in NN-based classification models. The interactive simulations are built on top of TensorFlow Playground with in-memory storage of data and NN coefficients. The simulations provide analytical, visualization, and output operations performed on training datasets and NN architectures. The measurements of a NN include (a) model inefficiency using modified Kullback-Liebler (KL) divergence from uniformly distributed states and (b) model sensitivity to variables related to data and NNs. Using the KL divergence measurements at each NN layer and per each predicted class label, a trojan detector is devised to discriminate NN models with or without trojans. To document robustness of such a trojan detector with respect to NN architectures, dataset perturbations, and trojan types, several properties of the KL divergence measurement are presented. For the general use, the web-based simulations is deployed via GitHub pages at https://github.com/usnistgov/nn-calculator.
format	Online Article Text
id	pubmed-8356191
institution	National Center for Biotechnology Information
language	English
publishDate	2021
record_format	MEDLINE/PubMed
spelling	pubmed-83561912021-08-11 Designing Trojan Detectors in Neural Networks Using Interactive Simulations Bajcsy, Peter Schaub, Nicholas J. Majurski, Michael Appl Sci (Basel) Article This paper addresses the problem of designing trojan detectors in neural networks (NNs) using interactive simulations. Trojans in NNs are defined as triggers in inputs that cause misclassification of such inputs into a class (or classes) unintended by the design of a NN-based model. The goal of our work is to understand encodings of a variety of trojan types in fully connected layers of neural networks. Our approach is (1) to simulate nine types of trojan embeddings into dot patterns, (2) to devise measurements of NN states, and (3) to design trojan detectors in NN-based classification models. The interactive simulations are built on top of TensorFlow Playground with in-memory storage of data and NN coefficients. The simulations provide analytical, visualization, and output operations performed on training datasets and NN architectures. The measurements of a NN include (a) model inefficiency using modified Kullback-Liebler (KL) divergence from uniformly distributed states and (b) model sensitivity to variables related to data and NNs. Using the KL divergence measurements at each NN layer and per each predicted class label, a trojan detector is devised to discriminate NN models with or without trojans. To document robustness of such a trojan detector with respect to NN architectures, dataset perturbations, and trojan types, several properties of the KL divergence measurement are presented. For the general use, the web-based simulations is deployed via GitHub pages at https://github.com/usnistgov/nn-calculator. 2021 /pmc/articles/PMC8356191/ /pubmed/34386268 http://dx.doi.org/10.3390/app11041865 Text en https://creativecommons.org/licenses/by/4.0/Submitted to Appl. Sci. for possible open access publication under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Bajcsy, Peter Schaub, Nicholas J. Majurski, Michael Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title	Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title_full	Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title_fullStr	Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title_full_unstemmed	Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title_short	Designing Trojan Detectors in Neural Networks Using Interactive Simulations
title_sort	designing trojan detectors in neural networks using interactive simulations
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356191/ https://www.ncbi.nlm.nih.gov/pubmed/34386268 http://dx.doi.org/10.3390/app11041865
work_keys_str_mv	AT bajcsypeter designingtrojandetectorsinneuralnetworksusinginteractivesimulations AT schaubnicholasj designingtrojandetectorsinneuralnetworksusinginteractivesimulations AT majurskimichael designingtrojandetectorsinneuralnetworksusinginteractivesimulations

Designing Trojan Detectors in Neural Networks Using Interactive Simulations

Ejemplares similares