Cargando…

FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification

With the popularity of Android and its open source, the Android platform has become an attractive target for hackers, and the detection and classification of malware has become a research hotspot. Existing malware classification methods rely on complex manual operation or large-volume high-quality t...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Changnan, Yin, Kanglong, Xia, Chunhe, Huang, Weidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9317647/
https://www.ncbi.nlm.nih.gov/pubmed/35885142
http://dx.doi.org/10.3390/e24070919
_version_ 1784755108039360512
author Jiang, Changnan
Yin, Kanglong
Xia, Chunhe
Huang, Weidong
author_facet Jiang, Changnan
Yin, Kanglong
Xia, Chunhe
Huang, Weidong
author_sort Jiang, Changnan
collection PubMed
description With the popularity of Android and its open source, the Android platform has become an attractive target for hackers, and the detection and classification of malware has become a research hotspot. Existing malware classification methods rely on complex manual operation or large-volume high-quality training data. However, malware data collected by security providers contains user privacy information, such as user identity and behavior habit information. The increasing concern for user privacy poses a challenge to the current malware classification scheme. Based on this problem, we propose a new android malware classification scheme based on Federated learning, named FedHGCDroid, which classifies malware on Android clients in a privacy-protected manner. Firstly, we use a convolutional neural network and graph neural network to design a novel multi-dimensional malware classification model HGCDroid, which can effectively extract malicious behavior features to classify the malware accurately. Secondly, we introduce an FL framework to enable distributed Android clients to collaboratively train a comprehensive Android malware classification model in a privacy-preserving way. Finally, to adapt to the non-IID distribution of malware on Android clients, we propose a contribution degree-based adaptive classifier training mechanism FedAdapt to improve the adaptability of the malware classifier based on Federated learning. Comprehensive experimental studies on the Androzoo dataset (under different non-IID data settings) show that the FedHGCDroid achieves more adaptability and higher accuracy than the other state-of-the-art methods.
format Online
Article
Text
id pubmed-9317647
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93176472022-07-27 FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification Jiang, Changnan Yin, Kanglong Xia, Chunhe Huang, Weidong Entropy (Basel) Article With the popularity of Android and its open source, the Android platform has become an attractive target for hackers, and the detection and classification of malware has become a research hotspot. Existing malware classification methods rely on complex manual operation or large-volume high-quality training data. However, malware data collected by security providers contains user privacy information, such as user identity and behavior habit information. The increasing concern for user privacy poses a challenge to the current malware classification scheme. Based on this problem, we propose a new android malware classification scheme based on Federated learning, named FedHGCDroid, which classifies malware on Android clients in a privacy-protected manner. Firstly, we use a convolutional neural network and graph neural network to design a novel multi-dimensional malware classification model HGCDroid, which can effectively extract malicious behavior features to classify the malware accurately. Secondly, we introduce an FL framework to enable distributed Android clients to collaboratively train a comprehensive Android malware classification model in a privacy-preserving way. Finally, to adapt to the non-IID distribution of malware on Android clients, we propose a contribution degree-based adaptive classifier training mechanism FedAdapt to improve the adaptability of the malware classifier based on Federated learning. Comprehensive experimental studies on the Androzoo dataset (under different non-IID data settings) show that the FedHGCDroid achieves more adaptability and higher accuracy than the other state-of-the-art methods. MDPI 2022-07-01 /pmc/articles/PMC9317647/ /pubmed/35885142 http://dx.doi.org/10.3390/e24070919 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jiang, Changnan
Yin, Kanglong
Xia, Chunhe
Huang, Weidong
FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title_full FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title_fullStr FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title_full_unstemmed FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title_short FedHGCDroid: An Adaptive Multi-Dimensional Federated Learning for Privacy-Preserving Android Malware Classification
title_sort fedhgcdroid: an adaptive multi-dimensional federated learning for privacy-preserving android malware classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9317647/
https://www.ncbi.nlm.nih.gov/pubmed/35885142
http://dx.doi.org/10.3390/e24070919
work_keys_str_mv AT jiangchangnan fedhgcdroidanadaptivemultidimensionalfederatedlearningforprivacypreservingandroidmalwareclassification
AT yinkanglong fedhgcdroidanadaptivemultidimensionalfederatedlearningforprivacypreservingandroidmalwareclassification
AT xiachunhe fedhgcdroidanadaptivemultidimensionalfederatedlearningforprivacypreservingandroidmalwareclassification
AT huangweidong fedhgcdroidanadaptivemultidimensionalfederatedlearningforprivacypreservingandroidmalwareclassification