Cargando…

A Hybrid Analysis-Based Approach to Android Malware Family Classification

With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Chao, Luktarhan, Nurbol, Lu, Bei, Zhang, Wenhui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8394521/
https://www.ncbi.nlm.nih.gov/pubmed/34441149
http://dx.doi.org/10.3390/e23081009
_version_ 1783743967537397760
author Ding, Chao
Luktarhan, Nurbol
Lu, Bei
Zhang, Wenhui
author_facet Ding, Chao
Luktarhan, Nurbol
Lu, Bei
Zhang, Wenhui
author_sort Ding, Chao
collection PubMed
description With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware and classifying malware families is presented in this paper, and is partially optimized for multiple-feature data. For static analysis, we use permissions and intent as static features and use three feature selection methods to form a subset of three candidate features. Compared with various models, including k-nearest neighbors and random forest, random forest is the best, with a detection rate of 95.04%, while the chi-square test is the best feature selection method. After using feature selection to explore the critical static features contained in this dataset, we analyzed a subset of important features to gain more insight into the malware. In a dynamic analysis based on network traffic, unlike those that focus on a one-way flow of traffic and work on HTTP protocols and transport layer protocols, we focused on sessions and retained protocol layers. The Res7LSTM model is then used to further classify the malicious and partially benign samples detected in the static detection. The experimental results show that our approach can not only work with fewer static features and guarantee sufficient accuracy, but also improve the detection rate of Android malware family classification from 71.48% in previous work to 99% when cutting the traffic in terms of the sessions and protocols of all layers.
format Online
Article
Text
id pubmed-8394521
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83945212021-08-28 A Hybrid Analysis-Based Approach to Android Malware Family Classification Ding, Chao Luktarhan, Nurbol Lu, Bei Zhang, Wenhui Entropy (Basel) Article With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware and classifying malware families is presented in this paper, and is partially optimized for multiple-feature data. For static analysis, we use permissions and intent as static features and use three feature selection methods to form a subset of three candidate features. Compared with various models, including k-nearest neighbors and random forest, random forest is the best, with a detection rate of 95.04%, while the chi-square test is the best feature selection method. After using feature selection to explore the critical static features contained in this dataset, we analyzed a subset of important features to gain more insight into the malware. In a dynamic analysis based on network traffic, unlike those that focus on a one-way flow of traffic and work on HTTP protocols and transport layer protocols, we focused on sessions and retained protocol layers. The Res7LSTM model is then used to further classify the malicious and partially benign samples detected in the static detection. The experimental results show that our approach can not only work with fewer static features and guarantee sufficient accuracy, but also improve the detection rate of Android malware family classification from 71.48% in previous work to 99% when cutting the traffic in terms of the sessions and protocols of all layers. MDPI 2021-08-03 /pmc/articles/PMC8394521/ /pubmed/34441149 http://dx.doi.org/10.3390/e23081009 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ding, Chao
Luktarhan, Nurbol
Lu, Bei
Zhang, Wenhui
A Hybrid Analysis-Based Approach to Android Malware Family Classification
title A Hybrid Analysis-Based Approach to Android Malware Family Classification
title_full A Hybrid Analysis-Based Approach to Android Malware Family Classification
title_fullStr A Hybrid Analysis-Based Approach to Android Malware Family Classification
title_full_unstemmed A Hybrid Analysis-Based Approach to Android Malware Family Classification
title_short A Hybrid Analysis-Based Approach to Android Malware Family Classification
title_sort hybrid analysis-based approach to android malware family classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8394521/
https://www.ncbi.nlm.nih.gov/pubmed/34441149
http://dx.doi.org/10.3390/e23081009
work_keys_str_mv AT dingchao ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT luktarhannurbol ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT lubei ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT zhangwenhui ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT dingchao hybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT luktarhannurbol hybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT lubei hybridanalysisbasedapproachtoandroidmalwarefamilyclassification
AT zhangwenhui hybridanalysisbasedapproachtoandroidmalwarefamilyclassification