Cargando…
A Hybrid Analysis-Based Approach to Android Malware Family Classification
With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8394521/ https://www.ncbi.nlm.nih.gov/pubmed/34441149 http://dx.doi.org/10.3390/e23081009 |
_version_ | 1783743967537397760 |
---|---|
author | Ding, Chao Luktarhan, Nurbol Lu, Bei Zhang, Wenhui |
author_facet | Ding, Chao Luktarhan, Nurbol Lu, Bei Zhang, Wenhui |
author_sort | Ding, Chao |
collection | PubMed |
description | With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware and classifying malware families is presented in this paper, and is partially optimized for multiple-feature data. For static analysis, we use permissions and intent as static features and use three feature selection methods to form a subset of three candidate features. Compared with various models, including k-nearest neighbors and random forest, random forest is the best, with a detection rate of 95.04%, while the chi-square test is the best feature selection method. After using feature selection to explore the critical static features contained in this dataset, we analyzed a subset of important features to gain more insight into the malware. In a dynamic analysis based on network traffic, unlike those that focus on a one-way flow of traffic and work on HTTP protocols and transport layer protocols, we focused on sessions and retained protocol layers. The Res7LSTM model is then used to further classify the malicious and partially benign samples detected in the static detection. The experimental results show that our approach can not only work with fewer static features and guarantee sufficient accuracy, but also improve the detection rate of Android malware family classification from 71.48% in previous work to 99% when cutting the traffic in terms of the sessions and protocols of all layers. |
format | Online Article Text |
id | pubmed-8394521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83945212021-08-28 A Hybrid Analysis-Based Approach to Android Malware Family Classification Ding, Chao Luktarhan, Nurbol Lu, Bei Zhang, Wenhui Entropy (Basel) Article With the popularity of Android, malware detection and family classification have also become a research focus. Many excellent methods have been proposed by previous authors, but static and dynamic analyses inevitably require complex processes. A hybrid analysis method for detecting Android malware and classifying malware families is presented in this paper, and is partially optimized for multiple-feature data. For static analysis, we use permissions and intent as static features and use three feature selection methods to form a subset of three candidate features. Compared with various models, including k-nearest neighbors and random forest, random forest is the best, with a detection rate of 95.04%, while the chi-square test is the best feature selection method. After using feature selection to explore the critical static features contained in this dataset, we analyzed a subset of important features to gain more insight into the malware. In a dynamic analysis based on network traffic, unlike those that focus on a one-way flow of traffic and work on HTTP protocols and transport layer protocols, we focused on sessions and retained protocol layers. The Res7LSTM model is then used to further classify the malicious and partially benign samples detected in the static detection. The experimental results show that our approach can not only work with fewer static features and guarantee sufficient accuracy, but also improve the detection rate of Android malware family classification from 71.48% in previous work to 99% when cutting the traffic in terms of the sessions and protocols of all layers. MDPI 2021-08-03 /pmc/articles/PMC8394521/ /pubmed/34441149 http://dx.doi.org/10.3390/e23081009 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ding, Chao Luktarhan, Nurbol Lu, Bei Zhang, Wenhui A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title | A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title_full | A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title_fullStr | A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title_full_unstemmed | A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title_short | A Hybrid Analysis-Based Approach to Android Malware Family Classification |
title_sort | hybrid analysis-based approach to android malware family classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8394521/ https://www.ncbi.nlm.nih.gov/pubmed/34441149 http://dx.doi.org/10.3390/e23081009 |
work_keys_str_mv | AT dingchao ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT luktarhannurbol ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT lubei ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT zhangwenhui ahybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT dingchao hybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT luktarhannurbol hybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT lubei hybridanalysisbasedapproachtoandroidmalwarefamilyclassification AT zhangwenhui hybridanalysisbasedapproachtoandroidmalwarefamilyclassification |