Cargando…

SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks

Insecure applications (apps) are increasingly used to steal users’ location information for illegal purposes, which has aroused great concern in recent years. Although the existing methods, i.e., static and dynamic taint analysis, have shown great merit for identifying such apps, which mainly rely o...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Guangwu, Zhang, Bin, Xiao, Xi, Zhang, Weizhe, Liao, Long, Zhou, Ying, Yan, Xia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623917/
https://www.ncbi.nlm.nih.gov/pubmed/34828187
http://dx.doi.org/10.3390/e23111489
_version_ 1784606047037554688
author Hu, Guangwu
Zhang, Bin
Xiao, Xi
Zhang, Weizhe
Liao, Long
Zhou, Ying
Yan, Xia
author_facet Hu, Guangwu
Zhang, Bin
Xiao, Xi
Zhang, Weizhe
Liao, Long
Zhou, Ying
Yan, Xia
author_sort Hu, Guangwu
collection PubMed
description Insecure applications (apps) are increasingly used to steal users’ location information for illegal purposes, which has aroused great concern in recent years. Although the existing methods, i.e., static and dynamic taint analysis, have shown great merit for identifying such apps, which mainly rely on statically analyzing source code or dynamically monitoring the location data flow, identification accuracy is still under research, since the analysis results contain a certain false positive or true negative rate. In order to improve the accuracy and reduce the misjudging rate in the process of vetting suspicious apps, this paper proposes SAMLDroid, a combined method of static code analysis and machine learning for identifying Android apps with location privacy leakage, which can effectively improve the identification rate compared with existing methods. SAMLDroid first uses static analysis to scrutinize source code to investigate apps with location acquiring intentions. Then it exploits a well-trained classifier and integrates an app’s multiple features to dynamically analyze the pattern and deliver the final verdict about the app’s property. Finally, it is proved by conducting experiments, that the accuracy rate of SAMLDroid is up to 98.4%, which is nearly 20% higher than Apparecium.
format Online
Article
Text
id pubmed-8623917
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86239172021-11-27 SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks Hu, Guangwu Zhang, Bin Xiao, Xi Zhang, Weizhe Liao, Long Zhou, Ying Yan, Xia Entropy (Basel) Article Insecure applications (apps) are increasingly used to steal users’ location information for illegal purposes, which has aroused great concern in recent years. Although the existing methods, i.e., static and dynamic taint analysis, have shown great merit for identifying such apps, which mainly rely on statically analyzing source code or dynamically monitoring the location data flow, identification accuracy is still under research, since the analysis results contain a certain false positive or true negative rate. In order to improve the accuracy and reduce the misjudging rate in the process of vetting suspicious apps, this paper proposes SAMLDroid, a combined method of static code analysis and machine learning for identifying Android apps with location privacy leakage, which can effectively improve the identification rate compared with existing methods. SAMLDroid first uses static analysis to scrutinize source code to investigate apps with location acquiring intentions. Then it exploits a well-trained classifier and integrates an app’s multiple features to dynamically analyze the pattern and deliver the final verdict about the app’s property. Finally, it is proved by conducting experiments, that the accuracy rate of SAMLDroid is up to 98.4%, which is nearly 20% higher than Apparecium. MDPI 2021-11-10 /pmc/articles/PMC8623917/ /pubmed/34828187 http://dx.doi.org/10.3390/e23111489 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hu, Guangwu
Zhang, Bin
Xiao, Xi
Zhang, Weizhe
Liao, Long
Zhou, Ying
Yan, Xia
SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title_full SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title_fullStr SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title_full_unstemmed SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title_short SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks
title_sort samldroid: a static taint analysis and machine learning combined high-accuracy method for identifying android apps with location privacy leakage risks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623917/
https://www.ncbi.nlm.nih.gov/pubmed/34828187
http://dx.doi.org/10.3390/e23111489
work_keys_str_mv AT huguangwu samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT zhangbin samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT xiaoxi samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT zhangweizhe samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT liaolong samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT zhouying samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks
AT yanxia samldroidastatictaintanalysisandmachinelearningcombinedhighaccuracymethodforidentifyingandroidappswithlocationprivacyleakagerisks