Cargando…
Code Action Network for Binary Function Scope Identification
Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope ident...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206166/ http://dx.doi.org/10.1007/978-3-030-47426-3_55 |
_version_ | 1783530360457396224 |
---|---|
author | Nguyen, Van Le, Trung Le, Tue Nguyen, Khanh Vel, Olivier de Montague, Paul Grundy, John Phung, Dinh |
author_facet | Nguyen, Van Le, Trung Le, Tue Nguyen, Khanh Vel, Olivier de Montague, Paul Grundy, John Phung, Dinh |
author_sort | Nguyen, Van |
collection | PubMed |
description | Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope identification to a sequence of three action states NI (i.e., next inclusion), NE (i.e., next exclusion), and FE (i.e., function end) to efficiently and effectively tackle function scope identification, the hardest and most crucial task in function identification. A bidirectional Recurrent Neural Network is trained to match binary programs with their sequence of action states. To work out function scopes in a binary, this binary is first fed to a trained CAN to output its sequence of action states which can be further decoded to know the function scopes in the binary. We undertake extensive experiments to compare our proposed method with other state-of-the-art baselines. Experimental results demonstrate that our proposed method outperforms the state-of-the-art baselines in terms of predictive performance on real-world datasets which include binaries from well-known libraries. |
format | Online Article Text |
id | pubmed-7206166 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72061662020-05-08 Code Action Network for Binary Function Scope Identification Nguyen, Van Le, Trung Le, Tue Nguyen, Khanh Vel, Olivier de Montague, Paul Grundy, John Phung, Dinh Advances in Knowledge Discovery and Data Mining Article Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope identification to a sequence of three action states NI (i.e., next inclusion), NE (i.e., next exclusion), and FE (i.e., function end) to efficiently and effectively tackle function scope identification, the hardest and most crucial task in function identification. A bidirectional Recurrent Neural Network is trained to match binary programs with their sequence of action states. To work out function scopes in a binary, this binary is first fed to a trained CAN to output its sequence of action states which can be further decoded to know the function scopes in the binary. We undertake extensive experiments to compare our proposed method with other state-of-the-art baselines. Experimental results demonstrate that our proposed method outperforms the state-of-the-art baselines in terms of predictive performance on real-world datasets which include binaries from well-known libraries. 2020-04-17 /pmc/articles/PMC7206166/ http://dx.doi.org/10.1007/978-3-030-47426-3_55 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Nguyen, Van Le, Trung Le, Tue Nguyen, Khanh Vel, Olivier de Montague, Paul Grundy, John Phung, Dinh Code Action Network for Binary Function Scope Identification |
title | Code Action Network for Binary Function Scope Identification |
title_full | Code Action Network for Binary Function Scope Identification |
title_fullStr | Code Action Network for Binary Function Scope Identification |
title_full_unstemmed | Code Action Network for Binary Function Scope Identification |
title_short | Code Action Network for Binary Function Scope Identification |
title_sort | code action network for binary function scope identification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206166/ http://dx.doi.org/10.1007/978-3-030-47426-3_55 |
work_keys_str_mv | AT nguyenvan codeactionnetworkforbinaryfunctionscopeidentification AT letrung codeactionnetworkforbinaryfunctionscopeidentification AT letue codeactionnetworkforbinaryfunctionscopeidentification AT nguyenkhanh codeactionnetworkforbinaryfunctionscopeidentification AT velolivierde codeactionnetworkforbinaryfunctionscopeidentification AT montaguepaul codeactionnetworkforbinaryfunctionscopeidentification AT grundyjohn codeactionnetworkforbinaryfunctionscopeidentification AT phungdinh codeactionnetworkforbinaryfunctionscopeidentification |