Cargando…

Code Action Network for Binary Function Scope Identification

Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope ident...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Van, Le, Trung, Le, Tue, Nguyen, Khanh, Vel, Olivier de, Montague, Paul, Grundy, John, Phung, Dinh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206166/
http://dx.doi.org/10.1007/978-3-030-47426-3_55
_version_ 1783530360457396224
author Nguyen, Van
Le, Trung
Le, Tue
Nguyen, Khanh
Vel, Olivier de
Montague, Paul
Grundy, John
Phung, Dinh
author_facet Nguyen, Van
Le, Trung
Le, Tue
Nguyen, Khanh
Vel, Olivier de
Montague, Paul
Grundy, John
Phung, Dinh
author_sort Nguyen, Van
collection PubMed
description Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope identification to a sequence of three action states NI (i.e., next inclusion), NE (i.e., next exclusion), and FE (i.e., function end) to efficiently and effectively tackle function scope identification, the hardest and most crucial task in function identification. A bidirectional Recurrent Neural Network is trained to match binary programs with their sequence of action states. To work out function scopes in a binary, this binary is first fed to a trained CAN to output its sequence of action states which can be further decoded to know the function scopes in the binary. We undertake extensive experiments to compare our proposed method with other state-of-the-art baselines. Experimental results demonstrate that our proposed method outperforms the state-of-the-art baselines in terms of predictive performance on real-world datasets which include binaries from well-known libraries.
format Online
Article
Text
id pubmed-7206166
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72061662020-05-08 Code Action Network for Binary Function Scope Identification Nguyen, Van Le, Trung Le, Tue Nguyen, Khanh Vel, Olivier de Montague, Paul Grundy, John Phung, Dinh Advances in Knowledge Discovery and Data Mining Article Function identification is a preliminary step in binary analysis for many applications from malware detection, common vulnerability detection and binary instrumentation to name a few. In this paper, we propose the Code Action Network (CAN) whose key idea is to encode the task of function scope identification to a sequence of three action states NI (i.e., next inclusion), NE (i.e., next exclusion), and FE (i.e., function end) to efficiently and effectively tackle function scope identification, the hardest and most crucial task in function identification. A bidirectional Recurrent Neural Network is trained to match binary programs with their sequence of action states. To work out function scopes in a binary, this binary is first fed to a trained CAN to output its sequence of action states which can be further decoded to know the function scopes in the binary. We undertake extensive experiments to compare our proposed method with other state-of-the-art baselines. Experimental results demonstrate that our proposed method outperforms the state-of-the-art baselines in terms of predictive performance on real-world datasets which include binaries from well-known libraries. 2020-04-17 /pmc/articles/PMC7206166/ http://dx.doi.org/10.1007/978-3-030-47426-3_55 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Nguyen, Van
Le, Trung
Le, Tue
Nguyen, Khanh
Vel, Olivier de
Montague, Paul
Grundy, John
Phung, Dinh
Code Action Network for Binary Function Scope Identification
title Code Action Network for Binary Function Scope Identification
title_full Code Action Network for Binary Function Scope Identification
title_fullStr Code Action Network for Binary Function Scope Identification
title_full_unstemmed Code Action Network for Binary Function Scope Identification
title_short Code Action Network for Binary Function Scope Identification
title_sort code action network for binary function scope identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206166/
http://dx.doi.org/10.1007/978-3-030-47426-3_55
work_keys_str_mv AT nguyenvan codeactionnetworkforbinaryfunctionscopeidentification
AT letrung codeactionnetworkforbinaryfunctionscopeidentification
AT letue codeactionnetworkforbinaryfunctionscopeidentification
AT nguyenkhanh codeactionnetworkforbinaryfunctionscopeidentification
AT velolivierde codeactionnetworkforbinaryfunctionscopeidentification
AT montaguepaul codeactionnetworkforbinaryfunctionscopeidentification
AT grundyjohn codeactionnetworkforbinaryfunctionscopeidentification
AT phungdinh codeactionnetworkforbinaryfunctionscopeidentification