Cargando…

BinBench: a benchmark for x64 portable operating system interface binary function representations

In this article we propose the first multi-task benchmark for evaluating the performances of machine learning models that work on low level assembly functions. While the use of multi-task benchmark is a standard in the natural language processing (NLP) field, such practice is unknown in the field of...

Descripción completa

Detalles Bibliográficos
Autores principales:	Console, Francesca, D’Aquanno, Giuseppe, Di Luna, Giuseppe Antonio, Querzoni, Leonardo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2023
Materias:	Data Mining and Machine Learning
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280411/ https://www.ncbi.nlm.nih.gov/pubmed/37346713 http://dx.doi.org/10.7717/peerj-cs.1286

_version_	1785060788381155328
author	Console, Francesca D’Aquanno, Giuseppe Di Luna, Giuseppe Antonio Querzoni, Leonardo
author_facet	Console, Francesca D’Aquanno, Giuseppe Di Luna, Giuseppe Antonio Querzoni, Leonardo
author_sort	Console, Francesca
collection	PubMed
description	In this article we propose the first multi-task benchmark for evaluating the performances of machine learning models that work on low level assembly functions. While the use of multi-task benchmark is a standard in the natural language processing (NLP) field, such practice is unknown in the field of assembly language processing. However, in the latest years there has been a strong push in the use of deep neural networks architectures borrowed from NLP to solve problems on assembly code. A first advantage of having a standard benchmark is the one of making different works comparable without effort of reproducing third part solutions. The second advantage is the one of being able to test the generality of a machine learning model on several tasks. For these reasons, we propose BinBench, a benchmark for binary function models. The benchmark includes various binary analysis tasks, as well as a dataset of binary functions on which tasks should be solved. The dataset is publicly available and it has been evaluated using baseline models.
format	Online Article Text
id	pubmed-10280411
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-102804112023-06-21 BinBench: a benchmark for x64 portable operating system interface binary function representations Console, Francesca D’Aquanno, Giuseppe Di Luna, Giuseppe Antonio Querzoni, Leonardo PeerJ Comput Sci Data Mining and Machine Learning In this article we propose the first multi-task benchmark for evaluating the performances of machine learning models that work on low level assembly functions. While the use of multi-task benchmark is a standard in the natural language processing (NLP) field, such practice is unknown in the field of assembly language processing. However, in the latest years there has been a strong push in the use of deep neural networks architectures borrowed from NLP to solve problems on assembly code. A first advantage of having a standard benchmark is the one of making different works comparable without effort of reproducing third part solutions. The second advantage is the one of being able to test the generality of a machine learning model on several tasks. For these reasons, we propose BinBench, a benchmark for binary function models. The benchmark includes various binary analysis tasks, as well as a dataset of binary functions on which tasks should be solved. The dataset is publicly available and it has been evaluated using baseline models. PeerJ Inc. 2023-06-01 /pmc/articles/PMC10280411/ /pubmed/37346713 http://dx.doi.org/10.7717/peerj-cs.1286 Text en © 2023 Console et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Data Mining and Machine Learning Console, Francesca D’Aquanno, Giuseppe Di Luna, Giuseppe Antonio Querzoni, Leonardo BinBench: a benchmark for x64 portable operating system interface binary function representations
title	BinBench: a benchmark for x64 portable operating system interface binary function representations
title_full	BinBench: a benchmark for x64 portable operating system interface binary function representations
title_fullStr	BinBench: a benchmark for x64 portable operating system interface binary function representations
title_full_unstemmed	BinBench: a benchmark for x64 portable operating system interface binary function representations
title_short	BinBench: a benchmark for x64 portable operating system interface binary function representations
title_sort	binbench: a benchmark for x64 portable operating system interface binary function representations
topic	Data Mining and Machine Learning
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280411/ https://www.ncbi.nlm.nih.gov/pubmed/37346713 http://dx.doi.org/10.7717/peerj-cs.1286
work_keys_str_mv	AT consolefrancesca binbenchabenchmarkforx64portableoperatingsysteminterfacebinaryfunctionrepresentations AT daquannogiuseppe binbenchabenchmarkforx64portableoperatingsysteminterfacebinaryfunctionrepresentations AT dilunagiuseppeantonio binbenchabenchmarkforx64portableoperatingsysteminterfacebinaryfunctionrepresentations AT querzonileonardo binbenchabenchmarkforx64portableoperatingsysteminterfacebinaryfunctionrepresentations

BinBench: a benchmark for x64 portable operating system interface binary function representations

Ejemplares similares