Cargando…

Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction

Software security is a very important aspect for software development organizations who wish to provide high-quality and dependable software to their consumers. A crucial part of software security is the early detection of software vulnerabilities. Vulnerability prediction is a mechanism that facili...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kalouptsoglou, Ilias, Siavvas, Miltiadis, Kehagias, Dionysios, Chatzigeorgiou, Alexandros, Ampatzoglou, Apostolos
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140602/ https://www.ncbi.nlm.nih.gov/pubmed/35626536 http://dx.doi.org/10.3390/e24050651

_version_	1784715137822752768
author	Kalouptsoglou, Ilias Siavvas, Miltiadis Kehagias, Dionysios Chatzigeorgiou, Alexandros Ampatzoglou, Apostolos
author_facet	Kalouptsoglou, Ilias Siavvas, Miltiadis Kehagias, Dionysios Chatzigeorgiou, Alexandros Ampatzoglou, Apostolos
author_sort	Kalouptsoglou, Ilias
collection	PubMed
description	Software security is a very important aspect for software development organizations who wish to provide high-quality and dependable software to their consumers. A crucial part of software security is the early detection of software vulnerabilities. Vulnerability prediction is a mechanism that facilitates the identification (and, in turn, the mitigation) of vulnerabilities early enough during the software development cycle. The scientific community has recently focused a lot of attention on developing Deep Learning models using text mining techniques for predicting the existence of vulnerabilities in software components. However, there are also studies that examine whether the utilization of statically extracted software metrics can lead to adequate Vulnerability Prediction Models. In this paper, both software metrics- and text mining-based Vulnerability Prediction Models are constructed and compared. A combination of software metrics and text tokens using deep-learning models is examined as well in order to investigate if a combined model can lead to more accurate vulnerability prediction. For the purposes of the present study, a vulnerability dataset containing vulnerabilities from real-world software products is utilized and extended. The results of our analysis indicate that text mining-based models outperform software metrics-based models with respect to their F(2)-score, whereas enriching the text mining-based models with software metrics was not found to provide any added value to their predictive performance.
format	Online Article Text
id	pubmed-9140602
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91406022022-05-28 Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction Kalouptsoglou, Ilias Siavvas, Miltiadis Kehagias, Dionysios Chatzigeorgiou, Alexandros Ampatzoglou, Apostolos Entropy (Basel) Article Software security is a very important aspect for software development organizations who wish to provide high-quality and dependable software to their consumers. A crucial part of software security is the early detection of software vulnerabilities. Vulnerability prediction is a mechanism that facilitates the identification (and, in turn, the mitigation) of vulnerabilities early enough during the software development cycle. The scientific community has recently focused a lot of attention on developing Deep Learning models using text mining techniques for predicting the existence of vulnerabilities in software components. However, there are also studies that examine whether the utilization of statically extracted software metrics can lead to adequate Vulnerability Prediction Models. In this paper, both software metrics- and text mining-based Vulnerability Prediction Models are constructed and compared. A combination of software metrics and text tokens using deep-learning models is examined as well in order to investigate if a combined model can lead to more accurate vulnerability prediction. For the purposes of the present study, a vulnerability dataset containing vulnerabilities from real-world software products is utilized and extended. The results of our analysis indicate that text mining-based models outperform software metrics-based models with respect to their F(2)-score, whereas enriching the text mining-based models with software metrics was not found to provide any added value to their predictive performance. MDPI 2022-05-05 /pmc/articles/PMC9140602/ /pubmed/35626536 http://dx.doi.org/10.3390/e24050651 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Kalouptsoglou, Ilias Siavvas, Miltiadis Kehagias, Dionysios Chatzigeorgiou, Alexandros Ampatzoglou, Apostolos Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title	Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title_full	Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title_fullStr	Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title_full_unstemmed	Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title_short	Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction
title_sort	examining the capacity of text mining and software metrics in vulnerability prediction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140602/ https://www.ncbi.nlm.nih.gov/pubmed/35626536 http://dx.doi.org/10.3390/e24050651
work_keys_str_mv	AT kalouptsoglouilias examiningthecapacityoftextminingandsoftwaremetricsinvulnerabilityprediction AT siavvasmiltiadis examiningthecapacityoftextminingandsoftwaremetricsinvulnerabilityprediction AT kehagiasdionysios examiningthecapacityoftextminingandsoftwaremetricsinvulnerabilityprediction AT chatzigeorgioualexandros examiningthecapacityoftextminingandsoftwaremetricsinvulnerabilityprediction AT ampatzoglouapostolos examiningthecapacityoftextminingandsoftwaremetricsinvulnerabilityprediction

Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction

Ejemplares similares