Cargando…

Merging Datasets of CyberSecurity Incidents for Fun and Insight

Providing an adequate assessment of their cyber-security posture requires companies and organisations to collect information about threats from a wide range of sources. One of such sources is history, intended as the knowledge about past cyber-security incidents, their size, type of attacks, industr...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbiati, Giovanni, Ranise, Silvio, Schizzerotto, Antonio, Siena, Alberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931890/
https://www.ncbi.nlm.nih.gov/pubmed/33693409
http://dx.doi.org/10.3389/fdata.2020.521132
_version_ 1783660376113545216
author Abbiati, Giovanni
Ranise, Silvio
Schizzerotto, Antonio
Siena, Alberto
author_facet Abbiati, Giovanni
Ranise, Silvio
Schizzerotto, Antonio
Siena, Alberto
author_sort Abbiati, Giovanni
collection PubMed
description Providing an adequate assessment of their cyber-security posture requires companies and organisations to collect information about threats from a wide range of sources. One of such sources is history, intended as the knowledge about past cyber-security incidents, their size, type of attacks, industry sector and so on. Ideally, having a large enough dataset of past security incidents, it would be possible to analyze it with automated tools and draw conclusions that may help in preventing future incidents. Unfortunately, it seems that there are only a few publicly available datasets of this kind that are of good quality. The paper reports our initial efforts in collecting all publicly available security incidents datasets, and building a single, large dataset that can be used to draw statistically significant observations. In order to argue about its statistical quality, we analyze the resulting combined dataset against the original ones. Additionally, we perform an analysis of the combined dataset and compare our results with the existing literature. Finally, we present our findings, discuss the limitations of the proposed approach, and point out interesting research directions.
format Online
Article
Text
id pubmed-7931890
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79318902021-03-09 Merging Datasets of CyberSecurity Incidents for Fun and Insight Abbiati, Giovanni Ranise, Silvio Schizzerotto, Antonio Siena, Alberto Front Big Data Big Data Providing an adequate assessment of their cyber-security posture requires companies and organisations to collect information about threats from a wide range of sources. One of such sources is history, intended as the knowledge about past cyber-security incidents, their size, type of attacks, industry sector and so on. Ideally, having a large enough dataset of past security incidents, it would be possible to analyze it with automated tools and draw conclusions that may help in preventing future incidents. Unfortunately, it seems that there are only a few publicly available datasets of this kind that are of good quality. The paper reports our initial efforts in collecting all publicly available security incidents datasets, and building a single, large dataset that can be used to draw statistically significant observations. In order to argue about its statistical quality, we analyze the resulting combined dataset against the original ones. Additionally, we perform an analysis of the combined dataset and compare our results with the existing literature. Finally, we present our findings, discuss the limitations of the proposed approach, and point out interesting research directions. Frontiers Media S.A. 2021-01-26 /pmc/articles/PMC7931890/ /pubmed/33693409 http://dx.doi.org/10.3389/fdata.2020.521132 Text en Copyright © 2021 Abbiati, Ranise, Schizzerotto and Siena. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Abbiati, Giovanni
Ranise, Silvio
Schizzerotto, Antonio
Siena, Alberto
Merging Datasets of CyberSecurity Incidents for Fun and Insight
title Merging Datasets of CyberSecurity Incidents for Fun and Insight
title_full Merging Datasets of CyberSecurity Incidents for Fun and Insight
title_fullStr Merging Datasets of CyberSecurity Incidents for Fun and Insight
title_full_unstemmed Merging Datasets of CyberSecurity Incidents for Fun and Insight
title_short Merging Datasets of CyberSecurity Incidents for Fun and Insight
title_sort merging datasets of cybersecurity incidents for fun and insight
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931890/
https://www.ncbi.nlm.nih.gov/pubmed/33693409
http://dx.doi.org/10.3389/fdata.2020.521132
work_keys_str_mv AT abbiatigiovanni mergingdatasetsofcybersecurityincidentsforfunandinsight
AT ranisesilvio mergingdatasetsofcybersecurityincidentsforfunandinsight
AT schizzerottoantonio mergingdatasetsofcybersecurityincidentsforfunandinsight
AT sienaalberto mergingdatasetsofcybersecurityincidentsforfunandinsight