Cargando…

Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling

Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection sh...

Descripción completa

Detalles Bibliográficos
Autores principales: Sleeman, Jennifer, Finin, Tim, Halem, Milton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275653/
https://www.ncbi.nlm.nih.gov/pubmed/34268490
http://dx.doi.org/10.3389/fdata.2021.601529
_version_ 1783721762878390272
author Sleeman, Jennifer
Finin, Tim
Halem, Milton
author_facet Sleeman, Jennifer
Finin, Tim
Halem, Milton
author_sort Sleeman, Jennifer
collection PubMed
description Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection shows how the significance and details of concepts found in them are evolving. We correlate two different temporal corpora, one with reports about specific exploits and the other with research-oriented papers on cybersecurity vulnerabilities and threats. We represent the documents, concepts, and dynamic topic modeling data in a semantic knowledge graph to support integration, inference, and discovery. A critical insight into discovering knowledge through topic modeling is seeding the knowledge graph with domain concepts to guide the modeling process. We use Wikipedia concepts to provide a basis for performing concept phrase extraction and show how using those phrases improves the quality of the topic models. Researchers can query the resulting knowledge graph to reveal important relations and trends. This work is novel because it uses topics as a bridge to relate documents across corpora over time.
format Online
Article
Text
id pubmed-8275653
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82756532021-07-14 Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling Sleeman, Jennifer Finin, Tim Halem, Milton Front Big Data Big Data Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection shows how the significance and details of concepts found in them are evolving. We correlate two different temporal corpora, one with reports about specific exploits and the other with research-oriented papers on cybersecurity vulnerabilities and threats. We represent the documents, concepts, and dynamic topic modeling data in a semantic knowledge graph to support integration, inference, and discovery. A critical insight into discovering knowledge through topic modeling is seeding the knowledge graph with domain concepts to guide the modeling process. We use Wikipedia concepts to provide a basis for performing concept phrase extraction and show how using those phrases improves the quality of the topic models. Researchers can query the resulting knowledge graph to reveal important relations and trends. This work is novel because it uses topics as a bridge to relate documents across corpora over time. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275653/ /pubmed/34268490 http://dx.doi.org/10.3389/fdata.2021.601529 Text en Copyright © 2021 Sleeman, Finin and Halem. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Sleeman, Jennifer
Finin, Tim
Halem, Milton
Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title_full Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title_fullStr Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title_full_unstemmed Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title_short Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
title_sort understanding cybersecurity threat trends through dynamic topic modeling
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275653/
https://www.ncbi.nlm.nih.gov/pubmed/34268490
http://dx.doi.org/10.3389/fdata.2021.601529
work_keys_str_mv AT sleemanjennifer understandingcybersecuritythreattrendsthroughdynamictopicmodeling
AT finintim understandingcybersecuritythreattrendsthroughdynamictopicmodeling
AT halemmilton understandingcybersecuritythreattrendsthroughdynamictopicmodeling