Cargando…
Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling
Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection sh...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275653/ https://www.ncbi.nlm.nih.gov/pubmed/34268490 http://dx.doi.org/10.3389/fdata.2021.601529 |
_version_ | 1783721762878390272 |
---|---|
author | Sleeman, Jennifer Finin, Tim Halem, Milton |
author_facet | Sleeman, Jennifer Finin, Tim Halem, Milton |
author_sort | Sleeman, Jennifer |
collection | PubMed |
description | Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection shows how the significance and details of concepts found in them are evolving. We correlate two different temporal corpora, one with reports about specific exploits and the other with research-oriented papers on cybersecurity vulnerabilities and threats. We represent the documents, concepts, and dynamic topic modeling data in a semantic knowledge graph to support integration, inference, and discovery. A critical insight into discovering knowledge through topic modeling is seeding the knowledge graph with domain concepts to guide the modeling process. We use Wikipedia concepts to provide a basis for performing concept phrase extraction and show how using those phrases improves the quality of the topic models. Researchers can query the resulting knowledge graph to reveal important relations and trends. This work is novel because it uses topics as a bridge to relate documents across corpora over time. |
format | Online Article Text |
id | pubmed-8275653 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82756532021-07-14 Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling Sleeman, Jennifer Finin, Tim Halem, Milton Front Big Data Big Data Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection shows how the significance and details of concepts found in them are evolving. We correlate two different temporal corpora, one with reports about specific exploits and the other with research-oriented papers on cybersecurity vulnerabilities and threats. We represent the documents, concepts, and dynamic topic modeling data in a semantic knowledge graph to support integration, inference, and discovery. A critical insight into discovering knowledge through topic modeling is seeding the knowledge graph with domain concepts to guide the modeling process. We use Wikipedia concepts to provide a basis for performing concept phrase extraction and show how using those phrases improves the quality of the topic models. Researchers can query the resulting knowledge graph to reveal important relations and trends. This work is novel because it uses topics as a bridge to relate documents across corpora over time. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275653/ /pubmed/34268490 http://dx.doi.org/10.3389/fdata.2021.601529 Text en Copyright © 2021 Sleeman, Finin and Halem. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Sleeman, Jennifer Finin, Tim Halem, Milton Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title | Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title_full | Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title_fullStr | Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title_full_unstemmed | Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title_short | Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling |
title_sort | understanding cybersecurity threat trends through dynamic topic modeling |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275653/ https://www.ncbi.nlm.nih.gov/pubmed/34268490 http://dx.doi.org/10.3389/fdata.2021.601529 |
work_keys_str_mv | AT sleemanjennifer understandingcybersecuritythreattrendsthroughdynamictopicmodeling AT finintim understandingcybersecuritythreattrendsthroughdynamictopicmodeling AT halemmilton understandingcybersecuritythreattrendsthroughdynamictopicmodeling |