Cargando…

Hierarchical lifelong topic modeling using rules extracted from network communities

Topic models extract latent concepts from texts in the form of topics. Lifelong topic models extend topic models by learning topics continuously based on accumulated knowledge from the past which is updated continuously as new information becomes available. Hierarchical topic modeling extends topic...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Muhammad Taimoor, Azam, Nouman, Khalid, Shehzad, Aziz, Furqan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893656/
https://www.ncbi.nlm.nih.gov/pubmed/35239700
http://dx.doi.org/10.1371/journal.pone.0264481
_version_ 1784662456385142784
author Khan, Muhammad Taimoor
Azam, Nouman
Khalid, Shehzad
Aziz, Furqan
author_facet Khan, Muhammad Taimoor
Azam, Nouman
Khalid, Shehzad
Aziz, Furqan
author_sort Khan, Muhammad Taimoor
collection PubMed
description Topic models extract latent concepts from texts in the form of topics. Lifelong topic models extend topic models by learning topics continuously based on accumulated knowledge from the past which is updated continuously as new information becomes available. Hierarchical topic modeling extends topic modeling by extracting topics and organizing them into a hierarchical structure. In this study, we combine the two and introduce hierarchical lifelong topic models. Hierarchical lifelong topic models not only allow to examine the topics at different levels of granularity but also allows to continuously adjust the granularity of the topics as more information becomes available. A fundamental issue in hierarchical lifelong topic modeling is the extraction of rules that are used to preserve the hierarchical structural information among the rules and will continuously update based on new information. To address this issue, we introduce a network communities based rule mining approach for hierarchical lifelong topic models (NHLTM). The proposed approach extracts hierarchical structural information among the rules by representing textual documents as graphs and analyzing the underlying communities in the graph. Experimental results indicate improvement of the hierarchical topic structures in terms of topic coherence that increases from general to specific topics.
format Online
Article
Text
id pubmed-8893656
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88936562022-03-04 Hierarchical lifelong topic modeling using rules extracted from network communities Khan, Muhammad Taimoor Azam, Nouman Khalid, Shehzad Aziz, Furqan PLoS One Research Article Topic models extract latent concepts from texts in the form of topics. Lifelong topic models extend topic models by learning topics continuously based on accumulated knowledge from the past which is updated continuously as new information becomes available. Hierarchical topic modeling extends topic modeling by extracting topics and organizing them into a hierarchical structure. In this study, we combine the two and introduce hierarchical lifelong topic models. Hierarchical lifelong topic models not only allow to examine the topics at different levels of granularity but also allows to continuously adjust the granularity of the topics as more information becomes available. A fundamental issue in hierarchical lifelong topic modeling is the extraction of rules that are used to preserve the hierarchical structural information among the rules and will continuously update based on new information. To address this issue, we introduce a network communities based rule mining approach for hierarchical lifelong topic models (NHLTM). The proposed approach extracts hierarchical structural information among the rules by representing textual documents as graphs and analyzing the underlying communities in the graph. Experimental results indicate improvement of the hierarchical topic structures in terms of topic coherence that increases from general to specific topics. Public Library of Science 2022-03-03 /pmc/articles/PMC8893656/ /pubmed/35239700 http://dx.doi.org/10.1371/journal.pone.0264481 Text en © 2022 Khan et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Khan, Muhammad Taimoor
Azam, Nouman
Khalid, Shehzad
Aziz, Furqan
Hierarchical lifelong topic modeling using rules extracted from network communities
title Hierarchical lifelong topic modeling using rules extracted from network communities
title_full Hierarchical lifelong topic modeling using rules extracted from network communities
title_fullStr Hierarchical lifelong topic modeling using rules extracted from network communities
title_full_unstemmed Hierarchical lifelong topic modeling using rules extracted from network communities
title_short Hierarchical lifelong topic modeling using rules extracted from network communities
title_sort hierarchical lifelong topic modeling using rules extracted from network communities
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8893656/
https://www.ncbi.nlm.nih.gov/pubmed/35239700
http://dx.doi.org/10.1371/journal.pone.0264481
work_keys_str_mv AT khanmuhammadtaimoor hierarchicallifelongtopicmodelingusingrulesextractedfromnetworkcommunities
AT azamnouman hierarchicallifelongtopicmodelingusingrulesextractedfromnetworkcommunities
AT khalidshehzad hierarchicallifelongtopicmodelingusingrulesextractedfromnetworkcommunities
AT azizfurqan hierarchicallifelongtopicmodelingusingrulesextractedfromnetworkcommunities