Cargando…

Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling

The article presents a discriminative approach to complement the unsupervised probabilistic nature of topic modelling. The framework transforms the probabilities of the topics per document into class-dependent deep learning models that extract highly discriminatory features suitable for classificati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Al Moubayed, Noura, McGough, Stephen, Awwad Shiekh Hasan, Bashar
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2020
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924555/ https://www.ncbi.nlm.nih.gov/pubmed/33816904 http://dx.doi.org/10.7717/peerj-cs.252

_version_	1783659113565126656
author	Al Moubayed, Noura McGough, Stephen Awwad Shiekh Hasan, Bashar
author_facet	Al Moubayed, Noura McGough, Stephen Awwad Shiekh Hasan, Bashar
author_sort	Al Moubayed, Noura
collection	PubMed
description	The article presents a discriminative approach to complement the unsupervised probabilistic nature of topic modelling. The framework transforms the probabilities of the topics per document into class-dependent deep learning models that extract highly discriminatory features suitable for classification. The framework is then used for sentiment analysis with minimum feature engineering. The approach transforms the sentiment analysis problem from the word/document domain to the topics domain making it more robust to noise and incorporating complex contextual information that are not represented otherwise. A stacked denoising autoencoder (SDA) is then used to model the complex relationship among the topics per sentiment with minimum assumptions. To achieve this, a distinct topic model and SDA per sentiment polarity is built with an additional decision layer for classification. The framework is tested on a comprehensive collection of benchmark datasets that vary in sample size, class bias and classification task. A significant improvement to the state of the art is achieved without the need for a sentiment lexica or over-engineered features. A further analysis is carried out to explain the observed improvement in accuracy.
format	Online Article Text
id	pubmed-7924555
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-79245552021-04-02 Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling Al Moubayed, Noura McGough, Stephen Awwad Shiekh Hasan, Bashar PeerJ Comput Sci Artificial Intelligence The article presents a discriminative approach to complement the unsupervised probabilistic nature of topic modelling. The framework transforms the probabilities of the topics per document into class-dependent deep learning models that extract highly discriminatory features suitable for classification. The framework is then used for sentiment analysis with minimum feature engineering. The approach transforms the sentiment analysis problem from the word/document domain to the topics domain making it more robust to noise and incorporating complex contextual information that are not represented otherwise. A stacked denoising autoencoder (SDA) is then used to model the complex relationship among the topics per sentiment with minimum assumptions. To achieve this, a distinct topic model and SDA per sentiment polarity is built with an additional decision layer for classification. The framework is tested on a comprehensive collection of benchmark datasets that vary in sample size, class bias and classification task. A significant improvement to the state of the art is achieved without the need for a sentiment lexica or over-engineered features. A further analysis is carried out to explain the observed improvement in accuracy. PeerJ Inc. 2020-01-27 /pmc/articles/PMC7924555/ /pubmed/33816904 http://dx.doi.org/10.7717/peerj-cs.252 Text en © 2020 Al Moubayed et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Al Moubayed, Noura McGough, Stephen Awwad Shiekh Hasan, Bashar Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title	Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title_full	Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title_fullStr	Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title_full_unstemmed	Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title_short	Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
title_sort	beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924555/ https://www.ncbi.nlm.nih.gov/pubmed/33816904 http://dx.doi.org/10.7717/peerj-cs.252
work_keys_str_mv	AT almoubayednoura beyondthetopicshowdeeplearningcanimprovethediscriminabilityofprobabilistictopicmodelling AT mcgoughstephen beyondthetopicshowdeeplearningcanimprovethediscriminabilityofprobabilistictopicmodelling AT awwadshiekhhasanbashar beyondthetopicshowdeeplearningcanimprovethediscriminabilityofprobabilistictopicmodelling

Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling

Ejemplares similares