Cargando…

Analysis of Twitter data with the Bayesian fused graphical lasso

We propose a method to simplify textual Twitter data into understandable networks of terms that can signify important events and their possible changes over time. The method allows for common characteristics of the networks across time periods and each period can comprise multiple unknown sub-networ...

Descripción completa

Detalles Bibliográficos
Autores principales: Aflakparast, Mehran, de Gunst, Mathisca, van Wieringen, Wessel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384635/
https://www.ncbi.nlm.nih.gov/pubmed/32716924
http://dx.doi.org/10.1371/journal.pone.0235596
_version_ 1783563638432333824
author Aflakparast, Mehran
de Gunst, Mathisca
van Wieringen, Wessel
author_facet Aflakparast, Mehran
de Gunst, Mathisca
van Wieringen, Wessel
author_sort Aflakparast, Mehran
collection PubMed
description We propose a method to simplify textual Twitter data into understandable networks of terms that can signify important events and their possible changes over time. The method allows for common characteristics of the networks across time periods and each period can comprise multiple unknown sub-networks. The networks are described by Gaussian graphical models and their parameter values are estimated through a Bayesian approach with a fused lasso-type prior on the precision matrices of the underlying mixtures of the sub-models. A flexible data allocation scheme is at the heart of an MCMC algorithm to recover mean and covariance parameters of the mixture components. Several implementations of the outlined estimation procedure are studied and compared based on simulated data. The procedure with the highest predictive power is used for mining tweets regarding the 2009 Iranian presidential election.
format Online
Article
Text
id pubmed-7384635
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73846352020-08-05 Analysis of Twitter data with the Bayesian fused graphical lasso Aflakparast, Mehran de Gunst, Mathisca van Wieringen, Wessel PLoS One Research Article We propose a method to simplify textual Twitter data into understandable networks of terms that can signify important events and their possible changes over time. The method allows for common characteristics of the networks across time periods and each period can comprise multiple unknown sub-networks. The networks are described by Gaussian graphical models and their parameter values are estimated through a Bayesian approach with a fused lasso-type prior on the precision matrices of the underlying mixtures of the sub-models. A flexible data allocation scheme is at the heart of an MCMC algorithm to recover mean and covariance parameters of the mixture components. Several implementations of the outlined estimation procedure are studied and compared based on simulated data. The procedure with the highest predictive power is used for mining tweets regarding the 2009 Iranian presidential election. Public Library of Science 2020-07-27 /pmc/articles/PMC7384635/ /pubmed/32716924 http://dx.doi.org/10.1371/journal.pone.0235596 Text en © 2020 Aflakparast et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Aflakparast, Mehran
de Gunst, Mathisca
van Wieringen, Wessel
Analysis of Twitter data with the Bayesian fused graphical lasso
title Analysis of Twitter data with the Bayesian fused graphical lasso
title_full Analysis of Twitter data with the Bayesian fused graphical lasso
title_fullStr Analysis of Twitter data with the Bayesian fused graphical lasso
title_full_unstemmed Analysis of Twitter data with the Bayesian fused graphical lasso
title_short Analysis of Twitter data with the Bayesian fused graphical lasso
title_sort analysis of twitter data with the bayesian fused graphical lasso
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384635/
https://www.ncbi.nlm.nih.gov/pubmed/32716924
http://dx.doi.org/10.1371/journal.pone.0235596
work_keys_str_mv AT aflakparastmehran analysisoftwitterdatawiththebayesianfusedgraphicallasso
AT degunstmathisca analysisoftwitterdatawiththebayesianfusedgraphicallasso
AT vanwieringenwessel analysisoftwitterdatawiththebayesianfusedgraphicallasso