Cargando…

Link prediction on Twitter

With over 300 million active users, Twitter is among the largest online news and social networking services in existence today. Open access to information on Twitter makes it a valuable source of data for research on social interactions, sentiment analysis, content diffusion, link prediction, and th...

Descripción completa

Detalles Bibliográficos
Autores principales:	Martinčić-Ipšić, Sanda, Močibob, Edvin, Perc, Matjaž
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515441/ https://www.ncbi.nlm.nih.gov/pubmed/28719651 http://dx.doi.org/10.1371/journal.pone.0181079

_version_	1783250993493835776
author	Martinčić-Ipšić, Sanda Močibob, Edvin Perc, Matjaž
author_facet	Martinčić-Ipšić, Sanda Močibob, Edvin Perc, Matjaž
author_sort	Martinčić-Ipšić, Sanda
collection	PubMed
description	With over 300 million active users, Twitter is among the largest online news and social networking services in existence today. Open access to information on Twitter makes it a valuable source of data for research on social interactions, sentiment analysis, content diffusion, link prediction, and the dynamics behind human collective behaviour in general. Here we use Twitter data to construct co-occurrence language networks based on hashtags and based on all the words in tweets, and we use these networks to study link prediction by means of different methods and evaluation metrics. In addition to using five known methods, we propose two effective weighted similarity measures, and we compare the obtained outcomes in dependence on the selected semantic context of topics on Twitter. We find that hashtag networks yield to a large degree equal results as all-word networks, thus supporting the claim that hashtags alone robustly capture the semantic context of tweets, and as such are useful and suitable for studying the content and categorization. We also introduce ranking diagrams as an efficient tool for the comparison of the performance of different link prediction algorithms across multiple datasets. Our research indicates that successful link prediction algorithms work well in correctly foretelling highly probable links even if the information about a network structure is incomplete, and they do so even if the semantic context is rationalized to hashtags.
format	Online Article Text
id	pubmed-5515441
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-55154412017-08-07 Link prediction on Twitter Martinčić-Ipšić, Sanda Močibob, Edvin Perc, Matjaž PLoS One Research Article With over 300 million active users, Twitter is among the largest online news and social networking services in existence today. Open access to information on Twitter makes it a valuable source of data for research on social interactions, sentiment analysis, content diffusion, link prediction, and the dynamics behind human collective behaviour in general. Here we use Twitter data to construct co-occurrence language networks based on hashtags and based on all the words in tweets, and we use these networks to study link prediction by means of different methods and evaluation metrics. In addition to using five known methods, we propose two effective weighted similarity measures, and we compare the obtained outcomes in dependence on the selected semantic context of topics on Twitter. We find that hashtag networks yield to a large degree equal results as all-word networks, thus supporting the claim that hashtags alone robustly capture the semantic context of tweets, and as such are useful and suitable for studying the content and categorization. We also introduce ranking diagrams as an efficient tool for the comparison of the performance of different link prediction algorithms across multiple datasets. Our research indicates that successful link prediction algorithms work well in correctly foretelling highly probable links even if the information about a network structure is incomplete, and they do so even if the semantic context is rationalized to hashtags. Public Library of Science 2017-07-18 /pmc/articles/PMC5515441/ /pubmed/28719651 http://dx.doi.org/10.1371/journal.pone.0181079 Text en © 2017 Martinčić-Ipšić et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Martinčić-Ipšić, Sanda Močibob, Edvin Perc, Matjaž Link prediction on Twitter
title	Link prediction on Twitter
title_full	Link prediction on Twitter
title_fullStr	Link prediction on Twitter
title_full_unstemmed	Link prediction on Twitter
title_short	Link prediction on Twitter
title_sort	link prediction on twitter
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515441/ https://www.ncbi.nlm.nih.gov/pubmed/28719651 http://dx.doi.org/10.1371/journal.pone.0181079
work_keys_str_mv	AT martincicipsicsanda linkpredictionontwitter AT mocibobedvin linkpredictionontwitter AT percmatjaz linkpredictionontwitter

Link prediction on Twitter

Ejemplares similares