Cargando…

Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources

A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes...

Descripción completa

Detalles Bibliográficos
Autores principales: Popović, Marko, Štefančić, Hrvoje, Sluban, Borut, Kralj Novak, Petra, Grčar, Miha, Mozetič, Igor, Puliga, Michelangelo, Zlatić, Vinko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4254290/
https://www.ncbi.nlm.nih.gov/pubmed/25470498
http://dx.doi.org/10.1371/journal.pone.0099515
_version_ 1782347331227615232
author Popović, Marko
Štefančić, Hrvoje
Sluban, Borut
Kralj Novak, Petra
Grčar, Miha
Mozetič, Igor
Puliga, Michelangelo
Zlatić, Vinko
author_facet Popović, Marko
Štefančić, Hrvoje
Sluban, Borut
Kralj Novak, Petra
Grčar, Miha
Mozetič, Igor
Puliga, Michelangelo
Zlatić, Vinko
author_sort Popović, Marko
collection PubMed
description A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations.
format Online
Article
Text
id pubmed-4254290
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42542902014-12-11 Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources Popović, Marko Štefančić, Hrvoje Sluban, Borut Kralj Novak, Petra Grčar, Miha Mozetič, Igor Puliga, Michelangelo Zlatić, Vinko PLoS One Research Article A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations. Public Library of Science 2014-12-03 /pmc/articles/PMC4254290/ /pubmed/25470498 http://dx.doi.org/10.1371/journal.pone.0099515 Text en © 2014 Popović et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Popović, Marko
Štefančić, Hrvoje
Sluban, Borut
Kralj Novak, Petra
Grčar, Miha
Mozetič, Igor
Puliga, Michelangelo
Zlatić, Vinko
Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title_full Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title_fullStr Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title_full_unstemmed Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title_short Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
title_sort extraction of temporal networks from term co-occurrences in online textual sources
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4254290/
https://www.ncbi.nlm.nih.gov/pubmed/25470498
http://dx.doi.org/10.1371/journal.pone.0099515
work_keys_str_mv AT popovicmarko extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT stefancichrvoje extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT slubanborut extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT kraljnovakpetra extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT grcarmiha extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT mozeticigor extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT puligamichelangelo extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources
AT zlaticvinko extractionoftemporalnetworksfromtermcooccurrencesinonlinetextualsources