Cargando…

Discovering Implicit Entity Relation with the Gene-Citation-Gene Network

In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connect...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Min, Han, Nam-Gi, Kim, Yong-Hwan, Ding, Ying, Chambers, Tamy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866152/
https://www.ncbi.nlm.nih.gov/pubmed/24358368
http://dx.doi.org/10.1371/journal.pone.0084639
_version_ 1782296118663577600
author Song, Min
Han, Nam-Gi
Kim, Yong-Hwan
Ding, Ying
Chambers, Tamy
author_facet Song, Min
Han, Nam-Gi
Kim, Yong-Hwan
Ding, Ying
Chambers, Tamy
author_sort Song, Min
collection PubMed
description In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner.
format Online
Article
Text
id pubmed-3866152
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38661522013-12-19 Discovering Implicit Entity Relation with the Gene-Citation-Gene Network Song, Min Han, Nam-Gi Kim, Yong-Hwan Ding, Ying Chambers, Tamy PLoS One Research Article In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner. Public Library of Science 2013-12-17 /pmc/articles/PMC3866152/ /pubmed/24358368 http://dx.doi.org/10.1371/journal.pone.0084639 Text en © 2013 Song et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Song, Min
Han, Nam-Gi
Kim, Yong-Hwan
Ding, Ying
Chambers, Tamy
Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title_full Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title_fullStr Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title_full_unstemmed Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title_short Discovering Implicit Entity Relation with the Gene-Citation-Gene Network
title_sort discovering implicit entity relation with the gene-citation-gene network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866152/
https://www.ncbi.nlm.nih.gov/pubmed/24358368
http://dx.doi.org/10.1371/journal.pone.0084639
work_keys_str_mv AT songmin discoveringimplicitentityrelationwiththegenecitationgenenetwork
AT hannamgi discoveringimplicitentityrelationwiththegenecitationgenenetwork
AT kimyonghwan discoveringimplicitentityrelationwiththegenecitationgenenetwork
AT dingying discoveringimplicitentityrelationwiththegenecitationgenenetwork
AT chamberstamy discoveringimplicitentityrelationwiththegenecitationgenenetwork