Cargando…

Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature

Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association betw...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Guocai, Zhao, Jieyi, Cohen, Trevor, Tao, Cui, Sun, Jingchun, Xu, Hua, Bernstam, Elmer V., Lawson, Andrew, Zeng, Jia, Johnson, Amber M., Holla, Vijaykumar, Bailey, Ann M., Lara-Guerra, Humberto, Litzenburger, Beate, Meric-Bernstam, Funda, Jim Zheng, W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4390608/
https://www.ncbi.nlm.nih.gov/pubmed/25858285
http://dx.doi.org/10.1093/database/bav034
_version_ 1782365708142772224
author Chen, Guocai
Zhao, Jieyi
Cohen, Trevor
Tao, Cui
Sun, Jingchun
Xu, Hua
Bernstam, Elmer V.
Lawson, Andrew
Zeng, Jia
Johnson, Amber M.
Holla, Vijaykumar
Bailey, Ann M.
Lara-Guerra, Humberto
Litzenburger, Beate
Meric-Bernstam, Funda
Jim Zheng, W.
author_facet Chen, Guocai
Zhao, Jieyi
Cohen, Trevor
Tao, Cui
Sun, Jingchun
Xu, Hua
Bernstam, Elmer V.
Lawson, Andrew
Zeng, Jia
Johnson, Amber M.
Holla, Vijaykumar
Bailey, Ann M.
Lara-Guerra, Humberto
Litzenburger, Beate
Meric-Bernstam, Funda
Jim Zheng, W.
author_sort Chen, Guocai
collection PubMed
description Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org
format Online
Article
Text
id pubmed-4390608
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43906082015-04-09 Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature Chen, Guocai Zhao, Jieyi Cohen, Trevor Tao, Cui Sun, Jingchun Xu, Hua Bernstam, Elmer V. Lawson, Andrew Zeng, Jia Johnson, Amber M. Holla, Vijaykumar Bailey, Ann M. Lara-Guerra, Humberto Litzenburger, Beate Meric-Bernstam, Funda Jim Zheng, W. Database (Oxford) Original Article Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org Oxford University Press 2015-04-08 /pmc/articles/PMC4390608/ /pubmed/25858285 http://dx.doi.org/10.1093/database/bav034 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Chen, Guocai
Zhao, Jieyi
Cohen, Trevor
Tao, Cui
Sun, Jingchun
Xu, Hua
Bernstam, Elmer V.
Lawson, Andrew
Zeng, Jia
Johnson, Amber M.
Holla, Vijaykumar
Bailey, Ann M.
Lara-Guerra, Humberto
Litzenburger, Beate
Meric-Bernstam, Funda
Jim Zheng, W.
Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title_full Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title_fullStr Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title_full_unstemmed Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title_short Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
title_sort using ontology fingerprints to disambiguate gene name entities in the biomedical literature
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4390608/
https://www.ncbi.nlm.nih.gov/pubmed/25858285
http://dx.doi.org/10.1093/database/bav034
work_keys_str_mv AT chenguocai usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT zhaojieyi usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT cohentrevor usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT taocui usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT sunjingchun usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT xuhua usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT bernstamelmerv usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT lawsonandrew usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT zengjia usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT johnsonamberm usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT hollavijaykumar usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT baileyannm usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT laraguerrahumberto usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT litzenburgerbeate usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT mericbernstamfunda usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature
AT jimzhengw usingontologyfingerprintstodisambiguategenenameentitiesinthebiomedicalliterature