Cargando…

Annotating the human genome with Disease Ontology

BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. RESULTS: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the Gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Osborne, John D, Flatow, Jared, Holko, Michelle, Lin, Simon M, Kibbe, Warren A, Zhu, Lihua (Julie), Danila, Maria I, Feng, Gang, Chisholm, Rex L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2709267/
https://www.ncbi.nlm.nih.gov/pubmed/19594883
http://dx.doi.org/10.1186/1471-2164-10-S1-S6
_version_ 1782169287172030464
author Osborne, John D
Flatow, Jared
Holko, Michelle
Lin, Simon M
Kibbe, Warren A
Zhu, Lihua (Julie)
Danila, Maria I
Feng, Gang
Chisholm, Rex L
author_facet Osborne, John D
Flatow, Jared
Holko, Michelle
Lin, Simon M
Kibbe, Warren A
Zhu, Lihua (Julie)
Danila, Maria I
Feng, Gang
Chisholm, Rex L
author_sort Osborne, John D
collection PubMed
description BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. RESULTS: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. CONCLUSION: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome.
format Text
id pubmed-2709267
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27092672009-07-14 Annotating the human genome with Disease Ontology Osborne, John D Flatow, Jared Holko, Michelle Lin, Simon M Kibbe, Warren A Zhu, Lihua (Julie) Danila, Maria I Feng, Gang Chisholm, Rex L BMC Genomics Research BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. RESULTS: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. CONCLUSION: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. BioMed Central 2009-07-07 /pmc/articles/PMC2709267/ /pubmed/19594883 http://dx.doi.org/10.1186/1471-2164-10-S1-S6 Text en Copyright © 2009 Osborne et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Osborne, John D
Flatow, Jared
Holko, Michelle
Lin, Simon M
Kibbe, Warren A
Zhu, Lihua (Julie)
Danila, Maria I
Feng, Gang
Chisholm, Rex L
Annotating the human genome with Disease Ontology
title Annotating the human genome with Disease Ontology
title_full Annotating the human genome with Disease Ontology
title_fullStr Annotating the human genome with Disease Ontology
title_full_unstemmed Annotating the human genome with Disease Ontology
title_short Annotating the human genome with Disease Ontology
title_sort annotating the human genome with disease ontology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2709267/
https://www.ncbi.nlm.nih.gov/pubmed/19594883
http://dx.doi.org/10.1186/1471-2164-10-S1-S6
work_keys_str_mv AT osbornejohnd annotatingthehumangenomewithdiseaseontology
AT flatowjared annotatingthehumangenomewithdiseaseontology
AT holkomichelle annotatingthehumangenomewithdiseaseontology
AT linsimonm annotatingthehumangenomewithdiseaseontology
AT kibbewarrena annotatingthehumangenomewithdiseaseontology
AT zhulihuajulie annotatingthehumangenomewithdiseaseontology
AT danilamariai annotatingthehumangenomewithdiseaseontology
AT fenggang annotatingthehumangenomewithdiseaseontology
AT chisholmrexl annotatingthehumangenomewithdiseaseontology