Cargando…

Genotator: A disease-agnostic tool for genetic annotation of disease

BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources di...

Descripción completa

Detalles Bibliográficos
Autores principales: Wall, Dennis P, Pivovarov, Rimma, Tong, Mark, Jung, Jae-Yoon, Fusaro, Vincent A, DeLuca, Todd F, Tonellato, Peter J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2990725/
https://www.ncbi.nlm.nih.gov/pubmed/21034472
http://dx.doi.org/10.1186/1755-8794-3-50
_version_ 1782192503629283328
author Wall, Dennis P
Pivovarov, Rimma
Tong, Mark
Jung, Jae-Yoon
Fusaro, Vincent A
DeLuca, Todd F
Tonellato, Peter J
author_facet Wall, Dennis P
Pivovarov, Rimma
Tong, Mark
Jung, Jae-Yoon
Fusaro, Vincent A
DeLuca, Todd F
Tonellato, Peter J
author_sort Wall, Dennis P
collection PubMed
description BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone. METHODS: We designed a real-time aggregation tool that provides both comprehensive coverage and reliable gene-to-disease rankings for any disease. Our tool, called Genotator, automatically integrates data from 11 externally accessible clinical genetics resources and uses these data in a straightforward formula to rank genes in order of disease relevance. We tested the accuracy of coverage of Genotator in three separate diseases for which there exist specialty curated databases, Autism Spectrum Disorder, Parkinson's Disease, and Alzheimer Disease. Genotator is freely available at http://genotator.hms.harvard.edu. RESULTS: Genotator demonstrated that most of the 11 selected databases contain unique information about the genetic composition of disease, with 2514 genes found in only one of the 11 databases. These findings confirm that the integration of these databases provides a more complete picture than would be possible from any one database alone. Genotator successfully identified at least 75% of the top ranked genes for all three of our use cases, including a 90% concordance with the top 40 ranked candidates for Alzheimer Disease. CONCLUSIONS: As a meta-query engine, Genotator provides high coverage of both historical genetic research as well as recent advances in the genetic understanding of specific diseases. As such, Genotator provides a real-time aggregation of ranked data that remains current with the pace of research in the disease fields. Genotator's algorithm appropriately transforms query terms to match the input requirements of each targeted databases and accurately resolves named synonyms to ensure full coverage of the genetic results with official nomenclature. Genotator generates an excel-style output that is consistent across disease queries and readily importable to other applications.
format Text
id pubmed-2990725
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29907252010-12-13 Genotator: A disease-agnostic tool for genetic annotation of disease Wall, Dennis P Pivovarov, Rimma Tong, Mark Jung, Jae-Yoon Fusaro, Vincent A DeLuca, Todd F Tonellato, Peter J BMC Med Genomics Software BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone. METHODS: We designed a real-time aggregation tool that provides both comprehensive coverage and reliable gene-to-disease rankings for any disease. Our tool, called Genotator, automatically integrates data from 11 externally accessible clinical genetics resources and uses these data in a straightforward formula to rank genes in order of disease relevance. We tested the accuracy of coverage of Genotator in three separate diseases for which there exist specialty curated databases, Autism Spectrum Disorder, Parkinson's Disease, and Alzheimer Disease. Genotator is freely available at http://genotator.hms.harvard.edu. RESULTS: Genotator demonstrated that most of the 11 selected databases contain unique information about the genetic composition of disease, with 2514 genes found in only one of the 11 databases. These findings confirm that the integration of these databases provides a more complete picture than would be possible from any one database alone. Genotator successfully identified at least 75% of the top ranked genes for all three of our use cases, including a 90% concordance with the top 40 ranked candidates for Alzheimer Disease. CONCLUSIONS: As a meta-query engine, Genotator provides high coverage of both historical genetic research as well as recent advances in the genetic understanding of specific diseases. As such, Genotator provides a real-time aggregation of ranked data that remains current with the pace of research in the disease fields. Genotator's algorithm appropriately transforms query terms to match the input requirements of each targeted databases and accurately resolves named synonyms to ensure full coverage of the genetic results with official nomenclature. Genotator generates an excel-style output that is consistent across disease queries and readily importable to other applications. BioMed Central 2010-10-29 /pmc/articles/PMC2990725/ /pubmed/21034472 http://dx.doi.org/10.1186/1755-8794-3-50 Text en Copyright ©2010 Wall et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Wall, Dennis P
Pivovarov, Rimma
Tong, Mark
Jung, Jae-Yoon
Fusaro, Vincent A
DeLuca, Todd F
Tonellato, Peter J
Genotator: A disease-agnostic tool for genetic annotation of disease
title Genotator: A disease-agnostic tool for genetic annotation of disease
title_full Genotator: A disease-agnostic tool for genetic annotation of disease
title_fullStr Genotator: A disease-agnostic tool for genetic annotation of disease
title_full_unstemmed Genotator: A disease-agnostic tool for genetic annotation of disease
title_short Genotator: A disease-agnostic tool for genetic annotation of disease
title_sort genotator: a disease-agnostic tool for genetic annotation of disease
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2990725/
https://www.ncbi.nlm.nih.gov/pubmed/21034472
http://dx.doi.org/10.1186/1755-8794-3-50
work_keys_str_mv AT walldennisp genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT pivovarovrimma genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT tongmark genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT jungjaeyoon genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT fusarovincenta genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT delucatoddf genotatoradiseaseagnostictoolforgeneticannotationofdisease
AT tonellatopeterj genotatoradiseaseagnostictoolforgeneticannotationofdisease