Cargando…
Genotator: A disease-agnostic tool for genetic annotation of disease
BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources di...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2990725/ https://www.ncbi.nlm.nih.gov/pubmed/21034472 http://dx.doi.org/10.1186/1755-8794-3-50 |
_version_ | 1782192503629283328 |
---|---|
author | Wall, Dennis P Pivovarov, Rimma Tong, Mark Jung, Jae-Yoon Fusaro, Vincent A DeLuca, Todd F Tonellato, Peter J |
author_facet | Wall, Dennis P Pivovarov, Rimma Tong, Mark Jung, Jae-Yoon Fusaro, Vincent A DeLuca, Todd F Tonellato, Peter J |
author_sort | Wall, Dennis P |
collection | PubMed |
description | BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone. METHODS: We designed a real-time aggregation tool that provides both comprehensive coverage and reliable gene-to-disease rankings for any disease. Our tool, called Genotator, automatically integrates data from 11 externally accessible clinical genetics resources and uses these data in a straightforward formula to rank genes in order of disease relevance. We tested the accuracy of coverage of Genotator in three separate diseases for which there exist specialty curated databases, Autism Spectrum Disorder, Parkinson's Disease, and Alzheimer Disease. Genotator is freely available at http://genotator.hms.harvard.edu. RESULTS: Genotator demonstrated that most of the 11 selected databases contain unique information about the genetic composition of disease, with 2514 genes found in only one of the 11 databases. These findings confirm that the integration of these databases provides a more complete picture than would be possible from any one database alone. Genotator successfully identified at least 75% of the top ranked genes for all three of our use cases, including a 90% concordance with the top 40 ranked candidates for Alzheimer Disease. CONCLUSIONS: As a meta-query engine, Genotator provides high coverage of both historical genetic research as well as recent advances in the genetic understanding of specific diseases. As such, Genotator provides a real-time aggregation of ranked data that remains current with the pace of research in the disease fields. Genotator's algorithm appropriately transforms query terms to match the input requirements of each targeted databases and accurately resolves named synonyms to ensure full coverage of the genetic results with official nomenclature. Genotator generates an excel-style output that is consistent across disease queries and readily importable to other applications. |
format | Text |
id | pubmed-2990725 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29907252010-12-13 Genotator: A disease-agnostic tool for genetic annotation of disease Wall, Dennis P Pivovarov, Rimma Tong, Mark Jung, Jae-Yoon Fusaro, Vincent A DeLuca, Todd F Tonellato, Peter J BMC Med Genomics Software BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone. METHODS: We designed a real-time aggregation tool that provides both comprehensive coverage and reliable gene-to-disease rankings for any disease. Our tool, called Genotator, automatically integrates data from 11 externally accessible clinical genetics resources and uses these data in a straightforward formula to rank genes in order of disease relevance. We tested the accuracy of coverage of Genotator in three separate diseases for which there exist specialty curated databases, Autism Spectrum Disorder, Parkinson's Disease, and Alzheimer Disease. Genotator is freely available at http://genotator.hms.harvard.edu. RESULTS: Genotator demonstrated that most of the 11 selected databases contain unique information about the genetic composition of disease, with 2514 genes found in only one of the 11 databases. These findings confirm that the integration of these databases provides a more complete picture than would be possible from any one database alone. Genotator successfully identified at least 75% of the top ranked genes for all three of our use cases, including a 90% concordance with the top 40 ranked candidates for Alzheimer Disease. CONCLUSIONS: As a meta-query engine, Genotator provides high coverage of both historical genetic research as well as recent advances in the genetic understanding of specific diseases. As such, Genotator provides a real-time aggregation of ranked data that remains current with the pace of research in the disease fields. Genotator's algorithm appropriately transforms query terms to match the input requirements of each targeted databases and accurately resolves named synonyms to ensure full coverage of the genetic results with official nomenclature. Genotator generates an excel-style output that is consistent across disease queries and readily importable to other applications. BioMed Central 2010-10-29 /pmc/articles/PMC2990725/ /pubmed/21034472 http://dx.doi.org/10.1186/1755-8794-3-50 Text en Copyright ©2010 Wall et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Wall, Dennis P Pivovarov, Rimma Tong, Mark Jung, Jae-Yoon Fusaro, Vincent A DeLuca, Todd F Tonellato, Peter J Genotator: A disease-agnostic tool for genetic annotation of disease |
title | Genotator: A disease-agnostic tool for genetic annotation of disease |
title_full | Genotator: A disease-agnostic tool for genetic annotation of disease |
title_fullStr | Genotator: A disease-agnostic tool for genetic annotation of disease |
title_full_unstemmed | Genotator: A disease-agnostic tool for genetic annotation of disease |
title_short | Genotator: A disease-agnostic tool for genetic annotation of disease |
title_sort | genotator: a disease-agnostic tool for genetic annotation of disease |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2990725/ https://www.ncbi.nlm.nih.gov/pubmed/21034472 http://dx.doi.org/10.1186/1755-8794-3-50 |
work_keys_str_mv | AT walldennisp genotatoradiseaseagnostictoolforgeneticannotationofdisease AT pivovarovrimma genotatoradiseaseagnostictoolforgeneticannotationofdisease AT tongmark genotatoradiseaseagnostictoolforgeneticannotationofdisease AT jungjaeyoon genotatoradiseaseagnostictoolforgeneticannotationofdisease AT fusarovincenta genotatoradiseaseagnostictoolforgeneticannotationofdisease AT delucatoddf genotatoradiseaseagnostictoolforgeneticannotationofdisease AT tonellatopeterj genotatoradiseaseagnostictoolforgeneticannotationofdisease |