Cargando…

Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations

Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potenti...

Descripción completa

Detalles Bibliográficos
Autores principales: Tong, Mark Y., Cassa, Christopher A., Kohane, Isaac S.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051330/
https://www.ncbi.nlm.nih.gov/pubmed/21258063
http://dx.doi.org/10.1093/bioinformatics/btr029
_version_ 1782199472078454784
author Tong, Mark Y.
Cassa, Christopher A.
Kohane, Isaac S.
author_facet Tong, Mark Y.
Cassa, Christopher A.
Kohane, Isaac S.
author_sort Tong, Mark Y.
collection PubMed
description Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potential inconsistencies with our current standards for genomic coordinates, nomenclature and gene structure. In an effort to validate and link variants from the medical genetics literature to an unambiguous reference for each variant, we developed a software pipeline and reviewed 68 641 single amino acid mutations from Online Mendelian Inheritance in Man (OMIM), Human Gene Mutation Database (HGMD) and dbSNP. The frequency of unresolved mutation annotations varied widely among the databases, ranging from 4 to 23%. A taxonomy of primary causes for unresolved mutations was produced. Availability: This program is freely available from the web site (http://safegene.hms.harvard.edu/aa2nt/). Contact: mt153@hms.harvard.edu; mark_tong2009@yahoo.com Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-3051330
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30513302011-03-10 Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations Tong, Mark Y. Cassa, Christopher A. Kohane, Isaac S. Bioinformatics Applications Note Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potential inconsistencies with our current standards for genomic coordinates, nomenclature and gene structure. In an effort to validate and link variants from the medical genetics literature to an unambiguous reference for each variant, we developed a software pipeline and reviewed 68 641 single amino acid mutations from Online Mendelian Inheritance in Man (OMIM), Human Gene Mutation Database (HGMD) and dbSNP. The frequency of unresolved mutation annotations varied widely among the databases, ranging from 4 to 23%. A taxonomy of primary causes for unresolved mutations was produced. Availability: This program is freely available from the web site (http://safegene.hms.harvard.edu/aa2nt/). Contact: mt153@hms.harvard.edu; mark_tong2009@yahoo.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-03-15 2011-01-22 /pmc/articles/PMC3051330/ /pubmed/21258063 http://dx.doi.org/10.1093/bioinformatics/btr029 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Tong, Mark Y.
Cassa, Christopher A.
Kohane, Isaac S.
Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title_full Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title_fullStr Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title_full_unstemmed Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title_short Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
title_sort automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051330/
https://www.ncbi.nlm.nih.gov/pubmed/21258063
http://dx.doi.org/10.1093/bioinformatics/btr029
work_keys_str_mv AT tongmarky automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations
AT cassachristophera automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations
AT kohaneisaacs automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations