Cargando…
Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations
Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potenti...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051330/ https://www.ncbi.nlm.nih.gov/pubmed/21258063 http://dx.doi.org/10.1093/bioinformatics/btr029 |
_version_ | 1782199472078454784 |
---|---|
author | Tong, Mark Y. Cassa, Christopher A. Kohane, Isaac S. |
author_facet | Tong, Mark Y. Cassa, Christopher A. Kohane, Isaac S. |
author_sort | Tong, Mark Y. |
collection | PubMed |
description | Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potential inconsistencies with our current standards for genomic coordinates, nomenclature and gene structure. In an effort to validate and link variants from the medical genetics literature to an unambiguous reference for each variant, we developed a software pipeline and reviewed 68 641 single amino acid mutations from Online Mendelian Inheritance in Man (OMIM), Human Gene Mutation Database (HGMD) and dbSNP. The frequency of unresolved mutation annotations varied widely among the databases, ranging from 4 to 23%. A taxonomy of primary causes for unresolved mutations was produced. Availability: This program is freely available from the web site (http://safegene.hms.harvard.edu/aa2nt/). Contact: mt153@hms.harvard.edu; mark_tong2009@yahoo.com Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-3051330 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30513302011-03-10 Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations Tong, Mark Y. Cassa, Christopher A. Kohane, Isaac S. Bioinformatics Applications Note Summary: Accurate annotations of genomic variants are necessary to achieve full-genome clinical interpretations that are scientifically sound and medically relevant. Many disease associations, especially those reported before the completion of the HGP, are limited in applicability because of potential inconsistencies with our current standards for genomic coordinates, nomenclature and gene structure. In an effort to validate and link variants from the medical genetics literature to an unambiguous reference for each variant, we developed a software pipeline and reviewed 68 641 single amino acid mutations from Online Mendelian Inheritance in Man (OMIM), Human Gene Mutation Database (HGMD) and dbSNP. The frequency of unresolved mutation annotations varied widely among the databases, ranging from 4 to 23%. A taxonomy of primary causes for unresolved mutations was produced. Availability: This program is freely available from the web site (http://safegene.hms.harvard.edu/aa2nt/). Contact: mt153@hms.harvard.edu; mark_tong2009@yahoo.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-03-15 2011-01-22 /pmc/articles/PMC3051330/ /pubmed/21258063 http://dx.doi.org/10.1093/bioinformatics/btr029 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Tong, Mark Y. Cassa, Christopher A. Kohane, Isaac S. Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title | Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title_full | Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title_fullStr | Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title_full_unstemmed | Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title_short | Automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
title_sort | automated validation of genetic variants from large databases: ensuring that variant references refer to the same genomic locations |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051330/ https://www.ncbi.nlm.nih.gov/pubmed/21258063 http://dx.doi.org/10.1093/bioinformatics/btr029 |
work_keys_str_mv | AT tongmarky automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations AT cassachristophera automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations AT kohaneisaacs automatedvalidationofgeneticvariantsfromlargedatabasesensuringthatvariantreferencesrefertothesamegenomiclocations |