Cargando…
BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic informa...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4506604/ https://www.ncbi.nlm.nih.gov/pubmed/26191084 http://dx.doi.org/10.1186/s13073-015-0190-y |
_version_ | 1782381718578135040 |
---|---|
author | Mueller, Sabine C. Backes, Christina Kalinina, Olga V. Meder, Benjamin Stöckel, Daniel Lenhof, Hans-Peter Meese, Eckart Keller, Andreas |
author_facet | Mueller, Sabine C. Backes, Christina Kalinina, Olga V. Meder, Benjamin Stöckel, Daniel Lenhof, Hans-Peter Meese, Eckart Keller, Andreas |
author_sort | Mueller, Sabine C. |
collection | PubMed |
description | BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic information or guide therapies. METHODS: To facilitate the interpretation of variants and the selection of candidate non-synonymous polymorphisms (nsSNPs) for further clinical studies, we developed BALL-SNP. Starting from genetic variants in variant call format (VCF) files or tabular input, our tool, first, visualizes the three-dimensional (3D) structure of the respective proteins from the Protein Data Bank (PDB) and highlights mutated residues, automatically. Second, a hierarchical bottom up clustering on the nsSNPs within the 3D structure is performed to identify nsSNPs, which are close to each other. The modular and flexible implementation allows for straightforward integration of different databases for pathogenic and benign variants, but also enables the integration of pathogenicity prediction tools. The collected background information of all variants is presented below the 3D structure in an easily interpretable table format. RESULTS: First, we integrated different data resources into BALL-SNP, including databases containing information on genetic variants such as ClinVar or HUMSAVAR; third party tools that predict stability or pathogenicity in silico such as I-Mutant2.0; and additional information derived from the 3D structure such as a prediction of binding pockets. We then explored the applicability of BALL-SNP on the example of patients suffering from cardiomyopathies. Here, the analysis highlighted accumulation of variations in the genes JUP, VCL, and SMYD2. CONCLUSION: Software solutions for analyzing high-throughput genomics data are important to support diagnosis and therapy selection. Our tool BALL-SNP, which is freely available at http://www.ccb.uni-saarland.de/BALL-SNP, combines genetic information with an easily interpretable and interactive, graphical representation of amino acid changes in proteins. Thereby relevant information from databases and computational tools is presented. Beyond this, proximity to functional sites or accumulations of mutations with a potential collective effect can be discovered. |
format | Online Article Text |
id | pubmed-4506604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45066042015-07-19 BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms Mueller, Sabine C. Backes, Christina Kalinina, Olga V. Meder, Benjamin Stöckel, Daniel Lenhof, Hans-Peter Meese, Eckart Keller, Andreas Genome Med Research BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic information or guide therapies. METHODS: To facilitate the interpretation of variants and the selection of candidate non-synonymous polymorphisms (nsSNPs) for further clinical studies, we developed BALL-SNP. Starting from genetic variants in variant call format (VCF) files or tabular input, our tool, first, visualizes the three-dimensional (3D) structure of the respective proteins from the Protein Data Bank (PDB) and highlights mutated residues, automatically. Second, a hierarchical bottom up clustering on the nsSNPs within the 3D structure is performed to identify nsSNPs, which are close to each other. The modular and flexible implementation allows for straightforward integration of different databases for pathogenic and benign variants, but also enables the integration of pathogenicity prediction tools. The collected background information of all variants is presented below the 3D structure in an easily interpretable table format. RESULTS: First, we integrated different data resources into BALL-SNP, including databases containing information on genetic variants such as ClinVar or HUMSAVAR; third party tools that predict stability or pathogenicity in silico such as I-Mutant2.0; and additional information derived from the 3D structure such as a prediction of binding pockets. We then explored the applicability of BALL-SNP on the example of patients suffering from cardiomyopathies. Here, the analysis highlighted accumulation of variations in the genes JUP, VCL, and SMYD2. CONCLUSION: Software solutions for analyzing high-throughput genomics data are important to support diagnosis and therapy selection. Our tool BALL-SNP, which is freely available at http://www.ccb.uni-saarland.de/BALL-SNP, combines genetic information with an easily interpretable and interactive, graphical representation of amino acid changes in proteins. Thereby relevant information from databases and computational tools is presented. Beyond this, proximity to functional sites or accumulations of mutations with a potential collective effect can be discovered. BioMed Central 2015-07-01 /pmc/articles/PMC4506604/ /pubmed/26191084 http://dx.doi.org/10.1186/s13073-015-0190-y Text en © Mueller et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Mueller, Sabine C. Backes, Christina Kalinina, Olga V. Meder, Benjamin Stöckel, Daniel Lenhof, Hans-Peter Meese, Eckart Keller, Andreas BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title | BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title_full | BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title_fullStr | BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title_full_unstemmed | BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title_short | BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
title_sort | ball-snp: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4506604/ https://www.ncbi.nlm.nih.gov/pubmed/26191084 http://dx.doi.org/10.1186/s13073-015-0190-y |
work_keys_str_mv | AT muellersabinec ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT backeschristina ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT kalininaolgav ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT mederbenjamin ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT stockeldaniel ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT lenhofhanspeter ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT meeseeckart ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms AT kellerandreas ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms |