Cargando…

BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms

BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic informa...

Descripción completa

Detalles Bibliográficos
Autores principales: Mueller, Sabine C., Backes, Christina, Kalinina, Olga V., Meder, Benjamin, Stöckel, Daniel, Lenhof, Hans-Peter, Meese, Eckart, Keller, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4506604/
https://www.ncbi.nlm.nih.gov/pubmed/26191084
http://dx.doi.org/10.1186/s13073-015-0190-y
_version_ 1782381718578135040
author Mueller, Sabine C.
Backes, Christina
Kalinina, Olga V.
Meder, Benjamin
Stöckel, Daniel
Lenhof, Hans-Peter
Meese, Eckart
Keller, Andreas
author_facet Mueller, Sabine C.
Backes, Christina
Kalinina, Olga V.
Meder, Benjamin
Stöckel, Daniel
Lenhof, Hans-Peter
Meese, Eckart
Keller, Andreas
author_sort Mueller, Sabine C.
collection PubMed
description BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic information or guide therapies. METHODS: To facilitate the interpretation of variants and the selection of candidate non-synonymous polymorphisms (nsSNPs) for further clinical studies, we developed BALL-SNP. Starting from genetic variants in variant call format (VCF) files or tabular input, our tool, first, visualizes the three-dimensional (3D) structure of the respective proteins from the Protein Data Bank (PDB) and highlights mutated residues, automatically. Second, a hierarchical bottom up clustering on the nsSNPs within the 3D structure is performed to identify nsSNPs, which are close to each other. The modular and flexible implementation allows for straightforward integration of different databases for pathogenic and benign variants, but also enables the integration of pathogenicity prediction tools. The collected background information of all variants is presented below the 3D structure in an easily interpretable table format. RESULTS: First, we integrated different data resources into BALL-SNP, including databases containing information on genetic variants such as ClinVar or HUMSAVAR; third party tools that predict stability or pathogenicity in silico such as I-Mutant2.0; and additional information derived from the 3D structure such as a prediction of binding pockets. We then explored the applicability of BALL-SNP on the example of patients suffering from cardiomyopathies. Here, the analysis highlighted accumulation of variations in the genes JUP, VCL, and SMYD2. CONCLUSION: Software solutions for analyzing high-throughput genomics data are important to support diagnosis and therapy selection. Our tool BALL-SNP, which is freely available at http://www.ccb.uni-saarland.de/BALL-SNP, combines genetic information with an easily interpretable and interactive, graphical representation of amino acid changes in proteins. Thereby relevant information from databases and computational tools is presented. Beyond this, proximity to functional sites or accumulations of mutations with a potential collective effect can be discovered.
format Online
Article
Text
id pubmed-4506604
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45066042015-07-19 BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms Mueller, Sabine C. Backes, Christina Kalinina, Olga V. Meder, Benjamin Stöckel, Daniel Lenhof, Hans-Peter Meese, Eckart Keller, Andreas Genome Med Research BACKGROUND: High-throughput genetic testing is increasingly applied in clinics. Next-Generation Sequencing (NGS) data analysis however still remains a great challenge. The interpretation of pathogenicity of single variants or combinations of variants is crucial to provide accurate diagnostic information or guide therapies. METHODS: To facilitate the interpretation of variants and the selection of candidate non-synonymous polymorphisms (nsSNPs) for further clinical studies, we developed BALL-SNP. Starting from genetic variants in variant call format (VCF) files or tabular input, our tool, first, visualizes the three-dimensional (3D) structure of the respective proteins from the Protein Data Bank (PDB) and highlights mutated residues, automatically. Second, a hierarchical bottom up clustering on the nsSNPs within the 3D structure is performed to identify nsSNPs, which are close to each other. The modular and flexible implementation allows for straightforward integration of different databases for pathogenic and benign variants, but also enables the integration of pathogenicity prediction tools. The collected background information of all variants is presented below the 3D structure in an easily interpretable table format. RESULTS: First, we integrated different data resources into BALL-SNP, including databases containing information on genetic variants such as ClinVar or HUMSAVAR; third party tools that predict stability or pathogenicity in silico such as I-Mutant2.0; and additional information derived from the 3D structure such as a prediction of binding pockets. We then explored the applicability of BALL-SNP on the example of patients suffering from cardiomyopathies. Here, the analysis highlighted accumulation of variations in the genes JUP, VCL, and SMYD2. CONCLUSION: Software solutions for analyzing high-throughput genomics data are important to support diagnosis and therapy selection. Our tool BALL-SNP, which is freely available at http://www.ccb.uni-saarland.de/BALL-SNP, combines genetic information with an easily interpretable and interactive, graphical representation of amino acid changes in proteins. Thereby relevant information from databases and computational tools is presented. Beyond this, proximity to functional sites or accumulations of mutations with a potential collective effect can be discovered. BioMed Central 2015-07-01 /pmc/articles/PMC4506604/ /pubmed/26191084 http://dx.doi.org/10.1186/s13073-015-0190-y Text en © Mueller et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Mueller, Sabine C.
Backes, Christina
Kalinina, Olga V.
Meder, Benjamin
Stöckel, Daniel
Lenhof, Hans-Peter
Meese, Eckart
Keller, Andreas
BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title_full BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title_fullStr BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title_full_unstemmed BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title_short BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
title_sort ball-snp: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4506604/
https://www.ncbi.nlm.nih.gov/pubmed/26191084
http://dx.doi.org/10.1186/s13073-015-0190-y
work_keys_str_mv AT muellersabinec ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT backeschristina ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT kalininaolgav ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT mederbenjamin ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT stockeldaniel ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT lenhofhanspeter ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT meeseeckart ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms
AT kellerandreas ballsnpcombininggeneticandstructuralinformationtoidentifycandidatenonsynonymoussinglenucleotidepolymorphisms