Cargando…

Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes

Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangement...

Descripción completa

Detalles Bibliográficos
Autores principales: Gress, A, Ramensky, V, Kalinina, O V
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623905/
https://www.ncbi.nlm.nih.gov/pubmed/28945216
http://dx.doi.org/10.1038/oncsis.2017.79
_version_ 1783268170253991936
author Gress, A
Ramensky, V
Kalinina, O V
author_facet Gress, A
Ramensky, V
Kalinina, O V
author_sort Gress, A
collection PubMed
description Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability.
format Online
Article
Text
id pubmed-5623905
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-56239052017-10-12 Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes Gress, A Ramensky, V Kalinina, O V Oncogenesis Original Article Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability. Nature Publishing Group 2017-09 2017-09-25 /pmc/articles/PMC5623905/ /pubmed/28945216 http://dx.doi.org/10.1038/oncsis.2017.79 Text en Copyright © 2017 The Author(s) http://creativecommons.org/licenses/by/4.0/ Oncogenesis is an open-access journal published by Nature Publishing Group. This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Original Article
Gress, A
Ramensky, V
Kalinina, O V
Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title_full Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title_fullStr Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title_full_unstemmed Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title_short Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
title_sort spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623905/
https://www.ncbi.nlm.nih.gov/pubmed/28945216
http://dx.doi.org/10.1038/oncsis.2017.79
work_keys_str_mv AT gressa spatialdistributionofdiseaseassociatedvariantsinthreedimensionalstructuresofproteincomplexes
AT ramenskyv spatialdistributionofdiseaseassociatedvariantsinthreedimensionalstructuresofproteincomplexes
AT kalininaov spatialdistributionofdiseaseassociatedvariantsinthreedimensionalstructuresofproteincomplexes