Cargando…
Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7668189/ https://www.ncbi.nlm.nih.gov/pubmed/33106425 http://dx.doi.org/10.1073/pnas.2002660117 |
_version_ | 1783610443782160384 |
---|---|
author | Iqbal, Sumaiya Pérez-Palma, Eduardo Jespersen, Jakob B. May, Patrick Hoksza, David Heyne, Henrike O. Ahmed, Shehab S. Rifat, Zaara T. Rahman, M. Sohel Lage, Kasper Palotie, Aarno Cottrell, Jeffrey R. Wagner, Florence F. Daly, Mark J. Campbell, Arthur J. Lal, Dennis |
author_facet | Iqbal, Sumaiya Pérez-Palma, Eduardo Jespersen, Jakob B. May, Patrick Hoksza, David Heyne, Henrike O. Ahmed, Shehab S. Rifat, Zaara T. Rahman, M. Sohel Lage, Kasper Palotie, Aarno Cottrell, Jeffrey R. Wagner, Florence F. Daly, Mark J. Campbell, Arthur J. Lal, Dennis |
author_sort | Iqbal, Sumaiya |
collection | PubMed |
description | Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations’ positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants’ pathogenicity in terms of the perturbed molecular mechanisms. |
format | Online Article Text |
id | pubmed-7668189 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-76681892020-11-27 Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants Iqbal, Sumaiya Pérez-Palma, Eduardo Jespersen, Jakob B. May, Patrick Hoksza, David Heyne, Henrike O. Ahmed, Shehab S. Rifat, Zaara T. Rahman, M. Sohel Lage, Kasper Palotie, Aarno Cottrell, Jeffrey R. Wagner, Florence F. Daly, Mark J. Campbell, Arthur J. Lal, Dennis Proc Natl Acad Sci U S A Biological Sciences Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations’ positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants’ pathogenicity in terms of the perturbed molecular mechanisms. National Academy of Sciences 2020-11-10 2020-10-26 /pmc/articles/PMC7668189/ /pubmed/33106425 http://dx.doi.org/10.1073/pnas.2002660117 Text en Copyright © 2020 the Author(s). Published by PNAS. http://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (http://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Iqbal, Sumaiya Pérez-Palma, Eduardo Jespersen, Jakob B. May, Patrick Hoksza, David Heyne, Henrike O. Ahmed, Shehab S. Rifat, Zaara T. Rahman, M. Sohel Lage, Kasper Palotie, Aarno Cottrell, Jeffrey R. Wagner, Florence F. Daly, Mark J. Campbell, Arthur J. Lal, Dennis Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title | Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title_full | Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title_fullStr | Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title_full_unstemmed | Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title_short | Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
title_sort | comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7668189/ https://www.ncbi.nlm.nih.gov/pubmed/33106425 http://dx.doi.org/10.1073/pnas.2002660117 |
work_keys_str_mv | AT iqbalsumaiya comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT perezpalmaeduardo comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT jespersenjakobb comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT maypatrick comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT hokszadavid comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT heynehenrikeo comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT ahmedshehabs comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT rifatzaarat comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT rahmanmsohel comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT lagekasper comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT palotieaarno comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT cottrelljeffreyr comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT wagnerflorencef comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT dalymarkj comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT campbellarthurj comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants AT laldennis comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants |