Cargando…

Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants

Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here...

Descripción completa

Detalles Bibliográficos
Autores principales: Iqbal, Sumaiya, Pérez-Palma, Eduardo, Jespersen, Jakob B., May, Patrick, Hoksza, David, Heyne, Henrike O., Ahmed, Shehab S., Rifat, Zaara T., Rahman, M. Sohel, Lage, Kasper, Palotie, Aarno, Cottrell, Jeffrey R., Wagner, Florence F., Daly, Mark J., Campbell, Arthur J., Lal, Dennis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7668189/
https://www.ncbi.nlm.nih.gov/pubmed/33106425
http://dx.doi.org/10.1073/pnas.2002660117
_version_ 1783610443782160384
author Iqbal, Sumaiya
Pérez-Palma, Eduardo
Jespersen, Jakob B.
May, Patrick
Hoksza, David
Heyne, Henrike O.
Ahmed, Shehab S.
Rifat, Zaara T.
Rahman, M. Sohel
Lage, Kasper
Palotie, Aarno
Cottrell, Jeffrey R.
Wagner, Florence F.
Daly, Mark J.
Campbell, Arthur J.
Lal, Dennis
author_facet Iqbal, Sumaiya
Pérez-Palma, Eduardo
Jespersen, Jakob B.
May, Patrick
Hoksza, David
Heyne, Henrike O.
Ahmed, Shehab S.
Rifat, Zaara T.
Rahman, M. Sohel
Lage, Kasper
Palotie, Aarno
Cottrell, Jeffrey R.
Wagner, Florence F.
Daly, Mark J.
Campbell, Arthur J.
Lal, Dennis
author_sort Iqbal, Sumaiya
collection PubMed
description Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations’ positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants’ pathogenicity in terms of the perturbed molecular mechanisms.
format Online
Article
Text
id pubmed-7668189
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-76681892020-11-27 Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants Iqbal, Sumaiya Pérez-Palma, Eduardo Jespersen, Jakob B. May, Patrick Hoksza, David Heyne, Henrike O. Ahmed, Shehab S. Rifat, Zaara T. Rahman, M. Sohel Lage, Kasper Palotie, Aarno Cottrell, Jeffrey R. Wagner, Florence F. Daly, Mark J. Campbell, Arthur J. Lal, Dennis Proc Natl Acad Sci U S A Biological Sciences Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations’ positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants’ pathogenicity in terms of the perturbed molecular mechanisms. National Academy of Sciences 2020-11-10 2020-10-26 /pmc/articles/PMC7668189/ /pubmed/33106425 http://dx.doi.org/10.1073/pnas.2002660117 Text en Copyright © 2020 the Author(s). Published by PNAS. http://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Iqbal, Sumaiya
Pérez-Palma, Eduardo
Jespersen, Jakob B.
May, Patrick
Hoksza, David
Heyne, Henrike O.
Ahmed, Shehab S.
Rifat, Zaara T.
Rahman, M. Sohel
Lage, Kasper
Palotie, Aarno
Cottrell, Jeffrey R.
Wagner, Florence F.
Daly, Mark J.
Campbell, Arthur J.
Lal, Dennis
Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title_full Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title_fullStr Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title_full_unstemmed Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title_short Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
title_sort comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7668189/
https://www.ncbi.nlm.nih.gov/pubmed/33106425
http://dx.doi.org/10.1073/pnas.2002660117
work_keys_str_mv AT iqbalsumaiya comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT perezpalmaeduardo comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT jespersenjakobb comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT maypatrick comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT hokszadavid comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT heynehenrikeo comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT ahmedshehabs comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT rifatzaarat comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT rahmanmsohel comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT lagekasper comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT palotieaarno comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT cottrelljeffreyr comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT wagnerflorencef comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT dalymarkj comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT campbellarthurj comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants
AT laldennis comprehensivecharacterizationofaminoacidpositionsinproteinstructuresrevealsmoleculareffectofmissensevariants