Cargando…

Evolutionary and functional lessons from human-specific amino acid substitution matrices

Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA...

Descripción completa

Detalles Bibliográficos
Autores principales: Shauli, Tair, Brandes, Nadav, Linial, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445205/
https://www.ncbi.nlm.nih.gov/pubmed/34541526
http://dx.doi.org/10.1093/nargab/lqab079
_version_ 1784568602503938048
author Shauli, Tair
Brandes, Nadav
Linial, Michal
author_facet Shauli, Tair
Brandes, Nadav
Linial, Michal
author_sort Shauli, Tair
collection PubMed
description Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions.
format Online
Article
Text
id pubmed-8445205
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84452052021-09-17 Evolutionary and functional lessons from human-specific amino acid substitution matrices Shauli, Tair Brandes, Nadav Linial, Michal NAR Genom Bioinform Standard Article Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions. Oxford University Press 2021-09-16 /pmc/articles/PMC8445205/ /pubmed/34541526 http://dx.doi.org/10.1093/nargab/lqab079 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Shauli, Tair
Brandes, Nadav
Linial, Michal
Evolutionary and functional lessons from human-specific amino acid substitution matrices
title Evolutionary and functional lessons from human-specific amino acid substitution matrices
title_full Evolutionary and functional lessons from human-specific amino acid substitution matrices
title_fullStr Evolutionary and functional lessons from human-specific amino acid substitution matrices
title_full_unstemmed Evolutionary and functional lessons from human-specific amino acid substitution matrices
title_short Evolutionary and functional lessons from human-specific amino acid substitution matrices
title_sort evolutionary and functional lessons from human-specific amino acid substitution matrices
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445205/
https://www.ncbi.nlm.nih.gov/pubmed/34541526
http://dx.doi.org/10.1093/nargab/lqab079
work_keys_str_mv AT shaulitair evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices
AT brandesnadav evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices
AT linialmichal evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices