Cargando…
Evolutionary and functional lessons from human-specific amino acid substitution matrices
Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445205/ https://www.ncbi.nlm.nih.gov/pubmed/34541526 http://dx.doi.org/10.1093/nargab/lqab079 |
_version_ | 1784568602503938048 |
---|---|
author | Shauli, Tair Brandes, Nadav Linial, Michal |
author_facet | Shauli, Tair Brandes, Nadav Linial, Michal |
author_sort | Shauli, Tair |
collection | PubMed |
description | Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions. |
format | Online Article Text |
id | pubmed-8445205 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84452052021-09-17 Evolutionary and functional lessons from human-specific amino acid substitution matrices Shauli, Tair Brandes, Nadav Linial, Michal NAR Genom Bioinform Standard Article Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions. Oxford University Press 2021-09-16 /pmc/articles/PMC8445205/ /pubmed/34541526 http://dx.doi.org/10.1093/nargab/lqab079 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Shauli, Tair Brandes, Nadav Linial, Michal Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title | Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title_full | Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title_fullStr | Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title_full_unstemmed | Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title_short | Evolutionary and functional lessons from human-specific amino acid substitution matrices |
title_sort | evolutionary and functional lessons from human-specific amino acid substitution matrices |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445205/ https://www.ncbi.nlm.nih.gov/pubmed/34541526 http://dx.doi.org/10.1093/nargab/lqab079 |
work_keys_str_mv | AT shaulitair evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices AT brandesnadav evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices AT linialmichal evolutionaryandfunctionallessonsfromhumanspecificaminoacidsubstitutionmatrices |