Cargando…

Ankyrin repeats in context with human population variation

Ankyrin protein repeats bind to a wide range of substrates and are one of the most common protein motifs in nature. Here, we collate a high-quality alignment of 7,407 ankyrin repeats and examine for the first time, the distribution of human population variants from large-scale sequencing of healthy...

Descripción completa

Detalles Bibliográficos
Autores principales: Utgés, Javier S., Tsenkov, Maxim I., Dietrich, Noah J. M., MacGowan, Stuart A., Barton, Geoffrey J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8415598/
https://www.ncbi.nlm.nih.gov/pubmed/34428215
http://dx.doi.org/10.1371/journal.pcbi.1009335
_version_ 1783747998476402688
author Utgés, Javier S.
Tsenkov, Maxim I.
Dietrich, Noah J. M.
MacGowan, Stuart A.
Barton, Geoffrey J.
author_facet Utgés, Javier S.
Tsenkov, Maxim I.
Dietrich, Noah J. M.
MacGowan, Stuart A.
Barton, Geoffrey J.
author_sort Utgés, Javier S.
collection PubMed
description Ankyrin protein repeats bind to a wide range of substrates and are one of the most common protein motifs in nature. Here, we collate a high-quality alignment of 7,407 ankyrin repeats and examine for the first time, the distribution of human population variants from large-scale sequencing of healthy individuals across this family. Population variants are not randomly distributed across the genome but are constrained by gene essentiality and function. Accordingly, we interpret the population variants in context with evolutionary constraint and structural features including secondary structure, accessibility and protein-protein interactions across 383 three-dimensional structures of ankyrin repeats. We find five positions that are highly conserved across homologues and also depleted in missense variants within the human population. These positions are significantly enriched in intra-domain contacts and so likely to be key for repeat packing. In contrast, a group of evolutionarily divergent positions are found to be depleted in missense variants in human and significantly enriched in protein-protein interactions. Our analysis also suggests the domain has three, not two surfaces, each with different patterns of enrichment in protein-substrate interactions and missense variants. Our findings will be of interest to those studying or engineering ankyrin-repeat containing proteins as well as those interpreting the significance of disease variants.
format Online
Article
Text
id pubmed-8415598
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-84155982021-09-04 Ankyrin repeats in context with human population variation Utgés, Javier S. Tsenkov, Maxim I. Dietrich, Noah J. M. MacGowan, Stuart A. Barton, Geoffrey J. PLoS Comput Biol Research Article Ankyrin protein repeats bind to a wide range of substrates and are one of the most common protein motifs in nature. Here, we collate a high-quality alignment of 7,407 ankyrin repeats and examine for the first time, the distribution of human population variants from large-scale sequencing of healthy individuals across this family. Population variants are not randomly distributed across the genome but are constrained by gene essentiality and function. Accordingly, we interpret the population variants in context with evolutionary constraint and structural features including secondary structure, accessibility and protein-protein interactions across 383 three-dimensional structures of ankyrin repeats. We find five positions that are highly conserved across homologues and also depleted in missense variants within the human population. These positions are significantly enriched in intra-domain contacts and so likely to be key for repeat packing. In contrast, a group of evolutionarily divergent positions are found to be depleted in missense variants in human and significantly enriched in protein-protein interactions. Our analysis also suggests the domain has three, not two surfaces, each with different patterns of enrichment in protein-substrate interactions and missense variants. Our findings will be of interest to those studying or engineering ankyrin-repeat containing proteins as well as those interpreting the significance of disease variants. Public Library of Science 2021-08-24 /pmc/articles/PMC8415598/ /pubmed/34428215 http://dx.doi.org/10.1371/journal.pcbi.1009335 Text en © 2021 Utgés et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Utgés, Javier S.
Tsenkov, Maxim I.
Dietrich, Noah J. M.
MacGowan, Stuart A.
Barton, Geoffrey J.
Ankyrin repeats in context with human population variation
title Ankyrin repeats in context with human population variation
title_full Ankyrin repeats in context with human population variation
title_fullStr Ankyrin repeats in context with human population variation
title_full_unstemmed Ankyrin repeats in context with human population variation
title_short Ankyrin repeats in context with human population variation
title_sort ankyrin repeats in context with human population variation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8415598/
https://www.ncbi.nlm.nih.gov/pubmed/34428215
http://dx.doi.org/10.1371/journal.pcbi.1009335
work_keys_str_mv AT utgesjaviers ankyrinrepeatsincontextwithhumanpopulationvariation
AT tsenkovmaximi ankyrinrepeatsincontextwithhumanpopulationvariation
AT dietrichnoahjm ankyrinrepeatsincontextwithhumanpopulationvariation
AT macgowanstuarta ankyrinrepeatsincontextwithhumanpopulationvariation
AT bartongeoffreyj ankyrinrepeatsincontextwithhumanpopulationvariation