Cargando…

Tandem-repeat protein domains across the tree of life

Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and as...

Descripción completa

Detalles Bibliográficos
Autores principales: Jernigan, Kristin K., Bordenstein, Seth R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304861/
https://www.ncbi.nlm.nih.gov/pubmed/25653910
http://dx.doi.org/10.7717/peerj.732
_version_ 1782354171316404224
author Jernigan, Kristin K.
Bordenstein, Seth R.
author_facet Jernigan, Kristin K.
Bordenstein, Seth R.
author_sort Jernigan, Kristin K.
collection PubMed
description Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species.
format Online
Article
Text
id pubmed-4304861
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-43048612015-02-04 Tandem-repeat protein domains across the tree of life Jernigan, Kristin K. Bordenstein, Seth R. PeerJ Evolutionary Studies Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species. PeerJ Inc. 2015-01-13 /pmc/articles/PMC4304861/ /pubmed/25653910 http://dx.doi.org/10.7717/peerj.732 Text en © 2015 Jernigan and Bordenstein http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Evolutionary Studies
Jernigan, Kristin K.
Bordenstein, Seth R.
Tandem-repeat protein domains across the tree of life
title Tandem-repeat protein domains across the tree of life
title_full Tandem-repeat protein domains across the tree of life
title_fullStr Tandem-repeat protein domains across the tree of life
title_full_unstemmed Tandem-repeat protein domains across the tree of life
title_short Tandem-repeat protein domains across the tree of life
title_sort tandem-repeat protein domains across the tree of life
topic Evolutionary Studies
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4304861/
https://www.ncbi.nlm.nih.gov/pubmed/25653910
http://dx.doi.org/10.7717/peerj.732
work_keys_str_mv AT jernigankristink tandemrepeatproteindomainsacrossthetreeoflife
AT bordensteinsethr tandemrepeatproteindomainsacrossthetreeoflife