Cargando…

Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions

The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey ess...

Descripción completa

Detalles Bibliográficos
Autores principales: Parikesit, Arli A., Stadler, Peter F., Prohaska, Sonja J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3927604/
https://www.ncbi.nlm.nih.gov/pubmed/24710298
http://dx.doi.org/10.3390/genes2040912
_version_ 1782304151137419264
author Parikesit, Arli A.
Stadler, Peter F.
Prohaska, Sonja J.
author_facet Parikesit, Arli A.
Stadler, Peter F.
Prohaska, Sonja J.
author_sort Parikesit, Arli A.
collection PubMed
description The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey essentially the same information and at the same time are much more readily accessible, although protein domain models trained for one phylogenetic group frequently fail on distantly related sequences. Pooling related domain models based on their GO-annotation in combination with de novo gene prediction methods provides estimates that seem to be less affected by phylogenetic biases. We show here for 18 diverse representatives from all eukaryotic kingdoms that a pooled analysis of the tendencies for co-occurrence or avoidance of protein domains is indeed feasible. This type of analysis can reveal general large-scale patterns in the domain co-occurrence and helps to identify lineage-specific variations in the evolution of protein domains. Somewhat surprisingly, we do not find strong ubiquitous patterns governing the evolutionary behavior of specific functional classes. Instead, there are strong variations between the major groups of Eukaryotes, pointing at systematic differences in their evolutionary constraints.
format Online
Article
Text
id pubmed-3927604
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-39276042014-03-26 Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions Parikesit, Arli A. Stadler, Peter F. Prohaska, Sonja J. Genes (Basel) Article The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey essentially the same information and at the same time are much more readily accessible, although protein domain models trained for one phylogenetic group frequently fail on distantly related sequences. Pooling related domain models based on their GO-annotation in combination with de novo gene prediction methods provides estimates that seem to be less affected by phylogenetic biases. We show here for 18 diverse representatives from all eukaryotic kingdoms that a pooled analysis of the tendencies for co-occurrence or avoidance of protein domains is indeed feasible. This type of analysis can reveal general large-scale patterns in the domain co-occurrence and helps to identify lineage-specific variations in the evolution of protein domains. Somewhat surprisingly, we do not find strong ubiquitous patterns governing the evolutionary behavior of specific functional classes. Instead, there are strong variations between the major groups of Eukaryotes, pointing at systematic differences in their evolutionary constraints. MDPI 2011-11-09 /pmc/articles/PMC3927604/ /pubmed/24710298 http://dx.doi.org/10.3390/genes2040912 Text en © 2011 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.)
spellingShingle Article
Parikesit, Arli A.
Stadler, Peter F.
Prohaska, Sonja J.
Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title_full Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title_fullStr Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title_full_unstemmed Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title_short Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
title_sort evolution and quantitative comparison of genome-wide protein domain distributions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3927604/
https://www.ncbi.nlm.nih.gov/pubmed/24710298
http://dx.doi.org/10.3390/genes2040912
work_keys_str_mv AT parikesitarlia evolutionandquantitativecomparisonofgenomewideproteindomaindistributions
AT stadlerpeterf evolutionandquantitativecomparisonofgenomewideproteindomaindistributions
AT prohaskasonjaj evolutionandquantitativecomparisonofgenomewideproteindomaindistributions