Cargando…
Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions
The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey ess...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3927604/ https://www.ncbi.nlm.nih.gov/pubmed/24710298 http://dx.doi.org/10.3390/genes2040912 |
_version_ | 1782304151137419264 |
---|---|
author | Parikesit, Arli A. Stadler, Peter F. Prohaska, Sonja J. |
author_facet | Parikesit, Arli A. Stadler, Peter F. Prohaska, Sonja J. |
author_sort | Parikesit, Arli A. |
collection | PubMed |
description | The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey essentially the same information and at the same time are much more readily accessible, although protein domain models trained for one phylogenetic group frequently fail on distantly related sequences. Pooling related domain models based on their GO-annotation in combination with de novo gene prediction methods provides estimates that seem to be less affected by phylogenetic biases. We show here for 18 diverse representatives from all eukaryotic kingdoms that a pooled analysis of the tendencies for co-occurrence or avoidance of protein domains is indeed feasible. This type of analysis can reveal general large-scale patterns in the domain co-occurrence and helps to identify lineage-specific variations in the evolution of protein domains. Somewhat surprisingly, we do not find strong ubiquitous patterns governing the evolutionary behavior of specific functional classes. Instead, there are strong variations between the major groups of Eukaryotes, pointing at systematic differences in their evolutionary constraints. |
format | Online Article Text |
id | pubmed-3927604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-39276042014-03-26 Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions Parikesit, Arli A. Stadler, Peter F. Prohaska, Sonja J. Genes (Basel) Article The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey essentially the same information and at the same time are much more readily accessible, although protein domain models trained for one phylogenetic group frequently fail on distantly related sequences. Pooling related domain models based on their GO-annotation in combination with de novo gene prediction methods provides estimates that seem to be less affected by phylogenetic biases. We show here for 18 diverse representatives from all eukaryotic kingdoms that a pooled analysis of the tendencies for co-occurrence or avoidance of protein domains is indeed feasible. This type of analysis can reveal general large-scale patterns in the domain co-occurrence and helps to identify lineage-specific variations in the evolution of protein domains. Somewhat surprisingly, we do not find strong ubiquitous patterns governing the evolutionary behavior of specific functional classes. Instead, there are strong variations between the major groups of Eukaryotes, pointing at systematic differences in their evolutionary constraints. MDPI 2011-11-09 /pmc/articles/PMC3927604/ /pubmed/24710298 http://dx.doi.org/10.3390/genes2040912 Text en © 2011 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.) |
spellingShingle | Article Parikesit, Arli A. Stadler, Peter F. Prohaska, Sonja J. Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title | Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title_full | Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title_fullStr | Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title_full_unstemmed | Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title_short | Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions |
title_sort | evolution and quantitative comparison of genome-wide protein domain distributions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3927604/ https://www.ncbi.nlm.nih.gov/pubmed/24710298 http://dx.doi.org/10.3390/genes2040912 |
work_keys_str_mv | AT parikesitarlia evolutionandquantitativecomparisonofgenomewideproteindomaindistributions AT stadlerpeterf evolutionandquantitativecomparisonofgenomewideproteindomaindistributions AT prohaskasonjaj evolutionandquantitativecomparisonofgenomewideproteindomaindistributions |