Cargando…

Protein Family Expansions and Biological Complexity

During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure...

Descripción completa

Detalles Bibliográficos
Autores principales: Vogel, Christine, Chothia, Cyrus
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1464810/
https://www.ncbi.nlm.nih.gov/pubmed/16733546
http://dx.doi.org/10.1371/journal.pcbi.0020048
_version_ 1782127552165314560
author Vogel, Christine
Chothia, Cyrus
author_facet Vogel, Christine
Chothia, Cyrus
author_sort Vogel, Christine
collection PubMed
description During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure, superfamilies whose members descended from a common ancestor. The size of superfamilies can vary greatly. Also, during the course of evolution organisms of increasing complexity have arisen. In this paper we determine the identity of those superfamilies whose relative sizes in different organisms are highly correlated to the complexity of the organisms. As a measure of the complexity of 38 uni- and multicellular eukaryotes we took the number of different cell types of which they are composed. Of 1,219 superfamilies, there are 194 whose sizes in the 38 organisms are strongly correlated with the number of cell types in the organisms. We give outline descriptions of these superfamilies. Half are involved in extracellular processes or regulation and smaller proportions in other types of activity. Half of all superfamilies have no significant correlation with complexity. We also determined whether the expansions of large superfamilies correlate with each other. We found three large clusters of correlated expansions: one involves expansions in both vertebrates and plants, one just in vertebrates, and one just in plants. Our work identifies important protein families and provides one explanation of the discrepancy between the total number of genes and the apparent physiological complexity of eukaryotic organisms.
format Text
id pubmed-1464810
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-14648102006-05-26 Protein Family Expansions and Biological Complexity Vogel, Christine Chothia, Cyrus PLoS Comput Biol Research Article During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure, superfamilies whose members descended from a common ancestor. The size of superfamilies can vary greatly. Also, during the course of evolution organisms of increasing complexity have arisen. In this paper we determine the identity of those superfamilies whose relative sizes in different organisms are highly correlated to the complexity of the organisms. As a measure of the complexity of 38 uni- and multicellular eukaryotes we took the number of different cell types of which they are composed. Of 1,219 superfamilies, there are 194 whose sizes in the 38 organisms are strongly correlated with the number of cell types in the organisms. We give outline descriptions of these superfamilies. Half are involved in extracellular processes or regulation and smaller proportions in other types of activity. Half of all superfamilies have no significant correlation with complexity. We also determined whether the expansions of large superfamilies correlate with each other. We found three large clusters of correlated expansions: one involves expansions in both vertebrates and plants, one just in vertebrates, and one just in plants. Our work identifies important protein families and provides one explanation of the discrepancy between the total number of genes and the apparent physiological complexity of eukaryotic organisms. Public Library of Science 2006-05 2006-05-26 /pmc/articles/PMC1464810/ /pubmed/16733546 http://dx.doi.org/10.1371/journal.pcbi.0020048 Text en © 2006 Vogel and Chothia. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Vogel, Christine
Chothia, Cyrus
Protein Family Expansions and Biological Complexity
title Protein Family Expansions and Biological Complexity
title_full Protein Family Expansions and Biological Complexity
title_fullStr Protein Family Expansions and Biological Complexity
title_full_unstemmed Protein Family Expansions and Biological Complexity
title_short Protein Family Expansions and Biological Complexity
title_sort protein family expansions and biological complexity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1464810/
https://www.ncbi.nlm.nih.gov/pubmed/16733546
http://dx.doi.org/10.1371/journal.pcbi.0020048
work_keys_str_mv AT vogelchristine proteinfamilyexpansionsandbiologicalcomplexity
AT chothiacyrus proteinfamilyexpansionsandbiologicalcomplexity