Cargando…

The relationship between proteome size, structural disorder and organism complexity

BACKGROUND: Sequencing the genomes of the first few eukaryotes created the impression that gene number shows no correlation with organism complexity, often referred to as the G-value paradox. Several attempts have previously been made to resolve this paradox, citing multifunctionality of proteins, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Schad, Eva, Tompa, Peter, Hegyi, Hedi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3334615/
https://www.ncbi.nlm.nih.gov/pubmed/22182830
http://dx.doi.org/10.1186/gb-2011-12-12-r120
_version_ 1782230652579479552
author Schad, Eva
Tompa, Peter
Hegyi, Hedi
author_facet Schad, Eva
Tompa, Peter
Hegyi, Hedi
author_sort Schad, Eva
collection PubMed
description BACKGROUND: Sequencing the genomes of the first few eukaryotes created the impression that gene number shows no correlation with organism complexity, often referred to as the G-value paradox. Several attempts have previously been made to resolve this paradox, citing multifunctionality of proteins, alternative splicing, microRNAs or non-coding DNA. As intrinsic protein disorder has been linked with complex responses to environmental stimuli and communication between cells, an additional possibility is that structural disorder may effectively increase the complexity of species. RESULTS: We revisited the G-value paradox by analyzing many new proteomes whose complexity measured with their number of distinct cell types is known. We found that complexity and proteome size measured by the total number of amino acids correlate significantly and have a power function relationship. We systematically analyzed numerous other features in relation to complexity in several organisms and tissues and found: the fraction of protein structural disorder increases significantly between prokaryotes and eukaryotes but does not further increase over the course of evolution; the number of predicted binding sites in disordered regions in a proteome increases with complexity; the fraction of protein disorder, predicted binding sites, alternative splicing and protein-protein interactions all increase with the complexity of human tissues. CONCLUSIONS: We conclude that complexity is a multi-parametric trait, determined by interaction potential, alternative splicing capacity, tissue-specific protein disorder and, above all, proteome size. The G-value paradox is only apparent when plants are grouped with metazoans, as they have a different relationship between complexity and proteome size.
format Online
Article
Text
id pubmed-3334615
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33346152012-04-25 The relationship between proteome size, structural disorder and organism complexity Schad, Eva Tompa, Peter Hegyi, Hedi Genome Biol Research BACKGROUND: Sequencing the genomes of the first few eukaryotes created the impression that gene number shows no correlation with organism complexity, often referred to as the G-value paradox. Several attempts have previously been made to resolve this paradox, citing multifunctionality of proteins, alternative splicing, microRNAs or non-coding DNA. As intrinsic protein disorder has been linked with complex responses to environmental stimuli and communication between cells, an additional possibility is that structural disorder may effectively increase the complexity of species. RESULTS: We revisited the G-value paradox by analyzing many new proteomes whose complexity measured with their number of distinct cell types is known. We found that complexity and proteome size measured by the total number of amino acids correlate significantly and have a power function relationship. We systematically analyzed numerous other features in relation to complexity in several organisms and tissues and found: the fraction of protein structural disorder increases significantly between prokaryotes and eukaryotes but does not further increase over the course of evolution; the number of predicted binding sites in disordered regions in a proteome increases with complexity; the fraction of protein disorder, predicted binding sites, alternative splicing and protein-protein interactions all increase with the complexity of human tissues. CONCLUSIONS: We conclude that complexity is a multi-parametric trait, determined by interaction potential, alternative splicing capacity, tissue-specific protein disorder and, above all, proteome size. The G-value paradox is only apparent when plants are grouped with metazoans, as they have a different relationship between complexity and proteome size. BioMed Central 2011 2011-12-19 /pmc/articles/PMC3334615/ /pubmed/22182830 http://dx.doi.org/10.1186/gb-2011-12-12-r120 Text en Copyright ©2011 Schad et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Schad, Eva
Tompa, Peter
Hegyi, Hedi
The relationship between proteome size, structural disorder and organism complexity
title The relationship between proteome size, structural disorder and organism complexity
title_full The relationship between proteome size, structural disorder and organism complexity
title_fullStr The relationship between proteome size, structural disorder and organism complexity
title_full_unstemmed The relationship between proteome size, structural disorder and organism complexity
title_short The relationship between proteome size, structural disorder and organism complexity
title_sort relationship between proteome size, structural disorder and organism complexity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3334615/
https://www.ncbi.nlm.nih.gov/pubmed/22182830
http://dx.doi.org/10.1186/gb-2011-12-12-r120
work_keys_str_mv AT schadeva therelationshipbetweenproteomesizestructuraldisorderandorganismcomplexity
AT tompapeter therelationshipbetweenproteomesizestructuraldisorderandorganismcomplexity
AT hegyihedi therelationshipbetweenproteomesizestructuraldisorderandorganismcomplexity
AT schadeva relationshipbetweenproteomesizestructuraldisorderandorganismcomplexity
AT tompapeter relationshipbetweenproteomesizestructuraldisorderandorganismcomplexity
AT hegyihedi relationshipbetweenproteomesizestructuraldisorderandorganismcomplexity