Cargando…

Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity

We consider the problem of biological complexity via a projection of protein-coding genes of complex organisms onto the functional space of the proteome. The latter can be defined as a set of all functions committed by proteins of an organism. Alternative splicing (AS) allows an organism to generate...

Descripción completa

Detalles Bibliográficos
Autores principales: Kanapin, Alexander A, Mulder, Nicola, Kuznetsov, Vladimir A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822532/
https://www.ncbi.nlm.nih.gov/pubmed/20158875
http://dx.doi.org/10.1186/1471-2164-11-S1-S4
_version_ 1782177534757044224
author Kanapin, Alexander A
Mulder, Nicola
Kuznetsov, Vladimir A
author_facet Kanapin, Alexander A
Mulder, Nicola
Kuznetsov, Vladimir A
author_sort Kanapin, Alexander A
collection PubMed
description We consider the problem of biological complexity via a projection of protein-coding genes of complex organisms onto the functional space of the proteome. The latter can be defined as a set of all functions committed by proteins of an organism. Alternative splicing (AS) allows an organism to generate diverse mature RNA transcripts from a single mRNA strand and thus it could be one of the key mechanisms of increasing of functional complexity of the organism's proteome and a driving force of biological evolution. Thus, the projection of transcription units (TU) and alternative splice-variant (SV) forms onto proteome functional space could generate new types of relational networks (e.g. SV-protein function networks, SFN) and lead to discoveries of novel evolutionarily conservative functional modules. Such types of networks might provide new reliable characteristics of organism complexity and a better understanding of the evolutionary integration and plasticity of interconnection of genome-transcriptome-proteome functions. RESULTS: We use the InterPro and UniProt databases to attribute descriptive features (keywords) to protein sequences. UniProt database includes a controlled and curated vocabulary of specific descriptors or keywords. The keywords have been assigned to a protein sequence via conserved domains or via similarity with annotated sequences. Then we consider the unique combinations of keywords as the protein functional labels (FL), which characterize the biological functions of the given protein and construct the contingency tables and graphs providing the projections of transcription units (TU) and alternative splice-variants (SV) onto all FL of the proteome of a given organism. We constructed SFNs for organisms with different evolutionary history and levels of complexity, and performed detailed statistical parameterization of the networks. CONCLUSIONS: The application of the algorithm to organisms with different evolutionary history and level of biological complexity (nematode, fruit fly, vertebrata) reveals that the parameters describing SFN correlate with the complexity of a given organism. Using statistical analysis of the links of the functional networks, we propose new features of evolution of protein function acquisition. We reveal a group of genes and corresponding functions, which could be attributed to an early conservative part of the cellular machinery essential for cell viability and survival. We identify and provide characteristics of functional switches in the polyform group of TUs in different organisms. Based on comparison of mouse and human SFNs, a role of alternative splicing as a necessary source of evolution towards more complex organisms is demonstrated. The entire set of FL across many organisms could be used as a draft of the catalogue of the functional space of the proteome world.
format Text
id pubmed-2822532
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28225322010-02-17 Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity Kanapin, Alexander A Mulder, Nicola Kuznetsov, Vladimir A BMC Genomics Research We consider the problem of biological complexity via a projection of protein-coding genes of complex organisms onto the functional space of the proteome. The latter can be defined as a set of all functions committed by proteins of an organism. Alternative splicing (AS) allows an organism to generate diverse mature RNA transcripts from a single mRNA strand and thus it could be one of the key mechanisms of increasing of functional complexity of the organism's proteome and a driving force of biological evolution. Thus, the projection of transcription units (TU) and alternative splice-variant (SV) forms onto proteome functional space could generate new types of relational networks (e.g. SV-protein function networks, SFN) and lead to discoveries of novel evolutionarily conservative functional modules. Such types of networks might provide new reliable characteristics of organism complexity and a better understanding of the evolutionary integration and plasticity of interconnection of genome-transcriptome-proteome functions. RESULTS: We use the InterPro and UniProt databases to attribute descriptive features (keywords) to protein sequences. UniProt database includes a controlled and curated vocabulary of specific descriptors or keywords. The keywords have been assigned to a protein sequence via conserved domains or via similarity with annotated sequences. Then we consider the unique combinations of keywords as the protein functional labels (FL), which characterize the biological functions of the given protein and construct the contingency tables and graphs providing the projections of transcription units (TU) and alternative splice-variants (SV) onto all FL of the proteome of a given organism. We constructed SFNs for organisms with different evolutionary history and levels of complexity, and performed detailed statistical parameterization of the networks. CONCLUSIONS: The application of the algorithm to organisms with different evolutionary history and level of biological complexity (nematode, fruit fly, vertebrata) reveals that the parameters describing SFN correlate with the complexity of a given organism. Using statistical analysis of the links of the functional networks, we propose new features of evolution of protein function acquisition. We reveal a group of genes and corresponding functions, which could be attributed to an early conservative part of the cellular machinery essential for cell viability and survival. We identify and provide characteristics of functional switches in the polyform group of TUs in different organisms. Based on comparison of mouse and human SFNs, a role of alternative splicing as a necessary source of evolution towards more complex organisms is demonstrated. The entire set of FL across many organisms could be used as a draft of the catalogue of the functional space of the proteome world. BioMed Central 2010-02-10 /pmc/articles/PMC2822532/ /pubmed/20158875 http://dx.doi.org/10.1186/1471-2164-11-S1-S4 Text en Copyright ©2010 Kanapin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Kanapin, Alexander A
Mulder, Nicola
Kuznetsov, Vladimir A
Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title_full Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title_fullStr Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title_full_unstemmed Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title_short Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
title_sort projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822532/
https://www.ncbi.nlm.nih.gov/pubmed/20158875
http://dx.doi.org/10.1186/1471-2164-11-S1-S4
work_keys_str_mv AT kanapinalexandera projectionofgeneproteinnetworkstothefunctionalspaceoftheproteomeanditsapplicationtoanalysisoforganismcomplexity
AT muldernicola projectionofgeneproteinnetworkstothefunctionalspaceoftheproteomeanditsapplicationtoanalysisoforganismcomplexity
AT kuznetsovvladimira projectionofgeneproteinnetworkstothefunctionalspaceoftheproteomeanditsapplicationtoanalysisoforganismcomplexity