Cargando…

Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis

BACKGROUND: Combining multiple evidence-types from different information sources has the potential to reveal new relationships in biological systems. The integrated information can be represented as a relationship network, and clustering the network can suggest possible functional modules. The value...

Descripción completa

Detalles Bibliográficos
Autores principales: Lysenko, Artem, Defoin-Platel, Michael, Hassani-Pak, Keywan, Taubert, Jan, Hodgman, Charlie, Rawlings, Christopher J, Saqi, Mansoor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118170/
https://www.ncbi.nlm.nih.gov/pubmed/21612636
http://dx.doi.org/10.1186/1471-2105-12-203
_version_ 1782206428976513024
author Lysenko, Artem
Defoin-Platel, Michael
Hassani-Pak, Keywan
Taubert, Jan
Hodgman, Charlie
Rawlings, Christopher J
Saqi, Mansoor
author_facet Lysenko, Artem
Defoin-Platel, Michael
Hassani-Pak, Keywan
Taubert, Jan
Hodgman, Charlie
Rawlings, Christopher J
Saqi, Mansoor
author_sort Lysenko, Artem
collection PubMed
description BACKGROUND: Combining multiple evidence-types from different information sources has the potential to reveal new relationships in biological systems. The integrated information can be represented as a relationship network, and clustering the network can suggest possible functional modules. The value of such modules for gaining insight into the underlying biological processes depends on their functional coherence. The challenges that we wish to address are to define and quantify the functional coherence of modules in relationship networks, so that they can be used to infer function of as yet unannotated proteins, to discover previously unknown roles of proteins in diseases as well as for better understanding of the regulation and interrelationship between different elements of complex biological systems. RESULTS: We have defined the functional coherence of modules with respect to the Gene Ontology (GO) by considering two complementary aspects: (i) the fragmentation of the GO functional categories into the different modules and (ii) the most representative functions of the modules. We have proposed a set of metrics to evaluate these two aspects and demonstrated their utility in Arabidopsis thaliana. We selected 2355 proteins for which experimentally established protein-protein interaction (PPI) data were available. From these we have constructed five relationship networks, four based on single types of data: PPI, co-expression, co-occurrence of protein names in scientific literature abstracts and sequence similarity and a fifth one combining these four evidence types. The ability of these networks to suggest biologically meaningful grouping of proteins was explored by applying Markov clustering and then by measuring the functional coherence of the clusters. CONCLUSIONS: Relationship networks integrating multiple evidence-types are biologically informative and allow more proteins to be assigned to a putative functional module. Using additional evidence types concentrates the functional annotations in a smaller number of modules without unduly compromising their consistency. These results indicate that integration of more data sources improves the ability to uncover functional association between proteins, both by allowing more proteins to be linked and producing a network where modular structure more closely reflects the hierarchy in the gene ontology.
format Online
Article
Text
id pubmed-3118170
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31181702011-06-19 Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis Lysenko, Artem Defoin-Platel, Michael Hassani-Pak, Keywan Taubert, Jan Hodgman, Charlie Rawlings, Christopher J Saqi, Mansoor BMC Bioinformatics Research Article BACKGROUND: Combining multiple evidence-types from different information sources has the potential to reveal new relationships in biological systems. The integrated information can be represented as a relationship network, and clustering the network can suggest possible functional modules. The value of such modules for gaining insight into the underlying biological processes depends on their functional coherence. The challenges that we wish to address are to define and quantify the functional coherence of modules in relationship networks, so that they can be used to infer function of as yet unannotated proteins, to discover previously unknown roles of proteins in diseases as well as for better understanding of the regulation and interrelationship between different elements of complex biological systems. RESULTS: We have defined the functional coherence of modules with respect to the Gene Ontology (GO) by considering two complementary aspects: (i) the fragmentation of the GO functional categories into the different modules and (ii) the most representative functions of the modules. We have proposed a set of metrics to evaluate these two aspects and demonstrated their utility in Arabidopsis thaliana. We selected 2355 proteins for which experimentally established protein-protein interaction (PPI) data were available. From these we have constructed five relationship networks, four based on single types of data: PPI, co-expression, co-occurrence of protein names in scientific literature abstracts and sequence similarity and a fifth one combining these four evidence types. The ability of these networks to suggest biologically meaningful grouping of proteins was explored by applying Markov clustering and then by measuring the functional coherence of the clusters. CONCLUSIONS: Relationship networks integrating multiple evidence-types are biologically informative and allow more proteins to be assigned to a putative functional module. Using additional evidence types concentrates the functional annotations in a smaller number of modules without unduly compromising their consistency. These results indicate that integration of more data sources improves the ability to uncover functional association between proteins, both by allowing more proteins to be linked and producing a network where modular structure more closely reflects the hierarchy in the gene ontology. BioMed Central 2011-05-25 /pmc/articles/PMC3118170/ /pubmed/21612636 http://dx.doi.org/10.1186/1471-2105-12-203 Text en Copyright ©2011 Lysenko et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lysenko, Artem
Defoin-Platel, Michael
Hassani-Pak, Keywan
Taubert, Jan
Hodgman, Charlie
Rawlings, Christopher J
Saqi, Mansoor
Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title_full Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title_fullStr Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title_full_unstemmed Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title_short Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis
title_sort assessing the functional coherence of modules found in multiple-evidence networks from arabidopsis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118170/
https://www.ncbi.nlm.nih.gov/pubmed/21612636
http://dx.doi.org/10.1186/1471-2105-12-203
work_keys_str_mv AT lysenkoartem assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT defoinplatelmichael assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT hassanipakkeywan assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT taubertjan assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT hodgmancharlie assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT rawlingschristopherj assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis
AT saqimansoor assessingthefunctionalcoherenceofmodulesfoundinmultipleevidencenetworksfromarabidopsis