Cargando…

Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction

BACKGROUND: Protein-protein interaction (PPI) networks (interactomes) of most organisms, except for some model organisms, are largely unknown. Experimental methods including high-throughput techniques are highly resource intensive. Therefore, computational discovery of PPIs can accelerate biological...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ananthasubramanian, Seshan, Metri, Rahul, Khetan, Ankur, Gupta, Aman, Handen, Adam, Chandra, Nagasuma, Ganapathiraju, Madhavi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3353838/ https://www.ncbi.nlm.nih.gov/pubmed/22587966 http://dx.doi.org/10.1186/2042-5783-2-4

_version_	1782233100221153280
author	Ananthasubramanian, Seshan Metri, Rahul Khetan, Ankur Gupta, Aman Handen, Adam Chandra, Nagasuma Ganapathiraju, Madhavi
author_facet	Ananthasubramanian, Seshan Metri, Rahul Khetan, Ankur Gupta, Aman Handen, Adam Chandra, Nagasuma Ganapathiraju, Madhavi
author_sort	Ananthasubramanian, Seshan
collection	PubMed
description	BACKGROUND: Protein-protein interaction (PPI) networks (interactomes) of most organisms, except for some model organisms, are largely unknown. Experimental methods including high-throughput techniques are highly resource intensive. Therefore, computational discovery of PPIs can accelerate biological discovery by presenting "most-promising" pairs of proteins that are likely to interact. For many bacteria, genome sequence, and thereby genomic context of proteomes, is readily available; additionally, for some of these proteomes, localization and functional annotations are also available, but interactomes are not available. We present here a method for rapid development of computational system to predict interactome of bacterial proteomes. While other studies have presented methods to transfer interologs across species, here, we propose transfer of computational models to benefit from cross-species annotations, thereby predicting many more novel interactions even in the absence of interologs. Mycobacterium tuberculosis (Mtb) and Clostridium difficile (CD) have been used to demonstrate the work. RESULTS: We developed a random forest classifier over features derived from Gene Ontology annotations and genetic context scores provided by STRING database for predicting Mtb and CD interactions independently. The Mtb classifier gave a precision of 94% and a recall of 23% on a held out test set. The Mtb model was then run on all the 8 million protein pairs of the Mtb proteome, resulting in 708 new interactions (at 94% expected precision) or 1,595 new interactions at 80% expected precision. The CD classifier gave a precision of 90% and a recall of 16% on a held out test set. The CD model was run on all the 8 million protein pairs of the CD proteome, resulting in 143 new interactions (at 90% expected precision) or 580 new interactions (at 80% expected precision). We also compared the overlap of predictions of our method with STRING database interactions for CD and Mtb and also with interactions identified recently by a bacterial 2-hybrid system for Mtb. To demonstrate the utility of transfer of computational models, we made use of the developed Mtb model and used it to predict CD protein-pairs. The cross species model thus developed yielded a precision of 88% at a recall of 8%. To demonstrate transfer of features from other organisms in the absence of feature-based and interaction-based information, we transferred missing feature values from Mtb orthologs into the CD data. In transferring this data from orthologs (not interologs), we showed that a large number of interactions can be predicted. CONCLUSIONS: Rapid discovery of (partial) bacterial interactome can be made by using existing set of GO and STRING features associated with the organisms. We can make use of cross-species interactome development, when there are not even sufficient known interactions to develop a computational prediction system. Computational model of well-studied organism(s) can be employed to make the initial interactome prediction for the target organism. We have also demonstrated successfully, that annotations can be transferred from orthologs in well-studied organisms enabling accurate predictions for organisms with no annotations. These approaches can serve as building blocks to address the challenges associated with feature coverage, missing interactions towards rapid interactome discovery for bacterial organisms. AVAILABILITY: The predictions for all Mtb and CD proteins are made available at: http://severus.dbmi.pitt.edu/TB and http://severus.dbmi.pitt.edu/CD respectively for browsing as well as for download.
format	Online Article Text
id	pubmed-3353838
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-33538382012-05-17 Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction Ananthasubramanian, Seshan Metri, Rahul Khetan, Ankur Gupta, Aman Handen, Adam Chandra, Nagasuma Ganapathiraju, Madhavi Microb Inform Exp Research BACKGROUND: Protein-protein interaction (PPI) networks (interactomes) of most organisms, except for some model organisms, are largely unknown. Experimental methods including high-throughput techniques are highly resource intensive. Therefore, computational discovery of PPIs can accelerate biological discovery by presenting "most-promising" pairs of proteins that are likely to interact. For many bacteria, genome sequence, and thereby genomic context of proteomes, is readily available; additionally, for some of these proteomes, localization and functional annotations are also available, but interactomes are not available. We present here a method for rapid development of computational system to predict interactome of bacterial proteomes. While other studies have presented methods to transfer interologs across species, here, we propose transfer of computational models to benefit from cross-species annotations, thereby predicting many more novel interactions even in the absence of interologs. Mycobacterium tuberculosis (Mtb) and Clostridium difficile (CD) have been used to demonstrate the work. RESULTS: We developed a random forest classifier over features derived from Gene Ontology annotations and genetic context scores provided by STRING database for predicting Mtb and CD interactions independently. The Mtb classifier gave a precision of 94% and a recall of 23% on a held out test set. The Mtb model was then run on all the 8 million protein pairs of the Mtb proteome, resulting in 708 new interactions (at 94% expected precision) or 1,595 new interactions at 80% expected precision. The CD classifier gave a precision of 90% and a recall of 16% on a held out test set. The CD model was run on all the 8 million protein pairs of the CD proteome, resulting in 143 new interactions (at 90% expected precision) or 580 new interactions (at 80% expected precision). We also compared the overlap of predictions of our method with STRING database interactions for CD and Mtb and also with interactions identified recently by a bacterial 2-hybrid system for Mtb. To demonstrate the utility of transfer of computational models, we made use of the developed Mtb model and used it to predict CD protein-pairs. The cross species model thus developed yielded a precision of 88% at a recall of 8%. To demonstrate transfer of features from other organisms in the absence of feature-based and interaction-based information, we transferred missing feature values from Mtb orthologs into the CD data. In transferring this data from orthologs (not interologs), we showed that a large number of interactions can be predicted. CONCLUSIONS: Rapid discovery of (partial) bacterial interactome can be made by using existing set of GO and STRING features associated with the organisms. We can make use of cross-species interactome development, when there are not even sufficient known interactions to develop a computational prediction system. Computational model of well-studied organism(s) can be employed to make the initial interactome prediction for the target organism. We have also demonstrated successfully, that annotations can be transferred from orthologs in well-studied organisms enabling accurate predictions for organisms with no annotations. These approaches can serve as building blocks to address the challenges associated with feature coverage, missing interactions towards rapid interactome discovery for bacterial organisms. AVAILABILITY: The predictions for all Mtb and CD proteins are made available at: http://severus.dbmi.pitt.edu/TB and http://severus.dbmi.pitt.edu/CD respectively for browsing as well as for download. BioMed Central 2012-03-21 /pmc/articles/PMC3353838/ /pubmed/22587966 http://dx.doi.org/10.1186/2042-5783-2-4 Text en Copyright ©2012 Ananthasubramanian et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Ananthasubramanian, Seshan Metri, Rahul Khetan, Ankur Gupta, Aman Handen, Adam Chandra, Nagasuma Ganapathiraju, Madhavi Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title	Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title_full	Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title_fullStr	Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title_full_unstemmed	Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title_short	Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
title_sort	mycobacterium tuberculosis and clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3353838/ https://www.ncbi.nlm.nih.gov/pubmed/22587966 http://dx.doi.org/10.1186/2042-5783-2-4
work_keys_str_mv	AT ananthasubramanianseshan mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT metrirahul mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT khetanankur mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT guptaaman mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT handenadam mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT chandranagasuma mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction AT ganapathirajumadhavi mycobacteriumtuberculosisandclostridiumdifficilleinteractomesdemonstrationofrapiddevelopmentofcomputationalsystemforbacterialinteractomeprediction

Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction

Ejemplares similares