Cargando…

Efficient Prediction of Co-Complexed Proteins Based on Coevolution

The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple d...

Descripción completa

Detalles Bibliográficos
Autores principales: de Vienne, Damien M., Azé, Jérôme
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494725/
https://www.ncbi.nlm.nih.gov/pubmed/23152796
http://dx.doi.org/10.1371/journal.pone.0048728
_version_ 1782249439054790656
author de Vienne, Damien M.
Azé, Jérôme
author_facet de Vienne, Damien M.
Azé, Jérôme
author_sort de Vienne, Damien M.
collection PubMed
description The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple direct and indirect biological data sources. However, experimental data are not available for most organisms. We propose here an ensemble machine learning approach for the prediction of PPI that depends solely on features independent from experimental data. We developed new estimators of the coevolution between proteins and combined them in an ensemble learning procedure. We applied this method to a dataset of known co-complexed proteins in Escherichia coli and compared it to previously published methods. We show that our method allows prediction of PPI with an unprecedented precision of 95.5% for the first 200 sorted pairs of proteins compared to 28.5% on the same dataset with the previous best method. A close inspection of the best predicted pairs allowed us to detect new or recently discovered interactions between chemotactic components, the flagellar apparatus and RNA polymerase complexes in E. coli.
format Online
Article
Text
id pubmed-3494725
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34947252012-11-14 Efficient Prediction of Co-Complexed Proteins Based on Coevolution de Vienne, Damien M. Azé, Jérôme PLoS One Research Article The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple direct and indirect biological data sources. However, experimental data are not available for most organisms. We propose here an ensemble machine learning approach for the prediction of PPI that depends solely on features independent from experimental data. We developed new estimators of the coevolution between proteins and combined them in an ensemble learning procedure. We applied this method to a dataset of known co-complexed proteins in Escherichia coli and compared it to previously published methods. We show that our method allows prediction of PPI with an unprecedented precision of 95.5% for the first 200 sorted pairs of proteins compared to 28.5% on the same dataset with the previous best method. A close inspection of the best predicted pairs allowed us to detect new or recently discovered interactions between chemotactic components, the flagellar apparatus and RNA polymerase complexes in E. coli. Public Library of Science 2012-11-09 /pmc/articles/PMC3494725/ /pubmed/23152796 http://dx.doi.org/10.1371/journal.pone.0048728 Text en © 2012 de Vienne, Azé http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
de Vienne, Damien M.
Azé, Jérôme
Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title_full Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title_fullStr Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title_full_unstemmed Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title_short Efficient Prediction of Co-Complexed Proteins Based on Coevolution
title_sort efficient prediction of co-complexed proteins based on coevolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494725/
https://www.ncbi.nlm.nih.gov/pubmed/23152796
http://dx.doi.org/10.1371/journal.pone.0048728
work_keys_str_mv AT deviennedamienm efficientpredictionofcocomplexedproteinsbasedoncoevolution
AT azejerome efficientpredictionofcocomplexedproteinsbasedoncoevolution