Cargando…
Efficient Prediction of Co-Complexed Proteins Based on Coevolution
The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple d...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494725/ https://www.ncbi.nlm.nih.gov/pubmed/23152796 http://dx.doi.org/10.1371/journal.pone.0048728 |
_version_ | 1782249439054790656 |
---|---|
author | de Vienne, Damien M. Azé, Jérôme |
author_facet | de Vienne, Damien M. Azé, Jérôme |
author_sort | de Vienne, Damien M. |
collection | PubMed |
description | The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple direct and indirect biological data sources. However, experimental data are not available for most organisms. We propose here an ensemble machine learning approach for the prediction of PPI that depends solely on features independent from experimental data. We developed new estimators of the coevolution between proteins and combined them in an ensemble learning procedure. We applied this method to a dataset of known co-complexed proteins in Escherichia coli and compared it to previously published methods. We show that our method allows prediction of PPI with an unprecedented precision of 95.5% for the first 200 sorted pairs of proteins compared to 28.5% on the same dataset with the previous best method. A close inspection of the best predicted pairs allowed us to detect new or recently discovered interactions between chemotactic components, the flagellar apparatus and RNA polymerase complexes in E. coli. |
format | Online Article Text |
id | pubmed-3494725 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34947252012-11-14 Efficient Prediction of Co-Complexed Proteins Based on Coevolution de Vienne, Damien M. Azé, Jérôme PLoS One Research Article The prediction of the network of protein-protein interactions (PPI) of an organism is crucial for the understanding of biological processes and for the development of new drugs. Machine learning methods have been successfully applied to the prediction of PPI in yeast by the integration of multiple direct and indirect biological data sources. However, experimental data are not available for most organisms. We propose here an ensemble machine learning approach for the prediction of PPI that depends solely on features independent from experimental data. We developed new estimators of the coevolution between proteins and combined them in an ensemble learning procedure. We applied this method to a dataset of known co-complexed proteins in Escherichia coli and compared it to previously published methods. We show that our method allows prediction of PPI with an unprecedented precision of 95.5% for the first 200 sorted pairs of proteins compared to 28.5% on the same dataset with the previous best method. A close inspection of the best predicted pairs allowed us to detect new or recently discovered interactions between chemotactic components, the flagellar apparatus and RNA polymerase complexes in E. coli. Public Library of Science 2012-11-09 /pmc/articles/PMC3494725/ /pubmed/23152796 http://dx.doi.org/10.1371/journal.pone.0048728 Text en © 2012 de Vienne, Azé http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article de Vienne, Damien M. Azé, Jérôme Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title | Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title_full | Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title_fullStr | Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title_full_unstemmed | Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title_short | Efficient Prediction of Co-Complexed Proteins Based on Coevolution |
title_sort | efficient prediction of co-complexed proteins based on coevolution |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494725/ https://www.ncbi.nlm.nih.gov/pubmed/23152796 http://dx.doi.org/10.1371/journal.pone.0048728 |
work_keys_str_mv | AT deviennedamienm efficientpredictionofcocomplexedproteinsbasedoncoevolution AT azejerome efficientpredictionofcocomplexedproteinsbasedoncoevolution |