Cargando…

Computational prediction of inter-species relationships through omics data analysis and machine learning

BACKGROUND: Antibiotic resistance and its rapid dissemination around the world threaten the efficacy of currently-used medical treatments and call for novel, innovative approaches to manage multi-drug resistant infections. Phage therapy, i.e., the use of viruses (phages) to specifically infect and k...

Descripción completa

Detalles Bibliográficos
Autores principales: Leite, Diogo Manuel Carvalho, Brochet, Xavier, Resch, Grégory, Que, Yok-Ai, Neves, Aitana, Peña-Reyes, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245486/
https://www.ncbi.nlm.nih.gov/pubmed/30453987
http://dx.doi.org/10.1186/s12859-018-2388-7
_version_ 1783372251736834048
author Leite, Diogo Manuel Carvalho
Brochet, Xavier
Resch, Grégory
Que, Yok-Ai
Neves, Aitana
Peña-Reyes, Carlos
author_facet Leite, Diogo Manuel Carvalho
Brochet, Xavier
Resch, Grégory
Que, Yok-Ai
Neves, Aitana
Peña-Reyes, Carlos
author_sort Leite, Diogo Manuel Carvalho
collection PubMed
description BACKGROUND: Antibiotic resistance and its rapid dissemination around the world threaten the efficacy of currently-used medical treatments and call for novel, innovative approaches to manage multi-drug resistant infections. Phage therapy, i.e., the use of viruses (phages) to specifically infect and kill bacteria during their life cycle, is one of the most promising alternatives to antibiotics. It is based on the correct matching between a target pathogenic bacteria and the therapeutic phage. Nevertheless, correctly matching them is a major challenge. Currently, there is no systematic method to efficiently predict whether phage-bacterium interactions exist and these pairs must be empirically tested in laboratory. Herein, we present our approach for developing a computational model able to predict whether a given phage-bacterium pair can interact based on their genome. RESULTS: Based on public data from GenBank and phagesDB.org, we collected more than a thousand positive phage-bacterium interactions with their complete genomes. In addition, we generated putative negative (i.e., non-interacting) pairs. We extracted, from the collected genomes, a set of informative features based on the distribution of predictive protein-protein interactions and on their primary structure (e.g. amino-acid frequency, molecular weight and chemical composition of each protein). With these features, we generated multiple candidate datasets to train our algorithms. On this base, we built predictive models exhibiting predictive performance of around 90% in terms of F1-score, sensitivity, specificity, and accuracy, obtained on the test set with 10-fold cross-validation. CONCLUSION: These promising results reinforce the hypothesis that machine learning techniques may produce highly-predictive models accelerating the search of interacting phage-bacteria pairs.
format Online
Article
Text
id pubmed-6245486
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62454862018-11-26 Computational prediction of inter-species relationships through omics data analysis and machine learning Leite, Diogo Manuel Carvalho Brochet, Xavier Resch, Grégory Que, Yok-Ai Neves, Aitana Peña-Reyes, Carlos BMC Bioinformatics Research BACKGROUND: Antibiotic resistance and its rapid dissemination around the world threaten the efficacy of currently-used medical treatments and call for novel, innovative approaches to manage multi-drug resistant infections. Phage therapy, i.e., the use of viruses (phages) to specifically infect and kill bacteria during their life cycle, is one of the most promising alternatives to antibiotics. It is based on the correct matching between a target pathogenic bacteria and the therapeutic phage. Nevertheless, correctly matching them is a major challenge. Currently, there is no systematic method to efficiently predict whether phage-bacterium interactions exist and these pairs must be empirically tested in laboratory. Herein, we present our approach for developing a computational model able to predict whether a given phage-bacterium pair can interact based on their genome. RESULTS: Based on public data from GenBank and phagesDB.org, we collected more than a thousand positive phage-bacterium interactions with their complete genomes. In addition, we generated putative negative (i.e., non-interacting) pairs. We extracted, from the collected genomes, a set of informative features based on the distribution of predictive protein-protein interactions and on their primary structure (e.g. amino-acid frequency, molecular weight and chemical composition of each protein). With these features, we generated multiple candidate datasets to train our algorithms. On this base, we built predictive models exhibiting predictive performance of around 90% in terms of F1-score, sensitivity, specificity, and accuracy, obtained on the test set with 10-fold cross-validation. CONCLUSION: These promising results reinforce the hypothesis that machine learning techniques may produce highly-predictive models accelerating the search of interacting phage-bacteria pairs. BioMed Central 2018-11-20 /pmc/articles/PMC6245486/ /pubmed/30453987 http://dx.doi.org/10.1186/s12859-018-2388-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Leite, Diogo Manuel Carvalho
Brochet, Xavier
Resch, Grégory
Que, Yok-Ai
Neves, Aitana
Peña-Reyes, Carlos
Computational prediction of inter-species relationships through omics data analysis and machine learning
title Computational prediction of inter-species relationships through omics data analysis and machine learning
title_full Computational prediction of inter-species relationships through omics data analysis and machine learning
title_fullStr Computational prediction of inter-species relationships through omics data analysis and machine learning
title_full_unstemmed Computational prediction of inter-species relationships through omics data analysis and machine learning
title_short Computational prediction of inter-species relationships through omics data analysis and machine learning
title_sort computational prediction of inter-species relationships through omics data analysis and machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245486/
https://www.ncbi.nlm.nih.gov/pubmed/30453987
http://dx.doi.org/10.1186/s12859-018-2388-7
work_keys_str_mv AT leitediogomanuelcarvalho computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning
AT brochetxavier computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning
AT reschgregory computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning
AT queyokai computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning
AT nevesaitana computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning
AT penareyescarlos computationalpredictionofinterspeciesrelationshipsthroughomicsdataanalysisandmachinelearning