Cargando…

Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins

Nowadays, bacteriophages are increasingly considered as an alternative treatment for a variety of bacterial infections in cases where classical antibiotics have become ineffective. However, characterizing the host specificity of phages remains a labor- and time-intensive process. In order to allevia...

Descripción completa

Detalles Bibliográficos
Autores principales: Boeckaerts, Dimitri, Stock, Michiel, Criel, Bjorn, Gerstmans, Hans, De Baets, Bernard, Briers, Yves
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7809048/
https://www.ncbi.nlm.nih.gov/pubmed/33446856
http://dx.doi.org/10.1038/s41598-021-81063-4
_version_ 1783637035115872256
author Boeckaerts, Dimitri
Stock, Michiel
Criel, Bjorn
Gerstmans, Hans
De Baets, Bernard
Briers, Yves
author_facet Boeckaerts, Dimitri
Stock, Michiel
Criel, Bjorn
Gerstmans, Hans
De Baets, Bernard
Briers, Yves
author_sort Boeckaerts, Dimitri
collection PubMed
description Nowadays, bacteriophages are increasingly considered as an alternative treatment for a variety of bacterial infections in cases where classical antibiotics have become ineffective. However, characterizing the host specificity of phages remains a labor- and time-intensive process. In order to alleviate this burden, we have developed a new machine-learning-based pipeline to predict bacteriophage hosts based on annotated receptor-binding protein (RBP) sequence data. We focus on predicting bacterial hosts from the ESKAPE group, Escherichia coli, Salmonella enterica and Clostridium difficile. We compare the performance of our predictive model with that of the widely used Basic Local Alignment Search Tool (BLAST). Our best-performing predictive model reaches Precision-Recall Area Under the Curve (PR-AUC) scores between 73.6 and 93.8% for different levels of sequence similarity in the collected data. Our model reaches a performance comparable to that of BLASTp when sequence similarity in the data is high and starts outperforming BLASTp when sequence similarity drops below 75%. Therefore, our machine learning methods can be especially useful in settings in which sequence similarity to other known sequences is low. Predicting the hosts of novel metagenomic RBP sequences could extend our toolbox to tune the host spectrum of phages or phage tail-like bacteriocins by swapping RBPs.
format Online
Article
Text
id pubmed-7809048
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78090482021-01-15 Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins Boeckaerts, Dimitri Stock, Michiel Criel, Bjorn Gerstmans, Hans De Baets, Bernard Briers, Yves Sci Rep Article Nowadays, bacteriophages are increasingly considered as an alternative treatment for a variety of bacterial infections in cases where classical antibiotics have become ineffective. However, characterizing the host specificity of phages remains a labor- and time-intensive process. In order to alleviate this burden, we have developed a new machine-learning-based pipeline to predict bacteriophage hosts based on annotated receptor-binding protein (RBP) sequence data. We focus on predicting bacterial hosts from the ESKAPE group, Escherichia coli, Salmonella enterica and Clostridium difficile. We compare the performance of our predictive model with that of the widely used Basic Local Alignment Search Tool (BLAST). Our best-performing predictive model reaches Precision-Recall Area Under the Curve (PR-AUC) scores between 73.6 and 93.8% for different levels of sequence similarity in the collected data. Our model reaches a performance comparable to that of BLASTp when sequence similarity in the data is high and starts outperforming BLASTp when sequence similarity drops below 75%. Therefore, our machine learning methods can be especially useful in settings in which sequence similarity to other known sequences is low. Predicting the hosts of novel metagenomic RBP sequences could extend our toolbox to tune the host spectrum of phages or phage tail-like bacteriocins by swapping RBPs. Nature Publishing Group UK 2021-01-14 /pmc/articles/PMC7809048/ /pubmed/33446856 http://dx.doi.org/10.1038/s41598-021-81063-4 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Boeckaerts, Dimitri
Stock, Michiel
Criel, Bjorn
Gerstmans, Hans
De Baets, Bernard
Briers, Yves
Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title_full Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title_fullStr Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title_full_unstemmed Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title_short Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
title_sort predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7809048/
https://www.ncbi.nlm.nih.gov/pubmed/33446856
http://dx.doi.org/10.1038/s41598-021-81063-4
work_keys_str_mv AT boeckaertsdimitri predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins
AT stockmichiel predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins
AT crielbjorn predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins
AT gerstmanshans predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins
AT debaetsbernard predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins
AT briersyves predictingbacteriophagehostsbasedonsequencesofannotatedreceptorbindingproteins