Cargando…

BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains

Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyl...

Descripción completa

Detalles Bibliográficos
Autores principales: Hockenberry, Adam J., Wilke, Claus O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106911/
https://www.ncbi.nlm.nih.gov/pubmed/33996289
http://dx.doi.org/10.7717/peerj.11396
_version_ 1783689857065811968
author Hockenberry, Adam J.
Wilke, Claus O.
author_facet Hockenberry, Adam J.
Wilke, Claus O.
author_sort Hockenberry, Adam J.
collection PubMed
description Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyle identification is critical for determining the role of individual phage species within ecosystems and their effect on host evolution. Here, we present BACPHLIP, a BACterioPHage LIfestyle Predictor. BACPHLIP detects the presence of a set of conserved protein domains within an input genome and uses this data to predict lifestyle via a Random Forest classifier that was trained on a dataset of 634 phage genomes. On an independent test set of 423 phages, BACPHLIP has an accuracy of 98% greatly exceeding that of the previously existing tools (79%). BACPHLIP is freely available on GitHub (https://github.com/adamhockenberry/bacphlip) and the code used to build and test the classifier is provided in a separate repository (https://github.com/adamhockenberry/bacphlip-model-dev) for users wishing to interrogate and re-train the underlying classification model.
format Online
Article
Text
id pubmed-8106911
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-81069112021-05-13 BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains Hockenberry, Adam J. Wilke, Claus O. PeerJ Bioinformatics Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyle identification is critical for determining the role of individual phage species within ecosystems and their effect on host evolution. Here, we present BACPHLIP, a BACterioPHage LIfestyle Predictor. BACPHLIP detects the presence of a set of conserved protein domains within an input genome and uses this data to predict lifestyle via a Random Forest classifier that was trained on a dataset of 634 phage genomes. On an independent test set of 423 phages, BACPHLIP has an accuracy of 98% greatly exceeding that of the previously existing tools (79%). BACPHLIP is freely available on GitHub (https://github.com/adamhockenberry/bacphlip) and the code used to build and test the classifier is provided in a separate repository (https://github.com/adamhockenberry/bacphlip-model-dev) for users wishing to interrogate and re-train the underlying classification model. PeerJ Inc. 2021-05-06 /pmc/articles/PMC8106911/ /pubmed/33996289 http://dx.doi.org/10.7717/peerj.11396 Text en ©2021 Hockenberry and Wilke https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Hockenberry, Adam J.
Wilke, Claus O.
BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title_full BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title_fullStr BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title_full_unstemmed BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title_short BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
title_sort bacphlip: predicting bacteriophage lifestyle from conserved protein domains
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106911/
https://www.ncbi.nlm.nih.gov/pubmed/33996289
http://dx.doi.org/10.7717/peerj.11396
work_keys_str_mv AT hockenberryadamj bacphlippredictingbacteriophagelifestylefromconservedproteindomains
AT wilkeclauso bacphlippredictingbacteriophagelifestylefromconservedproteindomains