Cargando…
BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains
Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyl...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106911/ https://www.ncbi.nlm.nih.gov/pubmed/33996289 http://dx.doi.org/10.7717/peerj.11396 |
_version_ | 1783689857065811968 |
---|---|
author | Hockenberry, Adam J. Wilke, Claus O. |
author_facet | Hockenberry, Adam J. Wilke, Claus O. |
author_sort | Hockenberry, Adam J. |
collection | PubMed |
description | Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyle identification is critical for determining the role of individual phage species within ecosystems and their effect on host evolution. Here, we present BACPHLIP, a BACterioPHage LIfestyle Predictor. BACPHLIP detects the presence of a set of conserved protein domains within an input genome and uses this data to predict lifestyle via a Random Forest classifier that was trained on a dataset of 634 phage genomes. On an independent test set of 423 phages, BACPHLIP has an accuracy of 98% greatly exceeding that of the previously existing tools (79%). BACPHLIP is freely available on GitHub (https://github.com/adamhockenberry/bacphlip) and the code used to build and test the classifier is provided in a separate repository (https://github.com/adamhockenberry/bacphlip-model-dev) for users wishing to interrogate and re-train the underlying classification model. |
format | Online Article Text |
id | pubmed-8106911 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81069112021-05-13 BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains Hockenberry, Adam J. Wilke, Claus O. PeerJ Bioinformatics Bacteriophages are broadly classified into two distinct lifestyles: temperate and virulent. Temperate phages are capable of a latent phase of infection within a host cell (lysogenic cycle), whereas virulent phages directly replicate and lyse host cells upon infection (lytic cycle). Accurate lifestyle identification is critical for determining the role of individual phage species within ecosystems and their effect on host evolution. Here, we present BACPHLIP, a BACterioPHage LIfestyle Predictor. BACPHLIP detects the presence of a set of conserved protein domains within an input genome and uses this data to predict lifestyle via a Random Forest classifier that was trained on a dataset of 634 phage genomes. On an independent test set of 423 phages, BACPHLIP has an accuracy of 98% greatly exceeding that of the previously existing tools (79%). BACPHLIP is freely available on GitHub (https://github.com/adamhockenberry/bacphlip) and the code used to build and test the classifier is provided in a separate repository (https://github.com/adamhockenberry/bacphlip-model-dev) for users wishing to interrogate and re-train the underlying classification model. PeerJ Inc. 2021-05-06 /pmc/articles/PMC8106911/ /pubmed/33996289 http://dx.doi.org/10.7717/peerj.11396 Text en ©2021 Hockenberry and Wilke https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Hockenberry, Adam J. Wilke, Claus O. BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title | BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title_full | BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title_fullStr | BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title_full_unstemmed | BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title_short | BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains |
title_sort | bacphlip: predicting bacteriophage lifestyle from conserved protein domains |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106911/ https://www.ncbi.nlm.nih.gov/pubmed/33996289 http://dx.doi.org/10.7717/peerj.11396 |
work_keys_str_mv | AT hockenberryadamj bacphlippredictingbacteriophagelifestylefromconservedproteindomains AT wilkeclauso bacphlippredictingbacteriophagelifestylefromconservedproteindomains |