Cargando…

Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter

Thousands of new phages have recently been discovered thanks to viral metagenomics. These phages are extremely diverse and their genome sequences often do not resemble any known phages. To appreciate their ecological impact, it is important to determine their bacterial hosts. CRISPR spacers can be u...

Descripción completa

Detalles Bibliográficos
Autores principales: Dion, Moïra B, Plante, Pier-Luc, Zufferey, Edwige, Shah, Shiraz A, Corbeil, Jacques, Moineau, Sylvain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8034630/
https://www.ncbi.nlm.nih.gov/pubmed/33677572
http://dx.doi.org/10.1093/nar/gkab133
_version_ 1783676570678853632
author Dion, Moïra B
Plante, Pier-Luc
Zufferey, Edwige
Shah, Shiraz A
Corbeil, Jacques
Moineau, Sylvain
author_facet Dion, Moïra B
Plante, Pier-Luc
Zufferey, Edwige
Shah, Shiraz A
Corbeil, Jacques
Moineau, Sylvain
author_sort Dion, Moïra B
collection PubMed
description Thousands of new phages have recently been discovered thanks to viral metagenomics. These phages are extremely diverse and their genome sequences often do not resemble any known phages. To appreciate their ecological impact, it is important to determine their bacterial hosts. CRISPR spacers can be used to predict hosts of unknown phages, as spacers represent biological records of past phage–bacteria interactions. However, no guidelines have been established to standardize host prediction based on CRISPR spacers. Additionally, there are no tools that use spacers to perform host predictions on large viral datasets. Here, we developed a set of tools that includes all the necessary steps for predicting the hosts of uncharacterized phages. We created a database of >11 million spacers and a program to execute host predictions on large viral datasets. Our host prediction approach uses biological criteria inspired by how CRISPR–Cas naturally work as adaptive immune systems, which make the results easy to interpret. We evaluated the performance using 9484 phages with known hosts and obtained a recall of 49% and a precision of 69%. We also found that this host prediction method yielded higher performance for phages that infect gut-associated bacteria, suggesting it is well suited for gut-virome characterization.
format Online
Article
Text
id pubmed-8034630
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80346302021-04-14 Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter Dion, Moïra B Plante, Pier-Luc Zufferey, Edwige Shah, Shiraz A Corbeil, Jacques Moineau, Sylvain Nucleic Acids Res Computational Biology Thousands of new phages have recently been discovered thanks to viral metagenomics. These phages are extremely diverse and their genome sequences often do not resemble any known phages. To appreciate their ecological impact, it is important to determine their bacterial hosts. CRISPR spacers can be used to predict hosts of unknown phages, as spacers represent biological records of past phage–bacteria interactions. However, no guidelines have been established to standardize host prediction based on CRISPR spacers. Additionally, there are no tools that use spacers to perform host predictions on large viral datasets. Here, we developed a set of tools that includes all the necessary steps for predicting the hosts of uncharacterized phages. We created a database of >11 million spacers and a program to execute host predictions on large viral datasets. Our host prediction approach uses biological criteria inspired by how CRISPR–Cas naturally work as adaptive immune systems, which make the results easy to interpret. We evaluated the performance using 9484 phages with known hosts and obtained a recall of 49% and a precision of 69%. We also found that this host prediction method yielded higher performance for phages that infect gut-associated bacteria, suggesting it is well suited for gut-virome characterization. Oxford University Press 2021-03-02 /pmc/articles/PMC8034630/ /pubmed/33677572 http://dx.doi.org/10.1093/nar/gkab133 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Dion, Moïra B
Plante, Pier-Luc
Zufferey, Edwige
Shah, Shiraz A
Corbeil, Jacques
Moineau, Sylvain
Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title_full Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title_fullStr Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title_full_unstemmed Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title_short Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter
title_sort streamlining crispr spacer-based bacterial host predictions to decipher the viral dark matter
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8034630/
https://www.ncbi.nlm.nih.gov/pubmed/33677572
http://dx.doi.org/10.1093/nar/gkab133
work_keys_str_mv AT dionmoirab streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter
AT plantepierluc streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter
AT zuffereyedwige streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter
AT shahshiraza streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter
AT corbeiljacques streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter
AT moineausylvain streamliningcrisprspacerbasedbacterialhostpredictionstodeciphertheviraldarkmatter