Cargando…

A network-based integrated framework for predicting virus–prokaryote interactions

Metagenomic sequencing has greatly enhanced the discovery of viral genomic sequences; however, it remains challenging to identify the host(s) of these new viruses. We developed VirHostMatcher-Net, a flexible, network-based, Markov random field framework for predicting virus–prokaryote interactions u...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Weili, Ren, Jie, Tang, Kujin, Dart, Emily, Ignacio-Espinoza, Julio Cesar, Fuhrman, Jed A, Braun, Jonathan, Sun, Fengzhu, Ahlgren, Nathan A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324143/
https://www.ncbi.nlm.nih.gov/pubmed/32626849
http://dx.doi.org/10.1093/nargab/lqaa044
_version_ 1783551889514692608
author Wang, Weili
Ren, Jie
Tang, Kujin
Dart, Emily
Ignacio-Espinoza, Julio Cesar
Fuhrman, Jed A
Braun, Jonathan
Sun, Fengzhu
Ahlgren, Nathan A
author_facet Wang, Weili
Ren, Jie
Tang, Kujin
Dart, Emily
Ignacio-Espinoza, Julio Cesar
Fuhrman, Jed A
Braun, Jonathan
Sun, Fengzhu
Ahlgren, Nathan A
author_sort Wang, Weili
collection PubMed
description Metagenomic sequencing has greatly enhanced the discovery of viral genomic sequences; however, it remains challenging to identify the host(s) of these new viruses. We developed VirHostMatcher-Net, a flexible, network-based, Markov random field framework for predicting virus–prokaryote interactions using multiple, integrated features: CRISPR sequences and alignment-free similarity measures ([Formula: see text] and WIsH). Evaluation of this method on a benchmark set of 1462 known virus–prokaryote pairs yielded host prediction accuracy of 59% and 86% at the genus and phylum levels, representing 16–27% and 6–10% improvement, respectively, over previous single-feature prediction approaches. We applied our host prediction tool to crAssphage, a human gut phage, and two metagenomic virus datasets: marine viruses and viral contigs recovered from globally distributed, diverse habitats. Host predictions were frequently consistent with those of previous studies, but more importantly, this new tool made many more confident predictions than previous tools, up to nearly 3-fold more (n > 27 000), greatly expanding the diversity of known virus–host interactions.
format Online
Article
Text
id pubmed-7324143
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73241432020-07-02 A network-based integrated framework for predicting virus–prokaryote interactions Wang, Weili Ren, Jie Tang, Kujin Dart, Emily Ignacio-Espinoza, Julio Cesar Fuhrman, Jed A Braun, Jonathan Sun, Fengzhu Ahlgren, Nathan A NAR Genom Bioinform Methods Article Metagenomic sequencing has greatly enhanced the discovery of viral genomic sequences; however, it remains challenging to identify the host(s) of these new viruses. We developed VirHostMatcher-Net, a flexible, network-based, Markov random field framework for predicting virus–prokaryote interactions using multiple, integrated features: CRISPR sequences and alignment-free similarity measures ([Formula: see text] and WIsH). Evaluation of this method on a benchmark set of 1462 known virus–prokaryote pairs yielded host prediction accuracy of 59% and 86% at the genus and phylum levels, representing 16–27% and 6–10% improvement, respectively, over previous single-feature prediction approaches. We applied our host prediction tool to crAssphage, a human gut phage, and two metagenomic virus datasets: marine viruses and viral contigs recovered from globally distributed, diverse habitats. Host predictions were frequently consistent with those of previous studies, but more importantly, this new tool made many more confident predictions than previous tools, up to nearly 3-fold more (n > 27 000), greatly expanding the diversity of known virus–host interactions. Oxford University Press 2020-06-23 /pmc/articles/PMC7324143/ /pubmed/32626849 http://dx.doi.org/10.1093/nargab/lqaa044 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Wang, Weili
Ren, Jie
Tang, Kujin
Dart, Emily
Ignacio-Espinoza, Julio Cesar
Fuhrman, Jed A
Braun, Jonathan
Sun, Fengzhu
Ahlgren, Nathan A
A network-based integrated framework for predicting virus–prokaryote interactions
title A network-based integrated framework for predicting virus–prokaryote interactions
title_full A network-based integrated framework for predicting virus–prokaryote interactions
title_fullStr A network-based integrated framework for predicting virus–prokaryote interactions
title_full_unstemmed A network-based integrated framework for predicting virus–prokaryote interactions
title_short A network-based integrated framework for predicting virus–prokaryote interactions
title_sort network-based integrated framework for predicting virus–prokaryote interactions
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324143/
https://www.ncbi.nlm.nih.gov/pubmed/32626849
http://dx.doi.org/10.1093/nargab/lqaa044
work_keys_str_mv AT wangweili anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT renjie anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT tangkujin anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT dartemily anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT ignacioespinozajuliocesar anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT fuhrmanjeda anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT braunjonathan anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT sunfengzhu anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT ahlgrennathana anetworkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT wangweili networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT renjie networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT tangkujin networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT dartemily networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT ignacioespinozajuliocesar networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT fuhrmanjeda networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT braunjonathan networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT sunfengzhu networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions
AT ahlgrennathana networkbasedintegratedframeworkforpredictingvirusprokaryoteinteractions