Cargando…

DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach

BACKGROUND: Prokaryotic viruses referred to as phages can be divided into virulent and temperate phages. Distinguishing virulent and temperate phage–derived sequences in metavirome data is important for elucidating their different roles in interactions with bacterial hosts and regulation of microbia...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Shufang, Fang, Zhencheng, Tan, Jie, Li, Mo, Wang, Chunhui, Guo, Qian, Xu, Congmin, Jiang, Xiaoqing, Zhu, Huaiqiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8427542/
https://www.ncbi.nlm.nih.gov/pubmed/34498685
http://dx.doi.org/10.1093/gigascience/giab056
_version_ 1783750194998804480
author Wu, Shufang
Fang, Zhencheng
Tan, Jie
Li, Mo
Wang, Chunhui
Guo, Qian
Xu, Congmin
Jiang, Xiaoqing
Zhu, Huaiqiu
author_facet Wu, Shufang
Fang, Zhencheng
Tan, Jie
Li, Mo
Wang, Chunhui
Guo, Qian
Xu, Congmin
Jiang, Xiaoqing
Zhu, Huaiqiu
author_sort Wu, Shufang
collection PubMed
description BACKGROUND: Prokaryotic viruses referred to as phages can be divided into virulent and temperate phages. Distinguishing virulent and temperate phage–derived sequences in metavirome data is important for elucidating their different roles in interactions with bacterial hosts and regulation of microbial communities. However, there is no experimental or computational approach to effectively classify their sequences in culture-independent metavirome. We present a new computational method, DeePhage, which can directly and rapidly judge each read or contig as a virulent or temperate phage–derived fragment. FINDINGS: DeePhage uses a “one-hot” encoding form to represent DNA sequences in detail. Sequence signatures are detected via a convolutional neural network to obtain valuable local features. The accuracy of DeePhage on 5-fold cross-validation reaches as high as 89%, nearly 10% and 30% higher than that of 2 similar tools, PhagePred and PHACTS. On real metavirome, DeePhage correctly predicts the highest proportion of contigs when using BLAST as annotation, without apparent preferences. Besides, DeePhage reduces running time vs PhagePred and PHACTS by 245 and 810 times, respectively, under the same computational configuration. By direct detection of the temperate viral fragments from metagenome and metavirome, we furthermore propose a new strategy to explore phage transformations in the microbial community. The ability to detect such transformations provides us a new insight into the potential treatment for human disease. CONCLUSIONS: DeePhage is a novel tool developed to rapidly and efficiently identify 2 kinds of phage fragments especially for metagenomics analysis. DeePhage is freely available via http://cqb.pku.edu.cn/ZhuLab/DeePhage or https://github.com/shufangwu/DeePhage.
format Online
Article
Text
id pubmed-8427542
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84275422021-09-09 DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach Wu, Shufang Fang, Zhencheng Tan, Jie Li, Mo Wang, Chunhui Guo, Qian Xu, Congmin Jiang, Xiaoqing Zhu, Huaiqiu Gigascience Research BACKGROUND: Prokaryotic viruses referred to as phages can be divided into virulent and temperate phages. Distinguishing virulent and temperate phage–derived sequences in metavirome data is important for elucidating their different roles in interactions with bacterial hosts and regulation of microbial communities. However, there is no experimental or computational approach to effectively classify their sequences in culture-independent metavirome. We present a new computational method, DeePhage, which can directly and rapidly judge each read or contig as a virulent or temperate phage–derived fragment. FINDINGS: DeePhage uses a “one-hot” encoding form to represent DNA sequences in detail. Sequence signatures are detected via a convolutional neural network to obtain valuable local features. The accuracy of DeePhage on 5-fold cross-validation reaches as high as 89%, nearly 10% and 30% higher than that of 2 similar tools, PhagePred and PHACTS. On real metavirome, DeePhage correctly predicts the highest proportion of contigs when using BLAST as annotation, without apparent preferences. Besides, DeePhage reduces running time vs PhagePred and PHACTS by 245 and 810 times, respectively, under the same computational configuration. By direct detection of the temperate viral fragments from metagenome and metavirome, we furthermore propose a new strategy to explore phage transformations in the microbial community. The ability to detect such transformations provides us a new insight into the potential treatment for human disease. CONCLUSIONS: DeePhage is a novel tool developed to rapidly and efficiently identify 2 kinds of phage fragments especially for metagenomics analysis. DeePhage is freely available via http://cqb.pku.edu.cn/ZhuLab/DeePhage or https://github.com/shufangwu/DeePhage. Oxford University Press 2021-09-08 /pmc/articles/PMC8427542/ /pubmed/34498685 http://dx.doi.org/10.1093/gigascience/giab056 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wu, Shufang
Fang, Zhencheng
Tan, Jie
Li, Mo
Wang, Chunhui
Guo, Qian
Xu, Congmin
Jiang, Xiaoqing
Zhu, Huaiqiu
DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title_full DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title_fullStr DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title_full_unstemmed DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title_short DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
title_sort deephage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8427542/
https://www.ncbi.nlm.nih.gov/pubmed/34498685
http://dx.doi.org/10.1093/gigascience/giab056
work_keys_str_mv AT wushufang deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT fangzhencheng deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT tanjie deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT limo deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT wangchunhui deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT guoqian deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT xucongmin deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT jiangxiaoqing deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach
AT zhuhuaiqiu deephagedistinguishingvirulentandtemperatephagederivedsequencesinmetaviromedatawithadeeplearningapproach