Cargando…

Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic

The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Qian, Li, Mo, Wang, Chunhui, Guo, Jinyuan, Jiang, Xiaoqing, Tan, Jie, Wu, Shufang, Wang, Peihong, Xiao, Tingting, Zhou, Man, Fang, Zhencheng, Xiao, Yonghong, Zhu, Huaiqiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8408148/
https://www.ncbi.nlm.nih.gov/pubmed/34465838
http://dx.doi.org/10.1038/s41598-021-96903-6
_version_ 1783746766783381504
author Guo, Qian
Li, Mo
Wang, Chunhui
Guo, Jinyuan
Jiang, Xiaoqing
Tan, Jie
Wu, Shufang
Wang, Peihong
Xiao, Tingting
Zhou, Man
Fang, Zhencheng
Xiao, Yonghong
Zhu, Huaiqiu
author_facet Guo, Qian
Li, Mo
Wang, Chunhui
Guo, Jinyuan
Jiang, Xiaoqing
Tan, Jie
Wu, Shufang
Wang, Peihong
Xiao, Tingting
Zhou, Man
Fang, Zhencheng
Xiao, Yonghong
Zhu, Huaiqiu
author_sort Guo, Qian
collection PubMed
description The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on five host types, including plant, germ, invertebrate, non-human vertebrate and human, for novel viruses. DeepHoF made up for the lack of an accurate tool, reaching a satisfactory AUC of 0.975 in the five-classification, and could make a reliable prediction for the novel viruses without close neighbors in phylogeny. Additionally, to fill the gap in the efficient inference of host species for SARS-CoV-2 using existing tools, we conducted a deep analysis on the host likelihood profile calculated by DeepHoF. Using the isolates sequenced in the earliest stage of the COVID-19 pandemic, we inferred that minks, bats, dogs and cats were potential hosts of SARS-CoV-2, while minks might be one of the most noteworthy hosts. Several genes of SARS-CoV-2 demonstrated their significance in determining the host range. Furthermore, a large-scale genome analysis, based on DeepHoF’s computation for the later pandemic in 2020, disclosed the uniformity of host range among SARS-CoV-2 samples and the strong association of SARS-CoV-2 between humans and minks.
format Online
Article
Text
id pubmed-8408148
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84081482021-09-01 Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic Guo, Qian Li, Mo Wang, Chunhui Guo, Jinyuan Jiang, Xiaoqing Tan, Jie Wu, Shufang Wang, Peihong Xiao, Tingting Zhou, Man Fang, Zhencheng Xiao, Yonghong Zhu, Huaiqiu Sci Rep Article The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on five host types, including plant, germ, invertebrate, non-human vertebrate and human, for novel viruses. DeepHoF made up for the lack of an accurate tool, reaching a satisfactory AUC of 0.975 in the five-classification, and could make a reliable prediction for the novel viruses without close neighbors in phylogeny. Additionally, to fill the gap in the efficient inference of host species for SARS-CoV-2 using existing tools, we conducted a deep analysis on the host likelihood profile calculated by DeepHoF. Using the isolates sequenced in the earliest stage of the COVID-19 pandemic, we inferred that minks, bats, dogs and cats were potential hosts of SARS-CoV-2, while minks might be one of the most noteworthy hosts. Several genes of SARS-CoV-2 demonstrated their significance in determining the host range. Furthermore, a large-scale genome analysis, based on DeepHoF’s computation for the later pandemic in 2020, disclosed the uniformity of host range among SARS-CoV-2 samples and the strong association of SARS-CoV-2 between humans and minks. Nature Publishing Group UK 2021-08-31 /pmc/articles/PMC8408148/ /pubmed/34465838 http://dx.doi.org/10.1038/s41598-021-96903-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Guo, Qian
Li, Mo
Wang, Chunhui
Guo, Jinyuan
Jiang, Xiaoqing
Tan, Jie
Wu, Shufang
Wang, Peihong
Xiao, Tingting
Zhou, Man
Fang, Zhencheng
Xiao, Yonghong
Zhu, Huaiqiu
Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title_full Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title_fullStr Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title_full_unstemmed Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title_short Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
title_sort predicting hosts based on early sars-cov-2 samples and analyzing the 2020 pandemic
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8408148/
https://www.ncbi.nlm.nih.gov/pubmed/34465838
http://dx.doi.org/10.1038/s41598-021-96903-6
work_keys_str_mv AT guoqian predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT limo predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT wangchunhui predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT guojinyuan predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT jiangxiaoqing predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT tanjie predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT wushufang predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT wangpeihong predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT xiaotingting predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT zhouman predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT fangzhencheng predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT xiaoyonghong predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic
AT zhuhuaiqiu predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic