Cargando…
Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic
The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8408148/ https://www.ncbi.nlm.nih.gov/pubmed/34465838 http://dx.doi.org/10.1038/s41598-021-96903-6 |
_version_ | 1783746766783381504 |
---|---|
author | Guo, Qian Li, Mo Wang, Chunhui Guo, Jinyuan Jiang, Xiaoqing Tan, Jie Wu, Shufang Wang, Peihong Xiao, Tingting Zhou, Man Fang, Zhencheng Xiao, Yonghong Zhu, Huaiqiu |
author_facet | Guo, Qian Li, Mo Wang, Chunhui Guo, Jinyuan Jiang, Xiaoqing Tan, Jie Wu, Shufang Wang, Peihong Xiao, Tingting Zhou, Man Fang, Zhencheng Xiao, Yonghong Zhu, Huaiqiu |
author_sort | Guo, Qian |
collection | PubMed |
description | The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on five host types, including plant, germ, invertebrate, non-human vertebrate and human, for novel viruses. DeepHoF made up for the lack of an accurate tool, reaching a satisfactory AUC of 0.975 in the five-classification, and could make a reliable prediction for the novel viruses without close neighbors in phylogeny. Additionally, to fill the gap in the efficient inference of host species for SARS-CoV-2 using existing tools, we conducted a deep analysis on the host likelihood profile calculated by DeepHoF. Using the isolates sequenced in the earliest stage of the COVID-19 pandemic, we inferred that minks, bats, dogs and cats were potential hosts of SARS-CoV-2, while minks might be one of the most noteworthy hosts. Several genes of SARS-CoV-2 demonstrated their significance in determining the host range. Furthermore, a large-scale genome analysis, based on DeepHoF’s computation for the later pandemic in 2020, disclosed the uniformity of host range among SARS-CoV-2 samples and the strong association of SARS-CoV-2 between humans and minks. |
format | Online Article Text |
id | pubmed-8408148 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-84081482021-09-01 Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic Guo, Qian Li, Mo Wang, Chunhui Guo, Jinyuan Jiang, Xiaoqing Tan, Jie Wu, Shufang Wang, Peihong Xiao, Tingting Zhou, Man Fang, Zhencheng Xiao, Yonghong Zhu, Huaiqiu Sci Rep Article The SARS-CoV-2 pandemic has raised concerns in the identification of the hosts of the virus since the early stages of the outbreak. To address this problem, we proposed a deep learning method, DeepHoF, based on extracting viral genomic features automatically, to predict the host likelihood scores on five host types, including plant, germ, invertebrate, non-human vertebrate and human, for novel viruses. DeepHoF made up for the lack of an accurate tool, reaching a satisfactory AUC of 0.975 in the five-classification, and could make a reliable prediction for the novel viruses without close neighbors in phylogeny. Additionally, to fill the gap in the efficient inference of host species for SARS-CoV-2 using existing tools, we conducted a deep analysis on the host likelihood profile calculated by DeepHoF. Using the isolates sequenced in the earliest stage of the COVID-19 pandemic, we inferred that minks, bats, dogs and cats were potential hosts of SARS-CoV-2, while minks might be one of the most noteworthy hosts. Several genes of SARS-CoV-2 demonstrated their significance in determining the host range. Furthermore, a large-scale genome analysis, based on DeepHoF’s computation for the later pandemic in 2020, disclosed the uniformity of host range among SARS-CoV-2 samples and the strong association of SARS-CoV-2 between humans and minks. Nature Publishing Group UK 2021-08-31 /pmc/articles/PMC8408148/ /pubmed/34465838 http://dx.doi.org/10.1038/s41598-021-96903-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Guo, Qian Li, Mo Wang, Chunhui Guo, Jinyuan Jiang, Xiaoqing Tan, Jie Wu, Shufang Wang, Peihong Xiao, Tingting Zhou, Man Fang, Zhencheng Xiao, Yonghong Zhu, Huaiqiu Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title | Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title_full | Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title_fullStr | Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title_full_unstemmed | Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title_short | Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic |
title_sort | predicting hosts based on early sars-cov-2 samples and analyzing the 2020 pandemic |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8408148/ https://www.ncbi.nlm.nih.gov/pubmed/34465838 http://dx.doi.org/10.1038/s41598-021-96903-6 |
work_keys_str_mv | AT guoqian predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT limo predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT wangchunhui predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT guojinyuan predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT jiangxiaoqing predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT tanjie predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT wushufang predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT wangpeihong predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT xiaotingting predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT zhouman predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT fangzhencheng predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT xiaoyonghong predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic AT zhuhuaiqiu predictinghostsbasedonearlysarscov2samplesandanalyzingthe2020pandemic |