Cargando…

Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021

INTRODUCTION: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently emergent coronavirus of natural origin and caused the coronavirus disease (COVID-19) pandemic. The study of its natural origin and host range is of particular importance for source tracing, monitoring of this vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Yue, Hu, Yu, Xia, Binbin, Du, Pei, Wu, Lili, Liang, Mifang, Chen, Qian, Yan, Huan, Gao, George F., Wang, Qihui, Wang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598542/
https://www.ncbi.nlm.nih.gov/pubmed/34804629
http://dx.doi.org/10.46234/ccdcw2021.235
_version_ 1784600825892438016
author Ma, Yue
Hu, Yu
Xia, Binbin
Du, Pei
Wu, Lili
Liang, Mifang
Chen, Qian
Yan, Huan
Gao, George F.
Wang, Qihui
Wang, Jun
author_facet Ma, Yue
Hu, Yu
Xia, Binbin
Du, Pei
Wu, Lili
Liang, Mifang
Chen, Qian
Yan, Huan
Gao, George F.
Wang, Qihui
Wang, Jun
author_sort Ma, Yue
collection PubMed
description INTRODUCTION: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently emergent coronavirus of natural origin and caused the coronavirus disease (COVID-19) pandemic. The study of its natural origin and host range is of particular importance for source tracing, monitoring of this virus, and prevention of recurrent infections. One major approach is to test the binding ability of the viral receptor gene ACE2 from various hosts to SARS-CoV-2 spike protein, but it is time-consuming and labor-intensive to cover a large collection of species. METHODS: In this paper, we applied state-of-the-art machine learning approaches and created a pipeline reaching >87% accuracy in predicting binding between different ACE2 and SARS-CoV-2 spike. RESULTS: We further validated our prediction pipeline using 2 independent test sets involving >50 bat species and achieved >78% accuracy. A large-scale screening of 204 mammal species revealed 144 species (or 61%) were susceptible to SARS-CoV-2 infections, highlighting the importance of intensive monitoring and studies in mammalian species. DISCUSSION: In short, our study employed machine learning models to create an important tool for predicting potential hosts of SARS-CoV-2 and achieved the highest precision to our knowledge in experimental validation. This study also predicted that a wide range of mammals were capable of being infected by SARS-CoV-2.
format Online
Article
Text
id pubmed-8598542
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention
record_format MEDLINE/PubMed
spelling pubmed-85985422021-11-19 Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021 Ma, Yue Hu, Yu Xia, Binbin Du, Pei Wu, Lili Liang, Mifang Chen, Qian Yan, Huan Gao, George F. Wang, Qihui Wang, Jun China CDC Wkly Methods and Applications INTRODUCTION: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently emergent coronavirus of natural origin and caused the coronavirus disease (COVID-19) pandemic. The study of its natural origin and host range is of particular importance for source tracing, monitoring of this virus, and prevention of recurrent infections. One major approach is to test the binding ability of the viral receptor gene ACE2 from various hosts to SARS-CoV-2 spike protein, but it is time-consuming and labor-intensive to cover a large collection of species. METHODS: In this paper, we applied state-of-the-art machine learning approaches and created a pipeline reaching >87% accuracy in predicting binding between different ACE2 and SARS-CoV-2 spike. RESULTS: We further validated our prediction pipeline using 2 independent test sets involving >50 bat species and achieved >78% accuracy. A large-scale screening of 204 mammal species revealed 144 species (or 61%) were susceptible to SARS-CoV-2 infections, highlighting the importance of intensive monitoring and studies in mammalian species. DISCUSSION: In short, our study employed machine learning models to create an important tool for predicting potential hosts of SARS-CoV-2 and achieved the highest precision to our knowledge in experimental validation. This study also predicted that a wide range of mammals were capable of being infected by SARS-CoV-2. Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2021-11-12 /pmc/articles/PMC8598542/ /pubmed/34804629 http://dx.doi.org/10.46234/ccdcw2021.235 Text en Copyright and License information: Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2021 https://creativecommons.org/licenses/by-nc-sa/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ (https://creativecommons.org/licenses/by-nc-sa/4.0/)
spellingShingle Methods and Applications
Ma, Yue
Hu, Yu
Xia, Binbin
Du, Pei
Wu, Lili
Liang, Mifang
Chen, Qian
Yan, Huan
Gao, George F.
Wang, Qihui
Wang, Jun
Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title_full Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title_fullStr Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title_full_unstemmed Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title_short Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species — Worldwide, 2021
title_sort machine learning approach effectively predicts binding between sars-cov-2 spike and ace2 across mammalian species — worldwide, 2021
topic Methods and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598542/
https://www.ncbi.nlm.nih.gov/pubmed/34804629
http://dx.doi.org/10.46234/ccdcw2021.235
work_keys_str_mv AT mayue machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT huyu machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT xiabinbin machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT dupei machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT wulili machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT liangmifang machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT chenqian machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT yanhuan machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT gaogeorgef machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT wangqihui machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021
AT wangjun machinelearningapproacheffectivelypredictsbindingbetweensarscov2spikeandace2acrossmammalianspeciesworldwide2021