Cargando…

Using machine learning to detect coronaviruses potentially infectious to humans

Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their bin...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez-Isunza, Georgina, Jawaid, M. Zaki, Liu, Pengyu, Cox, Daniel L., Vazquez, Mariel, Arsuaga, Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10248971/
https://www.ncbi.nlm.nih.gov/pubmed/37291260
http://dx.doi.org/10.1038/s41598-023-35861-7
_version_ 1785055463335788544
author Gonzalez-Isunza, Georgina
Jawaid, M. Zaki
Liu, Pengyu
Cox, Daniel L.
Vazquez, Mariel
Arsuaga, Javier
author_facet Gonzalez-Isunza, Georgina
Jawaid, M. Zaki
Liu, Pengyu
Cox, Daniel L.
Vazquez, Mariel
Arsuaga, Javier
author_sort Gonzalez-Isunza, Georgina
collection PubMed
description Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their binding annotation to their host receptor. The proposed method produces a human-Binding Potential (h-BiP) score that distinguishes, with high accuracy, the binding potential among coronaviruses. Three viruses, previously unknown to bind human receptors, were identified: Bat coronavirus BtCoV/133/2005 and Pipistrellus abramus bat coronavirus HKU5-related (both MERS related viruses), and Rhinolophus affinis coronavirus isolate LYRa3 (a SARS related virus). We further analyze the binding properties of BtCoV/133/2005 and LYRa3 using molecular dynamics. To test whether this model can be used for surveillance of novel coronaviruses, we re-trained the model on a set that excludes SARS-CoV-2 and all viral sequences released after the SARS-CoV-2 was published. The results predict the binding of SARS-CoV-2 with a human receptor, indicating that machine learning methods are an excellent tool for the prediction of host expansion events.
format Online
Article
Text
id pubmed-10248971
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-102489712023-06-10 Using machine learning to detect coronaviruses potentially infectious to humans Gonzalez-Isunza, Georgina Jawaid, M. Zaki Liu, Pengyu Cox, Daniel L. Vazquez, Mariel Arsuaga, Javier Sci Rep Article Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their binding annotation to their host receptor. The proposed method produces a human-Binding Potential (h-BiP) score that distinguishes, with high accuracy, the binding potential among coronaviruses. Three viruses, previously unknown to bind human receptors, were identified: Bat coronavirus BtCoV/133/2005 and Pipistrellus abramus bat coronavirus HKU5-related (both MERS related viruses), and Rhinolophus affinis coronavirus isolate LYRa3 (a SARS related virus). We further analyze the binding properties of BtCoV/133/2005 and LYRa3 using molecular dynamics. To test whether this model can be used for surveillance of novel coronaviruses, we re-trained the model on a set that excludes SARS-CoV-2 and all viral sequences released after the SARS-CoV-2 was published. The results predict the binding of SARS-CoV-2 with a human receptor, indicating that machine learning methods are an excellent tool for the prediction of host expansion events. Nature Publishing Group UK 2023-06-08 /pmc/articles/PMC10248971/ /pubmed/37291260 http://dx.doi.org/10.1038/s41598-023-35861-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gonzalez-Isunza, Georgina
Jawaid, M. Zaki
Liu, Pengyu
Cox, Daniel L.
Vazquez, Mariel
Arsuaga, Javier
Using machine learning to detect coronaviruses potentially infectious to humans
title Using machine learning to detect coronaviruses potentially infectious to humans
title_full Using machine learning to detect coronaviruses potentially infectious to humans
title_fullStr Using machine learning to detect coronaviruses potentially infectious to humans
title_full_unstemmed Using machine learning to detect coronaviruses potentially infectious to humans
title_short Using machine learning to detect coronaviruses potentially infectious to humans
title_sort using machine learning to detect coronaviruses potentially infectious to humans
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10248971/
https://www.ncbi.nlm.nih.gov/pubmed/37291260
http://dx.doi.org/10.1038/s41598-023-35861-7
work_keys_str_mv AT gonzalezisunzageorgina usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans
AT jawaidmzaki usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans
AT liupengyu usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans
AT coxdaniell usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans
AT vazquezmariel usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans
AT arsuagajavier usingmachinelearningtodetectcoronavirusespotentiallyinfectioustohumans