Cargando…

Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm

OBJECTIVES: The objective of this study is the implementation of an automatic procedure to weekly detect new SARS-CoV-2 variants and non-neutral variants (variants of concern (VOC) and variants of interest (VOI)). METHODS: We downloaded spike protein primary sequences from the public resource GISAID...

Descripción completa

Detalles Bibliográficos
Autores principales: Nicora, Giovanna, Salemi, Marco, Marini, Simone, Bellazzi, Riccardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742845/
https://www.ncbi.nlm.nih.gov/pubmed/36593658
http://dx.doi.org/10.1136/bmjhci-2022-100643
_version_ 1784848607022678016
author Nicora, Giovanna
Salemi, Marco
Marini, Simone
Bellazzi, Riccardo
author_facet Nicora, Giovanna
Salemi, Marco
Marini, Simone
Bellazzi, Riccardo
author_sort Nicora, Giovanna
collection PubMed
description OBJECTIVES: The objective of this study is the implementation of an automatic procedure to weekly detect new SARS-CoV-2 variants and non-neutral variants (variants of concern (VOC) and variants of interest (VOI)). METHODS: We downloaded spike protein primary sequences from the public resource GISAID and we represented each sequence as k-mer counts. For each week since 1 July 2020, we evaluate if each sequence represents an anomaly based on a One Class support vector machine (SVM) classification algorithm trained on neutral protein sequences collected from February to June 2020. RESULTS: We assess the ability of the One Class classifier to detect known VOC and VOI, such as Alpha, Delta or Omicron, ahead of their official classification by health authorities. In median, the classifier predicts a non-neutral variant as outlier 10 weeks before the official date of designation as VOC/VOI. DISCUSSION: The identification of non-neutral variants during a pandemic usually relies on indicators available during time, such as changing population size of a variant. Automatic variant surveillance systems based on protein sequences can enhance the fast identification of variants of potential concern. CONCLUSION: Machine learning, and in particular One Class SVM classification, can support the detection of potentially VOC/VOI variants during an evolving pandemics.
format Online
Article
Text
id pubmed-9742845
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-97428452022-12-13 Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm Nicora, Giovanna Salemi, Marco Marini, Simone Bellazzi, Riccardo BMJ Health Care Inform Original Research OBJECTIVES: The objective of this study is the implementation of an automatic procedure to weekly detect new SARS-CoV-2 variants and non-neutral variants (variants of concern (VOC) and variants of interest (VOI)). METHODS: We downloaded spike protein primary sequences from the public resource GISAID and we represented each sequence as k-mer counts. For each week since 1 July 2020, we evaluate if each sequence represents an anomaly based on a One Class support vector machine (SVM) classification algorithm trained on neutral protein sequences collected from February to June 2020. RESULTS: We assess the ability of the One Class classifier to detect known VOC and VOI, such as Alpha, Delta or Omicron, ahead of their official classification by health authorities. In median, the classifier predicts a non-neutral variant as outlier 10 weeks before the official date of designation as VOC/VOI. DISCUSSION: The identification of non-neutral variants during a pandemic usually relies on indicators available during time, such as changing population size of a variant. Automatic variant surveillance systems based on protein sequences can enhance the fast identification of variants of potential concern. CONCLUSION: Machine learning, and in particular One Class SVM classification, can support the detection of potentially VOC/VOI variants during an evolving pandemics. BMJ Publishing Group 2022-12-09 /pmc/articles/PMC9742845/ /pubmed/36593658 http://dx.doi.org/10.1136/bmjhci-2022-100643 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research
Nicora, Giovanna
Salemi, Marco
Marini, Simone
Bellazzi, Riccardo
Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title_full Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title_fullStr Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title_full_unstemmed Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title_short Predicting emerging SARS-CoV-2 variants of concern through a One Class dynamic anomaly detection algorithm
title_sort predicting emerging sars-cov-2 variants of concern through a one class dynamic anomaly detection algorithm
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742845/
https://www.ncbi.nlm.nih.gov/pubmed/36593658
http://dx.doi.org/10.1136/bmjhci-2022-100643
work_keys_str_mv AT nicoragiovanna predictingemergingsarscov2variantsofconcernthroughaoneclassdynamicanomalydetectionalgorithm
AT salemimarco predictingemergingsarscov2variantsofconcernthroughaoneclassdynamicanomalydetectionalgorithm
AT marinisimone predictingemergingsarscov2variantsofconcernthroughaoneclassdynamicanomalydetectionalgorithm
AT bellazziriccardo predictingemergingsarscov2variantsofconcernthroughaoneclassdynamicanomalydetectionalgorithm