Cargando…

Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance

BACKGROUND: Precise public health and clinical interventions for the COVID-19 pandemic has spurred a global rush on SARS-CoV-2 variant tracking, but current approaches to variant tracking are challenged by the flood of viral genome sequences leading to a loss of timeliness, accuracy, and reliability...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Qiang, Qiu, Huining, Bible, Paul W., Huang, Yong, Zheng, Fangfang, Gu, Jing, Sun, Jian, Hao, Yuantao, Liu, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901361/
https://www.ncbi.nlm.nih.gov/pubmed/36755900
http://dx.doi.org/10.3389/fpubh.2023.1015969
_version_ 1784883014930530304
author Huang, Qiang
Qiu, Huining
Bible, Paul W.
Huang, Yong
Zheng, Fangfang
Gu, Jing
Sun, Jian
Hao, Yuantao
Liu, Yu
author_facet Huang, Qiang
Qiu, Huining
Bible, Paul W.
Huang, Yong
Zheng, Fangfang
Gu, Jing
Sun, Jian
Hao, Yuantao
Liu, Yu
author_sort Huang, Qiang
collection PubMed
description BACKGROUND: Precise public health and clinical interventions for the COVID-19 pandemic has spurred a global rush on SARS-CoV-2 variant tracking, but current approaches to variant tracking are challenged by the flood of viral genome sequences leading to a loss of timeliness, accuracy, and reliability. Here, we devised a new co-mutation network framework, aiming to tackle these difficulties in variant surveillance. METHODS: To avoid simultaneous input and modeling of the whole large-scale data, we dynamically investigate the nucleotide covarying pattern of weekly sequences. The community detection algorithm is applied to a co-occurring genomic alteration network constructed from mutation corpora of weekly collected data. Co-mutation communities are identified, extracted, and characterized as variant markers. They contribute to the creation and weekly updates of a community-based variant dictionary tree representing SARS-CoV-2 evolution, where highly similar ones between weeks have been merged to represent the same variants. Emerging communities imply the presence of novel viral variants or new branches of existing variants. This process was benchmarked with worldwide GISAID data and validated using national level data from six COVID-19 hotspot countries. RESULTS: A total of 235 co-mutation communities were identified after a 120 weeks' investigation of worldwide sequence data, from March 2020 to mid-June 2022. The dictionary tree progressively developed from these communities perfectly recorded the time course of SARS-CoV-2 branching, coinciding with GISAID clades. The time-varying prevalence of these communities in the viral population showed a good match with the emergence and circulation of the variants they represented. All these benchmark results not only exhibited the methodology features but also demonstrated high efficiency in detection of the pandemic variants. When it was applied to regional variant surveillance, our method displayed significantly earlier identification of feature communities of major WHO-named SARS-CoV-2 variants in contrast with Pangolin's monitoring. CONCLUSION: An efficient genomic surveillance framework built from weekly co-mutation networks and a dynamic community-based variant dictionary tree enables early detection and continuous investigation of SARS-CoV-2 variants overcoming genomic data flood, aiding in the response to the COVID-19 pandemic.
format Online
Article
Text
id pubmed-9901361
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99013612023-02-07 Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance Huang, Qiang Qiu, Huining Bible, Paul W. Huang, Yong Zheng, Fangfang Gu, Jing Sun, Jian Hao, Yuantao Liu, Yu Front Public Health Public Health BACKGROUND: Precise public health and clinical interventions for the COVID-19 pandemic has spurred a global rush on SARS-CoV-2 variant tracking, but current approaches to variant tracking are challenged by the flood of viral genome sequences leading to a loss of timeliness, accuracy, and reliability. Here, we devised a new co-mutation network framework, aiming to tackle these difficulties in variant surveillance. METHODS: To avoid simultaneous input and modeling of the whole large-scale data, we dynamically investigate the nucleotide covarying pattern of weekly sequences. The community detection algorithm is applied to a co-occurring genomic alteration network constructed from mutation corpora of weekly collected data. Co-mutation communities are identified, extracted, and characterized as variant markers. They contribute to the creation and weekly updates of a community-based variant dictionary tree representing SARS-CoV-2 evolution, where highly similar ones between weeks have been merged to represent the same variants. Emerging communities imply the presence of novel viral variants or new branches of existing variants. This process was benchmarked with worldwide GISAID data and validated using national level data from six COVID-19 hotspot countries. RESULTS: A total of 235 co-mutation communities were identified after a 120 weeks' investigation of worldwide sequence data, from March 2020 to mid-June 2022. The dictionary tree progressively developed from these communities perfectly recorded the time course of SARS-CoV-2 branching, coinciding with GISAID clades. The time-varying prevalence of these communities in the viral population showed a good match with the emergence and circulation of the variants they represented. All these benchmark results not only exhibited the methodology features but also demonstrated high efficiency in detection of the pandemic variants. When it was applied to regional variant surveillance, our method displayed significantly earlier identification of feature communities of major WHO-named SARS-CoV-2 variants in contrast with Pangolin's monitoring. CONCLUSION: An efficient genomic surveillance framework built from weekly co-mutation networks and a dynamic community-based variant dictionary tree enables early detection and continuous investigation of SARS-CoV-2 variants overcoming genomic data flood, aiding in the response to the COVID-19 pandemic. Frontiers Media S.A. 2023-01-23 /pmc/articles/PMC9901361/ /pubmed/36755900 http://dx.doi.org/10.3389/fpubh.2023.1015969 Text en Copyright © 2023 Huang, Qiu, Bible, Huang, Zheng, Gu, Sun, Hao and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Huang, Qiang
Qiu, Huining
Bible, Paul W.
Huang, Yong
Zheng, Fangfang
Gu, Jing
Sun, Jian
Hao, Yuantao
Liu, Yu
Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title_full Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title_fullStr Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title_full_unstemmed Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title_short Early detection of SARS-CoV-2 variants through dynamic co-mutation network surveillance
title_sort early detection of sars-cov-2 variants through dynamic co-mutation network surveillance
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9901361/
https://www.ncbi.nlm.nih.gov/pubmed/36755900
http://dx.doi.org/10.3389/fpubh.2023.1015969
work_keys_str_mv AT huangqiang earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT qiuhuining earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT biblepaulw earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT huangyong earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT zhengfangfang earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT gujing earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT sunjian earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT haoyuantao earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance
AT liuyu earlydetectionofsarscov2variantsthroughdynamiccomutationnetworksurveillance