Cargando…
Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations
IMPORTANCE: Earlier detection of emerging novel SARS-COV-2 variants is important for public health surveillance of potential viral threats and for earlier prevention research. Artificial intelligence may facilitate early detection of SARS-CoV2 emerging novel variants based on variant-specific mutati...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Association
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945077/ https://www.ncbi.nlm.nih.gov/pubmed/36809468 http://dx.doi.org/10.1001/jamanetworkopen.2023.0191 |
_version_ | 1784892060735635456 |
---|---|
author | Zhao, Lue Ping Cohen, Seth Zhao, Michael Madeleine, Margaret Payne, Thomas H. Lybrand, Terry P. Geraghty, Daniel E. Jerome, Keith R. Corey, Lawrence |
author_facet | Zhao, Lue Ping Cohen, Seth Zhao, Michael Madeleine, Margaret Payne, Thomas H. Lybrand, Terry P. Geraghty, Daniel E. Jerome, Keith R. Corey, Lawrence |
author_sort | Zhao, Lue Ping |
collection | PubMed |
description | IMPORTANCE: Earlier detection of emerging novel SARS-COV-2 variants is important for public health surveillance of potential viral threats and for earlier prevention research. Artificial intelligence may facilitate early detection of SARS-CoV2 emerging novel variants based on variant-specific mutation haplotypes and, in turn, be associated with enhanced implementation of risk-stratified public health prevention strategies. OBJECTIVE: To develop a haplotype-based artificial intelligence (HAI) model for identifying novel variants, including mixture variants (MVs) of known variants and new variants with novel mutations. DESIGN, SETTING, AND PARTICIPANTS: This cross-sectional study used serially observed viral genomic sequences globally (prior to March 14, 2022) to train and validate the HAI model and used it to identify variants arising from a prospective set of viruses from March 15 to May 18, 2022. MAIN OUTCOMES AND MEASURES: Viral sequences, collection dates, and locations were subjected to statistical learning analysis to estimate variant-specific core mutations and haplotype frequencies, which were then used to construct an HAI model to identify novel variants. RESULTS: Through training on more than 5 million viral sequences, an HAI model was built, and its identification performance was validated on an independent validation set of more than 5 million viruses. Its identification performance was assessed on a prospective set of 344 901 viruses. In addition to achieving an accuracy of 92.8% (95% CI within 0.1%), the HAI model identified 4 Omicron MVs (Omicron-Alpha, Omicron-Delta, Omicron-Epsilon, and Omicron-Zeta), 2 Delta MVs (Delta-Kappa and Delta-Zeta), and 1 Alpha-Epsilon MV, among which Omicron-Epsilon MVs were most frequent (609/657 MVs [92.7%]). Furthermore, the HAI model found that 1699 Omicron viruses had unidentifiable variants given that these variants acquired novel mutations. Lastly, 524 variant-unassigned and variant-unidentifiable viruses carried 16 novel mutations, 8 of which were increasing in prevalence percentages as of May 2022. CONCLUSIONS AND RELEVANCE: In this cross-sectional study, an HAI model found SARS-COV-2 viruses with MV or novel mutations in the global population, which may require closer examination and monitoring. These results suggest that HAI may complement phylogenic variant assignment, providing additional insights into emerging novel variants in the population. |
format | Online Article Text |
id | pubmed-9945077 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Medical Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-99450772023-02-23 Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations Zhao, Lue Ping Cohen, Seth Zhao, Michael Madeleine, Margaret Payne, Thomas H. Lybrand, Terry P. Geraghty, Daniel E. Jerome, Keith R. Corey, Lawrence JAMA Netw Open Original Investigation IMPORTANCE: Earlier detection of emerging novel SARS-COV-2 variants is important for public health surveillance of potential viral threats and for earlier prevention research. Artificial intelligence may facilitate early detection of SARS-CoV2 emerging novel variants based on variant-specific mutation haplotypes and, in turn, be associated with enhanced implementation of risk-stratified public health prevention strategies. OBJECTIVE: To develop a haplotype-based artificial intelligence (HAI) model for identifying novel variants, including mixture variants (MVs) of known variants and new variants with novel mutations. DESIGN, SETTING, AND PARTICIPANTS: This cross-sectional study used serially observed viral genomic sequences globally (prior to March 14, 2022) to train and validate the HAI model and used it to identify variants arising from a prospective set of viruses from March 15 to May 18, 2022. MAIN OUTCOMES AND MEASURES: Viral sequences, collection dates, and locations were subjected to statistical learning analysis to estimate variant-specific core mutations and haplotype frequencies, which were then used to construct an HAI model to identify novel variants. RESULTS: Through training on more than 5 million viral sequences, an HAI model was built, and its identification performance was validated on an independent validation set of more than 5 million viruses. Its identification performance was assessed on a prospective set of 344 901 viruses. In addition to achieving an accuracy of 92.8% (95% CI within 0.1%), the HAI model identified 4 Omicron MVs (Omicron-Alpha, Omicron-Delta, Omicron-Epsilon, and Omicron-Zeta), 2 Delta MVs (Delta-Kappa and Delta-Zeta), and 1 Alpha-Epsilon MV, among which Omicron-Epsilon MVs were most frequent (609/657 MVs [92.7%]). Furthermore, the HAI model found that 1699 Omicron viruses had unidentifiable variants given that these variants acquired novel mutations. Lastly, 524 variant-unassigned and variant-unidentifiable viruses carried 16 novel mutations, 8 of which were increasing in prevalence percentages as of May 2022. CONCLUSIONS AND RELEVANCE: In this cross-sectional study, an HAI model found SARS-COV-2 viruses with MV or novel mutations in the global population, which may require closer examination and monitoring. These results suggest that HAI may complement phylogenic variant assignment, providing additional insights into emerging novel variants in the population. American Medical Association 2023-02-21 /pmc/articles/PMC9945077/ /pubmed/36809468 http://dx.doi.org/10.1001/jamanetworkopen.2023.0191 Text en Copyright 2023 Zhao LP et al. JAMA Network Open. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the CC-BY License. |
spellingShingle | Original Investigation Zhao, Lue Ping Cohen, Seth Zhao, Michael Madeleine, Margaret Payne, Thomas H. Lybrand, Terry P. Geraghty, Daniel E. Jerome, Keith R. Corey, Lawrence Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title | Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title_full | Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title_fullStr | Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title_full_unstemmed | Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title_short | Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations |
title_sort | using haplotype-based artificial intelligence to evaluate sars-cov-2 novel variants and mutations |
topic | Original Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945077/ https://www.ncbi.nlm.nih.gov/pubmed/36809468 http://dx.doi.org/10.1001/jamanetworkopen.2023.0191 |
work_keys_str_mv | AT zhaolueping usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT cohenseth usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT zhaomichael usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT madeleinemargaret usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT paynethomash usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT lybrandterryp usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT geraghtydaniele usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT jeromekeithr usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations AT coreylawrence usinghaplotypebasedartificialintelligencetoevaluatesarscov2novelvariantsandmutations |