Cargando…

Sierra SARS-CoV-2 sequence and antiviral resistance analysis program

INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the tar...

Descripción completa

Detalles Bibliográficos
Autores principales: Tzou, Philip L., Tao, Kaiming, Sahoo, Malaya K., Kosakovsky Pond, Sergei L., Pinsky, Benjamin A., Shafer, Robert W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595491/
https://www.ncbi.nlm.nih.gov/pubmed/36334368
http://dx.doi.org/10.1016/j.jcv.2022.105323
_version_ 1784815663180677120
author Tzou, Philip L.
Tao, Kaiming
Sahoo, Malaya K.
Kosakovsky Pond, Sergei L.
Pinsky, Benjamin A.
Shafer, Robert W.
author_facet Tzou, Philip L.
Tao, Kaiming
Sahoo, Malaya K.
Kosakovsky Pond, Sergei L.
Pinsky, Benjamin A.
Shafer, Robert W.
author_sort Tzou, Philip L.
collection PubMed
description INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the targets of antiviral therapy METHODS: We describe the technical aspects and benchmark the performance of Sierra SARS-CoV-2, a program designed to perform these functions on user-submitted FASTQ and FASTA sequence files and lists of Spike mutations. Sierra SARS-CoV-2 indicates which sequences contain an unexpectedly large number of unusual mutations and which mutations are associated with reduced susceptibility to clinical stage mAbs, the RdRP inhibitor remdesivir, or the Mpro inhibitor nirmatrelvir RESULTS: To assess the performance of Sierra SARS-CoV-2 on FASTQ files, we applied it to 600 representative FASTQ sequences and compared the results to the COVID-19 EDGE program. To assess its performance on FASTA files, we applied it to nearly one million representative FASTA sequences and compared the results to the GISAID mutation annotation. To assess its performance on mutations lists, we applied it to 13,578 distinct Spike RBD mutation patterns and showed that exactly or partially matching annotations were available for 88% of patterns CONCLUSION: Sierra SARS-CoV-2 leverages previously published data to improve the quality control of submitted viral genomic data and to provide functional annotation on the impact of mutations in the targets of antiviral SARS-CoV-2 therapy. The program can be found at https://covdb.stanford.edu/sierra/sars2/ and its source code at https://github.com/hivdb/sierra-sars2.
format Online
Article
Text
id pubmed-9595491
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Author(s). Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-95954912022-10-25 Sierra SARS-CoV-2 sequence and antiviral resistance analysis program Tzou, Philip L. Tao, Kaiming Sahoo, Malaya K. Kosakovsky Pond, Sergei L. Pinsky, Benjamin A. Shafer, Robert W. J Clin Virol Article INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the targets of antiviral therapy METHODS: We describe the technical aspects and benchmark the performance of Sierra SARS-CoV-2, a program designed to perform these functions on user-submitted FASTQ and FASTA sequence files and lists of Spike mutations. Sierra SARS-CoV-2 indicates which sequences contain an unexpectedly large number of unusual mutations and which mutations are associated with reduced susceptibility to clinical stage mAbs, the RdRP inhibitor remdesivir, or the Mpro inhibitor nirmatrelvir RESULTS: To assess the performance of Sierra SARS-CoV-2 on FASTQ files, we applied it to 600 representative FASTQ sequences and compared the results to the COVID-19 EDGE program. To assess its performance on FASTA files, we applied it to nearly one million representative FASTA sequences and compared the results to the GISAID mutation annotation. To assess its performance on mutations lists, we applied it to 13,578 distinct Spike RBD mutation patterns and showed that exactly or partially matching annotations were available for 88% of patterns CONCLUSION: Sierra SARS-CoV-2 leverages previously published data to improve the quality control of submitted viral genomic data and to provide functional annotation on the impact of mutations in the targets of antiviral SARS-CoV-2 therapy. The program can be found at https://covdb.stanford.edu/sierra/sars2/ and its source code at https://github.com/hivdb/sierra-sars2. The Author(s). Published by Elsevier B.V. 2022-12 2022-10-22 /pmc/articles/PMC9595491/ /pubmed/36334368 http://dx.doi.org/10.1016/j.jcv.2022.105323 Text en © 2022 The Author(s). Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Tzou, Philip L.
Tao, Kaiming
Sahoo, Malaya K.
Kosakovsky Pond, Sergei L.
Pinsky, Benjamin A.
Shafer, Robert W.
Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title_full Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title_fullStr Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title_full_unstemmed Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title_short Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
title_sort sierra sars-cov-2 sequence and antiviral resistance analysis program
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595491/
https://www.ncbi.nlm.nih.gov/pubmed/36334368
http://dx.doi.org/10.1016/j.jcv.2022.105323
work_keys_str_mv AT tzouphilipl sierrasarscov2sequenceandantiviralresistanceanalysisprogram
AT taokaiming sierrasarscov2sequenceandantiviralresistanceanalysisprogram
AT sahoomalayak sierrasarscov2sequenceandantiviralresistanceanalysisprogram
AT kosakovskypondsergeil sierrasarscov2sequenceandantiviralresistanceanalysisprogram
AT pinskybenjamina sierrasarscov2sequenceandantiviralresistanceanalysisprogram
AT shaferrobertw sierrasarscov2sequenceandantiviralresistanceanalysisprogram