Cargando…
Sierra SARS-CoV-2 sequence and antiviral resistance analysis program
INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the tar...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595491/ https://www.ncbi.nlm.nih.gov/pubmed/36334368 http://dx.doi.org/10.1016/j.jcv.2022.105323 |
_version_ | 1784815663180677120 |
---|---|
author | Tzou, Philip L. Tao, Kaiming Sahoo, Malaya K. Kosakovsky Pond, Sergei L. Pinsky, Benjamin A. Shafer, Robert W. |
author_facet | Tzou, Philip L. Tao, Kaiming Sahoo, Malaya K. Kosakovsky Pond, Sergei L. Pinsky, Benjamin A. Shafer, Robert W. |
author_sort | Tzou, Philip L. |
collection | PubMed |
description | INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the targets of antiviral therapy METHODS: We describe the technical aspects and benchmark the performance of Sierra SARS-CoV-2, a program designed to perform these functions on user-submitted FASTQ and FASTA sequence files and lists of Spike mutations. Sierra SARS-CoV-2 indicates which sequences contain an unexpectedly large number of unusual mutations and which mutations are associated with reduced susceptibility to clinical stage mAbs, the RdRP inhibitor remdesivir, or the Mpro inhibitor nirmatrelvir RESULTS: To assess the performance of Sierra SARS-CoV-2 on FASTQ files, we applied it to 600 representative FASTQ sequences and compared the results to the COVID-19 EDGE program. To assess its performance on FASTA files, we applied it to nearly one million representative FASTA sequences and compared the results to the GISAID mutation annotation. To assess its performance on mutations lists, we applied it to 13,578 distinct Spike RBD mutation patterns and showed that exactly or partially matching annotations were available for 88% of patterns CONCLUSION: Sierra SARS-CoV-2 leverages previously published data to improve the quality control of submitted viral genomic data and to provide functional annotation on the impact of mutations in the targets of antiviral SARS-CoV-2 therapy. The program can be found at https://covdb.stanford.edu/sierra/sars2/ and its source code at https://github.com/hivdb/sierra-sars2. |
format | Online Article Text |
id | pubmed-9595491 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Author(s). Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95954912022-10-25 Sierra SARS-CoV-2 sequence and antiviral resistance analysis program Tzou, Philip L. Tao, Kaiming Sahoo, Malaya K. Kosakovsky Pond, Sergei L. Pinsky, Benjamin A. Shafer, Robert W. J Clin Virol Article INTRODUCTION: Although most laboratories are capable of employing established protocols to perform full-genome SARS-CoV-2 sequencing, many are unable to assess sequence quality, select appropriate mutation-detection thresholds, or report on the potential clinical significance of mutations in the targets of antiviral therapy METHODS: We describe the technical aspects and benchmark the performance of Sierra SARS-CoV-2, a program designed to perform these functions on user-submitted FASTQ and FASTA sequence files and lists of Spike mutations. Sierra SARS-CoV-2 indicates which sequences contain an unexpectedly large number of unusual mutations and which mutations are associated with reduced susceptibility to clinical stage mAbs, the RdRP inhibitor remdesivir, or the Mpro inhibitor nirmatrelvir RESULTS: To assess the performance of Sierra SARS-CoV-2 on FASTQ files, we applied it to 600 representative FASTQ sequences and compared the results to the COVID-19 EDGE program. To assess its performance on FASTA files, we applied it to nearly one million representative FASTA sequences and compared the results to the GISAID mutation annotation. To assess its performance on mutations lists, we applied it to 13,578 distinct Spike RBD mutation patterns and showed that exactly or partially matching annotations were available for 88% of patterns CONCLUSION: Sierra SARS-CoV-2 leverages previously published data to improve the quality control of submitted viral genomic data and to provide functional annotation on the impact of mutations in the targets of antiviral SARS-CoV-2 therapy. The program can be found at https://covdb.stanford.edu/sierra/sars2/ and its source code at https://github.com/hivdb/sierra-sars2. The Author(s). Published by Elsevier B.V. 2022-12 2022-10-22 /pmc/articles/PMC9595491/ /pubmed/36334368 http://dx.doi.org/10.1016/j.jcv.2022.105323 Text en © 2022 The Author(s). Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Tzou, Philip L. Tao, Kaiming Sahoo, Malaya K. Kosakovsky Pond, Sergei L. Pinsky, Benjamin A. Shafer, Robert W. Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title | Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title_full | Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title_fullStr | Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title_full_unstemmed | Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title_short | Sierra SARS-CoV-2 sequence and antiviral resistance analysis program |
title_sort | sierra sars-cov-2 sequence and antiviral resistance analysis program |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595491/ https://www.ncbi.nlm.nih.gov/pubmed/36334368 http://dx.doi.org/10.1016/j.jcv.2022.105323 |
work_keys_str_mv | AT tzouphilipl sierrasarscov2sequenceandantiviralresistanceanalysisprogram AT taokaiming sierrasarscov2sequenceandantiviralresistanceanalysisprogram AT sahoomalayak sierrasarscov2sequenceandantiviralresistanceanalysisprogram AT kosakovskypondsergeil sierrasarscov2sequenceandantiviralresistanceanalysisprogram AT pinskybenjamina sierrasarscov2sequenceandantiviralresistanceanalysisprogram AT shaferrobertw sierrasarscov2sequenceandantiviralresistanceanalysisprogram |