Cargando…

Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens

Tiled amplicon sequencing has served as an essential tool for tracking the spread and evolution of pathogens. Over 2 million complete SARS-CoV-2 genomes are now publicly available, most sequenced and assembled via tiled amplicon sequencing. While computational tools for tiled amplicon design exist,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Michael X., Lou, Esther G., Sapoval, Nicolae, Kim, Eddie, Kalvapalle, Prashant, Kille, Bryce, Elworth, R. A. Leo, Liu, Yunxi, Fu, Yilei, Stadler, Lauren B., Treangen, Todd J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9948974/
https://www.ncbi.nlm.nih.gov/pubmed/36824759
http://dx.doi.org/10.1101/2023.02.11.528155
_version_ 1784892886857285632
author Wang, Michael X.
Lou, Esther G.
Sapoval, Nicolae
Kim, Eddie
Kalvapalle, Prashant
Kille, Bryce
Elworth, R. A. Leo
Liu, Yunxi
Fu, Yilei
Stadler, Lauren B.
Treangen, Todd J.
author_facet Wang, Michael X.
Lou, Esther G.
Sapoval, Nicolae
Kim, Eddie
Kalvapalle, Prashant
Kille, Bryce
Elworth, R. A. Leo
Liu, Yunxi
Fu, Yilei
Stadler, Lauren B.
Treangen, Todd J.
author_sort Wang, Michael X.
collection PubMed
description Tiled amplicon sequencing has served as an essential tool for tracking the spread and evolution of pathogens. Over 2 million complete SARS-CoV-2 genomes are now publicly available, most sequenced and assembled via tiled amplicon sequencing. While computational tools for tiled amplicon design exist, they require downstream manual optimization both computationally and experimentally, which is slow and costly. Here we present Olivar, a first step towards a fully automated, variant-aware design of tiled amplicons for pathogen genomes. Olivar converts each nucleotide of the target genome into a numeric risk score, capturing undesired sequence features that should be avoided. In a direct comparison with PrimalScheme, we show that Olivar has fewer SNPs overlapping with primers and predicted PCR byproducts. We also compared Olivar head-to-head with ARTIC v4.1, the most widely used primer set for SARS-CoV-2 sequencing, and show Olivar yields similar read mapping rates (~90%) and better coverage to the manually designed ARTIC v4.1 amplicons. We also evaluated Olivar on real wastewater samples and found that Olivar had up to 3-fold higher mapping rates while retaining similar coverage. In summary, Olivar automates and accelerates the generation of tiled amplicons, even in situations of high mutation frequency and/or density. Olivar is available as a web application at https://olivar.rice.edu. Olivar can also be installed locally as a command line tool with Bioconda. Source code, installation guide and usage are available at https://github.com/treangenlab/Olivar.
format Online
Article
Text
id pubmed-9948974
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99489742023-02-24 Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens Wang, Michael X. Lou, Esther G. Sapoval, Nicolae Kim, Eddie Kalvapalle, Prashant Kille, Bryce Elworth, R. A. Leo Liu, Yunxi Fu, Yilei Stadler, Lauren B. Treangen, Todd J. bioRxiv Article Tiled amplicon sequencing has served as an essential tool for tracking the spread and evolution of pathogens. Over 2 million complete SARS-CoV-2 genomes are now publicly available, most sequenced and assembled via tiled amplicon sequencing. While computational tools for tiled amplicon design exist, they require downstream manual optimization both computationally and experimentally, which is slow and costly. Here we present Olivar, a first step towards a fully automated, variant-aware design of tiled amplicons for pathogen genomes. Olivar converts each nucleotide of the target genome into a numeric risk score, capturing undesired sequence features that should be avoided. In a direct comparison with PrimalScheme, we show that Olivar has fewer SNPs overlapping with primers and predicted PCR byproducts. We also compared Olivar head-to-head with ARTIC v4.1, the most widely used primer set for SARS-CoV-2 sequencing, and show Olivar yields similar read mapping rates (~90%) and better coverage to the manually designed ARTIC v4.1 amplicons. We also evaluated Olivar on real wastewater samples and found that Olivar had up to 3-fold higher mapping rates while retaining similar coverage. In summary, Olivar automates and accelerates the generation of tiled amplicons, even in situations of high mutation frequency and/or density. Olivar is available as a web application at https://olivar.rice.edu. Olivar can also be installed locally as a command line tool with Bioconda. Source code, installation guide and usage are available at https://github.com/treangenlab/Olivar. Cold Spring Harbor Laboratory 2023-09-30 /pmc/articles/PMC9948974/ /pubmed/36824759 http://dx.doi.org/10.1101/2023.02.11.528155 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Wang, Michael X.
Lou, Esther G.
Sapoval, Nicolae
Kim, Eddie
Kalvapalle, Prashant
Kille, Bryce
Elworth, R. A. Leo
Liu, Yunxi
Fu, Yilei
Stadler, Lauren B.
Treangen, Todd J.
Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title_full Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title_fullStr Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title_full_unstemmed Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title_short Olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
title_sort olivar: automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9948974/
https://www.ncbi.nlm.nih.gov/pubmed/36824759
http://dx.doi.org/10.1101/2023.02.11.528155
work_keys_str_mv AT wangmichaelx olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT louestherg olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT sapovalnicolae olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT kimeddie olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT kalvapalleprashant olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT killebryce olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT elworthraleo olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT liuyunxi olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT fuyilei olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT stadlerlaurenb olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens
AT treangentoddj olivarautomatedvariantawareprimerdesignformultiplextiledampliconsequencingofpathogens