Cargando…

Interpreting and de-noising genetically engineered barcodes in a DNA virus

The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode...

Descripción completa

Detalles Bibliográficos
Autores principales: Blois, Sylvain, Goetz, Benjamin M., Bull, James J., Sullivan, Christopher S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9725130/
https://www.ncbi.nlm.nih.gov/pubmed/36413582
http://dx.doi.org/10.1371/journal.pcbi.1010131
_version_ 1784844512181354496
author Blois, Sylvain
Goetz, Benjamin M.
Bull, James J.
Sullivan, Christopher S.
author_facet Blois, Sylvain
Goetz, Benjamin M.
Bull, James J.
Sullivan, Christopher S.
author_sort Blois, Sylvain
collection PubMed
description The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode might alter fitness, the barcode may accumulate mutations, and construction of the marked pathogens may result in unintended barcodes that are not as designed. Here, we generate approximately 5,000 randomized barcodes in the genome of the prototypic small DNA virus murine polyomavirus. We describe the challenges faced with interpreting the barcode sequences obtained from the library. Our Illumina NextSeq sequencing recalled much greater variation in barcode sequencing reads than the expected 5,000 barcodes–necessarily stemming from the Illumina library processing and sequencing error. Using data from defined control virus genomes cloned into plasmid backbones we develop a vetted post-sequencing method to cluster the erroneous reads around the true virus genome barcodes. These findings may foreshadow problems with randomized barcodes in other microbial systems and provide a useful approach for future work utilizing nucleic acid barcoded pathogens.
format Online
Article
Text
id pubmed-9725130
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-97251302022-12-07 Interpreting and de-noising genetically engineered barcodes in a DNA virus Blois, Sylvain Goetz, Benjamin M. Bull, James J. Sullivan, Christopher S. PLoS Comput Biol Research Article The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode might alter fitness, the barcode may accumulate mutations, and construction of the marked pathogens may result in unintended barcodes that are not as designed. Here, we generate approximately 5,000 randomized barcodes in the genome of the prototypic small DNA virus murine polyomavirus. We describe the challenges faced with interpreting the barcode sequences obtained from the library. Our Illumina NextSeq sequencing recalled much greater variation in barcode sequencing reads than the expected 5,000 barcodes–necessarily stemming from the Illumina library processing and sequencing error. Using data from defined control virus genomes cloned into plasmid backbones we develop a vetted post-sequencing method to cluster the erroneous reads around the true virus genome barcodes. These findings may foreshadow problems with randomized barcodes in other microbial systems and provide a useful approach for future work utilizing nucleic acid barcoded pathogens. Public Library of Science 2022-11-22 /pmc/articles/PMC9725130/ /pubmed/36413582 http://dx.doi.org/10.1371/journal.pcbi.1010131 Text en © 2022 Blois et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Blois, Sylvain
Goetz, Benjamin M.
Bull, James J.
Sullivan, Christopher S.
Interpreting and de-noising genetically engineered barcodes in a DNA virus
title Interpreting and de-noising genetically engineered barcodes in a DNA virus
title_full Interpreting and de-noising genetically engineered barcodes in a DNA virus
title_fullStr Interpreting and de-noising genetically engineered barcodes in a DNA virus
title_full_unstemmed Interpreting and de-noising genetically engineered barcodes in a DNA virus
title_short Interpreting and de-noising genetically engineered barcodes in a DNA virus
title_sort interpreting and de-noising genetically engineered barcodes in a dna virus
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9725130/
https://www.ncbi.nlm.nih.gov/pubmed/36413582
http://dx.doi.org/10.1371/journal.pcbi.1010131
work_keys_str_mv AT bloissylvain interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus
AT goetzbenjaminm interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus
AT bulljamesj interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus
AT sullivanchristophers interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus