Cargando…
Interpreting and de-noising genetically engineered barcodes in a DNA virus
The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9725130/ https://www.ncbi.nlm.nih.gov/pubmed/36413582 http://dx.doi.org/10.1371/journal.pcbi.1010131 |
_version_ | 1784844512181354496 |
---|---|
author | Blois, Sylvain Goetz, Benjamin M. Bull, James J. Sullivan, Christopher S. |
author_facet | Blois, Sylvain Goetz, Benjamin M. Bull, James J. Sullivan, Christopher S. |
author_sort | Blois, Sylvain |
collection | PubMed |
description | The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode might alter fitness, the barcode may accumulate mutations, and construction of the marked pathogens may result in unintended barcodes that are not as designed. Here, we generate approximately 5,000 randomized barcodes in the genome of the prototypic small DNA virus murine polyomavirus. We describe the challenges faced with interpreting the barcode sequences obtained from the library. Our Illumina NextSeq sequencing recalled much greater variation in barcode sequencing reads than the expected 5,000 barcodes–necessarily stemming from the Illumina library processing and sequencing error. Using data from defined control virus genomes cloned into plasmid backbones we develop a vetted post-sequencing method to cluster the erroneous reads around the true virus genome barcodes. These findings may foreshadow problems with randomized barcodes in other microbial systems and provide a useful approach for future work utilizing nucleic acid barcoded pathogens. |
format | Online Article Text |
id | pubmed-9725130 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-97251302022-12-07 Interpreting and de-noising genetically engineered barcodes in a DNA virus Blois, Sylvain Goetz, Benjamin M. Bull, James J. Sullivan, Christopher S. PLoS Comput Biol Research Article The concept of a nucleic acid barcode applied to pathogen genomes is easy to grasp and the many possible uses are straightforward. But implementation may not be easy, especially when growing through multiple generations or assaying the pathogen long-term. The potential problems include: the barcode might alter fitness, the barcode may accumulate mutations, and construction of the marked pathogens may result in unintended barcodes that are not as designed. Here, we generate approximately 5,000 randomized barcodes in the genome of the prototypic small DNA virus murine polyomavirus. We describe the challenges faced with interpreting the barcode sequences obtained from the library. Our Illumina NextSeq sequencing recalled much greater variation in barcode sequencing reads than the expected 5,000 barcodes–necessarily stemming from the Illumina library processing and sequencing error. Using data from defined control virus genomes cloned into plasmid backbones we develop a vetted post-sequencing method to cluster the erroneous reads around the true virus genome barcodes. These findings may foreshadow problems with randomized barcodes in other microbial systems and provide a useful approach for future work utilizing nucleic acid barcoded pathogens. Public Library of Science 2022-11-22 /pmc/articles/PMC9725130/ /pubmed/36413582 http://dx.doi.org/10.1371/journal.pcbi.1010131 Text en © 2022 Blois et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Blois, Sylvain Goetz, Benjamin M. Bull, James J. Sullivan, Christopher S. Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title | Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title_full | Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title_fullStr | Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title_full_unstemmed | Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title_short | Interpreting and de-noising genetically engineered barcodes in a DNA virus |
title_sort | interpreting and de-noising genetically engineered barcodes in a dna virus |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9725130/ https://www.ncbi.nlm.nih.gov/pubmed/36413582 http://dx.doi.org/10.1371/journal.pcbi.1010131 |
work_keys_str_mv | AT bloissylvain interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus AT goetzbenjaminm interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus AT bulljamesj interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus AT sullivanchristophers interpretinganddenoisinggeneticallyengineeredbarcodesinadnavirus |