Cargando…

Best Practices in Designing, Sequencing, and Identifying Random DNA Barcodes

Random DNA barcodes are a versatile tool for tracking cell lineages, with applications ranging from development to cancer to evolution. Here, we review and critically evaluate barcode designs as well as methods of barcode sequencing and initial processing of barcode data. We first demonstrate how va...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, Milo S., Venkataram, Sandeep, Kryazhimskiy, Sergey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10276077/
https://www.ncbi.nlm.nih.gov/pubmed/36651964
http://dx.doi.org/10.1007/s00239-022-10083-z
Descripción
Sumario:Random DNA barcodes are a versatile tool for tracking cell lineages, with applications ranging from development to cancer to evolution. Here, we review and critically evaluate barcode designs as well as methods of barcode sequencing and initial processing of barcode data. We first demonstrate how various barcode design decisions affect data quality and propose a new design that balances all considerations that we are currently aware of. We then discuss various options for the preparation of barcode sequencing libraries, including inline indices and Unique Molecular Identifiers (UMIs). Finally, we test the performance of several established and new bioinformatic pipelines for the extraction of barcodes from raw sequencing reads and for error correction. We find that both alignment and regular expression-based approaches work well for barcode extraction, and that error-correction pipelines designed specifically for barcode data are superior to generic ones. Overall, this review will help researchers to approach their barcoding experiments in a deliberate and systematic way. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00239-022-10083-z.