Cargando…

Flexible parsing and preprocessing of technical sequences with splitcode

Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment...

Descripción completa

Detalles Bibliográficos
Autores principales: Sullivan, Delaney K., Pachter, Lior
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055216/
https://www.ncbi.nlm.nih.gov/pubmed/36993532
http://dx.doi.org/10.1101/2023.03.20.533521
_version_ 1785015839900041216
author Sullivan, Delaney K.
Pachter, Lior
author_facet Sullivan, Delaney K.
Pachter, Lior
author_sort Sullivan, Delaney K.
collection PubMed
description Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed. We present a tool called splitcode, that enables flexible and efficient preprocessing, parsing, and manipulation of sequencing reads. The splitcode program is free, open source, and available for download at http://github.com/pachterlab/splitcode. This versatile tool will facilitate simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays.
format Online
Article
Text
id pubmed-10055216
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100552162023-03-30 Flexible parsing and preprocessing of technical sequences with splitcode Sullivan, Delaney K. Pachter, Lior bioRxiv Article Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed. We present a tool called splitcode, that enables flexible and efficient preprocessing, parsing, and manipulation of sequencing reads. The splitcode program is free, open source, and available for download at http://github.com/pachterlab/splitcode. This versatile tool will facilitate simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays. Cold Spring Harbor Laboratory 2023-03-31 /pmc/articles/PMC10055216/ /pubmed/36993532 http://dx.doi.org/10.1101/2023.03.20.533521 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Sullivan, Delaney K.
Pachter, Lior
Flexible parsing and preprocessing of technical sequences with splitcode
title Flexible parsing and preprocessing of technical sequences with splitcode
title_full Flexible parsing and preprocessing of technical sequences with splitcode
title_fullStr Flexible parsing and preprocessing of technical sequences with splitcode
title_full_unstemmed Flexible parsing and preprocessing of technical sequences with splitcode
title_short Flexible parsing and preprocessing of technical sequences with splitcode
title_sort flexible parsing and preprocessing of technical sequences with splitcode
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055216/
https://www.ncbi.nlm.nih.gov/pubmed/36993532
http://dx.doi.org/10.1101/2023.03.20.533521
work_keys_str_mv AT sullivandelaneyk flexibleparsingandpreprocessingoftechnicalsequenceswithsplitcode
AT pachterlior flexibleparsingandpreprocessingoftechnicalsequenceswithsplitcode