Cargando…

Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language

Next-generation sequencing technologies create large, multiplexed DNA sequences that require preprocessing before any further analysis. Part of this preprocessing includes demultiplexing and trimming sequences. Although there are many existing tools that can handle these preprocessing steps, they ca...

Descripción completa

Detalles Bibliográficos
Autor principal: Liu, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589082/
https://www.ncbi.nlm.nih.gov/pubmed/31249738
http://dx.doi.org/10.7717/peerj.7170
_version_ 1783429339119878144
author Liu, Daniel
author_facet Liu, Daniel
author_sort Liu, Daniel
collection PubMed
description Next-generation sequencing technologies create large, multiplexed DNA sequences that require preprocessing before any further analysis. Part of this preprocessing includes demultiplexing and trimming sequences. Although there are many existing tools that can handle these preprocessing steps, they cannot be easily extended to new sequence schematics when new pipelines are developed. We present Fuzzysplit, a tool that relies on a simple declarative language to describe the schematics of sequences, which makes it incredibly adaptable to different use cases. In this paper, we explain the matching algorithms behind Fuzzysplit and we provide a preliminary comparison of its performance with other well-established tools. Overall, we find that its matching accuracy is comparable to previous tools.
format Online
Article
Text
id pubmed-6589082
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-65890822019-06-27 Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language Liu, Daniel PeerJ Bioinformatics Next-generation sequencing technologies create large, multiplexed DNA sequences that require preprocessing before any further analysis. Part of this preprocessing includes demultiplexing and trimming sequences. Although there are many existing tools that can handle these preprocessing steps, they cannot be easily extended to new sequence schematics when new pipelines are developed. We present Fuzzysplit, a tool that relies on a simple declarative language to describe the schematics of sequences, which makes it incredibly adaptable to different use cases. In this paper, we explain the matching algorithms behind Fuzzysplit and we provide a preliminary comparison of its performance with other well-established tools. Overall, we find that its matching accuracy is comparable to previous tools. PeerJ Inc. 2019-06-19 /pmc/articles/PMC6589082/ /pubmed/31249738 http://dx.doi.org/10.7717/peerj.7170 Text en © 2019 Liu http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Liu, Daniel
Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title_full Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title_fullStr Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title_full_unstemmed Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title_short Fuzzysplit: demultiplexing and trimming sequenced DNA with a declarative language
title_sort fuzzysplit: demultiplexing and trimming sequenced dna with a declarative language
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589082/
https://www.ncbi.nlm.nih.gov/pubmed/31249738
http://dx.doi.org/10.7717/peerj.7170
work_keys_str_mv AT liudaniel fuzzysplitdemultiplexingandtrimmingsequenceddnawithadeclarativelanguage