Cargando…

AdapterRemoval v2: rapid adapter trimming, identification, and read merging

BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to proc...

Descripción completa

Detalles Bibliográficos
Autores principales: Schubert, Mikkel, Lindgreen, Stinus, Orlando, Ludovic
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751634/
https://www.ncbi.nlm.nih.gov/pubmed/26868221
http://dx.doi.org/10.1186/s13104-016-1900-2
_version_ 1782415619394633728
author Schubert, Mikkel
Lindgreen, Stinus
Orlando, Ludovic
author_facet Schubert, Mikkel
Lindgreen, Stinus
Orlando, Ludovic
author_sort Schubert, Mikkel
collection PubMed
description BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to process the growing amount of data generated per sequencing run. FINDINGS: We introduce AdapterRemoval v2, a major revision of AdapterRemoval v1, which introduces (i) striking improvements in throughput, through the use of single instruction, multiple data (SIMD; SSE1 and SSE2) instructions and multi-threading support, (ii) the ability to handle datasets containing reads or read-pairs with different adapters or adapter pairs, (iii) simultaneous demultiplexing and adapter trimming, (iv) the ability to reconstruct adapter sequences from paired-end reads for poorly documented data sets, and (v) native gzip and bzip2 support. CONCLUSIONS: We show that AdapterRemoval v2 compares favorably with existing tools, while offering superior throughput to most alternatives examined here, both for single and multi-threaded operations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-1900-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4751634
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47516342016-02-13 AdapterRemoval v2: rapid adapter trimming, identification, and read merging Schubert, Mikkel Lindgreen, Stinus Orlando, Ludovic BMC Res Notes Technical Note BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to process the growing amount of data generated per sequencing run. FINDINGS: We introduce AdapterRemoval v2, a major revision of AdapterRemoval v1, which introduces (i) striking improvements in throughput, through the use of single instruction, multiple data (SIMD; SSE1 and SSE2) instructions and multi-threading support, (ii) the ability to handle datasets containing reads or read-pairs with different adapters or adapter pairs, (iii) simultaneous demultiplexing and adapter trimming, (iv) the ability to reconstruct adapter sequences from paired-end reads for poorly documented data sets, and (v) native gzip and bzip2 support. CONCLUSIONS: We show that AdapterRemoval v2 compares favorably with existing tools, while offering superior throughput to most alternatives examined here, both for single and multi-threaded operations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-1900-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-12 /pmc/articles/PMC4751634/ /pubmed/26868221 http://dx.doi.org/10.1186/s13104-016-1900-2 Text en © Schubert et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Schubert, Mikkel
Lindgreen, Stinus
Orlando, Ludovic
AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title_full AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title_fullStr AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title_full_unstemmed AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title_short AdapterRemoval v2: rapid adapter trimming, identification, and read merging
title_sort adapterremoval v2: rapid adapter trimming, identification, and read merging
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751634/
https://www.ncbi.nlm.nih.gov/pubmed/26868221
http://dx.doi.org/10.1186/s13104-016-1900-2
work_keys_str_mv AT schubertmikkel adapterremovalv2rapidadaptertrimmingidentificationandreadmerging
AT lindgreenstinus adapterremovalv2rapidadaptertrimmingidentificationandreadmerging
AT orlandoludovic adapterremovalv2rapidadaptertrimmingidentificationandreadmerging