Cargando…
AdapterRemoval v2: rapid adapter trimming, identification, and read merging
BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to proc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751634/ https://www.ncbi.nlm.nih.gov/pubmed/26868221 http://dx.doi.org/10.1186/s13104-016-1900-2 |
_version_ | 1782415619394633728 |
---|---|
author | Schubert, Mikkel Lindgreen, Stinus Orlando, Ludovic |
author_facet | Schubert, Mikkel Lindgreen, Stinus Orlando, Ludovic |
author_sort | Schubert, Mikkel |
collection | PubMed |
description | BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to process the growing amount of data generated per sequencing run. FINDINGS: We introduce AdapterRemoval v2, a major revision of AdapterRemoval v1, which introduces (i) striking improvements in throughput, through the use of single instruction, multiple data (SIMD; SSE1 and SSE2) instructions and multi-threading support, (ii) the ability to handle datasets containing reads or read-pairs with different adapters or adapter pairs, (iii) simultaneous demultiplexing and adapter trimming, (iv) the ability to reconstruct adapter sequences from paired-end reads for poorly documented data sets, and (v) native gzip and bzip2 support. CONCLUSIONS: We show that AdapterRemoval v2 compares favorably with existing tools, while offering superior throughput to most alternatives examined here, both for single and multi-threaded operations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-1900-2) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4751634 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-47516342016-02-13 AdapterRemoval v2: rapid adapter trimming, identification, and read merging Schubert, Mikkel Lindgreen, Stinus Orlando, Ludovic BMC Res Notes Technical Note BACKGROUND: As high-throughput sequencing platforms produce longer and longer reads, sequences generated from short inserts, such as those obtained from fossil and degraded material, are increasingly expected to contain adapter sequences. Efficient adapter trimming algorithms are also needed to process the growing amount of data generated per sequencing run. FINDINGS: We introduce AdapterRemoval v2, a major revision of AdapterRemoval v1, which introduces (i) striking improvements in throughput, through the use of single instruction, multiple data (SIMD; SSE1 and SSE2) instructions and multi-threading support, (ii) the ability to handle datasets containing reads or read-pairs with different adapters or adapter pairs, (iii) simultaneous demultiplexing and adapter trimming, (iv) the ability to reconstruct adapter sequences from paired-end reads for poorly documented data sets, and (v) native gzip and bzip2 support. CONCLUSIONS: We show that AdapterRemoval v2 compares favorably with existing tools, while offering superior throughput to most alternatives examined here, both for single and multi-threaded operations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-1900-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-12 /pmc/articles/PMC4751634/ /pubmed/26868221 http://dx.doi.org/10.1186/s13104-016-1900-2 Text en © Schubert et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Note Schubert, Mikkel Lindgreen, Stinus Orlando, Ludovic AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title | AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title_full | AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title_fullStr | AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title_full_unstemmed | AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title_short | AdapterRemoval v2: rapid adapter trimming, identification, and read merging |
title_sort | adapterremoval v2: rapid adapter trimming, identification, and read merging |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751634/ https://www.ncbi.nlm.nih.gov/pubmed/26868221 http://dx.doi.org/10.1186/s13104-016-1900-2 |
work_keys_str_mv | AT schubertmikkel adapterremovalv2rapidadaptertrimmingidentificationandreadmerging AT lindgreenstinus adapterremovalv2rapidadaptertrimmingidentificationandreadmerging AT orlandoludovic adapterremovalv2rapidadaptertrimmingidentificationandreadmerging |