Cargando…

AdapterRemoval: easy cleaning of next-generation sequencing reads

BACKGROUND: With the advent of next-generation sequencing there is an increased demand for tools to pre-process and handle the vast amounts of data generated. One recurring problem is adapter contamination in the reads, i.e. the partial or complete sequencing of adapter sequences. These adapter sequ...

Descripción completa

Detalles Bibliográficos
Autor principal: Lindgreen, Stinus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3532080/
https://www.ncbi.nlm.nih.gov/pubmed/22748135
http://dx.doi.org/10.1186/1756-0500-5-337
_version_ 1782254244174233600
author Lindgreen, Stinus
author_facet Lindgreen, Stinus
author_sort Lindgreen, Stinus
collection PubMed
description BACKGROUND: With the advent of next-generation sequencing there is an increased demand for tools to pre-process and handle the vast amounts of data generated. One recurring problem is adapter contamination in the reads, i.e. the partial or complete sequencing of adapter sequences. These adapter sequences have to be removed as they can hinder correct mapping of the reads and influence SNP calling and other downstream analyses. FINDINGS: We present a tool called AdapterRemoval which is able to pre-process both single and paired-end data. The program locates and removes adapter residues from the reads, it is able to combine paired reads if they overlap, and it can optionally trim low-quality nucleotides. Furthermore, it can look for adapter sequence in both the 5’ and 3’ ends of the reads. This is a flexible tool that can be tuned to accommodate different experimental settings and sequencing platforms producing FASTQ files. AdapterRemoval is shown to be good at trimming adapters from both single-end and paired-end data. CONCLUSIONS: AdapterRemoval is a comprehensive tool for analyzing next-generation sequencing data. It exhibits good performance both in terms of sensitivity and specificity. AdapterRemoval has already been used in various large projects and it is possible to extend it further to accommodate application-specific biases in the data.
format Online
Article
Text
id pubmed-3532080
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35320802013-01-03 AdapterRemoval: easy cleaning of next-generation sequencing reads Lindgreen, Stinus BMC Res Notes Technical Note BACKGROUND: With the advent of next-generation sequencing there is an increased demand for tools to pre-process and handle the vast amounts of data generated. One recurring problem is adapter contamination in the reads, i.e. the partial or complete sequencing of adapter sequences. These adapter sequences have to be removed as they can hinder correct mapping of the reads and influence SNP calling and other downstream analyses. FINDINGS: We present a tool called AdapterRemoval which is able to pre-process both single and paired-end data. The program locates and removes adapter residues from the reads, it is able to combine paired reads if they overlap, and it can optionally trim low-quality nucleotides. Furthermore, it can look for adapter sequence in both the 5’ and 3’ ends of the reads. This is a flexible tool that can be tuned to accommodate different experimental settings and sequencing platforms producing FASTQ files. AdapterRemoval is shown to be good at trimming adapters from both single-end and paired-end data. CONCLUSIONS: AdapterRemoval is a comprehensive tool for analyzing next-generation sequencing data. It exhibits good performance both in terms of sensitivity and specificity. AdapterRemoval has already been used in various large projects and it is possible to extend it further to accommodate application-specific biases in the data. BioMed Central 2012-07-02 /pmc/articles/PMC3532080/ /pubmed/22748135 http://dx.doi.org/10.1186/1756-0500-5-337 Text en Copyright ©2012 Lindgreen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Lindgreen, Stinus
AdapterRemoval: easy cleaning of next-generation sequencing reads
title AdapterRemoval: easy cleaning of next-generation sequencing reads
title_full AdapterRemoval: easy cleaning of next-generation sequencing reads
title_fullStr AdapterRemoval: easy cleaning of next-generation sequencing reads
title_full_unstemmed AdapterRemoval: easy cleaning of next-generation sequencing reads
title_short AdapterRemoval: easy cleaning of next-generation sequencing reads
title_sort adapterremoval: easy cleaning of next-generation sequencing reads
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3532080/
https://www.ncbi.nlm.nih.gov/pubmed/22748135
http://dx.doi.org/10.1186/1756-0500-5-337
work_keys_str_mv AT lindgreenstinus adapterremovaleasycleaningofnextgenerationsequencingreads