Cargando…

ConDeTri - A Content Dependent Read Trimmer for Illumina Data

During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly,...

Descripción completa

Detalles Bibliográficos
Autores principales: Smeds, Linnéa, Künstner, Axel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3198461/
https://www.ncbi.nlm.nih.gov/pubmed/22039460
http://dx.doi.org/10.1371/journal.pone.0026314
_version_ 1782214430169235456
author Smeds, Linnéa
Künstner, Axel
author_facet Smeds, Linnéa
Künstner, Axel
author_sort Smeds, Linnéa
collection PubMed
description During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly, Illumina/Solexa sequencing has had an increasing impact on gathering data from model and non-model organisms. However, accurate and easy to use tools for quality filtering have not yet been established. We present ConDeTri, a method for content dependent read trimming for next generation sequencing data using quality scores of each individual base. The main focus of the method is to remove sequencing errors from reads so that sequencing reads can be standardized. Another aspect of the method is to incorporate read trimming in next-generation sequencing data processing and analysis pipelines. It can process single-end and paired-end sequence data of arbitrary length and it is independent from sequencing coverage and user interaction. ConDeTri is able to trim and remove reads with low quality scores to save computational time and memory usage during de novo assemblies. Low coverage or large genome sequencing projects will especially gain from trimming reads. The method can easily be incorporated into preprocessing and analysis pipelines for Illumina data. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://code.google.com/p/condetri.
format Online
Article
Text
id pubmed-3198461
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31984612011-10-28 ConDeTri - A Content Dependent Read Trimmer for Illumina Data Smeds, Linnéa Künstner, Axel PLoS One Research Article During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly, Illumina/Solexa sequencing has had an increasing impact on gathering data from model and non-model organisms. However, accurate and easy to use tools for quality filtering have not yet been established. We present ConDeTri, a method for content dependent read trimming for next generation sequencing data using quality scores of each individual base. The main focus of the method is to remove sequencing errors from reads so that sequencing reads can be standardized. Another aspect of the method is to incorporate read trimming in next-generation sequencing data processing and analysis pipelines. It can process single-end and paired-end sequence data of arbitrary length and it is independent from sequencing coverage and user interaction. ConDeTri is able to trim and remove reads with low quality scores to save computational time and memory usage during de novo assemblies. Low coverage or large genome sequencing projects will especially gain from trimming reads. The method can easily be incorporated into preprocessing and analysis pipelines for Illumina data. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://code.google.com/p/condetri. Public Library of Science 2011-10-19 /pmc/articles/PMC3198461/ /pubmed/22039460 http://dx.doi.org/10.1371/journal.pone.0026314 Text en Smeds, Künstner. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Smeds, Linnéa
Künstner, Axel
ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title_full ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title_fullStr ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title_full_unstemmed ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title_short ConDeTri - A Content Dependent Read Trimmer for Illumina Data
title_sort condetri - a content dependent read trimmer for illumina data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3198461/
https://www.ncbi.nlm.nih.gov/pubmed/22039460
http://dx.doi.org/10.1371/journal.pone.0026314
work_keys_str_mv AT smedslinnea condetriacontentdependentreadtrimmerforilluminadata
AT kunstneraxel condetriacontentdependentreadtrimmerforilluminadata