Cargando…

ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing

BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, full cDNA shotgun sequencing has become a major approach in the study of transcriptomes, and several different protocols in 454 sequencing have been invented. As each protocol uses its own short DNA tags or adapters attach...

Descripción completa

Detalles Bibliográficos
Autores principales: Tae, Hongseok, Ryu, Dongsung, Sureshchandra, Suhas, Choi, Jeong-Hyeon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3630001/
https://www.ncbi.nlm.nih.gov/pubmed/23009593
http://dx.doi.org/10.1186/1471-2105-13-247
_version_ 1782266643335872512
author Tae, Hongseok
Ryu, Dongsung
Sureshchandra, Suhas
Choi, Jeong-Hyeon
author_facet Tae, Hongseok
Ryu, Dongsung
Sureshchandra, Suhas
Choi, Jeong-Hyeon
author_sort Tae, Hongseok
collection PubMed
description BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, full cDNA shotgun sequencing has become a major approach in the study of transcriptomes, and several different protocols in 454 sequencing have been invented. As each protocol uses its own short DNA tags or adapters attached to the ends of cDNA fragments for labeling or sequencing, different contaminants may lead to mis-assembly and inaccurate sequence products. RESULTS: We have designed and implemented a new program for raw sequence cleaning in a graphical user interface and a batch script. The cleaning process consists of several modules including barcode trimming, sequencing adapter trimming, amplification primer trimming, poly-A tail trimming, vector screening and low quality region trimming. These modules can be combined based on various sequencing applications. CONCLUSIONS: ESTclean is a software package not only for cleaning cDNA sequences, but also for helping to develop sequencing protocols by providing summary tables and figures for sequencing quality control in a graphical user interface. It outperforms in cleaning read sequences from complicated sequencing protocols which use barcodes and multiple amplification primers.
format Online
Article
Text
id pubmed-3630001
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36300012013-04-19 ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing Tae, Hongseok Ryu, Dongsung Sureshchandra, Suhas Choi, Jeong-Hyeon BMC Bioinformatics Software BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, full cDNA shotgun sequencing has become a major approach in the study of transcriptomes, and several different protocols in 454 sequencing have been invented. As each protocol uses its own short DNA tags or adapters attached to the ends of cDNA fragments for labeling or sequencing, different contaminants may lead to mis-assembly and inaccurate sequence products. RESULTS: We have designed and implemented a new program for raw sequence cleaning in a graphical user interface and a batch script. The cleaning process consists of several modules including barcode trimming, sequencing adapter trimming, amplification primer trimming, poly-A tail trimming, vector screening and low quality region trimming. These modules can be combined based on various sequencing applications. CONCLUSIONS: ESTclean is a software package not only for cleaning cDNA sequences, but also for helping to develop sequencing protocols by providing summary tables and figures for sequencing quality control in a graphical user interface. It outperforms in cleaning read sequences from complicated sequencing protocols which use barcodes and multiple amplification primers. BioMed Central 2012-09-26 /pmc/articles/PMC3630001/ /pubmed/23009593 http://dx.doi.org/10.1186/1471-2105-13-247 Text en Copyright © 2012 Tae et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Tae, Hongseok
Ryu, Dongsung
Sureshchandra, Suhas
Choi, Jeong-Hyeon
ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title_full ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title_fullStr ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title_full_unstemmed ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title_short ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing
title_sort estclean: a cleaning tool for next-gen transcriptome shotgun sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3630001/
https://www.ncbi.nlm.nih.gov/pubmed/23009593
http://dx.doi.org/10.1186/1471-2105-13-247
work_keys_str_mv AT taehongseok estcleanacleaningtoolfornextgentranscriptomeshotgunsequencing
AT ryudongsung estcleanacleaningtoolfornextgentranscriptomeshotgunsequencing
AT sureshchandrasuhas estcleanacleaningtoolfornextgentranscriptomeshotgunsequencing
AT choijeonghyeon estcleanacleaningtoolfornextgentranscriptomeshotgunsequencing