Cargando…

Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes

Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chien-Chih, Lin, Wen-Dar, Chang, Yu-Jung, Chen, Chuen-Liang, Ho, Jan-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Scholarly Research Network 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417554/
https://www.ncbi.nlm.nih.gov/pubmed/25969752
http://dx.doi.org/10.5402/2012/816402
_version_ 1782369377115439104
author Chen, Chien-Chih
Lin, Wen-Dar
Chang, Yu-Jung
Chen, Chuen-Liang
Ho, Jan-Ming
author_facet Chen, Chien-Chih
Lin, Wen-Dar
Chang, Yu-Jung
Chen, Chuen-Liang
Ho, Jan-Ming
author_sort Chen, Chien-Chih
collection PubMed
description Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new challenging issues for de novo transcriptome assembly. Methodology. To explore the influence of these features on assembly algorithms, we studied the relationship between read overlap size, coverage depth, and error rate using simulated data. According to the relationship, we propose a de novo transcriptome assembly procedure, called Euler-mix, and demonstrate its performance on a real transcriptome dataset of mice. The simulation tool and evaluation tool are freely available as open source. Significance. Euler-mix is a straightforward pipeline; it focuses on dealing with the variation of coverage depth of short reads dataset. The experiment result showed that Euler-mix improves the performance of de novo transcriptome assembly.
format Online
Article
Text
id pubmed-4417554
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher International Scholarly Research Network
record_format MEDLINE/PubMed
spelling pubmed-44175542015-05-12 Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes Chen, Chien-Chih Lin, Wen-Dar Chang, Yu-Jung Chen, Chuen-Liang Ho, Jan-Ming ISRN Bioinform Research Article Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new challenging issues for de novo transcriptome assembly. Methodology. To explore the influence of these features on assembly algorithms, we studied the relationship between read overlap size, coverage depth, and error rate using simulated data. According to the relationship, we propose a de novo transcriptome assembly procedure, called Euler-mix, and demonstrate its performance on a real transcriptome dataset of mice. The simulation tool and evaluation tool are freely available as open source. Significance. Euler-mix is a straightforward pipeline; it focuses on dealing with the variation of coverage depth of short reads dataset. The experiment result showed that Euler-mix improves the performance of de novo transcriptome assembly. International Scholarly Research Network 2012-04-23 /pmc/articles/PMC4417554/ /pubmed/25969752 http://dx.doi.org/10.5402/2012/816402 Text en Copyright © 2012 Chien-Chih Chen et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Chien-Chih
Lin, Wen-Dar
Chang, Yu-Jung
Chen, Chuen-Liang
Ho, Jan-Ming
Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title_full Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title_fullStr Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title_full_unstemmed Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title_short Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
title_sort enhancing de novo transcriptome assembly by incorporating multiple overlap sizes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417554/
https://www.ncbi.nlm.nih.gov/pubmed/25969752
http://dx.doi.org/10.5402/2012/816402
work_keys_str_mv AT chenchienchih enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes
AT linwendar enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes
AT changyujung enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes
AT chenchuenliang enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes
AT hojanming enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes