Cargando…
Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new cha...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Scholarly Research Network
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417554/ https://www.ncbi.nlm.nih.gov/pubmed/25969752 http://dx.doi.org/10.5402/2012/816402 |
_version_ | 1782369377115439104 |
---|---|
author | Chen, Chien-Chih Lin, Wen-Dar Chang, Yu-Jung Chen, Chuen-Liang Ho, Jan-Ming |
author_facet | Chen, Chien-Chih Lin, Wen-Dar Chang, Yu-Jung Chen, Chuen-Liang Ho, Jan-Ming |
author_sort | Chen, Chien-Chih |
collection | PubMed |
description | Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new challenging issues for de novo transcriptome assembly. Methodology. To explore the influence of these features on assembly algorithms, we studied the relationship between read overlap size, coverage depth, and error rate using simulated data. According to the relationship, we propose a de novo transcriptome assembly procedure, called Euler-mix, and demonstrate its performance on a real transcriptome dataset of mice. The simulation tool and evaluation tool are freely available as open source. Significance. Euler-mix is a straightforward pipeline; it focuses on dealing with the variation of coverage depth of short reads dataset. The experiment result showed that Euler-mix improves the performance of de novo transcriptome assembly. |
format | Online Article Text |
id | pubmed-4417554 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | International Scholarly Research Network |
record_format | MEDLINE/PubMed |
spelling | pubmed-44175542015-05-12 Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes Chen, Chien-Chih Lin, Wen-Dar Chang, Yu-Jung Chen, Chuen-Liang Ho, Jan-Ming ISRN Bioinform Research Article Background. The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new challenging issues for de novo transcriptome assembly. Methodology. To explore the influence of these features on assembly algorithms, we studied the relationship between read overlap size, coverage depth, and error rate using simulated data. According to the relationship, we propose a de novo transcriptome assembly procedure, called Euler-mix, and demonstrate its performance on a real transcriptome dataset of mice. The simulation tool and evaluation tool are freely available as open source. Significance. Euler-mix is a straightforward pipeline; it focuses on dealing with the variation of coverage depth of short reads dataset. The experiment result showed that Euler-mix improves the performance of de novo transcriptome assembly. International Scholarly Research Network 2012-04-23 /pmc/articles/PMC4417554/ /pubmed/25969752 http://dx.doi.org/10.5402/2012/816402 Text en Copyright © 2012 Chien-Chih Chen et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Chien-Chih Lin, Wen-Dar Chang, Yu-Jung Chen, Chuen-Liang Ho, Jan-Ming Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title | Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title_full | Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title_fullStr | Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title_full_unstemmed | Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title_short | Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes |
title_sort | enhancing de novo transcriptome assembly by incorporating multiple overlap sizes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417554/ https://www.ncbi.nlm.nih.gov/pubmed/25969752 http://dx.doi.org/10.5402/2012/816402 |
work_keys_str_mv | AT chenchienchih enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes AT linwendar enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes AT changyujung enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes AT chenchuenliang enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes AT hojanming enhancingdenovotranscriptomeassemblybyincorporatingmultipleoverlapsizes |