Cargando…

A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads

Downstream analyses of short-reads from next-generation sequencing platforms are often preceded by a pre-processing step that removes uncalled and wrongly called bases. Standard approaches rely on their associated base quality scores to retain the read or a portion of it when the score is above a pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Mbandi, Stanley Kimbung, Hesse, Uljana, Rees, D. Jasper G., Christoffels, Alan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3921913/
https://www.ncbi.nlm.nih.gov/pubmed/24575122
http://dx.doi.org/10.3389/fgene.2014.00017
_version_ 1782303375276113920
author Mbandi, Stanley Kimbung
Hesse, Uljana
Rees, D. Jasper G.
Christoffels, Alan
author_facet Mbandi, Stanley Kimbung
Hesse, Uljana
Rees, D. Jasper G.
Christoffels, Alan
author_sort Mbandi, Stanley Kimbung
collection PubMed
description Downstream analyses of short-reads from next-generation sequencing platforms are often preceded by a pre-processing step that removes uncalled and wrongly called bases. Standard approaches rely on their associated base quality scores to retain the read or a portion of it when the score is above a predefined threshold. It is difficult to differentiate sequencing error from biological variation without a reference using quality scores. The effects of quality score based trimming have not been systematically studied in de novo transcriptome assembly. Using RNA-Seq data produced from Illumina, we teased out the effects of quality score based filtering or trimming on de novo transcriptome reconstruction. We showed that assemblies produced from reads subjected to different quality score thresholds contain truncated and missing transfrags when compared to those from untrimmed reads. Our data supports the fact that de novo assembling of untrimmed data is challenging for de Bruijn graph assemblers. However, our results indicates that comparing the assemblies from untrimmed and trimmed read subsets can suggest appropriate filtering parameters and enable selection of the optimum de novo transcriptome assembly in non-model organisms.
format Online
Article
Text
id pubmed-3921913
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-39219132014-02-26 A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads Mbandi, Stanley Kimbung Hesse, Uljana Rees, D. Jasper G. Christoffels, Alan Front Genet Genetics Downstream analyses of short-reads from next-generation sequencing platforms are often preceded by a pre-processing step that removes uncalled and wrongly called bases. Standard approaches rely on their associated base quality scores to retain the read or a portion of it when the score is above a predefined threshold. It is difficult to differentiate sequencing error from biological variation without a reference using quality scores. The effects of quality score based trimming have not been systematically studied in de novo transcriptome assembly. Using RNA-Seq data produced from Illumina, we teased out the effects of quality score based filtering or trimming on de novo transcriptome reconstruction. We showed that assemblies produced from reads subjected to different quality score thresholds contain truncated and missing transfrags when compared to those from untrimmed reads. Our data supports the fact that de novo assembling of untrimmed data is challenging for de Bruijn graph assemblers. However, our results indicates that comparing the assemblies from untrimmed and trimmed read subsets can suggest appropriate filtering parameters and enable selection of the optimum de novo transcriptome assembly in non-model organisms. Frontiers Media S.A. 2014-02-12 /pmc/articles/PMC3921913/ /pubmed/24575122 http://dx.doi.org/10.3389/fgene.2014.00017 Text en Copyright © 2014 Mbandi, Hesse, Rees and Christoffels. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Mbandi, Stanley Kimbung
Hesse, Uljana
Rees, D. Jasper G.
Christoffels, Alan
A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title_full A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title_fullStr A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title_full_unstemmed A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title_short A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads
title_sort glance at quality score: implication for de novo transcriptome reconstruction of illumina reads
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3921913/
https://www.ncbi.nlm.nih.gov/pubmed/24575122
http://dx.doi.org/10.3389/fgene.2014.00017
work_keys_str_mv AT mbandistanleykimbung aglanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT hesseuljana aglanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT reesdjasperg aglanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT christoffelsalan aglanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT mbandistanleykimbung glanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT hesseuljana glanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT reesdjasperg glanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads
AT christoffelsalan glanceatqualityscoreimplicationfordenovotranscriptomereconstructionofilluminareads