Cargando…

Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data

BACKGROUND: Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model...

Descripción completa

Detalles Bibliográficos
Autores principales:	Duan, Jialei, Xia, Chuan, Zhao, Guangyao, Jia, Jizeng, Kong, Xiuying
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3485621/ https://www.ncbi.nlm.nih.gov/pubmed/22891638 http://dx.doi.org/10.1186/1471-2164-13-392

_version_	1782248327435255808
author	Duan, Jialei Xia, Chuan Zhao, Guangyao Jia, Jizeng Kong, Xiuying
author_facet	Duan, Jialei Xia, Chuan Zhao, Guangyao Jia, Jizeng Kong, Xiuying
author_sort	Duan, Jialei
collection	PubMed
description	BACKGROUND: Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. RESULTS: In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. CONCLUSIONS: It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.
format	Online Article Text
id	pubmed-3485621
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-34856212012-11-02 Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data Duan, Jialei Xia, Chuan Zhao, Guangyao Jia, Jizeng Kong, Xiuying BMC Genomics Research Article BACKGROUND: Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. RESULTS: In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. CONCLUSIONS: It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly. BioMed Central 2012-08-14 /pmc/articles/PMC3485621/ /pubmed/22891638 http://dx.doi.org/10.1186/1471-2164-13-392 Text en Copyright ©2012 Duan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Duan, Jialei Xia, Chuan Zhao, Guangyao Jia, Jizeng Kong, Xiuying Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title	Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title_full	Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title_fullStr	Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title_full_unstemmed	Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title_short	Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
title_sort	optimizing de novo common wheat transcriptome assembly using short-read rna-seq data
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3485621/ https://www.ncbi.nlm.nih.gov/pubmed/22891638 http://dx.doi.org/10.1186/1471-2164-13-392
work_keys_str_mv	AT duanjialei optimizingdenovocommonwheattranscriptomeassemblyusingshortreadrnaseqdata AT xiachuan optimizingdenovocommonwheattranscriptomeassemblyusingshortreadrnaseqdata AT zhaoguangyao optimizingdenovocommonwheattranscriptomeassemblyusingshortreadrnaseqdata AT jiajizeng optimizingdenovocommonwheattranscriptomeassemblyusingshortreadrnaseqdata AT kongxiuying optimizingdenovocommonwheattranscriptomeassemblyusingshortreadrnaseqdata

Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data

Ejemplares similares