Cargando…

Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing

BACKGROUND: The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist. RESULTS: We analyzed the pea transcriptome at the highest poss...

Descripción completa

Detalles Bibliográficos
Autores principales:	Franssen, Susanne U, Shrestha, Roshan P, Bräutigam, Andrea, Bornberg-Bauer, Erich, Weber, Andreas PM
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224338/ https://www.ncbi.nlm.nih.gov/pubmed/21569327 http://dx.doi.org/10.1186/1471-2164-12-227

_version_	1782217363284819968
author	Franssen, Susanne U Shrestha, Roshan P Bräutigam, Andrea Bornberg-Bauer, Erich Weber, Andreas PM
author_facet	Franssen, Susanne U Shrestha, Roshan P Bräutigam, Andrea Bornberg-Bauer, Erich Weber, Andreas PM
author_sort	Franssen, Susanne U
collection	PubMed
description	BACKGROUND: The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist. RESULTS: We analyzed the pea transcriptome at the highest possible amount of accuracy by current technology. We used next generation sequencing with the Roche/454 platform and evaluated and compared a variety of approaches, including diverse tissue libraries, normalization, alternative sequencing technologies, saturation estimation and diverse assembly strategies. We generated libraries from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings, comprising a total of 450 megabases. Libraries were assembled into 324,428 unigenes in a first pass assembly. A second pass assembly reduced the amount to 81,449 unigenes but caused a significant number of chimeras. Analyses of the assemblies identified the assembly step as a major possibility for improvement. By recording frequencies of Arabidopsis orthologs hit by randomly drawn reads and fitting parameters of the saturation curve we concluded that sequencing was exhaustive. For leaf libraries we found normalization allows partial recovery of expression strength aside the desired effect of increased coverage. Based on theoretical and biological considerations we concluded that the sequence reads in the database tagged the vast majority of transcripts in the aerial tissues. A pathway representation analysis showed the merits of sampling multiple aerial tissues to increase the number of tagged genes. All results have been made available as a fully annotated database in fasta format. CONCLUSIONS: We conclude that the approach taken resulted in a high quality - dataset which serves well as a first comprehensive reference set for the model legume pea. We suggest future deep sequencing transcriptome projects of species lacking a genomics backbone will need to concentrate mainly on resolving the issues of redundancy and paralogy during transcriptome assembly.
format	Online Article Text
id	pubmed-3224338
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32243382011-11-27 Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing Franssen, Susanne U Shrestha, Roshan P Bräutigam, Andrea Bornberg-Bauer, Erich Weber, Andreas PM BMC Genomics Research Article BACKGROUND: The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist. RESULTS: We analyzed the pea transcriptome at the highest possible amount of accuracy by current technology. We used next generation sequencing with the Roche/454 platform and evaluated and compared a variety of approaches, including diverse tissue libraries, normalization, alternative sequencing technologies, saturation estimation and diverse assembly strategies. We generated libraries from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings, comprising a total of 450 megabases. Libraries were assembled into 324,428 unigenes in a first pass assembly. A second pass assembly reduced the amount to 81,449 unigenes but caused a significant number of chimeras. Analyses of the assemblies identified the assembly step as a major possibility for improvement. By recording frequencies of Arabidopsis orthologs hit by randomly drawn reads and fitting parameters of the saturation curve we concluded that sequencing was exhaustive. For leaf libraries we found normalization allows partial recovery of expression strength aside the desired effect of increased coverage. Based on theoretical and biological considerations we concluded that the sequence reads in the database tagged the vast majority of transcripts in the aerial tissues. A pathway representation analysis showed the merits of sampling multiple aerial tissues to increase the number of tagged genes. All results have been made available as a fully annotated database in fasta format. CONCLUSIONS: We conclude that the approach taken resulted in a high quality - dataset which serves well as a first comprehensive reference set for the model legume pea. We suggest future deep sequencing transcriptome projects of species lacking a genomics backbone will need to concentrate mainly on resolving the issues of redundancy and paralogy during transcriptome assembly. BioMed Central 2011-05-11 /pmc/articles/PMC3224338/ /pubmed/21569327 http://dx.doi.org/10.1186/1471-2164-12-227 Text en Copyright ©2011 Franssen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Franssen, Susanne U Shrestha, Roshan P Bräutigam, Andrea Bornberg-Bauer, Erich Weber, Andreas PM Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title_full	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title_fullStr	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title_full_unstemmed	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title_short	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
title_sort	comprehensive transcriptome analysis of the highly complex pisum sativum genome using next generation sequencing
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224338/ https://www.ncbi.nlm.nih.gov/pubmed/21569327 http://dx.doi.org/10.1186/1471-2164-12-227
work_keys_str_mv	AT franssensusanneu comprehensivetranscriptomeanalysisofthehighlycomplexpisumsativumgenomeusingnextgenerationsequencing AT shrestharoshanp comprehensivetranscriptomeanalysisofthehighlycomplexpisumsativumgenomeusingnextgenerationsequencing AT brautigamandrea comprehensivetranscriptomeanalysisofthehighlycomplexpisumsativumgenomeusingnextgenerationsequencing AT bornbergbauererich comprehensivetranscriptomeanalysisofthehighlycomplexpisumsativumgenomeusingnextgenerationsequencing AT weberandreaspm comprehensivetranscriptomeanalysisofthehighlycomplexpisumsativumgenomeusingnextgenerationsequencing

Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing

Ejemplares similares