Cargando…

A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)

BACKGROUND: Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the...

Descripción completa

Detalles Bibliográficos
Autores principales: Ralph, Steven G, Chun, Hye Jung E, Kolosova, Natalia, Cooper, Dawn, Oddy, Claire, Ritland, Carol E, Kirkpatrick, Robert, Moore, Richard, Barber, Sarah, Holt, Robert A, Jones, Steven JM, Marra, Marco A, Douglas, Carl J, Ritland, Kermit, Bohlmann, Jörg
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2579922/
https://www.ncbi.nlm.nih.gov/pubmed/18854048
http://dx.doi.org/10.1186/1471-2164-9-484
_version_ 1782160594444484608
author Ralph, Steven G
Chun, Hye Jung E
Kolosova, Natalia
Cooper, Dawn
Oddy, Claire
Ritland, Carol E
Kirkpatrick, Robert
Moore, Richard
Barber, Sarah
Holt, Robert A
Jones, Steven JM
Marra, Marco A
Douglas, Carl J
Ritland, Kermit
Bohlmann, Jörg
author_facet Ralph, Steven G
Chun, Hye Jung E
Kolosova, Natalia
Cooper, Dawn
Oddy, Claire
Ritland, Carol E
Kirkpatrick, Robert
Moore, Richard
Barber, Sarah
Holt, Robert A
Jones, Steven JM
Marra, Marco A
Douglas, Carl J
Ritland, Kermit
Bohlmann, Jörg
author_sort Ralph, Steven G
collection PubMed
description BACKGROUND: Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the majority of the world's wood and fiber supply and serve as a renewable resource for other industrial biomaterials. In contrast to angiosperms, functional and comparative genomics research on conifers, or other gymnosperms, is limited by the lack of a relevant reference genome sequence. Sequence-finished full-length (FL)cDNAs and large collections of expressed sequence tags (ESTs) are essential for gene discovery, functional genomics, and for future efforts of conifer genome annotation. RESULTS: As part of a conifer genomics program to characterize defense against insects and adaptation to local environments, and to discover genes for the production of biomaterials, we developed 20 standard, normalized or full-length enriched cDNA libraries from Sitka spruce (P. sitchensis), white spruce (P. glauca), and interior spruce (P. glauca-engelmannii complex). We sequenced and analyzed 206,875 3'- or 5'-end ESTs from these libraries, and developed a resource of 6,464 high-quality sequence-finished FLcDNAs from Sitka spruce. Clustering and assembly of 147,146 3'-end ESTs resulted in 19,941 contigs and 26,804 singletons, representing 46,745 putative unique transcripts (PUTs). The 6,464 FLcDNAs were all obtained from a single Sitka spruce genotype and represent 5,718 PUTs. CONCLUSION: This paper provides detailed annotation and quality assessment of a large EST and FLcDNA resource for spruce. The 6,464 Sitka spruce FLcDNAs represent the third largest sequence-verified FLcDNA resource for any plant species, behind only rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana), and the only substantial FLcDNA resource for a gymnosperm. Our emphasis on capturing FLcDNAs and ESTs from cDNA libraries representing herbivore-, wound- or elicitor-treated induced spruce tissues, along with incorporating normalization to capture rare transcripts, resulted in a rich resource for functional genomics and proteomics studies. Sequence comparisons against five plant genomes and the non-redundant GenBank protein database revealed that a substantial number of spruce transcripts have no obvious similarity to known angiosperm gene sequences. Opportunities for future applications of the sequence and clone resources for comparative and functional genomics are discussed.
format Text
id pubmed-2579922
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25799222008-11-06 A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis) Ralph, Steven G Chun, Hye Jung E Kolosova, Natalia Cooper, Dawn Oddy, Claire Ritland, Carol E Kirkpatrick, Robert Moore, Richard Barber, Sarah Holt, Robert A Jones, Steven JM Marra, Marco A Douglas, Carl J Ritland, Kermit Bohlmann, Jörg BMC Genomics Research Article BACKGROUND: Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the majority of the world's wood and fiber supply and serve as a renewable resource for other industrial biomaterials. In contrast to angiosperms, functional and comparative genomics research on conifers, or other gymnosperms, is limited by the lack of a relevant reference genome sequence. Sequence-finished full-length (FL)cDNAs and large collections of expressed sequence tags (ESTs) are essential for gene discovery, functional genomics, and for future efforts of conifer genome annotation. RESULTS: As part of a conifer genomics program to characterize defense against insects and adaptation to local environments, and to discover genes for the production of biomaterials, we developed 20 standard, normalized or full-length enriched cDNA libraries from Sitka spruce (P. sitchensis), white spruce (P. glauca), and interior spruce (P. glauca-engelmannii complex). We sequenced and analyzed 206,875 3'- or 5'-end ESTs from these libraries, and developed a resource of 6,464 high-quality sequence-finished FLcDNAs from Sitka spruce. Clustering and assembly of 147,146 3'-end ESTs resulted in 19,941 contigs and 26,804 singletons, representing 46,745 putative unique transcripts (PUTs). The 6,464 FLcDNAs were all obtained from a single Sitka spruce genotype and represent 5,718 PUTs. CONCLUSION: This paper provides detailed annotation and quality assessment of a large EST and FLcDNA resource for spruce. The 6,464 Sitka spruce FLcDNAs represent the third largest sequence-verified FLcDNA resource for any plant species, behind only rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana), and the only substantial FLcDNA resource for a gymnosperm. Our emphasis on capturing FLcDNAs and ESTs from cDNA libraries representing herbivore-, wound- or elicitor-treated induced spruce tissues, along with incorporating normalization to capture rare transcripts, resulted in a rich resource for functional genomics and proteomics studies. Sequence comparisons against five plant genomes and the non-redundant GenBank protein database revealed that a substantial number of spruce transcripts have no obvious similarity to known angiosperm gene sequences. Opportunities for future applications of the sequence and clone resources for comparative and functional genomics are discussed. BioMed Central 2008-10-14 /pmc/articles/PMC2579922/ /pubmed/18854048 http://dx.doi.org/10.1186/1471-2164-9-484 Text en Copyright © 2008 Ralph et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ralph, Steven G
Chun, Hye Jung E
Kolosova, Natalia
Cooper, Dawn
Oddy, Claire
Ritland, Carol E
Kirkpatrick, Robert
Moore, Richard
Barber, Sarah
Holt, Robert A
Jones, Steven JM
Marra, Marco A
Douglas, Carl J
Ritland, Kermit
Bohlmann, Jörg
A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title_full A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title_fullStr A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title_full_unstemmed A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title_short A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)
title_sort conifer genomics resource of 200,000 spruce (picea spp.) ests and 6,464 high-quality, sequence-finished full-length cdnas for sitka spruce (picea sitchensis)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2579922/
https://www.ncbi.nlm.nih.gov/pubmed/18854048
http://dx.doi.org/10.1186/1471-2164-9-484
work_keys_str_mv AT ralphsteveng aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT chunhyejunge aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT kolosovanatalia aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT cooperdawn aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT oddyclaire aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT ritlandcarole aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT kirkpatrickrobert aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT moorerichard aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT barbersarah aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT holtroberta aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT jonesstevenjm aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT marramarcoa aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT douglascarlj aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT ritlandkermit aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT bohlmannjorg aconifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT ralphsteveng conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT chunhyejunge conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT kolosovanatalia conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT cooperdawn conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT oddyclaire conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT ritlandcarole conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT kirkpatrickrobert conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT moorerichard conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT barbersarah conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT holtroberta conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT jonesstevenjm conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT marramarcoa conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT douglascarlj conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT ritlandkermit conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis
AT bohlmannjorg conifergenomicsresourceof200000sprucepiceasppestsand6464highqualitysequencefinishedfulllengthcdnasforsitkasprucepiceasitchensis