Cargando…

Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak

BACKGROUND: The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despit...

Descripción completa

Detalles Bibliográficos
Autores principales: Ueno, Saneyoshi, Le Provost, Grégoire, Léger, Valérie, Klopp, Christophe, Noirot, Céline, Frigerio, Jean-Marc, Salin, Franck, Salse, Jérôme, Abrouk, Michael, Murat, Florent, Brendel, Oliver, Derory, Jérémy, Abadie, Pierre, Léger, Patrick, Cabane, Cyril, Barré, Aurélien, de Daruvar, Antoine, Couloux, Arnaud, Wincker, Patrick, Reviron, Marie-Pierre, Kremer, Antoine, Plomion, Christophe
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017864/
https://www.ncbi.nlm.nih.gov/pubmed/21092232
http://dx.doi.org/10.1186/1471-2164-11-650
_version_ 1782195970470051840
author Ueno, Saneyoshi
Le Provost, Grégoire
Léger, Valérie
Klopp, Christophe
Noirot, Céline
Frigerio, Jean-Marc
Salin, Franck
Salse, Jérôme
Abrouk, Michael
Murat, Florent
Brendel, Oliver
Derory, Jérémy
Abadie, Pierre
Léger, Patrick
Cabane, Cyril
Barré, Aurélien
de Daruvar, Antoine
Couloux, Arnaud
Wincker, Patrick
Reviron, Marie-Pierre
Kremer, Antoine
Plomion, Christophe
author_facet Ueno, Saneyoshi
Le Provost, Grégoire
Léger, Valérie
Klopp, Christophe
Noirot, Céline
Frigerio, Jean-Marc
Salin, Franck
Salse, Jérôme
Abrouk, Michael
Murat, Florent
Brendel, Oliver
Derory, Jérémy
Abadie, Pierre
Léger, Patrick
Cabane, Cyril
Barré, Aurélien
de Daruvar, Antoine
Couloux, Arnaud
Wincker, Patrick
Reviron, Marie-Pierre
Kremer, Antoine
Plomion, Christophe
author_sort Ueno, Saneyoshi
collection PubMed
description BACKGROUND: The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. RESULTS: We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. CONCLUSIONS: This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations.
format Text
id pubmed-3017864
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30178642011-01-24 Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak Ueno, Saneyoshi Le Provost, Grégoire Léger, Valérie Klopp, Christophe Noirot, Céline Frigerio, Jean-Marc Salin, Franck Salse, Jérôme Abrouk, Michael Murat, Florent Brendel, Oliver Derory, Jérémy Abadie, Pierre Léger, Patrick Cabane, Cyril Barré, Aurélien de Daruvar, Antoine Couloux, Arnaud Wincker, Patrick Reviron, Marie-Pierre Kremer, Antoine Plomion, Christophe BMC Genomics Research Article BACKGROUND: The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. RESULTS: We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. CONCLUSIONS: This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. BioMed Central 2010-11-23 /pmc/articles/PMC3017864/ /pubmed/21092232 http://dx.doi.org/10.1186/1471-2164-11-650 Text en Copyright ©2010 Ueno et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ueno, Saneyoshi
Le Provost, Grégoire
Léger, Valérie
Klopp, Christophe
Noirot, Céline
Frigerio, Jean-Marc
Salin, Franck
Salse, Jérôme
Abrouk, Michael
Murat, Florent
Brendel, Oliver
Derory, Jérémy
Abadie, Pierre
Léger, Patrick
Cabane, Cyril
Barré, Aurélien
de Daruvar, Antoine
Couloux, Arnaud
Wincker, Patrick
Reviron, Marie-Pierre
Kremer, Antoine
Plomion, Christophe
Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title_full Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title_fullStr Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title_full_unstemmed Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title_short Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak
title_sort bioinformatic analysis of ests collected by sanger and pyrosequencing methods for a keystone forest tree species: oak
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017864/
https://www.ncbi.nlm.nih.gov/pubmed/21092232
http://dx.doi.org/10.1186/1471-2164-11-650
work_keys_str_mv AT uenosaneyoshi bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT leprovostgregoire bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT legervalerie bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT kloppchristophe bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT noirotceline bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT frigeriojeanmarc bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT salinfranck bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT salsejerome bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT abroukmichael bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT muratflorent bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT brendeloliver bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT deroryjeremy bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT abadiepierre bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT legerpatrick bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT cabanecyril bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT barreaurelien bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT dedaruvarantoine bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT coulouxarnaud bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT winckerpatrick bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT revironmariepierre bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT kremerantoine bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak
AT plomionchristophe bioinformaticanalysisofestscollectedbysangerandpyrosequencingmethodsforakeystoneforesttreespeciesoak