Cargando…

Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

BACKGROUND: Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools...

Descripción completa

Detalles Bibliográficos
Autores principales: Shahin, Arwa, van Kaauwen, Martijn, Esselink, Danny, Bargsten, Joachim W, van Tuyl, Jaap M, Visser, Richard GF, Arens, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3576253/
https://www.ncbi.nlm.nih.gov/pubmed/23167289
http://dx.doi.org/10.1186/1471-2164-13-640
_version_ 1782259822706556928
author Shahin, Arwa
van Kaauwen, Martijn
Esselink, Danny
Bargsten, Joachim W
van Tuyl, Jaap M
Visser, Richard GF
Arens, Paul
author_facet Shahin, Arwa
van Kaauwen, Martijn
Esselink, Danny
Bargsten, Joachim W
van Tuyl, Jaap M
Visser, Richard GF
Arens, Paul
author_sort Shahin, Arwa
collection PubMed
description BACKGROUND: Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. RESULTS: Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. CONCLUSIONS: Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
format Online
Article
Text
id pubmed-3576253
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35762532013-02-20 Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa Shahin, Arwa van Kaauwen, Martijn Esselink, Danny Bargsten, Joachim W van Tuyl, Jaap M Visser, Richard GF Arens, Paul BMC Genomics Research Article BACKGROUND: Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. RESULTS: Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. CONCLUSIONS: Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. BioMed Central 2012-11-20 /pmc/articles/PMC3576253/ /pubmed/23167289 http://dx.doi.org/10.1186/1471-2164-13-640 Text en Copyright ©2012 Shahin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Shahin, Arwa
van Kaauwen, Martijn
Esselink, Danny
Bargsten, Joachim W
van Tuyl, Jaap M
Visser, Richard GF
Arens, Paul
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title_full Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title_fullStr Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title_full_unstemmed Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title_short Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
title_sort generation and analysis of expressed sequence tags in the extreme large genomes lilium and tulipa
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3576253/
https://www.ncbi.nlm.nih.gov/pubmed/23167289
http://dx.doi.org/10.1186/1471-2164-13-640
work_keys_str_mv AT shahinarwa generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT vankaauwenmartijn generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT esselinkdanny generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT bargstenjoachimw generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT vantuyljaapm generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT visserrichardgf generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa
AT arenspaul generationandanalysisofexpressedsequencetagsintheextremelargegenomesliliumandtulipa