Cargando…

The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers

BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its...

Descripción completa

Detalles Bibliográficos
Autores principales: Lulin, Huang, Xiao, Yang, Pei, Sun, Wen, Tong, Shangqin, Hu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3378585/
https://www.ncbi.nlm.nih.gov/pubmed/22723874
http://dx.doi.org/10.1371/journal.pone.0038653
_version_ 1782236062719934464
author Lulin, Huang
Xiao, Yang
Pei, Sun
Wen, Tong
Shangqin, Hu
author_facet Lulin, Huang
Xiao, Yang
Pei, Sun
Wen, Tong
Shangqin, Hu
author_sort Lulin, Huang
collection PubMed
description BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(−5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant.
format Online
Article
Text
id pubmed-3378585
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33785852012-06-21 The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers Lulin, Huang Xiao, Yang Pei, Sun Wen, Tong Shangqin, Hu PLoS One Research Article BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(−5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant. Public Library of Science 2012-06-19 /pmc/articles/PMC3378585/ /pubmed/22723874 http://dx.doi.org/10.1371/journal.pone.0038653 Text en Huang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lulin, Huang
Xiao, Yang
Pei, Sun
Wen, Tong
Shangqin, Hu
The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title_full The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title_fullStr The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title_full_unstemmed The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title_short The First Illumina-Based De Novo Transcriptome Sequencing and Analysis of Safflower Flowers
title_sort first illumina-based de novo transcriptome sequencing and analysis of safflower flowers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3378585/
https://www.ncbi.nlm.nih.gov/pubmed/22723874
http://dx.doi.org/10.1371/journal.pone.0038653
work_keys_str_mv AT lulinhuang thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT xiaoyang thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT peisun thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT wentong thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT shangqinhu thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT lulinhuang firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT xiaoyang firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT peisun firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT wentong firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT shangqinhu firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers