Cargando…

De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis

BACKGROUND: Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular br...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hua-Hong, Xu, Li-Li, Tong, Zai-Kang, Lin, Er-Pei, Liu, Qing-Po, Cheng, Long-Jun, Zhu, Mu-Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561127/
https://www.ncbi.nlm.nih.gov/pubmed/23171398
http://dx.doi.org/10.1186/1471-2164-13-648
_version_ 1782257908496465920
author Huang, Hua-Hong
Xu, Li-Li
Tong, Zai-Kang
Lin, Er-Pei
Liu, Qing-Po
Cheng, Long-Jun
Zhu, Mu-Yuan
author_facet Huang, Hua-Hong
Xu, Li-Li
Tong, Zai-Kang
Lin, Er-Pei
Liu, Qing-Po
Cheng, Long-Jun
Zhu, Mu-Yuan
author_sort Huang, Hua-Hong
collection PubMed
description BACKGROUND: Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes. RESULTS: In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions. CONCLUSIONS: A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata.
format Online
Article
Text
id pubmed-3561127
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35611272013-02-05 De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis Huang, Hua-Hong Xu, Li-Li Tong, Zai-Kang Lin, Er-Pei Liu, Qing-Po Cheng, Long-Jun Zhu, Mu-Yuan BMC Genomics Research Article BACKGROUND: Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes. RESULTS: In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions. CONCLUSIONS: A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata. BioMed Central 2012-11-21 /pmc/articles/PMC3561127/ /pubmed/23171398 http://dx.doi.org/10.1186/1471-2164-13-648 Text en Copyright ©2012 Huang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Huang, Hua-Hong
Xu, Li-Li
Tong, Zai-Kang
Lin, Er-Pei
Liu, Qing-Po
Cheng, Long-Jun
Zhu, Mu-Yuan
De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title_full De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title_fullStr De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title_full_unstemmed De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title_short De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
title_sort de novo characterization of the chinese fir (cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561127/
https://www.ncbi.nlm.nih.gov/pubmed/23171398
http://dx.doi.org/10.1186/1471-2164-13-648
work_keys_str_mv AT huanghuahong denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT xulili denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT tongzaikang denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT linerpei denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT liuqingpo denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT chenglongjun denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis
AT zhumuyuan denovocharacterizationofthechinesefircunninghamialanceolatatranscriptomeandanalysisofcandidategenesinvolvedincelluloseandligninbiosynthesis