Cargando…

Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers

BACKGROUND: Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accurac...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wei, Wenliang, Qi, Xiaoqiong, Wang, Linhai, Zhang, Yanxin, Hua, Wei, Li, Donghua, Lv, Haixia, Zhang, Xiurong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184296/ https://www.ncbi.nlm.nih.gov/pubmed/21929789 http://dx.doi.org/10.1186/1471-2164-12-451

_version_	1782213088930430976
author	Wei, Wenliang Qi, Xiaoqiong Wang, Linhai Zhang, Yanxin Hua, Wei Li, Donghua Lv, Haixia Zhang, Xiurong
author_facet	Wei, Wenliang Qi, Xiaoqiong Wang, Linhai Zhang, Yanxin Hua, Wei Li, Donghua Lv, Haixia Zhang, Xiurong
author_sort	Wei, Wenliang
collection	PubMed
description	BACKGROUND: Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accuracy of genetic breeding. High-throughput transcriptomic sequencing is essential to generate a large transcriptome sequence dataset for gene discovery and molecular marker development. RESULTS: Sesame transcriptomes from five tissues were sequenced using Illumina paired-end sequencing technology. The cleaned raw reads were assembled into a total of 86,222 unigenes with an average length of 629 bp. Of the unigenes, 46,584 (54.03%) had significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value < 10(-5)). Of these annotated unigenes, 10,805 and 27,588 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In total, 22,003 (25.52%) unigenes were mapped onto 119 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Furthermore, 44,750 unigenes showed homology to 15,460 Arabidopsis genes based on BLASTx analysis against The Arabidopsis Information Resource (TAIR, Version 10) and revealed relatively high gene coverage. In total, 7,702 unigenes were converted into SSR markers (EST-SSR). Dinucleotide SSRs were the dominant repeat motif (67.07%, 5,166), followed by trinucleotide (24.89%, 1,917), tetranucleotide (4.31%, 332), hexanucleotide (2.62%, 202), and pentanucleotide (1.10%, 85) SSRs. AG/CT (46.29%) was the dominant repeat motif, followed by AC/GT (16.07%), AT/AT (10.53%), AAG/CTT (6.23%), and AGG/CCT (3.39%). Fifty EST-SSRs were randomly selected to validate amplification and to determine the degree of polymorphism in the genomic DNA pools. Forty primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among 24 sesame accessions. CONCLUSIONS: This study demonstrates that Illumina paired-end sequencing is a fast and cost-effective approach to gene discovery and molecular marker development in non-model organisms. Our results provide a comprehensive sequence resource for sesame research.
format	Online Article Text
id	pubmed-3184296
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31842962011-10-02 Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers Wei, Wenliang Qi, Xiaoqiong Wang, Linhai Zhang, Yanxin Hua, Wei Li, Donghua Lv, Haixia Zhang, Xiurong BMC Genomics Research Article BACKGROUND: Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accuracy of genetic breeding. High-throughput transcriptomic sequencing is essential to generate a large transcriptome sequence dataset for gene discovery and molecular marker development. RESULTS: Sesame transcriptomes from five tissues were sequenced using Illumina paired-end sequencing technology. The cleaned raw reads were assembled into a total of 86,222 unigenes with an average length of 629 bp. Of the unigenes, 46,584 (54.03%) had significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value < 10(-5)). Of these annotated unigenes, 10,805 and 27,588 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In total, 22,003 (25.52%) unigenes were mapped onto 119 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Furthermore, 44,750 unigenes showed homology to 15,460 Arabidopsis genes based on BLASTx analysis against The Arabidopsis Information Resource (TAIR, Version 10) and revealed relatively high gene coverage. In total, 7,702 unigenes were converted into SSR markers (EST-SSR). Dinucleotide SSRs were the dominant repeat motif (67.07%, 5,166), followed by trinucleotide (24.89%, 1,917), tetranucleotide (4.31%, 332), hexanucleotide (2.62%, 202), and pentanucleotide (1.10%, 85) SSRs. AG/CT (46.29%) was the dominant repeat motif, followed by AC/GT (16.07%), AT/AT (10.53%), AAG/CTT (6.23%), and AGG/CCT (3.39%). Fifty EST-SSRs were randomly selected to validate amplification and to determine the degree of polymorphism in the genomic DNA pools. Forty primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among 24 sesame accessions. CONCLUSIONS: This study demonstrates that Illumina paired-end sequencing is a fast and cost-effective approach to gene discovery and molecular marker development in non-model organisms. Our results provide a comprehensive sequence resource for sesame research. BioMed Central 2011-09-19 /pmc/articles/PMC3184296/ /pubmed/21929789 http://dx.doi.org/10.1186/1471-2164-12-451 Text en Copyright ©2011 Wei et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Wei, Wenliang Qi, Xiaoqiong Wang, Linhai Zhang, Yanxin Hua, Wei Li, Donghua Lv, Haixia Zhang, Xiurong Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title	Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title_full	Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title_fullStr	Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title_full_unstemmed	Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title_short	Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
title_sort	characterization of the sesame (sesamum indicum l.) global transcriptome using illumina paired-end sequencing and development of est-ssr markers
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184296/ https://www.ncbi.nlm.nih.gov/pubmed/21929789 http://dx.doi.org/10.1186/1471-2164-12-451
work_keys_str_mv	AT weiwenliang characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT qixiaoqiong characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT wanglinhai characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT zhangyanxin characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT huawei characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT lidonghua characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT lvhaixia characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers AT zhangxiurong characterizationofthesesamesesamumindicumlglobaltranscriptomeusingilluminapairedendsequencinganddevelopmentofestssrmarkers

Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers

Ejemplares similares