Cargando…

De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers

Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study,...

Descripción completa

Detalles Bibliográficos
Autores principales: Long, Yan, Wang, Yanyan, Wu, Shanshan, Wang, Jiao, Tian, Xinjie, Pei, Xinwu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4309406/
https://www.ncbi.nlm.nih.gov/pubmed/25629164
http://dx.doi.org/10.1371/journal.pone.0115805
_version_ 1782354686080188416
author Long, Yan
Wang, Yanyan
Wu, Shanshan
Wang, Jiao
Tian, Xinjie
Pei, Xinwu
author_facet Long, Yan
Wang, Yanyan
Wu, Shanshan
Wang, Jiao
Tian, Xinjie
Pei, Xinwu
author_sort Long, Yan
collection PubMed
description Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study, more than two billion bases of high-quality sequence of C. korshinskii were generated by using illumina sequencing technology and demonstrated the de novo assembly and annotation of genes without prior genome information. These reads were assembled into 86,265 unigenes (mean length = 709 bp). The similarity search indicated that 33,955 and 21,978 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 26,232 a unigenes were separately assigned to Gene Ontology (GO) database. When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 5,598 unigenes were assigned to 5 main categories including 32 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (2,862, 43.7%), suggesting the active metabolic processes in the desert tree. In addition, a total of 19,150 EST-SSRs were identified from 15,484 unigenes, and the characterizations of EST-SSRs were further compared with other four species in Fabraceae. 126 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among the 9 germplasms in Caranaga Fabr. genus, PCR success rate were 93.7% and the phylogenic tree was constructed based on the genotypic data. This research generated a substantial fraction of transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding.
format Online
Article
Text
id pubmed-4309406
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43094062015-02-06 De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers Long, Yan Wang, Yanyan Wu, Shanshan Wang, Jiao Tian, Xinjie Pei, Xinwu PLoS One Research Article Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study, more than two billion bases of high-quality sequence of C. korshinskii were generated by using illumina sequencing technology and demonstrated the de novo assembly and annotation of genes without prior genome information. These reads were assembled into 86,265 unigenes (mean length = 709 bp). The similarity search indicated that 33,955 and 21,978 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 26,232 a unigenes were separately assigned to Gene Ontology (GO) database. When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 5,598 unigenes were assigned to 5 main categories including 32 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (2,862, 43.7%), suggesting the active metabolic processes in the desert tree. In addition, a total of 19,150 EST-SSRs were identified from 15,484 unigenes, and the characterizations of EST-SSRs were further compared with other four species in Fabraceae. 126 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among the 9 germplasms in Caranaga Fabr. genus, PCR success rate were 93.7% and the phylogenic tree was constructed based on the genotypic data. This research generated a substantial fraction of transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding. Public Library of Science 2015-01-28 /pmc/articles/PMC4309406/ /pubmed/25629164 http://dx.doi.org/10.1371/journal.pone.0115805 Text en © 2015 Long et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Long, Yan
Wang, Yanyan
Wu, Shanshan
Wang, Jiao
Tian, Xinjie
Pei, Xinwu
De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title_full De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title_fullStr De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title_full_unstemmed De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title_short De Novo Assembly of Transcriptome Sequencing in Caragana korshinskii Kom. and Characterization of EST-SSR Markers
title_sort de novo assembly of transcriptome sequencing in caragana korshinskii kom. and characterization of est-ssr markers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4309406/
https://www.ncbi.nlm.nih.gov/pubmed/25629164
http://dx.doi.org/10.1371/journal.pone.0115805
work_keys_str_mv AT longyan denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT wangyanyan denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT wushanshan denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT wangjiao denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT tianxinjie denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT peixinwu denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers