Cargando…
Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers
BACKGROUND: The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson’s drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plan...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445377/ https://www.ncbi.nlm.nih.gov/pubmed/28545396 http://dx.doi.org/10.1186/s12864-017-3780-9 |
_version_ | 1783238876263874560 |
---|---|
author | Sathyanarayana, N. Pittala, Ranjith Kumar Tripathi, Pankaj Kumar Chopra, Ratan Singh, Heikham Russiachand Belamkar, Vikas Bhardwaj, Pardeep Kumar Doyle, Jeff J. Egan, Ashley N. |
author_facet | Sathyanarayana, N. Pittala, Ranjith Kumar Tripathi, Pankaj Kumar Chopra, Ratan Singh, Heikham Russiachand Belamkar, Vikas Bhardwaj, Pardeep Kumar Doyle, Jeff J. Egan, Ashley N. |
author_sort | Sathyanarayana, N. |
collection | PubMed |
description | BACKGROUND: The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson’s drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plant currently lacks genomic resources and there is limited knowledge on gene expression, metabolic pathways, and genetics of secondary metabolite production. Here, we present transcriptomic resources for M. pruriens, including a de novo transcriptome assembly and annotation, as well as differential transcript expression analyses between root, leaf, and pod tissues. We also develop microsatellite markers and analyze genetic diversity and population structure within a set of Indian germplasm accessions. RESULTS: One-hundred ninety-one million two hundred thirty-three thousand two hundred forty-two bp cleaned reads were assembled into 67,561 transcripts with mean length of 626 bp and N50 of 987 bp. Assembled sequences were annotated using BLASTX against public databases with over 80% of transcripts annotated. We identified 7,493 simple sequence repeat (SSR) motifs, including 787 polymorphic repeats between the parents of a mapping population. 134 SSRs from expressed sequenced tags (ESTs) were screened against 23 M. pruriens accessions from India, with 52 EST-SSRs retained after quality control. Population structure analysis using a Bayesian framework implemented in fastSTRUCTURE showed nearly similar groupings as with distance-based (neighbor-joining) and principal component analyses, with most of the accessions clustering per geographical origins. Pair-wise comparison of transcript expression in leaves, roots and pods identified 4,387 differentially expressed transcripts with the highest number occurring between roots and leaves. Differentially expressed transcripts were enriched with transcription factors and transcripts annotated as belonging to secondary metabolite pathways. CONCLUSIONS: The M. pruriens transcriptomic resources generated in this study provide foundational resources for gene discovery and development of molecular markers. Polymorphic SSRs identified can be used for genetic diversity, marker-trait analyses, and development of functional markers for crop improvement. The results of differential expression studies can be used to investigate genes involved in L-Dopa synthesis and other key metabolic pathways in M. pruriens. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3780-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5445377 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54453772017-05-30 Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers Sathyanarayana, N. Pittala, Ranjith Kumar Tripathi, Pankaj Kumar Chopra, Ratan Singh, Heikham Russiachand Belamkar, Vikas Bhardwaj, Pardeep Kumar Doyle, Jeff J. Egan, Ashley N. BMC Genomics Research Article BACKGROUND: The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson’s drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plant currently lacks genomic resources and there is limited knowledge on gene expression, metabolic pathways, and genetics of secondary metabolite production. Here, we present transcriptomic resources for M. pruriens, including a de novo transcriptome assembly and annotation, as well as differential transcript expression analyses between root, leaf, and pod tissues. We also develop microsatellite markers and analyze genetic diversity and population structure within a set of Indian germplasm accessions. RESULTS: One-hundred ninety-one million two hundred thirty-three thousand two hundred forty-two bp cleaned reads were assembled into 67,561 transcripts with mean length of 626 bp and N50 of 987 bp. Assembled sequences were annotated using BLASTX against public databases with over 80% of transcripts annotated. We identified 7,493 simple sequence repeat (SSR) motifs, including 787 polymorphic repeats between the parents of a mapping population. 134 SSRs from expressed sequenced tags (ESTs) were screened against 23 M. pruriens accessions from India, with 52 EST-SSRs retained after quality control. Population structure analysis using a Bayesian framework implemented in fastSTRUCTURE showed nearly similar groupings as with distance-based (neighbor-joining) and principal component analyses, with most of the accessions clustering per geographical origins. Pair-wise comparison of transcript expression in leaves, roots and pods identified 4,387 differentially expressed transcripts with the highest number occurring between roots and leaves. Differentially expressed transcripts were enriched with transcription factors and transcripts annotated as belonging to secondary metabolite pathways. CONCLUSIONS: The M. pruriens transcriptomic resources generated in this study provide foundational resources for gene discovery and development of molecular markers. Polymorphic SSRs identified can be used for genetic diversity, marker-trait analyses, and development of functional markers for crop improvement. The results of differential expression studies can be used to investigate genes involved in L-Dopa synthesis and other key metabolic pathways in M. pruriens. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3780-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-25 /pmc/articles/PMC5445377/ /pubmed/28545396 http://dx.doi.org/10.1186/s12864-017-3780-9 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sathyanarayana, N. Pittala, Ranjith Kumar Tripathi, Pankaj Kumar Chopra, Ratan Singh, Heikham Russiachand Belamkar, Vikas Bhardwaj, Pardeep Kumar Doyle, Jeff J. Egan, Ashley N. Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title | Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title_full | Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title_fullStr | Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title_full_unstemmed | Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title_short | Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers |
title_sort | transcriptomic resources for the medicinal legume mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of est-ssr markers |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445377/ https://www.ncbi.nlm.nih.gov/pubmed/28545396 http://dx.doi.org/10.1186/s12864-017-3780-9 |
work_keys_str_mv | AT sathyanarayanan transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT pittalaranjithkumar transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT tripathipankajkumar transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT chopraratan transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT singhheikhamrussiachand transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT belamkarvikas transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT bhardwajpardeepkumar transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT doylejeffj transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers AT eganashleyn transcriptomicresourcesforthemedicinallegumemucunapruriensdenovotranscriptomeassemblyannotationidentificationandvalidationofestssrmarkers |