Cargando…

De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia

BACKGROUND: Sainfoin (Onobrychis viciifolia) is a highly nutritious tannin-containing forage legume. In the diet of ruminants sainfoin can have anti-parasitic effects and reduce methane emissions under in vitro conditions. Many of these benefits have been attributed to condensed tannins or proanthoc...

Descripción completa

Detalles Bibliográficos
Autores principales: Mora-Ortiz, Marina, Swain, Martin T., Vickers, Martin J., Hegarty, Matthew J., Kelly, Rhys, Smith, Lydia M. J., Skøt, Leif
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5037894/
https://www.ncbi.nlm.nih.gov/pubmed/27671367
http://dx.doi.org/10.1186/s12864-016-3083-6
_version_ 1782455836791013376
author Mora-Ortiz, Marina
Swain, Martin T.
Vickers, Martin J.
Hegarty, Matthew J.
Kelly, Rhys
Smith, Lydia M. J.
Skøt, Leif
author_facet Mora-Ortiz, Marina
Swain, Martin T.
Vickers, Martin J.
Hegarty, Matthew J.
Kelly, Rhys
Smith, Lydia M. J.
Skøt, Leif
author_sort Mora-Ortiz, Marina
collection PubMed
description BACKGROUND: Sainfoin (Onobrychis viciifolia) is a highly nutritious tannin-containing forage legume. In the diet of ruminants sainfoin can have anti-parasitic effects and reduce methane emissions under in vitro conditions. Many of these benefits have been attributed to condensed tannins or proanthocyanidins in sainfoin. A combination of increased use of industrially produced nitrogen fertilizer, issues with establishment and productivity in the first year and more reliable alternatives, such as red clover led to a decline in the use of sainfoin since the middle of the last century. In recent years there has been a resurgence of interest in sainfoin due to its potential beneficial nutraceutical and environmental attributes. However, genomic resources are scarce, thus hampering progress in genetic analysis and improvement. To address this we have used next generation RNA sequencing technology to obtain the first transcriptome of sainfoin. We used the library to identify gene-based simple sequence repeats (SSRs) and potential single nucleotide polymorphisms (SNPs). RESULTS: One genotype from each of five sainfoin accessions was sequenced. Paired-end (PE) sequences were generated from cDNA libraries of RNA extracted from 7 day old seedlings. A combined assembly of 92,772 transcripts was produced de novo using the Trinity programme. About 18,000 transcripts were annotated with at least one GO (gene ontology) term. A total of 63 transcripts were annotated as involved in the tannin biosynthesis pathway. We identified 3786 potential SSRs. SNPs were identified by mapping the reads of the individual assemblies against the combined assembly. After stringent filtering a total of 77,000 putative SNPs were identified. A phylogenetic analysis of single copy number genes showed that sainfoin was most closely related to red clover and Medicago truncatula, while Lotus japonicus, bean and soybean are more distant relatives. CONCLUSIONS: This work describes the first transcriptome assembly in sainfoin. The 92 K transcripts provide a rich source of SNP and SSR polymorphisms for future use in genetic studies of this crop. Annotation of genes involved in the condensed tannin biosynthesis pathway has provided the basis for further studies of the genetic control of this important trait in sainfoin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3083-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5037894
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50378942016-10-05 De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia Mora-Ortiz, Marina Swain, Martin T. Vickers, Martin J. Hegarty, Matthew J. Kelly, Rhys Smith, Lydia M. J. Skøt, Leif BMC Genomics Research Article BACKGROUND: Sainfoin (Onobrychis viciifolia) is a highly nutritious tannin-containing forage legume. In the diet of ruminants sainfoin can have anti-parasitic effects and reduce methane emissions under in vitro conditions. Many of these benefits have been attributed to condensed tannins or proanthocyanidins in sainfoin. A combination of increased use of industrially produced nitrogen fertilizer, issues with establishment and productivity in the first year and more reliable alternatives, such as red clover led to a decline in the use of sainfoin since the middle of the last century. In recent years there has been a resurgence of interest in sainfoin due to its potential beneficial nutraceutical and environmental attributes. However, genomic resources are scarce, thus hampering progress in genetic analysis and improvement. To address this we have used next generation RNA sequencing technology to obtain the first transcriptome of sainfoin. We used the library to identify gene-based simple sequence repeats (SSRs) and potential single nucleotide polymorphisms (SNPs). RESULTS: One genotype from each of five sainfoin accessions was sequenced. Paired-end (PE) sequences were generated from cDNA libraries of RNA extracted from 7 day old seedlings. A combined assembly of 92,772 transcripts was produced de novo using the Trinity programme. About 18,000 transcripts were annotated with at least one GO (gene ontology) term. A total of 63 transcripts were annotated as involved in the tannin biosynthesis pathway. We identified 3786 potential SSRs. SNPs were identified by mapping the reads of the individual assemblies against the combined assembly. After stringent filtering a total of 77,000 putative SNPs were identified. A phylogenetic analysis of single copy number genes showed that sainfoin was most closely related to red clover and Medicago truncatula, while Lotus japonicus, bean and soybean are more distant relatives. CONCLUSIONS: This work describes the first transcriptome assembly in sainfoin. The 92 K transcripts provide a rich source of SNP and SSR polymorphisms for future use in genetic studies of this crop. Annotation of genes involved in the condensed tannin biosynthesis pathway has provided the basis for further studies of the genetic control of this important trait in sainfoin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3083-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-26 /pmc/articles/PMC5037894/ /pubmed/27671367 http://dx.doi.org/10.1186/s12864-016-3083-6 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Mora-Ortiz, Marina
Swain, Martin T.
Vickers, Martin J.
Hegarty, Matthew J.
Kelly, Rhys
Smith, Lydia M. J.
Skøt, Leif
De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title_full De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title_fullStr De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title_full_unstemmed De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title_short De-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in Onobrychis viciifolia
title_sort de-novo transcriptome assembly for gene identification, analysis, annotation, and molecular marker discovery in onobrychis viciifolia
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5037894/
https://www.ncbi.nlm.nih.gov/pubmed/27671367
http://dx.doi.org/10.1186/s12864-016-3083-6
work_keys_str_mv AT moraortizmarina denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT swainmartint denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT vickersmartinj denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT hegartymatthewj denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT kellyrhys denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT smithlydiamj denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia
AT skøtleif denovotranscriptomeassemblyforgeneidentificationanalysisannotationandmolecularmarkerdiscoveryinonobrychisviciifolia