Cargando…
De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock
BACKGROUND: Arundo donax has attracted renewed interest as a potential candidate energy crop for use in biomass-to-liquid fuel conversion processes and biorefineries. This is due to its high productivity, adaptability to marginal land conditions, and suitability for biofuel and biomaterial productio...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5450047/ https://www.ncbi.nlm.nih.gov/pubmed/28572841 http://dx.doi.org/10.1186/s13068-017-0828-7 |
_version_ | 1783239875429924864 |
---|---|
author | Evangelistella, Chiara Valentini, Alessio Ludovisi, Riccardo Firrincieli, Andrea Fabbrini, Francesco Scalabrin, Simone Cattonaro, Federica Morgante, Michele Mugnozza, Giuseppe Scarascia Keurentjes, Joost J. B. Harfouche, Antoine |
author_facet | Evangelistella, Chiara Valentini, Alessio Ludovisi, Riccardo Firrincieli, Andrea Fabbrini, Francesco Scalabrin, Simone Cattonaro, Federica Morgante, Michele Mugnozza, Giuseppe Scarascia Keurentjes, Joost J. B. Harfouche, Antoine |
author_sort | Evangelistella, Chiara |
collection | PubMed |
description | BACKGROUND: Arundo donax has attracted renewed interest as a potential candidate energy crop for use in biomass-to-liquid fuel conversion processes and biorefineries. This is due to its high productivity, adaptability to marginal land conditions, and suitability for biofuel and biomaterial production. Despite its importance, the genomic resources currently available for supporting the improvement of this species are still limited. RESULTS: We used RNA sequencing (RNA-Seq) to de novo assemble and characterize the A. donax leaf transcriptome. The sequencing generated 1249 million clean reads that were assembled using single-k-mer and multi-k-mer approaches into 62,596 unique sequences (unitranscripts) with an N50 of 1134 bp. TransDecoder and Trinotate software suites were used to obtain putative coding sequences and annotate them by mapping to UniProtKB/Swiss-Prot and UniRef90 databases, searching for known transcripts, proteins, protein domains, and signal peptides. Furthermore, the unitranscripts were annotated by mapping them to the NCBI non-redundant, GO and KEGG pathway databases using Blast2GO. The transcriptome was also characterized by BLAST searches to investigate homologous transcripts of key genes involved in important metabolic pathways, such as lignin, cellulose, purine, and thiamine biosynthesis and carbon fixation. Moreover, a set of homologous transcripts of key genes involved in stomatal development and of genes coding for stress-associated proteins (SAPs) were identified. Additionally, 8364 simple sequence repeat (SSR) markers were identified and surveyed. SSRs appeared more abundant in non-coding regions (63.18%) than in coding regions (36.82%). This SSR dataset represents the first marker catalogue of A. donax. 53 SSRs (PolySSRs) were then predicted to be polymorphic between ecotype-specific assemblies, suggesting genetic variability in the studied ecotypes. CONCLUSIONS: This study provides the first publicly available leaf transcriptome for the A. donax bioenergy crop. The functional annotation and characterization of the transcriptome will be highly useful for providing insight into the molecular mechanisms underlying its extreme adaptability. The identification of homologous transcripts involved in key metabolic pathways offers a platform for directing future efforts in genetic improvement of this species. Finally, the identified SSRs will facilitate the harnessing of untapped genetic diversity. This transcriptome should be of value to ongoing functional genomics and genetic studies in this crop of paramount economic importance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13068-017-0828-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5450047 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54500472017-06-01 De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock Evangelistella, Chiara Valentini, Alessio Ludovisi, Riccardo Firrincieli, Andrea Fabbrini, Francesco Scalabrin, Simone Cattonaro, Federica Morgante, Michele Mugnozza, Giuseppe Scarascia Keurentjes, Joost J. B. Harfouche, Antoine Biotechnol Biofuels Research BACKGROUND: Arundo donax has attracted renewed interest as a potential candidate energy crop for use in biomass-to-liquid fuel conversion processes and biorefineries. This is due to its high productivity, adaptability to marginal land conditions, and suitability for biofuel and biomaterial production. Despite its importance, the genomic resources currently available for supporting the improvement of this species are still limited. RESULTS: We used RNA sequencing (RNA-Seq) to de novo assemble and characterize the A. donax leaf transcriptome. The sequencing generated 1249 million clean reads that were assembled using single-k-mer and multi-k-mer approaches into 62,596 unique sequences (unitranscripts) with an N50 of 1134 bp. TransDecoder and Trinotate software suites were used to obtain putative coding sequences and annotate them by mapping to UniProtKB/Swiss-Prot and UniRef90 databases, searching for known transcripts, proteins, protein domains, and signal peptides. Furthermore, the unitranscripts were annotated by mapping them to the NCBI non-redundant, GO and KEGG pathway databases using Blast2GO. The transcriptome was also characterized by BLAST searches to investigate homologous transcripts of key genes involved in important metabolic pathways, such as lignin, cellulose, purine, and thiamine biosynthesis and carbon fixation. Moreover, a set of homologous transcripts of key genes involved in stomatal development and of genes coding for stress-associated proteins (SAPs) were identified. Additionally, 8364 simple sequence repeat (SSR) markers were identified and surveyed. SSRs appeared more abundant in non-coding regions (63.18%) than in coding regions (36.82%). This SSR dataset represents the first marker catalogue of A. donax. 53 SSRs (PolySSRs) were then predicted to be polymorphic between ecotype-specific assemblies, suggesting genetic variability in the studied ecotypes. CONCLUSIONS: This study provides the first publicly available leaf transcriptome for the A. donax bioenergy crop. The functional annotation and characterization of the transcriptome will be highly useful for providing insight into the molecular mechanisms underlying its extreme adaptability. The identification of homologous transcripts involved in key metabolic pathways offers a platform for directing future efforts in genetic improvement of this species. Finally, the identified SSRs will facilitate the harnessing of untapped genetic diversity. This transcriptome should be of value to ongoing functional genomics and genetic studies in this crop of paramount economic importance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13068-017-0828-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-30 /pmc/articles/PMC5450047/ /pubmed/28572841 http://dx.doi.org/10.1186/s13068-017-0828-7 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Evangelistella, Chiara Valentini, Alessio Ludovisi, Riccardo Firrincieli, Andrea Fabbrini, Francesco Scalabrin, Simone Cattonaro, Federica Morgante, Michele Mugnozza, Giuseppe Scarascia Keurentjes, Joost J. B. Harfouche, Antoine De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title | De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title_full | De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title_fullStr | De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title_full_unstemmed | De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title_short | De novo assembly, functional annotation, and analysis of the giant reed (Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock |
title_sort | de novo assembly, functional annotation, and analysis of the giant reed (arundo donax l.) leaf transcriptome provide tools for the development of a biofuel feedstock |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5450047/ https://www.ncbi.nlm.nih.gov/pubmed/28572841 http://dx.doi.org/10.1186/s13068-017-0828-7 |
work_keys_str_mv | AT evangelistellachiara denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT valentinialessio denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT ludovisiriccardo denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT firrincieliandrea denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT fabbrinifrancesco denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT scalabrinsimone denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT cattonarofederica denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT morgantemichele denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT mugnozzagiuseppescarascia denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT keurentjesjoostjb denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock AT harfoucheantoine denovoassemblyfunctionalannotationandanalysisofthegiantreedarundodonaxlleaftranscriptomeprovidetoolsforthedevelopmentofabiofuelfeedstock |