Cargando…

Shoot transcriptome of the giant reed, Arundo donax

The giant reed, Arundo donax, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of A. donax have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant r...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrero, Roberto A., Guerrero, Felix D., Moolhuijzen, Paula, Goolsby, John A., Tidwell, Jason, Bellgard, Stanley E., Bellgard, Matthew I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4509983/
https://www.ncbi.nlm.nih.gov/pubmed/26217707
http://dx.doi.org/10.1016/j.dib.2014.12.007
_version_ 1782382105041305600
author Barrero, Roberto A.
Guerrero, Felix D.
Moolhuijzen, Paula
Goolsby, John A.
Tidwell, Jason
Bellgard, Stanley E.
Bellgard, Matthew I.
author_facet Barrero, Roberto A.
Guerrero, Felix D.
Moolhuijzen, Paula
Goolsby, John A.
Tidwell, Jason
Bellgard, Stanley E.
Bellgard, Matthew I.
author_sort Barrero, Roberto A.
collection PubMed
description The giant reed, Arundo donax, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of A. donax have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant reed grows under adverse environmental conditions, displaying insensitivity to drought stress, flooding, heavy metals, salinity and herbaceous competition, thus hampering control programs. To establish a foundational molecular dataset, we used an llumina Hi-Seq protocol to sequence the transcriptome of actively growing shoots from an invasive genotype collected along the Rio Grande River, bordering Texas and Mexico. We report the assembly of 27,491 high confidence transcripts (≥200 bp) with at least 70% coverage of known genes in other Poaceae species. Of these 13,080 (47.58%), 6165 (22.43%) and 8246 (30.0%) transcripts have sequence similarity to known, domain-containing and conserved hypothetical proteins, respectively. We also report 75,590 low confidence transcripts supported by both trans-ABBySS and Velvet-Oases de novo assembly pipelines. Within the low confidence subset of transcripts we identified partial hits to known (19,021; 25.16%), domain-containing (7093; 9.38%) and conserved hypothetical (16,647; 22.02%) proteins. Additionally 32,829 (43.43%) transcripts encode putative hypothetical proteins unique to A. donax. Functional annotation resulted in 5,550 and 6,070 transcripts with assigned Gene Ontology and KEGG pathway information, respectively. The most abundant KEGG pathways are spliceosome, ribosome, ubiquitin mediated proteolysis, plant–pathogen interaction, RNA degradation and oxidative phosphorylation metabolic pathway. Furthermore, we also found 12, 9, and 4 transcripts annotated as stress-related, heat stress, and water stress proteins, respectively. We envisage that these resources will promote and facilitate studies of the abiotic stress capabilities of this exotic plant species, which facilitates its invasive capacity.
format Online
Article
Text
id pubmed-4509983
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-45099832015-07-27 Shoot transcriptome of the giant reed, Arundo donax Barrero, Roberto A. Guerrero, Felix D. Moolhuijzen, Paula Goolsby, John A. Tidwell, Jason Bellgard, Stanley E. Bellgard, Matthew I. Data Brief Data Article The giant reed, Arundo donax, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of A. donax have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant reed grows under adverse environmental conditions, displaying insensitivity to drought stress, flooding, heavy metals, salinity and herbaceous competition, thus hampering control programs. To establish a foundational molecular dataset, we used an llumina Hi-Seq protocol to sequence the transcriptome of actively growing shoots from an invasive genotype collected along the Rio Grande River, bordering Texas and Mexico. We report the assembly of 27,491 high confidence transcripts (≥200 bp) with at least 70% coverage of known genes in other Poaceae species. Of these 13,080 (47.58%), 6165 (22.43%) and 8246 (30.0%) transcripts have sequence similarity to known, domain-containing and conserved hypothetical proteins, respectively. We also report 75,590 low confidence transcripts supported by both trans-ABBySS and Velvet-Oases de novo assembly pipelines. Within the low confidence subset of transcripts we identified partial hits to known (19,021; 25.16%), domain-containing (7093; 9.38%) and conserved hypothetical (16,647; 22.02%) proteins. Additionally 32,829 (43.43%) transcripts encode putative hypothetical proteins unique to A. donax. Functional annotation resulted in 5,550 and 6,070 transcripts with assigned Gene Ontology and KEGG pathway information, respectively. The most abundant KEGG pathways are spliceosome, ribosome, ubiquitin mediated proteolysis, plant–pathogen interaction, RNA degradation and oxidative phosphorylation metabolic pathway. Furthermore, we also found 12, 9, and 4 transcripts annotated as stress-related, heat stress, and water stress proteins, respectively. We envisage that these resources will promote and facilitate studies of the abiotic stress capabilities of this exotic plant species, which facilitates its invasive capacity. Elsevier 2015-01-22 /pmc/articles/PMC4509983/ /pubmed/26217707 http://dx.doi.org/10.1016/j.dib.2014.12.007 Text en © 2015 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Barrero, Roberto A.
Guerrero, Felix D.
Moolhuijzen, Paula
Goolsby, John A.
Tidwell, Jason
Bellgard, Stanley E.
Bellgard, Matthew I.
Shoot transcriptome of the giant reed, Arundo donax
title Shoot transcriptome of the giant reed, Arundo donax
title_full Shoot transcriptome of the giant reed, Arundo donax
title_fullStr Shoot transcriptome of the giant reed, Arundo donax
title_full_unstemmed Shoot transcriptome of the giant reed, Arundo donax
title_short Shoot transcriptome of the giant reed, Arundo donax
title_sort shoot transcriptome of the giant reed, arundo donax
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4509983/
https://www.ncbi.nlm.nih.gov/pubmed/26217707
http://dx.doi.org/10.1016/j.dib.2014.12.007
work_keys_str_mv AT barrerorobertoa shoottranscriptomeofthegiantreedarundodonax
AT guerrerofelixd shoottranscriptomeofthegiantreedarundodonax
AT moolhuijzenpaula shoottranscriptomeofthegiantreedarundodonax
AT goolsbyjohna shoottranscriptomeofthegiantreedarundodonax
AT tidwelljason shoottranscriptomeofthegiantreedarundodonax
AT bellgardstanleye shoottranscriptomeofthegiantreedarundodonax
AT bellgardmatthewi shoottranscriptomeofthegiantreedarundodonax