Cargando…

Shotgun metagenomic data of root endophytic microbiome of maize (Zea mays L.)

This dataset represents the root endophytic microbial community profile of maize (Zea mays L.), one of the largest food crops in South Africa, using a shotgun metagenomic approach. To the best of our understanding, this is the first account showcasing the endophytic microbial diversity in maize plan...

Descripción completa

Detalles Bibliográficos
Autores principales: Babalola, Olubukola Oluranti, Fadiji, Ayomide Emmanuel, Ayangbenro, Ayansina Segun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7327808/
https://www.ncbi.nlm.nih.gov/pubmed/32637499
http://dx.doi.org/10.1016/j.dib.2020.105893
Descripción
Sumario:This dataset represents the root endophytic microbial community profile of maize (Zea mays L.), one of the largest food crops in South Africa, using a shotgun metagenomic approach. To the best of our understanding, this is the first account showcasing the endophytic microbial diversity in maize plants via the shotgun metagenomics approach. High throughput sequencing of the whole DNA from the community was carried out using NovaSeq 6000 system (Illumina). The data obtained consists of 10,915,268 sequences accounting for 261,906,948 bps with an average length of 153 base pairs and 43% Guanine+Cytosine content. The metagenome data can be accessed at the National Centre for Biotechnology Information SRA registered with the accession number PRJNA607664. Community analysis was done using an online server called MG-RAST, which showed that 0.12% of the sequences were archaeal associated, eukaryotes were 15.06%, while 84.77% were classified as bacteria. A sum of 28 bacterial, 22 eukaryotic and 4 archaeal phyla were identified. The predominant genera were Bacillus (16%), Chitinophaga (12%), Flavobacterium (4%), Chryseobacterium (4%), Paenibacillus (4%), Pedobacter (3%) and Alphaproteobacteria (3%). Annotation using Cluster of Orthologous Group (COG) revealed that 41.47% of the sequenced data were for metabolic function, 24.10% for chemical process and signaling, while 17.43% of the sequences were in the poorly characterized group. Annotation using the subsystem method showed that 18% of the sequences were associated with carbohydrates, 9% were for clustering-based subsystems, and 9% contain genes coding for amino acids and derivatives, which might be beneficial in plant growth and health improvement.