Cargando…

Soil microbiome dataset from Guanica dry forest in Puerto Rico generated by shotgun sequencing

Guanica dry forest (GDF), located in the southwest area or region of Puerto Rico, is among the most preserved subtropical dry forests in the world [1]. To describe the taxonomic diversity and functional profiles of this environment, metagenomic DNA was extracted from a metagenomic library generated...

Descripción completa

Detalles Bibliográficos
Autores principales: Sotomayor-Mena, Roberto G., Rios-Velazquez, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6926141/
https://www.ncbi.nlm.nih.gov/pubmed/31890785
http://dx.doi.org/10.1016/j.dib.2019.104919
Descripción
Sumario:Guanica dry forest (GDF), located in the southwest area or region of Puerto Rico, is among the most preserved subtropical dry forests in the world [1]. To describe the taxonomic diversity and functional profiles of this environment, metagenomic DNA was extracted from a metagenomic library generated from the GDF. The DNA was shotgun-sequenced using Illumina and analyzed using the MG-RAST server. The diversity profile revealed that the most abundant domain was Bacteria (97.8%) followed by Archaea (1.12%), Eukaryota (1.02%) and Viruses (0.03%). Out of the 50 phyla present, the most abundant was Proteobacteria (41.6%) followed by Actinobacteria (18.7%) and Acidobacteria (7.06%). Moreover, a total of 213 orders, 384 families and 791 genus were identified. The functional profile showed abundance of genes related to Carbohydrates (13.16%), Clustering-based subsystems (13.0%), Amino Acids and Derivatives (9.9%) and Protein Metabolism (8.24%). Furthermore, more specific grouping showed that NULL (21.5%) was the most abundant function group, followed by Plant-Prokaryote DOE project (6.05%), Protein biosynthesis (4.82%), Central carbohydrate metabolism (3.98%), DNA repair (2.72%) and Resistance to antibiotics and toxic compounds (2.66%). This dataset is useful in bioprospecting studies with application in biomedical sciences, biotechnology and microbial, population and applied ecology fields.