Cargando…

Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir

The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as we...

Descripción completa

Detalles Bibliográficos
Autores principales: Soriano, Berliza M., Del Valle-Perez, Laura M., Morales-Vale, Luis, Rios-Velazquez, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288454/
https://www.ncbi.nlm.nih.gov/pubmed/30560161
http://dx.doi.org/10.1016/j.dib.2018.11.114
_version_ 1783379799046094848
author Soriano, Berliza M.
Del Valle-Perez, Laura M.
Morales-Vale, Luis
Rios-Velazquez, Carlos
author_facet Soriano, Berliza M.
Del Valle-Perez, Laura M.
Morales-Vale, Luis
Rios-Velazquez, Carlos
author_sort Soriano, Berliza M.
collection PubMed
description The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as well as the potential anthropogenic impact. Metagenomic libraries were constructed for two GWR sampling sites and genomic information access through shotgun sequencing. After removing the bacterial host cell genome and the library fosmid sequences, the environmental genome was processed through Rapid Annotation using Subsystems Technology for Metagenomes (MG-RAST). The sequences consisted primarily of bacteria (95.70%), followed by viruses (2.94%), other sequences (0.28%) and eukaryote (0.09%). The most abundant species were Enterobacter cloacae (31%), Enterobacter sp. 638 (20%), Enterobacter cancerogenus (10%) and Escherichia coli (11%). Furthermore, the subsystem data showed that 13% of the genes belong to carbohydrates functionality, 12% to clustering-based-subsystems and another 9% related to virulence-disease-and-defense (out of which 8% pertain to genes of antibiotic resistance and toxic compounds). This unique data input will serve as a baseline to a better understanding not only the microbial communities present in the AB, but also the microbial activities with potential application in biotechnological and biomedical fields.
format Online
Article
Text
id pubmed-6288454
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-62884542018-12-17 Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir Soriano, Berliza M. Del Valle-Perez, Laura M. Morales-Vale, Luis Rios-Velazquez, Carlos Data Brief Agricultural and Biological Science The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as well as the potential anthropogenic impact. Metagenomic libraries were constructed for two GWR sampling sites and genomic information access through shotgun sequencing. After removing the bacterial host cell genome and the library fosmid sequences, the environmental genome was processed through Rapid Annotation using Subsystems Technology for Metagenomes (MG-RAST). The sequences consisted primarily of bacteria (95.70%), followed by viruses (2.94%), other sequences (0.28%) and eukaryote (0.09%). The most abundant species were Enterobacter cloacae (31%), Enterobacter sp. 638 (20%), Enterobacter cancerogenus (10%) and Escherichia coli (11%). Furthermore, the subsystem data showed that 13% of the genes belong to carbohydrates functionality, 12% to clustering-based-subsystems and another 9% related to virulence-disease-and-defense (out of which 8% pertain to genes of antibiotic resistance and toxic compounds). This unique data input will serve as a baseline to a better understanding not only the microbial communities present in the AB, but also the microbial activities with potential application in biotechnological and biomedical fields. Elsevier 2018-11-27 /pmc/articles/PMC6288454/ /pubmed/30560161 http://dx.doi.org/10.1016/j.dib.2018.11.114 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Agricultural and Biological Science
Soriano, Berliza M.
Del Valle-Perez, Laura M.
Morales-Vale, Luis
Rios-Velazquez, Carlos
Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title_full Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title_fullStr Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title_full_unstemmed Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title_short Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
title_sort datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
topic Agricultural and Biological Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288454/
https://www.ncbi.nlm.nih.gov/pubmed/30560161
http://dx.doi.org/10.1016/j.dib.2018.11.114
work_keys_str_mv AT sorianoberlizam datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir
AT delvalleperezlauram datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir
AT moralesvaleluis datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir
AT riosvelazquezcarlos datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir