Cargando…
Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir
The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as we...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288454/ https://www.ncbi.nlm.nih.gov/pubmed/30560161 http://dx.doi.org/10.1016/j.dib.2018.11.114 |
_version_ | 1783379799046094848 |
---|---|
author | Soriano, Berliza M. Del Valle-Perez, Laura M. Morales-Vale, Luis Rios-Velazquez, Carlos |
author_facet | Soriano, Berliza M. Del Valle-Perez, Laura M. Morales-Vale, Luis Rios-Velazquez, Carlos |
author_sort | Soriano, Berliza M. |
collection | PubMed |
description | The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as well as the potential anthropogenic impact. Metagenomic libraries were constructed for two GWR sampling sites and genomic information access through shotgun sequencing. After removing the bacterial host cell genome and the library fosmid sequences, the environmental genome was processed through Rapid Annotation using Subsystems Technology for Metagenomes (MG-RAST). The sequences consisted primarily of bacteria (95.70%), followed by viruses (2.94%), other sequences (0.28%) and eukaryote (0.09%). The most abundant species were Enterobacter cloacae (31%), Enterobacter sp. 638 (20%), Enterobacter cancerogenus (10%) and Escherichia coli (11%). Furthermore, the subsystem data showed that 13% of the genes belong to carbohydrates functionality, 12% to clustering-based-subsystems and another 9% related to virulence-disease-and-defense (out of which 8% pertain to genes of antibiotic resistance and toxic compounds). This unique data input will serve as a baseline to a better understanding not only the microbial communities present in the AB, but also the microbial activities with potential application in biotechnological and biomedical fields. |
format | Online Article Text |
id | pubmed-6288454 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-62884542018-12-17 Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir Soriano, Berliza M. Del Valle-Perez, Laura M. Morales-Vale, Luis Rios-Velazquez, Carlos Data Brief Agricultural and Biological Science The Guajataca Water Reservoir (GWR) was constructed for irrigation and to bring potable water to the northwestern region of Puerto Rico. The generation of DNA sequencing data from aquatic bodies (AB) using culture-independent approaches allows the investigation of the total microbial diversity as well as the potential anthropogenic impact. Metagenomic libraries were constructed for two GWR sampling sites and genomic information access through shotgun sequencing. After removing the bacterial host cell genome and the library fosmid sequences, the environmental genome was processed through Rapid Annotation using Subsystems Technology for Metagenomes (MG-RAST). The sequences consisted primarily of bacteria (95.70%), followed by viruses (2.94%), other sequences (0.28%) and eukaryote (0.09%). The most abundant species were Enterobacter cloacae (31%), Enterobacter sp. 638 (20%), Enterobacter cancerogenus (10%) and Escherichia coli (11%). Furthermore, the subsystem data showed that 13% of the genes belong to carbohydrates functionality, 12% to clustering-based-subsystems and another 9% related to virulence-disease-and-defense (out of which 8% pertain to genes of antibiotic resistance and toxic compounds). This unique data input will serve as a baseline to a better understanding not only the microbial communities present in the AB, but also the microbial activities with potential application in biotechnological and biomedical fields. Elsevier 2018-11-27 /pmc/articles/PMC6288454/ /pubmed/30560161 http://dx.doi.org/10.1016/j.dib.2018.11.114 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Agricultural and Biological Science Soriano, Berliza M. Del Valle-Perez, Laura M. Morales-Vale, Luis Rios-Velazquez, Carlos Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title | Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title_full | Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title_fullStr | Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title_full_unstemmed | Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title_short | Datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
title_sort | datasets generated by shotgun sequencing of metagenomic libraries of the guajataca water reservoir |
topic | Agricultural and Biological Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6288454/ https://www.ncbi.nlm.nih.gov/pubmed/30560161 http://dx.doi.org/10.1016/j.dib.2018.11.114 |
work_keys_str_mv | AT sorianoberlizam datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir AT delvalleperezlauram datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir AT moralesvaleluis datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir AT riosvelazquezcarlos datasetsgeneratedbyshotgunsequencingofmetagenomiclibrariesoftheguajatacawaterreservoir |