Cargando…

TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes

Microbiome studies focused on the genetic potential of microbial communities (metagenomics) became standard within microbial ecology. MG-RAST and the Sequence Read Archive (SRA), the two main metagenome repositories, contain over 202 858 public available metagenomes and this number has increased exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Corrêa, Felipe Borim, Saraiva, João Pedro, Stadler, Peter F, da Rocha, Ulisses Nunes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7145636/
https://www.ncbi.nlm.nih.gov/pubmed/31728526
http://dx.doi.org/10.1093/nar/gkz994
_version_ 1783520030236868608
author Corrêa, Felipe Borim
Saraiva, João Pedro
Stadler, Peter F
da Rocha, Ulisses Nunes
author_facet Corrêa, Felipe Borim
Saraiva, João Pedro
Stadler, Peter F
da Rocha, Ulisses Nunes
author_sort Corrêa, Felipe Borim
collection PubMed
description Microbiome studies focused on the genetic potential of microbial communities (metagenomics) became standard within microbial ecology. MG-RAST and the Sequence Read Archive (SRA), the two main metagenome repositories, contain over 202 858 public available metagenomes and this number has increased exponentially. However, mining databases can be challenging due to misannotated, misleading and decentralized data. The main goal of TerrestrialMetagenomeDB is to make it easier for scientists to find terrestrial metagenomes of interest that could be compared with novel datasets in meta-analyses. We defined terrestrial metagenomes as those that do not belong to marine environments. Further, we curated the database using text mining to assign potential descriptive keywords that better contextualize environmental aspects of terrestrial metagenomes, such as biomes and materials. TerrestrialMetagenomeDB release 1.0 includes 15 022 terrestrial metagenomes from SRA and MG-RAST. Together, the downloadable data amounts to 68 Tbp. In total, 199 terrestrial terms were divided into 14 categories. These metagenomes span 83 countries, 30 biomes and 7 main source materials. The TerrestrialMetagenomeDB is publicly available at https://webapp.ufz.de/tmdb.
format Online
Article
Text
id pubmed-7145636
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71456362020-04-13 TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes Corrêa, Felipe Borim Saraiva, João Pedro Stadler, Peter F da Rocha, Ulisses Nunes Nucleic Acids Res Database Issue Microbiome studies focused on the genetic potential of microbial communities (metagenomics) became standard within microbial ecology. MG-RAST and the Sequence Read Archive (SRA), the two main metagenome repositories, contain over 202 858 public available metagenomes and this number has increased exponentially. However, mining databases can be challenging due to misannotated, misleading and decentralized data. The main goal of TerrestrialMetagenomeDB is to make it easier for scientists to find terrestrial metagenomes of interest that could be compared with novel datasets in meta-analyses. We defined terrestrial metagenomes as those that do not belong to marine environments. Further, we curated the database using text mining to assign potential descriptive keywords that better contextualize environmental aspects of terrestrial metagenomes, such as biomes and materials. TerrestrialMetagenomeDB release 1.0 includes 15 022 terrestrial metagenomes from SRA and MG-RAST. Together, the downloadable data amounts to 68 Tbp. In total, 199 terrestrial terms were divided into 14 categories. These metagenomes span 83 countries, 30 biomes and 7 main source materials. The TerrestrialMetagenomeDB is publicly available at https://webapp.ufz.de/tmdb. Oxford University Press 2020-01-08 2019-11-15 /pmc/articles/PMC7145636/ /pubmed/31728526 http://dx.doi.org/10.1093/nar/gkz994 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Corrêa, Felipe Borim
Saraiva, João Pedro
Stadler, Peter F
da Rocha, Ulisses Nunes
TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title_full TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title_fullStr TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title_full_unstemmed TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title_short TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes
title_sort terrestrialmetagenomedb: a public repository of curated and standardized metadata for terrestrial metagenomes
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7145636/
https://www.ncbi.nlm.nih.gov/pubmed/31728526
http://dx.doi.org/10.1093/nar/gkz994
work_keys_str_mv AT correafelipeborim terrestrialmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforterrestrialmetagenomes
AT saraivajoaopedro terrestrialmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforterrestrialmetagenomes
AT stadlerpeterf terrestrialmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforterrestrialmetagenomes
AT darochaulissesnunes terrestrialmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforterrestrialmetagenomes