Cargando…

HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes

Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositor...

Descripción completa

Detalles Bibliográficos
Autores principales: Kasmanas, Jonas Coelho, Bartholomäus, Alexander, Corrêa, Felipe Borim, Tal, Tamara, Jehmlich, Nico, Herberth, Gunda, von Bergen, Martin, Stadler, Peter F, Carvalho, André Carlos Ponce de Leon Ferreira de, Nunes da Rocha, Ulisses
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778935/
https://www.ncbi.nlm.nih.gov/pubmed/33221926
http://dx.doi.org/10.1093/nar/gkaa1031
_version_ 1783631227811528704
author Kasmanas, Jonas Coelho
Bartholomäus, Alexander
Corrêa, Felipe Borim
Tal, Tamara
Jehmlich, Nico
Herberth, Gunda
von Bergen, Martin
Stadler, Peter F
Carvalho, André Carlos Ponce de Leon Ferreira de
Nunes da Rocha, Ulisses
author_facet Kasmanas, Jonas Coelho
Bartholomäus, Alexander
Corrêa, Felipe Borim
Tal, Tamara
Jehmlich, Nico
Herberth, Gunda
von Bergen, Martin
Stadler, Peter F
Carvalho, André Carlos Ponce de Leon Ferreira de
Nunes da Rocha, Ulisses
author_sort Kasmanas, Jonas Coelho
collection PubMed
description Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositories for metagenomic data. These databases allow scientists to reanalyze samples and explore new hypotheses. However, mining samples from them can be a limiting factor, since the metadata available in these repositories is often misannotated, misleading, and decentralized, creating an overly complex environment for sample reanalysis. The main goal of the HumanMetagenomeDB is to simplify the identification and use of public human metagenomes of interest. HumanMetagenomeDB version 1.0 contains metadata of 69 822 metagenomes. We standardized 203 attributes, based on standardized ontologies, describing host characteristics (e.g. sex, age and body mass index), diagnosis information (e.g. cancer, Crohn's disease and Parkinson), location (e.g. country, longitude and latitude), sampling site (e.g. gut, lung and skin) and sequencing attributes (e.g. sequencing platform, average length and sequence quality). Further, HumanMetagenomeDB version 1.0 metagenomes encompass 58 countries, 9 main sample sites (i.e. body parts), 58 diagnoses and multiple ages, ranging from just born to 91 years old. The HumanMetagenomeDB is publicly available at https://webapp.ufz.de/hmgdb/.
format Online
Article
Text
id pubmed-7778935
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77789352021-01-06 HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes Kasmanas, Jonas Coelho Bartholomäus, Alexander Corrêa, Felipe Borim Tal, Tamara Jehmlich, Nico Herberth, Gunda von Bergen, Martin Stadler, Peter F Carvalho, André Carlos Ponce de Leon Ferreira de Nunes da Rocha, Ulisses Nucleic Acids Res Database Issue Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositories for metagenomic data. These databases allow scientists to reanalyze samples and explore new hypotheses. However, mining samples from them can be a limiting factor, since the metadata available in these repositories is often misannotated, misleading, and decentralized, creating an overly complex environment for sample reanalysis. The main goal of the HumanMetagenomeDB is to simplify the identification and use of public human metagenomes of interest. HumanMetagenomeDB version 1.0 contains metadata of 69 822 metagenomes. We standardized 203 attributes, based on standardized ontologies, describing host characteristics (e.g. sex, age and body mass index), diagnosis information (e.g. cancer, Crohn's disease and Parkinson), location (e.g. country, longitude and latitude), sampling site (e.g. gut, lung and skin) and sequencing attributes (e.g. sequencing platform, average length and sequence quality). Further, HumanMetagenomeDB version 1.0 metagenomes encompass 58 countries, 9 main sample sites (i.e. body parts), 58 diagnoses and multiple ages, ranging from just born to 91 years old. The HumanMetagenomeDB is publicly available at https://webapp.ufz.de/hmgdb/. Oxford University Press 2020-11-22 /pmc/articles/PMC7778935/ /pubmed/33221926 http://dx.doi.org/10.1093/nar/gkaa1031 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Kasmanas, Jonas Coelho
Bartholomäus, Alexander
Corrêa, Felipe Borim
Tal, Tamara
Jehmlich, Nico
Herberth, Gunda
von Bergen, Martin
Stadler, Peter F
Carvalho, André Carlos Ponce de Leon Ferreira de
Nunes da Rocha, Ulisses
HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title_full HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title_fullStr HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title_full_unstemmed HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title_short HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes
title_sort humanmetagenomedb: a public repository of curated and standardized metadata for human metagenomes
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778935/
https://www.ncbi.nlm.nih.gov/pubmed/33221926
http://dx.doi.org/10.1093/nar/gkaa1031
work_keys_str_mv AT kasmanasjonascoelho humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT bartholomausalexander humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT correafelipeborim humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT taltamara humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT jehmlichnico humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT herberthgunda humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT vonbergenmartin humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT stadlerpeterf humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT carvalhoandrecarlosponcedeleonferreirade humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes
AT nunesdarochaulisses humanmetagenomedbapublicrepositoryofcuratedandstandardizedmetadataforhumanmetagenomes