Cargando…

GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison

GMrepo (data repository for Gut Microbiota) is a database of curated and consistently annotated human gut metagenomes. Its main purposes are to increase the reusability and accessibility of human gut metagenomic data, and enable cross-project and phenotype comparisons. To achieve these goals, we per...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Die, Zhu, Jiaying, Sun, Chuqing, Li, Min, Liu, Jinxin, Wu, Sicheng, Ning, Kang, He, Li-jie, Zhao, Xing-Ming, Chen, Wei-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728112/
https://www.ncbi.nlm.nih.gov/pubmed/34788838
http://dx.doi.org/10.1093/nar/gkab1019
_version_ 1784626662741114880
author Dai, Die
Zhu, Jiaying
Sun, Chuqing
Li, Min
Liu, Jinxin
Wu, Sicheng
Ning, Kang
He, Li-jie
Zhao, Xing-Ming
Chen, Wei-Hua
author_facet Dai, Die
Zhu, Jiaying
Sun, Chuqing
Li, Min
Liu, Jinxin
Wu, Sicheng
Ning, Kang
He, Li-jie
Zhao, Xing-Ming
Chen, Wei-Hua
author_sort Dai, Die
collection PubMed
description GMrepo (data repository for Gut Microbiota) is a database of curated and consistently annotated human gut metagenomes. Its main purposes are to increase the reusability and accessibility of human gut metagenomic data, and enable cross-project and phenotype comparisons. To achieve these goals, we performed manual curation on the meta-data and organized the datasets in a phenotype-centric manner. GMrepo v2 contains 353 projects and 71,642 runs/samples, which are significantly increased from the previous version. Among these runs/samples, 45,111 and 26,531 were obtained by 16S rRNA amplicon and whole-genome metagenomics sequencing, respectively. We also increased the number of phenotypes from 92 to 133. In addition, we introduced disease-marker identification and cross-project/phenotype comparison. We first identified disease markers between two phenotypes (e.g. health versus diseases) on a per-project basis for selected projects. We then compared the identified markers for each phenotype pair across datasets to facilitate the identification of consistent microbial markers across datasets. Finally, we provided a marker-centric view to allow users to check if a marker has different trends in different diseases. So far, GMrepo includes 592 marker taxa (350 species and 242 genera) for 47 phenotype pairs, identified from 83 selected projects. GMrepo v2 is freely available at: https://gmrepo.humangut.info.
format Online
Article
Text
id pubmed-8728112
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87281122022-01-05 GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison Dai, Die Zhu, Jiaying Sun, Chuqing Li, Min Liu, Jinxin Wu, Sicheng Ning, Kang He, Li-jie Zhao, Xing-Ming Chen, Wei-Hua Nucleic Acids Res Database Issue GMrepo (data repository for Gut Microbiota) is a database of curated and consistently annotated human gut metagenomes. Its main purposes are to increase the reusability and accessibility of human gut metagenomic data, and enable cross-project and phenotype comparisons. To achieve these goals, we performed manual curation on the meta-data and organized the datasets in a phenotype-centric manner. GMrepo v2 contains 353 projects and 71,642 runs/samples, which are significantly increased from the previous version. Among these runs/samples, 45,111 and 26,531 were obtained by 16S rRNA amplicon and whole-genome metagenomics sequencing, respectively. We also increased the number of phenotypes from 92 to 133. In addition, we introduced disease-marker identification and cross-project/phenotype comparison. We first identified disease markers between two phenotypes (e.g. health versus diseases) on a per-project basis for selected projects. We then compared the identified markers for each phenotype pair across datasets to facilitate the identification of consistent microbial markers across datasets. Finally, we provided a marker-centric view to allow users to check if a marker has different trends in different diseases. So far, GMrepo includes 592 marker taxa (350 species and 242 genera) for 47 phenotype pairs, identified from 83 selected projects. GMrepo v2 is freely available at: https://gmrepo.humangut.info. Oxford University Press 2021-11-12 /pmc/articles/PMC8728112/ /pubmed/34788838 http://dx.doi.org/10.1093/nar/gkab1019 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Issue
Dai, Die
Zhu, Jiaying
Sun, Chuqing
Li, Min
Liu, Jinxin
Wu, Sicheng
Ning, Kang
He, Li-jie
Zhao, Xing-Ming
Chen, Wei-Hua
GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title_full GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title_fullStr GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title_full_unstemmed GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title_short GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
title_sort gmrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728112/
https://www.ncbi.nlm.nih.gov/pubmed/34788838
http://dx.doi.org/10.1093/nar/gkab1019
work_keys_str_mv AT daidie gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT zhujiaying gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT sunchuqing gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT limin gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT liujinxin gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT wusicheng gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT ningkang gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT helijie gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT zhaoxingming gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison
AT chenweihua gmrepov2acuratedhumangutmicrobiomedatabasewithspecialfocusondiseasemarkersandcrossdatasetcomparison