Cargando…

A unified catalog of 204,938 reference genomes from the human gut microbiome

Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes e...

Descripción completa

Detalles Bibliográficos
Autores principales: Almeida, Alexandre, Nayfach, Stephen, Boland, Miguel, Strozzi, Francesco, Beracochea, Martin, Shi, Zhou Jason, Pollard, Katherine S., Sakharova, Ekaterina, Parks, Donovan H., Hugenholtz, Philip, Segata, Nicola, Kyrpides, Nikos C., Finn, Robert D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7801254/
https://www.ncbi.nlm.nih.gov/pubmed/32690973
http://dx.doi.org/10.1038/s41587-020-0603-3
_version_ 1783635536109371392
author Almeida, Alexandre
Nayfach, Stephen
Boland, Miguel
Strozzi, Francesco
Beracochea, Martin
Shi, Zhou Jason
Pollard, Katherine S.
Sakharova, Ekaterina
Parks, Donovan H.
Hugenholtz, Philip
Segata, Nicola
Kyrpides, Nikos C.
Finn, Robert D.
author_facet Almeida, Alexandre
Nayfach, Stephen
Boland, Miguel
Strozzi, Francesco
Beracochea, Martin
Shi, Zhou Jason
Pollard, Katherine S.
Sakharova, Ekaterina
Parks, Donovan H.
Hugenholtz, Philip
Segata, Nicola
Kyrpides, Nikos C.
Finn, Robert D.
author_sort Almeida, Alexandre
collection PubMed
description Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome.
format Online
Article
Text
id pubmed-7801254
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-78012542021-01-19 A unified catalog of 204,938 reference genomes from the human gut microbiome Almeida, Alexandre Nayfach, Stephen Boland, Miguel Strozzi, Francesco Beracochea, Martin Shi, Zhou Jason Pollard, Katherine S. Sakharova, Ekaterina Parks, Donovan H. Hugenholtz, Philip Segata, Nicola Kyrpides, Nikos C. Finn, Robert D. Nat Biotechnol Resource Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome. Nature Publishing Group US 2020-07-20 2021 /pmc/articles/PMC7801254/ /pubmed/32690973 http://dx.doi.org/10.1038/s41587-020-0603-3 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Resource
Almeida, Alexandre
Nayfach, Stephen
Boland, Miguel
Strozzi, Francesco
Beracochea, Martin
Shi, Zhou Jason
Pollard, Katherine S.
Sakharova, Ekaterina
Parks, Donovan H.
Hugenholtz, Philip
Segata, Nicola
Kyrpides, Nikos C.
Finn, Robert D.
A unified catalog of 204,938 reference genomes from the human gut microbiome
title A unified catalog of 204,938 reference genomes from the human gut microbiome
title_full A unified catalog of 204,938 reference genomes from the human gut microbiome
title_fullStr A unified catalog of 204,938 reference genomes from the human gut microbiome
title_full_unstemmed A unified catalog of 204,938 reference genomes from the human gut microbiome
title_short A unified catalog of 204,938 reference genomes from the human gut microbiome
title_sort unified catalog of 204,938 reference genomes from the human gut microbiome
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7801254/
https://www.ncbi.nlm.nih.gov/pubmed/32690973
http://dx.doi.org/10.1038/s41587-020-0603-3
work_keys_str_mv AT almeidaalexandre aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT nayfachstephen aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT bolandmiguel aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT strozzifrancesco aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT beracocheamartin aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT shizhoujason aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT pollardkatherines aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT sakharovaekaterina aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT parksdonovanh aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT hugenholtzphilip aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT segatanicola aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT kyrpidesnikosc aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT finnrobertd aunifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT almeidaalexandre unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT nayfachstephen unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT bolandmiguel unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT strozzifrancesco unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT beracocheamartin unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT shizhoujason unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT pollardkatherines unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT sakharovaekaterina unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT parksdonovanh unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT hugenholtzphilip unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT segatanicola unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT kyrpidesnikosc unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome
AT finnrobertd unifiedcatalogof204938referencegenomesfromthehumangutmicrobiome