Cargando…

Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians

Despite extensive efforts to address it, the vastness of uncharacterized ‘dark matter’ microbial genetic diversity can impact short-read sequencing based metagenomic studies. Population-specific biases in genomic reference databases can further compound this problem. Leveraging advances in hybrid as...

Descripción completa

Detalles Bibliográficos
Autores principales: Gounot, Jean-Sebastien, Chia, Minghao, Bertrand, Denis, Saw, Woei-Yuh, Ravikrishnan, Aarthi, Low, Adrian, Ding, Yichen, Ng, Amanda Hui Qi, Tan, Linda Wei Lin, Teo, Yik-Ying, Seedorf, Henning, Nagarajan, Niranjan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9561172/
https://www.ncbi.nlm.nih.gov/pubmed/36229545
http://dx.doi.org/10.1038/s41467-022-33782-z
_version_ 1784807891362906112
author Gounot, Jean-Sebastien
Chia, Minghao
Bertrand, Denis
Saw, Woei-Yuh
Ravikrishnan, Aarthi
Low, Adrian
Ding, Yichen
Ng, Amanda Hui Qi
Tan, Linda Wei Lin
Teo, Yik-Ying
Seedorf, Henning
Nagarajan, Niranjan
author_facet Gounot, Jean-Sebastien
Chia, Minghao
Bertrand, Denis
Saw, Woei-Yuh
Ravikrishnan, Aarthi
Low, Adrian
Ding, Yichen
Ng, Amanda Hui Qi
Tan, Linda Wei Lin
Teo, Yik-Ying
Seedorf, Henning
Nagarajan, Niranjan
author_sort Gounot, Jean-Sebastien
collection PubMed
description Despite extensive efforts to address it, the vastness of uncharacterized ‘dark matter’ microbial genetic diversity can impact short-read sequencing based metagenomic studies. Population-specific biases in genomic reference databases can further compound this problem. Leveraging advances in hybrid assembly (using short and long reads) and Hi-C technologies in a cross-sectional survey, we deeply characterized 109 gut microbiomes from three ethnicities in Singapore to comprehensively reconstruct 4497 medium and high-quality metagenome assembled genomes, 1708 of which were missing in short-read only analysis and with >28× N50 improvement. Species-level clustering identified 70 (>10% of total) novel gut species out of 685, improved reference genomes for 363 species (53% of total), and discovered 3413 strains unique to these populations. Among the top 10 most abundant gut bacteria in our study, one of the species and >80% of strains were unrepresented in existing databases. Annotation of biosynthetic gene clusters (BGCs) uncovered more than 27,000 BGCs with a large fraction (36–88%) unrepresented in current databases, and with several unique clusters predicted to produce bacteriocins that could significantly alter microbiome community structure. These results reveal significant uncharacterized gut microbial diversity in Southeast Asian populations and highlight the utility of hybrid metagenomic references for bioprospecting and disease-focused studies.
format Online
Article
Text
id pubmed-9561172
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95611722022-10-15 Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians Gounot, Jean-Sebastien Chia, Minghao Bertrand, Denis Saw, Woei-Yuh Ravikrishnan, Aarthi Low, Adrian Ding, Yichen Ng, Amanda Hui Qi Tan, Linda Wei Lin Teo, Yik-Ying Seedorf, Henning Nagarajan, Niranjan Nat Commun Article Despite extensive efforts to address it, the vastness of uncharacterized ‘dark matter’ microbial genetic diversity can impact short-read sequencing based metagenomic studies. Population-specific biases in genomic reference databases can further compound this problem. Leveraging advances in hybrid assembly (using short and long reads) and Hi-C technologies in a cross-sectional survey, we deeply characterized 109 gut microbiomes from three ethnicities in Singapore to comprehensively reconstruct 4497 medium and high-quality metagenome assembled genomes, 1708 of which were missing in short-read only analysis and with >28× N50 improvement. Species-level clustering identified 70 (>10% of total) novel gut species out of 685, improved reference genomes for 363 species (53% of total), and discovered 3413 strains unique to these populations. Among the top 10 most abundant gut bacteria in our study, one of the species and >80% of strains were unrepresented in existing databases. Annotation of biosynthetic gene clusters (BGCs) uncovered more than 27,000 BGCs with a large fraction (36–88%) unrepresented in current databases, and with several unique clusters predicted to produce bacteriocins that could significantly alter microbiome community structure. These results reveal significant uncharacterized gut microbial diversity in Southeast Asian populations and highlight the utility of hybrid metagenomic references for bioprospecting and disease-focused studies. Nature Publishing Group UK 2022-10-13 /pmc/articles/PMC9561172/ /pubmed/36229545 http://dx.doi.org/10.1038/s41467-022-33782-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gounot, Jean-Sebastien
Chia, Minghao
Bertrand, Denis
Saw, Woei-Yuh
Ravikrishnan, Aarthi
Low, Adrian
Ding, Yichen
Ng, Amanda Hui Qi
Tan, Linda Wei Lin
Teo, Yik-Ying
Seedorf, Henning
Nagarajan, Niranjan
Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title_full Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title_fullStr Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title_full_unstemmed Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title_short Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
title_sort genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in southeast asians
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9561172/
https://www.ncbi.nlm.nih.gov/pubmed/36229545
http://dx.doi.org/10.1038/s41467-022-33782-z
work_keys_str_mv AT gounotjeansebastien genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT chiaminghao genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT bertranddenis genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT sawwoeiyuh genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT ravikrishnanaarthi genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT lowadrian genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT dingyichen genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT ngamandahuiqi genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT tanlindaweilin genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT teoyikying genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT seedorfhenning genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians
AT nagarajanniranjan genomecentricanalysisofshortandlongreadmetagenomesrevealsuncharacterizedmicrobiomediversityinsoutheastasians