Cargando…

MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes

Environmental and host‐associated microbial communities are complex ecosystems, of which many members are still unknown. Hence, it is challenging to study community dynamics and important to create model systems of reduced complexity that mimic major community functions. Therefore, we developed MiMi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumar, Neeraj, Hitch, Thomas C. A., Haller, Dirk, Lagkouvardos, Ilias, Clavel, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8313253/
https://www.ncbi.nlm.nih.gov/pubmed/34081399
http://dx.doi.org/10.1111/1751-7915.13845
_version_ 1783729305827672064
author Kumar, Neeraj
Hitch, Thomas C. A.
Haller, Dirk
Lagkouvardos, Ilias
Clavel, Thomas
author_facet Kumar, Neeraj
Hitch, Thomas C. A.
Haller, Dirk
Lagkouvardos, Ilias
Clavel, Thomas
author_sort Kumar, Neeraj
collection PubMed
description Environmental and host‐associated microbial communities are complex ecosystems, of which many members are still unknown. Hence, it is challenging to study community dynamics and important to create model systems of reduced complexity that mimic major community functions. Therefore, we developed MiMiC, a computational approach for data‐driven design of simplified communities from shotgun metagenomes. We first built a comprehensive database of species‐level bacterial and archaeal genomes (n = 22 627) consisting of binary (presence/absence) vectors of protein families (Pfam = 17 929). MiMiC predicts the composition of minimal consortia using an iterative scoring system based on maximal match‐to‐mismatch ratios between this database and the Pfam binary vector of any input metagenome. Pfam vectorization retained enough resolution to distinguish metagenomic profiles between six environmental and host‐derived microbial communities (n = 937). The calculated number of species per minimal community ranged between 5 and 11, with MiMiC selected communities better recapitulating the functional repertoire of the original samples than randomly selected species. The inferred minimal communities retained habitat‐specific features and were substantially different from communities consisting of most abundant members. The use of a mixture of known microbes revealed the ability to select 23 of 25 target species from the entire genome database. MiMiC is open source and available at https://github.com/ClavelLab/MiMiC.
format Online
Article
Text
id pubmed-8313253
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83132532021-07-30 MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes Kumar, Neeraj Hitch, Thomas C. A. Haller, Dirk Lagkouvardos, Ilias Clavel, Thomas Microb Biotechnol Research Articles Environmental and host‐associated microbial communities are complex ecosystems, of which many members are still unknown. Hence, it is challenging to study community dynamics and important to create model systems of reduced complexity that mimic major community functions. Therefore, we developed MiMiC, a computational approach for data‐driven design of simplified communities from shotgun metagenomes. We first built a comprehensive database of species‐level bacterial and archaeal genomes (n = 22 627) consisting of binary (presence/absence) vectors of protein families (Pfam = 17 929). MiMiC predicts the composition of minimal consortia using an iterative scoring system based on maximal match‐to‐mismatch ratios between this database and the Pfam binary vector of any input metagenome. Pfam vectorization retained enough resolution to distinguish metagenomic profiles between six environmental and host‐derived microbial communities (n = 937). The calculated number of species per minimal community ranged between 5 and 11, with MiMiC selected communities better recapitulating the functional repertoire of the original samples than randomly selected species. The inferred minimal communities retained habitat‐specific features and were substantially different from communities consisting of most abundant members. The use of a mixture of known microbes revealed the ability to select 23 of 25 target species from the entire genome database. MiMiC is open source and available at https://github.com/ClavelLab/MiMiC. John Wiley and Sons Inc. 2021-06-03 /pmc/articles/PMC8313253/ /pubmed/34081399 http://dx.doi.org/10.1111/1751-7915.13845 Text en © 2021 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Kumar, Neeraj
Hitch, Thomas C. A.
Haller, Dirk
Lagkouvardos, Ilias
Clavel, Thomas
MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title_full MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title_fullStr MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title_full_unstemmed MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title_short MiMiC: a bioinformatic approach for generation of synthetic communities from metagenomes
title_sort mimic: a bioinformatic approach for generation of synthetic communities from metagenomes
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8313253/
https://www.ncbi.nlm.nih.gov/pubmed/34081399
http://dx.doi.org/10.1111/1751-7915.13845
work_keys_str_mv AT kumarneeraj mimicabioinformaticapproachforgenerationofsyntheticcommunitiesfrommetagenomes
AT hitchthomasca mimicabioinformaticapproachforgenerationofsyntheticcommunitiesfrommetagenomes
AT hallerdirk mimicabioinformaticapproachforgenerationofsyntheticcommunitiesfrommetagenomes
AT lagkouvardosilias mimicabioinformaticapproachforgenerationofsyntheticcommunitiesfrommetagenomes
AT clavelthomas mimicabioinformaticapproachforgenerationofsyntheticcommunitiesfrommetagenomes