Cargando…

MB-GAN: Microbiome Simulation via Generative Adversarial Network

BACKGROUND: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the...

Descripción completa

Detalles Bibliográficos
Autores principales: Rong, Ruichen, Jiang, Shuang, Xu, Lin, Xiao, Guanghua, Xie, Yang, Liu, Dajiang J, Li, Qiwei, Zhan, Xiaowei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931821/
https://www.ncbi.nlm.nih.gov/pubmed/33543271
http://dx.doi.org/10.1093/gigascience/giab005
_version_ 1783660365537607680
author Rong, Ruichen
Jiang, Shuang
Xu, Lin
Xiao, Guanghua
Xie, Yang
Liu, Dajiang J
Li, Qiwei
Zhan, Xiaowei
author_facet Rong, Ruichen
Jiang, Shuang
Xu, Lin
Xiao, Guanghua
Xie, Yang
Liu, Dajiang J
Li, Qiwei
Zhan, Xiaowei
author_sort Rong, Ruichen
collection PubMed
description BACKGROUND: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the strengths and limitations of these analytical tools, simulating realistic microbiome datasets is critically important. However, simulating the real microbiome data is challenging because it is difficult to model their correlation structure using explicit statistical models. RESULTS: To address the challenge of simulating realistic microbiome data, we designed a novel simulation framework termed MB-GAN, by using a generative adversarial network (GAN) and utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from given microbial abundances and compute simulated abundances that are indistinguishable from them. In practice, MB-GAN showed the following advantages. First, MB-GAN avoids explicit statistical modeling assumptions, and it only requires real datasets as inputs. Second, unlike the traditional GANs, MB-GAN is easily applicable and can converge efficiently. CONCLUSIONS: By applying MB-GAN to a case-control gut microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high-fidelity microbiome data are needed.
format Online
Article
Text
id pubmed-7931821
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-79318212021-03-09 MB-GAN: Microbiome Simulation via Generative Adversarial Network Rong, Ruichen Jiang, Shuang Xu, Lin Xiao, Guanghua Xie, Yang Liu, Dajiang J Li, Qiwei Zhan, Xiaowei Gigascience Research BACKGROUND: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the strengths and limitations of these analytical tools, simulating realistic microbiome datasets is critically important. However, simulating the real microbiome data is challenging because it is difficult to model their correlation structure using explicit statistical models. RESULTS: To address the challenge of simulating realistic microbiome data, we designed a novel simulation framework termed MB-GAN, by using a generative adversarial network (GAN) and utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from given microbial abundances and compute simulated abundances that are indistinguishable from them. In practice, MB-GAN showed the following advantages. First, MB-GAN avoids explicit statistical modeling assumptions, and it only requires real datasets as inputs. Second, unlike the traditional GANs, MB-GAN is easily applicable and can converge efficiently. CONCLUSIONS: By applying MB-GAN to a case-control gut microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high-fidelity microbiome data are needed. Oxford University Press 2021-02-05 /pmc/articles/PMC7931821/ /pubmed/33543271 http://dx.doi.org/10.1093/gigascience/giab005 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Rong, Ruichen
Jiang, Shuang
Xu, Lin
Xiao, Guanghua
Xie, Yang
Liu, Dajiang J
Li, Qiwei
Zhan, Xiaowei
MB-GAN: Microbiome Simulation via Generative Adversarial Network
title MB-GAN: Microbiome Simulation via Generative Adversarial Network
title_full MB-GAN: Microbiome Simulation via Generative Adversarial Network
title_fullStr MB-GAN: Microbiome Simulation via Generative Adversarial Network
title_full_unstemmed MB-GAN: Microbiome Simulation via Generative Adversarial Network
title_short MB-GAN: Microbiome Simulation via Generative Adversarial Network
title_sort mb-gan: microbiome simulation via generative adversarial network
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931821/
https://www.ncbi.nlm.nih.gov/pubmed/33543271
http://dx.doi.org/10.1093/gigascience/giab005
work_keys_str_mv AT rongruichen mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT jiangshuang mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT xulin mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT xiaoguanghua mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT xieyang mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT liudajiangj mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT liqiwei mbganmicrobiomesimulationviagenerativeadversarialnetwork
AT zhanxiaowei mbganmicrobiomesimulationviagenerativeadversarialnetwork