Cargando…

kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors

Identification of significant biological relationships or patterns is central to many metagenomic studies. Methods that estimate association networks have been proposed for this purpose; however, they assume that associations are static, neglecting the fact that relationships in a microbial ecosyste...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yuqing, Wang, Xin, Xie, Kaikun, Zhu, Congmin, Chen, Ning, Chen, Ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170748/
https://www.ncbi.nlm.nih.gov/pubmed/33607296
http://dx.doi.org/10.1016/j.gpb.2020.06.015
_version_ 1784721504247742464
author Yang, Yuqing
Wang, Xin
Xie, Kaikun
Zhu, Congmin
Chen, Ning
Chen, Ting
author_facet Yang, Yuqing
Wang, Xin
Xie, Kaikun
Zhu, Congmin
Chen, Ning
Chen, Ting
author_sort Yang, Yuqing
collection PubMed
description Identification of significant biological relationships or patterns is central to many metagenomic studies. Methods that estimate association networks have been proposed for this purpose; however, they assume that associations are static, neglecting the fact that relationships in a microbial ecosystem may vary with changes in environmental factors (EFs), which can result in inaccurate estimations. Therefore, in this study, we propose a computational model, called the k-Lognormal-Dirichlet-Multinomial (kLDM) model, which estimates multiple association networks that correspond to specific environmental conditions, and simultaneously infers microbe–microbe and EF–microbe associations for each network. The effectiveness of the kLDM model was demonstrated on synthetic data, a colorectal cancer (CRC) dataset, the Tara Oceans dataset, and the American Gut Project dataset. The results revealed that the widely-used Spearman’s rank correlation coefficient method performed much worse than the other methods, indicating the importance of separating samples by environmental conditions. Cancer fecal samples were then compared with cancer-free samples, and the estimation achieved by kLDM exhibited fewer associations among microbes but stronger associations between specific bacteria, especially five CRC-associated operational taxonomic units, indicating gut microbe translocation in cancer patients. Some EF-dependent associations were then found within a marine eukaryotic community. Finally, the gut microbial heterogeneity of inflammatory bowel disease patients was detected. These results demonstrate that kLDM can elucidate the complex associations within microbial ecosystems. The kLDM program, R, and Python scripts, together with all experimental datasets, are accessible at https://github.com/tinglab/kLDM.git.
format Online
Article
Text
id pubmed-9170748
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91707482022-06-08 kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors Yang, Yuqing Wang, Xin Xie, Kaikun Zhu, Congmin Chen, Ning Chen, Ting Genomics Proteomics Bioinformatics Method Identification of significant biological relationships or patterns is central to many metagenomic studies. Methods that estimate association networks have been proposed for this purpose; however, they assume that associations are static, neglecting the fact that relationships in a microbial ecosystem may vary with changes in environmental factors (EFs), which can result in inaccurate estimations. Therefore, in this study, we propose a computational model, called the k-Lognormal-Dirichlet-Multinomial (kLDM) model, which estimates multiple association networks that correspond to specific environmental conditions, and simultaneously infers microbe–microbe and EF–microbe associations for each network. The effectiveness of the kLDM model was demonstrated on synthetic data, a colorectal cancer (CRC) dataset, the Tara Oceans dataset, and the American Gut Project dataset. The results revealed that the widely-used Spearman’s rank correlation coefficient method performed much worse than the other methods, indicating the importance of separating samples by environmental conditions. Cancer fecal samples were then compared with cancer-free samples, and the estimation achieved by kLDM exhibited fewer associations among microbes but stronger associations between specific bacteria, especially five CRC-associated operational taxonomic units, indicating gut microbe translocation in cancer patients. Some EF-dependent associations were then found within a marine eukaryotic community. Finally, the gut microbial heterogeneity of inflammatory bowel disease patients was detected. These results demonstrate that kLDM can elucidate the complex associations within microbial ecosystems. The kLDM program, R, and Python scripts, together with all experimental datasets, are accessible at https://github.com/tinglab/kLDM.git. Elsevier 2021-10 2021-02-17 /pmc/articles/PMC9170748/ /pubmed/33607296 http://dx.doi.org/10.1016/j.gpb.2020.06.015 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method
Yang, Yuqing
Wang, Xin
Xie, Kaikun
Zhu, Congmin
Chen, Ning
Chen, Ting
kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title_full kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title_fullStr kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title_full_unstemmed kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title_short kLDM: Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
title_sort kldm: inferring multiple metagenomic association networks based on the variation of environmental factors
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170748/
https://www.ncbi.nlm.nih.gov/pubmed/33607296
http://dx.doi.org/10.1016/j.gpb.2020.06.015
work_keys_str_mv AT yangyuqing kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors
AT wangxin kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors
AT xiekaikun kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors
AT zhucongmin kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors
AT chenning kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors
AT chenting kldminferringmultiplemetagenomicassociationnetworksbasedonthevariationofenvironmentalfactors