Cargando…
Compositional zero-inflated network estimation for microbiome data
BACKGROUND: The estimation of microbial networks can provide important insight into the ecological relationships among the organisms that comprise the microbiome. However, there are a number of critical statistical challenges in the inference of such networks from high-throughput data. Since the abu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7768662/ https://www.ncbi.nlm.nih.gov/pubmed/33371887 http://dx.doi.org/10.1186/s12859-020-03911-w |
_version_ | 1783629203266076672 |
---|---|
author | Ha, Min Jin Kim, Junghi Galloway-Peña, Jessica Do, Kim-Anh Peterson, Christine B. |
author_facet | Ha, Min Jin Kim, Junghi Galloway-Peña, Jessica Do, Kim-Anh Peterson, Christine B. |
author_sort | Ha, Min Jin |
collection | PubMed |
description | BACKGROUND: The estimation of microbial networks can provide important insight into the ecological relationships among the organisms that comprise the microbiome. However, there are a number of critical statistical challenges in the inference of such networks from high-throughput data. Since the abundances in each sample are constrained to have a fixed sum and there is incomplete overlap in microbial populations across subjects, the data are both compositional and zero-inflated. RESULTS: We propose the COmpositional Zero-Inflated Network Estimation (COZINE) method for inference of microbial networks which addresses these critical aspects of the data while maintaining computational scalability. COZINE relies on the multivariate Hurdle model to infer a sparse set of conditional dependencies which reflect not only relationships among the continuous values, but also among binary indicators of presence or absence and between the binary and continuous representations of the data. Our simulation results show that the proposed method is better able to capture various types of microbial relationships than existing approaches. We demonstrate the utility of the method with an application to understanding the oral microbiome network in a cohort of leukemic patients. CONCLUSIONS: Our proposed method addresses important challenges in microbiome network estimation, and can be effectively applied to discover various types of dependence relationships in microbial communities. The procedure we have developed, which we refer to as COZINE, is available online at https://github.com/MinJinHa/COZINE. |
format | Online Article Text |
id | pubmed-7768662 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77686622020-12-29 Compositional zero-inflated network estimation for microbiome data Ha, Min Jin Kim, Junghi Galloway-Peña, Jessica Do, Kim-Anh Peterson, Christine B. BMC Bioinformatics Methodology BACKGROUND: The estimation of microbial networks can provide important insight into the ecological relationships among the organisms that comprise the microbiome. However, there are a number of critical statistical challenges in the inference of such networks from high-throughput data. Since the abundances in each sample are constrained to have a fixed sum and there is incomplete overlap in microbial populations across subjects, the data are both compositional and zero-inflated. RESULTS: We propose the COmpositional Zero-Inflated Network Estimation (COZINE) method for inference of microbial networks which addresses these critical aspects of the data while maintaining computational scalability. COZINE relies on the multivariate Hurdle model to infer a sparse set of conditional dependencies which reflect not only relationships among the continuous values, but also among binary indicators of presence or absence and between the binary and continuous representations of the data. Our simulation results show that the proposed method is better able to capture various types of microbial relationships than existing approaches. We demonstrate the utility of the method with an application to understanding the oral microbiome network in a cohort of leukemic patients. CONCLUSIONS: Our proposed method addresses important challenges in microbiome network estimation, and can be effectively applied to discover various types of dependence relationships in microbial communities. The procedure we have developed, which we refer to as COZINE, is available online at https://github.com/MinJinHa/COZINE. BioMed Central 2020-12-28 /pmc/articles/PMC7768662/ /pubmed/33371887 http://dx.doi.org/10.1186/s12859-020-03911-w Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Ha, Min Jin Kim, Junghi Galloway-Peña, Jessica Do, Kim-Anh Peterson, Christine B. Compositional zero-inflated network estimation for microbiome data |
title | Compositional zero-inflated network estimation for microbiome data |
title_full | Compositional zero-inflated network estimation for microbiome data |
title_fullStr | Compositional zero-inflated network estimation for microbiome data |
title_full_unstemmed | Compositional zero-inflated network estimation for microbiome data |
title_short | Compositional zero-inflated network estimation for microbiome data |
title_sort | compositional zero-inflated network estimation for microbiome data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7768662/ https://www.ncbi.nlm.nih.gov/pubmed/33371887 http://dx.doi.org/10.1186/s12859-020-03911-w |
work_keys_str_mv | AT haminjin compositionalzeroinflatednetworkestimationformicrobiomedata AT kimjunghi compositionalzeroinflatednetworkestimationformicrobiomedata AT gallowaypenajessica compositionalzeroinflatednetworkestimationformicrobiomedata AT dokimanh compositionalzeroinflatednetworkestimationformicrobiomedata AT petersonchristineb compositionalzeroinflatednetworkestimationformicrobiomedata |