Cargando…

Sparse and Compositionally Robust Inference of Microbial Ecological Networks

16S ribosomal RNA (rRNA) gene and other environmental sequencing techniques provide snapshots of microbial communities, revealing phylogeny and the abundances of microbial populations across diverse ecosystems. While changes in microbial community structure are demonstrably associated with certain e...

Descripción completa

Detalles Bibliográficos
Autores principales: Kurtz, Zachary D., Müller, Christian L., Miraldi, Emily R., Littman, Dan R., Blaser, Martin J., Bonneau, Richard A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4423992/
https://www.ncbi.nlm.nih.gov/pubmed/25950956
http://dx.doi.org/10.1371/journal.pcbi.1004226
_version_ 1782370292378632192
author Kurtz, Zachary D.
Müller, Christian L.
Miraldi, Emily R.
Littman, Dan R.
Blaser, Martin J.
Bonneau, Richard A.
author_facet Kurtz, Zachary D.
Müller, Christian L.
Miraldi, Emily R.
Littman, Dan R.
Blaser, Martin J.
Bonneau, Richard A.
author_sort Kurtz, Zachary D.
collection PubMed
description 16S ribosomal RNA (rRNA) gene and other environmental sequencing techniques provide snapshots of microbial communities, revealing phylogeny and the abundances of microbial populations across diverse ecosystems. While changes in microbial community structure are demonstrably associated with certain environmental conditions (from metabolic and immunological health in mammals to ecological stability in soils and oceans), identification of underlying mechanisms requires new statistical tools, as these datasets present several technical challenges. First, the abundances of microbial operational taxonomic units (OTUs) from amplicon-based datasets are compositional. Counts are normalized to the total number of counts in the sample. Thus, microbial abundances are not independent, and traditional statistical metrics (e.g., correlation) for the detection of OTU-OTU relationships can lead to spurious results. Secondly, microbial sequencing-based studies typically measure hundreds of OTUs on only tens to hundreds of samples; thus, inference of OTU-OTU association networks is severely under-powered, and additional information (or assumptions) are required for accurate inference. Here, we present SPIEC-EASI (SParse InversE Covariance Estimation for Ecological Association Inference), a statistical method for the inference of microbial ecological networks from amplicon sequencing datasets that addresses both of these issues. SPIEC-EASI combines data transformations developed for compositional data analysis with a graphical model inference framework that assumes the underlying ecological association network is sparse. To reconstruct the network, SPIEC-EASI relies on algorithms for sparse neighborhood and inverse covariance selection. To provide a synthetic benchmark in the absence of an experimentally validated gold-standard network, SPIEC-EASI is accompanied by a set of computational tools to generate OTU count data from a set of diverse underlying network topologies. SPIEC-EASI outperforms state-of-the-art methods to recover edges and network properties on synthetic data under a variety of scenarios. SPIEC-EASI also reproducibly predicts previously unknown microbial associations using data from the American Gut project.
format Online
Article
Text
id pubmed-4423992
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44239922015-05-13 Sparse and Compositionally Robust Inference of Microbial Ecological Networks Kurtz, Zachary D. Müller, Christian L. Miraldi, Emily R. Littman, Dan R. Blaser, Martin J. Bonneau, Richard A. PLoS Comput Biol Research Article 16S ribosomal RNA (rRNA) gene and other environmental sequencing techniques provide snapshots of microbial communities, revealing phylogeny and the abundances of microbial populations across diverse ecosystems. While changes in microbial community structure are demonstrably associated with certain environmental conditions (from metabolic and immunological health in mammals to ecological stability in soils and oceans), identification of underlying mechanisms requires new statistical tools, as these datasets present several technical challenges. First, the abundances of microbial operational taxonomic units (OTUs) from amplicon-based datasets are compositional. Counts are normalized to the total number of counts in the sample. Thus, microbial abundances are not independent, and traditional statistical metrics (e.g., correlation) for the detection of OTU-OTU relationships can lead to spurious results. Secondly, microbial sequencing-based studies typically measure hundreds of OTUs on only tens to hundreds of samples; thus, inference of OTU-OTU association networks is severely under-powered, and additional information (or assumptions) are required for accurate inference. Here, we present SPIEC-EASI (SParse InversE Covariance Estimation for Ecological Association Inference), a statistical method for the inference of microbial ecological networks from amplicon sequencing datasets that addresses both of these issues. SPIEC-EASI combines data transformations developed for compositional data analysis with a graphical model inference framework that assumes the underlying ecological association network is sparse. To reconstruct the network, SPIEC-EASI relies on algorithms for sparse neighborhood and inverse covariance selection. To provide a synthetic benchmark in the absence of an experimentally validated gold-standard network, SPIEC-EASI is accompanied by a set of computational tools to generate OTU count data from a set of diverse underlying network topologies. SPIEC-EASI outperforms state-of-the-art methods to recover edges and network properties on synthetic data under a variety of scenarios. SPIEC-EASI also reproducibly predicts previously unknown microbial associations using data from the American Gut project. Public Library of Science 2015-05-07 /pmc/articles/PMC4423992/ /pubmed/25950956 http://dx.doi.org/10.1371/journal.pcbi.1004226 Text en © 2015 Kurtz et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kurtz, Zachary D.
Müller, Christian L.
Miraldi, Emily R.
Littman, Dan R.
Blaser, Martin J.
Bonneau, Richard A.
Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title_full Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title_fullStr Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title_full_unstemmed Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title_short Sparse and Compositionally Robust Inference of Microbial Ecological Networks
title_sort sparse and compositionally robust inference of microbial ecological networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4423992/
https://www.ncbi.nlm.nih.gov/pubmed/25950956
http://dx.doi.org/10.1371/journal.pcbi.1004226
work_keys_str_mv AT kurtzzacharyd sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks
AT mullerchristianl sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks
AT miraldiemilyr sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks
AT littmandanr sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks
AT blasermartinj sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks
AT bonneauricharda sparseandcompositionallyrobustinferenceofmicrobialecologicalnetworks