Cargando…

SCNIC: Sparse correlation network investigation for compositional data

Microbiome studies are often limited by a lack of statistical power due to small sample sizes and a large number of features. This problem is exacerbated in correlative studies of multi‐omic datasets. Statistical power can be increased by finding and summarizing modules of correlated observations, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaffer, Michael, Thurimella, Kumar, Sterrett, John D., Lozupone, Catherine A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744196/
https://www.ncbi.nlm.nih.gov/pubmed/36001047
http://dx.doi.org/10.1111/1755-0998.13704
_version_ 1784848868016390144
author Shaffer, Michael
Thurimella, Kumar
Sterrett, John D.
Lozupone, Catherine A.
author_facet Shaffer, Michael
Thurimella, Kumar
Sterrett, John D.
Lozupone, Catherine A.
author_sort Shaffer, Michael
collection PubMed
description Microbiome studies are often limited by a lack of statistical power due to small sample sizes and a large number of features. This problem is exacerbated in correlative studies of multi‐omic datasets. Statistical power can be increased by finding and summarizing modules of correlated observations, which is one dimensionality reduction method. Additionally, modules provide biological insight as correlated groups of microbes can have relationships among themselves. To address these challenges, we developed SCNIC: Sparse Cooccurrence Network Investigation for compositional data. SCNIC is open‐source software that can generate correlation networks and detect and summarize modules of highly correlated features. Modules can be formed using either the Louvain Modularity Maximization (LMM) algorithm or a Shared Minimum Distance algorithm (SMD) that we newly describe here and relate to LMM using simulated data. We applied SCNIC to two published datasets and we achieved increased statistical power and identified microbes that not only differed across groups, but also correlated strongly with each other, suggesting shared environmental drivers or cooperative relationships among them. SCNIC provides an easy way to generate correlation networks, identify modules of correlated features and summarize them for downstream statistical analysis. Although SCNIC was designed considering properties of microbiome data, such as compositionality and sparsity, it can be applied to a variety of data types including metabolomics data and used to integrate multiple data types. SCNIC allows for the identification of functional microbial relationships at scale while increasing statistical power through feature reduction.
format Online
Article
Text
id pubmed-9744196
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-97441962023-04-12 SCNIC: Sparse correlation network investigation for compositional data Shaffer, Michael Thurimella, Kumar Sterrett, John D. Lozupone, Catherine A. Mol Ecol Resour Resource Articles Microbiome studies are often limited by a lack of statistical power due to small sample sizes and a large number of features. This problem is exacerbated in correlative studies of multi‐omic datasets. Statistical power can be increased by finding and summarizing modules of correlated observations, which is one dimensionality reduction method. Additionally, modules provide biological insight as correlated groups of microbes can have relationships among themselves. To address these challenges, we developed SCNIC: Sparse Cooccurrence Network Investigation for compositional data. SCNIC is open‐source software that can generate correlation networks and detect and summarize modules of highly correlated features. Modules can be formed using either the Louvain Modularity Maximization (LMM) algorithm or a Shared Minimum Distance algorithm (SMD) that we newly describe here and relate to LMM using simulated data. We applied SCNIC to two published datasets and we achieved increased statistical power and identified microbes that not only differed across groups, but also correlated strongly with each other, suggesting shared environmental drivers or cooperative relationships among them. SCNIC provides an easy way to generate correlation networks, identify modules of correlated features and summarize them for downstream statistical analysis. Although SCNIC was designed considering properties of microbiome data, such as compositionality and sparsity, it can be applied to a variety of data types including metabolomics data and used to integrate multiple data types. SCNIC allows for the identification of functional microbial relationships at scale while increasing statistical power through feature reduction. John Wiley and Sons Inc. 2022-09-01 2023-01 /pmc/articles/PMC9744196/ /pubmed/36001047 http://dx.doi.org/10.1111/1755-0998.13704 Text en © 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Resource Articles
Shaffer, Michael
Thurimella, Kumar
Sterrett, John D.
Lozupone, Catherine A.
SCNIC: Sparse correlation network investigation for compositional data
title SCNIC: Sparse correlation network investigation for compositional data
title_full SCNIC: Sparse correlation network investigation for compositional data
title_fullStr SCNIC: Sparse correlation network investigation for compositional data
title_full_unstemmed SCNIC: Sparse correlation network investigation for compositional data
title_short SCNIC: Sparse correlation network investigation for compositional data
title_sort scnic: sparse correlation network investigation for compositional data
topic Resource Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744196/
https://www.ncbi.nlm.nih.gov/pubmed/36001047
http://dx.doi.org/10.1111/1755-0998.13704
work_keys_str_mv AT shaffermichael scnicsparsecorrelationnetworkinvestigationforcompositionaldata
AT thurimellakumar scnicsparsecorrelationnetworkinvestigationforcompositionaldata
AT sterrettjohnd scnicsparsecorrelationnetworkinvestigationforcompositionaldata
AT lozuponecatherinea scnicsparsecorrelationnetworkinvestigationforcompositionaldata