Cargando…

FastSpar: rapid and scalable correlation estimation for compositional data

SUMMARY: A common goal of microbiome studies is the elucidation of community composition and member interactions using counts of taxonomic units extracted from sequence data. Inference of interaction networks from sparse and compositional data requires specialized statistical approaches. A popular s...

Descripción completa

Detalles Bibliográficos
Autores principales: Watts, Stephen C, Ritchie, Scott C, Inouye, Michael, Holt, Kathryn E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419895/
https://www.ncbi.nlm.nih.gov/pubmed/30169561
http://dx.doi.org/10.1093/bioinformatics/bty734
_version_ 1783404020794130432
author Watts, Stephen C
Ritchie, Scott C
Inouye, Michael
Holt, Kathryn E
author_facet Watts, Stephen C
Ritchie, Scott C
Inouye, Michael
Holt, Kathryn E
author_sort Watts, Stephen C
collection PubMed
description SUMMARY: A common goal of microbiome studies is the elucidation of community composition and member interactions using counts of taxonomic units extracted from sequence data. Inference of interaction networks from sparse and compositional data requires specialized statistical approaches. A popular solution is SparCC, however its performance limits the calculation of interaction networks for very high-dimensional datasets. Here we introduce FastSpar, an efficient and parallelizable implementation of the SparCC algorithm which rapidly infers correlation networks and calculates P-values using an unbiased estimator. We further demonstrate that FastSpar reduces network inference wall time by 2–3 orders of magnitude compared to SparCC. AVAILABILITY AND IMPLEMENTATION: FastSpar source code, precompiled binaries and platform packages are freely available on GitHub: github.com/scwatts/FastSpar SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6419895
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64198952019-03-20 FastSpar: rapid and scalable correlation estimation for compositional data Watts, Stephen C Ritchie, Scott C Inouye, Michael Holt, Kathryn E Bioinformatics Applications Notes SUMMARY: A common goal of microbiome studies is the elucidation of community composition and member interactions using counts of taxonomic units extracted from sequence data. Inference of interaction networks from sparse and compositional data requires specialized statistical approaches. A popular solution is SparCC, however its performance limits the calculation of interaction networks for very high-dimensional datasets. Here we introduce FastSpar, an efficient and parallelizable implementation of the SparCC algorithm which rapidly infers correlation networks and calculates P-values using an unbiased estimator. We further demonstrate that FastSpar reduces network inference wall time by 2–3 orders of magnitude compared to SparCC. AVAILABILITY AND IMPLEMENTATION: FastSpar source code, precompiled binaries and platform packages are freely available on GitHub: github.com/scwatts/FastSpar SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-03-15 2018-08-29 /pmc/articles/PMC6419895/ /pubmed/30169561 http://dx.doi.org/10.1093/bioinformatics/bty734 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Watts, Stephen C
Ritchie, Scott C
Inouye, Michael
Holt, Kathryn E
FastSpar: rapid and scalable correlation estimation for compositional data
title FastSpar: rapid and scalable correlation estimation for compositional data
title_full FastSpar: rapid and scalable correlation estimation for compositional data
title_fullStr FastSpar: rapid and scalable correlation estimation for compositional data
title_full_unstemmed FastSpar: rapid and scalable correlation estimation for compositional data
title_short FastSpar: rapid and scalable correlation estimation for compositional data
title_sort fastspar: rapid and scalable correlation estimation for compositional data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419895/
https://www.ncbi.nlm.nih.gov/pubmed/30169561
http://dx.doi.org/10.1093/bioinformatics/bty734
work_keys_str_mv AT wattsstephenc fastsparrapidandscalablecorrelationestimationforcompositionaldata
AT ritchiescottc fastsparrapidandscalablecorrelationestimationforcompositionaldata
AT inouyemichael fastsparrapidandscalablecorrelationestimationforcompositionaldata
AT holtkathryne fastsparrapidandscalablecorrelationestimationforcompositionaldata