Cargando…

Exact hypothesis testing for shrinkage-based Gaussian graphical models

MOTIVATION: One of the main goals in systems biology is to learn molecular regulatory networks from quantitative profile data. In particular, Gaussian graphical models (GGMs) are widely used network models in bioinformatics where variables (e.g. transcripts, metabolites or proteins) are represented...

Descripción completa

Detalles Bibliográficos
Autores principales: Bernal, Victor, Bischoff, Rainer, Guryev, Victor, Grzegorczyk, Marco, Horvatovich, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901079/
https://www.ncbi.nlm.nih.gov/pubmed/31077287
http://dx.doi.org/10.1093/bioinformatics/btz357
_version_ 1783477450373595136
author Bernal, Victor
Bischoff, Rainer
Guryev, Victor
Grzegorczyk, Marco
Horvatovich, Peter
author_facet Bernal, Victor
Bischoff, Rainer
Guryev, Victor
Grzegorczyk, Marco
Horvatovich, Peter
author_sort Bernal, Victor
collection PubMed
description MOTIVATION: One of the main goals in systems biology is to learn molecular regulatory networks from quantitative profile data. In particular, Gaussian graphical models (GGMs) are widely used network models in bioinformatics where variables (e.g. transcripts, metabolites or proteins) are represented by nodes, and pairs of nodes are connected with an edge according to their partial correlation. Reconstructing a GGM from data is a challenging task when the sample size is smaller than the number of variables. The main problem consists in finding the inverse of the covariance estimator which is ill-conditioned in this case. Shrinkage-based covariance estimators are a popular approach, producing an invertible ‘shrunk’ covariance. However, a proper significance test for the ‘shrunk’ partial correlation (i.e. the GGM edges) is an open challenge as a probability density including the shrinkage is unknown. In this article, we present (i) a geometric reformulation of the shrinkage-based GGM, and (ii) a probability density that naturally includes the shrinkage parameter. RESULTS: Our results show that the inference using this new ‘shrunk’ probability density is as accurate as Monte Carlo estimation (an unbiased non-parametric method) for any shrinkage value, while being computationally more efficient. We show on synthetic data how the novel test for significance allows an accurate control of the Type I error and outperforms the network reconstruction obtained by the widely used R package GeneNet. This is further highlighted in two gene expression datasets from stress response in Eschericha coli, and the effect of influenza infection in Mus musculus. AVAILABILITY AND IMPLEMENTATION: https://github.com/V-Bernal/GGM-Shrinkage SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6901079
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69010792019-12-16 Exact hypothesis testing for shrinkage-based Gaussian graphical models Bernal, Victor Bischoff, Rainer Guryev, Victor Grzegorczyk, Marco Horvatovich, Peter Bioinformatics Original Papers MOTIVATION: One of the main goals in systems biology is to learn molecular regulatory networks from quantitative profile data. In particular, Gaussian graphical models (GGMs) are widely used network models in bioinformatics where variables (e.g. transcripts, metabolites or proteins) are represented by nodes, and pairs of nodes are connected with an edge according to their partial correlation. Reconstructing a GGM from data is a challenging task when the sample size is smaller than the number of variables. The main problem consists in finding the inverse of the covariance estimator which is ill-conditioned in this case. Shrinkage-based covariance estimators are a popular approach, producing an invertible ‘shrunk’ covariance. However, a proper significance test for the ‘shrunk’ partial correlation (i.e. the GGM edges) is an open challenge as a probability density including the shrinkage is unknown. In this article, we present (i) a geometric reformulation of the shrinkage-based GGM, and (ii) a probability density that naturally includes the shrinkage parameter. RESULTS: Our results show that the inference using this new ‘shrunk’ probability density is as accurate as Monte Carlo estimation (an unbiased non-parametric method) for any shrinkage value, while being computationally more efficient. We show on synthetic data how the novel test for significance allows an accurate control of the Type I error and outperforms the network reconstruction obtained by the widely used R package GeneNet. This is further highlighted in two gene expression datasets from stress response in Eschericha coli, and the effect of influenza infection in Mus musculus. AVAILABILITY AND IMPLEMENTATION: https://github.com/V-Bernal/GGM-Shrinkage SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-12-01 2019-05-11 /pmc/articles/PMC6901079/ /pubmed/31077287 http://dx.doi.org/10.1093/bioinformatics/btz357 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Bernal, Victor
Bischoff, Rainer
Guryev, Victor
Grzegorczyk, Marco
Horvatovich, Peter
Exact hypothesis testing for shrinkage-based Gaussian graphical models
title Exact hypothesis testing for shrinkage-based Gaussian graphical models
title_full Exact hypothesis testing for shrinkage-based Gaussian graphical models
title_fullStr Exact hypothesis testing for shrinkage-based Gaussian graphical models
title_full_unstemmed Exact hypothesis testing for shrinkage-based Gaussian graphical models
title_short Exact hypothesis testing for shrinkage-based Gaussian graphical models
title_sort exact hypothesis testing for shrinkage-based gaussian graphical models
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901079/
https://www.ncbi.nlm.nih.gov/pubmed/31077287
http://dx.doi.org/10.1093/bioinformatics/btz357
work_keys_str_mv AT bernalvictor exacthypothesistestingforshrinkagebasedgaussiangraphicalmodels
AT bischoffrainer exacthypothesistestingforshrinkagebasedgaussiangraphicalmodels
AT guryevvictor exacthypothesistestingforshrinkagebasedgaussiangraphicalmodels
AT grzegorczykmarco exacthypothesistestingforshrinkagebasedgaussiangraphicalmodels
AT horvatovichpeter exacthypothesistestingforshrinkagebasedgaussiangraphicalmodels