Cargando…

Penalized estimation of the Gaussian graphical model from data with replicates

Gaussian graphical models are usually estimated from unreplicated data. The data are, however, likely to comprise signal and noise. These two cannot be deconvoluted from unreplicated data. Pragmatically, the noise is then ignored in practice. We point out the consequences of this practice for the re...

Descripción completa

Detalles Bibliográficos
Autores principales: van Wieringen, Wessel N., Chen, Yao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360145/
https://www.ncbi.nlm.nih.gov/pubmed/33987868
http://dx.doi.org/10.1002/sim.9028
_version_ 1783737686777921536
author van Wieringen, Wessel N.
Chen, Yao
author_facet van Wieringen, Wessel N.
Chen, Yao
author_sort van Wieringen, Wessel N.
collection PubMed
description Gaussian graphical models are usually estimated from unreplicated data. The data are, however, likely to comprise signal and noise. These two cannot be deconvoluted from unreplicated data. Pragmatically, the noise is then ignored in practice. We point out the consequences of this practice for the reconstruction of the conditional independence graph of the signal. Replicated data allow for the deconvolution of signal and noise and the reconstruction of former's conditional independence graph. Hereto we present a penalized Expectation‐Maximization algorithm. The penalty parameter is chosen to maximize the F‐fold cross‐validated log‐likelihood. Sampling schemes of the folds from replicated data are discussed. By simulation we investigate the effect of replicates on the reconstruction of the signal's conditional independence graph. Moreover, we compare the proposed method to several obvious competitors. In an application we use data from oncogenomic studies with replicates to reconstruct the gene‐gene interaction networks, operationalized as conditional independence graphs. This yields a realistic portrait of the effect of ignoring other sources but sampling variation. In addition, it bears implications on the reproducibility of inferred gene‐gene interaction networks reported in literature.
format Online
Article
Text
id pubmed-8360145
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83601452021-08-17 Penalized estimation of the Gaussian graphical model from data with replicates van Wieringen, Wessel N. Chen, Yao Stat Med Research Articles Gaussian graphical models are usually estimated from unreplicated data. The data are, however, likely to comprise signal and noise. These two cannot be deconvoluted from unreplicated data. Pragmatically, the noise is then ignored in practice. We point out the consequences of this practice for the reconstruction of the conditional independence graph of the signal. Replicated data allow for the deconvolution of signal and noise and the reconstruction of former's conditional independence graph. Hereto we present a penalized Expectation‐Maximization algorithm. The penalty parameter is chosen to maximize the F‐fold cross‐validated log‐likelihood. Sampling schemes of the folds from replicated data are discussed. By simulation we investigate the effect of replicates on the reconstruction of the signal's conditional independence graph. Moreover, we compare the proposed method to several obvious competitors. In an application we use data from oncogenomic studies with replicates to reconstruct the gene‐gene interaction networks, operationalized as conditional independence graphs. This yields a realistic portrait of the effect of ignoring other sources but sampling variation. In addition, it bears implications on the reproducibility of inferred gene‐gene interaction networks reported in literature. John Wiley and Sons Inc. 2021-05-13 2021-08-30 /pmc/articles/PMC8360145/ /pubmed/33987868 http://dx.doi.org/10.1002/sim.9028 Text en © 2021 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
van Wieringen, Wessel N.
Chen, Yao
Penalized estimation of the Gaussian graphical model from data with replicates
title Penalized estimation of the Gaussian graphical model from data with replicates
title_full Penalized estimation of the Gaussian graphical model from data with replicates
title_fullStr Penalized estimation of the Gaussian graphical model from data with replicates
title_full_unstemmed Penalized estimation of the Gaussian graphical model from data with replicates
title_short Penalized estimation of the Gaussian graphical model from data with replicates
title_sort penalized estimation of the gaussian graphical model from data with replicates
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360145/
https://www.ncbi.nlm.nih.gov/pubmed/33987868
http://dx.doi.org/10.1002/sim.9028
work_keys_str_mv AT vanwieringenwesseln penalizedestimationofthegaussiangraphicalmodelfromdatawithreplicates
AT chenyao penalizedestimationofthegaussiangraphicalmodelfromdatawithreplicates