Cargando…

A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models

Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute...

Descripción completa

Detalles Bibliográficos
Autores principales: Lafit, Ginette, Tuerlinckx, Francis, Myin-Germeys, Inez, Ceulemans, Eva
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882820/
https://www.ncbi.nlm.nih.gov/pubmed/31780817
http://dx.doi.org/10.1038/s41598-019-53795-x
_version_ 1783474244869423104
author Lafit, Ginette
Tuerlinckx, Francis
Myin-Germeys, Inez
Ceulemans, Eva
author_facet Lafit, Ginette
Tuerlinckx, Francis
Myin-Germeys, Inez
Ceulemans, Eva
author_sort Lafit, Ginette
collection PubMed
description Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute the nodes and the partial correlations the edges. In many applications, it makes sense to impose sparsity (i.e., some of the partial correlations are forced to zero) as sparsity is theoretically meaningful and/or because it improves the predictive accuracy of the fitted model. However, as we will show by means of extensive simulations, state-of-the-art estimation approaches for imposing sparsity on GGMs, such as the Graphical lasso, ℓ(1) regularized nodewise regression, and joint sparse regression, fall short because they often yield too many false positives (i.e., partial correlations that are not properly set to zero). In this paper we present a new estimation approach that allows to control the false positive rate better. Our approach consists of two steps: First, we estimate an undirected network using one of the three state-of-the-art estimation approaches. Second, we try to detect the false positives, by flagging the partial correlations that are smaller in absolute value than a given threshold, which is determined through cross-validation; the flagged correlations are set to zero. Applying this new approach to the same simulated data, shows that it indeed performs better. We also illustrate our approach by using it to estimate (1) a gene regulatory network for breast cancer data, (2) a symptom network of patients with a diagnosis within the nonaffective psychotic spectrum and (3) a symptom network of patients with PTSD.
format Online
Article
Text
id pubmed-6882820
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68828202019-12-06 A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models Lafit, Ginette Tuerlinckx, Francis Myin-Germeys, Inez Ceulemans, Eva Sci Rep Article Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute the nodes and the partial correlations the edges. In many applications, it makes sense to impose sparsity (i.e., some of the partial correlations are forced to zero) as sparsity is theoretically meaningful and/or because it improves the predictive accuracy of the fitted model. However, as we will show by means of extensive simulations, state-of-the-art estimation approaches for imposing sparsity on GGMs, such as the Graphical lasso, ℓ(1) regularized nodewise regression, and joint sparse regression, fall short because they often yield too many false positives (i.e., partial correlations that are not properly set to zero). In this paper we present a new estimation approach that allows to control the false positive rate better. Our approach consists of two steps: First, we estimate an undirected network using one of the three state-of-the-art estimation approaches. Second, we try to detect the false positives, by flagging the partial correlations that are smaller in absolute value than a given threshold, which is determined through cross-validation; the flagged correlations are set to zero. Applying this new approach to the same simulated data, shows that it indeed performs better. We also illustrate our approach by using it to estimate (1) a gene regulatory network for breast cancer data, (2) a symptom network of patients with a diagnosis within the nonaffective psychotic spectrum and (3) a symptom network of patients with PTSD. Nature Publishing Group UK 2019-11-28 /pmc/articles/PMC6882820/ /pubmed/31780817 http://dx.doi.org/10.1038/s41598-019-53795-x Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Lafit, Ginette
Tuerlinckx, Francis
Myin-Germeys, Inez
Ceulemans, Eva
A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title_full A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title_fullStr A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title_full_unstemmed A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title_short A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
title_sort partial correlation screening approach for controlling the false positive rate in sparse gaussian graphical models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882820/
https://www.ncbi.nlm.nih.gov/pubmed/31780817
http://dx.doi.org/10.1038/s41598-019-53795-x
work_keys_str_mv AT lafitginette apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT tuerlinckxfrancis apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT myingermeysinez apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT ceulemanseva apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT lafitginette partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT tuerlinckxfrancis partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT myingermeysinez partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels
AT ceulemanseva partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels