Cargando…
A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models
Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882820/ https://www.ncbi.nlm.nih.gov/pubmed/31780817 http://dx.doi.org/10.1038/s41598-019-53795-x |
_version_ | 1783474244869423104 |
---|---|
author | Lafit, Ginette Tuerlinckx, Francis Myin-Germeys, Inez Ceulemans, Eva |
author_facet | Lafit, Ginette Tuerlinckx, Francis Myin-Germeys, Inez Ceulemans, Eva |
author_sort | Lafit, Ginette |
collection | PubMed |
description | Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute the nodes and the partial correlations the edges. In many applications, it makes sense to impose sparsity (i.e., some of the partial correlations are forced to zero) as sparsity is theoretically meaningful and/or because it improves the predictive accuracy of the fitted model. However, as we will show by means of extensive simulations, state-of-the-art estimation approaches for imposing sparsity on GGMs, such as the Graphical lasso, ℓ(1) regularized nodewise regression, and joint sparse regression, fall short because they often yield too many false positives (i.e., partial correlations that are not properly set to zero). In this paper we present a new estimation approach that allows to control the false positive rate better. Our approach consists of two steps: First, we estimate an undirected network using one of the three state-of-the-art estimation approaches. Second, we try to detect the false positives, by flagging the partial correlations that are smaller in absolute value than a given threshold, which is determined through cross-validation; the flagged correlations are set to zero. Applying this new approach to the same simulated data, shows that it indeed performs better. We also illustrate our approach by using it to estimate (1) a gene regulatory network for breast cancer data, (2) a symptom network of patients with a diagnosis within the nonaffective psychotic spectrum and (3) a symptom network of patients with PTSD. |
format | Online Article Text |
id | pubmed-6882820 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-68828202019-12-06 A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models Lafit, Ginette Tuerlinckx, Francis Myin-Germeys, Inez Ceulemans, Eva Sci Rep Article Gaussian Graphical Models (GGMs) are extensively used in many research areas, such as genomics, proteomics, neuroimaging, and psychology, to study the partial correlation structure of a set of variables. This structure is visualized by drawing an undirected network, in which the variables constitute the nodes and the partial correlations the edges. In many applications, it makes sense to impose sparsity (i.e., some of the partial correlations are forced to zero) as sparsity is theoretically meaningful and/or because it improves the predictive accuracy of the fitted model. However, as we will show by means of extensive simulations, state-of-the-art estimation approaches for imposing sparsity on GGMs, such as the Graphical lasso, ℓ(1) regularized nodewise regression, and joint sparse regression, fall short because they often yield too many false positives (i.e., partial correlations that are not properly set to zero). In this paper we present a new estimation approach that allows to control the false positive rate better. Our approach consists of two steps: First, we estimate an undirected network using one of the three state-of-the-art estimation approaches. Second, we try to detect the false positives, by flagging the partial correlations that are smaller in absolute value than a given threshold, which is determined through cross-validation; the flagged correlations are set to zero. Applying this new approach to the same simulated data, shows that it indeed performs better. We also illustrate our approach by using it to estimate (1) a gene regulatory network for breast cancer data, (2) a symptom network of patients with a diagnosis within the nonaffective psychotic spectrum and (3) a symptom network of patients with PTSD. Nature Publishing Group UK 2019-11-28 /pmc/articles/PMC6882820/ /pubmed/31780817 http://dx.doi.org/10.1038/s41598-019-53795-x Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Lafit, Ginette Tuerlinckx, Francis Myin-Germeys, Inez Ceulemans, Eva A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title | A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title_full | A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title_fullStr | A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title_full_unstemmed | A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title_short | A Partial Correlation Screening Approach for Controlling the False Positive Rate in Sparse Gaussian Graphical Models |
title_sort | partial correlation screening approach for controlling the false positive rate in sparse gaussian graphical models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882820/ https://www.ncbi.nlm.nih.gov/pubmed/31780817 http://dx.doi.org/10.1038/s41598-019-53795-x |
work_keys_str_mv | AT lafitginette apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT tuerlinckxfrancis apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT myingermeysinez apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT ceulemanseva apartialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT lafitginette partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT tuerlinckxfrancis partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT myingermeysinez partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels AT ceulemanseva partialcorrelationscreeningapproachforcontrollingthefalsepositiverateinsparsegaussiangraphicalmodels |