Cargando…

On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation

BACKGROUND: Various ℓ (1)-penalised estimation methods such as graphical lasso and CLIME are widely used for sparse precision matrix estimation and learning of undirected network structure from data. Many of these methods have been shown to be consistent under various quantitative assumptions about...

Descripción completa

Detalles Bibliográficos
Autores principales: Heinävaara, Otte, Leppä-aho, Janne, Corander, Jukka, Honkela, Antti
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5249033/
https://www.ncbi.nlm.nih.gov/pubmed/28105909
http://dx.doi.org/10.1186/s12859-016-1309-x
_version_ 1782497376282345472
author Heinävaara, Otte
Leppä-aho, Janne
Corander, Jukka
Honkela, Antti
author_facet Heinävaara, Otte
Leppä-aho, Janne
Corander, Jukka
Honkela, Antti
author_sort Heinävaara, Otte
collection PubMed
description BACKGROUND: Various ℓ (1)-penalised estimation methods such as graphical lasso and CLIME are widely used for sparse precision matrix estimation and learning of undirected network structure from data. Many of these methods have been shown to be consistent under various quantitative assumptions about the underlying true covariance matrix. Intuitively, these conditions are related to situations where the penalty term will dominate the optimisation. RESULTS: We explore the consistency of ℓ (1)-based methods for a class of bipartite graphs motivated by the structure of models commonly used for gene regulatory networks. We show that all ℓ (1)-based methods fail dramatically for models with nearly linear dependencies between the variables. We also study the consistency on models derived from real gene expression data and note that the assumptions needed for consistency never hold even for modest sized gene networks and ℓ (1)-based methods also become unreliable in practice for larger networks. CONCLUSIONS: Our results demonstrate that ℓ (1)-penalised undirected network structure learning methods are unable to reliably learn many sparse bipartite graph structures, which arise often in gene expression data. Users of such methods should be aware of the consistency criteria of the methods and check if they are likely to be met in their application of interest.
format Online
Article
Text
id pubmed-5249033
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52490332017-01-26 On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation Heinävaara, Otte Leppä-aho, Janne Corander, Jukka Honkela, Antti BMC Bioinformatics Research BACKGROUND: Various ℓ (1)-penalised estimation methods such as graphical lasso and CLIME are widely used for sparse precision matrix estimation and learning of undirected network structure from data. Many of these methods have been shown to be consistent under various quantitative assumptions about the underlying true covariance matrix. Intuitively, these conditions are related to situations where the penalty term will dominate the optimisation. RESULTS: We explore the consistency of ℓ (1)-based methods for a class of bipartite graphs motivated by the structure of models commonly used for gene regulatory networks. We show that all ℓ (1)-based methods fail dramatically for models with nearly linear dependencies between the variables. We also study the consistency on models derived from real gene expression data and note that the assumptions needed for consistency never hold even for modest sized gene networks and ℓ (1)-based methods also become unreliable in practice for larger networks. CONCLUSIONS: Our results demonstrate that ℓ (1)-penalised undirected network structure learning methods are unable to reliably learn many sparse bipartite graph structures, which arise often in gene expression data. Users of such methods should be aware of the consistency criteria of the methods and check if they are likely to be met in their application of interest. BioMed Central 2016-12-13 /pmc/articles/PMC5249033/ /pubmed/28105909 http://dx.doi.org/10.1186/s12859-016-1309-x Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Heinävaara, Otte
Leppä-aho, Janne
Corander, Jukka
Honkela, Antti
On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title_full On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title_fullStr On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title_full_unstemmed On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title_short On the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
title_sort on the inconsistency of ℓ(1)-penalised sparse precision matrix estimation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5249033/
https://www.ncbi.nlm.nih.gov/pubmed/28105909
http://dx.doi.org/10.1186/s12859-016-1309-x
work_keys_str_mv AT heinavaaraotte ontheinconsistencyofl1penalisedsparseprecisionmatrixestimation
AT leppaahojanne ontheinconsistencyofl1penalisedsparseprecisionmatrixestimation
AT coranderjukka ontheinconsistencyofl1penalisedsparseprecisionmatrixestimation
AT honkelaantti ontheinconsistencyofl1penalisedsparseprecisionmatrixestimation