Cargando…

Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure

We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our crit...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuismin, Markku, Dodangeh, Fatemeh, Sillanpää, Mikko J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9210289/
https://www.ncbi.nlm.nih.gov/pubmed/35100338
http://dx.doi.org/10.1093/g3journal/jkab437
_version_ 1784730135287562240
author Kuismin, Markku
Dodangeh, Fatemeh
Sillanpää, Mikko J
author_facet Kuismin, Markku
Dodangeh, Fatemeh
Sillanpää, Mikko J
author_sort Kuismin, Markku
collection PubMed
description We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our criterion favors gene network structure which differs from a trivial gene interaction structure obtained totally at random. We call the criterion the gap-com statistic (gap community statistic). The idea of the gap-com statistic is to examine the difference between the observed and the expected counts of communities (clusters) where the expected counts are evaluated using either data permutations or reference graph (the Erdős-Rényi graph) resampling. The latter represents a trivial gene network structure determined by chance. We put emphasis on complex network inference because the structure of gene networks is usually nontrivial. For example, some of the genes can be clustered together or some genes can be hub genes. We evaluate the performance of the gap-com statistic in graphical model selection and compare its performance to some existing methods using simulated and real biological data examples.
format Online
Article
Text
id pubmed-9210289
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92102892022-06-21 Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure Kuismin, Markku Dodangeh, Fatemeh Sillanpää, Mikko J G3 (Bethesda) Software and Data Resources We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our criterion favors gene network structure which differs from a trivial gene interaction structure obtained totally at random. We call the criterion the gap-com statistic (gap community statistic). The idea of the gap-com statistic is to examine the difference between the observed and the expected counts of communities (clusters) where the expected counts are evaluated using either data permutations or reference graph (the Erdős-Rényi graph) resampling. The latter represents a trivial gene network structure determined by chance. We put emphasis on complex network inference because the structure of gene networks is usually nontrivial. For example, some of the genes can be clustered together or some genes can be hub genes. We evaluate the performance of the gap-com statistic in graphical model selection and compare its performance to some existing methods using simulated and real biological data examples. Oxford University Press 2021-12-21 /pmc/articles/PMC9210289/ /pubmed/35100338 http://dx.doi.org/10.1093/g3journal/jkab437 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software and Data Resources
Kuismin, Markku
Dodangeh, Fatemeh
Sillanpää, Mikko J
Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title_full Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title_fullStr Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title_full_unstemmed Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title_short Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
title_sort gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
topic Software and Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9210289/
https://www.ncbi.nlm.nih.gov/pubmed/35100338
http://dx.doi.org/10.1093/g3journal/jkab437
work_keys_str_mv AT kuisminmarkku gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure
AT dodangehfatemeh gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure
AT sillanpaamikkoj gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure