Cargando…
Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure
We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our crit...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9210289/ https://www.ncbi.nlm.nih.gov/pubmed/35100338 http://dx.doi.org/10.1093/g3journal/jkab437 |
_version_ | 1784730135287562240 |
---|---|
author | Kuismin, Markku Dodangeh, Fatemeh Sillanpää, Mikko J |
author_facet | Kuismin, Markku Dodangeh, Fatemeh Sillanpää, Mikko J |
author_sort | Kuismin, Markku |
collection | PubMed |
description | We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our criterion favors gene network structure which differs from a trivial gene interaction structure obtained totally at random. We call the criterion the gap-com statistic (gap community statistic). The idea of the gap-com statistic is to examine the difference between the observed and the expected counts of communities (clusters) where the expected counts are evaluated using either data permutations or reference graph (the Erdős-Rényi graph) resampling. The latter represents a trivial gene network structure determined by chance. We put emphasis on complex network inference because the structure of gene networks is usually nontrivial. For example, some of the genes can be clustered together or some genes can be hub genes. We evaluate the performance of the gap-com statistic in graphical model selection and compare its performance to some existing methods using simulated and real biological data examples. |
format | Online Article Text |
id | pubmed-9210289 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92102892022-06-21 Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure Kuismin, Markku Dodangeh, Fatemeh Sillanpää, Mikko J G3 (Bethesda) Software and Data Resources We introduce a new model selection criterion for sparse complex gene network modeling where gene co-expression relationships are estimated from data. This is a novel formulation of the gap statistic and it can be used for the optimal choice of a regularization parameter in graphical models. Our criterion favors gene network structure which differs from a trivial gene interaction structure obtained totally at random. We call the criterion the gap-com statistic (gap community statistic). The idea of the gap-com statistic is to examine the difference between the observed and the expected counts of communities (clusters) where the expected counts are evaluated using either data permutations or reference graph (the Erdős-Rényi graph) resampling. The latter represents a trivial gene network structure determined by chance. We put emphasis on complex network inference because the structure of gene networks is usually nontrivial. For example, some of the genes can be clustered together or some genes can be hub genes. We evaluate the performance of the gap-com statistic in graphical model selection and compare its performance to some existing methods using simulated and real biological data examples. Oxford University Press 2021-12-21 /pmc/articles/PMC9210289/ /pubmed/35100338 http://dx.doi.org/10.1093/g3journal/jkab437 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software and Data Resources Kuismin, Markku Dodangeh, Fatemeh Sillanpää, Mikko J Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title | Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title_full | Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title_fullStr | Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title_full_unstemmed | Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title_short | Gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
title_sort | gap-com: general model selection criterion for sparse undirected gene networks with nontrivial community structure |
topic | Software and Data Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9210289/ https://www.ncbi.nlm.nih.gov/pubmed/35100338 http://dx.doi.org/10.1093/g3journal/jkab437 |
work_keys_str_mv | AT kuisminmarkku gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure AT dodangehfatemeh gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure AT sillanpaamikkoj gapcomgeneralmodelselectioncriterionforsparseundirectedgenenetworkswithnontrivialcommunitystructure |