Cargando…

Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM

Despite the abundance of multimodal data, suitable statistical models that can improve our understanding of diseases with genetic underpinnings are challenging to develop. Here, we present SparseGMM, a statistical approach for gene regulatory network discovery. SparseGMM uses latent variable modelin...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakr, Shaimaa, Brennan, Kevin, Mukherjee, Pritam, Argemi, Josepmaria, Hernaez, Mikel, Gevaert, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9939431/
https://www.ncbi.nlm.nih.gov/pubmed/36814838
http://dx.doi.org/10.1016/j.crmeth.2022.100392
_version_ 1784890850754428928
author Bakr, Shaimaa
Brennan, Kevin
Mukherjee, Pritam
Argemi, Josepmaria
Hernaez, Mikel
Gevaert, Olivier
author_facet Bakr, Shaimaa
Brennan, Kevin
Mukherjee, Pritam
Argemi, Josepmaria
Hernaez, Mikel
Gevaert, Olivier
author_sort Bakr, Shaimaa
collection PubMed
description Despite the abundance of multimodal data, suitable statistical models that can improve our understanding of diseases with genetic underpinnings are challenging to develop. Here, we present SparseGMM, a statistical approach for gene regulatory network discovery. SparseGMM uses latent variable modeling with sparsity constraints to learn Gaussian mixtures from multiomic data. By combining coexpression patterns with a Bayesian framework, SparseGMM quantitatively measures confidence in regulators and uncertainty in target gene assignment by computing gene entropy. We apply SparseGMM to liver cancer and normal liver tissue data and evaluate discovered gene modules in an independent single-cell RNA sequencing (scRNA-seq) dataset. SparseGMM identifies PROCR as a regulator of angiogenesis and PDCD1LG2 and HNF4A as regulators of immune response and blood coagulation in cancer. Furthermore, we show that more genes have significantly higher entropy in cancer compared with normal liver. Among high-entropy genes are key multifunctional components shared by critical pathways, including p53 and estrogen signaling.
format Online
Article
Text
id pubmed-9939431
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-99394312023-02-21 Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM Bakr, Shaimaa Brennan, Kevin Mukherjee, Pritam Argemi, Josepmaria Hernaez, Mikel Gevaert, Olivier Cell Rep Methods Article Despite the abundance of multimodal data, suitable statistical models that can improve our understanding of diseases with genetic underpinnings are challenging to develop. Here, we present SparseGMM, a statistical approach for gene regulatory network discovery. SparseGMM uses latent variable modeling with sparsity constraints to learn Gaussian mixtures from multiomic data. By combining coexpression patterns with a Bayesian framework, SparseGMM quantitatively measures confidence in regulators and uncertainty in target gene assignment by computing gene entropy. We apply SparseGMM to liver cancer and normal liver tissue data and evaluate discovered gene modules in an independent single-cell RNA sequencing (scRNA-seq) dataset. SparseGMM identifies PROCR as a regulator of angiogenesis and PDCD1LG2 and HNF4A as regulators of immune response and blood coagulation in cancer. Furthermore, we show that more genes have significantly higher entropy in cancer compared with normal liver. Among high-entropy genes are key multifunctional components shared by critical pathways, including p53 and estrogen signaling. Elsevier 2023-01-16 /pmc/articles/PMC9939431/ /pubmed/36814838 http://dx.doi.org/10.1016/j.crmeth.2022.100392 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Bakr, Shaimaa
Brennan, Kevin
Mukherjee, Pritam
Argemi, Josepmaria
Hernaez, Mikel
Gevaert, Olivier
Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title_full Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title_fullStr Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title_full_unstemmed Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title_short Identifying key multifunctional components shared by critical cancer and normal liver pathways via SparseGMM
title_sort identifying key multifunctional components shared by critical cancer and normal liver pathways via sparsegmm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9939431/
https://www.ncbi.nlm.nih.gov/pubmed/36814838
http://dx.doi.org/10.1016/j.crmeth.2022.100392
work_keys_str_mv AT bakrshaimaa identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm
AT brennankevin identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm
AT mukherjeepritam identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm
AT argemijosepmaria identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm
AT hernaezmikel identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm
AT gevaertolivier identifyingkeymultifunctionalcomponentssharedbycriticalcancerandnormalliverpathwaysviasparsegmm