Cargando…

Balanced Functional Module Detection in genomic data

MOTIVATION: High-dimensional genomic data can be analyzed to understand the effects of variables on a target variable such as a clinical outcome. For understanding the underlying biological mechanism affecting the target, it is important to discover the complete set of relevant variables. Thus varia...

Descripción completa

Detalles Bibliográficos
Autores principales: Tritchler, David, Towle-Miller, Lorin M, Miecznikowski, Jeffrey C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710612/
https://www.ncbi.nlm.nih.gov/pubmed/36700111
http://dx.doi.org/10.1093/bioadv/vbab018
_version_ 1784841403851866112
author Tritchler, David
Towle-Miller, Lorin M
Miecznikowski, Jeffrey C
author_facet Tritchler, David
Towle-Miller, Lorin M
Miecznikowski, Jeffrey C
author_sort Tritchler, David
collection PubMed
description MOTIVATION: High-dimensional genomic data can be analyzed to understand the effects of variables on a target variable such as a clinical outcome. For understanding the underlying biological mechanism affecting the target, it is important to discover the complete set of relevant variables. Thus variable selection is a primary goal, which differs from a prediction criterion. Of special interest are functional modules, cooperating sets of variables affecting the target which can be characterized by a graph. In applications such as social networks, the concept of balance in undirected signed graphs characterizes the consistency of associations within the network. This property requires that the module variables have a joint effect on the target outcome with no internal conflict, an efficiency that may be applied to biological networks. RESULTS: In this paper, we model genomic variables in signed undirected graphs for applications where the set of predictor variables influences an outcome. Consequences of the balance property are exploited to implement a new module discovery algorithm, balanced Functional Module Detection (bFMD), which selects a subset of variables from high-dimensional data that compose a balanced functional module. Our bFMD algorithm performed favorably in simulations as compared to other module detection methods. Additionally, bFMD detected interpretable results in an application using RNA-seq data obtained from subjects with Uterine Corpus Endometrial Carcinoma using the percentage of tumor invasion as the outcome of interest. The variables selected by bFMD have improved interpretability due to the logical consistency afforded by the balance property. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9710612
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106122023-01-24 Balanced Functional Module Detection in genomic data Tritchler, David Towle-Miller, Lorin M Miecznikowski, Jeffrey C Bioinform Adv Original Article MOTIVATION: High-dimensional genomic data can be analyzed to understand the effects of variables on a target variable such as a clinical outcome. For understanding the underlying biological mechanism affecting the target, it is important to discover the complete set of relevant variables. Thus variable selection is a primary goal, which differs from a prediction criterion. Of special interest are functional modules, cooperating sets of variables affecting the target which can be characterized by a graph. In applications such as social networks, the concept of balance in undirected signed graphs characterizes the consistency of associations within the network. This property requires that the module variables have a joint effect on the target outcome with no internal conflict, an efficiency that may be applied to biological networks. RESULTS: In this paper, we model genomic variables in signed undirected graphs for applications where the set of predictor variables influences an outcome. Consequences of the balance property are exploited to implement a new module discovery algorithm, balanced Functional Module Detection (bFMD), which selects a subset of variables from high-dimensional data that compose a balanced functional module. Our bFMD algorithm performed favorably in simulations as compared to other module detection methods. Additionally, bFMD detected interpretable results in an application using RNA-seq data obtained from subjects with Uterine Corpus Endometrial Carcinoma using the percentage of tumor invasion as the outcome of interest. The variables selected by bFMD have improved interpretability due to the logical consistency afforded by the balance property. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-09-16 /pmc/articles/PMC9710612/ /pubmed/36700111 http://dx.doi.org/10.1093/bioadv/vbab018 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Tritchler, David
Towle-Miller, Lorin M
Miecznikowski, Jeffrey C
Balanced Functional Module Detection in genomic data
title Balanced Functional Module Detection in genomic data
title_full Balanced Functional Module Detection in genomic data
title_fullStr Balanced Functional Module Detection in genomic data
title_full_unstemmed Balanced Functional Module Detection in genomic data
title_short Balanced Functional Module Detection in genomic data
title_sort balanced functional module detection in genomic data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710612/
https://www.ncbi.nlm.nih.gov/pubmed/36700111
http://dx.doi.org/10.1093/bioadv/vbab018
work_keys_str_mv AT tritchlerdavid balancedfunctionalmoduledetectioningenomicdata
AT towlemillerlorinm balancedfunctionalmoduledetectioningenomicdata
AT miecznikowskijeffreyc balancedfunctionalmoduledetectioningenomicdata