Cargando…

Structural Equation Modeling of In silico Perturbations

Gene expression is controlled by multiple regulators and their interactions. Data from genome-wide gene expression assays can be used to estimate molecular activities of regulators within a model organism and extrapolate them to biological processes in humans. This approach is valuable in studies to...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jianying, Bushel, Pierre R., Lin, Lin, Day, Kevin, Wang, Tianyuan, DeMayo, Francesco J., Wu, San-Pin, Li, Jian-Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8652139/
https://www.ncbi.nlm.nih.gov/pubmed/34899830
http://dx.doi.org/10.3389/fgene.2021.727532
_version_ 1784611529505636352
author Li, Jianying
Bushel, Pierre R.
Lin, Lin
Day, Kevin
Wang, Tianyuan
DeMayo, Francesco J.
Wu, San-Pin
Li, Jian-Liang
author_facet Li, Jianying
Bushel, Pierre R.
Lin, Lin
Day, Kevin
Wang, Tianyuan
DeMayo, Francesco J.
Wu, San-Pin
Li, Jian-Liang
author_sort Li, Jianying
collection PubMed
description Gene expression is controlled by multiple regulators and their interactions. Data from genome-wide gene expression assays can be used to estimate molecular activities of regulators within a model organism and extrapolate them to biological processes in humans. This approach is valuable in studies to better understand complex human biological systems which may be involved in diseases and hence, have potential clinical relevance. In order to achieve this, it is necessary to infer gene interactions that are not directly observed (i.e. latent or hidden) by way of structural equation modeling (SEM) on the expression levels or activities of the downstream targets of regulator genes. Here we developed an R Shiny application, termed “Structural Equation Modeling of In silico Perturbations (SEMIPs)” to compute a two-sided t-statistic (T-score) from analysis of gene expression data, as a surrogate to gene activity in a given human specimen. SEMIPs can be used in either correlational studies between outcome variables of interest or subsequent model fitting on multiple variables. This application implements a 3-node SEM model that consists of two upstream regulators as input variables and one downstream reporter as an outcome variable to examine the significance of interactions among these variables. SEMIPs enables scientists to investigate gene interactions among three variables through computational and mathematical modeling (i.e. in silico). In a case study using SEMIPs, we have shown that putative direct downstream genes of the GATA Binding Protein 2 (GATA2) transcription factor are sufficient to infer its activities in silico for the conserved progesterone receptor (PGR)-GATA2-SRY-box transcription factor 17 (SOX17) genetic network in the human uterine endometrium.
format Online
Article
Text
id pubmed-8652139
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86521392021-12-09 Structural Equation Modeling of In silico Perturbations Li, Jianying Bushel, Pierre R. Lin, Lin Day, Kevin Wang, Tianyuan DeMayo, Francesco J. Wu, San-Pin Li, Jian-Liang Front Genet Genetics Gene expression is controlled by multiple regulators and their interactions. Data from genome-wide gene expression assays can be used to estimate molecular activities of regulators within a model organism and extrapolate them to biological processes in humans. This approach is valuable in studies to better understand complex human biological systems which may be involved in diseases and hence, have potential clinical relevance. In order to achieve this, it is necessary to infer gene interactions that are not directly observed (i.e. latent or hidden) by way of structural equation modeling (SEM) on the expression levels or activities of the downstream targets of regulator genes. Here we developed an R Shiny application, termed “Structural Equation Modeling of In silico Perturbations (SEMIPs)” to compute a two-sided t-statistic (T-score) from analysis of gene expression data, as a surrogate to gene activity in a given human specimen. SEMIPs can be used in either correlational studies between outcome variables of interest or subsequent model fitting on multiple variables. This application implements a 3-node SEM model that consists of two upstream regulators as input variables and one downstream reporter as an outcome variable to examine the significance of interactions among these variables. SEMIPs enables scientists to investigate gene interactions among three variables through computational and mathematical modeling (i.e. in silico). In a case study using SEMIPs, we have shown that putative direct downstream genes of the GATA Binding Protein 2 (GATA2) transcription factor are sufficient to infer its activities in silico for the conserved progesterone receptor (PGR)-GATA2-SRY-box transcription factor 17 (SOX17) genetic network in the human uterine endometrium. Frontiers Media S.A. 2021-11-24 /pmc/articles/PMC8652139/ /pubmed/34899830 http://dx.doi.org/10.3389/fgene.2021.727532 Text en Copyright © 2021 Li, Bushel, Lin, Day, Wang, DeMayo, Wu and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Jianying
Bushel, Pierre R.
Lin, Lin
Day, Kevin
Wang, Tianyuan
DeMayo, Francesco J.
Wu, San-Pin
Li, Jian-Liang
Structural Equation Modeling of In silico Perturbations
title Structural Equation Modeling of In silico Perturbations
title_full Structural Equation Modeling of In silico Perturbations
title_fullStr Structural Equation Modeling of In silico Perturbations
title_full_unstemmed Structural Equation Modeling of In silico Perturbations
title_short Structural Equation Modeling of In silico Perturbations
title_sort structural equation modeling of in silico perturbations
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8652139/
https://www.ncbi.nlm.nih.gov/pubmed/34899830
http://dx.doi.org/10.3389/fgene.2021.727532
work_keys_str_mv AT lijianying structuralequationmodelingofinsilicoperturbations
AT bushelpierrer structuralequationmodelingofinsilicoperturbations
AT linlin structuralequationmodelingofinsilicoperturbations
AT daykevin structuralequationmodelingofinsilicoperturbations
AT wangtianyuan structuralequationmodelingofinsilicoperturbations
AT demayofrancescoj structuralequationmodelingofinsilicoperturbations
AT wusanpin structuralequationmodelingofinsilicoperturbations
AT lijianliang structuralequationmodelingofinsilicoperturbations