Cargando…
Executable pathway analysis using ensemble discrete-state modeling for large-scale data
Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6743792/ https://www.ncbi.nlm.nih.gov/pubmed/31479446 http://dx.doi.org/10.1371/journal.pcbi.1007317 |
_version_ | 1783451330136768512 |
---|---|
author | Palli, Rohith Palshikar, Mukta G. Thakar, Juilee |
author_facet | Palli, Rohith Palshikar, Mukta G. Thakar, Juilee |
author_sort | Palli, Rohith |
collection | PubMed |
description | Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal flow and integration in pathway analysis could rank the pathways based on modulation in key regulatory genes. This implementation can be facilitated for large-scale data by discrete state network modeling due to simplicity in parameterization. Here, we model cellular heterogeneity using discrete state dynamics and measure pathway activities in cross-sectional data. We introduce a new algorithm, Boolean Omics Network Invariant-Time Analysis (BONITA), for signal propagation, signal integration, and pathway analysis. Our signal propagation approach models heterogeneity in transcriptomic data as arising from intercellular heterogeneity rather than intracellular stochasticity, and propagates binary signals repeatedly across networks. Logic rules defining signal integration are inferred by genetic algorithm and are refined by local search. The rules determine the impact of each node in a pathway, which is used to score the probability of the pathway’s modulation by chance. We have comprehensively tested BONITA for application to transcriptomics data from translational studies. Comparison with state-of-the-art pathway analysis methods shows that BONITA has higher sensitivity at lower levels of source node modulation and similar sensitivity at higher levels of source node modulation. Application of BONITA pathway analysis to previously validated RNA-sequencing studies identifies additional relevant pathways in in-vitro human cell line experiments and in-vivo infant studies. Additionally, BONITA successfully detected modulation of disease specific pathways when comparing relevant RNA-sequencing data with healthy controls. Most interestingly, the two highest impact score nodes identified by BONITA included known drug targets. Thus, BONITA is a powerful approach to prioritize not only pathways but also specific mechanistic role of genes compared to existing methods. BONITA is available at: https://github.com/thakar-lab/BONITA. |
format | Online Article Text |
id | pubmed-6743792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-67437922019-09-20 Executable pathway analysis using ensemble discrete-state modeling for large-scale data Palli, Rohith Palshikar, Mukta G. Thakar, Juilee PLoS Comput Biol Research Article Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal flow and integration in pathway analysis could rank the pathways based on modulation in key regulatory genes. This implementation can be facilitated for large-scale data by discrete state network modeling due to simplicity in parameterization. Here, we model cellular heterogeneity using discrete state dynamics and measure pathway activities in cross-sectional data. We introduce a new algorithm, Boolean Omics Network Invariant-Time Analysis (BONITA), for signal propagation, signal integration, and pathway analysis. Our signal propagation approach models heterogeneity in transcriptomic data as arising from intercellular heterogeneity rather than intracellular stochasticity, and propagates binary signals repeatedly across networks. Logic rules defining signal integration are inferred by genetic algorithm and are refined by local search. The rules determine the impact of each node in a pathway, which is used to score the probability of the pathway’s modulation by chance. We have comprehensively tested BONITA for application to transcriptomics data from translational studies. Comparison with state-of-the-art pathway analysis methods shows that BONITA has higher sensitivity at lower levels of source node modulation and similar sensitivity at higher levels of source node modulation. Application of BONITA pathway analysis to previously validated RNA-sequencing studies identifies additional relevant pathways in in-vitro human cell line experiments and in-vivo infant studies. Additionally, BONITA successfully detected modulation of disease specific pathways when comparing relevant RNA-sequencing data with healthy controls. Most interestingly, the two highest impact score nodes identified by BONITA included known drug targets. Thus, BONITA is a powerful approach to prioritize not only pathways but also specific mechanistic role of genes compared to existing methods. BONITA is available at: https://github.com/thakar-lab/BONITA. Public Library of Science 2019-09-03 /pmc/articles/PMC6743792/ /pubmed/31479446 http://dx.doi.org/10.1371/journal.pcbi.1007317 Text en © 2019 Palli et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Palli, Rohith Palshikar, Mukta G. Thakar, Juilee Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title | Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title_full | Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title_fullStr | Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title_full_unstemmed | Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title_short | Executable pathway analysis using ensemble discrete-state modeling for large-scale data |
title_sort | executable pathway analysis using ensemble discrete-state modeling for large-scale data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6743792/ https://www.ncbi.nlm.nih.gov/pubmed/31479446 http://dx.doi.org/10.1371/journal.pcbi.1007317 |
work_keys_str_mv | AT pallirohith executablepathwayanalysisusingensemblediscretestatemodelingforlargescaledata AT palshikarmuktag executablepathwayanalysisusingensemblediscretestatemodelingforlargescaledata AT thakarjuilee executablepathwayanalysisusingensemblediscretestatemodelingforlargescaledata |