Cargando…

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity

BACKGROUND: High throughput technologies have been used to profile genes in multiple different dimensions, such as genetic variation, copy number, gene and protein expression, epigenetics, metabolomics. Computational analyses often treat these different data types as independent, leading to an explo...

Descripción completa

Detalles Bibliográficos
Autores principales: Pavel, Ana B., Sonkin, Dmitriy, Reddy, Anupama
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4750289/
https://www.ncbi.nlm.nih.gov/pubmed/26864072
http://dx.doi.org/10.1186/s12918-016-0260-9
_version_ 1782415413365178368
author Pavel, Ana B.
Sonkin, Dmitriy
Reddy, Anupama
author_facet Pavel, Ana B.
Sonkin, Dmitriy
Reddy, Anupama
author_sort Pavel, Ana B.
collection PubMed
description BACKGROUND: High throughput technologies have been used to profile genes in multiple different dimensions, such as genetic variation, copy number, gene and protein expression, epigenetics, metabolomics. Computational analyses often treat these different data types as independent, leading to an explosion in the number of features making studies under-powered and more importantly do not provide a comprehensive view of the gene’s state. We sought to infer gene activity by integrating different dimensions using biological knowledge of oncogenes and tumor suppressors. RESULTS: This paper proposes an integrative model of oncogene and tumor suppressor activity in cells which is used to identify cancer drivers and compute patient-specific gene activity scores. We have developed a Fuzzy Logic Modeling (FLM) framework to incorporate biological knowledge with multi-omics data such as somatic mutation, gene expression and copy number measurements. The advantage of using a fuzzy logic approach is to abstract meaningful biological rules from low-level numerical data. Biological knowledge is often qualitative, thus combining it with quantitative numerical measurements may leverage new biological insights about a gene’s state. We show that the oncogenic and altered tumor suppressing state of a gene can be better characterized by integrating different molecular measurements with biological knowledge than by each data type alone. We validate the gene activity score using data from the Cancer Cell Line Encyclopedia and drug sensitivity data for five compounds: BYL719 (PIK3CA inhibitor), PLX4720 (BRAF inhibitor), AZD6244 (MEK inhibitor), Erlotinib (EGFR inhibitor), and Nutlin-3 (MDM2 inhibitor). The integrative score improves prediction of drug sensitivity for the known drug targets of these compounds compared to each data type alone. The gene activity scores are also used to cluster colorectal cancer cell lines. Two subtypes of CRCs were found and potential cancer drivers and therapeutic targets for each of the subtypes were identified. CONCLUSIONS: We propose a fuzzy logic based approach to infer gene activity in cancer by integrating numerical data with descriptive biological knowledge. We compute general patient-specific gene-level scores useful to determine the oncogenic or tumor suppressor status of cancer gene drivers and to cluster or classify patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0260-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4750289
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47502892016-02-12 Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity Pavel, Ana B. Sonkin, Dmitriy Reddy, Anupama BMC Syst Biol Methodology Article BACKGROUND: High throughput technologies have been used to profile genes in multiple different dimensions, such as genetic variation, copy number, gene and protein expression, epigenetics, metabolomics. Computational analyses often treat these different data types as independent, leading to an explosion in the number of features making studies under-powered and more importantly do not provide a comprehensive view of the gene’s state. We sought to infer gene activity by integrating different dimensions using biological knowledge of oncogenes and tumor suppressors. RESULTS: This paper proposes an integrative model of oncogene and tumor suppressor activity in cells which is used to identify cancer drivers and compute patient-specific gene activity scores. We have developed a Fuzzy Logic Modeling (FLM) framework to incorporate biological knowledge with multi-omics data such as somatic mutation, gene expression and copy number measurements. The advantage of using a fuzzy logic approach is to abstract meaningful biological rules from low-level numerical data. Biological knowledge is often qualitative, thus combining it with quantitative numerical measurements may leverage new biological insights about a gene’s state. We show that the oncogenic and altered tumor suppressing state of a gene can be better characterized by integrating different molecular measurements with biological knowledge than by each data type alone. We validate the gene activity score using data from the Cancer Cell Line Encyclopedia and drug sensitivity data for five compounds: BYL719 (PIK3CA inhibitor), PLX4720 (BRAF inhibitor), AZD6244 (MEK inhibitor), Erlotinib (EGFR inhibitor), and Nutlin-3 (MDM2 inhibitor). The integrative score improves prediction of drug sensitivity for the known drug targets of these compounds compared to each data type alone. The gene activity scores are also used to cluster colorectal cancer cell lines. Two subtypes of CRCs were found and potential cancer drivers and therapeutic targets for each of the subtypes were identified. CONCLUSIONS: We propose a fuzzy logic based approach to infer gene activity in cancer by integrating numerical data with descriptive biological knowledge. We compute general patient-specific gene-level scores useful to determine the oncogenic or tumor suppressor status of cancer gene drivers and to cluster or classify patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0260-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-11 /pmc/articles/PMC4750289/ /pubmed/26864072 http://dx.doi.org/10.1186/s12918-016-0260-9 Text en © Pavel et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Pavel, Ana B.
Sonkin, Dmitriy
Reddy, Anupama
Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title_full Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title_fullStr Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title_full_unstemmed Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title_short Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
title_sort integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4750289/
https://www.ncbi.nlm.nih.gov/pubmed/26864072
http://dx.doi.org/10.1186/s12918-016-0260-9
work_keys_str_mv AT pavelanab integrativemodelingofmultiomicsdatatoidentifycancerdriversandinferpatientspecificgeneactivity
AT sonkindmitriy integrativemodelingofmultiomicsdatatoidentifycancerdriversandinferpatientspecificgeneactivity
AT reddyanupama integrativemodelingofmultiomicsdatatoidentifycancerdriversandinferpatientspecificgeneactivity