Cargando…

Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference

BACKGROUND: The analysis of integrated multi-omics data enables the identification of disease-related biomarkers that cannot be identified from a single omics profile. Although protein-level data reflects the cellular status of cancer tissue more directly than gene-level data, past studies have main...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Tae Rim, Jeong, Hyun-Hwan, Sohn, Kyung-Ah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624183/
https://www.ncbi.nlm.nih.gov/pubmed/31296204
http://dx.doi.org/10.1186/s12920-019-0511-x
_version_ 1783434217343942656
author Kim, Tae Rim
Jeong, Hyun-Hwan
Sohn, Kyung-Ah
author_facet Kim, Tae Rim
Jeong, Hyun-Hwan
Sohn, Kyung-Ah
author_sort Kim, Tae Rim
collection PubMed
description BACKGROUND: The analysis of integrated multi-omics data enables the identification of disease-related biomarkers that cannot be identified from a single omics profile. Although protein-level data reflects the cellular status of cancer tissue more directly than gene-level data, past studies have mainly focused on multi-omics integration using gene-level data as opposed to protein-level data. However, the use of protein-level data (such as mass spectrometry) in multi-omics integration has some limitations. For example, the correlation between the characteristics of gene-level data (such as mRNA) and protein-level data is weak, and it is difficult to detect low-abundance signaling proteins that are used to target cancer. The reverse phase protein array (RPPA) is a highly sensitive antibody-based quantification method for signaling proteins. However, the number of protein features in RPPA data is extremely low compared to the number of gene features in gene-level data. In this study, we present a new method for integrating RPPA profiles with RNA-Seq and DNA methylation profiles for survival prediction based on the integrative directed random walk (iDRW) framework proposed in our previous study. In the iDRW framework, each omics profile is merged into a single pathway profile that reflects the topological information of the pathway. In order to address the sparsity of RPPA profiles, we employ the random walk with restart (RWR) approach on the pathway network. RESULTS: Our model was validated using survival prediction analysis for a breast cancer dataset from The Cancer Genome Atlas. Our proposed model exhibited improved performance compared with other methods that utilize pathway information and also out-performed models that did not include the RPPA data utilized in our study. The risk pathways identified for breast cancer in this study were closely related to well-known breast cancer risk pathways. CONCLUSIONS: Our results indicated that RPPA data is useful for survival prediction for breast cancer patients under our framework. We also observed that iDRW effectively integrates RNA-Seq, DNA methylation, and RPPA profiles, while variation in the composition of the omics data can affect both prediction performance and risk pathway identification. These results suggest that omics data composition is a critical parameter for iDRW. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-019-0511-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6624183
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66241832019-07-23 Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference Kim, Tae Rim Jeong, Hyun-Hwan Sohn, Kyung-Ah BMC Med Genomics Research BACKGROUND: The analysis of integrated multi-omics data enables the identification of disease-related biomarkers that cannot be identified from a single omics profile. Although protein-level data reflects the cellular status of cancer tissue more directly than gene-level data, past studies have mainly focused on multi-omics integration using gene-level data as opposed to protein-level data. However, the use of protein-level data (such as mass spectrometry) in multi-omics integration has some limitations. For example, the correlation between the characteristics of gene-level data (such as mRNA) and protein-level data is weak, and it is difficult to detect low-abundance signaling proteins that are used to target cancer. The reverse phase protein array (RPPA) is a highly sensitive antibody-based quantification method for signaling proteins. However, the number of protein features in RPPA data is extremely low compared to the number of gene features in gene-level data. In this study, we present a new method for integrating RPPA profiles with RNA-Seq and DNA methylation profiles for survival prediction based on the integrative directed random walk (iDRW) framework proposed in our previous study. In the iDRW framework, each omics profile is merged into a single pathway profile that reflects the topological information of the pathway. In order to address the sparsity of RPPA profiles, we employ the random walk with restart (RWR) approach on the pathway network. RESULTS: Our model was validated using survival prediction analysis for a breast cancer dataset from The Cancer Genome Atlas. Our proposed model exhibited improved performance compared with other methods that utilize pathway information and also out-performed models that did not include the RPPA data utilized in our study. The risk pathways identified for breast cancer in this study were closely related to well-known breast cancer risk pathways. CONCLUSIONS: Our results indicated that RPPA data is useful for survival prediction for breast cancer patients under our framework. We also observed that iDRW effectively integrates RNA-Seq, DNA methylation, and RPPA profiles, while variation in the composition of the omics data can affect both prediction performance and risk pathway identification. These results suggest that omics data composition is a critical parameter for iDRW. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-019-0511-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-11 /pmc/articles/PMC6624183/ /pubmed/31296204 http://dx.doi.org/10.1186/s12920-019-0511-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kim, Tae Rim
Jeong, Hyun-Hwan
Sohn, Kyung-Ah
Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title_full Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title_fullStr Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title_full_unstemmed Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title_short Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
title_sort topological integration of rppa proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624183/
https://www.ncbi.nlm.nih.gov/pubmed/31296204
http://dx.doi.org/10.1186/s12920-019-0511-x
work_keys_str_mv AT kimtaerim topologicalintegrationofrppaproteomicdatawithmultiomicsdataforsurvivalpredictioninbreastcancerviapathwayactivityinference
AT jeonghyunhwan topologicalintegrationofrppaproteomicdatawithmultiomicsdataforsurvivalpredictioninbreastcancerviapathwayactivityinference
AT sohnkyungah topologicalintegrationofrppaproteomicdatawithmultiomicsdataforsurvivalpredictioninbreastcancerviapathwayactivityinference