Cargando…

Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction

Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Dokyoon, Joung, Je-Gun, Sohn, Kyung-Ah, Shin, Hyunjung, Park, Yu Rang, Ritchie, Marylyn D, Kim, Ju Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433357/
https://www.ncbi.nlm.nih.gov/pubmed/25002459
http://dx.doi.org/10.1136/amiajnl-2013-002481
_version_ 1782371633733828608
author Kim, Dokyoon
Joung, Je-Gun
Sohn, Kyung-Ah
Shin, Hyunjung
Park, Yu Rang
Ritchie, Marylyn D
Kim, Ju Han
author_facet Kim, Dokyoon
Joung, Je-Gun
Sohn, Kyung-Ah
Shin, Hyunjung
Park, Yu Rang
Ritchie, Marylyn D
Kim, Ju Han
author_sort Kim, Dokyoon
collection PubMed
description Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Methods Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Results Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Conclusions Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies.
format Online
Article
Text
id pubmed-4433357
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44333572016-01-01 Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction Kim, Dokyoon Joung, Je-Gun Sohn, Kyung-Ah Shin, Hyunjung Park, Yu Rang Ritchie, Marylyn D Kim, Ju Han J Am Med Inform Assoc Research and Applications Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Methods Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Results Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Conclusions Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. Oxford University Press 2015-01 2014-07-07 /pmc/articles/PMC4433357/ /pubmed/25002459 http://dx.doi.org/10.1136/amiajnl-2013-002481 Text en © The Author 2014. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.comFor numbered affiliations see end of article.
spellingShingle Research and Applications
Kim, Dokyoon
Joung, Je-Gun
Sohn, Kyung-Ah
Shin, Hyunjung
Park, Yu Rang
Ritchie, Marylyn D
Kim, Ju Han
Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title_full Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title_fullStr Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title_full_unstemmed Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title_short Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
title_sort knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433357/
https://www.ncbi.nlm.nih.gov/pubmed/25002459
http://dx.doi.org/10.1136/amiajnl-2013-002481
work_keys_str_mv AT kimdokyoon knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT joungjegun knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT sohnkyungah knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT shinhyunjung knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT parkyurang knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT ritchiemarylynd knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction
AT kimjuhan knowledgeboostingagraphbasedintegrationapproachwithmultiomicsdataandgenomicknowledgeforcancerclinicaloutcomeprediction