Cargando…

Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment

MOTIVATION: Despite the success of recent machine learning algorithms’ applications to survival analysis, their black-box nature hinders interpretability, which is arguably the most important aspect. Similarly, multi-omics data integration for survival analysis is often constrained by the underlying...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Hyun Jae, Shu, Mia, Bekiranov, Stefan, Zang, Chongzhi, Zhang, Aidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10079355/
https://www.ncbi.nlm.nih.gov/pubmed/36864611
http://dx.doi.org/10.1093/bioinformatics/btad113
_version_ 1785020711350304768
author Cho, Hyun Jae
Shu, Mia
Bekiranov, Stefan
Zang, Chongzhi
Zhang, Aidong
author_facet Cho, Hyun Jae
Shu, Mia
Bekiranov, Stefan
Zang, Chongzhi
Zhang, Aidong
author_sort Cho, Hyun Jae
collection PubMed
description MOTIVATION: Despite the success of recent machine learning algorithms’ applications to survival analysis, their black-box nature hinders interpretability, which is arguably the most important aspect. Similarly, multi-omics data integration for survival analysis is often constrained by the underlying relationships and correlations that are rarely well understood. The goal of this work is to alleviate the interpretability problem in machine learning approaches for survival analysis and also demonstrate how multi-omics data integration improves survival analysis and pathway enrichment. We use meta-learning, a machine-learning algorithm that is trained on a variety of related datasets and allows quick adaptations to new tasks, to perform survival analysis and pathway enrichment on pan-cancer datasets. In recent machine learning research, meta-learning has been effectively used for knowledge transfer among multiple related datasets. RESULTS: We use meta-learning with Cox hazard loss to show that the integration of TCGA pan-cancer data increases the performance of survival analysis. We also apply advanced model interpretability method called DeepLIFT (Deep Learning Important FeaTures) to show different sets of enriched pathways for multi-omics and transcriptomics data. Our results show that multi-omics cancer survival analysis enhances performance compared with using transcriptomics or clinical data alone. Additionally, we show a correlation between variable importance assignment from DeepLIFT and gene coenrichment, suggesting that genes with higher and similar contribution scores are more likely to be enriched together in the same enrichment sets. AVAILABILITY AND IMPLEMENTATION: https://github.com/berkuva/TCGA-omics-integration.
format Online
Article
Text
id pubmed-10079355
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100793552023-04-07 Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment Cho, Hyun Jae Shu, Mia Bekiranov, Stefan Zang, Chongzhi Zhang, Aidong Bioinformatics Original Paper MOTIVATION: Despite the success of recent machine learning algorithms’ applications to survival analysis, their black-box nature hinders interpretability, which is arguably the most important aspect. Similarly, multi-omics data integration for survival analysis is often constrained by the underlying relationships and correlations that are rarely well understood. The goal of this work is to alleviate the interpretability problem in machine learning approaches for survival analysis and also demonstrate how multi-omics data integration improves survival analysis and pathway enrichment. We use meta-learning, a machine-learning algorithm that is trained on a variety of related datasets and allows quick adaptations to new tasks, to perform survival analysis and pathway enrichment on pan-cancer datasets. In recent machine learning research, meta-learning has been effectively used for knowledge transfer among multiple related datasets. RESULTS: We use meta-learning with Cox hazard loss to show that the integration of TCGA pan-cancer data increases the performance of survival analysis. We also apply advanced model interpretability method called DeepLIFT (Deep Learning Important FeaTures) to show different sets of enriched pathways for multi-omics and transcriptomics data. Our results show that multi-omics cancer survival analysis enhances performance compared with using transcriptomics or clinical data alone. Additionally, we show a correlation between variable importance assignment from DeepLIFT and gene coenrichment, suggesting that genes with higher and similar contribution scores are more likely to be enriched together in the same enrichment sets. AVAILABILITY AND IMPLEMENTATION: https://github.com/berkuva/TCGA-omics-integration. Oxford University Press 2023-03-02 /pmc/articles/PMC10079355/ /pubmed/36864611 http://dx.doi.org/10.1093/bioinformatics/btad113 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Cho, Hyun Jae
Shu, Mia
Bekiranov, Stefan
Zang, Chongzhi
Zhang, Aidong
Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title_full Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title_fullStr Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title_full_unstemmed Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title_short Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
title_sort interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10079355/
https://www.ncbi.nlm.nih.gov/pubmed/36864611
http://dx.doi.org/10.1093/bioinformatics/btad113
work_keys_str_mv AT chohyunjae interpretablemetalearningofmultiomicsdataforsurvivalanalysisandpathwayenrichment
AT shumia interpretablemetalearningofmultiomicsdataforsurvivalanalysisandpathwayenrichment
AT bekiranovstefan interpretablemetalearningofmultiomicsdataforsurvivalanalysisandpathwayenrichment
AT zangchongzhi interpretablemetalearningofmultiomicsdataforsurvivalanalysisandpathwayenrichment
AT zhangaidong interpretablemetalearningofmultiomicsdataforsurvivalanalysisandpathwayenrichment