Cargando…

Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning

Motivation: The Cox proportional hazard models are widely used in the study of cancer survival. However, these models often meet challenges such as the large number of features and small sample sizes of cancer data sets. While this issue can be partially solved by applying regularization techniques...

Descripción completa

Detalles Bibliográficos
Autores principales: Malenová, Gabriela, Rowson, Daniel, Boeva, Valentina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8667553/
https://www.ncbi.nlm.nih.gov/pubmed/34912376
http://dx.doi.org/10.3389/fgene.2021.771301
_version_ 1784614404415815680
author Malenová, Gabriela
Rowson, Daniel
Boeva, Valentina
author_facet Malenová, Gabriela
Rowson, Daniel
Boeva, Valentina
author_sort Malenová, Gabriela
collection PubMed
description Motivation: The Cox proportional hazard models are widely used in the study of cancer survival. However, these models often meet challenges such as the large number of features and small sample sizes of cancer data sets. While this issue can be partially solved by applying regularization techniques such as lasso, the models still suffer from unsatisfactory predictive power and low stability. Methods: Here, we investigated two methods to improve survival models. Firstly, we leveraged the biological knowledge that groups of genes act together in pathways and regularized both at the group and gene level using latent group lasso penalty term. Secondly, we designed and applied a multi-task learning penalty that allowed us leveraging the relationship between survival models for different cancers. Results: We observed modest improvements over the simple lasso model with the inclusion of latent group lasso penalty for six of the 16 cancer types tested. The addition of a multi-task penalty, which penalized coefficients in pairs of cancers from diverging too greatly, significantly improved accuracy for a single cancer, lung squamous cell carcinoma, while having minimal effect on other cancer types. Conclusion: While the use of pathway information and multi-tasking shows some promise, these methods do not provide a substantial improvement when compared with standard methods.
format Online
Article
Text
id pubmed-8667553
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86675532021-12-14 Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning Malenová, Gabriela Rowson, Daniel Boeva, Valentina Front Genet Genetics Motivation: The Cox proportional hazard models are widely used in the study of cancer survival. However, these models often meet challenges such as the large number of features and small sample sizes of cancer data sets. While this issue can be partially solved by applying regularization techniques such as lasso, the models still suffer from unsatisfactory predictive power and low stability. Methods: Here, we investigated two methods to improve survival models. Firstly, we leveraged the biological knowledge that groups of genes act together in pathways and regularized both at the group and gene level using latent group lasso penalty term. Secondly, we designed and applied a multi-task learning penalty that allowed us leveraging the relationship between survival models for different cancers. Results: We observed modest improvements over the simple lasso model with the inclusion of latent group lasso penalty for six of the 16 cancer types tested. The addition of a multi-task penalty, which penalized coefficients in pairs of cancers from diverging too greatly, significantly improved accuracy for a single cancer, lung squamous cell carcinoma, while having minimal effect on other cancer types. Conclusion: While the use of pathway information and multi-tasking shows some promise, these methods do not provide a substantial improvement when compared with standard methods. Frontiers Media S.A. 2021-11-29 /pmc/articles/PMC8667553/ /pubmed/34912376 http://dx.doi.org/10.3389/fgene.2021.771301 Text en Copyright © 2021 Malenová, Rowson and Boeva. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Malenová, Gabriela
Rowson, Daniel
Boeva, Valentina
Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title_full Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title_fullStr Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title_full_unstemmed Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title_short Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning
title_sort exploring pathway-based group lasso for cancer survival analysis: a special case of multi-task learning
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8667553/
https://www.ncbi.nlm.nih.gov/pubmed/34912376
http://dx.doi.org/10.3389/fgene.2021.771301
work_keys_str_mv AT malenovagabriela exploringpathwaybasedgrouplassoforcancersurvivalanalysisaspecialcaseofmultitasklearning
AT rowsondaniel exploringpathwaybasedgrouplassoforcancersurvivalanalysisaspecialcaseofmultitasklearning
AT boevavalentina exploringpathwaybasedgrouplassoforcancersurvivalanalysisaspecialcaseofmultitasklearning