Cargando…

OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer

In many real-world applications, such as those based on electronic health records, prognostic prediction of patient survival is based on heterogeneous sets of clinical laboratory measurements. To address the trade-off between the predictive accuracy of a prognostic model and the costs related to its...

Descripción completa

Detalles Bibliográficos
Autores principales: Halkola, Anni S., Joki, Kaisa, Mirtti, Tuomas, Mäkelä, Marko M., Aittokallio, Tero, Laajala, Teemu D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10032505/
https://www.ncbi.nlm.nih.gov/pubmed/36897911
http://dx.doi.org/10.1371/journal.pcbi.1010333
_version_ 1784910815315361792
author Halkola, Anni S.
Joki, Kaisa
Mirtti, Tuomas
Mäkelä, Marko M.
Aittokallio, Tero
Laajala, Teemu D.
author_facet Halkola, Anni S.
Joki, Kaisa
Mirtti, Tuomas
Mäkelä, Marko M.
Aittokallio, Tero
Laajala, Teemu D.
author_sort Halkola, Anni S.
collection PubMed
description In many real-world applications, such as those based on electronic health records, prognostic prediction of patient survival is based on heterogeneous sets of clinical laboratory measurements. To address the trade-off between the predictive accuracy of a prognostic model and the costs related to its clinical implementation, we propose an optimized L(0)-pseudonorm approach to learn sparse solutions in multivariable regression. The model sparsity is maintained by restricting the number of nonzero coefficients in the model with a cardinality constraint, which makes the optimization problem NP-hard. In addition, we generalize the cardinality constraint for grouped feature selection, which makes it possible to identify key sets of predictors that may be measured together in a kit in clinical practice. We demonstrate the operation of our cardinality constraint-based feature subset selection method, named OSCAR, in the context of prognostic prediction of prostate cancer patients, where it enables one to determine the key explanatory predictors at different levels of model sparsity. We further explore how the model sparsity affects the model accuracy and implementation cost. Lastly, we demonstrate generalization of the presented methodology to high-dimensional transcriptomics data.
format Online
Article
Text
id pubmed-10032505
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-100325052023-03-23 OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer Halkola, Anni S. Joki, Kaisa Mirtti, Tuomas Mäkelä, Marko M. Aittokallio, Tero Laajala, Teemu D. PLoS Comput Biol Research Article In many real-world applications, such as those based on electronic health records, prognostic prediction of patient survival is based on heterogeneous sets of clinical laboratory measurements. To address the trade-off between the predictive accuracy of a prognostic model and the costs related to its clinical implementation, we propose an optimized L(0)-pseudonorm approach to learn sparse solutions in multivariable regression. The model sparsity is maintained by restricting the number of nonzero coefficients in the model with a cardinality constraint, which makes the optimization problem NP-hard. In addition, we generalize the cardinality constraint for grouped feature selection, which makes it possible to identify key sets of predictors that may be measured together in a kit in clinical practice. We demonstrate the operation of our cardinality constraint-based feature subset selection method, named OSCAR, in the context of prognostic prediction of prostate cancer patients, where it enables one to determine the key explanatory predictors at different levels of model sparsity. We further explore how the model sparsity affects the model accuracy and implementation cost. Lastly, we demonstrate generalization of the presented methodology to high-dimensional transcriptomics data. Public Library of Science 2023-03-10 /pmc/articles/PMC10032505/ /pubmed/36897911 http://dx.doi.org/10.1371/journal.pcbi.1010333 Text en © 2023 Halkola et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Halkola, Anni S.
Joki, Kaisa
Mirtti, Tuomas
Mäkelä, Marko M.
Aittokallio, Tero
Laajala, Teemu D.
OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title_full OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title_fullStr OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title_full_unstemmed OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title_short OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
title_sort oscar: optimal subset cardinality regression using the l0-pseudonorm with applications to prognostic modelling of prostate cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10032505/
https://www.ncbi.nlm.nih.gov/pubmed/36897911
http://dx.doi.org/10.1371/journal.pcbi.1010333
work_keys_str_mv AT halkolaannis oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer
AT jokikaisa oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer
AT mirttituomas oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer
AT makelamarkom oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer
AT aittokalliotero oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer
AT laajalateemud oscaroptimalsubsetcardinalityregressionusingthel0pseudonormwithapplicationstoprognosticmodellingofprostatecancer