Cargando…

Using biological constraints to improve prediction in precision oncology

Many gene signatures have been developed by applying machine learning (ML) on omics profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by...

Descripción completa

Detalles Bibliográficos
Autores principales: Omar, Mohamed, Dinalankara, Wikum, Mulder, Lotte, Coady, Tendai, Zanettini, Claudio, Imada, Eddie Luidy, Younes, Laurent, Geman, Donald, Marchionni, Luigi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9958363/
https://www.ncbi.nlm.nih.gov/pubmed/36852282
http://dx.doi.org/10.1016/j.isci.2023.106108
_version_ 1784895007321227264
author Omar, Mohamed
Dinalankara, Wikum
Mulder, Lotte
Coady, Tendai
Zanettini, Claudio
Imada, Eddie Luidy
Younes, Laurent
Geman, Donald
Marchionni, Luigi
author_facet Omar, Mohamed
Dinalankara, Wikum
Mulder, Lotte
Coady, Tendai
Zanettini, Claudio
Imada, Eddie Luidy
Younes, Laurent
Geman, Donald
Marchionni, Luigi
author_sort Omar, Mohamed
collection PubMed
description Many gene signatures have been developed by applying machine learning (ML) on omics profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by ML approaches to build robust classifiers. We tested this by applying different ML algorithms on gene expression data to predict three difficult cancer phenotypes: bladder cancer progression to muscle-invasive disease, response to neoadjuvant chemotherapy in triple-negative breast cancer, and prostate cancer metastatic progression. We developed two sets of classifiers: mechanistic, by restricting the training to features capturing specific biological mechanisms; and agnostic, in which the training did not use any a priori biological information. Mechanistic models had a similar or better testing performance than their agnostic counterparts, with enhanced interpretability. Our findings support the use of biological constraints to develop robust gene signatures with high translational potential.
format Online
Article
Text
id pubmed-9958363
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-99583632023-02-26 Using biological constraints to improve prediction in precision oncology Omar, Mohamed Dinalankara, Wikum Mulder, Lotte Coady, Tendai Zanettini, Claudio Imada, Eddie Luidy Younes, Laurent Geman, Donald Marchionni, Luigi iScience Article Many gene signatures have been developed by applying machine learning (ML) on omics profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by ML approaches to build robust classifiers. We tested this by applying different ML algorithms on gene expression data to predict three difficult cancer phenotypes: bladder cancer progression to muscle-invasive disease, response to neoadjuvant chemotherapy in triple-negative breast cancer, and prostate cancer metastatic progression. We developed two sets of classifiers: mechanistic, by restricting the training to features capturing specific biological mechanisms; and agnostic, in which the training did not use any a priori biological information. Mechanistic models had a similar or better testing performance than their agnostic counterparts, with enhanced interpretability. Our findings support the use of biological constraints to develop robust gene signatures with high translational potential. Elsevier 2023-02-02 /pmc/articles/PMC9958363/ /pubmed/36852282 http://dx.doi.org/10.1016/j.isci.2023.106108 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Omar, Mohamed
Dinalankara, Wikum
Mulder, Lotte
Coady, Tendai
Zanettini, Claudio
Imada, Eddie Luidy
Younes, Laurent
Geman, Donald
Marchionni, Luigi
Using biological constraints to improve prediction in precision oncology
title Using biological constraints to improve prediction in precision oncology
title_full Using biological constraints to improve prediction in precision oncology
title_fullStr Using biological constraints to improve prediction in precision oncology
title_full_unstemmed Using biological constraints to improve prediction in precision oncology
title_short Using biological constraints to improve prediction in precision oncology
title_sort using biological constraints to improve prediction in precision oncology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9958363/
https://www.ncbi.nlm.nih.gov/pubmed/36852282
http://dx.doi.org/10.1016/j.isci.2023.106108
work_keys_str_mv AT omarmohamed usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT dinalankarawikum usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT mulderlotte usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT coadytendai usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT zanettiniclaudio usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT imadaeddieluidy usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT youneslaurent usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT gemandonald usingbiologicalconstraintstoimprovepredictioninprecisiononcology
AT marchionniluigi usingbiologicalconstraintstoimprovepredictioninprecisiononcology