Cargando…

Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma

BACKGROUND: Over the past several decades, metrics have been defined to assess the quality of various types of models and to compare their performance depending on their capacity to explain the variance found in real-life data. However, available validation methods are mostly designed for statistica...

Descripción completa

Detalles Bibliográficos
Autores principales: Jacob, Evgueni, Perrillat-Mercerot, Angélique, Palgen, Jean-Louis, L’Hostis, Adèle, Ceres, Nicoletta, Boissel, Jean-Pierre, Bosley, Jim, Monteiro, Claudio, Kahoul, Riad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10478282/
https://www.ncbi.nlm.nih.gov/pubmed/37667175
http://dx.doi.org/10.1186/s12859-023-05430-w
_version_ 1785101314786590720
author Jacob, Evgueni
Perrillat-Mercerot, Angélique
Palgen, Jean-Louis
L’Hostis, Adèle
Ceres, Nicoletta
Boissel, Jean-Pierre
Bosley, Jim
Monteiro, Claudio
Kahoul, Riad
author_facet Jacob, Evgueni
Perrillat-Mercerot, Angélique
Palgen, Jean-Louis
L’Hostis, Adèle
Ceres, Nicoletta
Boissel, Jean-Pierre
Bosley, Jim
Monteiro, Claudio
Kahoul, Riad
author_sort Jacob, Evgueni
collection PubMed
description BACKGROUND: Over the past several decades, metrics have been defined to assess the quality of various types of models and to compare their performance depending on their capacity to explain the variance found in real-life data. However, available validation methods are mostly designed for statistical regressions rather than for mechanistic models. To our knowledge, in the latter case, there are no consensus standards, for instance for the validation of predictions against real-world data given the variability and uncertainty of the data. In this work, we focus on the prediction of time-to-event curves using as an application example a mechanistic model of non-small cell lung cancer. We designed four empirical methods to assess both model performance and reliability of predictions: two methods based on bootstrapped versions of parametric statistical tests: log-rank and combined weighted log-ranks (MaxCombo); and two methods based on bootstrapped prediction intervals, referred to here as raw coverage and the juncture metric. We also introduced the notion of observation time uncertainty to take into consideration the real life delay between the moment when an event happens, and the moment when it is observed and reported. RESULTS: We highlight the advantages and disadvantages of these methods according to their application context. We have shown that the context of use of the model has an impact on the model validation process. Thanks to the use of several validation metrics we have highlighted the limit of the model to predict the evolution of the disease in the whole population of mutations at the same time, and that it was more efficient with specific predictions in the target mutation populations. The choice and use of a single metric could have led to an erroneous validation of the model and its context of use. CONCLUSIONS: With this work, we stress the importance of making judicious choices for a metric, and how using a combination of metrics could be more relevant, with the objective of validating a given model and its predictions within a specific context of use. We also show how the reliability of the results depends both on the metric and on the statistical comparisons, and that the conditions of application and the type of available information need to be taken into account to choose the best validation strategy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05430-w.
format Online
Article
Text
id pubmed-10478282
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104782822023-09-06 Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma Jacob, Evgueni Perrillat-Mercerot, Angélique Palgen, Jean-Louis L’Hostis, Adèle Ceres, Nicoletta Boissel, Jean-Pierre Bosley, Jim Monteiro, Claudio Kahoul, Riad BMC Bioinformatics Research BACKGROUND: Over the past several decades, metrics have been defined to assess the quality of various types of models and to compare their performance depending on their capacity to explain the variance found in real-life data. However, available validation methods are mostly designed for statistical regressions rather than for mechanistic models. To our knowledge, in the latter case, there are no consensus standards, for instance for the validation of predictions against real-world data given the variability and uncertainty of the data. In this work, we focus on the prediction of time-to-event curves using as an application example a mechanistic model of non-small cell lung cancer. We designed four empirical methods to assess both model performance and reliability of predictions: two methods based on bootstrapped versions of parametric statistical tests: log-rank and combined weighted log-ranks (MaxCombo); and two methods based on bootstrapped prediction intervals, referred to here as raw coverage and the juncture metric. We also introduced the notion of observation time uncertainty to take into consideration the real life delay between the moment when an event happens, and the moment when it is observed and reported. RESULTS: We highlight the advantages and disadvantages of these methods according to their application context. We have shown that the context of use of the model has an impact on the model validation process. Thanks to the use of several validation metrics we have highlighted the limit of the model to predict the evolution of the disease in the whole population of mutations at the same time, and that it was more efficient with specific predictions in the target mutation populations. The choice and use of a single metric could have led to an erroneous validation of the model and its context of use. CONCLUSIONS: With this work, we stress the importance of making judicious choices for a metric, and how using a combination of metrics could be more relevant, with the objective of validating a given model and its predictions within a specific context of use. We also show how the reliability of the results depends both on the metric and on the statistical comparisons, and that the conditions of application and the type of available information need to be taken into account to choose the best validation strategy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05430-w. BioMed Central 2023-09-04 /pmc/articles/PMC10478282/ /pubmed/37667175 http://dx.doi.org/10.1186/s12859-023-05430-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jacob, Evgueni
Perrillat-Mercerot, Angélique
Palgen, Jean-Louis
L’Hostis, Adèle
Ceres, Nicoletta
Boissel, Jean-Pierre
Bosley, Jim
Monteiro, Claudio
Kahoul, Riad
Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title_full Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title_fullStr Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title_full_unstemmed Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title_short Empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to EGFR + lung adenocarcinoma
title_sort empirical methods for the validation of time-to-event mathematical models taking into account uncertainty and variability: application to egfr + lung adenocarcinoma
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10478282/
https://www.ncbi.nlm.nih.gov/pubmed/37667175
http://dx.doi.org/10.1186/s12859-023-05430-w
work_keys_str_mv AT jacobevgueni empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT perrillatmercerotangelique empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT palgenjeanlouis empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT lhostisadele empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT ceresnicoletta empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT boisseljeanpierre empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT bosleyjim empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT monteiroclaudio empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma
AT kahoulriad empiricalmethodsforthevalidationoftimetoeventmathematicalmodelstakingintoaccountuncertaintyandvariabilityapplicationtoegfrlungadenocarcinoma