Cargando…

Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine

OBJECTIVE: Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted ind...

Descripción completa

Detalles Bibliográficos
Autores principales: Venkatasubramaniam, Ashwini, Mateen, Bilal A., Shields, Beverley M., Hattersley, Andrew T., Jones, Angus G., Vollmer, Sebastian J., Dennis, John M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10276367/
https://www.ncbi.nlm.nih.gov/pubmed/37328784
http://dx.doi.org/10.1186/s12911-023-02207-2
_version_ 1785060061162242048
author Venkatasubramaniam, Ashwini
Mateen, Bilal A.
Shields, Beverley M.
Hattersley, Andrew T.
Jones, Angus G.
Vollmer, Sebastian J.
Dennis, John M.
author_facet Venkatasubramaniam, Ashwini
Mateen, Bilal A.
Shields, Beverley M.
Hattersley, Andrew T.
Jones, Angus G.
Vollmer, Sebastian J.
Dennis, John M.
author_sort Venkatasubramaniam, Ashwini
collection PubMed
description OBJECTIVE: Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted individual-level treatment effects from a causal forest machine learning algorithm and a penalized regression model. METHODS: Cohort study characterizing individual-level glucose-lowering response (6 month reduction in HbA1c) in people with type 2 diabetes initiating SGLT2-inhibitor or DPP4-inhibitor therapy. Model development set comprised 1,428 participants in the CANTATA-D and CANTATA-D2 randomised clinical trials of SGLT2-inhibitors versus DPP4-inhibitors. For external validation, calibration of observed versus predicted differences in HbA1c in patient strata defined by size of predicted HbA1c benefit was evaluated in 18,741 patients in UK primary care (Clinical Practice Research Datalink). RESULTS: Heterogeneity in treatment effects was detected in clinical trial participants with both approaches (proportion predicted to have a benefit on SGLT2-inhibitor therapy over DPP4-inhibitor therapy: causal forest: 98.6%; penalized regression: 81.7%). In validation, calibration was good with penalized regression but sub-optimal with causal forest. A strata with an HbA1c benefit > 10 mmol/mol with SGLT2-inhibitors (3.7% of patients, observed benefit 11.0 mmol/mol [95%CI 8.0–14.0]) was identified using penalized regression but not causal forest, and a much larger strata with an HbA1c benefit 5–10 mmol with SGLT2-inhibitors was identified with penalized regression (regression: 20.9% of patients, observed benefit 7.8 mmol/mol (95%CI 6.7–8.9); causal forest 11.6%, observed benefit 8.7 mmol/mol (95%CI 7.4–10.1). CONCLUSIONS: Consistent with recent results for outcome prediction with clinical data, when evaluating treatment effect heterogeneity researchers should not rely on causal forest or other similar machine learning algorithms alone, and must compare outputs with standard regression, which in this evaluation was superior. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02207-2.
format Online
Article
Text
id pubmed-10276367
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102763672023-06-18 Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine Venkatasubramaniam, Ashwini Mateen, Bilal A. Shields, Beverley M. Hattersley, Andrew T. Jones, Angus G. Vollmer, Sebastian J. Dennis, John M. BMC Med Inform Decis Mak Research OBJECTIVE: Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted individual-level treatment effects from a causal forest machine learning algorithm and a penalized regression model. METHODS: Cohort study characterizing individual-level glucose-lowering response (6 month reduction in HbA1c) in people with type 2 diabetes initiating SGLT2-inhibitor or DPP4-inhibitor therapy. Model development set comprised 1,428 participants in the CANTATA-D and CANTATA-D2 randomised clinical trials of SGLT2-inhibitors versus DPP4-inhibitors. For external validation, calibration of observed versus predicted differences in HbA1c in patient strata defined by size of predicted HbA1c benefit was evaluated in 18,741 patients in UK primary care (Clinical Practice Research Datalink). RESULTS: Heterogeneity in treatment effects was detected in clinical trial participants with both approaches (proportion predicted to have a benefit on SGLT2-inhibitor therapy over DPP4-inhibitor therapy: causal forest: 98.6%; penalized regression: 81.7%). In validation, calibration was good with penalized regression but sub-optimal with causal forest. A strata with an HbA1c benefit > 10 mmol/mol with SGLT2-inhibitors (3.7% of patients, observed benefit 11.0 mmol/mol [95%CI 8.0–14.0]) was identified using penalized regression but not causal forest, and a much larger strata with an HbA1c benefit 5–10 mmol with SGLT2-inhibitors was identified with penalized regression (regression: 20.9% of patients, observed benefit 7.8 mmol/mol (95%CI 6.7–8.9); causal forest 11.6%, observed benefit 8.7 mmol/mol (95%CI 7.4–10.1). CONCLUSIONS: Consistent with recent results for outcome prediction with clinical data, when evaluating treatment effect heterogeneity researchers should not rely on causal forest or other similar machine learning algorithms alone, and must compare outputs with standard regression, which in this evaluation was superior. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02207-2. BioMed Central 2023-06-16 /pmc/articles/PMC10276367/ /pubmed/37328784 http://dx.doi.org/10.1186/s12911-023-02207-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Venkatasubramaniam, Ashwini
Mateen, Bilal A.
Shields, Beverley M.
Hattersley, Andrew T.
Jones, Angus G.
Vollmer, Sebastian J.
Dennis, John M.
Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title_full Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title_fullStr Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title_full_unstemmed Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title_short Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
title_sort comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10276367/
https://www.ncbi.nlm.nih.gov/pubmed/37328784
http://dx.doi.org/10.1186/s12911-023-02207-2
work_keys_str_mv AT venkatasubramaniamashwini comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT mateenbilala comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT shieldsbeverleym comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT hattersleyandrewt comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT jonesangusg comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT vollmersebastianj comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine
AT dennisjohnm comparisonofcausalforestandregressionbasedapproachestoevaluatetreatmenteffectheterogeneityanapplicationfortype2diabetesprecisionmedicine