Cargando…

Predicting outcomes at the individual patient level: what is the best method?

OBJECTIVE: When developing prediction models, researchers commonly employ a single model which uses all the available data (end-to-end approach). Alternatively, a similarity-based approach has been previously proposed, in which patients with similar clinical characteristics are first grouped into cl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Qiang, Ostinelli, Edoardo Giuseppe, De Crescenzo, Franco, Li, Zhenpeng, Tomlinson, Anneka, Salanti, Georgia, Cipriani, Andrea, Efthimiou, Orestis
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BMJ Publishing Group 2023
Materias:	Statistics in Practice
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10277128/ https://www.ncbi.nlm.nih.gov/pubmed/37316257 http://dx.doi.org/10.1136/bmjment-2023-300701

_version_	1785060223965200384
author	Liu, Qiang Ostinelli, Edoardo Giuseppe De Crescenzo, Franco Li, Zhenpeng Tomlinson, Anneka Salanti, Georgia Cipriani, Andrea Efthimiou, Orestis
author_facet	Liu, Qiang Ostinelli, Edoardo Giuseppe De Crescenzo, Franco Li, Zhenpeng Tomlinson, Anneka Salanti, Georgia Cipriani, Andrea Efthimiou, Orestis
author_sort	Liu, Qiang
collection	PubMed
description	OBJECTIVE: When developing prediction models, researchers commonly employ a single model which uses all the available data (end-to-end approach). Alternatively, a similarity-based approach has been previously proposed, in which patients with similar clinical characteristics are first grouped into clusters, then prediction models are developed within each cluster. The potential advantage of the similarity-based approach is that it may better address heterogeneity in patient characteristics. However, it remains unclear whether it improves the overall predictive performance. We illustrate the similarity-based approach using data from people with depression and empirically compare its performance with the end-to-end approach. METHODS: We used primary care data collected in general practices in the UK. Using 31 predefined baseline variables, we aimed to predict the severity of depressive symptoms, measured by Patient Health Questionnaire-9, 60 days after initiation of antidepressant treatment. Following the similarity-based approach, we used k-means to cluster patients based on their baseline characteristics. We derived the optimal number of clusters using the Silhouette coefficient. We used ridge regression to build prediction models in both approaches. To compare the models’ performance, we calculated the mean absolute error (MAE) and the coefficient of determination (R(2)) using bootstrapping. RESULTS: We analysed data from 16 384 patients. The end-to-end approach resulted in an MAE of 4.64 and R(2) of 0.20. The best-performing similarity-based model was for four clusters, with MAE of 4.65 and R(2) of 0.19. CONCLUSIONS: The end-to-end and the similarity-based model yielded comparable performance. Due to its simplicity, the end-to-end approach can be favoured when using demographic and clinical data to build prediction models on pharmacological treatments for depression.
format	Online Article Text
id	pubmed-10277128
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BMJ Publishing Group
record_format	MEDLINE/PubMed
spelling	pubmed-102771282023-08-21 Predicting outcomes at the individual patient level: what is the best method? Liu, Qiang Ostinelli, Edoardo Giuseppe De Crescenzo, Franco Li, Zhenpeng Tomlinson, Anneka Salanti, Georgia Cipriani, Andrea Efthimiou, Orestis BMJ Ment Health Statistics in Practice OBJECTIVE: When developing prediction models, researchers commonly employ a single model which uses all the available data (end-to-end approach). Alternatively, a similarity-based approach has been previously proposed, in which patients with similar clinical characteristics are first grouped into clusters, then prediction models are developed within each cluster. The potential advantage of the similarity-based approach is that it may better address heterogeneity in patient characteristics. However, it remains unclear whether it improves the overall predictive performance. We illustrate the similarity-based approach using data from people with depression and empirically compare its performance with the end-to-end approach. METHODS: We used primary care data collected in general practices in the UK. Using 31 predefined baseline variables, we aimed to predict the severity of depressive symptoms, measured by Patient Health Questionnaire-9, 60 days after initiation of antidepressant treatment. Following the similarity-based approach, we used k-means to cluster patients based on their baseline characteristics. We derived the optimal number of clusters using the Silhouette coefficient. We used ridge regression to build prediction models in both approaches. To compare the models’ performance, we calculated the mean absolute error (MAE) and the coefficient of determination (R(2)) using bootstrapping. RESULTS: We analysed data from 16 384 patients. The end-to-end approach resulted in an MAE of 4.64 and R(2) of 0.20. The best-performing similarity-based model was for four clusters, with MAE of 4.65 and R(2) of 0.19. CONCLUSIONS: The end-to-end and the similarity-based model yielded comparable performance. Due to its simplicity, the end-to-end approach can be favoured when using demographic and clinical data to build prediction models on pharmacological treatments for depression. BMJ Publishing Group 2023-06-14 /pmc/articles/PMC10277128/ /pubmed/37316257 http://dx.doi.org/10.1136/bmjment-2023-300701 Text en © Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY. Published by BMJ. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.
spellingShingle	Statistics in Practice Liu, Qiang Ostinelli, Edoardo Giuseppe De Crescenzo, Franco Li, Zhenpeng Tomlinson, Anneka Salanti, Georgia Cipriani, Andrea Efthimiou, Orestis Predicting outcomes at the individual patient level: what is the best method?
title	Predicting outcomes at the individual patient level: what is the best method?
title_full	Predicting outcomes at the individual patient level: what is the best method?
title_fullStr	Predicting outcomes at the individual patient level: what is the best method?
title_full_unstemmed	Predicting outcomes at the individual patient level: what is the best method?
title_short	Predicting outcomes at the individual patient level: what is the best method?
title_sort	predicting outcomes at the individual patient level: what is the best method?
topic	Statistics in Practice
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10277128/ https://www.ncbi.nlm.nih.gov/pubmed/37316257 http://dx.doi.org/10.1136/bmjment-2023-300701
work_keys_str_mv	AT liuqiang predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT ostinelliedoardogiuseppe predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT decrescenzofranco predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT lizhenpeng predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT tomlinsonanneka predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT salantigeorgia predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT ciprianiandrea predictingoutcomesattheindividualpatientlevelwhatisthebestmethod AT efthimiouorestis predictingoutcomesattheindividualpatientlevelwhatisthebestmethod

Predicting outcomes at the individual patient level: what is the best method?

Ejemplares similares