Cargando…

Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression

BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Qiang, Salanti, Georgia, De Crescenzo, Franco, Ostinelli, Edoardo Giuseppe, Li, Zhenpeng, Tomlinson, Anneka, Cipriani, Andrea, Efthimiou, Orestis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9112573/
https://www.ncbi.nlm.nih.gov/pubmed/35578254
http://dx.doi.org/10.1186/s12888-022-03986-0
_version_ 1784709439117328384
author Liu, Qiang
Salanti, Georgia
De Crescenzo, Franco
Ostinelli, Edoardo Giuseppe
Li, Zhenpeng
Tomlinson, Anneka
Cipriani, Andrea
Efthimiou, Orestis
author_facet Liu, Qiang
Salanti, Georgia
De Crescenzo, Franco
Ostinelli, Edoardo Giuseppe
Li, Zhenpeng
Tomlinson, Anneka
Cipriani, Andrea
Efthimiou, Orestis
author_sort Liu, Qiang
collection PubMed
description BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a dataset of 187,757 people with depression. Using 31 variables, we aimed to predict two outcomes measured 60 days after initiation of antidepressant treatment: severity of depressive symptoms (continuous) and all-cause dropouts (binary). We fitted a ridge regression and a multi-layer perceptron (MLP) deep neural network as two separate prediction models (“base-learners”). We then developed two “meta-learners”, combining predictions from the two base-learners. To compare the performance across the different methods, we calculated mean absolute error (MAE, for continuous outcome) and the area under the receiver operating characteristic curve (AUC, for binary outcome) using bootstrapping. RESULTS: Compared to the best performing base-learner (MLP base-learner, MAE at 4.63, AUC at 0.59), the best performing meta-learner showed a 2.49% decrease in MAE at 4.52 for the continuous outcome and a 6.47% increase in AUC at 0.60 for the binary outcome. CONCLUSIONS: A meta-learner approach may effectively combine multiple prediction models. Choosing between statistical and machine learning models may not be necessary in practice. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12888-022-03986-0.
format Online
Article
Text
id pubmed-9112573
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91125732022-05-18 Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression Liu, Qiang Salanti, Georgia De Crescenzo, Franco Ostinelli, Edoardo Giuseppe Li, Zhenpeng Tomlinson, Anneka Cipriani, Andrea Efthimiou, Orestis BMC Psychiatry Research BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a dataset of 187,757 people with depression. Using 31 variables, we aimed to predict two outcomes measured 60 days after initiation of antidepressant treatment: severity of depressive symptoms (continuous) and all-cause dropouts (binary). We fitted a ridge regression and a multi-layer perceptron (MLP) deep neural network as two separate prediction models (“base-learners”). We then developed two “meta-learners”, combining predictions from the two base-learners. To compare the performance across the different methods, we calculated mean absolute error (MAE, for continuous outcome) and the area under the receiver operating characteristic curve (AUC, for binary outcome) using bootstrapping. RESULTS: Compared to the best performing base-learner (MLP base-learner, MAE at 4.63, AUC at 0.59), the best performing meta-learner showed a 2.49% decrease in MAE at 4.52 for the continuous outcome and a 6.47% increase in AUC at 0.60 for the binary outcome. CONCLUSIONS: A meta-learner approach may effectively combine multiple prediction models. Choosing between statistical and machine learning models may not be necessary in practice. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12888-022-03986-0. BioMed Central 2022-05-16 /pmc/articles/PMC9112573/ /pubmed/35578254 http://dx.doi.org/10.1186/s12888-022-03986-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liu, Qiang
Salanti, Georgia
De Crescenzo, Franco
Ostinelli, Edoardo Giuseppe
Li, Zhenpeng
Tomlinson, Anneka
Cipriani, Andrea
Efthimiou, Orestis
Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title_full Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title_fullStr Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title_full_unstemmed Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title_short Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
title_sort development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9112573/
https://www.ncbi.nlm.nih.gov/pubmed/35578254
http://dx.doi.org/10.1186/s12888-022-03986-0
work_keys_str_mv AT liuqiang developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT salantigeorgia developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT decrescenzofranco developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT ostinelliedoardogiuseppe developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT lizhenpeng developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT tomlinsonanneka developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT ciprianiandrea developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression
AT efthimiouorestis developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression