Cargando…
Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression
BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9112573/ https://www.ncbi.nlm.nih.gov/pubmed/35578254 http://dx.doi.org/10.1186/s12888-022-03986-0 |
_version_ | 1784709439117328384 |
---|---|
author | Liu, Qiang Salanti, Georgia De Crescenzo, Franco Ostinelli, Edoardo Giuseppe Li, Zhenpeng Tomlinson, Anneka Cipriani, Andrea Efthimiou, Orestis |
author_facet | Liu, Qiang Salanti, Georgia De Crescenzo, Franco Ostinelli, Edoardo Giuseppe Li, Zhenpeng Tomlinson, Anneka Cipriani, Andrea Efthimiou, Orestis |
author_sort | Liu, Qiang |
collection | PubMed |
description | BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a dataset of 187,757 people with depression. Using 31 variables, we aimed to predict two outcomes measured 60 days after initiation of antidepressant treatment: severity of depressive symptoms (continuous) and all-cause dropouts (binary). We fitted a ridge regression and a multi-layer perceptron (MLP) deep neural network as two separate prediction models (“base-learners”). We then developed two “meta-learners”, combining predictions from the two base-learners. To compare the performance across the different methods, we calculated mean absolute error (MAE, for continuous outcome) and the area under the receiver operating characteristic curve (AUC, for binary outcome) using bootstrapping. RESULTS: Compared to the best performing base-learner (MLP base-learner, MAE at 4.63, AUC at 0.59), the best performing meta-learner showed a 2.49% decrease in MAE at 4.52 for the continuous outcome and a 6.47% increase in AUC at 0.60 for the binary outcome. CONCLUSIONS: A meta-learner approach may effectively combine multiple prediction models. Choosing between statistical and machine learning models may not be necessary in practice. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12888-022-03986-0. |
format | Online Article Text |
id | pubmed-9112573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91125732022-05-18 Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression Liu, Qiang Salanti, Georgia De Crescenzo, Franco Ostinelli, Edoardo Giuseppe Li, Zhenpeng Tomlinson, Anneka Cipriani, Andrea Efthimiou, Orestis BMC Psychiatry Research BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative. METHODS: To illustrate the development of a meta-learner, we used a dataset of 187,757 people with depression. Using 31 variables, we aimed to predict two outcomes measured 60 days after initiation of antidepressant treatment: severity of depressive symptoms (continuous) and all-cause dropouts (binary). We fitted a ridge regression and a multi-layer perceptron (MLP) deep neural network as two separate prediction models (“base-learners”). We then developed two “meta-learners”, combining predictions from the two base-learners. To compare the performance across the different methods, we calculated mean absolute error (MAE, for continuous outcome) and the area under the receiver operating characteristic curve (AUC, for binary outcome) using bootstrapping. RESULTS: Compared to the best performing base-learner (MLP base-learner, MAE at 4.63, AUC at 0.59), the best performing meta-learner showed a 2.49% decrease in MAE at 4.52 for the continuous outcome and a 6.47% increase in AUC at 0.60 for the binary outcome. CONCLUSIONS: A meta-learner approach may effectively combine multiple prediction models. Choosing between statistical and machine learning models may not be necessary in practice. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12888-022-03986-0. BioMed Central 2022-05-16 /pmc/articles/PMC9112573/ /pubmed/35578254 http://dx.doi.org/10.1186/s12888-022-03986-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Liu, Qiang Salanti, Georgia De Crescenzo, Franco Ostinelli, Edoardo Giuseppe Li, Zhenpeng Tomlinson, Anneka Cipriani, Andrea Efthimiou, Orestis Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title | Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title_full | Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title_fullStr | Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title_full_unstemmed | Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title_short | Development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
title_sort | development and validation of a meta-learner for combining statistical and machine learning prediction models in individuals with depression |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9112573/ https://www.ncbi.nlm.nih.gov/pubmed/35578254 http://dx.doi.org/10.1186/s12888-022-03986-0 |
work_keys_str_mv | AT liuqiang developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT salantigeorgia developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT decrescenzofranco developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT ostinelliedoardogiuseppe developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT lizhenpeng developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT tomlinsonanneka developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT ciprianiandrea developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression AT efthimiouorestis developmentandvalidationofametalearnerforcombiningstatisticalandmachinelearningpredictionmodelsinindividualswithdepression |