Cargando…
Use of a machine learning framework to predict substance use disorder treatment success
There are several methods for building prediction models. The wealth of currently available modeling techniques usually forces the researcher to judge, a priori, what will likely be the best method. Super learning (SL) is a methodology that facilitates this decision by combining all identified predi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5386258/ https://www.ncbi.nlm.nih.gov/pubmed/28394905 http://dx.doi.org/10.1371/journal.pone.0175383 |
_version_ | 1782520735612272640 |
---|---|
author | Acion, Laura Kelmansky, Diana van der Laan, Mark Sahker, Ethan Jones, DeShauna Arndt, Stephan |
author_facet | Acion, Laura Kelmansky, Diana van der Laan, Mark Sahker, Ethan Jones, DeShauna Arndt, Stephan |
author_sort | Acion, Laura |
collection | PubMed |
description | There are several methods for building prediction models. The wealth of currently available modeling techniques usually forces the researcher to judge, a priori, what will likely be the best method. Super learning (SL) is a methodology that facilitates this decision by combining all identified prediction algorithms pertinent for a particular prediction problem. SL generates a final model that is at least as good as any of the other models considered for predicting the outcome. The overarching aim of this work is to introduce SL to analysts and practitioners. This work compares the performance of logistic regression, penalized regression, random forests, deep learning neural networks, and SL to predict successful substance use disorders (SUD) treatment. A nationwide database including 99,013 SUD treatment patients was used. All algorithms were evaluated using the area under the receiver operating characteristic curve (AUC) in a test sample that was not included in the training sample used to fit the prediction models. AUC for the models ranged between 0.793 and 0.820. SL was superior to all but one of the algorithms compared. An explanation of SL steps is provided. SL is the first step in targeted learning, an analytic framework that yields double robust effect estimation and inference with fewer assumptions than the usual parametric methods. Different aspects of SL depending on the context, its function within the targeted learning framework, and the benefits of this methodology in the addiction field are discussed. |
format | Online Article Text |
id | pubmed-5386258 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53862582017-05-03 Use of a machine learning framework to predict substance use disorder treatment success Acion, Laura Kelmansky, Diana van der Laan, Mark Sahker, Ethan Jones, DeShauna Arndt, Stephan PLoS One Research Article There are several methods for building prediction models. The wealth of currently available modeling techniques usually forces the researcher to judge, a priori, what will likely be the best method. Super learning (SL) is a methodology that facilitates this decision by combining all identified prediction algorithms pertinent for a particular prediction problem. SL generates a final model that is at least as good as any of the other models considered for predicting the outcome. The overarching aim of this work is to introduce SL to analysts and practitioners. This work compares the performance of logistic regression, penalized regression, random forests, deep learning neural networks, and SL to predict successful substance use disorders (SUD) treatment. A nationwide database including 99,013 SUD treatment patients was used. All algorithms were evaluated using the area under the receiver operating characteristic curve (AUC) in a test sample that was not included in the training sample used to fit the prediction models. AUC for the models ranged between 0.793 and 0.820. SL was superior to all but one of the algorithms compared. An explanation of SL steps is provided. SL is the first step in targeted learning, an analytic framework that yields double robust effect estimation and inference with fewer assumptions than the usual parametric methods. Different aspects of SL depending on the context, its function within the targeted learning framework, and the benefits of this methodology in the addiction field are discussed. Public Library of Science 2017-04-10 /pmc/articles/PMC5386258/ /pubmed/28394905 http://dx.doi.org/10.1371/journal.pone.0175383 Text en © 2017 Acion et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Acion, Laura Kelmansky, Diana van der Laan, Mark Sahker, Ethan Jones, DeShauna Arndt, Stephan Use of a machine learning framework to predict substance use disorder treatment success |
title | Use of a machine learning framework to predict substance use disorder treatment success |
title_full | Use of a machine learning framework to predict substance use disorder treatment success |
title_fullStr | Use of a machine learning framework to predict substance use disorder treatment success |
title_full_unstemmed | Use of a machine learning framework to predict substance use disorder treatment success |
title_short | Use of a machine learning framework to predict substance use disorder treatment success |
title_sort | use of a machine learning framework to predict substance use disorder treatment success |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5386258/ https://www.ncbi.nlm.nih.gov/pubmed/28394905 http://dx.doi.org/10.1371/journal.pone.0175383 |
work_keys_str_mv | AT acionlaura useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess AT kelmanskydiana useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess AT vanderlaanmark useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess AT sahkerethan useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess AT jonesdeshauna useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess AT arndtstephan useofamachinelearningframeworktopredictsubstanceusedisordertreatmentsuccess |