Cargando…

4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm

OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non...

Descripción completa

Detalles Bibliográficos
Autores principales: Morris, James Keoni, Gowin, Josh L., Schwandt, Melanie L., Diazgranados, Nancy, Ramchandani, Vijay A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823439/
http://dx.doi.org/10.1017/cts.2020.399
_version_ 1784646802519097344
author Morris, James Keoni
Gowin, Josh L.
Schwandt, Melanie L.
Diazgranados, Nancy
Ramchandani, Vijay A.
author_facet Morris, James Keoni
Gowin, Josh L.
Schwandt, Melanie L.
Diazgranados, Nancy
Ramchandani, Vijay A.
author_sort Morris, James Keoni
collection PubMed
description OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non-treatment-seeking drinkers (age: 18-91 years), that were assigned to a group based on their heaviest drinking day reported in a 90-Day Alcohol Timeline Followback questionnaire. The groups were Non-Bingers (female: 12 drinks, male:>15 drinks). The sample was divided into a training sample (N = 884) and a testing sample (N = 293). A machine learning algorithm called random forest was then used to generate a predictive model based on measures of substance use, personality traits, and trauma. The model was applied to the testing sample to determine accuracy. RESULTS/ANTICIPATED RESULTS: The first model correctly assigned 190 out of 293 subjects, giving it a total error rate of 0.35, with lowest rates for non-binge (0.19) and high-intensity (0.18), while medium-intensity had the highest error rate (0.86). The most important variables for the accuracy of the model included: total score on the Alcohol Use Disorder Identification Test, first five sub-score of the Self-Reported Effects of Alcohol, Compulsive Drinking subscale, and presence of a current psychiatric diagnosis. As a follow-up analysis, we built and tested another random forest model without the use of drinking dependence measures. This model had a total error rate of 0.39, and introduced other important variables such as smoking behaviors, perceived stress, IQ, and number of negative life events. DISCUSSION/SIGNIFICANCE OF IMPACT: Our study showed that it was possible for a machine learning algorithm to predict binge drinking intensity better than chance. Drinking patterns were the most robust predictors, and stress, IQ, and psychiatric diagnoses were also useful in predicting binge drinking intensity.
format Online
Article
Text
id pubmed-8823439
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-88234392022-02-18 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm Morris, James Keoni Gowin, Josh L. Schwandt, Melanie L. Diazgranados, Nancy Ramchandani, Vijay A. J Clin Transl Sci Translational Science, Policy, & Health Outcomes Science OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non-treatment-seeking drinkers (age: 18-91 years), that were assigned to a group based on their heaviest drinking day reported in a 90-Day Alcohol Timeline Followback questionnaire. The groups were Non-Bingers (female: 12 drinks, male:>15 drinks). The sample was divided into a training sample (N = 884) and a testing sample (N = 293). A machine learning algorithm called random forest was then used to generate a predictive model based on measures of substance use, personality traits, and trauma. The model was applied to the testing sample to determine accuracy. RESULTS/ANTICIPATED RESULTS: The first model correctly assigned 190 out of 293 subjects, giving it a total error rate of 0.35, with lowest rates for non-binge (0.19) and high-intensity (0.18), while medium-intensity had the highest error rate (0.86). The most important variables for the accuracy of the model included: total score on the Alcohol Use Disorder Identification Test, first five sub-score of the Self-Reported Effects of Alcohol, Compulsive Drinking subscale, and presence of a current psychiatric diagnosis. As a follow-up analysis, we built and tested another random forest model without the use of drinking dependence measures. This model had a total error rate of 0.39, and introduced other important variables such as smoking behaviors, perceived stress, IQ, and number of negative life events. DISCUSSION/SIGNIFICANCE OF IMPACT: Our study showed that it was possible for a machine learning algorithm to predict binge drinking intensity better than chance. Drinking patterns were the most robust predictors, and stress, IQ, and psychiatric diagnoses were also useful in predicting binge drinking intensity. Cambridge University Press 2020-07-29 /pmc/articles/PMC8823439/ http://dx.doi.org/10.1017/cts.2020.399 Text en © The Association for Clinical and Translational Science 2020 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Translational Science, Policy, & Health Outcomes Science
Morris, James Keoni
Gowin, Josh L.
Schwandt, Melanie L.
Diazgranados, Nancy
Ramchandani, Vijay A.
4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title_full 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title_fullStr 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title_full_unstemmed 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title_short 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
title_sort 4288 identifying predictive variables of high-intensity binge drinking through the use of a machine learning algorithm
topic Translational Science, Policy, & Health Outcomes Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823439/
http://dx.doi.org/10.1017/cts.2020.399
work_keys_str_mv AT morrisjameskeoni 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm
AT gowinjoshl 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm
AT schwandtmelaniel 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm
AT diazgranadosnancy 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm
AT ramchandanivijaya 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm