Cargando…
4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm
OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823439/ http://dx.doi.org/10.1017/cts.2020.399 |
_version_ | 1784646802519097344 |
---|---|
author | Morris, James Keoni Gowin, Josh L. Schwandt, Melanie L. Diazgranados, Nancy Ramchandani, Vijay A. |
author_facet | Morris, James Keoni Gowin, Josh L. Schwandt, Melanie L. Diazgranados, Nancy Ramchandani, Vijay A. |
author_sort | Morris, James Keoni |
collection | PubMed |
description | OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non-treatment-seeking drinkers (age: 18-91 years), that were assigned to a group based on their heaviest drinking day reported in a 90-Day Alcohol Timeline Followback questionnaire. The groups were Non-Bingers (female: 12 drinks, male:>15 drinks). The sample was divided into a training sample (N = 884) and a testing sample (N = 293). A machine learning algorithm called random forest was then used to generate a predictive model based on measures of substance use, personality traits, and trauma. The model was applied to the testing sample to determine accuracy. RESULTS/ANTICIPATED RESULTS: The first model correctly assigned 190 out of 293 subjects, giving it a total error rate of 0.35, with lowest rates for non-binge (0.19) and high-intensity (0.18), while medium-intensity had the highest error rate (0.86). The most important variables for the accuracy of the model included: total score on the Alcohol Use Disorder Identification Test, first five sub-score of the Self-Reported Effects of Alcohol, Compulsive Drinking subscale, and presence of a current psychiatric diagnosis. As a follow-up analysis, we built and tested another random forest model without the use of drinking dependence measures. This model had a total error rate of 0.39, and introduced other important variables such as smoking behaviors, perceived stress, IQ, and number of negative life events. DISCUSSION/SIGNIFICANCE OF IMPACT: Our study showed that it was possible for a machine learning algorithm to predict binge drinking intensity better than chance. Drinking patterns were the most robust predictors, and stress, IQ, and psychiatric diagnoses were also useful in predicting binge drinking intensity. |
format | Online Article Text |
id | pubmed-8823439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88234392022-02-18 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm Morris, James Keoni Gowin, Josh L. Schwandt, Melanie L. Diazgranados, Nancy Ramchandani, Vijay A. J Clin Transl Sci Translational Science, Policy, & Health Outcomes Science OBJECTIVES/GOALS: To test if a machine learning algorithm could predict a person’s capacity to binge drink and explore what measures might be important for identifying individuals at risk for high-intensity binge drinking behaviors. METHODS/STUDY POPULATION: The sample included 1177 (474 female) non-treatment-seeking drinkers (age: 18-91 years), that were assigned to a group based on their heaviest drinking day reported in a 90-Day Alcohol Timeline Followback questionnaire. The groups were Non-Bingers (female: 12 drinks, male:>15 drinks). The sample was divided into a training sample (N = 884) and a testing sample (N = 293). A machine learning algorithm called random forest was then used to generate a predictive model based on measures of substance use, personality traits, and trauma. The model was applied to the testing sample to determine accuracy. RESULTS/ANTICIPATED RESULTS: The first model correctly assigned 190 out of 293 subjects, giving it a total error rate of 0.35, with lowest rates for non-binge (0.19) and high-intensity (0.18), while medium-intensity had the highest error rate (0.86). The most important variables for the accuracy of the model included: total score on the Alcohol Use Disorder Identification Test, first five sub-score of the Self-Reported Effects of Alcohol, Compulsive Drinking subscale, and presence of a current psychiatric diagnosis. As a follow-up analysis, we built and tested another random forest model without the use of drinking dependence measures. This model had a total error rate of 0.39, and introduced other important variables such as smoking behaviors, perceived stress, IQ, and number of negative life events. DISCUSSION/SIGNIFICANCE OF IMPACT: Our study showed that it was possible for a machine learning algorithm to predict binge drinking intensity better than chance. Drinking patterns were the most robust predictors, and stress, IQ, and psychiatric diagnoses were also useful in predicting binge drinking intensity. Cambridge University Press 2020-07-29 /pmc/articles/PMC8823439/ http://dx.doi.org/10.1017/cts.2020.399 Text en © The Association for Clinical and Translational Science 2020 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Translational Science, Policy, & Health Outcomes Science Morris, James Keoni Gowin, Josh L. Schwandt, Melanie L. Diazgranados, Nancy Ramchandani, Vijay A. 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title | 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title_full | 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title_fullStr | 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title_full_unstemmed | 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title_short | 4288 Identifying Predictive Variables of High-Intensity Binge Drinking Through the Use of a Machine Learning Algorithm |
title_sort | 4288 identifying predictive variables of high-intensity binge drinking through the use of a machine learning algorithm |
topic | Translational Science, Policy, & Health Outcomes Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823439/ http://dx.doi.org/10.1017/cts.2020.399 |
work_keys_str_mv | AT morrisjameskeoni 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm AT gowinjoshl 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm AT schwandtmelaniel 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm AT diazgranadosnancy 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm AT ramchandanivijaya 4288identifyingpredictivevariablesofhighintensitybingedrinkingthroughtheuseofamachinelearningalgorithm |