Cargando…
Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
BACKGROUND: High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-targetin...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5883887/ https://www.ncbi.nlm.nih.gov/pubmed/29615005 http://dx.doi.org/10.1186/s12888-018-1656-4 |
_version_ | 1783311740529803264 |
---|---|
author | Bernecker, Samantha L. Rosellini, Anthony J. Nock, Matthew K. Chiu, Wai Tat Gutierrez, Peter M. Hwang, Irving Joiner, Thomas E. Naifeh, James A. Sampson, Nancy A. Zaslavsky, Alan M. Stein, Murray B. Ursano, Robert J. Kessler, Ronald C. |
author_facet | Bernecker, Samantha L. Rosellini, Anthony J. Nock, Matthew K. Chiu, Wai Tat Gutierrez, Peter M. Hwang, Irving Joiner, Thomas E. Naifeh, James A. Sampson, Nancy A. Zaslavsky, Alan M. Stein, Murray B. Ursano, Robert J. Kessler, Ronald C. |
author_sort | Bernecker, Samantha L. |
collection | PubMed |
description | BACKGROUND: High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-targeting systems for these outcomes based on machine learning methods using administrative data predictors. However, administrative data omit many risk factors, raising the question whether risk targeting could be improved by adding self-report survey data to prediction models. If so, the Army may gain from routinely administering surveys that assess additional risk factors. METHODS: The STARRS New Soldier Survey was administered to 21,790 Regular Army soldiers who agreed to have survey data linked to administrative records. As reported previously, machine learning models using administrative data as predictors found that small proportions of high-risk soldiers accounted for high proportions of negative outcomes. Other machine learning models using self-report survey data as predictors were developed previously for three of these outcomes: major physical violence and sexual violence perpetration among men and sexual violence victimization among women. Here we examined the extent to which this survey information increases prediction accuracy, over models based solely on administrative data, for those three outcomes. We used discrete-time survival analysis to estimate a series of models predicting first occurrence, assessing how model fit improved and concentration of risk increased when adding the predicted risk score based on survey data to the predicted risk score based on administrative data. RESULTS: The addition of survey data improved prediction significantly for all outcomes. In the most extreme case, the percentage of reported sexual violence victimization among the 5% of female soldiers with highest predicted risk increased from 17.5% using only administrative predictors to 29.4% adding survey predictors, a 67.9% proportional increase in prediction accuracy. Other proportional increases in concentration of risk ranged from 4.8% to 49.5% (median = 26.0%). CONCLUSIONS: Data from an ongoing New Soldier Survey could substantially improve accuracy of risk models compared to models based exclusively on administrative predictors. Depending upon the characteristics of interventions used, the increase in targeting accuracy from survey data might offset survey administration costs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12888-018-1656-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5883887 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58838872018-04-09 Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data Bernecker, Samantha L. Rosellini, Anthony J. Nock, Matthew K. Chiu, Wai Tat Gutierrez, Peter M. Hwang, Irving Joiner, Thomas E. Naifeh, James A. Sampson, Nancy A. Zaslavsky, Alan M. Stein, Murray B. Ursano, Robert J. Kessler, Ronald C. BMC Psychiatry Research Article BACKGROUND: High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-targeting systems for these outcomes based on machine learning methods using administrative data predictors. However, administrative data omit many risk factors, raising the question whether risk targeting could be improved by adding self-report survey data to prediction models. If so, the Army may gain from routinely administering surveys that assess additional risk factors. METHODS: The STARRS New Soldier Survey was administered to 21,790 Regular Army soldiers who agreed to have survey data linked to administrative records. As reported previously, machine learning models using administrative data as predictors found that small proportions of high-risk soldiers accounted for high proportions of negative outcomes. Other machine learning models using self-report survey data as predictors were developed previously for three of these outcomes: major physical violence and sexual violence perpetration among men and sexual violence victimization among women. Here we examined the extent to which this survey information increases prediction accuracy, over models based solely on administrative data, for those three outcomes. We used discrete-time survival analysis to estimate a series of models predicting first occurrence, assessing how model fit improved and concentration of risk increased when adding the predicted risk score based on survey data to the predicted risk score based on administrative data. RESULTS: The addition of survey data improved prediction significantly for all outcomes. In the most extreme case, the percentage of reported sexual violence victimization among the 5% of female soldiers with highest predicted risk increased from 17.5% using only administrative predictors to 29.4% adding survey predictors, a 67.9% proportional increase in prediction accuracy. Other proportional increases in concentration of risk ranged from 4.8% to 49.5% (median = 26.0%). CONCLUSIONS: Data from an ongoing New Soldier Survey could substantially improve accuracy of risk models compared to models based exclusively on administrative predictors. Depending upon the characteristics of interventions used, the increase in targeting accuracy from survey data might offset survey administration costs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12888-018-1656-4) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-03 /pmc/articles/PMC5883887/ /pubmed/29615005 http://dx.doi.org/10.1186/s12888-018-1656-4 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Bernecker, Samantha L. Rosellini, Anthony J. Nock, Matthew K. Chiu, Wai Tat Gutierrez, Peter M. Hwang, Irving Joiner, Thomas E. Naifeh, James A. Sampson, Nancy A. Zaslavsky, Alan M. Stein, Murray B. Ursano, Robert J. Kessler, Ronald C. Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title | Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title_full | Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title_fullStr | Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title_full_unstemmed | Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title_short | Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data |
title_sort | improving risk prediction accuracy for new soldiers in the u.s. army by adding self-report survey data to administrative data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5883887/ https://www.ncbi.nlm.nih.gov/pubmed/29615005 http://dx.doi.org/10.1186/s12888-018-1656-4 |
work_keys_str_mv | AT berneckersamanthal improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT rosellinianthonyj improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT nockmatthewk improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT chiuwaitat improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT gutierrezpeterm improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT hwangirving improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT joinerthomase improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT naifehjamesa improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT sampsonnancya improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT zaslavskyalanm improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT steinmurrayb improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT ursanorobertj improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata AT kesslerronaldc improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata |