Cargando…

Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models

IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention effor...

Descripción completa

Detalles Bibliográficos
Autor principal: Parker, Susan T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7557517/
https://www.ncbi.nlm.nih.gov/pubmed/33052403
http://dx.doi.org/10.1001/jamanetworkopen.2020.20664
_version_ 1783594437460361216
author Parker, Susan T.
author_facet Parker, Susan T.
author_sort Parker, Susan T.
collection PubMed
description IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention efforts. OBJECTIVE: To examine the performance of natural language processing (NLP) and machine learning models to predict nonfatal gunshot injury locations and generate new national estimates of the locations in which these injuries occur. DESIGN, SETTING, AND PARTICIPANTS: Cross-sectional study of data from the National Electronic Injury Surveillance System Firearm Injury Surveillance Study on nonfatal gunshot injuries that occurred in the US between January 1, 1993, and December 31, 2015. The unweighted sample included 59 025 gunshot injuries that were initially treated at emergency departments. Data were analyzed from June 1, 2019 to July 24, 2020. MAIN OUTCOMES AND MEASURES: The primary outcomes were classification of injury location and subsequent estimation of nonfatal gunshot injury location. The NLP was used to generate 6 sets of predictors, and 4 machine learning models were trained to classify the missing locations: multinomial support vector machines, lasso regression, XgBoost gradient descent, and feed-forward neural networks. For each of the 6 sets of NLP predictors, 70% of records with locations were randomly sampled to form the training set and the remaining 30% of records composed the test set. The best-performing model was validated by comparing the predicted locations were with those from existing national estimates of nonfatal gunshot injuries stratified by location and intent. RESULTS: The unweighted sample included 59 025 nonfatal gunshot injuries; patients with these injuries were predominantly male (n = 52 630, [89.2%]), of Black race/ethnicity (n = 29 304 [49.6%]), and young (15-24 years; n = 27 037 [45.8%]). In total, 54 089 nonfatal gunshot injuries that were weighted to approximate national estimates were included in the analysis. Existing national estimates suggest that the most prevalent nonfatal gunshot injury location is the home (n = 14 764 [23.4%]), followed by the street or highway (n = 14 402 [22.9%]), and other public places (n = 7276 [11.6%]). After implementation of NLP classification, the most frequent gunshot injury location was the street or highway (n = 27 200 [46.1%]), followed by the home (n = 23 738 [37.7%]), and other public places (n = 10 439 [15.1%]). CONCLUSIONS AND RELEVANCE: The findings of this study suggest that NLP and machine learning models may be useful for classifying gunshot injury location and that most nonfatal gunshot injuries occur in the street or highway rather than in the home; these findings can inform future firearm injury prevention efforts.
format Online
Article
Text
id pubmed-7557517
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-75575172020-10-19 Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models Parker, Susan T. JAMA Netw Open Original Investigation IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention efforts. OBJECTIVE: To examine the performance of natural language processing (NLP) and machine learning models to predict nonfatal gunshot injury locations and generate new national estimates of the locations in which these injuries occur. DESIGN, SETTING, AND PARTICIPANTS: Cross-sectional study of data from the National Electronic Injury Surveillance System Firearm Injury Surveillance Study on nonfatal gunshot injuries that occurred in the US between January 1, 1993, and December 31, 2015. The unweighted sample included 59 025 gunshot injuries that were initially treated at emergency departments. Data were analyzed from June 1, 2019 to July 24, 2020. MAIN OUTCOMES AND MEASURES: The primary outcomes were classification of injury location and subsequent estimation of nonfatal gunshot injury location. The NLP was used to generate 6 sets of predictors, and 4 machine learning models were trained to classify the missing locations: multinomial support vector machines, lasso regression, XgBoost gradient descent, and feed-forward neural networks. For each of the 6 sets of NLP predictors, 70% of records with locations were randomly sampled to form the training set and the remaining 30% of records composed the test set. The best-performing model was validated by comparing the predicted locations were with those from existing national estimates of nonfatal gunshot injuries stratified by location and intent. RESULTS: The unweighted sample included 59 025 nonfatal gunshot injuries; patients with these injuries were predominantly male (n = 52 630, [89.2%]), of Black race/ethnicity (n = 29 304 [49.6%]), and young (15-24 years; n = 27 037 [45.8%]). In total, 54 089 nonfatal gunshot injuries that were weighted to approximate national estimates were included in the analysis. Existing national estimates suggest that the most prevalent nonfatal gunshot injury location is the home (n = 14 764 [23.4%]), followed by the street or highway (n = 14 402 [22.9%]), and other public places (n = 7276 [11.6%]). After implementation of NLP classification, the most frequent gunshot injury location was the street or highway (n = 27 200 [46.1%]), followed by the home (n = 23 738 [37.7%]), and other public places (n = 10 439 [15.1%]). CONCLUSIONS AND RELEVANCE: The findings of this study suggest that NLP and machine learning models may be useful for classifying gunshot injury location and that most nonfatal gunshot injuries occur in the street or highway rather than in the home; these findings can inform future firearm injury prevention efforts. American Medical Association 2020-10-14 /pmc/articles/PMC7557517/ /pubmed/33052403 http://dx.doi.org/10.1001/jamanetworkopen.2020.20664 Text en Copyright 2020 Parker ST. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Parker, Susan T.
Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_full Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_fullStr Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_full_unstemmed Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_short Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_sort estimating nonfatal gunshot injury locations with natural language processing and machine learning models
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7557517/
https://www.ncbi.nlm.nih.gov/pubmed/33052403
http://dx.doi.org/10.1001/jamanetworkopen.2020.20664
work_keys_str_mv AT parkersusant estimatingnonfatalgunshotinjurylocationswithnaturallanguageprocessingandmachinelearningmodels