Cargando…

Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models

IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention effor...

Descripción completa

Detalles Bibliográficos
Autor principal:	Parker, Susan T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Medical Association 2020
Materias:	Original Investigation
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7557517/ https://www.ncbi.nlm.nih.gov/pubmed/33052403 http://dx.doi.org/10.1001/jamanetworkopen.2020.20664

_version_	1783594437460361216
author	Parker, Susan T.
author_facet	Parker, Susan T.
author_sort	Parker, Susan T.
collection	PubMed
description	IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention efforts. OBJECTIVE: To examine the performance of natural language processing (NLP) and machine learning models to predict nonfatal gunshot injury locations and generate new national estimates of the locations in which these injuries occur. DESIGN, SETTING, AND PARTICIPANTS: Cross-sectional study of data from the National Electronic Injury Surveillance System Firearm Injury Surveillance Study on nonfatal gunshot injuries that occurred in the US between January 1, 1993, and December 31, 2015. The unweighted sample included 59 025 gunshot injuries that were initially treated at emergency departments. Data were analyzed from June 1, 2019 to July 24, 2020. MAIN OUTCOMES AND MEASURES: The primary outcomes were classification of injury location and subsequent estimation of nonfatal gunshot injury location. The NLP was used to generate 6 sets of predictors, and 4 machine learning models were trained to classify the missing locations: multinomial support vector machines, lasso regression, XgBoost gradient descent, and feed-forward neural networks. For each of the 6 sets of NLP predictors, 70% of records with locations were randomly sampled to form the training set and the remaining 30% of records composed the test set. The best-performing model was validated by comparing the predicted locations were with those from existing national estimates of nonfatal gunshot injuries stratified by location and intent. RESULTS: The unweighted sample included 59 025 nonfatal gunshot injuries; patients with these injuries were predominantly male (n = 52 630, [89.2%]), of Black race/ethnicity (n = 29 304 [49.6%]), and young (15-24 years; n = 27 037 [45.8%]). In total, 54 089 nonfatal gunshot injuries that were weighted to approximate national estimates were included in the analysis. Existing national estimates suggest that the most prevalent nonfatal gunshot injury location is the home (n = 14 764 [23.4%]), followed by the street or highway (n = 14 402 [22.9%]), and other public places (n = 7276 [11.6%]). After implementation of NLP classification, the most frequent gunshot injury location was the street or highway (n = 27 200 [46.1%]), followed by the home (n = 23 738 [37.7%]), and other public places (n = 10 439 [15.1%]). CONCLUSIONS AND RELEVANCE: The findings of this study suggest that NLP and machine learning models may be useful for classifying gunshot injury location and that most nonfatal gunshot injuries occur in the street or highway rather than in the home; these findings can inform future firearm injury prevention efforts.
format	Online Article Text
id	pubmed-7557517
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	American Medical Association
record_format	MEDLINE/PubMed
spelling	pubmed-75575172020-10-19 Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models Parker, Susan T. JAMA Netw Open Original Investigation IMPORTANCE: Nonfatal gunshot injuries are the most common firearm injury, but where they frequently occur remains unclear owing to data limitations. Natural language processing can be applied to medical text narratives of gunshot injury records to classify injury location and inform prevention efforts. OBJECTIVE: To examine the performance of natural language processing (NLP) and machine learning models to predict nonfatal gunshot injury locations and generate new national estimates of the locations in which these injuries occur. DESIGN, SETTING, AND PARTICIPANTS: Cross-sectional study of data from the National Electronic Injury Surveillance System Firearm Injury Surveillance Study on nonfatal gunshot injuries that occurred in the US between January 1, 1993, and December 31, 2015. The unweighted sample included 59 025 gunshot injuries that were initially treated at emergency departments. Data were analyzed from June 1, 2019 to July 24, 2020. MAIN OUTCOMES AND MEASURES: The primary outcomes were classification of injury location and subsequent estimation of nonfatal gunshot injury location. The NLP was used to generate 6 sets of predictors, and 4 machine learning models were trained to classify the missing locations: multinomial support vector machines, lasso regression, XgBoost gradient descent, and feed-forward neural networks. For each of the 6 sets of NLP predictors, 70% of records with locations were randomly sampled to form the training set and the remaining 30% of records composed the test set. The best-performing model was validated by comparing the predicted locations were with those from existing national estimates of nonfatal gunshot injuries stratified by location and intent. RESULTS: The unweighted sample included 59 025 nonfatal gunshot injuries; patients with these injuries were predominantly male (n = 52 630, [89.2%]), of Black race/ethnicity (n = 29 304 [49.6%]), and young (15-24 years; n = 27 037 [45.8%]). In total, 54 089 nonfatal gunshot injuries that were weighted to approximate national estimates were included in the analysis. Existing national estimates suggest that the most prevalent nonfatal gunshot injury location is the home (n = 14 764 [23.4%]), followed by the street or highway (n = 14 402 [22.9%]), and other public places (n = 7276 [11.6%]). After implementation of NLP classification, the most frequent gunshot injury location was the street or highway (n = 27 200 [46.1%]), followed by the home (n = 23 738 [37.7%]), and other public places (n = 10 439 [15.1%]). CONCLUSIONS AND RELEVANCE: The findings of this study suggest that NLP and machine learning models may be useful for classifying gunshot injury location and that most nonfatal gunshot injuries occur in the street or highway rather than in the home; these findings can inform future firearm injury prevention efforts. American Medical Association 2020-10-14 /pmc/articles/PMC7557517/ /pubmed/33052403 http://dx.doi.org/10.1001/jamanetworkopen.2020.20664 Text en Copyright 2020 Parker ST. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle	Original Investigation Parker, Susan T. Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title	Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_full	Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_fullStr	Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_full_unstemmed	Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_short	Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models
title_sort	estimating nonfatal gunshot injury locations with natural language processing and machine learning models
topic	Original Investigation
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7557517/ https://www.ncbi.nlm.nih.gov/pubmed/33052403 http://dx.doi.org/10.1001/jamanetworkopen.2020.20664
work_keys_str_mv	AT parkersusant estimatingnonfatalgunshotinjurylocationswithnaturallanguageprocessingandmachinelearningmodels

Estimating Nonfatal Gunshot Injury Locations With Natural Language Processing and Machine Learning Models

Ejemplares similares