Cargando…

2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study

BACKGROUND: Outpatient antibiotic misuse is common, yet it is difficult to identify and prevent. Novel methods are needed to better identify unnecessary antibiotic use in the outpatient setting. METHODS: The Twitter developer platform was accessed to identify Tweets describing outpatient antibiotic...

Descripción completa

Detalles Bibliográficos
Autor principal: Sullivan, Timothy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6809283/
http://dx.doi.org/10.1093/ofid/ofz360.1743
_version_ 1783461948186165248
author Sullivan, Timothy
author_facet Sullivan, Timothy
author_sort Sullivan, Timothy
collection PubMed
description BACKGROUND: Outpatient antibiotic misuse is common, yet it is difficult to identify and prevent. Novel methods are needed to better identify unnecessary antibiotic use in the outpatient setting. METHODS: The Twitter developer platform was accessed to identify Tweets describing outpatient antibiotic use in the United States between November 2018 and March 2019. Unique English-language Tweets reporting recent antibiotic use were aggregated, reviewed, and labeled as describing possible misuse or not describing misuse. Possible misuse was defined as antibiotic use for a diagnosis or symptoms for which antibiotics are not indicated based on national guidelines, or the use of antibiotics without evaluation by a healthcare provider (Figure 1). Tweets were randomly divided into training and testing sets consisting of 80% and 20% of the data, respectively. Training set Tweets were preprocessed via a natural language processing pipeline, converted into numerical vectors, and used to generate a logistic regression algorithm to predict misuse in the testing set. Analyses were performed in Python using the scikit-learn and nltk libraries. RESULTS: 4000 Tweets were included, of which 1028 were labeled as describing possible outpatient antibiotic misuse. The algorithm correctly identified Tweets describing possible antibiotic misuse in the testing set with specificity = 94%, sensitivity = 55%, PPV = 75%, NPV = 87%, and area under the ROC curve = 0.91 (Figure 2). CONCLUSION: A machine learning algorithm using Twitter data identified episodes of self-reported antibiotic misuse with good test performance, as defined by the area under the ROC curve. Analysis of Twitter data captured some episodes of antibiotic misuses, such as the use of non-prescribed antibiotics, that are not easily identified by other methods. This approach could be used to generate novel insights into the causes and extent of antibiotic misuse in the United States, and to monitor antibiotic misuse in real time. [Image: see text] [Image: see text] DISCLOSURES: All authors: No reported disclosures.
format Online
Article
Text
id pubmed-6809283
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68092832019-10-28 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study Sullivan, Timothy Open Forum Infect Dis Abstracts BACKGROUND: Outpatient antibiotic misuse is common, yet it is difficult to identify and prevent. Novel methods are needed to better identify unnecessary antibiotic use in the outpatient setting. METHODS: The Twitter developer platform was accessed to identify Tweets describing outpatient antibiotic use in the United States between November 2018 and March 2019. Unique English-language Tweets reporting recent antibiotic use were aggregated, reviewed, and labeled as describing possible misuse or not describing misuse. Possible misuse was defined as antibiotic use for a diagnosis or symptoms for which antibiotics are not indicated based on national guidelines, or the use of antibiotics without evaluation by a healthcare provider (Figure 1). Tweets were randomly divided into training and testing sets consisting of 80% and 20% of the data, respectively. Training set Tweets were preprocessed via a natural language processing pipeline, converted into numerical vectors, and used to generate a logistic regression algorithm to predict misuse in the testing set. Analyses were performed in Python using the scikit-learn and nltk libraries. RESULTS: 4000 Tweets were included, of which 1028 were labeled as describing possible outpatient antibiotic misuse. The algorithm correctly identified Tweets describing possible antibiotic misuse in the testing set with specificity = 94%, sensitivity = 55%, PPV = 75%, NPV = 87%, and area under the ROC curve = 0.91 (Figure 2). CONCLUSION: A machine learning algorithm using Twitter data identified episodes of self-reported antibiotic misuse with good test performance, as defined by the area under the ROC curve. Analysis of Twitter data captured some episodes of antibiotic misuses, such as the use of non-prescribed antibiotics, that are not easily identified by other methods. This approach could be used to generate novel insights into the causes and extent of antibiotic misuse in the United States, and to monitor antibiotic misuse in real time. [Image: see text] [Image: see text] DISCLOSURES: All authors: No reported disclosures. Oxford University Press 2019-10-23 /pmc/articles/PMC6809283/ http://dx.doi.org/10.1093/ofid/ofz360.1743 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Infectious Diseases Society of America. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Abstracts
Sullivan, Timothy
2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title_full 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title_fullStr 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title_full_unstemmed 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title_short 2063. Using Twitter Data and Machine Learning to Identify Outpatient Antibiotic Misuse: A Proof-of-Concept Study
title_sort 2063. using twitter data and machine learning to identify outpatient antibiotic misuse: a proof-of-concept study
topic Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6809283/
http://dx.doi.org/10.1093/ofid/ofz360.1743
work_keys_str_mv AT sullivantimothy 2063usingtwitterdataandmachinelearningtoidentifyoutpatientantibioticmisuseaproofofconceptstudy