Cargando…

Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study

BACKGROUND: Smartphones have made it possible for patients to digitally report symptoms before physical primary care visits. Using machine learning (ML), these data offer an opportunity to support decisions about the appropriate level of care (triage). OBJECTIVE: The purpose of this study was to exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Entezarjou, Artin, Bonamy, Anna-Karin Edstedt, Benjaminsson, Simon, Herman, Pawel, Midlöv, Patrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7499160/
https://www.ncbi.nlm.nih.gov/pubmed/32880578
http://dx.doi.org/10.2196/18930
_version_ 1783583659922554880
author Entezarjou, Artin
Bonamy, Anna-Karin Edstedt
Benjaminsson, Simon
Herman, Pawel
Midlöv, Patrik
author_facet Entezarjou, Artin
Bonamy, Anna-Karin Edstedt
Benjaminsson, Simon
Herman, Pawel
Midlöv, Patrik
author_sort Entezarjou, Artin
collection PubMed
description BACKGROUND: Smartphones have made it possible for patients to digitally report symptoms before physical primary care visits. Using machine learning (ML), these data offer an opportunity to support decisions about the appropriate level of care (triage). OBJECTIVE: The purpose of this study was to explore the interrater reliability between human physicians and an automated ML-based triage method. METHODS: After testing several models, a naïve Bayes triage model was created using data from digital medical histories, capable of classifying digital medical history reports as either in need of urgent physical examination or not in need of urgent physical examination. The model was tested on 300 digital medical history reports and classification was compared with the majority vote of an expert panel of 5 primary care physicians (PCPs). Reliability between raters was measured using both Cohen κ (adjusted for chance agreement) and percentage agreement (not adjusted for chance agreement). RESULTS: Interrater reliability as measured by Cohen κ was 0.17 when comparing the majority vote of the reference group with the model. Agreement was 74% (138/186) for cases judged not in need of urgent physical examination and 42% (38/90) for cases judged to be in need of urgent physical examination. No specific features linked to the model’s triage decision could be identified. Between physicians within the panel, Cohen κ was 0.2. Intrarater reliability when 1 physician retriaged 50 reports resulted in Cohen κ of 0.55. CONCLUSIONS: Low interrater and intrarater agreement in triage decisions among PCPs limits the possibility to use human decisions as a reference for ML to automate triage in primary care.
format Online
Article
Text
id pubmed-7499160
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74991602020-09-30 Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study Entezarjou, Artin Bonamy, Anna-Karin Edstedt Benjaminsson, Simon Herman, Pawel Midlöv, Patrik JMIR Med Inform Original Paper BACKGROUND: Smartphones have made it possible for patients to digitally report symptoms before physical primary care visits. Using machine learning (ML), these data offer an opportunity to support decisions about the appropriate level of care (triage). OBJECTIVE: The purpose of this study was to explore the interrater reliability between human physicians and an automated ML-based triage method. METHODS: After testing several models, a naïve Bayes triage model was created using data from digital medical histories, capable of classifying digital medical history reports as either in need of urgent physical examination or not in need of urgent physical examination. The model was tested on 300 digital medical history reports and classification was compared with the majority vote of an expert panel of 5 primary care physicians (PCPs). Reliability between raters was measured using both Cohen κ (adjusted for chance agreement) and percentage agreement (not adjusted for chance agreement). RESULTS: Interrater reliability as measured by Cohen κ was 0.17 when comparing the majority vote of the reference group with the model. Agreement was 74% (138/186) for cases judged not in need of urgent physical examination and 42% (38/90) for cases judged to be in need of urgent physical examination. No specific features linked to the model’s triage decision could be identified. Between physicians within the panel, Cohen κ was 0.2. Intrarater reliability when 1 physician retriaged 50 reports resulted in Cohen κ of 0.55. CONCLUSIONS: Low interrater and intrarater agreement in triage decisions among PCPs limits the possibility to use human decisions as a reference for ML to automate triage in primary care. JMIR Publications 2020-09-03 /pmc/articles/PMC7499160/ /pubmed/32880578 http://dx.doi.org/10.2196/18930 Text en ©Artin Entezarjou, Anna-Karin Edstedt Bonamy, Simon Benjaminsson, Pawel Herman, Patrik Midlöv. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 03.09.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Entezarjou, Artin
Bonamy, Anna-Karin Edstedt
Benjaminsson, Simon
Herman, Pawel
Midlöv, Patrik
Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title_full Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title_fullStr Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title_full_unstemmed Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title_short Human- Versus Machine Learning–Based Triage Using Digitalized Patient Histories in Primary Care: Comparative Study
title_sort human- versus machine learning–based triage using digitalized patient histories in primary care: comparative study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7499160/
https://www.ncbi.nlm.nih.gov/pubmed/32880578
http://dx.doi.org/10.2196/18930
work_keys_str_mv AT entezarjouartin humanversusmachinelearningbasedtriageusingdigitalizedpatienthistoriesinprimarycarecomparativestudy
AT bonamyannakarinedstedt humanversusmachinelearningbasedtriageusingdigitalizedpatienthistoriesinprimarycarecomparativestudy
AT benjaminssonsimon humanversusmachinelearningbasedtriageusingdigitalizedpatienthistoriesinprimarycarecomparativestudy
AT hermanpawel humanversusmachinelearningbasedtriageusingdigitalizedpatienthistoriesinprimarycarecomparativestudy
AT midlovpatrik humanversusmachinelearningbasedtriageusingdigitalizedpatienthistoriesinprimarycarecomparativestudy