Cargando…

Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy

OBJECTIVE: Machine learning (ML) is expected to play an increasing role within primary health care (PHC) in coming years. No peer-reviewed studies exist that evaluate the diagnostic accuracy of ML models compared to general practitioners (GPs). The aim of this study was to evaluate the diagnostic ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Ellertsson, Steindor, Loftsson, Hrafn, Sigurdsson, Emil L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8725823/
https://www.ncbi.nlm.nih.gov/pubmed/34585629
http://dx.doi.org/10.1080/02813432.2021.1973255
_version_ 1784626191295053824
author Ellertsson, Steindor
Loftsson, Hrafn
Sigurdsson, Emil L.
author_facet Ellertsson, Steindor
Loftsson, Hrafn
Sigurdsson, Emil L.
author_sort Ellertsson, Steindor
collection PubMed
description OBJECTIVE: Machine learning (ML) is expected to play an increasing role within primary health care (PHC) in coming years. No peer-reviewed studies exist that evaluate the diagnostic accuracy of ML models compared to general practitioners (GPs). The aim of this study was to evaluate the diagnostic accuracy of an ML classifier on primary headache diagnoses in PHC, compare its performance to GPs, and examine the most impactful signs and symptoms when making a prediction. DESIGN: A retrospective study on diagnostic accuracy, using electronic health records from the database of the Primary Health Care Service of the Capital Area (PHCCA) in Iceland. SETTING: Fifteen primary health care centers of the PHCCA. SUBJECTS: All patients that consulted a physician, from 1 January 2006 to 30 April 2020, and received one of the selected diagnoses. MAIN OUTCOME MEASURES: Sensitivity, Specificity, Positive Predictive Value, Matthews Correlation Coefficient, Receiver Operating Characteristic (ROC) curve, and Area under the ROC curve (AUROC) score for primary headache diagnoses, as well as Shapley Additive Explanations (SHAP) values of the ML classifier. RESULTS: The classifier outperformed the GPs on all metrics except specificity. The SHAP values indicate that the classifier uses the same signs and symptoms (features) as a physician would, when distinguishing between headache diagnoses. CONCLUSION: In a retrospective comparison, the diagnostic accuracy of the ML classifier for primary headache diagnoses is superior to GPs. According to SHAP values, the ML classifier relies on the same signs and symptoms as a physician when making a diagnostic prediction. KEYPOINTS: Little is known about the diagnostic accuracy of machine learning (ML) in the context of primary health care, despite its considerable potential to aid in clinical work. This novel research sheds light on the diagnostic accuracy of ML in a clinical context, as well as the interpretation of its predictions. If the vast potential of ML is to be utilized in primary health care, its performance, safety, and inner workings need to be understood by clinicians.
format Online
Article
Text
id pubmed-8725823
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-87258232022-01-05 Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy Ellertsson, Steindor Loftsson, Hrafn Sigurdsson, Emil L. Scand J Prim Health Care Original Articles OBJECTIVE: Machine learning (ML) is expected to play an increasing role within primary health care (PHC) in coming years. No peer-reviewed studies exist that evaluate the diagnostic accuracy of ML models compared to general practitioners (GPs). The aim of this study was to evaluate the diagnostic accuracy of an ML classifier on primary headache diagnoses in PHC, compare its performance to GPs, and examine the most impactful signs and symptoms when making a prediction. DESIGN: A retrospective study on diagnostic accuracy, using electronic health records from the database of the Primary Health Care Service of the Capital Area (PHCCA) in Iceland. SETTING: Fifteen primary health care centers of the PHCCA. SUBJECTS: All patients that consulted a physician, from 1 January 2006 to 30 April 2020, and received one of the selected diagnoses. MAIN OUTCOME MEASURES: Sensitivity, Specificity, Positive Predictive Value, Matthews Correlation Coefficient, Receiver Operating Characteristic (ROC) curve, and Area under the ROC curve (AUROC) score for primary headache diagnoses, as well as Shapley Additive Explanations (SHAP) values of the ML classifier. RESULTS: The classifier outperformed the GPs on all metrics except specificity. The SHAP values indicate that the classifier uses the same signs and symptoms (features) as a physician would, when distinguishing between headache diagnoses. CONCLUSION: In a retrospective comparison, the diagnostic accuracy of the ML classifier for primary headache diagnoses is superior to GPs. According to SHAP values, the ML classifier relies on the same signs and symptoms as a physician when making a diagnostic prediction. KEYPOINTS: Little is known about the diagnostic accuracy of machine learning (ML) in the context of primary health care, despite its considerable potential to aid in clinical work. This novel research sheds light on the diagnostic accuracy of ML in a clinical context, as well as the interpretation of its predictions. If the vast potential of ML is to be utilized in primary health care, its performance, safety, and inner workings need to be understood by clinicians. Taylor & Francis 2021-09-29 /pmc/articles/PMC8725823/ /pubmed/34585629 http://dx.doi.org/10.1080/02813432.2021.1973255 Text en © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
Ellertsson, Steindor
Loftsson, Hrafn
Sigurdsson, Emil L.
Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title_full Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title_fullStr Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title_full_unstemmed Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title_short Artificial intelligence in the GPs office: a retrospective study on diagnostic accuracy
title_sort artificial intelligence in the gps office: a retrospective study on diagnostic accuracy
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8725823/
https://www.ncbi.nlm.nih.gov/pubmed/34585629
http://dx.doi.org/10.1080/02813432.2021.1973255
work_keys_str_mv AT ellertssonsteindor artificialintelligenceinthegpsofficearetrospectivestudyondiagnosticaccuracy
AT loftssonhrafn artificialintelligenceinthegpsofficearetrospectivestudyondiagnosticaccuracy
AT sigurdssonemill artificialintelligenceinthegpsofficearetrospectivestudyondiagnosticaccuracy