Cargando…

The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study

BACKGROUND: Artificial intelligence (AI) has gained tremendous popularity recently, especially the use of natural language processing (NLP). ChatGPT is a state-of-the-art chatbot capable of creating natural conversations using NLP. The use of AI in medicine can have a tremendous impact on health car...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuroiwa, Tomoyuki, Sarcon, Aida, Ibara, Takuya, Yamada, Eriku, Yamamoto, Akiko, Tsukamoto, Kazuya, Fujita, Koji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10541638/
https://www.ncbi.nlm.nih.gov/pubmed/37713254
http://dx.doi.org/10.2196/47621
_version_ 1785113938743001088
author Kuroiwa, Tomoyuki
Sarcon, Aida
Ibara, Takuya
Yamada, Eriku
Yamamoto, Akiko
Tsukamoto, Kazuya
Fujita, Koji
author_facet Kuroiwa, Tomoyuki
Sarcon, Aida
Ibara, Takuya
Yamada, Eriku
Yamamoto, Akiko
Tsukamoto, Kazuya
Fujita, Koji
author_sort Kuroiwa, Tomoyuki
collection PubMed
description BACKGROUND: Artificial intelligence (AI) has gained tremendous popularity recently, especially the use of natural language processing (NLP). ChatGPT is a state-of-the-art chatbot capable of creating natural conversations using NLP. The use of AI in medicine can have a tremendous impact on health care delivery. Although some studies have evaluated ChatGPT’s accuracy in self-diagnosis, there is no research regarding its precision and the degree to which it recommends medical consultations. OBJECTIVE: The aim of this study was to evaluate ChatGPT’s ability to accurately and precisely self-diagnose common orthopedic diseases, as well as the degree of recommendation it provides for medical consultations. METHODS: Over a 5-day course, each of the study authors submitted the same questions to ChatGPT. The conditions evaluated were carpal tunnel syndrome (CTS), cervical myelopathy (CM), lumbar spinal stenosis (LSS), knee osteoarthritis (KOA), and hip osteoarthritis (HOA). Answers were categorized as either correct, partially correct, incorrect, or a differential diagnosis. The percentage of correct answers and reproducibility were calculated. The reproducibility between days and raters were calculated using the Fleiss κ coefficient. Answers that recommended that the patient seek medical attention were recategorized according to the strength of the recommendation as defined by the study. RESULTS: The ratios of correct answers were 25/25, 1/25, 24/25, 16/25, and 17/25 for CTS, CM, LSS, KOA, and HOA, respectively. The ratios of incorrect answers were 23/25 for CM and 0/25 for all other conditions. The reproducibility between days was 1.0, 0.15, 0.7, 0.6, and 0.6 for CTS, CM, LSS, KOA, and HOA, respectively. The reproducibility between raters was 1.0, 0.1, 0.64, –0.12, and 0.04 for CTS, CM, LSS, KOA, and HOA, respectively. Among the answers recommending medical attention, the phrases “essential,” “recommended,” “best,” and “important” were used. Specifically, “essential” occurred in 4 out of 125, “recommended” in 12 out of 125, “best” in 6 out of 125, and “important” in 94 out of 125 answers. Additionally, 7 out of the 125 answers did not include a recommendation to seek medical attention. CONCLUSIONS: The accuracy and reproducibility of ChatGPT to self-diagnose five common orthopedic conditions were inconsistent. The accuracy could potentially be improved by adding symptoms that could easily identify a specific location. Only a few answers were accompanied by a strong recommendation to seek medical attention according to our study standards. Although ChatGPT could serve as a potential first step in accessing care, we found variability in accurate self-diagnosis. Given the risk of harm with self-diagnosis without medical follow-up, it would be prudent for an NLP to include clear language alerting patients to seek expert medical opinions. We hope to shed further light on the use of AI in a future clinical study.
format Online
Article
Text
id pubmed-10541638
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-105416382023-10-02 The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study Kuroiwa, Tomoyuki Sarcon, Aida Ibara, Takuya Yamada, Eriku Yamamoto, Akiko Tsukamoto, Kazuya Fujita, Koji J Med Internet Res Original Paper BACKGROUND: Artificial intelligence (AI) has gained tremendous popularity recently, especially the use of natural language processing (NLP). ChatGPT is a state-of-the-art chatbot capable of creating natural conversations using NLP. The use of AI in medicine can have a tremendous impact on health care delivery. Although some studies have evaluated ChatGPT’s accuracy in self-diagnosis, there is no research regarding its precision and the degree to which it recommends medical consultations. OBJECTIVE: The aim of this study was to evaluate ChatGPT’s ability to accurately and precisely self-diagnose common orthopedic diseases, as well as the degree of recommendation it provides for medical consultations. METHODS: Over a 5-day course, each of the study authors submitted the same questions to ChatGPT. The conditions evaluated were carpal tunnel syndrome (CTS), cervical myelopathy (CM), lumbar spinal stenosis (LSS), knee osteoarthritis (KOA), and hip osteoarthritis (HOA). Answers were categorized as either correct, partially correct, incorrect, or a differential diagnosis. The percentage of correct answers and reproducibility were calculated. The reproducibility between days and raters were calculated using the Fleiss κ coefficient. Answers that recommended that the patient seek medical attention were recategorized according to the strength of the recommendation as defined by the study. RESULTS: The ratios of correct answers were 25/25, 1/25, 24/25, 16/25, and 17/25 for CTS, CM, LSS, KOA, and HOA, respectively. The ratios of incorrect answers were 23/25 for CM and 0/25 for all other conditions. The reproducibility between days was 1.0, 0.15, 0.7, 0.6, and 0.6 for CTS, CM, LSS, KOA, and HOA, respectively. The reproducibility between raters was 1.0, 0.1, 0.64, –0.12, and 0.04 for CTS, CM, LSS, KOA, and HOA, respectively. Among the answers recommending medical attention, the phrases “essential,” “recommended,” “best,” and “important” were used. Specifically, “essential” occurred in 4 out of 125, “recommended” in 12 out of 125, “best” in 6 out of 125, and “important” in 94 out of 125 answers. Additionally, 7 out of the 125 answers did not include a recommendation to seek medical attention. CONCLUSIONS: The accuracy and reproducibility of ChatGPT to self-diagnose five common orthopedic conditions were inconsistent. The accuracy could potentially be improved by adding symptoms that could easily identify a specific location. Only a few answers were accompanied by a strong recommendation to seek medical attention according to our study standards. Although ChatGPT could serve as a potential first step in accessing care, we found variability in accurate self-diagnosis. Given the risk of harm with self-diagnosis without medical follow-up, it would be prudent for an NLP to include clear language alerting patients to seek expert medical opinions. We hope to shed further light on the use of AI in a future clinical study. JMIR Publications 2023-09-15 /pmc/articles/PMC10541638/ /pubmed/37713254 http://dx.doi.org/10.2196/47621 Text en ©Tomoyuki Kuroiwa, Aida Sarcon, Takuya Ibara, Eriku Yamada, Akiko Yamamoto, Kazuya Tsukamoto, Koji Fujita. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 15.09.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kuroiwa, Tomoyuki
Sarcon, Aida
Ibara, Takuya
Yamada, Eriku
Yamamoto, Akiko
Tsukamoto, Kazuya
Fujita, Koji
The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title_full The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title_fullStr The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title_full_unstemmed The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title_short The Potential of ChatGPT as a Self-Diagnostic Tool in Common Orthopedic Diseases: Exploratory Study
title_sort potential of chatgpt as a self-diagnostic tool in common orthopedic diseases: exploratory study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10541638/
https://www.ncbi.nlm.nih.gov/pubmed/37713254
http://dx.doi.org/10.2196/47621
work_keys_str_mv AT kuroiwatomoyuki thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT sarconaida thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT ibaratakuya thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT yamadaeriku thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT yamamotoakiko thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT tsukamotokazuya thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT fujitakoji thepotentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT kuroiwatomoyuki potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT sarconaida potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT ibaratakuya potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT yamadaeriku potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT yamamotoakiko potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT tsukamotokazuya potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy
AT fujitakoji potentialofchatgptasaselfdiagnostictoolincommonorthopedicdiseasesexploratorystudy