Cargando…
Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4
BACKGROUND: The spread of artificial intelligence (AI) has led to transformative advancements in diverse sectors, including healthcare. Specifically, generative writing systems have shown potential in various applications, but their effectiveness in clinical settings has been barely investigated. In...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10684473/ https://www.ncbi.nlm.nih.gov/pubmed/38015298 http://dx.doi.org/10.1186/s10195-023-00740-4 |
_version_ | 1785151407410642944 |
---|---|
author | Pagano, Stefano Holzapfel, Sabrina Kappenschneider, Tobias Meyer, Matthias Maderbacher, Günther Grifka, Joachim Holzapfel, Dominik Emanuel |
author_facet | Pagano, Stefano Holzapfel, Sabrina Kappenschneider, Tobias Meyer, Matthias Maderbacher, Günther Grifka, Joachim Holzapfel, Dominik Emanuel |
author_sort | Pagano, Stefano |
collection | PubMed |
description | BACKGROUND: The spread of artificial intelligence (AI) has led to transformative advancements in diverse sectors, including healthcare. Specifically, generative writing systems have shown potential in various applications, but their effectiveness in clinical settings has been barely investigated. In this context, we evaluated the proficiency of ChatGPT-4 in diagnosing gonarthrosis and coxarthrosis and recommending appropriate treatments compared with orthopaedic specialists. METHODS: A retrospective review was conducted using anonymized medical records of 100 patients previously diagnosed with either knee or hip arthrosis. ChatGPT-4 was employed to analyse these historical records, formulating both a diagnosis and potential treatment suggestions. Subsequently, a comparative analysis was conducted to assess the concordance between the AI’s conclusions and the original clinical decisions made by the physicians. RESULTS: In diagnostic evaluations, ChatGPT-4 consistently aligned with the conclusions previously drawn by physicians. In terms of treatment recommendations, there was an 83% agreement between the AI and orthopaedic specialists. The therapeutic concordance was verified by the calculation of a Cohen’s Kappa coefficient of 0.580 (p < 0.001). This indicates a moderate-to-good level of agreement. In recommendations pertaining to surgical treatment, the AI demonstrated a sensitivity and specificity of 78% and 80%, respectively. Multivariable logistic regression demonstrated that the variables reduced quality of life (OR 49.97, p < 0.001) and start-up pain (OR 12.54, p = 0.028) have an influence on ChatGPT-4’s recommendation for a surgery. CONCLUSION: This study emphasises ChatGPT-4’s notable potential in diagnosing conditions such as gonarthrosis and coxarthrosis and in aligning its treatment recommendations with those of orthopaedic specialists. However, it is crucial to acknowledge that AI tools such as ChatGPT-4 are not meant to replace the nuanced expertise and clinical judgment of seasoned orthopaedic surgeons, particularly in complex decision-making scenarios regarding treatment indications. Due to the exploratory nature of the study, further research with larger patient populations and more complex diagnoses is necessary to validate the findings and explore the broader potential of AI in healthcare. Level of Evidence: Level III evidence. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10195-023-00740-4. |
format | Online Article Text |
id | pubmed-10684473 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-106844732023-11-30 Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 Pagano, Stefano Holzapfel, Sabrina Kappenschneider, Tobias Meyer, Matthias Maderbacher, Günther Grifka, Joachim Holzapfel, Dominik Emanuel J Orthop Traumatol Original Article BACKGROUND: The spread of artificial intelligence (AI) has led to transformative advancements in diverse sectors, including healthcare. Specifically, generative writing systems have shown potential in various applications, but their effectiveness in clinical settings has been barely investigated. In this context, we evaluated the proficiency of ChatGPT-4 in diagnosing gonarthrosis and coxarthrosis and recommending appropriate treatments compared with orthopaedic specialists. METHODS: A retrospective review was conducted using anonymized medical records of 100 patients previously diagnosed with either knee or hip arthrosis. ChatGPT-4 was employed to analyse these historical records, formulating both a diagnosis and potential treatment suggestions. Subsequently, a comparative analysis was conducted to assess the concordance between the AI’s conclusions and the original clinical decisions made by the physicians. RESULTS: In diagnostic evaluations, ChatGPT-4 consistently aligned with the conclusions previously drawn by physicians. In terms of treatment recommendations, there was an 83% agreement between the AI and orthopaedic specialists. The therapeutic concordance was verified by the calculation of a Cohen’s Kappa coefficient of 0.580 (p < 0.001). This indicates a moderate-to-good level of agreement. In recommendations pertaining to surgical treatment, the AI demonstrated a sensitivity and specificity of 78% and 80%, respectively. Multivariable logistic regression demonstrated that the variables reduced quality of life (OR 49.97, p < 0.001) and start-up pain (OR 12.54, p = 0.028) have an influence on ChatGPT-4’s recommendation for a surgery. CONCLUSION: This study emphasises ChatGPT-4’s notable potential in diagnosing conditions such as gonarthrosis and coxarthrosis and in aligning its treatment recommendations with those of orthopaedic specialists. However, it is crucial to acknowledge that AI tools such as ChatGPT-4 are not meant to replace the nuanced expertise and clinical judgment of seasoned orthopaedic surgeons, particularly in complex decision-making scenarios regarding treatment indications. Due to the exploratory nature of the study, further research with larger patient populations and more complex diagnoses is necessary to validate the findings and explore the broader potential of AI in healthcare. Level of Evidence: Level III evidence. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10195-023-00740-4. Springer International Publishing 2023-11-28 2023-12 /pmc/articles/PMC10684473/ /pubmed/38015298 http://dx.doi.org/10.1186/s10195-023-00740-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Pagano, Stefano Holzapfel, Sabrina Kappenschneider, Tobias Meyer, Matthias Maderbacher, Günther Grifka, Joachim Holzapfel, Dominik Emanuel Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title | Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title_full | Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title_fullStr | Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title_full_unstemmed | Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title_short | Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4 |
title_sort | arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative ai model gpt-4 |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10684473/ https://www.ncbi.nlm.nih.gov/pubmed/38015298 http://dx.doi.org/10.1186/s10195-023-00740-4 |
work_keys_str_mv | AT paganostefano arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT holzapfelsabrina arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT kappenschneidertobias arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT meyermatthias arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT maderbachergunther arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT grifkajoachim arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 AT holzapfeldominikemanuel arthrosisdiagnosisandtreatmentrecommendationsinclinicalpracticeanexploratoryinvestigationwiththegenerativeaimodelgpt4 |