Cargando…

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

BACKGROUND/AIMS: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine...

Descripción completa

Detalles Bibliográficos
Autores principales: Yeo, Yee Hui, Samaan, Jamil S., Ng, Wee Han, Ting, Peng-Sheng, Trivedi, Hirsh, Vipani, Aarshi, Ayoub, Walid, Yang, Ju Dong, Liran, Omer, Spiegel, Brennan, Kuo, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Association for the Study of the Liver 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366809/
https://www.ncbi.nlm.nih.gov/pubmed/36946005
http://dx.doi.org/10.3350/cmh.2023.0089
_version_ 1785077251086221312
author Yeo, Yee Hui
Samaan, Jamil S.
Ng, Wee Han
Ting, Peng-Sheng
Trivedi, Hirsh
Vipani, Aarshi
Ayoub, Walid
Yang, Ju Dong
Liran, Omer
Spiegel, Brennan
Kuo, Alexander
author_facet Yeo, Yee Hui
Samaan, Jamil S.
Ng, Wee Han
Ting, Peng-Sheng
Trivedi, Hirsh
Vipani, Aarshi
Ayoub, Walid
Yang, Ju Dong
Liran, Omer
Spiegel, Brennan
Kuo, Alexander
author_sort Yeo, Yee Hui
collection PubMed
description BACKGROUND/AIMS: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC. METHODS: ChatGPT’s responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested. RESULTS: We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis. CONCLUSIONS: We analyzed the areas of robustness and limitations of ChatGPT’s responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes.
format Online
Article
Text
id pubmed-10366809
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Korean Association for the Study of the Liver
record_format MEDLINE/PubMed
spelling pubmed-103668092023-07-26 Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma Yeo, Yee Hui Samaan, Jamil S. Ng, Wee Han Ting, Peng-Sheng Trivedi, Hirsh Vipani, Aarshi Ayoub, Walid Yang, Ju Dong Liran, Omer Spiegel, Brennan Kuo, Alexander Clin Mol Hepatol Original Article BACKGROUND/AIMS: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC. METHODS: ChatGPT’s responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested. RESULTS: We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis. CONCLUSIONS: We analyzed the areas of robustness and limitations of ChatGPT’s responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes. The Korean Association for the Study of the Liver 2023-07 2023-03-22 /pmc/articles/PMC10366809/ /pubmed/36946005 http://dx.doi.org/10.3350/cmh.2023.0089 Text en Copyright © 2023 by The Korean Association for the Study of the Liver https://creativecommons.org/licenses/by-nc/3.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Yeo, Yee Hui
Samaan, Jamil S.
Ng, Wee Han
Ting, Peng-Sheng
Trivedi, Hirsh
Vipani, Aarshi
Ayoub, Walid
Yang, Ju Dong
Liran, Omer
Spiegel, Brennan
Kuo, Alexander
Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title_full Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title_fullStr Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title_full_unstemmed Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title_short Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
title_sort assessing the performance of chatgpt in answering questions regarding cirrhosis and hepatocellular carcinoma
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366809/
https://www.ncbi.nlm.nih.gov/pubmed/36946005
http://dx.doi.org/10.3350/cmh.2023.0089
work_keys_str_mv AT yeoyeehui assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT samaanjamils assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT ngweehan assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT tingpengsheng assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT trivedihirsh assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT vipaniaarshi assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT ayoubwalid assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT yangjudong assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT liranomer assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT spiegelbrennan assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma
AT kuoalexander assessingtheperformanceofchatgptinansweringquestionsregardingcirrhosisandhepatocellularcarcinoma