Cargando…

Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting

OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evi...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Ryan Shea Ying Cong, Lin, Qian, Low, Guat Hwa, Lin, Ruixi, Goh, Tzer Chew, Chang, Christopher Chu En, Lee, Fung Fung, Chan, Wei Yin, Tan, Wei Chong, Tey, Han Jieh, Leong, Fun Loon, Tan, Hong Qi, Nei, Wen Long, Chay, Wen Yee, Tai, David Wai Meng, Lai, Gillianne Geet Yi, Cheng, Lionel Tim-Ee, Wong, Fuh Yong, Chua, Matthew Chin Heng, Chua, Melvin Lee Kiang, Tan, Daniel Shao Weng, Thng, Choon Hua, Tan, Iain Bee Huat, Ng, Hwee Tou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531105/
https://www.ncbi.nlm.nih.gov/pubmed/37451682
http://dx.doi.org/10.1093/jamia/ocad133
_version_ 1785111641923256320
author Tan, Ryan Shea Ying Cong
Lin, Qian
Low, Guat Hwa
Lin, Ruixi
Goh, Tzer Chew
Chang, Christopher Chu En
Lee, Fung Fung
Chan, Wei Yin
Tan, Wei Chong
Tey, Han Jieh
Leong, Fun Loon
Tan, Hong Qi
Nei, Wen Long
Chay, Wen Yee
Tai, David Wai Meng
Lai, Gillianne Geet Yi
Cheng, Lionel Tim-Ee
Wong, Fuh Yong
Chua, Matthew Chin Heng
Chua, Melvin Lee Kiang
Tan, Daniel Shao Weng
Thng, Choon Hua
Tan, Iain Bee Huat
Ng, Hwee Tou
author_facet Tan, Ryan Shea Ying Cong
Lin, Qian
Low, Guat Hwa
Lin, Ruixi
Goh, Tzer Chew
Chang, Christopher Chu En
Lee, Fung Fung
Chan, Wei Yin
Tan, Wei Chong
Tey, Han Jieh
Leong, Fun Loon
Tan, Hong Qi
Nei, Wen Long
Chay, Wen Yee
Tai, David Wai Meng
Lai, Gillianne Geet Yi
Cheng, Lionel Tim-Ee
Wong, Fuh Yong
Chua, Matthew Chin Heng
Chua, Melvin Lee Kiang
Tan, Daniel Shao Weng
Thng, Choon Hua
Tan, Iain Bee Huat
Ng, Hwee Tou
author_sort Tan, Ryan Shea Ying Cong
collection PubMed
description OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evidence of disease, partial response, stable disease, or progressive disease. We applied transformer models, a bidirectional long short-term memory model, a convolutional neural network model, and conventional machine learning methods to this task. Data augmentation using sentence permutation with consistency loss as well as prompt-based fine-tuning were used on the best-performing models. Models were validated on a hold-out test set and an external validation set based on Response Evaluation Criteria in Solid Tumors (RECIST) classifications. RESULTS: The best-performing model was the GatorTron transformer which achieved an accuracy of 0.8916 on the test set and 0.8919 on the RECIST validation set. Data augmentation further improved the accuracy to 0.8976. Prompt-based fine-tuning did not further improve accuracy but was able to reduce the number of training reports to 500 while still achieving good performance. DISCUSSION: These models could be used by researchers to derive progression-free survival in large datasets. It may also serve as a decision support tool by providing clinicians an automated second opinion of disease response. CONCLUSIONS: Large clinical language models demonstrate potential to infer cancer disease response from radiology reports at scale. Data augmentation techniques are useful to further improve performance. Prompt-based fine-tuning can significantly reduce the size of the training dataset.
format Online
Article
Text
id pubmed-10531105
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105311052023-09-28 Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting Tan, Ryan Shea Ying Cong Lin, Qian Low, Guat Hwa Lin, Ruixi Goh, Tzer Chew Chang, Christopher Chu En Lee, Fung Fung Chan, Wei Yin Tan, Wei Chong Tey, Han Jieh Leong, Fun Loon Tan, Hong Qi Nei, Wen Long Chay, Wen Yee Tai, David Wai Meng Lai, Gillianne Geet Yi Cheng, Lionel Tim-Ee Wong, Fuh Yong Chua, Matthew Chin Heng Chua, Melvin Lee Kiang Tan, Daniel Shao Weng Thng, Choon Hua Tan, Iain Bee Huat Ng, Hwee Tou J Am Med Inform Assoc Research and Applications OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evidence of disease, partial response, stable disease, or progressive disease. We applied transformer models, a bidirectional long short-term memory model, a convolutional neural network model, and conventional machine learning methods to this task. Data augmentation using sentence permutation with consistency loss as well as prompt-based fine-tuning were used on the best-performing models. Models were validated on a hold-out test set and an external validation set based on Response Evaluation Criteria in Solid Tumors (RECIST) classifications. RESULTS: The best-performing model was the GatorTron transformer which achieved an accuracy of 0.8916 on the test set and 0.8919 on the RECIST validation set. Data augmentation further improved the accuracy to 0.8976. Prompt-based fine-tuning did not further improve accuracy but was able to reduce the number of training reports to 500 while still achieving good performance. DISCUSSION: These models could be used by researchers to derive progression-free survival in large datasets. It may also serve as a decision support tool by providing clinicians an automated second opinion of disease response. CONCLUSIONS: Large clinical language models demonstrate potential to infer cancer disease response from radiology reports at scale. Data augmentation techniques are useful to further improve performance. Prompt-based fine-tuning can significantly reduce the size of the training dataset. Oxford University Press 2023-07-14 /pmc/articles/PMC10531105/ /pubmed/37451682 http://dx.doi.org/10.1093/jamia/ocad133 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Tan, Ryan Shea Ying Cong
Lin, Qian
Low, Guat Hwa
Lin, Ruixi
Goh, Tzer Chew
Chang, Christopher Chu En
Lee, Fung Fung
Chan, Wei Yin
Tan, Wei Chong
Tey, Han Jieh
Leong, Fun Loon
Tan, Hong Qi
Nei, Wen Long
Chay, Wen Yee
Tai, David Wai Meng
Lai, Gillianne Geet Yi
Cheng, Lionel Tim-Ee
Wong, Fuh Yong
Chua, Matthew Chin Heng
Chua, Melvin Lee Kiang
Tan, Daniel Shao Weng
Thng, Choon Hua
Tan, Iain Bee Huat
Ng, Hwee Tou
Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title_full Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title_fullStr Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title_full_unstemmed Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title_short Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
title_sort inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531105/
https://www.ncbi.nlm.nih.gov/pubmed/37451682
http://dx.doi.org/10.1093/jamia/ocad133
work_keys_str_mv AT tanryansheayingcong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT linqian inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT lowguathwa inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT linruixi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT gohtzerchew inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT changchristopherchuen inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT leefungfung inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT chanweiyin inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT tanweichong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT teyhanjieh inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT leongfunloon inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT tanhongqi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT neiwenlong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT chaywenyee inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT taidavidwaimeng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT laigilliannegeetyi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT chenglioneltimee inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT wongfuhyong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT chuamatthewchinheng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT chuamelvinleekiang inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT tandanielshaoweng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT thngchoonhua inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT taniainbeehuat inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting
AT nghweetou inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting