Cargando…
Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting
OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evi...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531105/ https://www.ncbi.nlm.nih.gov/pubmed/37451682 http://dx.doi.org/10.1093/jamia/ocad133 |
_version_ | 1785111641923256320 |
---|---|
author | Tan, Ryan Shea Ying Cong Lin, Qian Low, Guat Hwa Lin, Ruixi Goh, Tzer Chew Chang, Christopher Chu En Lee, Fung Fung Chan, Wei Yin Tan, Wei Chong Tey, Han Jieh Leong, Fun Loon Tan, Hong Qi Nei, Wen Long Chay, Wen Yee Tai, David Wai Meng Lai, Gillianne Geet Yi Cheng, Lionel Tim-Ee Wong, Fuh Yong Chua, Matthew Chin Heng Chua, Melvin Lee Kiang Tan, Daniel Shao Weng Thng, Choon Hua Tan, Iain Bee Huat Ng, Hwee Tou |
author_facet | Tan, Ryan Shea Ying Cong Lin, Qian Low, Guat Hwa Lin, Ruixi Goh, Tzer Chew Chang, Christopher Chu En Lee, Fung Fung Chan, Wei Yin Tan, Wei Chong Tey, Han Jieh Leong, Fun Loon Tan, Hong Qi Nei, Wen Long Chay, Wen Yee Tai, David Wai Meng Lai, Gillianne Geet Yi Cheng, Lionel Tim-Ee Wong, Fuh Yong Chua, Matthew Chin Heng Chua, Melvin Lee Kiang Tan, Daniel Shao Weng Thng, Choon Hua Tan, Iain Bee Huat Ng, Hwee Tou |
author_sort | Tan, Ryan Shea Ying Cong |
collection | PubMed |
description | OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evidence of disease, partial response, stable disease, or progressive disease. We applied transformer models, a bidirectional long short-term memory model, a convolutional neural network model, and conventional machine learning methods to this task. Data augmentation using sentence permutation with consistency loss as well as prompt-based fine-tuning were used on the best-performing models. Models were validated on a hold-out test set and an external validation set based on Response Evaluation Criteria in Solid Tumors (RECIST) classifications. RESULTS: The best-performing model was the GatorTron transformer which achieved an accuracy of 0.8916 on the test set and 0.8919 on the RECIST validation set. Data augmentation further improved the accuracy to 0.8976. Prompt-based fine-tuning did not further improve accuracy but was able to reduce the number of training reports to 500 while still achieving good performance. DISCUSSION: These models could be used by researchers to derive progression-free survival in large datasets. It may also serve as a decision support tool by providing clinicians an automated second opinion of disease response. CONCLUSIONS: Large clinical language models demonstrate potential to infer cancer disease response from radiology reports at scale. Data augmentation techniques are useful to further improve performance. Prompt-based fine-tuning can significantly reduce the size of the training dataset. |
format | Online Article Text |
id | pubmed-10531105 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-105311052023-09-28 Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting Tan, Ryan Shea Ying Cong Lin, Qian Low, Guat Hwa Lin, Ruixi Goh, Tzer Chew Chang, Christopher Chu En Lee, Fung Fung Chan, Wei Yin Tan, Wei Chong Tey, Han Jieh Leong, Fun Loon Tan, Hong Qi Nei, Wen Long Chay, Wen Yee Tai, David Wai Meng Lai, Gillianne Geet Yi Cheng, Lionel Tim-Ee Wong, Fuh Yong Chua, Matthew Chin Heng Chua, Melvin Lee Kiang Tan, Daniel Shao Weng Thng, Choon Hua Tan, Iain Bee Huat Ng, Hwee Tou J Am Med Inform Assoc Research and Applications OBJECTIVE: To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. MATERIALS AND METHODS: We assembled 10 602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evidence of disease, partial response, stable disease, or progressive disease. We applied transformer models, a bidirectional long short-term memory model, a convolutional neural network model, and conventional machine learning methods to this task. Data augmentation using sentence permutation with consistency loss as well as prompt-based fine-tuning were used on the best-performing models. Models were validated on a hold-out test set and an external validation set based on Response Evaluation Criteria in Solid Tumors (RECIST) classifications. RESULTS: The best-performing model was the GatorTron transformer which achieved an accuracy of 0.8916 on the test set and 0.8919 on the RECIST validation set. Data augmentation further improved the accuracy to 0.8976. Prompt-based fine-tuning did not further improve accuracy but was able to reduce the number of training reports to 500 while still achieving good performance. DISCUSSION: These models could be used by researchers to derive progression-free survival in large datasets. It may also serve as a decision support tool by providing clinicians an automated second opinion of disease response. CONCLUSIONS: Large clinical language models demonstrate potential to infer cancer disease response from radiology reports at scale. Data augmentation techniques are useful to further improve performance. Prompt-based fine-tuning can significantly reduce the size of the training dataset. Oxford University Press 2023-07-14 /pmc/articles/PMC10531105/ /pubmed/37451682 http://dx.doi.org/10.1093/jamia/ocad133 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research and Applications Tan, Ryan Shea Ying Cong Lin, Qian Low, Guat Hwa Lin, Ruixi Goh, Tzer Chew Chang, Christopher Chu En Lee, Fung Fung Chan, Wei Yin Tan, Wei Chong Tey, Han Jieh Leong, Fun Loon Tan, Hong Qi Nei, Wen Long Chay, Wen Yee Tai, David Wai Meng Lai, Gillianne Geet Yi Cheng, Lionel Tim-Ee Wong, Fuh Yong Chua, Matthew Chin Heng Chua, Melvin Lee Kiang Tan, Daniel Shao Weng Thng, Choon Hua Tan, Iain Bee Huat Ng, Hwee Tou Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title | Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title_full | Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title_fullStr | Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title_full_unstemmed | Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title_short | Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
title_sort | inferring cancer disease response from radiology reports using large language models with data augmentation and prompting |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531105/ https://www.ncbi.nlm.nih.gov/pubmed/37451682 http://dx.doi.org/10.1093/jamia/ocad133 |
work_keys_str_mv | AT tanryansheayingcong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT linqian inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT lowguathwa inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT linruixi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT gohtzerchew inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT changchristopherchuen inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT leefungfung inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT chanweiyin inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT tanweichong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT teyhanjieh inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT leongfunloon inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT tanhongqi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT neiwenlong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT chaywenyee inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT taidavidwaimeng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT laigilliannegeetyi inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT chenglioneltimee inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT wongfuhyong inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT chuamatthewchinheng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT chuamelvinleekiang inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT tandanielshaoweng inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT thngchoonhua inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT taniainbeehuat inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting AT nghweetou inferringcancerdiseaseresponsefromradiologyreportsusinglargelanguagemodelswithdataaugmentationandprompting |