Cargando…

Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing

IMPORTANCE: Predicting short- and long-term survival of patients with cancer may improve their care. Prior predictive models either use data with limited availability or predict the outcome of only 1 type of cancer. OBJECTIVE: To investigate whether natural language processing can predict survival o...

Descripción completa

Detalles Bibliográficos
Autores principales: Nunez, John-Jose, Leung, Bonnie, Ho, Cheryl, Bates, Alan T., Ng, Raymond T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9972192/
https://www.ncbi.nlm.nih.gov/pubmed/36848085
http://dx.doi.org/10.1001/jamanetworkopen.2023.0813
_version_ 1784898270878760960
author Nunez, John-Jose
Leung, Bonnie
Ho, Cheryl
Bates, Alan T.
Ng, Raymond T.
author_facet Nunez, John-Jose
Leung, Bonnie
Ho, Cheryl
Bates, Alan T.
Ng, Raymond T.
author_sort Nunez, John-Jose
collection PubMed
description IMPORTANCE: Predicting short- and long-term survival of patients with cancer may improve their care. Prior predictive models either use data with limited availability or predict the outcome of only 1 type of cancer. OBJECTIVE: To investigate whether natural language processing can predict survival of patients with general cancer from a patient’s initial oncologist consultation document. DESIGN, SETTING, AND PARTICIPANTS: This retrospective prognostic study used data from 47 625 of 59 800 patients who started cancer care at any of the 6 BC Cancer sites located in the province of British Columbia between April 1, 2011, and December 31, 2016. Mortality data were updated until April 6, 2022, and data were analyzed from update until September 30, 2022. All patients with a medical or radiation oncologist consultation document generated within 180 days of diagnosis were included; patients seen for multiple cancers were excluded. EXPOSURES: Initial oncologist consultation documents were analyzed using traditional and neural language models. MAIN OUTCOMES AND MEASURES: The primary outcome was the performance of the predictive models, including balanced accuracy and receiver operating characteristics area under the curve (AUC). The secondary outcome was investigating what words the models used. RESULTS: Of the 47 625 patients in the sample, 25 428 (53.4%) were female and 22 197 (46.6%) were male, with a mean (SD) age of 64.9 (13.7) years. A total of 41 447 patients (87.0%) survived 6 months, 31 143 (65.4%) survived 36 months, and 27 880 (58.5%) survived 60 months, calculated from their initial oncologist consultation. The best models achieved a balanced accuracy of 0.856 (AUC, 0.928) for predicting 6-month survival, 0.842 (AUC, 0.918) for 36-month survival, and 0.837 (AUC, 0.918) for 60-month survival, on a holdout test set. Differences in what words were important for predicting 6- vs 60-month survival were found. CONCLUSIONS AND RELEVANCE: These findings suggest that models performed comparably with or better than previous models predicting cancer survival and that they may be able to predict survival using readily available data without focusing on 1 cancer type.
format Online
Article
Text
id pubmed-9972192
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-99721922023-03-01 Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing Nunez, John-Jose Leung, Bonnie Ho, Cheryl Bates, Alan T. Ng, Raymond T. JAMA Netw Open Original Investigation IMPORTANCE: Predicting short- and long-term survival of patients with cancer may improve their care. Prior predictive models either use data with limited availability or predict the outcome of only 1 type of cancer. OBJECTIVE: To investigate whether natural language processing can predict survival of patients with general cancer from a patient’s initial oncologist consultation document. DESIGN, SETTING, AND PARTICIPANTS: This retrospective prognostic study used data from 47 625 of 59 800 patients who started cancer care at any of the 6 BC Cancer sites located in the province of British Columbia between April 1, 2011, and December 31, 2016. Mortality data were updated until April 6, 2022, and data were analyzed from update until September 30, 2022. All patients with a medical or radiation oncologist consultation document generated within 180 days of diagnosis were included; patients seen for multiple cancers were excluded. EXPOSURES: Initial oncologist consultation documents were analyzed using traditional and neural language models. MAIN OUTCOMES AND MEASURES: The primary outcome was the performance of the predictive models, including balanced accuracy and receiver operating characteristics area under the curve (AUC). The secondary outcome was investigating what words the models used. RESULTS: Of the 47 625 patients in the sample, 25 428 (53.4%) were female and 22 197 (46.6%) were male, with a mean (SD) age of 64.9 (13.7) years. A total of 41 447 patients (87.0%) survived 6 months, 31 143 (65.4%) survived 36 months, and 27 880 (58.5%) survived 60 months, calculated from their initial oncologist consultation. The best models achieved a balanced accuracy of 0.856 (AUC, 0.928) for predicting 6-month survival, 0.842 (AUC, 0.918) for 36-month survival, and 0.837 (AUC, 0.918) for 60-month survival, on a holdout test set. Differences in what words were important for predicting 6- vs 60-month survival were found. CONCLUSIONS AND RELEVANCE: These findings suggest that models performed comparably with or better than previous models predicting cancer survival and that they may be able to predict survival using readily available data without focusing on 1 cancer type. American Medical Association 2023-02-27 /pmc/articles/PMC9972192/ /pubmed/36848085 http://dx.doi.org/10.1001/jamanetworkopen.2023.0813 Text en Copyright 2023 Nunez JJ et al. JAMA Network Open. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Nunez, John-Jose
Leung, Bonnie
Ho, Cheryl
Bates, Alan T.
Ng, Raymond T.
Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title_full Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title_fullStr Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title_full_unstemmed Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title_short Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing
title_sort predicting the survival of patients with cancer from their initial oncology consultation document using natural language processing
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9972192/
https://www.ncbi.nlm.nih.gov/pubmed/36848085
http://dx.doi.org/10.1001/jamanetworkopen.2023.0813
work_keys_str_mv AT nunezjohnjose predictingthesurvivalofpatientswithcancerfromtheirinitialoncologyconsultationdocumentusingnaturallanguageprocessing
AT leungbonnie predictingthesurvivalofpatientswithcancerfromtheirinitialoncologyconsultationdocumentusingnaturallanguageprocessing
AT hocheryl predictingthesurvivalofpatientswithcancerfromtheirinitialoncologyconsultationdocumentusingnaturallanguageprocessing
AT batesalant predictingthesurvivalofpatientswithcancerfromtheirinitialoncologyconsultationdocumentusingnaturallanguageprocessing
AT ngraymondt predictingthesurvivalofpatientswithcancerfromtheirinitialoncologyconsultationdocumentusingnaturallanguageprocessing