Cargando…

Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry

OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Sunil, Tran, Truyen, Luo, Wei, Phung, Dinh, Kennedy, Richard Lee, Broad, Adam, Campbell, David, Kipp, David, Singh, Madhu, Khasraw, Mustafa, Matheson, Leigh, Ashley, David M, Venkatesh, Svetha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963101/
https://www.ncbi.nlm.nih.gov/pubmed/24643167
http://dx.doi.org/10.1136/bmjopen-2013-004007
_version_ 1782308473253396480
author Gupta, Sunil
Tran, Truyen
Luo, Wei
Phung, Dinh
Kennedy, Richard Lee
Broad, Adam
Campbell, David
Kipp, David
Singh, Madhu
Khasraw, Mustafa
Matheson, Leigh
Ashley, David M
Venkatesh, Svetha
author_facet Gupta, Sunil
Tran, Truyen
Luo, Wei
Phung, Dinh
Kennedy, Richard Lee
Broad, Adam
Campbell, David
Kipp, David
Singh, Madhu
Khasraw, Mustafa
Matheson, Leigh
Ashley, David M
Venkatesh, Svetha
author_sort Gupta, Sunil
collection PubMed
description OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes. SETTING: A regional cancer centre in Australia. PARTICIPANTS: Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data. PRIMARY AND SECONDARY OUTCOME MEASURES: Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC). RESULTS: The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours. CONCLUSIONS: Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems.
format Online
Article
Text
id pubmed-3963101
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-39631012014-03-24 Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry Gupta, Sunil Tran, Truyen Luo, Wei Phung, Dinh Kennedy, Richard Lee Broad, Adam Campbell, David Kipp, David Singh, Madhu Khasraw, Mustafa Matheson, Leigh Ashley, David M Venkatesh, Svetha BMJ Open Oncology OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes. SETTING: A regional cancer centre in Australia. PARTICIPANTS: Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data. PRIMARY AND SECONDARY OUTCOME MEASURES: Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC). RESULTS: The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours. CONCLUSIONS: Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems. BMJ Publishing Group 2014-03-15 /pmc/articles/PMC3963101/ /pubmed/24643167 http://dx.doi.org/10.1136/bmjopen-2013-004007 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Oncology
Gupta, Sunil
Tran, Truyen
Luo, Wei
Phung, Dinh
Kennedy, Richard Lee
Broad, Adam
Campbell, David
Kipp, David
Singh, Madhu
Khasraw, Mustafa
Matheson, Leigh
Ashley, David M
Venkatesh, Svetha
Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title_full Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title_fullStr Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title_full_unstemmed Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title_short Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
title_sort machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963101/
https://www.ncbi.nlm.nih.gov/pubmed/24643167
http://dx.doi.org/10.1136/bmjopen-2013-004007
work_keys_str_mv AT guptasunil machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT trantruyen machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT luowei machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT phungdinh machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT kennedyrichardlee machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT broadadam machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT campbelldavid machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT kippdavid machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT singhmadhu machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT khasrawmustafa machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT mathesonleigh machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT ashleydavidm machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry
AT venkateshsvetha machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry