Cargando…
Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963101/ https://www.ncbi.nlm.nih.gov/pubmed/24643167 http://dx.doi.org/10.1136/bmjopen-2013-004007 |
_version_ | 1782308473253396480 |
---|---|
author | Gupta, Sunil Tran, Truyen Luo, Wei Phung, Dinh Kennedy, Richard Lee Broad, Adam Campbell, David Kipp, David Singh, Madhu Khasraw, Mustafa Matheson, Leigh Ashley, David M Venkatesh, Svetha |
author_facet | Gupta, Sunil Tran, Truyen Luo, Wei Phung, Dinh Kennedy, Richard Lee Broad, Adam Campbell, David Kipp, David Singh, Madhu Khasraw, Mustafa Matheson, Leigh Ashley, David M Venkatesh, Svetha |
author_sort | Gupta, Sunil |
collection | PubMed |
description | OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes. SETTING: A regional cancer centre in Australia. PARTICIPANTS: Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data. PRIMARY AND SECONDARY OUTCOME MEASURES: Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC). RESULTS: The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours. CONCLUSIONS: Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems. |
format | Online Article Text |
id | pubmed-3963101 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-39631012014-03-24 Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry Gupta, Sunil Tran, Truyen Luo, Wei Phung, Dinh Kennedy, Richard Lee Broad, Adam Campbell, David Kipp, David Singh, Madhu Khasraw, Mustafa Matheson, Leigh Ashley, David M Venkatesh, Svetha BMJ Open Oncology OBJECTIVES: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes. SETTING: A regional cancer centre in Australia. PARTICIPANTS: Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data. PRIMARY AND SECONDARY OUTCOME MEASURES: Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC). RESULTS: The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours. CONCLUSIONS: Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems. BMJ Publishing Group 2014-03-15 /pmc/articles/PMC3963101/ /pubmed/24643167 http://dx.doi.org/10.1136/bmjopen-2013-004007 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Oncology Gupta, Sunil Tran, Truyen Luo, Wei Phung, Dinh Kennedy, Richard Lee Broad, Adam Campbell, David Kipp, David Singh, Madhu Khasraw, Mustafa Matheson, Leigh Ashley, David M Venkatesh, Svetha Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title | Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title_full | Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title_fullStr | Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title_full_unstemmed | Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title_short | Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
title_sort | machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3963101/ https://www.ncbi.nlm.nih.gov/pubmed/24643167 http://dx.doi.org/10.1136/bmjopen-2013-004007 |
work_keys_str_mv | AT guptasunil machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT trantruyen machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT luowei machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT phungdinh machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT kennedyrichardlee machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT broadadam machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT campbelldavid machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT kippdavid machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT singhmadhu machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT khasrawmustafa machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT mathesonleigh machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT ashleydavidm machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry AT venkateshsvetha machinelearningpredictionofcancersurvivalaretrospectivestudyusingelectronicadministrativerecordsandacancerregistry |