Cargando…

Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study

BACKGROUND: Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Xiaorong, Luo, Ting, Deng, Ling, Liu, Pei, Hu, Kejia, Lu, Donghao, Zheng, Dan, Luo, Chuanxu, Xie, Yuxin, Li, Jiayuan, He, Ping, Pu, Tianjie, Ye, Feng, Bu, Hong, Fu, Bo, Zheng, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683252/
https://www.ncbi.nlm.nih.gov/pubmed/33164899
http://dx.doi.org/10.2196/19069
_version_ 1783612840358182912
author Zhong, Xiaorong
Luo, Ting
Deng, Ling
Liu, Pei
Hu, Kejia
Lu, Donghao
Zheng, Dan
Luo, Chuanxu
Xie, Yuxin
Li, Jiayuan
He, Ping
Pu, Tianjie
Ye, Feng
Bu, Hong
Fu, Bo
Zheng, Hong
author_facet Zhong, Xiaorong
Luo, Ting
Deng, Ling
Liu, Pei
Hu, Kejia
Lu, Donghao
Zheng, Dan
Luo, Chuanxu
Xie, Yuxin
Li, Jiayuan
He, Ping
Pu, Tianjie
Ye, Feng
Bu, Hong
Fu, Bo
Zheng, Hong
author_sort Zhong, Xiaorong
collection PubMed
description BACKGROUND: Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries have revealed suboptimal predictions. OBJECTIVE: We aimed to develop an advanced breast cancer prognosis model for disease progression, cancer-specific mortality, and all-cause mortality by integrating tumor, demographic, and treatment characteristics from a large breast cancer cohort in China. METHODS: This study was approved by the Clinical Test and Biomedical Ethics Committee of West China Hospital, Sichuan University on May 17, 2012. Data collection for this project was started in May 2017 and ended in March 2019. Data on 5293 women diagnosed with stage I to III invasive breast cancer between 2000 and 2013 were collected. Disease progression, cancer-specific mortality, all-cause mortality, and the likelihood of disease progression or death within a 5-year period were predicted. Extreme gradient boosting was used to develop the prediction model. Model performance was assessed by calculating the area under the receiver operating characteristic curve (AUROC), and the model was calibrated and compared with PREDICT. RESULTS: The training, test, and validation sets comprised 3276 (499 progressions, 202 breast cancer-specific deaths, and 261 all-cause deaths within 5-year follow-up), 1405 (211 progressions, 94 breast cancer-specific deaths, and 129 all-cause deaths), and 612 (109 progressions, 33 breast cancer-specific deaths, and 37 all-cause deaths) women, respectively. The AUROC values for disease progression, cancer-specific mortality, and all-cause mortality were 0.76, 0.88, and 0.82 for training set; 0.79, 0.80, and 0.83 for the test set; and 0.79, 0.84, and 0.88 for the validation set, respectively. Calibration analysis demonstrated good agreement between predicted and observed events within 5 years. Comparable AUROC and calibration results were confirmed in different age, residence status, and receptor status subgroups. Compared with PREDICT, our model showed similar AUROC and improved calibration values. CONCLUSIONS: Our prognostic model exhibits high discrimination and good calibration. It may facilitate prognosis prediction and clinical decision making for patients with breast cancer in China.
format Online
Article
Text
id pubmed-7683252
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-76832522020-11-27 Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study Zhong, Xiaorong Luo, Ting Deng, Ling Liu, Pei Hu, Kejia Lu, Donghao Zheng, Dan Luo, Chuanxu Xie, Yuxin Li, Jiayuan He, Ping Pu, Tianjie Ye, Feng Bu, Hong Fu, Bo Zheng, Hong JMIR Med Inform Original Paper BACKGROUND: Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries have revealed suboptimal predictions. OBJECTIVE: We aimed to develop an advanced breast cancer prognosis model for disease progression, cancer-specific mortality, and all-cause mortality by integrating tumor, demographic, and treatment characteristics from a large breast cancer cohort in China. METHODS: This study was approved by the Clinical Test and Biomedical Ethics Committee of West China Hospital, Sichuan University on May 17, 2012. Data collection for this project was started in May 2017 and ended in March 2019. Data on 5293 women diagnosed with stage I to III invasive breast cancer between 2000 and 2013 were collected. Disease progression, cancer-specific mortality, all-cause mortality, and the likelihood of disease progression or death within a 5-year period were predicted. Extreme gradient boosting was used to develop the prediction model. Model performance was assessed by calculating the area under the receiver operating characteristic curve (AUROC), and the model was calibrated and compared with PREDICT. RESULTS: The training, test, and validation sets comprised 3276 (499 progressions, 202 breast cancer-specific deaths, and 261 all-cause deaths within 5-year follow-up), 1405 (211 progressions, 94 breast cancer-specific deaths, and 129 all-cause deaths), and 612 (109 progressions, 33 breast cancer-specific deaths, and 37 all-cause deaths) women, respectively. The AUROC values for disease progression, cancer-specific mortality, and all-cause mortality were 0.76, 0.88, and 0.82 for training set; 0.79, 0.80, and 0.83 for the test set; and 0.79, 0.84, and 0.88 for the validation set, respectively. Calibration analysis demonstrated good agreement between predicted and observed events within 5 years. Comparable AUROC and calibration results were confirmed in different age, residence status, and receptor status subgroups. Compared with PREDICT, our model showed similar AUROC and improved calibration values. CONCLUSIONS: Our prognostic model exhibits high discrimination and good calibration. It may facilitate prognosis prediction and clinical decision making for patients with breast cancer in China. JMIR Publications 2020-11-09 /pmc/articles/PMC7683252/ /pubmed/33164899 http://dx.doi.org/10.2196/19069 Text en ©Xiaorong Zhong, Ting Luo, Ling Deng, Pei Liu, Kejia Hu, Donghao Lu, Dan Zheng, Chuanxu Luo, Yuxin Xie, Jiayuan Li, Ping He, Tianjie Pu, Feng Ye, Hong Bu, Bo Fu, Hong Zheng. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 09.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Zhong, Xiaorong
Luo, Ting
Deng, Ling
Liu, Pei
Hu, Kejia
Lu, Donghao
Zheng, Dan
Luo, Chuanxu
Xie, Yuxin
Li, Jiayuan
He, Ping
Pu, Tianjie
Ye, Feng
Bu, Hong
Fu, Bo
Zheng, Hong
Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title_full Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title_fullStr Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title_full_unstemmed Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title_short Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study
title_sort multidimensional machine learning personalized prognostic model in an early invasive breast cancer population-based cohort in china: algorithm validation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683252/
https://www.ncbi.nlm.nih.gov/pubmed/33164899
http://dx.doi.org/10.2196/19069
work_keys_str_mv AT zhongxiaorong multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT luoting multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT dengling multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT liupei multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT hukejia multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT ludonghao multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT zhengdan multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT luochuanxu multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT xieyuxin multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT lijiayuan multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT heping multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT putianjie multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT yefeng multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT buhong multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT fubo multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy
AT zhenghong multidimensionalmachinelearningpersonalizedprognosticmodelinanearlyinvasivebreastcancerpopulationbasedcohortinchinaalgorithmvalidationstudy