Cargando…

Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer

OBJECTIVES: Survival machine learning (ML) has been suggested as a useful approach for forecasting future events, but a growing concern exists that ML models have the potential to cause racial disparities through the data used to train them. This study aims to develop race/ethnicity-specific surviva...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Jung In, Bozkurt, Selen, Park, Jong Won, Lee, Sunmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9853120/
https://www.ncbi.nlm.nih.gov/pubmed/36653067
http://dx.doi.org/10.1136/bmjhci-2022-100666
_version_ 1784872824528175104
author Park, Jung In
Bozkurt, Selen
Park, Jong Won
Lee, Sunmin
author_facet Park, Jung In
Bozkurt, Selen
Park, Jong Won
Lee, Sunmin
author_sort Park, Jung In
collection PubMed
description OBJECTIVES: Survival machine learning (ML) has been suggested as a useful approach for forecasting future events, but a growing concern exists that ML models have the potential to cause racial disparities through the data used to train them. This study aims to develop race/ethnicity-specific survival ML models for Hispanic and black women diagnosed with breast cancer to examine whether race/ethnicity-specific ML models outperform the general models trained with all races/ethnicity data. METHODS: We used the data from the US National Cancer Institute’s Surveillance, Epidemiology and End Results programme registries. We developed the Hispanic-specific and black-specific models and compared them with the general model using the Cox proportional-hazards model, Gradient Boost Tree, survival tree and survival support vector machine. RESULTS: A total of 322 348 female patients who had breast cancer diagnoses between 1 January 2000 and 31 December 2017 were identified. The race/ethnicity-specific models for Hispanic and black women consistently outperformed the general model when predicting the outcomes of specific race/ethnicity. DISCUSSION: Accurately predicting the survival outcome of a patient is critical in determining treatment options and providing appropriate cancer care. The high-performing models developed in this study can contribute to providing individualised oncology care and improving the survival outcome of black and Hispanic women. CONCLUSION: Predicting the individualised survival outcome of breast cancer can provide the evidence necessary for determining treatment options and high-quality, patient-centred cancer care delivery for under-represented populations. Also, the race/ethnicity-specific ML models can mitigate representation bias and contribute to addressing health disparities.
format Online
Article
Text
id pubmed-9853120
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-98531202023-01-21 Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer Park, Jung In Bozkurt, Selen Park, Jong Won Lee, Sunmin BMJ Health Care Inform Original Research OBJECTIVES: Survival machine learning (ML) has been suggested as a useful approach for forecasting future events, but a growing concern exists that ML models have the potential to cause racial disparities through the data used to train them. This study aims to develop race/ethnicity-specific survival ML models for Hispanic and black women diagnosed with breast cancer to examine whether race/ethnicity-specific ML models outperform the general models trained with all races/ethnicity data. METHODS: We used the data from the US National Cancer Institute’s Surveillance, Epidemiology and End Results programme registries. We developed the Hispanic-specific and black-specific models and compared them with the general model using the Cox proportional-hazards model, Gradient Boost Tree, survival tree and survival support vector machine. RESULTS: A total of 322 348 female patients who had breast cancer diagnoses between 1 January 2000 and 31 December 2017 were identified. The race/ethnicity-specific models for Hispanic and black women consistently outperformed the general model when predicting the outcomes of specific race/ethnicity. DISCUSSION: Accurately predicting the survival outcome of a patient is critical in determining treatment options and providing appropriate cancer care. The high-performing models developed in this study can contribute to providing individualised oncology care and improving the survival outcome of black and Hispanic women. CONCLUSION: Predicting the individualised survival outcome of breast cancer can provide the evidence necessary for determining treatment options and high-quality, patient-centred cancer care delivery for under-represented populations. Also, the race/ethnicity-specific ML models can mitigate representation bias and contribute to addressing health disparities. BMJ Publishing Group 2023-01-18 /pmc/articles/PMC9853120/ /pubmed/36653067 http://dx.doi.org/10.1136/bmjhci-2022-100666 Text en © Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research
Park, Jung In
Bozkurt, Selen
Park, Jong Won
Lee, Sunmin
Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title_full Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title_fullStr Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title_full_unstemmed Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title_short Evaluation of race/ethnicity-specific survival machine learning models for Hispanic and Black patients with breast cancer
title_sort evaluation of race/ethnicity-specific survival machine learning models for hispanic and black patients with breast cancer
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9853120/
https://www.ncbi.nlm.nih.gov/pubmed/36653067
http://dx.doi.org/10.1136/bmjhci-2022-100666
work_keys_str_mv AT parkjungin evaluationofraceethnicityspecificsurvivalmachinelearningmodelsforhispanicandblackpatientswithbreastcancer
AT bozkurtselen evaluationofraceethnicityspecificsurvivalmachinelearningmodelsforhispanicandblackpatientswithbreastcancer
AT parkjongwon evaluationofraceethnicityspecificsurvivalmachinelearningmodelsforhispanicandblackpatientswithbreastcancer
AT leesunmin evaluationofraceethnicityspecificsurvivalmachinelearningmodelsforhispanicandblackpatientswithbreastcancer