Cargando…

Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors

Due to the high effectiveness of cancer screening and therapies, the diagnosis of second primary cancers (SPCs) has increased in women with breast cancer. The present study was conducted to develop a novel machine learning–based classification scheme for predicting the risk factors of SPCs in breast...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Chi-Chang, Chen, Ssu-Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6759630/
https://www.ncbi.nlm.nih.gov/pubmed/31620166
http://dx.doi.org/10.3389/fgene.2019.00848
_version_ 1783453731539386368
author Chang, Chi-Chang
Chen, Ssu-Han
author_facet Chang, Chi-Chang
Chen, Ssu-Han
author_sort Chang, Chi-Chang
collection PubMed
description Due to the high effectiveness of cancer screening and therapies, the diagnosis of second primary cancers (SPCs) has increased in women with breast cancer. The present study was conducted to develop a novel machine learning–based classification scheme for predicting the risk factors of SPCs in breast cancer survivors. The proposed scheme was based on the XGBoost classifier with the following four comparable strategies: transformation, resampling, clustering, and ensemble learning, to improve the training balanced accuracy. Results suggested that the best prediction accuracy for an empirical case is the XGBoost associated with the strategies of resampling and clustering. The experimental results showed that age, sequence of radiotherapy and surgery, surgical margins of the primary site, human epidermal growth factor, high-dose clinical target volume, and estrogen receptors are relatively more important risk factors associated with SPCs in patients with breast cancer. These risk factors should be monitored for the early detection of breast cancer. In conclusion, the proposed scheme can support the important influence of personality and clinical symptom representations in all phases of the primary treatment trajectory. Our results further suggested that adaptive machine learning techniques require the incorporation of significant variables for optimal predictions.
format Online
Article
Text
id pubmed-6759630
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67596302019-10-16 Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors Chang, Chi-Chang Chen, Ssu-Han Front Genet Genetics Due to the high effectiveness of cancer screening and therapies, the diagnosis of second primary cancers (SPCs) has increased in women with breast cancer. The present study was conducted to develop a novel machine learning–based classification scheme for predicting the risk factors of SPCs in breast cancer survivors. The proposed scheme was based on the XGBoost classifier with the following four comparable strategies: transformation, resampling, clustering, and ensemble learning, to improve the training balanced accuracy. Results suggested that the best prediction accuracy for an empirical case is the XGBoost associated with the strategies of resampling and clustering. The experimental results showed that age, sequence of radiotherapy and surgery, surgical margins of the primary site, human epidermal growth factor, high-dose clinical target volume, and estrogen receptors are relatively more important risk factors associated with SPCs in patients with breast cancer. These risk factors should be monitored for the early detection of breast cancer. In conclusion, the proposed scheme can support the important influence of personality and clinical symptom representations in all phases of the primary treatment trajectory. Our results further suggested that adaptive machine learning techniques require the incorporation of significant variables for optimal predictions. Frontiers Media S.A. 2019-09-18 /pmc/articles/PMC6759630/ /pubmed/31620166 http://dx.doi.org/10.3389/fgene.2019.00848 Text en Copyright © 2019 Chang and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chang, Chi-Chang
Chen, Ssu-Han
Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title_full Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title_fullStr Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title_full_unstemmed Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title_short Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors
title_sort developing a novel machine learning-based classification scheme for predicting spcs in breast cancer survivors
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6759630/
https://www.ncbi.nlm.nih.gov/pubmed/31620166
http://dx.doi.org/10.3389/fgene.2019.00848
work_keys_str_mv AT changchichang developinganovelmachinelearningbasedclassificationschemeforpredictingspcsinbreastcancersurvivors
AT chenssuhan developinganovelmachinelearningbasedclassificationschemeforpredictingspcsinbreastcancersurvivors