Cargando…
Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning
BACKGROUND: Integrating additional factors into the TNM staging system is needed for more accurate risk classification and survival prediction for patients with cutaneous melanoma. In the present study, we introduce machine learning as a novel tool that incorporates additional prognostic factors to...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8483349/ https://www.ncbi.nlm.nih.gov/pubmed/34591891 http://dx.doi.org/10.1371/journal.pone.0257949 |
_version_ | 1784577103817080832 |
---|---|
author | Yang, Charles Q. Wang, Huan Liu, Zhenqiu Hueman, Matthew T. Bhaskaran, Aadya Henson, Donald E. Sheng, Li Chen, Dechang |
author_facet | Yang, Charles Q. Wang, Huan Liu, Zhenqiu Hueman, Matthew T. Bhaskaran, Aadya Henson, Donald E. Sheng, Li Chen, Dechang |
author_sort | Yang, Charles Q. |
collection | PubMed |
description | BACKGROUND: Integrating additional factors into the TNM staging system is needed for more accurate risk classification and survival prediction for patients with cutaneous melanoma. In the present study, we introduce machine learning as a novel tool that incorporates additional prognostic factors to improve the current TNM staging system. METHODS AND FINDINGS: Cancer-specific survival data for cutaneous melanoma with at least a 5 years follow-up were extracted from the Surveillance, Epidemiology, and End Results Program of the National Cancer Institute and split into the training set (40,781 cases) and validation set (5,390 cases). Five factors were studied: the primary tumor (T), regional lymph nodes (N), distant metastasis (M), age (A), and sex (S). The Ensemble Algorithm for Clustering Cancer Data (EACCD) was applied to the training set to generate prognostic groups. Utilizing only T, N, and M, a basic prognostic system was built where patients were stratified into 10 prognostic groups with well-separated survival curves, similar to 10 AJCC stages. These 10 groups had a significantly higher accuracy in survival prediction than 10 stages (C-index = 0.7682 vs 0.7643; increase in C-index = 0.0039, 95% CI = (0.0032, 0.0047); p-value = 7.2×10(−23)). Nevertheless, a positive association remained between the EACCD grouping and the AJCC staging (Spearman’s rank correlation coefficient = 0.8316; p-value = 4.5×10(−13)). With additional information from A and S, a more advanced prognostic system was established using the training data that stratified patients into 10 groups and further improved the prediction accuracy (C-index = 0.7865 vs 0.7643; increase in C-index = 0.0222, 95% CI = (0.0191, 0.0254); p-value = 8.8×10(−43)). Both internal validation using the training set and temporal validation using the validation set showed good stratification and a high predictive accuracy of the prognostic systems. CONCLUSIONS: The EACCD allows additional factors to be integrated into the TNM to create a prognostic system that improves patient stratification and survival prediction for cutaneous melanoma. This integration separates favorable from unfavorable clinical outcomes for patients and improves both cohort selection for clinical trials and treatment management. |
format | Online Article Text |
id | pubmed-8483349 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-84833492021-10-01 Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning Yang, Charles Q. Wang, Huan Liu, Zhenqiu Hueman, Matthew T. Bhaskaran, Aadya Henson, Donald E. Sheng, Li Chen, Dechang PLoS One Research Article BACKGROUND: Integrating additional factors into the TNM staging system is needed for more accurate risk classification and survival prediction for patients with cutaneous melanoma. In the present study, we introduce machine learning as a novel tool that incorporates additional prognostic factors to improve the current TNM staging system. METHODS AND FINDINGS: Cancer-specific survival data for cutaneous melanoma with at least a 5 years follow-up were extracted from the Surveillance, Epidemiology, and End Results Program of the National Cancer Institute and split into the training set (40,781 cases) and validation set (5,390 cases). Five factors were studied: the primary tumor (T), regional lymph nodes (N), distant metastasis (M), age (A), and sex (S). The Ensemble Algorithm for Clustering Cancer Data (EACCD) was applied to the training set to generate prognostic groups. Utilizing only T, N, and M, a basic prognostic system was built where patients were stratified into 10 prognostic groups with well-separated survival curves, similar to 10 AJCC stages. These 10 groups had a significantly higher accuracy in survival prediction than 10 stages (C-index = 0.7682 vs 0.7643; increase in C-index = 0.0039, 95% CI = (0.0032, 0.0047); p-value = 7.2×10(−23)). Nevertheless, a positive association remained between the EACCD grouping and the AJCC staging (Spearman’s rank correlation coefficient = 0.8316; p-value = 4.5×10(−13)). With additional information from A and S, a more advanced prognostic system was established using the training data that stratified patients into 10 groups and further improved the prediction accuracy (C-index = 0.7865 vs 0.7643; increase in C-index = 0.0222, 95% CI = (0.0191, 0.0254); p-value = 8.8×10(−43)). Both internal validation using the training set and temporal validation using the validation set showed good stratification and a high predictive accuracy of the prognostic systems. CONCLUSIONS: The EACCD allows additional factors to be integrated into the TNM to create a prognostic system that improves patient stratification and survival prediction for cutaneous melanoma. This integration separates favorable from unfavorable clinical outcomes for patients and improves both cohort selection for clinical trials and treatment management. Public Library of Science 2021-09-30 /pmc/articles/PMC8483349/ /pubmed/34591891 http://dx.doi.org/10.1371/journal.pone.0257949 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Yang, Charles Q. Wang, Huan Liu, Zhenqiu Hueman, Matthew T. Bhaskaran, Aadya Henson, Donald E. Sheng, Li Chen, Dechang Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title | Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title_full | Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title_fullStr | Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title_full_unstemmed | Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title_short | Integrating additional factors into the TNM staging for cutaneous melanoma by machine learning |
title_sort | integrating additional factors into the tnm staging for cutaneous melanoma by machine learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8483349/ https://www.ncbi.nlm.nih.gov/pubmed/34591891 http://dx.doi.org/10.1371/journal.pone.0257949 |
work_keys_str_mv | AT yangcharlesq integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT wanghuan integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT liuzhenqiu integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT huemanmatthewt integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT bhaskaranaadya integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT hensondonalde integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT shengli integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning AT chendechang integratingadditionalfactorsintothetnmstagingforcutaneousmelanomabymachinelearning |