Cargando…
A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigate...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7911834/ https://www.ncbi.nlm.nih.gov/pubmed/33499377 http://dx.doi.org/10.3390/medicina57020099 |
_version_ | 1783656435081543680 |
---|---|
author | Wang, Yueying Liu, Shuai Wang, Zhao Fan, Yusi Huang, Jingxuan Huang, Lan Li, Zhijun Li, Xinwei Jin, Mengdi Yu, Qiong Zhou, Fengfeng |
author_facet | Wang, Yueying Liu, Shuai Wang, Zhao Fan, Yusi Huang, Jingxuan Huang, Lan Li, Zhijun Li, Xinwei Jin, Mengdi Yu, Qiong Zhou, Fengfeng |
author_sort | Wang, Yueying |
collection | PubMed |
description | Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigated survival prediction using machine learning algorithms. The clinical data of 28,458 patients with primary lung cancers were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Results: This study indicated that the survival rate of women with primary lung cancer was often higher than that of men (p < 0.001). Seven popular machine learning algorithms were utilized to evaluate one-year, three-year, and five-year survival prediction The two classifiers extreme gradient boosting (XGB) and logistic regression (LR) achieved the best prediction accuracies. The importance variable of the trained XGB models suggested that surgical removal (feature “Surgery”) made the largest contribution to the one-year survival prediction models, while the metastatic status (feature “N” stage) of the regional lymph nodes was the most important contributor to three-year and five-year survival prediction. The female patients’ three-year prognosis model achieved a prediction accuracy of 0.8297 on the independent future samples, while the male model only achieved the accuracy 0.7329. Conclusions: This data suggested that male patients may have more complicated factors in lung cancer than females, and it is necessary to develop gender-specific diagnosis and prognosis models. |
format | Online Article Text |
id | pubmed-7911834 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79118342021-02-28 A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers Wang, Yueying Liu, Shuai Wang, Zhao Fan, Yusi Huang, Jingxuan Huang, Lan Li, Zhijun Li, Xinwei Jin, Mengdi Yu, Qiong Zhou, Fengfeng Medicina (Kaunas) Article Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigated survival prediction using machine learning algorithms. The clinical data of 28,458 patients with primary lung cancers were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Results: This study indicated that the survival rate of women with primary lung cancer was often higher than that of men (p < 0.001). Seven popular machine learning algorithms were utilized to evaluate one-year, three-year, and five-year survival prediction The two classifiers extreme gradient boosting (XGB) and logistic regression (LR) achieved the best prediction accuracies. The importance variable of the trained XGB models suggested that surgical removal (feature “Surgery”) made the largest contribution to the one-year survival prediction models, while the metastatic status (feature “N” stage) of the regional lymph nodes was the most important contributor to three-year and five-year survival prediction. The female patients’ three-year prognosis model achieved a prediction accuracy of 0.8297 on the independent future samples, while the male model only achieved the accuracy 0.7329. Conclusions: This data suggested that male patients may have more complicated factors in lung cancer than females, and it is necessary to develop gender-specific diagnosis and prognosis models. MDPI 2021-01-22 /pmc/articles/PMC7911834/ /pubmed/33499377 http://dx.doi.org/10.3390/medicina57020099 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Yueying Liu, Shuai Wang, Zhao Fan, Yusi Huang, Jingxuan Huang, Lan Li, Zhijun Li, Xinwei Jin, Mengdi Yu, Qiong Zhou, Fengfeng A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title | A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title_full | A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title_fullStr | A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title_full_unstemmed | A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title_short | A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers |
title_sort | machine learning-based investigation of gender-specific prognosis of lung cancers |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7911834/ https://www.ncbi.nlm.nih.gov/pubmed/33499377 http://dx.doi.org/10.3390/medicina57020099 |
work_keys_str_mv | AT wangyueying amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT liushuai amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT wangzhao amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT fanyusi amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT huangjingxuan amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT huanglan amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT lizhijun amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT lixinwei amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT jinmengdi amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT yuqiong amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT zhoufengfeng amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT wangyueying machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT liushuai machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT wangzhao machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT fanyusi machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT huangjingxuan machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT huanglan machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT lizhijun machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT lixinwei machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT jinmengdi machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT yuqiong machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers AT zhoufengfeng machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers |