Cargando…

A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers

Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigate...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yueying, Liu, Shuai, Wang, Zhao, Fan, Yusi, Huang, Jingxuan, Huang, Lan, Li, Zhijun, Li, Xinwei, Jin, Mengdi, Yu, Qiong, Zhou, Fengfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7911834/
https://www.ncbi.nlm.nih.gov/pubmed/33499377
http://dx.doi.org/10.3390/medicina57020099
_version_ 1783656435081543680
author Wang, Yueying
Liu, Shuai
Wang, Zhao
Fan, Yusi
Huang, Jingxuan
Huang, Lan
Li, Zhijun
Li, Xinwei
Jin, Mengdi
Yu, Qiong
Zhou, Fengfeng
author_facet Wang, Yueying
Liu, Shuai
Wang, Zhao
Fan, Yusi
Huang, Jingxuan
Huang, Lan
Li, Zhijun
Li, Xinwei
Jin, Mengdi
Yu, Qiong
Zhou, Fengfeng
author_sort Wang, Yueying
collection PubMed
description Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigated survival prediction using machine learning algorithms. The clinical data of 28,458 patients with primary lung cancers were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Results: This study indicated that the survival rate of women with primary lung cancer was often higher than that of men (p < 0.001). Seven popular machine learning algorithms were utilized to evaluate one-year, three-year, and five-year survival prediction The two classifiers extreme gradient boosting (XGB) and logistic regression (LR) achieved the best prediction accuracies. The importance variable of the trained XGB models suggested that surgical removal (feature “Surgery”) made the largest contribution to the one-year survival prediction models, while the metastatic status (feature “N” stage) of the regional lymph nodes was the most important contributor to three-year and five-year survival prediction. The female patients’ three-year prognosis model achieved a prediction accuracy of 0.8297 on the independent future samples, while the male model only achieved the accuracy 0.7329. Conclusions: This data suggested that male patients may have more complicated factors in lung cancer than females, and it is necessary to develop gender-specific diagnosis and prognosis models.
format Online
Article
Text
id pubmed-7911834
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79118342021-02-28 A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers Wang, Yueying Liu, Shuai Wang, Zhao Fan, Yusi Huang, Jingxuan Huang, Lan Li, Zhijun Li, Xinwei Jin, Mengdi Yu, Qiong Zhou, Fengfeng Medicina (Kaunas) Article Background and Objective: Primary lung cancer is a lethal and rapidly-developing cancer type and is one of the most leading causes of cancer deaths. Materials and Methods: Statistical methods such as Cox regression are usually used to detect the prognosis factors of a disease. This study investigated survival prediction using machine learning algorithms. The clinical data of 28,458 patients with primary lung cancers were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Results: This study indicated that the survival rate of women with primary lung cancer was often higher than that of men (p < 0.001). Seven popular machine learning algorithms were utilized to evaluate one-year, three-year, and five-year survival prediction The two classifiers extreme gradient boosting (XGB) and logistic regression (LR) achieved the best prediction accuracies. The importance variable of the trained XGB models suggested that surgical removal (feature “Surgery”) made the largest contribution to the one-year survival prediction models, while the metastatic status (feature “N” stage) of the regional lymph nodes was the most important contributor to three-year and five-year survival prediction. The female patients’ three-year prognosis model achieved a prediction accuracy of 0.8297 on the independent future samples, while the male model only achieved the accuracy 0.7329. Conclusions: This data suggested that male patients may have more complicated factors in lung cancer than females, and it is necessary to develop gender-specific diagnosis and prognosis models. MDPI 2021-01-22 /pmc/articles/PMC7911834/ /pubmed/33499377 http://dx.doi.org/10.3390/medicina57020099 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yueying
Liu, Shuai
Wang, Zhao
Fan, Yusi
Huang, Jingxuan
Huang, Lan
Li, Zhijun
Li, Xinwei
Jin, Mengdi
Yu, Qiong
Zhou, Fengfeng
A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title_full A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title_fullStr A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title_full_unstemmed A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title_short A Machine Learning-Based Investigation of Gender-Specific Prognosis of Lung Cancers
title_sort machine learning-based investigation of gender-specific prognosis of lung cancers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7911834/
https://www.ncbi.nlm.nih.gov/pubmed/33499377
http://dx.doi.org/10.3390/medicina57020099
work_keys_str_mv AT wangyueying amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT liushuai amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT wangzhao amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT fanyusi amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT huangjingxuan amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT huanglan amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT lizhijun amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT lixinwei amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT jinmengdi amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT yuqiong amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT zhoufengfeng amachinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT wangyueying machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT liushuai machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT wangzhao machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT fanyusi machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT huangjingxuan machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT huanglan machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT lizhijun machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT lixinwei machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT jinmengdi machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT yuqiong machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers
AT zhoufengfeng machinelearningbasedinvestigationofgenderspecificprognosisoflungcancers