Cargando…

Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery

BACKGROUND: Data mining, a new concept introduced in the mid-1990s, can help researchers to gain new, profound insights and facilitate access to unanticipated knowledge sources in biomedical datasets. Many issues in the medical field are concerned with the diagnosis of diseases based on tests conduc...

Descripción completa

Detalles Bibliográficos
Autores principales: Hachesu, Peyman Rezaei, Moftian, Nazila, Dehghani, Mahsa, Soltani, Taha Samad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: West Asia Organization for Cancer Prevention 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373791/
https://www.ncbi.nlm.nih.gov/pubmed/28669163
http://dx.doi.org/10.22034/APJCP.2017.18.6.1531
_version_ 1783395044073406464
author Hachesu, Peyman Rezaei
Moftian, Nazila
Dehghani, Mahsa
Soltani, Taha Samad
author_facet Hachesu, Peyman Rezaei
Moftian, Nazila
Dehghani, Mahsa
Soltani, Taha Samad
author_sort Hachesu, Peyman Rezaei
collection PubMed
description BACKGROUND: Data mining, a new concept introduced in the mid-1990s, can help researchers to gain new, profound insights and facilitate access to unanticipated knowledge sources in biomedical datasets. Many issues in the medical field are concerned with the diagnosis of diseases based on tests conducted on individuals at risk. Early diagnosis and treatment can provide a better outcome regarding the survival of lung cancer patients. Researchers can use data mining techniques to create effective diagnostic models. The aim of this study was to evaluate patterns existing in risk factor data of for mortality one year after thoracic surgery for lung cancer. METHODS: The dataset used in this study contained 470 records and 17 features. First, the most important variables involved in the incidence of lung cancer were extracted using knowledge discovery and datamining algorithms such as naive Bayes, maximum expectation and then, using a regression analysis algorithm, a questionnaire was developed to predict the risk of death one year after lung surgery. Outliers in the data were excluded and reported using the clustering algorithm. Finally, a calculator was designed to estimate the risk for one-year post-operative mortality based on a scorecard algorithm. RESULTS: The results revealed the most important factor involved in increased mortality to be large tumor size. Roles for type II diabetes and preoperative dyspnea in lower survival were also identified. The greatest commonality in classification of patients was Forced expiratory volume in first second (FEV1), based on levels of which patients could be classified into different categories. CONCLUSION: Development of a questionnaire based on calculations to diagnose disease can be used to identify and fill knowledge gaps in clinical practice guidelines.
format Online
Article
Text
id pubmed-6373791
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher West Asia Organization for Cancer Prevention
record_format MEDLINE/PubMed
spelling pubmed-63737912019-03-19 Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery Hachesu, Peyman Rezaei Moftian, Nazila Dehghani, Mahsa Soltani, Taha Samad Asian Pac J Cancer Prev Research Article BACKGROUND: Data mining, a new concept introduced in the mid-1990s, can help researchers to gain new, profound insights and facilitate access to unanticipated knowledge sources in biomedical datasets. Many issues in the medical field are concerned with the diagnosis of diseases based on tests conducted on individuals at risk. Early diagnosis and treatment can provide a better outcome regarding the survival of lung cancer patients. Researchers can use data mining techniques to create effective diagnostic models. The aim of this study was to evaluate patterns existing in risk factor data of for mortality one year after thoracic surgery for lung cancer. METHODS: The dataset used in this study contained 470 records and 17 features. First, the most important variables involved in the incidence of lung cancer were extracted using knowledge discovery and datamining algorithms such as naive Bayes, maximum expectation and then, using a regression analysis algorithm, a questionnaire was developed to predict the risk of death one year after lung surgery. Outliers in the data were excluded and reported using the clustering algorithm. Finally, a calculator was designed to estimate the risk for one-year post-operative mortality based on a scorecard algorithm. RESULTS: The results revealed the most important factor involved in increased mortality to be large tumor size. Roles for type II diabetes and preoperative dyspnea in lower survival were also identified. The greatest commonality in classification of patients was Forced expiratory volume in first second (FEV1), based on levels of which patients could be classified into different categories. CONCLUSION: Development of a questionnaire based on calculations to diagnose disease can be used to identify and fill knowledge gaps in clinical practice guidelines. West Asia Organization for Cancer Prevention 2017 /pmc/articles/PMC6373791/ /pubmed/28669163 http://dx.doi.org/10.22034/APJCP.2017.18.6.1531 Text en Copyright: © Asian Pacific Journal of Cancer Prevention http://creativecommons.org/licenses/BY-SA/4.0 This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
spellingShingle Research Article
Hachesu, Peyman Rezaei
Moftian, Nazila
Dehghani, Mahsa
Soltani, Taha Samad
Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title_full Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title_fullStr Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title_full_unstemmed Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title_short Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery
title_sort analyzing a lung cancer patient dataset with the focus on predicting survival rate one year after thoracic surgery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373791/
https://www.ncbi.nlm.nih.gov/pubmed/28669163
http://dx.doi.org/10.22034/APJCP.2017.18.6.1531
work_keys_str_mv AT hachesupeymanrezaei analyzingalungcancerpatientdatasetwiththefocusonpredictingsurvivalrateoneyearafterthoracicsurgery
AT moftiannazila analyzingalungcancerpatientdatasetwiththefocusonpredictingsurvivalrateoneyearafterthoracicsurgery
AT dehghanimahsa analyzingalungcancerpatientdatasetwiththefocusonpredictingsurvivalrateoneyearafterthoracicsurgery
AT soltanitahasamad analyzingalungcancerpatientdatasetwiththefocusonpredictingsurvivalrateoneyearafterthoracicsurgery