Cargando…

Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models

This paper presents the research results on the contribution of user-centered data mining based on the standard principles, focusing on the analysis of survival and mortality of lung cancer cases. Researchers used anonymized data from previously diagnosed instances in the health database to predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Saleh Ibrahim, Yousif, Muhammed, Yasser, Al-Douri, Asaad T., Faisal, Muhammad Shahzad, Mohamad, Abdulsattar Abdullah H., Al-Husban, Abdallah, Birhan, Mequanint
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173921/
https://www.ncbi.nlm.nih.gov/pubmed/35685154
http://dx.doi.org/10.1155/2022/6058213
_version_ 1784722124221448192
author Saleh Ibrahim, Yousif
Muhammed, Yasser
Al-Douri, Asaad T.
Faisal, Muhammad Shahzad
Mohamad, Abdulsattar Abdullah H.
Al-Husban, Abdallah
Birhan, Mequanint
author_facet Saleh Ibrahim, Yousif
Muhammed, Yasser
Al-Douri, Asaad T.
Faisal, Muhammad Shahzad
Mohamad, Abdulsattar Abdullah H.
Al-Husban, Abdallah
Birhan, Mequanint
author_sort Saleh Ibrahim, Yousif
collection PubMed
description This paper presents the research results on the contribution of user-centered data mining based on the standard principles, focusing on the analysis of survival and mortality of lung cancer cases. Researchers used anonymized data from previously diagnosed instances in the health database to predict the condition of new patients who have not had their results yet. Medical professionals specializing in this field provided feedback on the usefulness of the new software, which was constructed using WEKA data mining tools and the Naive Bayes method. The results of this article provide elements of interest to discuss the value of identifying or discovering relationships in apparently “hidden” information to propose strategies to counteract health problems or prevent future complications and thus contribute to improving the quality of care. Life of the population, as would be the case of data mining in the health area, has shown applicability in the early detection and prevention of diseases for the analysis of genetic markers to determine the probability of a satisfactory response to medical treatment, and the most accurate model was Naive Bayes (91.1%). The Naive Bayes algorithm's closest competitor, bagging, came in second with 90.8%. The analysis found that the ZeroR algorithm had the lowest success rate at 80%.
format Online
Article
Text
id pubmed-9173921
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-91739212022-06-08 Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models Saleh Ibrahim, Yousif Muhammed, Yasser Al-Douri, Asaad T. Faisal, Muhammad Shahzad Mohamad, Abdulsattar Abdullah H. Al-Husban, Abdallah Birhan, Mequanint Comput Intell Neurosci Research Article This paper presents the research results on the contribution of user-centered data mining based on the standard principles, focusing on the analysis of survival and mortality of lung cancer cases. Researchers used anonymized data from previously diagnosed instances in the health database to predict the condition of new patients who have not had their results yet. Medical professionals specializing in this field provided feedback on the usefulness of the new software, which was constructed using WEKA data mining tools and the Naive Bayes method. The results of this article provide elements of interest to discuss the value of identifying or discovering relationships in apparently “hidden” information to propose strategies to counteract health problems or prevent future complications and thus contribute to improving the quality of care. Life of the population, as would be the case of data mining in the health area, has shown applicability in the early detection and prevention of diseases for the analysis of genetic markers to determine the probability of a satisfactory response to medical treatment, and the most accurate model was Naive Bayes (91.1%). The Naive Bayes algorithm's closest competitor, bagging, came in second with 90.8%. The analysis found that the ZeroR algorithm had the lowest success rate at 80%. Hindawi 2022-05-31 /pmc/articles/PMC9173921/ /pubmed/35685154 http://dx.doi.org/10.1155/2022/6058213 Text en Copyright © 2022 Yousif Saleh Ibrahim et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Saleh Ibrahim, Yousif
Muhammed, Yasser
Al-Douri, Asaad T.
Faisal, Muhammad Shahzad
Mohamad, Abdulsattar Abdullah H.
Al-Husban, Abdallah
Birhan, Mequanint
Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title_full Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title_fullStr Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title_full_unstemmed Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title_short Discovery of Knowledge in the Incidence of a Type of Lung Cancer for Patients through Data Mining Models
title_sort discovery of knowledge in the incidence of a type of lung cancer for patients through data mining models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173921/
https://www.ncbi.nlm.nih.gov/pubmed/35685154
http://dx.doi.org/10.1155/2022/6058213
work_keys_str_mv AT salehibrahimyousif discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT muhammedyasser discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT aldouriasaadt discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT faisalmuhammadshahzad discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT mohamadabdulsattarabdullahh discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT alhusbanabdallah discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels
AT birhanmequanint discoveryofknowledgeintheincidenceofatypeoflungcancerforpatientsthroughdataminingmodels