Cargando…

Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications

Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Chandrashekar, K, Setlur, Anagha S, Sabhapathi C, Adithya, Raiker, Satyam Suresh, Singh, Satyam, Niranjan, Vidya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9880585/
https://www.ncbi.nlm.nih.gov/pubmed/36714384
http://dx.doi.org/10.1177/11769351221147244
_version_ 1784878947031318528
author Chandrashekar, K
Setlur, Anagha S
Sabhapathi C, Adithya
Raiker, Satyam Suresh
Singh, Satyam
Niranjan, Vidya
author_facet Chandrashekar, K
Setlur, Anagha S
Sabhapathi C, Adithya
Raiker, Satyam Suresh
Singh, Satyam
Niranjan, Vidya
author_sort Chandrashekar, K
collection PubMed
description Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew’s correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.
format Online
Article
Text
id pubmed-9880585
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-98805852023-01-28 Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications Chandrashekar, K Setlur, Anagha S Sabhapathi C, Adithya Raiker, Satyam Suresh Singh, Satyam Niranjan, Vidya Cancer Inform Original Research Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew’s correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications. SAGE Publications 2023-01-23 /pmc/articles/PMC9880585/ /pubmed/36714384 http://dx.doi.org/10.1177/11769351221147244 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Chandrashekar, K
Setlur, Anagha S
Sabhapathi C, Adithya
Raiker, Satyam Suresh
Singh, Satyam
Niranjan, Vidya
Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_full Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_fullStr Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_full_unstemmed Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_short Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications
title_sort decision support system and web-application using supervised machine learning algorithms for easy cancer classifications
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9880585/
https://www.ncbi.nlm.nih.gov/pubmed/36714384
http://dx.doi.org/10.1177/11769351221147244
work_keys_str_mv AT chandrashekark decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT setluranaghas decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT sabhapathicadithya decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT raikersatyamsuresh decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT singhsatyam decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications
AT niranjanvidya decisionsupportsystemandwebapplicationusingsupervisedmachinelearningalgorithmsforeasycancerclassifications