Cargando…

Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences

Breast cancer is the leading cancer in women, which accounts for millions of deaths worldwide. Early and accurate detection, prognosis, cure, and prevention of breast cancer is a major challenge to society. Hence, a precise and reliable system is vital for the classification of cancerous sequences....

Descripción completa

Detalles Bibliográficos
Autores principales: Kurian, Babymol, Jyothi, V. L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9507671/
https://www.ncbi.nlm.nih.gov/pubmed/36159773
http://dx.doi.org/10.1155/2022/7199290
_version_ 1784796888868847616
author Kurian, Babymol
Jyothi, V. L
author_facet Kurian, Babymol
Jyothi, V. L
author_sort Kurian, Babymol
collection PubMed
description Breast cancer is the leading cancer in women, which accounts for millions of deaths worldwide. Early and accurate detection, prognosis, cure, and prevention of breast cancer is a major challenge to society. Hence, a precise and reliable system is vital for the classification of cancerous sequences. Machine learning classifiers contribute much to the process of early prediction and diagnosis of cancer. In this paper, a comparative study of four machine learning classifiers such as random forest, decision tree, AdaBoost, and gradient boosting is implemented for the classification of a benign and malignant tumor. To derive the most efficient machine learning model, NCBI datasets are utilized. Performance evaluation is conducted, and all four classifiers are compared based on the results. The aim of the work is to derive the most efficient machine-learning model for the diagnosis of breast cancer. It was observed that gradient boosting outperformed all other models and achieved a classification accuracy of 95.82%.
format Online
Article
Text
id pubmed-9507671
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95076712022-09-24 Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences Kurian, Babymol Jyothi, V. L J Environ Public Health Research Article Breast cancer is the leading cancer in women, which accounts for millions of deaths worldwide. Early and accurate detection, prognosis, cure, and prevention of breast cancer is a major challenge to society. Hence, a precise and reliable system is vital for the classification of cancerous sequences. Machine learning classifiers contribute much to the process of early prediction and diagnosis of cancer. In this paper, a comparative study of four machine learning classifiers such as random forest, decision tree, AdaBoost, and gradient boosting is implemented for the classification of a benign and malignant tumor. To derive the most efficient machine learning model, NCBI datasets are utilized. Performance evaluation is conducted, and all four classifiers are compared based on the results. The aim of the work is to derive the most efficient machine-learning model for the diagnosis of breast cancer. It was observed that gradient boosting outperformed all other models and achieved a classification accuracy of 95.82%. Hindawi 2022-09-16 /pmc/articles/PMC9507671/ /pubmed/36159773 http://dx.doi.org/10.1155/2022/7199290 Text en Copyright © 2022 Babymol Kurian and V.L Jyothi. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kurian, Babymol
Jyothi, V. L
Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title_full Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title_fullStr Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title_full_unstemmed Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title_short Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences
title_sort comparative analysis of machine learning methods for breast cancer classification in genetic sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9507671/
https://www.ncbi.nlm.nih.gov/pubmed/36159773
http://dx.doi.org/10.1155/2022/7199290
work_keys_str_mv AT kurianbabymol comparativeanalysisofmachinelearningmethodsforbreastcancerclassificationingeneticsequences
AT jyothivl comparativeanalysisofmachinelearningmethodsforbreastcancerclassificationingeneticsequences