Cargando…

Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records

This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malays...

Descripción completa

Detalles Bibliográficos
Autores principales: Hanis, Tengku Muhammad, Ruhaiyem, Nur Intan Raihana, Arifin, Wan Nor, Haron, Juhara, Wan Abdul Rahman, Wan Faiziah, Abdullah, Rosni, Musa, Kamarul Imran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689364/
https://www.ncbi.nlm.nih.gov/pubmed/36428886
http://dx.doi.org/10.3390/diagnostics12112826
_version_ 1784836514877800448
author Hanis, Tengku Muhammad
Ruhaiyem, Nur Intan Raihana
Arifin, Wan Nor
Haron, Juhara
Wan Abdul Rahman, Wan Faiziah
Abdullah, Rosni
Musa, Kamarul Imran
author_facet Hanis, Tengku Muhammad
Ruhaiyem, Nur Intan Raihana
Arifin, Wan Nor
Haron, Juhara
Wan Abdul Rahman, Wan Faiziah
Abdullah, Rosni
Musa, Kamarul Imran
author_sort Hanis, Tengku Muhammad
collection PubMed
description This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer.
format Online
Article
Text
id pubmed-9689364
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96893642022-11-25 Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records Hanis, Tengku Muhammad Ruhaiyem, Nur Intan Raihana Arifin, Wan Nor Haron, Juhara Wan Abdul Rahman, Wan Faiziah Abdullah, Rosni Musa, Kamarul Imran Diagnostics (Basel) Article This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer. MDPI 2022-11-16 /pmc/articles/PMC9689364/ /pubmed/36428886 http://dx.doi.org/10.3390/diagnostics12112826 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hanis, Tengku Muhammad
Ruhaiyem, Nur Intan Raihana
Arifin, Wan Nor
Haron, Juhara
Wan Abdul Rahman, Wan Faiziah
Abdullah, Rosni
Musa, Kamarul Imran
Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title_full Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title_fullStr Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title_full_unstemmed Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title_short Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
title_sort over-the-counter breast cancer classification using machine learning and patient registration records
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689364/
https://www.ncbi.nlm.nih.gov/pubmed/36428886
http://dx.doi.org/10.3390/diagnostics12112826
work_keys_str_mv AT hanistengkumuhammad overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT ruhaiyemnurintanraihana overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT arifinwannor overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT haronjuhara overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT wanabdulrahmanwanfaiziah overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT abdullahrosni overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords
AT musakamarulimran overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords