Cargando…
Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records
This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malays...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689364/ https://www.ncbi.nlm.nih.gov/pubmed/36428886 http://dx.doi.org/10.3390/diagnostics12112826 |
_version_ | 1784836514877800448 |
---|---|
author | Hanis, Tengku Muhammad Ruhaiyem, Nur Intan Raihana Arifin, Wan Nor Haron, Juhara Wan Abdul Rahman, Wan Faiziah Abdullah, Rosni Musa, Kamarul Imran |
author_facet | Hanis, Tengku Muhammad Ruhaiyem, Nur Intan Raihana Arifin, Wan Nor Haron, Juhara Wan Abdul Rahman, Wan Faiziah Abdullah, Rosni Musa, Kamarul Imran |
author_sort | Hanis, Tengku Muhammad |
collection | PubMed |
description | This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer. |
format | Online Article Text |
id | pubmed-9689364 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96893642022-11-25 Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records Hanis, Tengku Muhammad Ruhaiyem, Nur Intan Raihana Arifin, Wan Nor Haron, Juhara Wan Abdul Rahman, Wan Faiziah Abdullah, Rosni Musa, Kamarul Imran Diagnostics (Basel) Article This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer. MDPI 2022-11-16 /pmc/articles/PMC9689364/ /pubmed/36428886 http://dx.doi.org/10.3390/diagnostics12112826 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Hanis, Tengku Muhammad Ruhaiyem, Nur Intan Raihana Arifin, Wan Nor Haron, Juhara Wan Abdul Rahman, Wan Faiziah Abdullah, Rosni Musa, Kamarul Imran Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title | Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title_full | Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title_fullStr | Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title_full_unstemmed | Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title_short | Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records |
title_sort | over-the-counter breast cancer classification using machine learning and patient registration records |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689364/ https://www.ncbi.nlm.nih.gov/pubmed/36428886 http://dx.doi.org/10.3390/diagnostics12112826 |
work_keys_str_mv | AT hanistengkumuhammad overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT ruhaiyemnurintanraihana overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT arifinwannor overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT haronjuhara overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT wanabdulrahmanwanfaiziah overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT abdullahrosni overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords AT musakamarulimran overthecounterbreastcancerclassificationusingmachinelearningandpatientregistrationrecords |