Cargando…

Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction

Disease risk prediction is a rising challenge in the medical domain. Researchers have widely used machine learning algorithms to solve this challenge. The k-nearest neighbour (KNN) algorithm is the most frequently used among the wide range of machine learning algorithms. This paper presents a study...

Descripción completa

Detalles Bibliográficos
Autores principales:	Uddin, Shahadat, Haque, Ibtisham, Lu, Haohui, Moni, Mohammad Ali, Gide, Ergun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9012855/ https://www.ncbi.nlm.nih.gov/pubmed/35428863 http://dx.doi.org/10.1038/s41598-022-10358-x

_version_	1784687879459438592
author	Uddin, Shahadat Haque, Ibtisham Lu, Haohui Moni, Mohammad Ali Gide, Ergun
author_facet	Uddin, Shahadat Haque, Ibtisham Lu, Haohui Moni, Mohammad Ali Gide, Ergun
author_sort	Uddin, Shahadat
collection	PubMed
description	Disease risk prediction is a rising challenge in the medical domain. Researchers have widely used machine learning algorithms to solve this challenge. The k-nearest neighbour (KNN) algorithm is the most frequently used among the wide range of machine learning algorithms. This paper presents a study on different KNN variants (Classic one, Adaptive, Locally adaptive, k-means clustering, Fuzzy, Mutual, Ensemble, Hassanat and Generalised mean distance) and their performance comparison for disease prediction. This study analysed these variants in-depth through implementations and experimentations using eight machine learning benchmark datasets obtained from Kaggle, UCI Machine learning repository and OpenML. The datasets were related to different disease contexts. We considered the performance measures of accuracy, precision and recall for comparative analysis. The average accuracy values of these variants ranged from 64.22% to 83.62%. The Hassanaat KNN showed the highest average accuracy (83.62%), followed by the ensemble approach KNN (82.34%). A relative performance index is also proposed based on each performance measure to assess each variant and compare the results. This study identified Hassanat KNN as the best performing variant based on the accuracy-based version of this index, followed by the ensemble approach KNN. This study also provided a relative comparison among KNN variants based on precision and recall measures. Finally, this paper summarises which KNN variant is the most promising candidate to follow under the consideration of three performance measures (accuracy, precision and recall) for disease prediction. Healthcare researchers and stakeholders could use the findings of this study to select the appropriate KNN variant for predictive disease risk analytics.
format	Online Article Text
id	pubmed-9012855
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-90128552022-04-18 Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction Uddin, Shahadat Haque, Ibtisham Lu, Haohui Moni, Mohammad Ali Gide, Ergun Sci Rep Article Disease risk prediction is a rising challenge in the medical domain. Researchers have widely used machine learning algorithms to solve this challenge. The k-nearest neighbour (KNN) algorithm is the most frequently used among the wide range of machine learning algorithms. This paper presents a study on different KNN variants (Classic one, Adaptive, Locally adaptive, k-means clustering, Fuzzy, Mutual, Ensemble, Hassanat and Generalised mean distance) and their performance comparison for disease prediction. This study analysed these variants in-depth through implementations and experimentations using eight machine learning benchmark datasets obtained from Kaggle, UCI Machine learning repository and OpenML. The datasets were related to different disease contexts. We considered the performance measures of accuracy, precision and recall for comparative analysis. The average accuracy values of these variants ranged from 64.22% to 83.62%. The Hassanaat KNN showed the highest average accuracy (83.62%), followed by the ensemble approach KNN (82.34%). A relative performance index is also proposed based on each performance measure to assess each variant and compare the results. This study identified Hassanat KNN as the best performing variant based on the accuracy-based version of this index, followed by the ensemble approach KNN. This study also provided a relative comparison among KNN variants based on precision and recall measures. Finally, this paper summarises which KNN variant is the most promising candidate to follow under the consideration of three performance measures (accuracy, precision and recall) for disease prediction. Healthcare researchers and stakeholders could use the findings of this study to select the appropriate KNN variant for predictive disease risk analytics. Nature Publishing Group UK 2022-04-15 /pmc/articles/PMC9012855/ /pubmed/35428863 http://dx.doi.org/10.1038/s41598-022-10358-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Uddin, Shahadat Haque, Ibtisham Lu, Haohui Moni, Mohammad Ali Gide, Ergun Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title	Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title_full	Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title_fullStr	Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title_full_unstemmed	Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title_short	Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction
title_sort	comparative performance analysis of k-nearest neighbour (knn) algorithm and its different variants for disease prediction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9012855/ https://www.ncbi.nlm.nih.gov/pubmed/35428863 http://dx.doi.org/10.1038/s41598-022-10358-x
work_keys_str_mv	AT uddinshahadat comparativeperformanceanalysisofknearestneighbourknnalgorithmanditsdifferentvariantsfordiseaseprediction AT haqueibtisham comparativeperformanceanalysisofknearestneighbourknnalgorithmanditsdifferentvariantsfordiseaseprediction AT luhaohui comparativeperformanceanalysisofknearestneighbourknnalgorithmanditsdifferentvariantsfordiseaseprediction AT monimohammadali comparativeperformanceanalysisofknearestneighbourknnalgorithmanditsdifferentvariantsfordiseaseprediction AT gideergun comparativeperformanceanalysisofknearestneighbourknnalgorithmanditsdifferentvariantsfordiseaseprediction

Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction

Ejemplares similares