Cargando…

Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem

Background and Objectives: Recently, many studies have focused on the early detection of Parkinson’s disease (PD). This disease belongs to a group of neurological problems that immediately affect brain cells and influence the movement, hearing, and various cognitive functions. Medical data sets are...

Descripción completa

Detalles Bibliográficos
Autores principales: Qasim, Hayder Mohammed, Ata, Oguz, Ansari, Mohammad Azam, Alomary, Mohammad N., Alghamdi, Saad, Almehmadi, Mazen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8619928/
https://www.ncbi.nlm.nih.gov/pubmed/34833435
http://dx.doi.org/10.3390/medicina57111217
_version_ 1784605102453030912
author Qasim, Hayder Mohammed
Ata, Oguz
Ansari, Mohammad Azam
Alomary, Mohammad N.
Alghamdi, Saad
Almehmadi, Mazen
author_facet Qasim, Hayder Mohammed
Ata, Oguz
Ansari, Mohammad Azam
Alomary, Mohammad N.
Alghamdi, Saad
Almehmadi, Mazen
author_sort Qasim, Hayder Mohammed
collection PubMed
description Background and Objectives: Recently, many studies have focused on the early detection of Parkinson’s disease (PD). This disease belongs to a group of neurological problems that immediately affect brain cells and influence the movement, hearing, and various cognitive functions. Medical data sets are often not equally distributed in their classes and this gives a bias in the classification of patients. We performed a Hybrid feature selection framework that can deal with imbalanced datasets like PD. Use the SOMTE algorithm to deal with unbalanced datasets. Removing the contradiction from the features in the dataset and decrease the processing time by using Recursive Feature Elimination (RFE), and Principle Component Analysis (PCA). Materials and Methods: PD acoustic datasets and the characteristics of control subjects were used to construct classification models such as Bagging, K-nearest neighbour (KNN), multilayer perceptron, and the support vector machine (SVM). In the prepressing stage, the synthetic minority over-sampling technique (SMOTE) with two-feature selection RFE and PCA were used. The PD dataset comprises a large difference between the numbers of the infected and uninfected patients, which causes the classification bias problem. Therefore, SMOTE was used to resolve this problem. Results: For model evaluation, the train–test split technique was used for the experiment. All the models were Grid-search tuned, the evaluation results of the SVM model showed the highest accuracy of 98.2%, and the KNN model exhibited the highest specificity of 99%. Conclusions: the proposed method is compared with the current modern methods of detecting Parkinson’s disease and other methods for medical diseases, it was noted that our developed system could treat data bias and reach a high prediction of PD and this can be beneficial for health organizations to properly prioritize assets.
format Online
Article
Text
id pubmed-8619928
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86199282021-11-27 Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem Qasim, Hayder Mohammed Ata, Oguz Ansari, Mohammad Azam Alomary, Mohammad N. Alghamdi, Saad Almehmadi, Mazen Medicina (Kaunas) Article Background and Objectives: Recently, many studies have focused on the early detection of Parkinson’s disease (PD). This disease belongs to a group of neurological problems that immediately affect brain cells and influence the movement, hearing, and various cognitive functions. Medical data sets are often not equally distributed in their classes and this gives a bias in the classification of patients. We performed a Hybrid feature selection framework that can deal with imbalanced datasets like PD. Use the SOMTE algorithm to deal with unbalanced datasets. Removing the contradiction from the features in the dataset and decrease the processing time by using Recursive Feature Elimination (RFE), and Principle Component Analysis (PCA). Materials and Methods: PD acoustic datasets and the characteristics of control subjects were used to construct classification models such as Bagging, K-nearest neighbour (KNN), multilayer perceptron, and the support vector machine (SVM). In the prepressing stage, the synthetic minority over-sampling technique (SMOTE) with two-feature selection RFE and PCA were used. The PD dataset comprises a large difference between the numbers of the infected and uninfected patients, which causes the classification bias problem. Therefore, SMOTE was used to resolve this problem. Results: For model evaluation, the train–test split technique was used for the experiment. All the models were Grid-search tuned, the evaluation results of the SVM model showed the highest accuracy of 98.2%, and the KNN model exhibited the highest specificity of 99%. Conclusions: the proposed method is compared with the current modern methods of detecting Parkinson’s disease and other methods for medical diseases, it was noted that our developed system could treat data bias and reach a high prediction of PD and this can be beneficial for health organizations to properly prioritize assets. MDPI 2021-11-08 /pmc/articles/PMC8619928/ /pubmed/34833435 http://dx.doi.org/10.3390/medicina57111217 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qasim, Hayder Mohammed
Ata, Oguz
Ansari, Mohammad Azam
Alomary, Mohammad N.
Alghamdi, Saad
Almehmadi, Mazen
Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title_full Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title_fullStr Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title_full_unstemmed Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title_short Hybrid Feature Selection Framework for the Parkinson Imbalanced Dataset Prediction Problem
title_sort hybrid feature selection framework for the parkinson imbalanced dataset prediction problem
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8619928/
https://www.ncbi.nlm.nih.gov/pubmed/34833435
http://dx.doi.org/10.3390/medicina57111217
work_keys_str_mv AT qasimhaydermohammed hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem
AT ataoguz hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem
AT ansarimohammadazam hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem
AT alomarymohammadn hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem
AT alghamdisaad hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem
AT almehmadimazen hybridfeatureselectionframeworkfortheparkinsonimbalanceddatasetpredictionproblem