Cargando…

An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant

Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a di...

Descripción completa

Detalles Bibliográficos
Autores principales: Jerop, Brenda, Segera, Davies Rene
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550829/
https://www.ncbi.nlm.nih.gov/pubmed/34722764
http://dx.doi.org/10.1155/2021/4784057
_version_ 1784591037592764416
author Jerop, Brenda
Segera, Davies Rene
author_facet Jerop, Brenda
Segera, Davies Rene
author_sort Jerop, Brenda
collection PubMed
description Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a disease. To improve on the performance of these techniques, this paper presents a technique which involves feature selection using principal component analysis (PCA), a hybrid kernel-based support vector machine (HKSVM) classification model and hyperparameter optimization using genetic algorithm (GA). The HKSVM in this paper introduces a new way of combining three kernels: Radial basis function (RBF), linear, and polynomial. Combining local (RBF) and global (linear and polynomial) kernels has the effect of improved model performance. This is because the local kernels are better able to distinguish points closer to each other while the global kernels are more suited to distinguish points that are far away from each other. The PCA-GA-HKSVM is used on 7 different medical datasets, with two datasets being multiclass datasets and 5 datasets being binary. Performance evaluation metrics used were accuracy, precision, and recall. It was observed that the PCA-GA-HKSVM offered better performance than the single kernel support vector machines (SVMs).
format Online
Article
Text
id pubmed-8550829
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85508292021-10-28 An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant Jerop, Brenda Segera, Davies Rene Biomed Res Int Research Article Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a disease. To improve on the performance of these techniques, this paper presents a technique which involves feature selection using principal component analysis (PCA), a hybrid kernel-based support vector machine (HKSVM) classification model and hyperparameter optimization using genetic algorithm (GA). The HKSVM in this paper introduces a new way of combining three kernels: Radial basis function (RBF), linear, and polynomial. Combining local (RBF) and global (linear and polynomial) kernels has the effect of improved model performance. This is because the local kernels are better able to distinguish points closer to each other while the global kernels are more suited to distinguish points that are far away from each other. The PCA-GA-HKSVM is used on 7 different medical datasets, with two datasets being multiclass datasets and 5 datasets being binary. Performance evaluation metrics used were accuracy, precision, and recall. It was observed that the PCA-GA-HKSVM offered better performance than the single kernel support vector machines (SVMs). Hindawi 2021-10-20 /pmc/articles/PMC8550829/ /pubmed/34722764 http://dx.doi.org/10.1155/2021/4784057 Text en Copyright © 2021 Brenda Jerop and Davies Rene Segera. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jerop, Brenda
Segera, Davies Rene
An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title_full An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title_fullStr An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title_full_unstemmed An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title_short An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
title_sort efficient pca-ga-hksvm-based disease diagnostic assistant
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550829/
https://www.ncbi.nlm.nih.gov/pubmed/34722764
http://dx.doi.org/10.1155/2021/4784057
work_keys_str_mv AT jeropbrenda anefficientpcagahksvmbaseddiseasediagnosticassistant
AT segeradaviesrene anefficientpcagahksvmbaseddiseasediagnosticassistant
AT jeropbrenda efficientpcagahksvmbaseddiseasediagnosticassistant
AT segeradaviesrene efficientpcagahksvmbaseddiseasediagnosticassistant