Cargando…
An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant
Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a di...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550829/ https://www.ncbi.nlm.nih.gov/pubmed/34722764 http://dx.doi.org/10.1155/2021/4784057 |
_version_ | 1784591037592764416 |
---|---|
author | Jerop, Brenda Segera, Davies Rene |
author_facet | Jerop, Brenda Segera, Davies Rene |
author_sort | Jerop, Brenda |
collection | PubMed |
description | Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a disease. To improve on the performance of these techniques, this paper presents a technique which involves feature selection using principal component analysis (PCA), a hybrid kernel-based support vector machine (HKSVM) classification model and hyperparameter optimization using genetic algorithm (GA). The HKSVM in this paper introduces a new way of combining three kernels: Radial basis function (RBF), linear, and polynomial. Combining local (RBF) and global (linear and polynomial) kernels has the effect of improved model performance. This is because the local kernels are better able to distinguish points closer to each other while the global kernels are more suited to distinguish points that are far away from each other. The PCA-GA-HKSVM is used on 7 different medical datasets, with two datasets being multiclass datasets and 5 datasets being binary. Performance evaluation metrics used were accuracy, precision, and recall. It was observed that the PCA-GA-HKSVM offered better performance than the single kernel support vector machines (SVMs). |
format | Online Article Text |
id | pubmed-8550829 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-85508292021-10-28 An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant Jerop, Brenda Segera, Davies Rene Biomed Res Int Research Article Disease diagnosis faces challenges such as misdiagnosis, lack of diagnosis, and slow diagnosis. There are several machine learning techniques that have been applied to address these challenges, where a set of symptoms is applied to a classification model that predicts the presence or absence of a disease. To improve on the performance of these techniques, this paper presents a technique which involves feature selection using principal component analysis (PCA), a hybrid kernel-based support vector machine (HKSVM) classification model and hyperparameter optimization using genetic algorithm (GA). The HKSVM in this paper introduces a new way of combining three kernels: Radial basis function (RBF), linear, and polynomial. Combining local (RBF) and global (linear and polynomial) kernels has the effect of improved model performance. This is because the local kernels are better able to distinguish points closer to each other while the global kernels are more suited to distinguish points that are far away from each other. The PCA-GA-HKSVM is used on 7 different medical datasets, with two datasets being multiclass datasets and 5 datasets being binary. Performance evaluation metrics used were accuracy, precision, and recall. It was observed that the PCA-GA-HKSVM offered better performance than the single kernel support vector machines (SVMs). Hindawi 2021-10-20 /pmc/articles/PMC8550829/ /pubmed/34722764 http://dx.doi.org/10.1155/2021/4784057 Text en Copyright © 2021 Brenda Jerop and Davies Rene Segera. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Jerop, Brenda Segera, Davies Rene An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title | An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title_full | An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title_fullStr | An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title_full_unstemmed | An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title_short | An Efficient PCA-GA-HKSVM-Based Disease Diagnostic Assistant |
title_sort | efficient pca-ga-hksvm-based disease diagnostic assistant |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550829/ https://www.ncbi.nlm.nih.gov/pubmed/34722764 http://dx.doi.org/10.1155/2021/4784057 |
work_keys_str_mv | AT jeropbrenda anefficientpcagahksvmbaseddiseasediagnosticassistant AT segeradaviesrene anefficientpcagahksvmbaseddiseasediagnosticassistant AT jeropbrenda efficientpcagahksvmbaseddiseasediagnosticassistant AT segeradaviesrene efficientpcagahksvmbaseddiseasediagnosticassistant |