Cargando…

Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance

Diabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for id...

Descripción completa

Detalles Bibliográficos
Autores principales: Chellappan, Dinesh, Rajaguru, Harikumar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453776/
https://www.ncbi.nlm.nih.gov/pubmed/37627916
http://dx.doi.org/10.3390/diagnostics13162654
_version_ 1785096024421826560
author Chellappan, Dinesh
Rajaguru, Harikumar
author_facet Chellappan, Dinesh
Rajaguru, Harikumar
author_sort Chellappan, Dinesh
collection PubMed
description Diabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for identifying type II diabetes mellitus using microarray gene data. Specifically, our research focuses on the performance enhancement of methods for detecting diabetes. Four different Dimensionality Reduction techniques, Detrend Fluctuation Analysis (DFA), the Chi-square probability density function (Chi2pdf), the Firefly algorithm, and Cuckoo Search, are used to reduce high dimensional data. Metaheuristic algorithms like Particle Swarm Optimization (PSO) and Harmonic Search (HS) are used for feature selection. Seven classifiers, Non-Linear Regression (NLR), Linear Regression (LR), Logistics Regression (LoR), Gaussian Mixture Model (GMM), Bayesian Linear Discriminant Classifier (BLDC), Softmax Discriminant Classifier (SDC), and Support Vector Machine—Radial Basis Function (SVM-RBF), are utilized to classify the diabetic and non-diabetic classes. The classifiers’ performances are analyzed through parameters such as accuracy, recall, precision, F1 score, error rate, Matthews Correlation Coefficient (MCC), Jaccard metric, and kappa. The SVM (RBF) classifier with the Chi2pdf Dimensionality Reduction technique with a PSO feature selection method attained a high accuracy of 91% with a Kappa of 0.7961, outperforming all of the other classifiers.
format Online
Article
Text
id pubmed-10453776
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104537762023-08-26 Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance Chellappan, Dinesh Rajaguru, Harikumar Diagnostics (Basel) Article Diabetes is a life-threatening, non-communicable disease. Diabetes mellitus is a prevalent chronic disease with a significant global impact. The timely detection of diabetes in patients is necessary for an effective treatment. The primary objective of this study is to propose a novel approach for identifying type II diabetes mellitus using microarray gene data. Specifically, our research focuses on the performance enhancement of methods for detecting diabetes. Four different Dimensionality Reduction techniques, Detrend Fluctuation Analysis (DFA), the Chi-square probability density function (Chi2pdf), the Firefly algorithm, and Cuckoo Search, are used to reduce high dimensional data. Metaheuristic algorithms like Particle Swarm Optimization (PSO) and Harmonic Search (HS) are used for feature selection. Seven classifiers, Non-Linear Regression (NLR), Linear Regression (LR), Logistics Regression (LoR), Gaussian Mixture Model (GMM), Bayesian Linear Discriminant Classifier (BLDC), Softmax Discriminant Classifier (SDC), and Support Vector Machine—Radial Basis Function (SVM-RBF), are utilized to classify the diabetic and non-diabetic classes. The classifiers’ performances are analyzed through parameters such as accuracy, recall, precision, F1 score, error rate, Matthews Correlation Coefficient (MCC), Jaccard metric, and kappa. The SVM (RBF) classifier with the Chi2pdf Dimensionality Reduction technique with a PSO feature selection method attained a high accuracy of 91% with a Kappa of 0.7961, outperforming all of the other classifiers. MDPI 2023-08-11 /pmc/articles/PMC10453776/ /pubmed/37627916 http://dx.doi.org/10.3390/diagnostics13162654 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chellappan, Dinesh
Rajaguru, Harikumar
Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_full Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_fullStr Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_full_unstemmed Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_short Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance
title_sort detection of diabetes through microarray genes with enhancement of classifiers performance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453776/
https://www.ncbi.nlm.nih.gov/pubmed/37627916
http://dx.doi.org/10.3390/diagnostics13162654
work_keys_str_mv AT chellappandinesh detectionofdiabetesthroughmicroarraygeneswithenhancementofclassifiersperformance
AT rajaguruharikumar detectionofdiabetesthroughmicroarraygeneswithenhancementofclassifiersperformance