Cargando…

Comparative Study of Classification Algorithms for Various DNA Microarray Data

Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jingeun, Yoon, Yourim, Park, Hye-Jin, Kim, Yong-Hyuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8951024/
https://www.ncbi.nlm.nih.gov/pubmed/35328048
http://dx.doi.org/10.3390/genes13030494
_version_ 1784675284278050816
author Kim, Jingeun
Yoon, Yourim
Park, Hye-Jin
Kim, Yong-Hyuk
author_facet Kim, Jingeun
Yoon, Yourim
Park, Hye-Jin
Kim, Yong-Hyuk
author_sort Kim, Jingeun
collection PubMed
description Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN), and the resulting accuracies were compared. k-fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance.
format Online
Article
Text
id pubmed-8951024
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89510242022-03-26 Comparative Study of Classification Algorithms for Various DNA Microarray Data Kim, Jingeun Yoon, Yourim Park, Hye-Jin Kim, Yong-Hyuk Genes (Basel) Article Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN), and the resulting accuracies were compared. k-fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance. MDPI 2022-03-11 /pmc/articles/PMC8951024/ /pubmed/35328048 http://dx.doi.org/10.3390/genes13030494 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kim, Jingeun
Yoon, Yourim
Park, Hye-Jin
Kim, Yong-Hyuk
Comparative Study of Classification Algorithms for Various DNA Microarray Data
title Comparative Study of Classification Algorithms for Various DNA Microarray Data
title_full Comparative Study of Classification Algorithms for Various DNA Microarray Data
title_fullStr Comparative Study of Classification Algorithms for Various DNA Microarray Data
title_full_unstemmed Comparative Study of Classification Algorithms for Various DNA Microarray Data
title_short Comparative Study of Classification Algorithms for Various DNA Microarray Data
title_sort comparative study of classification algorithms for various dna microarray data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8951024/
https://www.ncbi.nlm.nih.gov/pubmed/35328048
http://dx.doi.org/10.3390/genes13030494
work_keys_str_mv AT kimjingeun comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata
AT yoonyourim comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata
AT parkhyejin comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata
AT kimyonghyuk comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata