Cargando…
Comparative Study of Classification Algorithms for Various DNA Microarray Data
Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8951024/ https://www.ncbi.nlm.nih.gov/pubmed/35328048 http://dx.doi.org/10.3390/genes13030494 |
_version_ | 1784675284278050816 |
---|---|
author | Kim, Jingeun Yoon, Yourim Park, Hye-Jin Kim, Yong-Hyuk |
author_facet | Kim, Jingeun Yoon, Yourim Park, Hye-Jin Kim, Yong-Hyuk |
author_sort | Kim, Jingeun |
collection | PubMed |
description | Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN), and the resulting accuracies were compared. k-fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance. |
format | Online Article Text |
id | pubmed-8951024 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89510242022-03-26 Comparative Study of Classification Algorithms for Various DNA Microarray Data Kim, Jingeun Yoon, Yourim Park, Hye-Jin Kim, Yong-Hyuk Genes (Basel) Article Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN), and the resulting accuracies were compared. k-fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance. MDPI 2022-03-11 /pmc/articles/PMC8951024/ /pubmed/35328048 http://dx.doi.org/10.3390/genes13030494 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kim, Jingeun Yoon, Yourim Park, Hye-Jin Kim, Yong-Hyuk Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title | Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title_full | Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title_fullStr | Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title_full_unstemmed | Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title_short | Comparative Study of Classification Algorithms for Various DNA Microarray Data |
title_sort | comparative study of classification algorithms for various dna microarray data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8951024/ https://www.ncbi.nlm.nih.gov/pubmed/35328048 http://dx.doi.org/10.3390/genes13030494 |
work_keys_str_mv | AT kimjingeun comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata AT yoonyourim comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata AT parkhyejin comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata AT kimyonghyuk comparativestudyofclassificationalgorithmsforvariousdnamicroarraydata |