Cargando…

Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes

In the microarray gene expression data, there are a large number of genes that are expressed at varying levels of expression. Given that there are only a few critically significant genes, it is challenging to analyze and categorize datasets that span the whole gene space. In order to aid in the diag...

Descripción completa

Detalles Bibliográficos
Autores principales: Sathya, M., Jeyaselvi, M., Joshi, Shubham, Pandey, Ekta, Pareek, Piyush Kumar, Jamal, Sajjad Shaukat, Kumar, Vinay, Atiglah, Henry Kwame
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8888099/
https://www.ncbi.nlm.nih.gov/pubmed/35242297
http://dx.doi.org/10.1155/2022/5821938
_version_ 1784661057794473984
author Sathya, M.
Jeyaselvi, M.
Joshi, Shubham
Pandey, Ekta
Pareek, Piyush Kumar
Jamal, Sajjad Shaukat
Kumar, Vinay
Atiglah, Henry Kwame
author_facet Sathya, M.
Jeyaselvi, M.
Joshi, Shubham
Pandey, Ekta
Pareek, Piyush Kumar
Jamal, Sajjad Shaukat
Kumar, Vinay
Atiglah, Henry Kwame
author_sort Sathya, M.
collection PubMed
description In the microarray gene expression data, there are a large number of genes that are expressed at varying levels of expression. Given that there are only a few critically significant genes, it is challenging to analyze and categorize datasets that span the whole gene space. In order to aid in the diagnosis of cancer disease and, as a consequence, the suggestion of individualized treatment, the discovery of biomarker genes is essential. Starting with a large pool of candidates, the parallelized minimal redundancy and maximum relevance ensemble (mRMRe) is used to choose the top m informative genes from a huge pool of candidates. A Genetic Algorithm (GA) is used to heuristically compute the ideal set of genes by applying the Mahalanobis Distance (MD) as a distance metric. Once the genes have been identified, they are input into the GA. It is used as a classifier to four microarray datasets using the approved approach (mRMRe-GA), with the Support Vector Machine (SVM) serving as the classification basis. Leave-One-Out-Cross-Validation (LOOCV) is a cross-validation technique for assessing the performance of a classifier. It is now being investigated if the proposed mRMRe-GA strategy can be compared to other approaches. It has been shown that the proposed mRMRe-GA approach enhances classification accuracy while employing less genetic material than previous methods. Microarray, Gene Expression Data, GA, Feature Selection, SVM, and Cancer Classification are some of the terms used in this paper.
format Online
Article
Text
id pubmed-8888099
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-88880992022-03-02 Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes Sathya, M. Jeyaselvi, M. Joshi, Shubham Pandey, Ekta Pareek, Piyush Kumar Jamal, Sajjad Shaukat Kumar, Vinay Atiglah, Henry Kwame J Healthc Eng Research Article In the microarray gene expression data, there are a large number of genes that are expressed at varying levels of expression. Given that there are only a few critically significant genes, it is challenging to analyze and categorize datasets that span the whole gene space. In order to aid in the diagnosis of cancer disease and, as a consequence, the suggestion of individualized treatment, the discovery of biomarker genes is essential. Starting with a large pool of candidates, the parallelized minimal redundancy and maximum relevance ensemble (mRMRe) is used to choose the top m informative genes from a huge pool of candidates. A Genetic Algorithm (GA) is used to heuristically compute the ideal set of genes by applying the Mahalanobis Distance (MD) as a distance metric. Once the genes have been identified, they are input into the GA. It is used as a classifier to four microarray datasets using the approved approach (mRMRe-GA), with the Support Vector Machine (SVM) serving as the classification basis. Leave-One-Out-Cross-Validation (LOOCV) is a cross-validation technique for assessing the performance of a classifier. It is now being investigated if the proposed mRMRe-GA strategy can be compared to other approaches. It has been shown that the proposed mRMRe-GA approach enhances classification accuracy while employing less genetic material than previous methods. Microarray, Gene Expression Data, GA, Feature Selection, SVM, and Cancer Classification are some of the terms used in this paper. Hindawi 2022-02-22 /pmc/articles/PMC8888099/ /pubmed/35242297 http://dx.doi.org/10.1155/2022/5821938 Text en Copyright © 2022 M. Sathya et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sathya, M.
Jeyaselvi, M.
Joshi, Shubham
Pandey, Ekta
Pareek, Piyush Kumar
Jamal, Sajjad Shaukat
Kumar, Vinay
Atiglah, Henry Kwame
Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title_full Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title_fullStr Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title_full_unstemmed Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title_short Cancer Categorization Using Genetic Algorithm to Identify Biomarker Genes
title_sort cancer categorization using genetic algorithm to identify biomarker genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8888099/
https://www.ncbi.nlm.nih.gov/pubmed/35242297
http://dx.doi.org/10.1155/2022/5821938
work_keys_str_mv AT sathyam cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT jeyaselvim cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT joshishubham cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT pandeyekta cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT pareekpiyushkumar cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT jamalsajjadshaukat cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT kumarvinay cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes
AT atiglahhenrykwame cancercategorizationusinggeneticalgorithmtoidentifybiomarkergenes