Cargando…

An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer

One of the deadliest diseases which affects the large intestine is colon cancer. Older adults are typically affected by colon cancer though it can happen at any age. It generally starts as small benign growth of cells that forms on the inside of the colon, and later, it develops into cancer. Due to...

Descripción completa

Detalles Bibliográficos
Autores principales: Prabhakar, Sunil Kumar, Rajaguru, Harikumar, Kim, Sun-Hee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7578727/
https://www.ncbi.nlm.nih.gov/pubmed/33102596
http://dx.doi.org/10.1155/2020/8427574
_version_ 1783598428947742720
author Prabhakar, Sunil Kumar
Rajaguru, Harikumar
Kim, Sun-Hee
author_facet Prabhakar, Sunil Kumar
Rajaguru, Harikumar
Kim, Sun-Hee
author_sort Prabhakar, Sunil Kumar
collection PubMed
description One of the deadliest diseases which affects the large intestine is colon cancer. Older adults are typically affected by colon cancer though it can happen at any age. It generally starts as small benign growth of cells that forms on the inside of the colon, and later, it develops into cancer. Due to the propagation of somatic alterations that affects the gene expression, colon cancer is caused. A standardized format for assessing the expression levels of thousands of genes is provided by the DNA microarray technology. The tumors of various anatomical regions can be distinguished by the patterns of gene expression in microarray technology. As the microarray data is too huge to process due to the curse of dimensionality problem, an amalgamated approach of utilizing bilevel feature selection techniques is proposed in this paper. In the first level, the genes or the features are dimensionally reduced with the help of Multivariate Minimum Redundancy–Maximum Relevance (MRMR) technique. Then, in the second level, six optimization techniques are utilized in this work for selecting the best genes or features before proceeding to classification process. The optimization techniques considered in this work are Invasive Weed Optimization (IWO), Teaching Learning-Based Optimization (TLBO), League Championship Optimization (LCO), Beetle Antennae Search Optimization (BASO), Crow Search Optimization (CSO), and Fruit Fly Optimization (FFO). Finally, it is classified with five suitable classifiers, and the best results show when IWO is utilized with MRMR, and then classified with Quadratic Discriminant Analysis (QDA), a classification accuracy of 99.16% is obtained.
format Online
Article
Text
id pubmed-7578727
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-75787272020-10-22 An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer Prabhakar, Sunil Kumar Rajaguru, Harikumar Kim, Sun-Hee Biomed Res Int Research Article One of the deadliest diseases which affects the large intestine is colon cancer. Older adults are typically affected by colon cancer though it can happen at any age. It generally starts as small benign growth of cells that forms on the inside of the colon, and later, it develops into cancer. Due to the propagation of somatic alterations that affects the gene expression, colon cancer is caused. A standardized format for assessing the expression levels of thousands of genes is provided by the DNA microarray technology. The tumors of various anatomical regions can be distinguished by the patterns of gene expression in microarray technology. As the microarray data is too huge to process due to the curse of dimensionality problem, an amalgamated approach of utilizing bilevel feature selection techniques is proposed in this paper. In the first level, the genes or the features are dimensionally reduced with the help of Multivariate Minimum Redundancy–Maximum Relevance (MRMR) technique. Then, in the second level, six optimization techniques are utilized in this work for selecting the best genes or features before proceeding to classification process. The optimization techniques considered in this work are Invasive Weed Optimization (IWO), Teaching Learning-Based Optimization (TLBO), League Championship Optimization (LCO), Beetle Antennae Search Optimization (BASO), Crow Search Optimization (CSO), and Fruit Fly Optimization (FFO). Finally, it is classified with five suitable classifiers, and the best results show when IWO is utilized with MRMR, and then classified with Quadratic Discriminant Analysis (QDA), a classification accuracy of 99.16% is obtained. Hindawi 2020-10-13 /pmc/articles/PMC7578727/ /pubmed/33102596 http://dx.doi.org/10.1155/2020/8427574 Text en Copyright © 2020 Sunil Kumar Prabhakar et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Prabhakar, Sunil Kumar
Rajaguru, Harikumar
Kim, Sun-Hee
An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title_full An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title_fullStr An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title_full_unstemmed An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title_short An Amalgamated Approach to Bilevel Feature Selection Techniques Utilizing Soft Computing Methods for Classifying Colon Cancer
title_sort amalgamated approach to bilevel feature selection techniques utilizing soft computing methods for classifying colon cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7578727/
https://www.ncbi.nlm.nih.gov/pubmed/33102596
http://dx.doi.org/10.1155/2020/8427574
work_keys_str_mv AT prabhakarsunilkumar anamalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer
AT rajaguruharikumar anamalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer
AT kimsunhee anamalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer
AT prabhakarsunilkumar amalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer
AT rajaguruharikumar amalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer
AT kimsunhee amalgamatedapproachtobilevelfeatureselectiontechniquesutilizingsoftcomputingmethodsforclassifyingcoloncancer