Cargando…

Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers

DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that m...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Hualong, Hong, Shufang, Yang, Xibei, Ni, Jun, Dan, Yuanyuan, Qin, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3770038/
https://www.ncbi.nlm.nih.gov/pubmed/24078908
http://dx.doi.org/10.1155/2013/239628
_version_ 1782284058385973248
author Yu, Hualong
Hong, Shufang
Yang, Xibei
Ni, Jun
Dan, Yuanyuan
Qin, Bin
author_facet Yu, Hualong
Hong, Shufang
Yang, Xibei
Ni, Jun
Dan, Yuanyuan
Qin, Bin
author_sort Yu, Hualong
collection PubMed
description DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.
format Online
Article
Text
id pubmed-3770038
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37700382013-09-29 Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers Yu, Hualong Hong, Shufang Yang, Xibei Ni, Jun Dan, Yuanyuan Qin, Bin Biomed Res Int Research Article DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance. Hindawi Publishing Corporation 2013 2013-08-26 /pmc/articles/PMC3770038/ /pubmed/24078908 http://dx.doi.org/10.1155/2013/239628 Text en Copyright © 2013 Hualong Yu et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yu, Hualong
Hong, Shufang
Yang, Xibei
Ni, Jun
Dan, Yuanyuan
Qin, Bin
Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title_full Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title_fullStr Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title_full_unstemmed Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title_short Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
title_sort recognition of multiple imbalanced cancer types based on dna microarray data using ensemble classifiers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3770038/
https://www.ncbi.nlm.nih.gov/pubmed/24078908
http://dx.doi.org/10.1155/2013/239628
work_keys_str_mv AT yuhualong recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers
AT hongshufang recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers
AT yangxibei recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers
AT nijun recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers
AT danyuanyuan recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers
AT qinbin recognitionofmultipleimbalancedcancertypesbasedondnamicroarraydatausingensembleclassifiers