Cargando…

Gene masking - a technique to improve accuracy for cancer classification with high dimensionality in microarray data

BACKGROUND: High dimensional feature space generally degrades classification in several applications. In this paper, we propose a strategy called gene masking, in which non-contributing dimensions are heuristically removed from the data to improve classification accuracy. METHODS: Gene masking is im...

Descripción completa

Detalles Bibliográficos
Autores principales: Saini, Harsh, Lal, Sunil Pranit, Naidu, Vimal Vikash, Pickering, Vincel Wince, Singh, Gurmeet, Tsunoda, Tatsuhiko, Sharma, Alok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5260793/
https://www.ncbi.nlm.nih.gov/pubmed/28117659
http://dx.doi.org/10.1186/s12920-016-0233-2
Descripción
Sumario:BACKGROUND: High dimensional feature space generally degrades classification in several applications. In this paper, we propose a strategy called gene masking, in which non-contributing dimensions are heuristically removed from the data to improve classification accuracy. METHODS: Gene masking is implemented via a binary encoded genetic algorithm that can be integrated seamlessly with classifiers during the training phase of classification to perform feature selection. It can also be used to discriminate between features that contribute most to the classification, thereby, allowing researchers to isolate features that may have special significance. RESULTS: This technique was applied on publicly available datasets whereby it substantially reduced the number of features used for classification while maintaining high accuracies. CONCLUSION: The proposed technique can be extremely useful in feature selection as it heuristically removes non-contributing features to improve the performance of classifiers.