Cargando…

An efficient approach for feature construction of high-dimensional microarray data by random projections

Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well...

Descripción completa

Detalles Bibliográficos
Autores principales: Tariq, Hassan, Eldridge, Elf, Welch, Ian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5922581/
https://www.ncbi.nlm.nih.gov/pubmed/29702670
http://dx.doi.org/10.1371/journal.pone.0196385
_version_ 1783318225509941248
author Tariq, Hassan
Eldridge, Elf
Welch, Ian
author_facet Tariq, Hassan
Eldridge, Elf
Welch, Ian
author_sort Tariq, Hassan
collection PubMed
description Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well as other machine learning algorithms. To explore the inherent property of GP to generalize models from low dimensional data, we need to consider dimensionality reduction approaches. Random projections (RPs) have gained attention for reducing the dimensionality of data with reduced computational cost, compared to other dimensionality reduction approaches. We report that the features constructed from RPs perform extremely well when combined with a GP approach. We used eight datasets out of which seven have not been reported as being used in any machine learning research before. We have also compared our results by using the same full and constructed features for decision trees, random forest, naive Bayes, support vector machines and k-nearest neighbor methods.
format Online
Article
Text
id pubmed-5922581
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59225812018-05-11 An efficient approach for feature construction of high-dimensional microarray data by random projections Tariq, Hassan Eldridge, Elf Welch, Ian PLoS One Research Article Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well as other machine learning algorithms. To explore the inherent property of GP to generalize models from low dimensional data, we need to consider dimensionality reduction approaches. Random projections (RPs) have gained attention for reducing the dimensionality of data with reduced computational cost, compared to other dimensionality reduction approaches. We report that the features constructed from RPs perform extremely well when combined with a GP approach. We used eight datasets out of which seven have not been reported as being used in any machine learning research before. We have also compared our results by using the same full and constructed features for decision trees, random forest, naive Bayes, support vector machines and k-nearest neighbor methods. Public Library of Science 2018-04-27 /pmc/articles/PMC5922581/ /pubmed/29702670 http://dx.doi.org/10.1371/journal.pone.0196385 Text en © 2018 Tariq et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tariq, Hassan
Eldridge, Elf
Welch, Ian
An efficient approach for feature construction of high-dimensional microarray data by random projections
title An efficient approach for feature construction of high-dimensional microarray data by random projections
title_full An efficient approach for feature construction of high-dimensional microarray data by random projections
title_fullStr An efficient approach for feature construction of high-dimensional microarray data by random projections
title_full_unstemmed An efficient approach for feature construction of high-dimensional microarray data by random projections
title_short An efficient approach for feature construction of high-dimensional microarray data by random projections
title_sort efficient approach for feature construction of high-dimensional microarray data by random projections
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5922581/
https://www.ncbi.nlm.nih.gov/pubmed/29702670
http://dx.doi.org/10.1371/journal.pone.0196385
work_keys_str_mv AT tariqhassan anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections
AT eldridgeelf anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections
AT welchian anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections
AT tariqhassan efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections
AT eldridgeelf efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections
AT welchian efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections