Cargando…
An efficient approach for feature construction of high-dimensional microarray data by random projections
Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5922581/ https://www.ncbi.nlm.nih.gov/pubmed/29702670 http://dx.doi.org/10.1371/journal.pone.0196385 |
_version_ | 1783318225509941248 |
---|---|
author | Tariq, Hassan Eldridge, Elf Welch, Ian |
author_facet | Tariq, Hassan Eldridge, Elf Welch, Ian |
author_sort | Tariq, Hassan |
collection | PubMed |
description | Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well as other machine learning algorithms. To explore the inherent property of GP to generalize models from low dimensional data, we need to consider dimensionality reduction approaches. Random projections (RPs) have gained attention for reducing the dimensionality of data with reduced computational cost, compared to other dimensionality reduction approaches. We report that the features constructed from RPs perform extremely well when combined with a GP approach. We used eight datasets out of which seven have not been reported as being used in any machine learning research before. We have also compared our results by using the same full and constructed features for decision trees, random forest, naive Bayes, support vector machines and k-nearest neighbor methods. |
format | Online Article Text |
id | pubmed-5922581 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-59225812018-05-11 An efficient approach for feature construction of high-dimensional microarray data by random projections Tariq, Hassan Eldridge, Elf Welch, Ian PLoS One Research Article Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well as other machine learning algorithms. To explore the inherent property of GP to generalize models from low dimensional data, we need to consider dimensionality reduction approaches. Random projections (RPs) have gained attention for reducing the dimensionality of data with reduced computational cost, compared to other dimensionality reduction approaches. We report that the features constructed from RPs perform extremely well when combined with a GP approach. We used eight datasets out of which seven have not been reported as being used in any machine learning research before. We have also compared our results by using the same full and constructed features for decision trees, random forest, naive Bayes, support vector machines and k-nearest neighbor methods. Public Library of Science 2018-04-27 /pmc/articles/PMC5922581/ /pubmed/29702670 http://dx.doi.org/10.1371/journal.pone.0196385 Text en © 2018 Tariq et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Tariq, Hassan Eldridge, Elf Welch, Ian An efficient approach for feature construction of high-dimensional microarray data by random projections |
title | An efficient approach for feature construction of high-dimensional microarray data by random projections |
title_full | An efficient approach for feature construction of high-dimensional microarray data by random projections |
title_fullStr | An efficient approach for feature construction of high-dimensional microarray data by random projections |
title_full_unstemmed | An efficient approach for feature construction of high-dimensional microarray data by random projections |
title_short | An efficient approach for feature construction of high-dimensional microarray data by random projections |
title_sort | efficient approach for feature construction of high-dimensional microarray data by random projections |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5922581/ https://www.ncbi.nlm.nih.gov/pubmed/29702670 http://dx.doi.org/10.1371/journal.pone.0196385 |
work_keys_str_mv | AT tariqhassan anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections AT eldridgeelf anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections AT welchian anefficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections AT tariqhassan efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections AT eldridgeelf efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections AT welchian efficientapproachforfeatureconstructionofhighdimensionalmicroarraydatabyrandomprojections |