Cargando…

Application of wavelet-based neural network on DNA microarray data

The advantage of using DNA microarray data when investigating human cancer gene expressions is its ability to generate enormous amount of information from a single assay in order to speed up the scientific evaluation process. The number of variables from the gene expression data coupled with compara...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Jack, Zee, Benny
Formato:	Texto
Lenguaje:	English
Publicado:	Biomedical Informatics Publishing Group 2008
Materias:	Hypothesis
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646193/ https://www.ncbi.nlm.nih.gov/pubmed/19255638

_version_	1782164828282945536
author	Lee, Jack Zee, Benny
author_facet	Lee, Jack Zee, Benny
author_sort	Lee, Jack
collection	PubMed
description	The advantage of using DNA microarray data when investigating human cancer gene expressions is its ability to generate enormous amount of information from a single assay in order to speed up the scientific evaluation process. The number of variables from the gene expression data coupled with comparably much less number of samples creates new challenges to scientists and statisticians. In particular, the problems include enormous degree of collinearity among genes expressions, likely violation of model assumptions as well as high level of noise with potential outliers. To deal with these problems, we propose a block wavelet shrinkage principal component (BWSPCA) analysis method to optimize the information during the noise reduction process. This paper firstly uses the National Cancer Institute database (NC160) as an illustration and shows a significant improvement in dimension reduction. Secondly we combine BWSPCA with an artificial neural network-based gene minimization strategy to establish a Block Wavelet-based Neural Network model in a robust and accurate cancer classification process (BWNN). Our extensive experiments on six public cancer datasets have shown that the method of BWNN for tumor classification performed well, especially on some difficult instances with large-class (more than two) expression data. This proposed method is extremely useful for data denoising and is competitiveness with respect to other methods such as BagBoost, RandomForest (RanFor), Support Vector Machines (SVM), K-Nearest Neighbor (KNN) and Artificial Neural Network (ANN).
format	Text
id	pubmed-2646193
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Biomedical Informatics Publishing Group
record_format	MEDLINE/PubMed
spelling	pubmed-26461932009-03-02 Application of wavelet-based neural network on DNA microarray data Lee, Jack Zee, Benny Bioinformation Hypothesis The advantage of using DNA microarray data when investigating human cancer gene expressions is its ability to generate enormous amount of information from a single assay in order to speed up the scientific evaluation process. The number of variables from the gene expression data coupled with comparably much less number of samples creates new challenges to scientists and statisticians. In particular, the problems include enormous degree of collinearity among genes expressions, likely violation of model assumptions as well as high level of noise with potential outliers. To deal with these problems, we propose a block wavelet shrinkage principal component (BWSPCA) analysis method to optimize the information during the noise reduction process. This paper firstly uses the National Cancer Institute database (NC160) as an illustration and shows a significant improvement in dimension reduction. Secondly we combine BWSPCA with an artificial neural network-based gene minimization strategy to establish a Block Wavelet-based Neural Network model in a robust and accurate cancer classification process (BWNN). Our extensive experiments on six public cancer datasets have shown that the method of BWNN for tumor classification performed well, especially on some difficult instances with large-class (more than two) expression data. This proposed method is extremely useful for data denoising and is competitiveness with respect to other methods such as BagBoost, RandomForest (RanFor), Support Vector Machines (SVM), K-Nearest Neighbor (KNN) and Artificial Neural Network (ANN). Biomedical Informatics Publishing Group 2008-12-31 /pmc/articles/PMC2646193/ /pubmed/19255638 Text en © 2008 Biomedical Informatics Publishing Group This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle	Hypothesis Lee, Jack Zee, Benny Application of wavelet-based neural network on DNA microarray data
title	Application of wavelet-based neural network on DNA microarray data
title_full	Application of wavelet-based neural network on DNA microarray data
title_fullStr	Application of wavelet-based neural network on DNA microarray data
title_full_unstemmed	Application of wavelet-based neural network on DNA microarray data
title_short	Application of wavelet-based neural network on DNA microarray data
title_sort	application of wavelet-based neural network on dna microarray data
topic	Hypothesis
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646193/ https://www.ncbi.nlm.nih.gov/pubmed/19255638
work_keys_str_mv	AT leejack applicationofwaveletbasedneuralnetworkondnamicroarraydata AT zeebenny applicationofwaveletbasedneuralnetworkondnamicroarraydata

Application of wavelet-based neural network on DNA microarray data

Ejemplares similares