Cargando…

Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data

BACKGROUND: Normalization is essential in dual-labelled microarray data analysis to remove non-biological variations and systematic biases. Many normalization methods have been used to remove such biases within slides (Global, Lowess) and across slides (Scale, Quantile and VSN). However, all these p...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Huiling, Zhang, Dapeng, Martyniuk, Christopher J, Trudeau, Vance L, Xia, Xuhua
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2275243/
https://www.ncbi.nlm.nih.gov/pubmed/18199333
http://dx.doi.org/10.1186/1471-2105-9-25
_version_ 1782151840225296384
author Xiong, Huiling
Zhang, Dapeng
Martyniuk, Christopher J
Trudeau, Vance L
Xia, Xuhua
author_facet Xiong, Huiling
Zhang, Dapeng
Martyniuk, Christopher J
Trudeau, Vance L
Xia, Xuhua
author_sort Xiong, Huiling
collection PubMed
description BACKGROUND: Normalization is essential in dual-labelled microarray data analysis to remove non-biological variations and systematic biases. Many normalization methods have been used to remove such biases within slides (Global, Lowess) and across slides (Scale, Quantile and VSN). However, all these popular approaches have critical assumptions about data distribution, which is often not valid in practice. RESULTS: In this study, we propose a novel assumption-free normalization method based on the Generalized Procrustes Analysis (GPA) algorithm. Using experimental and simulated normal microarray data and boutique array data, we systemically evaluate the ability of the GPA method in normalization compared with six other popular normalization methods including Global, Lowess, Scale, Quantile, VSN, and one boutique array-specific housekeeping gene method. The assessment of these methods is based on three different empirical criteria: across-slide variability, the Kolmogorov-Smirnov (K-S) statistic and the mean square error (MSE). Compared with other methods, the GPA method performs effectively and consistently better in reducing across-slide variability and removing systematic bias. CONCLUSION: The GPA method is an effective normalization approach for microarray data analysis. In particular, it is free from the statistical and biological assumptions inherent in other normalization methods that are often difficult to validate. Therefore, the GPA method has a major advantage in that it can be applied to diverse types of array sets, especially to the boutique array where the majority of genes may be differentially expressed.
format Text
id pubmed-2275243
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22752432008-03-26 Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data Xiong, Huiling Zhang, Dapeng Martyniuk, Christopher J Trudeau, Vance L Xia, Xuhua BMC Bioinformatics Methodology Article BACKGROUND: Normalization is essential in dual-labelled microarray data analysis to remove non-biological variations and systematic biases. Many normalization methods have been used to remove such biases within slides (Global, Lowess) and across slides (Scale, Quantile and VSN). However, all these popular approaches have critical assumptions about data distribution, which is often not valid in practice. RESULTS: In this study, we propose a novel assumption-free normalization method based on the Generalized Procrustes Analysis (GPA) algorithm. Using experimental and simulated normal microarray data and boutique array data, we systemically evaluate the ability of the GPA method in normalization compared with six other popular normalization methods including Global, Lowess, Scale, Quantile, VSN, and one boutique array-specific housekeeping gene method. The assessment of these methods is based on three different empirical criteria: across-slide variability, the Kolmogorov-Smirnov (K-S) statistic and the mean square error (MSE). Compared with other methods, the GPA method performs effectively and consistently better in reducing across-slide variability and removing systematic bias. CONCLUSION: The GPA method is an effective normalization approach for microarray data analysis. In particular, it is free from the statistical and biological assumptions inherent in other normalization methods that are often difficult to validate. Therefore, the GPA method has a major advantage in that it can be applied to diverse types of array sets, especially to the boutique array where the majority of genes may be differentially expressed. BioMed Central 2008-01-16 /pmc/articles/PMC2275243/ /pubmed/18199333 http://dx.doi.org/10.1186/1471-2105-9-25 Text en Copyright © 2008 Xiong et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Xiong, Huiling
Zhang, Dapeng
Martyniuk, Christopher J
Trudeau, Vance L
Xia, Xuhua
Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title_full Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title_fullStr Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title_full_unstemmed Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title_short Using Generalized Procrustes Analysis (GPA) for normalization of cDNA microarray data
title_sort using generalized procrustes analysis (gpa) for normalization of cdna microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2275243/
https://www.ncbi.nlm.nih.gov/pubmed/18199333
http://dx.doi.org/10.1186/1471-2105-9-25
work_keys_str_mv AT xionghuiling usinggeneralizedprocrustesanalysisgpafornormalizationofcdnamicroarraydata
AT zhangdapeng usinggeneralizedprocrustesanalysisgpafornormalizationofcdnamicroarraydata
AT martyniukchristopherj usinggeneralizedprocrustesanalysisgpafornormalizationofcdnamicroarraydata
AT trudeauvancel usinggeneralizedprocrustesanalysisgpafornormalizationofcdnamicroarraydata
AT xiaxuhua usinggeneralizedprocrustesanalysisgpafornormalizationofcdnamicroarraydata