Cargando…

Permutation-validated principal components analysis of microarray data

BACKGROUND: In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been publi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Landgrebe, Jobst, Wurst, Wolfgang, Welzl, Gerhard
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2002
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC115254/ https://www.ncbi.nlm.nih.gov/pubmed/11983060

_version_	1782120249226690560
author	Landgrebe, Jobst Wurst, Wolfgang Welzl, Gerhard
author_facet	Landgrebe, Jobst Wurst, Wolfgang Welzl, Gerhard
author_sort	Landgrebe, Jobst
collection	PubMed
description	BACKGROUND: In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the assessment of the reliability of gene-selection procedures. Here we describe a method to assess reliability in multivariate microarray data analysis using permutation-validated principal components analysis (PCA). The approach is designed for microarray data with a group structure. RESULTS: We used PCA to detect the major sources of variance underlying the hybridization conditions followed by gene selection based on PCA-derived and permutation-based test statistics. We validated our method by applying it to well characterized yeast cell-cycle data and to two datasets from our laboratory. We could describe the major sources of variance, select informative genes and visualize the relationship of genes and arrays. We observed differences in the level of the explained variance and the interpretability of the selected genes. CONCLUSIONS: Combining data visualization and permutation-based gene selection, permutation-validated PCA enables one to illustrate gene-expression variance between several conditions and to select genes by taking into account the relationship of between-group to within-group variance of genes. The method can be used to extract the leading sources of variance from microarray data, to visualize relationships between genes and hybridizations and to select informative genes in a statistically reliable manner. This selection accounts for the level of reproducibility of replicates or group structure as well as gene-specific scatter. Visualization of the data can support a straightforward biological interpretation.
format	Text
id	pubmed-115254
institution	National Center for Biotechnology Information
language	English
publishDate	2002
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-1152542002-06-10 Permutation-validated principal components analysis of microarray data Landgrebe, Jobst Wurst, Wolfgang Welzl, Gerhard Genome Biol Research BACKGROUND: In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the assessment of the reliability of gene-selection procedures. Here we describe a method to assess reliability in multivariate microarray data analysis using permutation-validated principal components analysis (PCA). The approach is designed for microarray data with a group structure. RESULTS: We used PCA to detect the major sources of variance underlying the hybridization conditions followed by gene selection based on PCA-derived and permutation-based test statistics. We validated our method by applying it to well characterized yeast cell-cycle data and to two datasets from our laboratory. We could describe the major sources of variance, select informative genes and visualize the relationship of genes and arrays. We observed differences in the level of the explained variance and the interpretability of the selected genes. CONCLUSIONS: Combining data visualization and permutation-based gene selection, permutation-validated PCA enables one to illustrate gene-expression variance between several conditions and to select genes by taking into account the relationship of between-group to within-group variance of genes. The method can be used to extract the leading sources of variance from microarray data, to visualize relationships between genes and hybridizations and to select informative genes in a statistically reliable manner. This selection accounts for the level of reproducibility of replicates or group structure as well as gene-specific scatter. Visualization of the data can support a straightforward biological interpretation. BioMed Central 2002 2002-03-22 /pmc/articles/PMC115254/ /pubmed/11983060 Text en Copyright © 2002 BioMed Central Ltd
spellingShingle	Research Landgrebe, Jobst Wurst, Wolfgang Welzl, Gerhard Permutation-validated principal components analysis of microarray data
title	Permutation-validated principal components analysis of microarray data
title_full	Permutation-validated principal components analysis of microarray data
title_fullStr	Permutation-validated principal components analysis of microarray data
title_full_unstemmed	Permutation-validated principal components analysis of microarray data
title_short	Permutation-validated principal components analysis of microarray data
title_sort	permutation-validated principal components analysis of microarray data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC115254/ https://www.ncbi.nlm.nih.gov/pubmed/11983060
work_keys_str_mv	AT landgrebejobst permutationvalidatedprincipalcomponentsanalysisofmicroarraydata AT wurstwolfgang permutationvalidatedprincipalcomponentsanalysisofmicroarraydata AT welzlgerhard permutationvalidatedprincipalcomponentsanalysisofmicroarraydata

Permutation-validated principal components analysis of microarray data

Ejemplares similares