Cargando…

Correlation test to assess low-level processing of high-density oligonucleotide microarray data

BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Ploner, Alexander, Miller, Lance D, Hall, Per, Bergh, Jonas, Pawitan, Yudi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1084343/
https://www.ncbi.nlm.nih.gov/pubmed/15799785
http://dx.doi.org/10.1186/1471-2105-6-80
_version_ 1782123797023817728
author Ploner, Alexander
Miller, Lance D
Hall, Per
Bergh, Jonas
Pawitan, Yudi
author_facet Ploner, Alexander
Miller, Lance D
Hall, Per
Bergh, Jonas
Pawitan, Yudi
author_sort Ploner, Alexander
collection PubMed
description BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific data set, without reference to external data. RESULTS: We analyzed coregulation between genes in order to detect insufficient normalization between arrays, where coregulation is measured in terms of statistical correlation. In a large collection of genes, a random pair of genes should have on average zero correlation, hence allowing a correlation test. For all data sets that we evaluated, and the three most commonly used low-level processing procedures including MAS5, RMA and MBEI, the housekeeping-gene normalization failed the test. For a real clinical data set, RMA and MBEI showed significant correlation for absent genes. We also found that a second round of normalization on the probe set level improved normalization significantly throughout. CONCLUSION: Previous evaluation of low-level processing in the literature has been limited to artificial spike-in and mixture data sets. In the absence of a known gold-standard, the correlation criterion allows us to assess the appropriateness of low-level processing of a specific data set and the success of normalization for subsets of genes.
format Text
id pubmed-1084343
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-10843432005-04-23 Correlation test to assess low-level processing of high-density oligonucleotide microarray data Ploner, Alexander Miller, Lance D Hall, Per Bergh, Jonas Pawitan, Yudi BMC Bioinformatics Methodology Article BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific data set, without reference to external data. RESULTS: We analyzed coregulation between genes in order to detect insufficient normalization between arrays, where coregulation is measured in terms of statistical correlation. In a large collection of genes, a random pair of genes should have on average zero correlation, hence allowing a correlation test. For all data sets that we evaluated, and the three most commonly used low-level processing procedures including MAS5, RMA and MBEI, the housekeeping-gene normalization failed the test. For a real clinical data set, RMA and MBEI showed significant correlation for absent genes. We also found that a second round of normalization on the probe set level improved normalization significantly throughout. CONCLUSION: Previous evaluation of low-level processing in the literature has been limited to artificial spike-in and mixture data sets. In the absence of a known gold-standard, the correlation criterion allows us to assess the appropriateness of low-level processing of a specific data set and the success of normalization for subsets of genes. BioMed Central 2005-03-31 /pmc/articles/PMC1084343/ /pubmed/15799785 http://dx.doi.org/10.1186/1471-2105-6-80 Text en Copyright © 2005 Ploner et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Ploner, Alexander
Miller, Lance D
Hall, Per
Bergh, Jonas
Pawitan, Yudi
Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title_full Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title_fullStr Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title_full_unstemmed Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title_short Correlation test to assess low-level processing of high-density oligonucleotide microarray data
title_sort correlation test to assess low-level processing of high-density oligonucleotide microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1084343/
https://www.ncbi.nlm.nih.gov/pubmed/15799785
http://dx.doi.org/10.1186/1471-2105-6-80
work_keys_str_mv AT ploneralexander correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata
AT millerlanced correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata
AT hallper correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata
AT berghjonas correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata
AT pawitanyudi correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata