Cargando…
Correlation test to assess low-level processing of high-density oligonucleotide microarray data
BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific dat...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1084343/ https://www.ncbi.nlm.nih.gov/pubmed/15799785 http://dx.doi.org/10.1186/1471-2105-6-80 |
_version_ | 1782123797023817728 |
---|---|
author | Ploner, Alexander Miller, Lance D Hall, Per Bergh, Jonas Pawitan, Yudi |
author_facet | Ploner, Alexander Miller, Lance D Hall, Per Bergh, Jonas Pawitan, Yudi |
author_sort | Ploner, Alexander |
collection | PubMed |
description | BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific data set, without reference to external data. RESULTS: We analyzed coregulation between genes in order to detect insufficient normalization between arrays, where coregulation is measured in terms of statistical correlation. In a large collection of genes, a random pair of genes should have on average zero correlation, hence allowing a correlation test. For all data sets that we evaluated, and the three most commonly used low-level processing procedures including MAS5, RMA and MBEI, the housekeeping-gene normalization failed the test. For a real clinical data set, RMA and MBEI showed significant correlation for absent genes. We also found that a second round of normalization on the probe set level improved normalization significantly throughout. CONCLUSION: Previous evaluation of low-level processing in the literature has been limited to artificial spike-in and mixture data sets. In the absence of a known gold-standard, the correlation criterion allows us to assess the appropriateness of low-level processing of a specific data set and the success of normalization for subsets of genes. |
format | Text |
id | pubmed-1084343 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-10843432005-04-23 Correlation test to assess low-level processing of high-density oligonucleotide microarray data Ploner, Alexander Miller, Lance D Hall, Per Bergh, Jonas Pawitan, Yudi BMC Bioinformatics Methodology Article BACKGROUND: There are currently a number of competing techniques for low-level processing of oligonucleotide array data. The choice of technique has a profound effect on subsequent statistical analyses, but there is no method to assess whether a particular technique is appropriate for a specific data set, without reference to external data. RESULTS: We analyzed coregulation between genes in order to detect insufficient normalization between arrays, where coregulation is measured in terms of statistical correlation. In a large collection of genes, a random pair of genes should have on average zero correlation, hence allowing a correlation test. For all data sets that we evaluated, and the three most commonly used low-level processing procedures including MAS5, RMA and MBEI, the housekeeping-gene normalization failed the test. For a real clinical data set, RMA and MBEI showed significant correlation for absent genes. We also found that a second round of normalization on the probe set level improved normalization significantly throughout. CONCLUSION: Previous evaluation of low-level processing in the literature has been limited to artificial spike-in and mixture data sets. In the absence of a known gold-standard, the correlation criterion allows us to assess the appropriateness of low-level processing of a specific data set and the success of normalization for subsets of genes. BioMed Central 2005-03-31 /pmc/articles/PMC1084343/ /pubmed/15799785 http://dx.doi.org/10.1186/1471-2105-6-80 Text en Copyright © 2005 Ploner et al; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Ploner, Alexander Miller, Lance D Hall, Per Bergh, Jonas Pawitan, Yudi Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title | Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title_full | Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title_fullStr | Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title_full_unstemmed | Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title_short | Correlation test to assess low-level processing of high-density oligonucleotide microarray data |
title_sort | correlation test to assess low-level processing of high-density oligonucleotide microarray data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1084343/ https://www.ncbi.nlm.nih.gov/pubmed/15799785 http://dx.doi.org/10.1186/1471-2105-6-80 |
work_keys_str_mv | AT ploneralexander correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata AT millerlanced correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata AT hallper correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata AT berghjonas correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata AT pawitanyudi correlationtesttoassesslowlevelprocessingofhighdensityoligonucleotidemicroarraydata |