Cargando…

Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays

BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array....

Descripción completa

Detalles Bibliográficos
Autor principal: Lu, Chao
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC509236/
https://www.ncbi.nlm.nih.gov/pubmed/15283861
http://dx.doi.org/10.1186/1471-2105-5-103
_version_ 1782121690491256832
author Lu, Chao
author_facet Lu, Chao
author_sort Lu, Chao
collection PubMed
description BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. RESULTS: Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd). In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. CONCLUSIONS: Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays.
format Text
id pubmed-509236
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5092362004-08-12 Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays Lu, Chao BMC Bioinformatics Methodology Article BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. RESULTS: Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd). In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. CONCLUSIONS: Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays. BioMed Central 2004-07-29 /pmc/articles/PMC509236/ /pubmed/15283861 http://dx.doi.org/10.1186/1471-2105-5-103 Text en Copyright © 2004 Lu; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Lu, Chao
Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title_full Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title_fullStr Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title_full_unstemmed Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title_short Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
title_sort improving the scaling normalization for high-density oligonucleotide genechip expression microarrays
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC509236/
https://www.ncbi.nlm.nih.gov/pubmed/15283861
http://dx.doi.org/10.1186/1471-2105-5-103
work_keys_str_mv AT luchao improvingthescalingnormalizationforhighdensityoligonucleotidegenechipexpressionmicroarrays