Cargando…
Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays
BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array....
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC509236/ https://www.ncbi.nlm.nih.gov/pubmed/15283861 http://dx.doi.org/10.1186/1471-2105-5-103 |
_version_ | 1782121690491256832 |
---|---|
author | Lu, Chao |
author_facet | Lu, Chao |
author_sort | Lu, Chao |
collection | PubMed |
description | BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. RESULTS: Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd). In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. CONCLUSIONS: Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays. |
format | Text |
id | pubmed-509236 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-5092362004-08-12 Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays Lu, Chao BMC Bioinformatics Methodology Article BACKGROUND: Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF), for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. RESULTS: Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd). In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. CONCLUSIONS: Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays. BioMed Central 2004-07-29 /pmc/articles/PMC509236/ /pubmed/15283861 http://dx.doi.org/10.1186/1471-2105-5-103 Text en Copyright © 2004 Lu; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Lu, Chao Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title | Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title_full | Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title_fullStr | Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title_full_unstemmed | Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title_short | Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays |
title_sort | improving the scaling normalization for high-density oligonucleotide genechip expression microarrays |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC509236/ https://www.ncbi.nlm.nih.gov/pubmed/15283861 http://dx.doi.org/10.1186/1471-2105-5-103 |
work_keys_str_mv | AT luchao improvingthescalingnormalizationforhighdensityoligonucleotidegenechipexpressionmicroarrays |