Cargando…

Normalization of oligonucleotide arrays based on the least-variant set of genes

BACKGROUND: It is well known that the normalization step of microarray data makes a difference in the downstream analysis. All normalization methods rely on certain assumptions, so differences in results can be traced to different sensitivities to violation of the assumptions. Illustrating the lack...

Descripción completa

Detalles Bibliográficos
Autores principales: Calza, Stefano, Valentini, Davide, Pawitan, Yudi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2324100/
https://www.ncbi.nlm.nih.gov/pubmed/18318917
http://dx.doi.org/10.1186/1471-2105-9-140
_version_ 1782152718358413312
author Calza, Stefano
Valentini, Davide
Pawitan, Yudi
author_facet Calza, Stefano
Valentini, Davide
Pawitan, Yudi
author_sort Calza, Stefano
collection PubMed
description BACKGROUND: It is well known that the normalization step of microarray data makes a difference in the downstream analysis. All normalization methods rely on certain assumptions, so differences in results can be traced to different sensitivities to violation of the assumptions. Illustrating the lack of robustness, in a striking spike-in experiment all existing normalization methods fail because of an imbalance between up- and down-regulated genes. This means it is still important to develop a normalization method that is robust against violation of the standard assumptions RESULTS: We develop a new algorithm based on identification of the least-variant set (LVS) of genes across the arrays. The array-to-array variation is evaluated in the robust linear model fit of pre-normalized probe-level data. The genes are then used as a reference set for a non-linear normalization. The method is applicable to any existing expression summaries, such as MAS5 or RMA. CONCLUSION: We show that LVS normalization outperforms other normalization methods when the standard assumptions are not satisfied. In the complex spike-in study, LVS performs similarly to the ideal (in practice unknown) housekeeping-gene normalization. An R package called lvs is available in .
format Text
id pubmed-2324100
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23241002008-04-22 Normalization of oligonucleotide arrays based on the least-variant set of genes Calza, Stefano Valentini, Davide Pawitan, Yudi BMC Bioinformatics Research Article BACKGROUND: It is well known that the normalization step of microarray data makes a difference in the downstream analysis. All normalization methods rely on certain assumptions, so differences in results can be traced to different sensitivities to violation of the assumptions. Illustrating the lack of robustness, in a striking spike-in experiment all existing normalization methods fail because of an imbalance between up- and down-regulated genes. This means it is still important to develop a normalization method that is robust against violation of the standard assumptions RESULTS: We develop a new algorithm based on identification of the least-variant set (LVS) of genes across the arrays. The array-to-array variation is evaluated in the robust linear model fit of pre-normalized probe-level data. The genes are then used as a reference set for a non-linear normalization. The method is applicable to any existing expression summaries, such as MAS5 or RMA. CONCLUSION: We show that LVS normalization outperforms other normalization methods when the standard assumptions are not satisfied. In the complex spike-in study, LVS performs similarly to the ideal (in practice unknown) housekeeping-gene normalization. An R package called lvs is available in . BioMed Central 2008-03-05 /pmc/articles/PMC2324100/ /pubmed/18318917 http://dx.doi.org/10.1186/1471-2105-9-140 Text en Copyright © 2008 Calza et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Calza, Stefano
Valentini, Davide
Pawitan, Yudi
Normalization of oligonucleotide arrays based on the least-variant set of genes
title Normalization of oligonucleotide arrays based on the least-variant set of genes
title_full Normalization of oligonucleotide arrays based on the least-variant set of genes
title_fullStr Normalization of oligonucleotide arrays based on the least-variant set of genes
title_full_unstemmed Normalization of oligonucleotide arrays based on the least-variant set of genes
title_short Normalization of oligonucleotide arrays based on the least-variant set of genes
title_sort normalization of oligonucleotide arrays based on the least-variant set of genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2324100/
https://www.ncbi.nlm.nih.gov/pubmed/18318917
http://dx.doi.org/10.1186/1471-2105-9-140
work_keys_str_mv AT calzastefano normalizationofoligonucleotidearraysbasedontheleastvariantsetofgenes
AT valentinidavide normalizationofoligonucleotidearraysbasedontheleastvariantsetofgenes
AT pawitanyudi normalizationofoligonucleotidearraysbasedontheleastvariantsetofgenes