Cargando…

Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios

BACKGROUND: Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on di...

Descripción completa

Detalles Bibliográficos
Autores principales: Staaf, Johan, Vallon-Christersson, Johan, Lindgren, David, Juliusson, Gunnar, Rosenquist, Richard, Höglund, Mattias, Borg, Åke, Ringnér, Markus
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2572624/
https://www.ncbi.nlm.nih.gov/pubmed/18831757
http://dx.doi.org/10.1186/1471-2105-9-409
_version_ 1782160262638338048
author Staaf, Johan
Vallon-Christersson, Johan
Lindgren, David
Juliusson, Gunnar
Rosenquist, Richard
Höglund, Mattias
Borg, Åke
Ringnér, Markus
author_facet Staaf, Johan
Vallon-Christersson, Johan
Lindgren, David
Juliusson, Gunnar
Rosenquist, Richard
Höglund, Mattias
Borg, Åke
Ringnér, Markus
author_sort Staaf, Johan
collection PubMed
description BACKGROUND: Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples. RESULTS: We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina's proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300 k version 1 and 2, 370 k and 550 k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations. CONCLUSION: The proposed normalization strategy represents a valuable tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies.
format Text
id pubmed-2572624
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25726242008-10-25 Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios Staaf, Johan Vallon-Christersson, Johan Lindgren, David Juliusson, Gunnar Rosenquist, Richard Höglund, Mattias Borg, Åke Ringnér, Markus BMC Bioinformatics Methodology Article BACKGROUND: Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples. RESULTS: We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina's proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300 k version 1 and 2, 370 k and 550 k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations. CONCLUSION: The proposed normalization strategy represents a valuable tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies. BioMed Central 2008-10-02 /pmc/articles/PMC2572624/ /pubmed/18831757 http://dx.doi.org/10.1186/1471-2105-9-409 Text en Copyright © 2008 Staaf et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Staaf, Johan
Vallon-Christersson, Johan
Lindgren, David
Juliusson, Gunnar
Rosenquist, Richard
Höglund, Mattias
Borg, Åke
Ringnér, Markus
Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title_full Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title_fullStr Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title_full_unstemmed Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title_short Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios
title_sort normalization of illumina infinium whole-genome snp data improves copy number estimates and allelic intensity ratios
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2572624/
https://www.ncbi.nlm.nih.gov/pubmed/18831757
http://dx.doi.org/10.1186/1471-2105-9-409
work_keys_str_mv AT staafjohan normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT vallonchristerssonjohan normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT lindgrendavid normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT juliussongunnar normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT rosenquistrichard normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT hoglundmattias normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT borgake normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios
AT ringnermarkus normalizationofilluminainfiniumwholegenomesnpdataimprovescopynumberestimatesandallelicintensityratios