Cargando…

Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles

BACKGROUND: Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays o...

Descripción completa

Detalles Bibliográficos
Autores principales: Kitchen, Robert R, Sabine, Vicky S, Sims, Andrew H, Macaskill, E Jane, Renshaw, Lorna, Thomas, Jeremy S, van Hemert, Jano I, Dixon, J Michael, Bartlett, John MS
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2843619/
https://www.ncbi.nlm.nih.gov/pubmed/20181233
http://dx.doi.org/10.1186/1471-2164-11-134
_version_ 1782179237551144960
author Kitchen, Robert R
Sabine, Vicky S
Sims, Andrew H
Macaskill, E Jane
Renshaw, Lorna
Thomas, Jeremy S
van Hemert, Jano I
Dixon, J Michael
Bartlett, John MS
author_facet Kitchen, Robert R
Sabine, Vicky S
Sims, Andrew H
Macaskill, E Jane
Renshaw, Lorna
Thomas, Jeremy S
van Hemert, Jano I
Dixon, J Michael
Bartlett, John MS
author_sort Kitchen, Robert R
collection PubMed
description BACKGROUND: Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study. RESULTS: A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%. CONCLUSION: In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.
format Text
id pubmed-2843619
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28436192010-03-23 Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles Kitchen, Robert R Sabine, Vicky S Sims, Andrew H Macaskill, E Jane Renshaw, Lorna Thomas, Jeremy S van Hemert, Jano I Dixon, J Michael Bartlett, John MS BMC Genomics Research Article BACKGROUND: Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study. RESULTS: A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%. CONCLUSION: In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data. BioMed Central 2010-02-24 /pmc/articles/PMC2843619/ /pubmed/20181233 http://dx.doi.org/10.1186/1471-2164-11-134 Text en Copyright ©2010 Kitchen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kitchen, Robert R
Sabine, Vicky S
Sims, Andrew H
Macaskill, E Jane
Renshaw, Lorna
Thomas, Jeremy S
van Hemert, Jano I
Dixon, J Michael
Bartlett, John MS
Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title_full Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title_fullStr Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title_full_unstemmed Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title_short Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles
title_sort correcting for intra-experiment variation in illumina beadchip data is necessary to generate robust gene-expression profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2843619/
https://www.ncbi.nlm.nih.gov/pubmed/20181233
http://dx.doi.org/10.1186/1471-2164-11-134
work_keys_str_mv AT kitchenrobertr correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT sabinevickys correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT simsandrewh correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT macaskillejane correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT renshawlorna correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT thomasjeremys correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT vanhemertjanoi correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT dixonjmichael correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles
AT bartlettjohnms correctingforintraexperimentvariationinilluminabeadchipdataisnecessarytogeneraterobustgeneexpressionprofiles