Cargando…

Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients

BACKGROUND: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch e...

Descripción completa

Detalles Bibliográficos
Autores principales: Walker, Wynn L, Liao, Isaac H, Gilbert, Donald L, Wong, Brenda, Pollard, Katherine S, McCulloch, Charles E, Lit, Lisa, Sharp, Frank R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576259/
https://www.ncbi.nlm.nih.gov/pubmed/18937867
http://dx.doi.org/10.1186/1471-2164-9-494
_version_ 1782160379402518528
author Walker, Wynn L
Liao, Isaac H
Gilbert, Donald L
Wong, Brenda
Pollard, Katherine S
McCulloch, Charles E
Lit, Lisa
Sharp, Frank R
author_facet Walker, Wynn L
Liao, Isaac H
Gilbert, Donald L
Wong, Brenda
Pollard, Katherine S
McCulloch, Charles E
Lit, Lisa
Sharp, Frank R
author_sort Walker, Wynn L
collection PubMed
description BACKGROUND: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. RESULTS: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. CONCLUSION: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed.
format Text
id pubmed-2576259
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25762592008-10-31 Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients Walker, Wynn L Liao, Isaac H Gilbert, Donald L Wong, Brenda Pollard, Katherine S McCulloch, Charles E Lit, Lisa Sharp, Frank R BMC Genomics Methodology Article BACKGROUND: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. RESULTS: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. CONCLUSION: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed. BioMed Central 2008-10-20 /pmc/articles/PMC2576259/ /pubmed/18937867 http://dx.doi.org/10.1186/1471-2164-9-494 Text en Copyright © 2008 Walker et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Walker, Wynn L
Liao, Isaac H
Gilbert, Donald L
Wong, Brenda
Pollard, Katherine S
McCulloch, Charles E
Lit, Lisa
Sharp, Frank R
Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title_full Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title_fullStr Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title_full_unstemmed Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title_short Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients
title_sort empirical bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to rna expression profiling of blood from duchenne muscular dystrophy patients
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576259/
https://www.ncbi.nlm.nih.gov/pubmed/18937867
http://dx.doi.org/10.1186/1471-2164-9-494
work_keys_str_mv AT walkerwynnl empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT liaoisaach empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT gilbertdonaldl empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT wongbrenda empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT pollardkatherines empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT mccullochcharlese empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT litlisa empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients
AT sharpfrankr empiricalbayesaccomodationofbatcheffectsinmicroarraydatausingidenticalreplicatereferencesamplesapplicationtornaexpressionprofilingofbloodfromduchennemusculardystrophypatients