Cargando…

Batch effect correction for genome-wide methylation data with Illumina Infinium platform

BACKGROUND: Genome-wide methylation profiling has led to more comprehensive insights into gene regulation mechanisms and potential therapeutic targets. Illumina Human Methylation BeadChip is one of the most commonly used genome-wide methylation platforms. Similar to other microarray experiments, met...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Zhifu, Chai, High Seng, Wu, Yanhong, White, Wendy M, Donkena, Krishna V, Klein, Christopher J, Garovic, Vesna D, Therneau, Terry M, Kocher, Jean-Pierre A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3265417/
https://www.ncbi.nlm.nih.gov/pubmed/22171553
http://dx.doi.org/10.1186/1755-8794-4-84
_version_ 1782222084004380672
author Sun, Zhifu
Chai, High Seng
Wu, Yanhong
White, Wendy M
Donkena, Krishna V
Klein, Christopher J
Garovic, Vesna D
Therneau, Terry M
Kocher, Jean-Pierre A
author_facet Sun, Zhifu
Chai, High Seng
Wu, Yanhong
White, Wendy M
Donkena, Krishna V
Klein, Christopher J
Garovic, Vesna D
Therneau, Terry M
Kocher, Jean-Pierre A
author_sort Sun, Zhifu
collection PubMed
description BACKGROUND: Genome-wide methylation profiling has led to more comprehensive insights into gene regulation mechanisms and potential therapeutic targets. Illumina Human Methylation BeadChip is one of the most commonly used genome-wide methylation platforms. Similar to other microarray experiments, methylation data is susceptible to various technical artifacts, particularly batch effects. To date, little attention has been given to issues related to normalization and batch effect correction for this kind of data. METHODS: We evaluated three common normalization approaches and investigated their performance in batch effect removal using three datasets with different degrees of batch effects generated from HumanMethylation27 platform: quantile normalization at average β value (QNβ); two step quantile normalization at probe signals implemented in "lumi" package of R (lumi); and quantile normalization of A and B signal separately (ABnorm). Subsequent Empirical Bayes (EB) batch adjustment was also evaluated. RESULTS: Each normalization could remove a portion of batch effects and their effectiveness differed depending on the severity of batch effects in a dataset. For the dataset with minor batch effects (Dataset 1), normalization alone appeared adequate and "lumi" showed the best performance. However, all methods left substantial batch effects intact in the datasets with obvious batch effects and further correction was necessary. Without any correction, 50 and 66 percent of CpGs were associated with batch effects in Dataset 2 and 3, respectively. After QNβ, lumi or ABnorm, the number of CpGs associated with batch effects were reduced to 24, 32, and 26 percent for Dataset 2; and 37, 46, and 35 percent for Dataset 3, respectively. Additional EB correction effectively removed such remaining non-biological effects. More importantly, the two-step procedure almost tripled the numbers of CpGs associated with the outcome of interest for the two datasets. CONCLUSION: Genome-wide methylation data from Infinium Methylation BeadChip can be susceptible to batch effects with profound impacts on downstream analyses and conclusions. Normalization can reduce part but not all batch effects. EB correction along with normalization is recommended for effective batch effect removal.
format Online
Article
Text
id pubmed-3265417
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32654172012-01-25 Batch effect correction for genome-wide methylation data with Illumina Infinium platform Sun, Zhifu Chai, High Seng Wu, Yanhong White, Wendy M Donkena, Krishna V Klein, Christopher J Garovic, Vesna D Therneau, Terry M Kocher, Jean-Pierre A BMC Med Genomics Research Article BACKGROUND: Genome-wide methylation profiling has led to more comprehensive insights into gene regulation mechanisms and potential therapeutic targets. Illumina Human Methylation BeadChip is one of the most commonly used genome-wide methylation platforms. Similar to other microarray experiments, methylation data is susceptible to various technical artifacts, particularly batch effects. To date, little attention has been given to issues related to normalization and batch effect correction for this kind of data. METHODS: We evaluated three common normalization approaches and investigated their performance in batch effect removal using three datasets with different degrees of batch effects generated from HumanMethylation27 platform: quantile normalization at average β value (QNβ); two step quantile normalization at probe signals implemented in "lumi" package of R (lumi); and quantile normalization of A and B signal separately (ABnorm). Subsequent Empirical Bayes (EB) batch adjustment was also evaluated. RESULTS: Each normalization could remove a portion of batch effects and their effectiveness differed depending on the severity of batch effects in a dataset. For the dataset with minor batch effects (Dataset 1), normalization alone appeared adequate and "lumi" showed the best performance. However, all methods left substantial batch effects intact in the datasets with obvious batch effects and further correction was necessary. Without any correction, 50 and 66 percent of CpGs were associated with batch effects in Dataset 2 and 3, respectively. After QNβ, lumi or ABnorm, the number of CpGs associated with batch effects were reduced to 24, 32, and 26 percent for Dataset 2; and 37, 46, and 35 percent for Dataset 3, respectively. Additional EB correction effectively removed such remaining non-biological effects. More importantly, the two-step procedure almost tripled the numbers of CpGs associated with the outcome of interest for the two datasets. CONCLUSION: Genome-wide methylation data from Infinium Methylation BeadChip can be susceptible to batch effects with profound impacts on downstream analyses and conclusions. Normalization can reduce part but not all batch effects. EB correction along with normalization is recommended for effective batch effect removal. BioMed Central 2011-12-16 /pmc/articles/PMC3265417/ /pubmed/22171553 http://dx.doi.org/10.1186/1755-8794-4-84 Text en Copyright ©2011 Sun et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sun, Zhifu
Chai, High Seng
Wu, Yanhong
White, Wendy M
Donkena, Krishna V
Klein, Christopher J
Garovic, Vesna D
Therneau, Terry M
Kocher, Jean-Pierre A
Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title_full Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title_fullStr Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title_full_unstemmed Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title_short Batch effect correction for genome-wide methylation data with Illumina Infinium platform
title_sort batch effect correction for genome-wide methylation data with illumina infinium platform
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3265417/
https://www.ncbi.nlm.nih.gov/pubmed/22171553
http://dx.doi.org/10.1186/1755-8794-4-84
work_keys_str_mv AT sunzhifu batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT chaihighseng batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT wuyanhong batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT whitewendym batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT donkenakrishnav batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT kleinchristopherj batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT garovicvesnad batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT therneauterrym batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform
AT kocherjeanpierrea batcheffectcorrectionforgenomewidemethylationdatawithilluminainfiniumplatform