Cargando…

Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses

Removal of, or adjustment for, batch effects or center differences is generally required when such effects are present in data. In particular, when preparing microarray gene expression data from multiple cohorts, array platforms, or batches for later analyses, batch effects can have confounding effe...

Descripción completa

Detalles Bibliográficos
Autores principales: Nygaard, Vegard, Rødland, Einar Andreas, Hovig, Eivind
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679072/
https://www.ncbi.nlm.nih.gov/pubmed/26272994
http://dx.doi.org/10.1093/biostatistics/kxv027
_version_ 1782405541413257216
author Nygaard, Vegard
Rødland, Einar Andreas
Hovig, Eivind
author_facet Nygaard, Vegard
Rødland, Einar Andreas
Hovig, Eivind
author_sort Nygaard, Vegard
collection PubMed
description Removal of, or adjustment for, batch effects or center differences is generally required when such effects are present in data. In particular, when preparing microarray gene expression data from multiple cohorts, array platforms, or batches for later analyses, batch effects can have confounding effects, inducing spurious differences between study groups. Many methods and tools exist for removing batch effects from data. However, when study groups are not evenly distributed across batches, actual group differences may induce apparent batch differences, in which case batch adjustments may bias, usually deflate, group differences. Some tools therefore have the option of preserving the difference between study groups, e.g. using a two-way ANOVA model to simultaneously estimate both group and batch effects. Unfortunately, this approach may systematically induce incorrect group differences in downstream analyses when groups are distributed between the batches in an unbalanced manner. The scientific community seems to be largely unaware of how this approach may lead to false discoveries.
format Online
Article
Text
id pubmed-4679072
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46790722015-12-16 Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses Nygaard, Vegard Rødland, Einar Andreas Hovig, Eivind Biostatistics Articles Removal of, or adjustment for, batch effects or center differences is generally required when such effects are present in data. In particular, when preparing microarray gene expression data from multiple cohorts, array platforms, or batches for later analyses, batch effects can have confounding effects, inducing spurious differences between study groups. Many methods and tools exist for removing batch effects from data. However, when study groups are not evenly distributed across batches, actual group differences may induce apparent batch differences, in which case batch adjustments may bias, usually deflate, group differences. Some tools therefore have the option of preserving the difference between study groups, e.g. using a two-way ANOVA model to simultaneously estimate both group and batch effects. Unfortunately, this approach may systematically induce incorrect group differences in downstream analyses when groups are distributed between the batches in an unbalanced manner. The scientific community seems to be largely unaware of how this approach may lead to false discoveries. Oxford University Press 2016-01 2015-08-13 /pmc/articles/PMC4679072/ /pubmed/26272994 http://dx.doi.org/10.1093/biostatistics/kxv027 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Nygaard, Vegard
Rødland, Einar Andreas
Hovig, Eivind
Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title_full Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title_fullStr Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title_full_unstemmed Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title_short Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
title_sort methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679072/
https://www.ncbi.nlm.nih.gov/pubmed/26272994
http://dx.doi.org/10.1093/biostatistics/kxv027
work_keys_str_mv AT nygaardvegard methodsthatremovebatcheffectswhileretaininggroupdifferencesmayleadtoexaggeratedconfidenceindownstreamanalyses
AT rødlandeinarandreas methodsthatremovebatcheffectswhileretaininggroupdifferencesmayleadtoexaggeratedconfidenceindownstreamanalyses
AT hovigeivind methodsthatremovebatcheffectswhileretaininggroupdifferencesmayleadtoexaggeratedconfidenceindownstreamanalyses