Cargando…

Correcting for batch effects in case-control microbiome studies

High-throughput data generation platforms, like mass-spectrometry, microarrays, and second-generation sequencing are susceptible to batch effects due to run-to-run variation in reagents, equipment, protocols, or personnel. Currently, batch correction methods are not commonly applied to microbiome se...

Descripción completa

Detalles Bibliográficos
Autores principales: Gibbons, Sean M., Duvallet, Claire, Alm, Eric J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940237/
https://www.ncbi.nlm.nih.gov/pubmed/29684016
http://dx.doi.org/10.1371/journal.pcbi.1006102
_version_ 1783321077956476928
author Gibbons, Sean M.
Duvallet, Claire
Alm, Eric J.
author_facet Gibbons, Sean M.
Duvallet, Claire
Alm, Eric J.
author_sort Gibbons, Sean M.
collection PubMed
description High-throughput data generation platforms, like mass-spectrometry, microarrays, and second-generation sequencing are susceptible to batch effects due to run-to-run variation in reagents, equipment, protocols, or personnel. Currently, batch correction methods are not commonly applied to microbiome sequencing datasets. In this paper, we compare different batch-correction methods applied to microbiome case-control studies. We introduce a model-free normalization procedure where features (i.e. bacterial taxa) in case samples are converted to percentiles of the equivalent features in control samples within a study prior to pooling data across studies. We look at how this percentile-normalization method compares to traditional meta-analysis methods for combining independent p-values and to limma and ComBat, widely used batch-correction models developed for RNA microarray data. Overall, we show that percentile-normalization is a simple, non-parametric approach for correcting batch effects and improving sensitivity in case-control meta-analyses.
format Online
Article
Text
id pubmed-5940237
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59402372018-05-18 Correcting for batch effects in case-control microbiome studies Gibbons, Sean M. Duvallet, Claire Alm, Eric J. PLoS Comput Biol Research Article High-throughput data generation platforms, like mass-spectrometry, microarrays, and second-generation sequencing are susceptible to batch effects due to run-to-run variation in reagents, equipment, protocols, or personnel. Currently, batch correction methods are not commonly applied to microbiome sequencing datasets. In this paper, we compare different batch-correction methods applied to microbiome case-control studies. We introduce a model-free normalization procedure where features (i.e. bacterial taxa) in case samples are converted to percentiles of the equivalent features in control samples within a study prior to pooling data across studies. We look at how this percentile-normalization method compares to traditional meta-analysis methods for combining independent p-values and to limma and ComBat, widely used batch-correction models developed for RNA microarray data. Overall, we show that percentile-normalization is a simple, non-parametric approach for correcting batch effects and improving sensitivity in case-control meta-analyses. Public Library of Science 2018-04-23 /pmc/articles/PMC5940237/ /pubmed/29684016 http://dx.doi.org/10.1371/journal.pcbi.1006102 Text en © 2018 Gibbons et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gibbons, Sean M.
Duvallet, Claire
Alm, Eric J.
Correcting for batch effects in case-control microbiome studies
title Correcting for batch effects in case-control microbiome studies
title_full Correcting for batch effects in case-control microbiome studies
title_fullStr Correcting for batch effects in case-control microbiome studies
title_full_unstemmed Correcting for batch effects in case-control microbiome studies
title_short Correcting for batch effects in case-control microbiome studies
title_sort correcting for batch effects in case-control microbiome studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940237/
https://www.ncbi.nlm.nih.gov/pubmed/29684016
http://dx.doi.org/10.1371/journal.pcbi.1006102
work_keys_str_mv AT gibbonsseanm correctingforbatcheffectsincasecontrolmicrobiomestudies
AT duvalletclaire correctingforbatcheffectsincasecontrolmicrobiomestudies
AT almericj correctingforbatcheffectsincasecontrolmicrobiomestudies