Cargando…

Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota

BACKGROUND: Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible conclusi...

Descripción completa

Detalles Bibliográficos
Autores principales: Moossavi, Shirin, Fehr, Kelsey, Khafipour, Ehsan, Azad, Meghan B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7877029/
https://www.ncbi.nlm.nih.gov/pubmed/33568231
http://dx.doi.org/10.1186/s40168-020-00998-4
_version_ 1783650084120952832
author Moossavi, Shirin
Fehr, Kelsey
Khafipour, Ehsan
Azad, Meghan B.
author_facet Moossavi, Shirin
Fehr, Kelsey
Khafipour, Ehsan
Azad, Meghan B.
author_sort Moossavi, Shirin
collection PubMed
description BACKGROUND: Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible conclusions and interpretations. Low biomass samples in microbiome research are prone to reagent contamination; yet, quality control procedures for low biomass samples in large-scale microbiome studies are not well established. RESULTS: In this study, we have proposed a framework for an in-depth step-by-step approach to address this gap. The framework consists of three independent stages: (1) verification of sequencing accuracy by assessing technical repeatability and reproducibility of the results using mock communities and biological controls; (2) contaminant removal and batch variability correction by applying a two-tier strategy using statistical algorithms (e.g. decontam) followed by comparison of the data structure between batches; and (3) corroborating the repeatability and reproducibility of microbiome composition and downstream statistical analysis. Using this approach on the milk microbiota data from the CHILD Cohort generated in two batches (extracted and sequenced in 2016 and 2019), we were able to identify potential reagent contaminants that were missed with standard algorithms and substantially reduce contaminant-induced batch variability. Additionally, we confirmed the repeatability and reproducibility of our results in each batch before merging them for downstream analysis. CONCLUSION: This study provides important insight to advance quality control efforts in low biomass microbiome research. Within-study quality control that takes advantage of the data structure (i.e. differential prevalence of contaminants between batches) would enhance the overall reliability and reproducibility of research in this field. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-020-00998-4.
format Online
Article
Text
id pubmed-7877029
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78770292021-02-11 Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota Moossavi, Shirin Fehr, Kelsey Khafipour, Ehsan Azad, Meghan B. Microbiome Short Report BACKGROUND: Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible conclusions and interpretations. Low biomass samples in microbiome research are prone to reagent contamination; yet, quality control procedures for low biomass samples in large-scale microbiome studies are not well established. RESULTS: In this study, we have proposed a framework for an in-depth step-by-step approach to address this gap. The framework consists of three independent stages: (1) verification of sequencing accuracy by assessing technical repeatability and reproducibility of the results using mock communities and biological controls; (2) contaminant removal and batch variability correction by applying a two-tier strategy using statistical algorithms (e.g. decontam) followed by comparison of the data structure between batches; and (3) corroborating the repeatability and reproducibility of microbiome composition and downstream statistical analysis. Using this approach on the milk microbiota data from the CHILD Cohort generated in two batches (extracted and sequenced in 2016 and 2019), we were able to identify potential reagent contaminants that were missed with standard algorithms and substantially reduce contaminant-induced batch variability. Additionally, we confirmed the repeatability and reproducibility of our results in each batch before merging them for downstream analysis. CONCLUSION: This study provides important insight to advance quality control efforts in low biomass microbiome research. Within-study quality control that takes advantage of the data structure (i.e. differential prevalence of contaminants between batches) would enhance the overall reliability and reproducibility of research in this field. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-020-00998-4. BioMed Central 2021-02-10 /pmc/articles/PMC7877029/ /pubmed/33568231 http://dx.doi.org/10.1186/s40168-020-00998-4 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Short Report
Moossavi, Shirin
Fehr, Kelsey
Khafipour, Ehsan
Azad, Meghan B.
Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_full Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_fullStr Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_full_unstemmed Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_short Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_sort repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7877029/
https://www.ncbi.nlm.nih.gov/pubmed/33568231
http://dx.doi.org/10.1186/s40168-020-00998-4
work_keys_str_mv AT moossavishirin repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota
AT fehrkelsey repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota
AT khafipourehsan repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota
AT azadmeghanb repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota