Cargando…

An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics

Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to purs...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaver, Amanda O., Garcia, Brianna M., Gouveia, Goncalo J., Morse, Alison M., Liu, Zihao, Asef, Carter K., Borges, Ricardo M., Leach, Franklin E., Andersen, Erik C., Amster, I. Jonathan, Fernández, Facundo M., Edison, Arthur S., McIntyre, Lauren M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682135/
https://www.ncbi.nlm.nih.gov/pubmed/36438654
http://dx.doi.org/10.3389/fmolb.2022.930204
_version_ 1784834781987471360
author Shaver, Amanda O.
Garcia, Brianna M.
Gouveia, Goncalo J.
Morse, Alison M.
Liu, Zihao
Asef, Carter K.
Borges, Ricardo M.
Leach, Franklin E.
Andersen, Erik C.
Amster, I. Jonathan
Fernández, Facundo M.
Edison, Arthur S.
McIntyre, Lauren M.
author_facet Shaver, Amanda O.
Garcia, Brianna M.
Gouveia, Goncalo J.
Morse, Alison M.
Liu, Zihao
Asef, Carter K.
Borges, Ricardo M.
Leach, Franklin E.
Andersen, Erik C.
Amster, I. Jonathan
Fernández, Facundo M.
Edison, Arthur S.
McIntyre, Lauren M.
author_sort Shaver, Amanda O.
collection PubMed
description Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to pursue for identification. Here, we developed an anchored experimental design. This generalizable approach enabled us to integrate three genetic studies consisting of 14 test strains of Caenorhabditis elegans prior to the compound identification process. An anchor strain, PD1074, was included in every sample collection, resulting in a large set of biological replicates of a genetically identical strain that anchored each study. This enables us to estimate treatment effects within each batch and apply straightforward meta-analytic approaches to combine treatment effects across batches without the need for estimation of batch effects and complex normalization strategies. We collected 104 test samples for three genetic studies across six batches to produce five analytical datasets from two complementary technologies commonly used in untargeted metabolomics. Here, we use the model system C. elegans to demonstrate that an augmented design combined with experimental blocks and other metabolomic QC approaches can be used to anchor studies and enable comparisons of stable spectral features across time without the need for compound identification. This approach is generalizable to systems where the same genotype can be assayed in multiple environments and provides biologically relevant features for downstream compound identification efforts. All methods are included in the newest release of the publicly available SECIMTools based on the open-source Galaxy platform.
format Online
Article
Text
id pubmed-9682135
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96821352022-11-24 An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics Shaver, Amanda O. Garcia, Brianna M. Gouveia, Goncalo J. Morse, Alison M. Liu, Zihao Asef, Carter K. Borges, Ricardo M. Leach, Franklin E. Andersen, Erik C. Amster, I. Jonathan Fernández, Facundo M. Edison, Arthur S. McIntyre, Lauren M. Front Mol Biosci Molecular Biosciences Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to pursue for identification. Here, we developed an anchored experimental design. This generalizable approach enabled us to integrate three genetic studies consisting of 14 test strains of Caenorhabditis elegans prior to the compound identification process. An anchor strain, PD1074, was included in every sample collection, resulting in a large set of biological replicates of a genetically identical strain that anchored each study. This enables us to estimate treatment effects within each batch and apply straightforward meta-analytic approaches to combine treatment effects across batches without the need for estimation of batch effects and complex normalization strategies. We collected 104 test samples for three genetic studies across six batches to produce five analytical datasets from two complementary technologies commonly used in untargeted metabolomics. Here, we use the model system C. elegans to demonstrate that an augmented design combined with experimental blocks and other metabolomic QC approaches can be used to anchor studies and enable comparisons of stable spectral features across time without the need for compound identification. This approach is generalizable to systems where the same genotype can be assayed in multiple environments and provides biologically relevant features for downstream compound identification efforts. All methods are included in the newest release of the publicly available SECIMTools based on the open-source Galaxy platform. Frontiers Media S.A. 2022-11-09 /pmc/articles/PMC9682135/ /pubmed/36438654 http://dx.doi.org/10.3389/fmolb.2022.930204 Text en Copyright © 2022 Shaver, Garcia, Gouveia, Morse, Liu, Asef, Borges, Leach, Andersen, Amster, Fernández, Edison and McIntyre. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Shaver, Amanda O.
Garcia, Brianna M.
Gouveia, Goncalo J.
Morse, Alison M.
Liu, Zihao
Asef, Carter K.
Borges, Ricardo M.
Leach, Franklin E.
Andersen, Erik C.
Amster, I. Jonathan
Fernández, Facundo M.
Edison, Arthur S.
McIntyre, Lauren M.
An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title_full An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title_fullStr An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title_full_unstemmed An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title_short An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
title_sort anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682135/
https://www.ncbi.nlm.nih.gov/pubmed/36438654
http://dx.doi.org/10.3389/fmolb.2022.930204
work_keys_str_mv AT shaveramandao ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT garciabriannam ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT gouveiagoncaloj ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT morsealisonm ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT liuzihao ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT asefcarterk ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT borgesricardom ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT leachfrankline ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT andersenerikc ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT amsterijonathan ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT fernandezfacundom ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT edisonarthurs ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT mcintyrelaurenm ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT shaveramandao anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT garciabriannam anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT gouveiagoncaloj anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT morsealisonm anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT liuzihao anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT asefcarterk anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT borgesricardom anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT leachfrankline anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT andersenerikc anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT amsterijonathan anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT fernandezfacundom anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT edisonarthurs anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics
AT mcintyrelaurenm anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics