Cargando…
An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics
Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to purs...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682135/ https://www.ncbi.nlm.nih.gov/pubmed/36438654 http://dx.doi.org/10.3389/fmolb.2022.930204 |
_version_ | 1784834781987471360 |
---|---|
author | Shaver, Amanda O. Garcia, Brianna M. Gouveia, Goncalo J. Morse, Alison M. Liu, Zihao Asef, Carter K. Borges, Ricardo M. Leach, Franklin E. Andersen, Erik C. Amster, I. Jonathan Fernández, Facundo M. Edison, Arthur S. McIntyre, Lauren M. |
author_facet | Shaver, Amanda O. Garcia, Brianna M. Gouveia, Goncalo J. Morse, Alison M. Liu, Zihao Asef, Carter K. Borges, Ricardo M. Leach, Franklin E. Andersen, Erik C. Amster, I. Jonathan Fernández, Facundo M. Edison, Arthur S. McIntyre, Lauren M. |
author_sort | Shaver, Amanda O. |
collection | PubMed |
description | Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to pursue for identification. Here, we developed an anchored experimental design. This generalizable approach enabled us to integrate three genetic studies consisting of 14 test strains of Caenorhabditis elegans prior to the compound identification process. An anchor strain, PD1074, was included in every sample collection, resulting in a large set of biological replicates of a genetically identical strain that anchored each study. This enables us to estimate treatment effects within each batch and apply straightforward meta-analytic approaches to combine treatment effects across batches without the need for estimation of batch effects and complex normalization strategies. We collected 104 test samples for three genetic studies across six batches to produce five analytical datasets from two complementary technologies commonly used in untargeted metabolomics. Here, we use the model system C. elegans to demonstrate that an augmented design combined with experimental blocks and other metabolomic QC approaches can be used to anchor studies and enable comparisons of stable spectral features across time without the need for compound identification. This approach is generalizable to systems where the same genotype can be assayed in multiple environments and provides biologically relevant features for downstream compound identification efforts. All methods are included in the newest release of the publicly available SECIMTools based on the open-source Galaxy platform. |
format | Online Article Text |
id | pubmed-9682135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96821352022-11-24 An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics Shaver, Amanda O. Garcia, Brianna M. Gouveia, Goncalo J. Morse, Alison M. Liu, Zihao Asef, Carter K. Borges, Ricardo M. Leach, Franklin E. Andersen, Erik C. Amster, I. Jonathan Fernández, Facundo M. Edison, Arthur S. McIntyre, Lauren M. Front Mol Biosci Molecular Biosciences Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to pursue for identification. Here, we developed an anchored experimental design. This generalizable approach enabled us to integrate three genetic studies consisting of 14 test strains of Caenorhabditis elegans prior to the compound identification process. An anchor strain, PD1074, was included in every sample collection, resulting in a large set of biological replicates of a genetically identical strain that anchored each study. This enables us to estimate treatment effects within each batch and apply straightforward meta-analytic approaches to combine treatment effects across batches without the need for estimation of batch effects and complex normalization strategies. We collected 104 test samples for three genetic studies across six batches to produce five analytical datasets from two complementary technologies commonly used in untargeted metabolomics. Here, we use the model system C. elegans to demonstrate that an augmented design combined with experimental blocks and other metabolomic QC approaches can be used to anchor studies and enable comparisons of stable spectral features across time without the need for compound identification. This approach is generalizable to systems where the same genotype can be assayed in multiple environments and provides biologically relevant features for downstream compound identification efforts. All methods are included in the newest release of the publicly available SECIMTools based on the open-source Galaxy platform. Frontiers Media S.A. 2022-11-09 /pmc/articles/PMC9682135/ /pubmed/36438654 http://dx.doi.org/10.3389/fmolb.2022.930204 Text en Copyright © 2022 Shaver, Garcia, Gouveia, Morse, Liu, Asef, Borges, Leach, Andersen, Amster, Fernández, Edison and McIntyre. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Shaver, Amanda O. Garcia, Brianna M. Gouveia, Goncalo J. Morse, Alison M. Liu, Zihao Asef, Carter K. Borges, Ricardo M. Leach, Franklin E. Andersen, Erik C. Amster, I. Jonathan Fernández, Facundo M. Edison, Arthur S. McIntyre, Lauren M. An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title | An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title_full | An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title_fullStr | An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title_full_unstemmed | An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title_short | An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
title_sort | anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682135/ https://www.ncbi.nlm.nih.gov/pubmed/36438654 http://dx.doi.org/10.3389/fmolb.2022.930204 |
work_keys_str_mv | AT shaveramandao ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT garciabriannam ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT gouveiagoncaloj ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT morsealisonm ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT liuzihao ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT asefcarterk ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT borgesricardom ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT leachfrankline ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT andersenerikc ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT amsterijonathan ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT fernandezfacundom ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT edisonarthurs ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT mcintyrelaurenm ananchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT shaveramandao anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT garciabriannam anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT gouveiagoncaloj anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT morsealisonm anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT liuzihao anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT asefcarterk anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT borgesricardom anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT leachfrankline anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT andersenerikc anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT amsterijonathan anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT fernandezfacundom anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT edisonarthurs anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics AT mcintyrelaurenm anchoredexperimentaldesignandmetaanalysisapproachtoaddressbatcheffectsinlargescalemetabolomics |