Cargando…

Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics

INTRODUCTION: Availability of large cohorts of samples with related metadata provides scientists with extensive material for studies. At the same time, recent development of modern high-throughput ‘omics’ technologies, including metabolomics, has resulted in the potential for analysis of large sampl...

Descripción completa

Detalles Bibliográficos
Autores principales: Surowiec, Izabella, Johansson, Erik, Torell, Frida, Idborg, Helena, Gunnarsson, Iva, Svenungsson, Elisabet, Jakobsson, Per-Johan, Trygg, Johan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570768/
https://www.ncbi.nlm.nih.gov/pubmed/28890672
http://dx.doi.org/10.1007/s11306-017-1248-1
_version_ 1783259213864108032
author Surowiec, Izabella
Johansson, Erik
Torell, Frida
Idborg, Helena
Gunnarsson, Iva
Svenungsson, Elisabet
Jakobsson, Per-Johan
Trygg, Johan
author_facet Surowiec, Izabella
Johansson, Erik
Torell, Frida
Idborg, Helena
Gunnarsson, Iva
Svenungsson, Elisabet
Jakobsson, Per-Johan
Trygg, Johan
author_sort Surowiec, Izabella
collection PubMed
description INTRODUCTION: Availability of large cohorts of samples with related metadata provides scientists with extensive material for studies. At the same time, recent development of modern high-throughput ‘omics’ technologies, including metabolomics, has resulted in the potential for analysis of large sample sizes. Representative subset selection becomes critical for selection of samples from bigger cohorts and their division into analytical batches. This especially holds true when relative quantification of compound levels is used. OBJECTIVES: We present a multivariate strategy for representative sample selection and integration of results from multi-batch experiments in metabolomics. METHODS: Multivariate characterization was applied for design of experiment based sample selection and subsequent subdivision into four analytical batches which were analyzed on different days by metabolomics profiling using gas-chromatography time-of-flight mass spectrometry (GC–TOF–MS). For each batch OPLS-DA(®) was used and its p(corr) vectors were averaged to obtain combined metabolic profile. Jackknifed standard errors were used to calculate confidence intervals for each metabolite in the average p(corr) profile. RESULTS: A combined, representative metabolic profile describing differences between systemic lupus erythematosus (SLE) patients and controls was obtained and used for elucidation of metabolic pathways that could be disturbed in SLE. CONCLUSION: Design of experiment based representative sample selection ensured diversity and minimized bias that could be introduced at this step. Combined metabolic profile enabled unified analysis and interpretation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-017-1248-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5570768
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-55707682017-09-07 Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics Surowiec, Izabella Johansson, Erik Torell, Frida Idborg, Helena Gunnarsson, Iva Svenungsson, Elisabet Jakobsson, Per-Johan Trygg, Johan Metabolomics Original Article INTRODUCTION: Availability of large cohorts of samples with related metadata provides scientists with extensive material for studies. At the same time, recent development of modern high-throughput ‘omics’ technologies, including metabolomics, has resulted in the potential for analysis of large sample sizes. Representative subset selection becomes critical for selection of samples from bigger cohorts and their division into analytical batches. This especially holds true when relative quantification of compound levels is used. OBJECTIVES: We present a multivariate strategy for representative sample selection and integration of results from multi-batch experiments in metabolomics. METHODS: Multivariate characterization was applied for design of experiment based sample selection and subsequent subdivision into four analytical batches which were analyzed on different days by metabolomics profiling using gas-chromatography time-of-flight mass spectrometry (GC–TOF–MS). For each batch OPLS-DA(®) was used and its p(corr) vectors were averaged to obtain combined metabolic profile. Jackknifed standard errors were used to calculate confidence intervals for each metabolite in the average p(corr) profile. RESULTS: A combined, representative metabolic profile describing differences between systemic lupus erythematosus (SLE) patients and controls was obtained and used for elucidation of metabolic pathways that could be disturbed in SLE. CONCLUSION: Design of experiment based representative sample selection ensured diversity and minimized bias that could be introduced at this step. Combined metabolic profile enabled unified analysis and interpretation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-017-1248-1) contains supplementary material, which is available to authorized users. Springer US 2017-08-24 2017 /pmc/articles/PMC5570768/ /pubmed/28890672 http://dx.doi.org/10.1007/s11306-017-1248-1 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Surowiec, Izabella
Johansson, Erik
Torell, Frida
Idborg, Helena
Gunnarsson, Iva
Svenungsson, Elisabet
Jakobsson, Per-Johan
Trygg, Johan
Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title_full Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title_fullStr Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title_full_unstemmed Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title_short Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
title_sort multivariate strategy for the sample selection and integration of multi-batch data in metabolomics
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5570768/
https://www.ncbi.nlm.nih.gov/pubmed/28890672
http://dx.doi.org/10.1007/s11306-017-1248-1
work_keys_str_mv AT surowiecizabella multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT johanssonerik multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT torellfrida multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT idborghelena multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT gunnarssoniva multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT svenungssonelisabet multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT jakobssonperjohan multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics
AT tryggjohan multivariatestrategyforthesampleselectionandintegrationofmultibatchdatainmetabolomics