Cargando…

Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples

Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Jonsson, Pär, Wuolikainen, Anna, Thysell, Elin, Chorell, Elin, Stattin, Pär, Wikström, Pernilla, Antti, Henrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4605978/
https://www.ncbi.nlm.nih.gov/pubmed/26491420
http://dx.doi.org/10.1007/s11306-015-0818-3
_version_ 1782395287349755904
author Jonsson, Pär
Wuolikainen, Anna
Thysell, Elin
Chorell, Elin
Stattin, Pär
Wikström, Pernilla
Antti, Henrik
author_facet Jonsson, Pär
Wuolikainen, Anna
Thysell, Elin
Chorell, Elin
Stattin, Pär
Wikström, Pernilla
Antti, Henrik
author_sort Jonsson, Pär
collection PubMed
description Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of analytical drift on multivariate comparisons of matched or dependent samples in mass spectrometry based metabolomics studies. The approach is building on a randomization procedure for sample run order, constrained to independent randomizations between and within dependent sample pairs (e.g. pre/post intervention). This is followed by a novel multivariate statistical analysis strategy allowing paired or dependent analyses of individual effects named OPLS-effect projections (OPLS-EP). We show, using simulated data that OPLS-EP gives improved interpretation over existing methods and that constrained randomization of sample run order in combination with an appropriate dependent statistical test increase the accuracy and sensitivity and decrease the false omission rate in biomarker detection. We verify these findings and prove the strength of the suggested approach in a clinical data set consisting of LC/MS data of blood plasma samples from patients before and after radical prostatectomy. Here OPLS-EP compared to traditional (independent) OPLS-discriminant analysis (OPLS-DA) on constrained randomized data gives a less complex model (3 versus 5 components) as well a higher predictive ability (Q2 = 0.80 versus Q2 = 0.55). We explain this by showing that paired statistical analysis detects 37 unique significant metabolites that were masked for the independent test due to bias, including analytical drift and inter-individual variation.
format Online
Article
Text
id pubmed-4605978
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-46059782015-10-19 Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples Jonsson, Pär Wuolikainen, Anna Thysell, Elin Chorell, Elin Stattin, Pär Wikström, Pernilla Antti, Henrik Metabolomics Original Article Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of analytical drift on multivariate comparisons of matched or dependent samples in mass spectrometry based metabolomics studies. The approach is building on a randomization procedure for sample run order, constrained to independent randomizations between and within dependent sample pairs (e.g. pre/post intervention). This is followed by a novel multivariate statistical analysis strategy allowing paired or dependent analyses of individual effects named OPLS-effect projections (OPLS-EP). We show, using simulated data that OPLS-EP gives improved interpretation over existing methods and that constrained randomization of sample run order in combination with an appropriate dependent statistical test increase the accuracy and sensitivity and decrease the false omission rate in biomarker detection. We verify these findings and prove the strength of the suggested approach in a clinical data set consisting of LC/MS data of blood plasma samples from patients before and after radical prostatectomy. Here OPLS-EP compared to traditional (independent) OPLS-discriminant analysis (OPLS-DA) on constrained randomized data gives a less complex model (3 versus 5 components) as well a higher predictive ability (Q2 = 0.80 versus Q2 = 0.55). We explain this by showing that paired statistical analysis detects 37 unique significant metabolites that were masked for the independent test due to bias, including analytical drift and inter-individual variation. Springer US 2015-06-02 2015 /pmc/articles/PMC4605978/ /pubmed/26491420 http://dx.doi.org/10.1007/s11306-015-0818-3 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Jonsson, Pär
Wuolikainen, Anna
Thysell, Elin
Chorell, Elin
Stattin, Pär
Wikström, Pernilla
Antti, Henrik
Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title_full Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title_fullStr Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title_full_unstemmed Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title_short Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
title_sort constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4605978/
https://www.ncbi.nlm.nih.gov/pubmed/26491420
http://dx.doi.org/10.1007/s11306-015-0818-3
work_keys_str_mv AT jonssonpar constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT wuolikainenanna constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT thysellelin constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT chorellelin constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT stattinpar constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT wikstrompernilla constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples
AT anttihenrik constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples