Cargando…
Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples
Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of anal...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4605978/ https://www.ncbi.nlm.nih.gov/pubmed/26491420 http://dx.doi.org/10.1007/s11306-015-0818-3 |
_version_ | 1782395287349755904 |
---|---|
author | Jonsson, Pär Wuolikainen, Anna Thysell, Elin Chorell, Elin Stattin, Pär Wikström, Pernilla Antti, Henrik |
author_facet | Jonsson, Pär Wuolikainen, Anna Thysell, Elin Chorell, Elin Stattin, Pär Wikström, Pernilla Antti, Henrik |
author_sort | Jonsson, Pär |
collection | PubMed |
description | Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of analytical drift on multivariate comparisons of matched or dependent samples in mass spectrometry based metabolomics studies. The approach is building on a randomization procedure for sample run order, constrained to independent randomizations between and within dependent sample pairs (e.g. pre/post intervention). This is followed by a novel multivariate statistical analysis strategy allowing paired or dependent analyses of individual effects named OPLS-effect projections (OPLS-EP). We show, using simulated data that OPLS-EP gives improved interpretation over existing methods and that constrained randomization of sample run order in combination with an appropriate dependent statistical test increase the accuracy and sensitivity and decrease the false omission rate in biomarker detection. We verify these findings and prove the strength of the suggested approach in a clinical data set consisting of LC/MS data of blood plasma samples from patients before and after radical prostatectomy. Here OPLS-EP compared to traditional (independent) OPLS-discriminant analysis (OPLS-DA) on constrained randomized data gives a less complex model (3 versus 5 components) as well a higher predictive ability (Q2 = 0.80 versus Q2 = 0.55). We explain this by showing that paired statistical analysis detects 37 unique significant metabolites that were masked for the independent test due to bias, including analytical drift and inter-individual variation. |
format | Online Article Text |
id | pubmed-4605978 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-46059782015-10-19 Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples Jonsson, Pär Wuolikainen, Anna Thysell, Elin Chorell, Elin Stattin, Pär Wikström, Pernilla Antti, Henrik Metabolomics Original Article Analytical drift is a major source of bias in mass spectrometry based metabolomics confounding interpretation and biomarker detection. So far, standard protocols for sample and data analysis have not been able to fully resolve this. We present a combined approach for minimizing the influence of analytical drift on multivariate comparisons of matched or dependent samples in mass spectrometry based metabolomics studies. The approach is building on a randomization procedure for sample run order, constrained to independent randomizations between and within dependent sample pairs (e.g. pre/post intervention). This is followed by a novel multivariate statistical analysis strategy allowing paired or dependent analyses of individual effects named OPLS-effect projections (OPLS-EP). We show, using simulated data that OPLS-EP gives improved interpretation over existing methods and that constrained randomization of sample run order in combination with an appropriate dependent statistical test increase the accuracy and sensitivity and decrease the false omission rate in biomarker detection. We verify these findings and prove the strength of the suggested approach in a clinical data set consisting of LC/MS data of blood plasma samples from patients before and after radical prostatectomy. Here OPLS-EP compared to traditional (independent) OPLS-discriminant analysis (OPLS-DA) on constrained randomized data gives a less complex model (3 versus 5 components) as well a higher predictive ability (Q2 = 0.80 versus Q2 = 0.55). We explain this by showing that paired statistical analysis detects 37 unique significant metabolites that were masked for the independent test due to bias, including analytical drift and inter-individual variation. Springer US 2015-06-02 2015 /pmc/articles/PMC4605978/ /pubmed/26491420 http://dx.doi.org/10.1007/s11306-015-0818-3 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Original Article Jonsson, Pär Wuolikainen, Anna Thysell, Elin Chorell, Elin Stattin, Pär Wikström, Pernilla Antti, Henrik Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title | Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title_full | Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title_fullStr | Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title_full_unstemmed | Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title_short | Constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
title_sort | constrained randomization and multivariate effect projections improve information extraction and biomarker pattern discovery in metabolomics studies involving dependent samples |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4605978/ https://www.ncbi.nlm.nih.gov/pubmed/26491420 http://dx.doi.org/10.1007/s11306-015-0818-3 |
work_keys_str_mv | AT jonssonpar constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT wuolikainenanna constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT thysellelin constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT chorellelin constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT stattinpar constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT wikstrompernilla constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples AT anttihenrik constrainedrandomizationandmultivariateeffectprojectionsimproveinformationextractionandbiomarkerpatterndiscoveryinmetabolomicsstudiesinvolvingdependentsamples |