Cargando…

Improving the analysis of designed studies by combining statistical modelling with study design information

BACKGROUND: In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study des...

Descripción completa

Detalles Bibliográficos
Autores principales: Thissen, Uwe, Wopereis, Suzan, van den Berg, Sjoerd AA, Bobeldijk, Ivana, Kleemann, Robert, Kooistra, Teake, Willems van Dijk, Ko, van Ommen, Ben, Smilde, Age K
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657790/
https://www.ncbi.nlm.nih.gov/pubmed/19200393
http://dx.doi.org/10.1186/1471-2105-10-52
_version_ 1782165615701655552
author Thissen, Uwe
Wopereis, Suzan
van den Berg, Sjoerd AA
Bobeldijk, Ivana
Kleemann, Robert
Kooistra, Teake
Willems van Dijk, Ko
van Ommen, Ben
Smilde, Age K
author_facet Thissen, Uwe
Wopereis, Suzan
van den Berg, Sjoerd AA
Bobeldijk, Ivana
Kleemann, Robert
Kooistra, Teake
Willems van Dijk, Ko
van Ommen, Ben
Smilde, Age K
author_sort Thissen, Uwe
collection PubMed
description BACKGROUND: In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects. RESULTS: The benefit of this approach was demonstrated with an analysis that combines multivariate PLS (Partial Least Squares) regression with data decomposition from ANOVA (Analysis of Variance): ANOVA-PLS. As a case, a nutritional intervention study is used on Apoliprotein E3-Leiden (APOE3Leiden) transgenic mice to study the relation between liver lipidomics and a plasma inflammation marker, Serum Amyloid A. The ANOVA-PLS performance was compared to PLS regression on the non-decomposed data with respect to the quality of the modelled relation, model reliability, and interpretability. CONCLUSION: It was shown that ANOVA-PLS leads to a better statistical model that is more reliable and better interpretable compared to standard PLS analysis. From a following biological interpretation, more relevant metabolites were derived from the model. The concept of combining data composition with a subsequent statistical analysis, as in ANOVA-PLS, is however not limited to PLS regression in metabolomics but can be applied for many statistical methods and many different types of data.
format Text
id pubmed-2657790
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26577902009-03-19 Improving the analysis of designed studies by combining statistical modelling with study design information Thissen, Uwe Wopereis, Suzan van den Berg, Sjoerd AA Bobeldijk, Ivana Kleemann, Robert Kooistra, Teake Willems van Dijk, Ko van Ommen, Ben Smilde, Age K BMC Bioinformatics Research Article BACKGROUND: In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects. RESULTS: The benefit of this approach was demonstrated with an analysis that combines multivariate PLS (Partial Least Squares) regression with data decomposition from ANOVA (Analysis of Variance): ANOVA-PLS. As a case, a nutritional intervention study is used on Apoliprotein E3-Leiden (APOE3Leiden) transgenic mice to study the relation between liver lipidomics and a plasma inflammation marker, Serum Amyloid A. The ANOVA-PLS performance was compared to PLS regression on the non-decomposed data with respect to the quality of the modelled relation, model reliability, and interpretability. CONCLUSION: It was shown that ANOVA-PLS leads to a better statistical model that is more reliable and better interpretable compared to standard PLS analysis. From a following biological interpretation, more relevant metabolites were derived from the model. The concept of combining data composition with a subsequent statistical analysis, as in ANOVA-PLS, is however not limited to PLS regression in metabolomics but can be applied for many statistical methods and many different types of data. BioMed Central 2009-02-07 /pmc/articles/PMC2657790/ /pubmed/19200393 http://dx.doi.org/10.1186/1471-2105-10-52 Text en Copyright © 2009 Thissen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Thissen, Uwe
Wopereis, Suzan
van den Berg, Sjoerd AA
Bobeldijk, Ivana
Kleemann, Robert
Kooistra, Teake
Willems van Dijk, Ko
van Ommen, Ben
Smilde, Age K
Improving the analysis of designed studies by combining statistical modelling with study design information
title Improving the analysis of designed studies by combining statistical modelling with study design information
title_full Improving the analysis of designed studies by combining statistical modelling with study design information
title_fullStr Improving the analysis of designed studies by combining statistical modelling with study design information
title_full_unstemmed Improving the analysis of designed studies by combining statistical modelling with study design information
title_short Improving the analysis of designed studies by combining statistical modelling with study design information
title_sort improving the analysis of designed studies by combining statistical modelling with study design information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657790/
https://www.ncbi.nlm.nih.gov/pubmed/19200393
http://dx.doi.org/10.1186/1471-2105-10-52
work_keys_str_mv AT thissenuwe improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT wopereissuzan improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT vandenbergsjoerdaa improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT bobeldijkivana improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT kleemannrobert improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT kooistrateake improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT willemsvandijkko improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT vanommenben improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT smildeagek improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation