Cargando…

Scaling in ANOVA-simultaneous component analysis

In omics research often high-dimensional data is collected according to an experimental design. Typically, the manipulations involved yield differential effects on subsets of variables. An effective approach to identify those effects is ANOVA-simultaneous component analysis (ASCA), which combines an...

Descripción completa

Detalles Bibliográficos
Autores principales: Timmerman, Marieke E., Hoefsloot, Huub C. J., Smilde, Age K., Ceulemans, Eva
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559107/
https://www.ncbi.nlm.nih.gov/pubmed/26366136
http://dx.doi.org/10.1007/s11306-015-0785-8
_version_ 1782388723901530112
author Timmerman, Marieke E.
Hoefsloot, Huub C. J.
Smilde, Age K.
Ceulemans, Eva
author_facet Timmerman, Marieke E.
Hoefsloot, Huub C. J.
Smilde, Age K.
Ceulemans, Eva
author_sort Timmerman, Marieke E.
collection PubMed
description In omics research often high-dimensional data is collected according to an experimental design. Typically, the manipulations involved yield differential effects on subsets of variables. An effective approach to identify those effects is ANOVA-simultaneous component analysis (ASCA), which combines analysis of variance with principal component analysis. So far, pre-treatment in ASCA received hardly any attention, whereas its effects can be huge. In this paper, we describe various strategies for scaling, and identify a rational approach. We present the approaches in matrix algebra terms and illustrate them with an insightful simulated example. We show that scaling directly influences which data aspects are stressed in the analysis, and hence become apparent in the solution. Therefore, the cornerstone for proper scaling is to use a scaling factor that is free from the effect of interest. This implies that proper scaling depends on the effect(s) of interest, and that different types of scaling may be proper for the different effect matrices. We illustrate that different scaling approaches can greatly affect the ASCA interpretation with a real-life example from nutritional research. The principle that scaling factors should be free from the effect of interest generalizes to other statistical methods that involve scaling, as classification methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-015-0785-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4559107
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-45591072015-09-09 Scaling in ANOVA-simultaneous component analysis Timmerman, Marieke E. Hoefsloot, Huub C. J. Smilde, Age K. Ceulemans, Eva Metabolomics Original Article In omics research often high-dimensional data is collected according to an experimental design. Typically, the manipulations involved yield differential effects on subsets of variables. An effective approach to identify those effects is ANOVA-simultaneous component analysis (ASCA), which combines analysis of variance with principal component analysis. So far, pre-treatment in ASCA received hardly any attention, whereas its effects can be huge. In this paper, we describe various strategies for scaling, and identify a rational approach. We present the approaches in matrix algebra terms and illustrate them with an insightful simulated example. We show that scaling directly influences which data aspects are stressed in the analysis, and hence become apparent in the solution. Therefore, the cornerstone for proper scaling is to use a scaling factor that is free from the effect of interest. This implies that proper scaling depends on the effect(s) of interest, and that different types of scaling may be proper for the different effect matrices. We illustrate that different scaling approaches can greatly affect the ASCA interpretation with a real-life example from nutritional research. The principle that scaling factors should be free from the effect of interest generalizes to other statistical methods that involve scaling, as classification methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-015-0785-8) contains supplementary material, which is available to authorized users. Springer US 2015-02-14 2015 /pmc/articles/PMC4559107/ /pubmed/26366136 http://dx.doi.org/10.1007/s11306-015-0785-8 Text en © The Author(s) 2015 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Original Article
Timmerman, Marieke E.
Hoefsloot, Huub C. J.
Smilde, Age K.
Ceulemans, Eva
Scaling in ANOVA-simultaneous component analysis
title Scaling in ANOVA-simultaneous component analysis
title_full Scaling in ANOVA-simultaneous component analysis
title_fullStr Scaling in ANOVA-simultaneous component analysis
title_full_unstemmed Scaling in ANOVA-simultaneous component analysis
title_short Scaling in ANOVA-simultaneous component analysis
title_sort scaling in anova-simultaneous component analysis
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559107/
https://www.ncbi.nlm.nih.gov/pubmed/26366136
http://dx.doi.org/10.1007/s11306-015-0785-8
work_keys_str_mv AT timmermanmariekee scalinginanovasimultaneouscomponentanalysis
AT hoefsloothuubcj scalinginanovasimultaneouscomponentanalysis
AT smildeagek scalinginanovasimultaneouscomponentanalysis
AT ceulemanseva scalinginanovasimultaneouscomponentanalysis