Cargando…

Variable-selection ANOVA Simultaneous Component Analysis (VASCA)

MOTIVATION: ANOVA Simultaneous Component Analysis (ASCA) is a popular method for the analysis of multivariate data yielded by designed experiments. Meaningful associations between factors/interactions of the experimental design and measured variables in the dataset are typically identified via signi...

Descripción completa

Detalles Bibliográficos
Autores principales: Camacho, José, Vitale, Raffaele, Morales-Jiménez, David, Gómez-Llorente, Carolina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825241/
https://www.ncbi.nlm.nih.gov/pubmed/36495189
http://dx.doi.org/10.1093/bioinformatics/btac795
_version_ 1784866596238393344
author Camacho, José
Vitale, Raffaele
Morales-Jiménez, David
Gómez-Llorente, Carolina
author_facet Camacho, José
Vitale, Raffaele
Morales-Jiménez, David
Gómez-Llorente, Carolina
author_sort Camacho, José
collection PubMed
description MOTIVATION: ANOVA Simultaneous Component Analysis (ASCA) is a popular method for the analysis of multivariate data yielded by designed experiments. Meaningful associations between factors/interactions of the experimental design and measured variables in the dataset are typically identified via significance testing, with permutation tests being the standard go-to choice. However, in settings with large numbers of variables, like omics (genomics, transcriptomics, proteomics and metabolomics) experiments, the ‘holistic’ testing approach of ASCA (all variables considered) often overlooks statistically significant effects encoded by only a few variables (biomarkers). RESULTS: We hereby propose Variable-selection ASCA (VASCA), a method that generalizes ASCA through variable selection, augmenting its statistical power without inflating the Type-I error risk. The method is evaluated with simulations and with a real dataset from a multi-omic clinical experiment. We show that VASCA is more powerful than both ASCA and the widely adopted false discovery rate controlling procedure; the latter is used as a benchmark for variable selection based on multiple significance testing. We further illustrate the usefulness of VASCA for exploratory data analysis in comparison to the popular partial least squares discriminant analysis method and its sparse counterpart. AVAILABILITY AND IMPLEMENTATION: The code for VASCA is available in the MEDA Toolbox at https://github.com/josecamachop/MEDA-Toolbox (release v1.3). The simulation results and motivating example can be reproduced using the repository at https://github.com/josecamachop/VASCA/tree/v1.0.0 (DOI 10.5281/zenodo.7410623). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9825241
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98252412023-01-09 Variable-selection ANOVA Simultaneous Component Analysis (VASCA) Camacho, José Vitale, Raffaele Morales-Jiménez, David Gómez-Llorente, Carolina Bioinformatics Original Paper MOTIVATION: ANOVA Simultaneous Component Analysis (ASCA) is a popular method for the analysis of multivariate data yielded by designed experiments. Meaningful associations between factors/interactions of the experimental design and measured variables in the dataset are typically identified via significance testing, with permutation tests being the standard go-to choice. However, in settings with large numbers of variables, like omics (genomics, transcriptomics, proteomics and metabolomics) experiments, the ‘holistic’ testing approach of ASCA (all variables considered) often overlooks statistically significant effects encoded by only a few variables (biomarkers). RESULTS: We hereby propose Variable-selection ASCA (VASCA), a method that generalizes ASCA through variable selection, augmenting its statistical power without inflating the Type-I error risk. The method is evaluated with simulations and with a real dataset from a multi-omic clinical experiment. We show that VASCA is more powerful than both ASCA and the widely adopted false discovery rate controlling procedure; the latter is used as a benchmark for variable selection based on multiple significance testing. We further illustrate the usefulness of VASCA for exploratory data analysis in comparison to the popular partial least squares discriminant analysis method and its sparse counterpart. AVAILABILITY AND IMPLEMENTATION: The code for VASCA is available in the MEDA Toolbox at https://github.com/josecamachop/MEDA-Toolbox (release v1.3). The simulation results and motivating example can be reproduced using the repository at https://github.com/josecamachop/VASCA/tree/v1.0.0 (DOI 10.5281/zenodo.7410623). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-10 /pmc/articles/PMC9825241/ /pubmed/36495189 http://dx.doi.org/10.1093/bioinformatics/btac795 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Camacho, José
Vitale, Raffaele
Morales-Jiménez, David
Gómez-Llorente, Carolina
Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title_full Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title_fullStr Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title_full_unstemmed Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title_short Variable-selection ANOVA Simultaneous Component Analysis (VASCA)
title_sort variable-selection anova simultaneous component analysis (vasca)
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825241/
https://www.ncbi.nlm.nih.gov/pubmed/36495189
http://dx.doi.org/10.1093/bioinformatics/btac795
work_keys_str_mv AT camachojose variableselectionanovasimultaneouscomponentanalysisvasca
AT vitaleraffaele variableselectionanovasimultaneouscomponentanalysisvasca
AT moralesjimenezdavid variableselectionanovasimultaneouscomponentanalysisvasca
AT gomezllorentecarolina variableselectionanovasimultaneouscomponentanalysisvasca