Cargando…

Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches

In many statistical applications, composite variables are constructed to reduce the number of variables and improve the performances of statistical analyses of these variables, especially when some of the variables are highly correlated. Principal component analysis (PCA) and factor analysis (FA) ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chenyu, Zhang, Xinlian, Nguyen, Tanya T, Liu, Jinyuan, Wu, Tsungchin, Lee, Ellen, Tu, Xin M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796256/
https://www.ncbi.nlm.nih.gov/pubmed/35146334
http://dx.doi.org/10.1136/gpsych-2021-100662
Descripción
Sumario:In many statistical applications, composite variables are constructed to reduce the number of variables and improve the performances of statistical analyses of these variables, especially when some of the variables are highly correlated. Principal component analysis (PCA) and factor analysis (FA) are generally used for such purposes. If the variables are used as explanatory or independent variables in linear regression analysis, partial least squares (PLS) regression is a better alternative. Unlike PCA and FA, PLS creates composite variables by also taking into account the response, or dependent variable, so that they have higher correlations with the response than composites from their PCA and FA counterparts. In this report, we provide an introduction to this useful approach and illustrate it with data from a real study.