Cargando…

Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches

In many statistical applications, composite variables are constructed to reduce the number of variables and improve the performances of statistical analyses of these variables, especially when some of the variables are highly correlated. Principal component analysis (PCA) and factor analysis (FA) ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chenyu, Zhang, Xinlian, Nguyen, Tanya T, Liu, Jinyuan, Wu, Tsungchin, Lee, Ellen, Tu, Xin M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796256/
https://www.ncbi.nlm.nih.gov/pubmed/35146334
http://dx.doi.org/10.1136/gpsych-2021-100662
_version_ 1784641264732340224
author Liu, Chenyu
Zhang, Xinlian
Nguyen, Tanya T
Liu, Jinyuan
Wu, Tsungchin
Lee, Ellen
Tu, Xin M
author_facet Liu, Chenyu
Zhang, Xinlian
Nguyen, Tanya T
Liu, Jinyuan
Wu, Tsungchin
Lee, Ellen
Tu, Xin M
author_sort Liu, Chenyu
collection PubMed
description In many statistical applications, composite variables are constructed to reduce the number of variables and improve the performances of statistical analyses of these variables, especially when some of the variables are highly correlated. Principal component analysis (PCA) and factor analysis (FA) are generally used for such purposes. If the variables are used as explanatory or independent variables in linear regression analysis, partial least squares (PLS) regression is a better alternative. Unlike PCA and FA, PLS creates composite variables by also taking into account the response, or dependent variable, so that they have higher correlations with the response than composites from their PCA and FA counterparts. In this report, we provide an introduction to this useful approach and illustrate it with data from a real study.
format Online
Article
Text
id pubmed-8796256
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-87962562022-02-09 Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches Liu, Chenyu Zhang, Xinlian Nguyen, Tanya T Liu, Jinyuan Wu, Tsungchin Lee, Ellen Tu, Xin M Gen Psychiatr Biostatistical Methods in Psychiatry In many statistical applications, composite variables are constructed to reduce the number of variables and improve the performances of statistical analyses of these variables, especially when some of the variables are highly correlated. Principal component analysis (PCA) and factor analysis (FA) are generally used for such purposes. If the variables are used as explanatory or independent variables in linear regression analysis, partial least squares (PLS) regression is a better alternative. Unlike PCA and FA, PLS creates composite variables by also taking into account the response, or dependent variable, so that they have higher correlations with the response than composites from their PCA and FA counterparts. In this report, we provide an introduction to this useful approach and illustrate it with data from a real study. BMJ Publishing Group 2022-01-27 /pmc/articles/PMC8796256/ /pubmed/35146334 http://dx.doi.org/10.1136/gpsych-2021-100662 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Biostatistical Methods in Psychiatry
Liu, Chenyu
Zhang, Xinlian
Nguyen, Tanya T
Liu, Jinyuan
Wu, Tsungchin
Lee, Ellen
Tu, Xin M
Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title_full Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title_fullStr Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title_full_unstemmed Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title_short Partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
title_sort partial least squares regression and principal component analysis: similarity and differences between two popular variable reduction approaches
topic Biostatistical Methods in Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8796256/
https://www.ncbi.nlm.nih.gov/pubmed/35146334
http://dx.doi.org/10.1136/gpsych-2021-100662
work_keys_str_mv AT liuchenyu partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT zhangxinlian partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT nguyentanyat partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT liujinyuan partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT wutsungchin partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT leeellen partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches
AT tuxinm partialleastsquaresregressionandprincipalcomponentanalysissimilarityanddifferencesbetweentwopopularvariablereductionapproaches