Cargando…

Validation in Principal Components Analysis Applied to EEG Data

The well-known multivariate technique Principal Components Analysis (PCA) is usually applied to a sample, and so component scores are subjected to sampling variability. However, few studies address their stability, an important topic when the sample size is small. This work presents three validation...

Descripción completa

Detalles Bibliográficos
Autores principales: Costa, João Carlos G. D., Da-Silva, Paulo José G., Almeida, Renan Moritz V. R., Infantosi, Antonio Fernando C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4170877/
https://www.ncbi.nlm.nih.gov/pubmed/25276221
http://dx.doi.org/10.1155/2014/413801
_version_ 1782335862356312064
author Costa, João Carlos G. D.
Da-Silva, Paulo José G.
Almeida, Renan Moritz V. R.
Infantosi, Antonio Fernando C.
author_facet Costa, João Carlos G. D.
Da-Silva, Paulo José G.
Almeida, Renan Moritz V. R.
Infantosi, Antonio Fernando C.
author_sort Costa, João Carlos G. D.
collection PubMed
description The well-known multivariate technique Principal Components Analysis (PCA) is usually applied to a sample, and so component scores are subjected to sampling variability. However, few studies address their stability, an important topic when the sample size is small. This work presents three validation procedures applied to PCA, based on confidence regions generated by a variant of a nonparametric bootstrap called the partial bootstrap: (i) the assessment of PC scores variability by the spread and overlapping of “confidence regions” plotted around these scores; (ii) the use of the confidence regions centroids as a validation set; and (iii) the definition of the number of nontrivial axes to be retained for analysis. The methods were applied to EEG data collected during a postural control protocol with twenty-four volunteers. Two axes were retained for analysis, with 91.6% of explained variance. Results showed that the area of the confidence regions provided useful insights on the variability of scores and suggested that some subjects were not distinguishable from others, which was not evident from the principal planes. In addition, potential outliers, initially suggested by an analysis of the first principal plane, could not be confirmed by the confidence regions.
format Online
Article
Text
id pubmed-4170877
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41708772014-09-28 Validation in Principal Components Analysis Applied to EEG Data Costa, João Carlos G. D. Da-Silva, Paulo José G. Almeida, Renan Moritz V. R. Infantosi, Antonio Fernando C. Comput Math Methods Med Research Article The well-known multivariate technique Principal Components Analysis (PCA) is usually applied to a sample, and so component scores are subjected to sampling variability. However, few studies address their stability, an important topic when the sample size is small. This work presents three validation procedures applied to PCA, based on confidence regions generated by a variant of a nonparametric bootstrap called the partial bootstrap: (i) the assessment of PC scores variability by the spread and overlapping of “confidence regions” plotted around these scores; (ii) the use of the confidence regions centroids as a validation set; and (iii) the definition of the number of nontrivial axes to be retained for analysis. The methods were applied to EEG data collected during a postural control protocol with twenty-four volunteers. Two axes were retained for analysis, with 91.6% of explained variance. Results showed that the area of the confidence regions provided useful insights on the variability of scores and suggested that some subjects were not distinguishable from others, which was not evident from the principal planes. In addition, potential outliers, initially suggested by an analysis of the first principal plane, could not be confirmed by the confidence regions. Hindawi Publishing Corporation 2014 2014-09-08 /pmc/articles/PMC4170877/ /pubmed/25276221 http://dx.doi.org/10.1155/2014/413801 Text en Copyright © 2014 João Carlos G. D. Costa et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Costa, João Carlos G. D.
Da-Silva, Paulo José G.
Almeida, Renan Moritz V. R.
Infantosi, Antonio Fernando C.
Validation in Principal Components Analysis Applied to EEG Data
title Validation in Principal Components Analysis Applied to EEG Data
title_full Validation in Principal Components Analysis Applied to EEG Data
title_fullStr Validation in Principal Components Analysis Applied to EEG Data
title_full_unstemmed Validation in Principal Components Analysis Applied to EEG Data
title_short Validation in Principal Components Analysis Applied to EEG Data
title_sort validation in principal components analysis applied to eeg data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4170877/
https://www.ncbi.nlm.nih.gov/pubmed/25276221
http://dx.doi.org/10.1155/2014/413801
work_keys_str_mv AT costajoaocarlosgd validationinprincipalcomponentsanalysisappliedtoeegdata
AT dasilvapaulojoseg validationinprincipalcomponentsanalysisappliedtoeegdata
AT almeidarenanmoritzvr validationinprincipalcomponentsanalysisappliedtoeegdata
AT infantosiantoniofernandoc validationinprincipalcomponentsanalysisappliedtoeegdata