Cargando…

Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment

An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clu...

Descripción completa

Detalles Bibliográficos
Autores principales: Praisler, Mirela, Ciochina, Stefanut
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4158469/
https://www.ncbi.nlm.nih.gov/pubmed/25210644
http://dx.doi.org/10.1155/2014/342497
_version_ 1782334050626699264
author Praisler, Mirela
Ciochina, Stefanut
author_facet Praisler, Mirela
Ciochina, Stefanut
author_sort Praisler, Mirela
collection PubMed
description An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clusters and by their dispersion. We are proposing a set of indicators inspired from analytical geometry that may be used for an objective quantitative assessment of the data clustering quality as well as a global clustering quality coefficient (GCQC) that is a measure of the overall predictive power of the PCA models. The use of these indicators for evaluating the efficiency of the PCA class assignment is illustrated by a comparative study performed for the identification of the preprocessing function that is generating the most efficient PCA system screening for amphetamines based on their GC-FTIR spectra. The GCQC ranking of the tested feature weights is explained based on estimated density distributions and validated by using quadratic discriminant analysis (QDA).
format Online
Article
Text
id pubmed-4158469
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41584692014-09-10 Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment Praisler, Mirela Ciochina, Stefanut J Anal Methods Chem Research Article An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clusters and by their dispersion. We are proposing a set of indicators inspired from analytical geometry that may be used for an objective quantitative assessment of the data clustering quality as well as a global clustering quality coefficient (GCQC) that is a measure of the overall predictive power of the PCA models. The use of these indicators for evaluating the efficiency of the PCA class assignment is illustrated by a comparative study performed for the identification of the preprocessing function that is generating the most efficient PCA system screening for amphetamines based on their GC-FTIR spectra. The GCQC ranking of the tested feature weights is explained based on estimated density distributions and validated by using quadratic discriminant analysis (QDA). Hindawi Publishing Corporation 2014 2014-08-26 /pmc/articles/PMC4158469/ /pubmed/25210644 http://dx.doi.org/10.1155/2014/342497 Text en Copyright © 2014 M. Praisler and S. Ciochina. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Praisler, Mirela
Ciochina, Stefanut
Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title_full Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title_fullStr Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title_full_unstemmed Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title_short Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
title_sort global clustering quality coefficient assessing the efficiency of pca class assignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4158469/
https://www.ncbi.nlm.nih.gov/pubmed/25210644
http://dx.doi.org/10.1155/2014/342497
work_keys_str_mv AT praislermirela globalclusteringqualitycoefficientassessingtheefficiencyofpcaclassassignment
AT ciochinastefanut globalclusteringqualitycoefficientassessingtheefficiencyofpcaclassassignment