Cargando…
Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment
An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clu...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4158469/ https://www.ncbi.nlm.nih.gov/pubmed/25210644 http://dx.doi.org/10.1155/2014/342497 |
_version_ | 1782334050626699264 |
---|---|
author | Praisler, Mirela Ciochina, Stefanut |
author_facet | Praisler, Mirela Ciochina, Stefanut |
author_sort | Praisler, Mirela |
collection | PubMed |
description | An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clusters and by their dispersion. We are proposing a set of indicators inspired from analytical geometry that may be used for an objective quantitative assessment of the data clustering quality as well as a global clustering quality coefficient (GCQC) that is a measure of the overall predictive power of the PCA models. The use of these indicators for evaluating the efficiency of the PCA class assignment is illustrated by a comparative study performed for the identification of the preprocessing function that is generating the most efficient PCA system screening for amphetamines based on their GC-FTIR spectra. The GCQC ranking of the tested feature weights is explained based on estimated density distributions and validated by using quadratic discriminant analysis (QDA). |
format | Online Article Text |
id | pubmed-4158469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-41584692014-09-10 Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment Praisler, Mirela Ciochina, Stefanut J Anal Methods Chem Research Article An essential factor influencing the efficiency of the predictive models built with principal component analysis (PCA) is the quality of the data clustering revealed by the score plots. The sensitivity and selectivity of the class assignment are strongly influenced by the relative position of the clusters and by their dispersion. We are proposing a set of indicators inspired from analytical geometry that may be used for an objective quantitative assessment of the data clustering quality as well as a global clustering quality coefficient (GCQC) that is a measure of the overall predictive power of the PCA models. The use of these indicators for evaluating the efficiency of the PCA class assignment is illustrated by a comparative study performed for the identification of the preprocessing function that is generating the most efficient PCA system screening for amphetamines based on their GC-FTIR spectra. The GCQC ranking of the tested feature weights is explained based on estimated density distributions and validated by using quadratic discriminant analysis (QDA). Hindawi Publishing Corporation 2014 2014-08-26 /pmc/articles/PMC4158469/ /pubmed/25210644 http://dx.doi.org/10.1155/2014/342497 Text en Copyright © 2014 M. Praisler and S. Ciochina. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Praisler, Mirela Ciochina, Stefanut Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title | Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title_full | Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title_fullStr | Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title_full_unstemmed | Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title_short | Global Clustering Quality Coefficient Assessing the Efficiency of PCA Class Assignment |
title_sort | global clustering quality coefficient assessing the efficiency of pca class assignment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4158469/ https://www.ncbi.nlm.nih.gov/pubmed/25210644 http://dx.doi.org/10.1155/2014/342497 |
work_keys_str_mv | AT praislermirela globalclusteringqualitycoefficientassessingtheefficiencyofpcaclassassignment AT ciochinastefanut globalclusteringqualitycoefficientassessingtheefficiencyofpcaclassassignment |