Cargando…

Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis

[Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Aminu, Muhammad, Ahmad, Noor Atinah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2020
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581267/
https://www.ncbi.nlm.nih.gov/pubmed/33110988
http://dx.doi.org/10.1021/acsomega.0c03362
_version_ 1783598943464062976
author Aminu, Muhammad
Ahmad, Noor Atinah
author_facet Aminu, Muhammad
Ahmad, Noor Atinah
author_sort Aminu, Muhammad
collection PubMed
description [Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning and extraction depends on how well the discriminant subspace is captured. In this paper, discriminant subspace learning of chemical data is discussed from the perspective of PLS-DA and a recent extension of PLS-DA, which is known as the locality preserving partial least squares discriminant analysis (LPPLS-DA). The objective is twofold: (a) to introduce the LPPLS-DA algorithm to the chemometrics community and (b) to demonstrate the superior discrimination capabilities of LPPLS-DA and how it can be a powerful alternative to PLS-DA. Four chemical data sets are used: three spectroscopic data sets and one that contains compositional data. Comparative performances are measured based on discrimination and classification of these data sets. To compare the classification performances, the data samples are projected onto the PLS-DA and LPPLS-DA subspaces, and classification of the projected samples into one of the different groups (classes) is done using the nearest-neighbor classifier. We also compare the two techniques in data visualization (discrimination) task. The ability of LPPLS-DA to group samples from the same class while at the same time maximizing the between-class separation is clearly shown in our results. In comparison with PLS-DA, separation of data in the projected LPPLS-DA subspace is more well defined.
format Online
Article
Text
id pubmed-7581267
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-75812672020-10-26 Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis Aminu, Muhammad Ahmad, Noor Atinah ACS Omega [Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning and extraction depends on how well the discriminant subspace is captured. In this paper, discriminant subspace learning of chemical data is discussed from the perspective of PLS-DA and a recent extension of PLS-DA, which is known as the locality preserving partial least squares discriminant analysis (LPPLS-DA). The objective is twofold: (a) to introduce the LPPLS-DA algorithm to the chemometrics community and (b) to demonstrate the superior discrimination capabilities of LPPLS-DA and how it can be a powerful alternative to PLS-DA. Four chemical data sets are used: three spectroscopic data sets and one that contains compositional data. Comparative performances are measured based on discrimination and classification of these data sets. To compare the classification performances, the data samples are projected onto the PLS-DA and LPPLS-DA subspaces, and classification of the projected samples into one of the different groups (classes) is done using the nearest-neighbor classifier. We also compare the two techniques in data visualization (discrimination) task. The ability of LPPLS-DA to group samples from the same class while at the same time maximizing the between-class separation is clearly shown in our results. In comparison with PLS-DA, separation of data in the projected LPPLS-DA subspace is more well defined. American Chemical Society 2020-10-09 /pmc/articles/PMC7581267/ /pubmed/33110988 http://dx.doi.org/10.1021/acsomega.0c03362 Text en © 2020 American Chemical Society This is an open access article published under a Creative Commons Attribution (CC-BY) License (http://pubs.acs.org/page/policy/authorchoice_ccby_termsofuse.html) , which permits unrestricted use, distribution and reproduction in any medium, provided the author and source are cited.
spellingShingle Aminu, Muhammad
Ahmad, Noor Atinah
Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title_full Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title_fullStr Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title_full_unstemmed Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title_short Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
title_sort complex chemical data classification and discrimination using locality preserving partial least squares discriminant analysis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581267/
https://www.ncbi.nlm.nih.gov/pubmed/33110988
http://dx.doi.org/10.1021/acsomega.0c03362
work_keys_str_mv AT aminumuhammad complexchemicaldataclassificationanddiscriminationusinglocalitypreservingpartialleastsquaresdiscriminantanalysis
AT ahmadnooratinah complexchemicaldataclassificationanddiscriminationusinglocalitypreservingpartialleastsquaresdiscriminantanalysis