Cargando…
Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis
[Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2020
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581267/ https://www.ncbi.nlm.nih.gov/pubmed/33110988 http://dx.doi.org/10.1021/acsomega.0c03362 |
_version_ | 1783598943464062976 |
---|---|
author | Aminu, Muhammad Ahmad, Noor Atinah |
author_facet | Aminu, Muhammad Ahmad, Noor Atinah |
author_sort | Aminu, Muhammad |
collection | PubMed |
description | [Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning and extraction depends on how well the discriminant subspace is captured. In this paper, discriminant subspace learning of chemical data is discussed from the perspective of PLS-DA and a recent extension of PLS-DA, which is known as the locality preserving partial least squares discriminant analysis (LPPLS-DA). The objective is twofold: (a) to introduce the LPPLS-DA algorithm to the chemometrics community and (b) to demonstrate the superior discrimination capabilities of LPPLS-DA and how it can be a powerful alternative to PLS-DA. Four chemical data sets are used: three spectroscopic data sets and one that contains compositional data. Comparative performances are measured based on discrimination and classification of these data sets. To compare the classification performances, the data samples are projected onto the PLS-DA and LPPLS-DA subspaces, and classification of the projected samples into one of the different groups (classes) is done using the nearest-neighbor classifier. We also compare the two techniques in data visualization (discrimination) task. The ability of LPPLS-DA to group samples from the same class while at the same time maximizing the between-class separation is clearly shown in our results. In comparison with PLS-DA, separation of data in the projected LPPLS-DA subspace is more well defined. |
format | Online Article Text |
id | pubmed-7581267 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-75812672020-10-26 Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis Aminu, Muhammad Ahmad, Noor Atinah ACS Omega [Image: see text] Partial least squares discriminant analysis (PLS-DA) is a well-known technique for feature extraction and discriminant analysis in chemometrics. Despite its popularity, it has been observed that PLS-DA does not automatically lead to extraction of relevant features. Feature learning and extraction depends on how well the discriminant subspace is captured. In this paper, discriminant subspace learning of chemical data is discussed from the perspective of PLS-DA and a recent extension of PLS-DA, which is known as the locality preserving partial least squares discriminant analysis (LPPLS-DA). The objective is twofold: (a) to introduce the LPPLS-DA algorithm to the chemometrics community and (b) to demonstrate the superior discrimination capabilities of LPPLS-DA and how it can be a powerful alternative to PLS-DA. Four chemical data sets are used: three spectroscopic data sets and one that contains compositional data. Comparative performances are measured based on discrimination and classification of these data sets. To compare the classification performances, the data samples are projected onto the PLS-DA and LPPLS-DA subspaces, and classification of the projected samples into one of the different groups (classes) is done using the nearest-neighbor classifier. We also compare the two techniques in data visualization (discrimination) task. The ability of LPPLS-DA to group samples from the same class while at the same time maximizing the between-class separation is clearly shown in our results. In comparison with PLS-DA, separation of data in the projected LPPLS-DA subspace is more well defined. American Chemical Society 2020-10-09 /pmc/articles/PMC7581267/ /pubmed/33110988 http://dx.doi.org/10.1021/acsomega.0c03362 Text en © 2020 American Chemical Society This is an open access article published under a Creative Commons Attribution (CC-BY) License (http://pubs.acs.org/page/policy/authorchoice_ccby_termsofuse.html) , which permits unrestricted use, distribution and reproduction in any medium, provided the author and source are cited. |
spellingShingle | Aminu, Muhammad Ahmad, Noor Atinah Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis |
title | Complex Chemical Data Classification and Discrimination
Using Locality Preserving Partial Least Squares Discriminant Analysis |
title_full | Complex Chemical Data Classification and Discrimination
Using Locality Preserving Partial Least Squares Discriminant Analysis |
title_fullStr | Complex Chemical Data Classification and Discrimination
Using Locality Preserving Partial Least Squares Discriminant Analysis |
title_full_unstemmed | Complex Chemical Data Classification and Discrimination
Using Locality Preserving Partial Least Squares Discriminant Analysis |
title_short | Complex Chemical Data Classification and Discrimination
Using Locality Preserving Partial Least Squares Discriminant Analysis |
title_sort | complex chemical data classification and discrimination
using locality preserving partial least squares discriminant analysis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581267/ https://www.ncbi.nlm.nih.gov/pubmed/33110988 http://dx.doi.org/10.1021/acsomega.0c03362 |
work_keys_str_mv | AT aminumuhammad complexchemicaldataclassificationanddiscriminationusinglocalitypreservingpartialleastsquaresdiscriminantanalysis AT ahmadnooratinah complexchemicaldataclassificationanddiscriminationusinglocalitypreservingpartialleastsquaresdiscriminantanalysis |