Cargando…

An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA

Classification studies are widely applied, e.g. in biomedical research to classify objects/patients into predefined groups. The goal is to find a classification function/rule which assigns each object/patient to a unique group with the greatest possible accuracy (classification error). Especially in...

Descripción completa

Detalles Bibliográficos
Autores principales: Telaar, Anna, Liland, Kristian Hovde, Repsilber, Dirk, Nürnberg, Gerd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3569448/
https://www.ncbi.nlm.nih.gov/pubmed/23408965
http://dx.doi.org/10.1371/journal.pone.0055267
_version_ 1782258908858941440
author Telaar, Anna
Liland, Kristian Hovde
Repsilber, Dirk
Nürnberg, Gerd
author_facet Telaar, Anna
Liland, Kristian Hovde
Repsilber, Dirk
Nürnberg, Gerd
author_sort Telaar, Anna
collection PubMed
description Classification studies are widely applied, e.g. in biomedical research to classify objects/patients into predefined groups. The goal is to find a classification function/rule which assigns each object/patient to a unique group with the greatest possible accuracy (classification error). Especially in gene expression experiments often a lot of variables (genes) are measured for only few objects/patients. A suitable approach is the well-known method PLS-DA, which searches for a transformation to a lower dimensional space. Resulting new components are linear combinations of the original variables. An advancement of PLS-DA leads to PPLS-DA, introducing a so called ‘power parameter’, which is maximized towards the correlation between the components and the group-membership. We introduce an extension of PPLS-DA for optimizing this power parameter towards the final aim, namely towards a minimal classification error. We compare this new extension with the original PPLS-DA and also with the ordinary PLS-DA using simulated and experimental datasets. For the investigated data sets with weak linear dependency between features/variables, no improvement is shown for PPLS-DA and for the extensions compared to PLS-DA. A very weak linear dependency, a low proportion of differentially expressed genes for simulated data, does not lead to an improvement of PPLS-DA over PLS-DA, but our extension shows a lower prediction error. On the contrary, for the data set with strong between-feature collinearity and a low proportion of differentially expressed genes and a large total number of genes, the prediction error of PPLS-DA and the extensions is clearly lower than for PLS-DA. Moreover we compare these prediction results with results of support vector machines with linear kernel and linear discriminant analysis.
format Online
Article
Text
id pubmed-3569448
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35694482013-02-13 An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA Telaar, Anna Liland, Kristian Hovde Repsilber, Dirk Nürnberg, Gerd PLoS One Research Article Classification studies are widely applied, e.g. in biomedical research to classify objects/patients into predefined groups. The goal is to find a classification function/rule which assigns each object/patient to a unique group with the greatest possible accuracy (classification error). Especially in gene expression experiments often a lot of variables (genes) are measured for only few objects/patients. A suitable approach is the well-known method PLS-DA, which searches for a transformation to a lower dimensional space. Resulting new components are linear combinations of the original variables. An advancement of PLS-DA leads to PPLS-DA, introducing a so called ‘power parameter’, which is maximized towards the correlation between the components and the group-membership. We introduce an extension of PPLS-DA for optimizing this power parameter towards the final aim, namely towards a minimal classification error. We compare this new extension with the original PPLS-DA and also with the ordinary PLS-DA using simulated and experimental datasets. For the investigated data sets with weak linear dependency between features/variables, no improvement is shown for PPLS-DA and for the extensions compared to PLS-DA. A very weak linear dependency, a low proportion of differentially expressed genes for simulated data, does not lead to an improvement of PPLS-DA over PLS-DA, but our extension shows a lower prediction error. On the contrary, for the data set with strong between-feature collinearity and a low proportion of differentially expressed genes and a large total number of genes, the prediction error of PPLS-DA and the extensions is clearly lower than for PLS-DA. Moreover we compare these prediction results with results of support vector machines with linear kernel and linear discriminant analysis. Public Library of Science 2013-02-11 /pmc/articles/PMC3569448/ /pubmed/23408965 http://dx.doi.org/10.1371/journal.pone.0055267 Text en © 2013 Telaar et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Telaar, Anna
Liland, Kristian Hovde
Repsilber, Dirk
Nürnberg, Gerd
An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title_full An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title_fullStr An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title_full_unstemmed An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title_short An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA
title_sort extension of ppls-da for classification and comparison to ordinary pls-da
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3569448/
https://www.ncbi.nlm.nih.gov/pubmed/23408965
http://dx.doi.org/10.1371/journal.pone.0055267
work_keys_str_mv AT telaaranna anextensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT lilandkristianhovde anextensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT repsilberdirk anextensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT nurnberggerd anextensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT telaaranna extensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT lilandkristianhovde extensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT repsilberdirk extensionofpplsdaforclassificationandcomparisontoordinaryplsda
AT nurnberggerd extensionofpplsdaforclassificationandcomparisontoordinaryplsda