Cargando…
Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the abilit...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3833980/ https://www.ncbi.nlm.nih.gov/pubmed/24260109 http://dx.doi.org/10.1371/journal.pone.0078302 |
_version_ | 1782291922632572928 |
---|---|
author | Tian, Suyan Suárez-Fariñas, Mayte |
author_facet | Tian, Suyan Suárez-Fariñas, Mayte |
author_sort | Tian, Suyan |
collection | PubMed |
description | BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the ability to handle multiple classes, arguably a common application. Here, we propose an extension to an existing regularization algorithm, called Threshold Gradient Descent Regularization (TGDR), to specifically tackle multi-class classification of microarray data. When there are several microarray experiments addressing the same/similar objectives, one option is to use a meta-analysis version of TGDR (Meta-TGDR), which considers the classification task as a combination of classifiers with the same structure/model while allowing the parameters to vary across studies. However, the original Meta-TGDR extension did not offer a solution to the prediction on independent samples. Here, we propose an explicit method to estimate the overall coefficients of the biomarkers selected by Meta-TGDR. This extension permits broader applicability and allows a comparison between the predictive performance of Meta-TGDR and TGDR using an independent testing set. RESULTS: Using real-world applications, we demonstrated the proposed multi-TGDR framework works well and the number of selected genes is less than the sum of all individualized binary TGDRs. Additionally, Meta-TGDR and TGDR on the batch-effect adjusted pooled data approximately provided same results. By adding Bagging procedure in each application, the stability and good predictive performance are warranted. CONCLUSIONS: Compared with Meta-TGDR, TGDR is less computing time intensive, and requires no samples of all classes in each study. On the adjusted data, it has approximate same predictive performance with Meta-TGDR. Thus, it is highly recommended. |
format | Online Article Text |
id | pubmed-3833980 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38339802013-11-20 Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments Tian, Suyan Suárez-Fariñas, Mayte PLoS One Research Article BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the ability to handle multiple classes, arguably a common application. Here, we propose an extension to an existing regularization algorithm, called Threshold Gradient Descent Regularization (TGDR), to specifically tackle multi-class classification of microarray data. When there are several microarray experiments addressing the same/similar objectives, one option is to use a meta-analysis version of TGDR (Meta-TGDR), which considers the classification task as a combination of classifiers with the same structure/model while allowing the parameters to vary across studies. However, the original Meta-TGDR extension did not offer a solution to the prediction on independent samples. Here, we propose an explicit method to estimate the overall coefficients of the biomarkers selected by Meta-TGDR. This extension permits broader applicability and allows a comparison between the predictive performance of Meta-TGDR and TGDR using an independent testing set. RESULTS: Using real-world applications, we demonstrated the proposed multi-TGDR framework works well and the number of selected genes is less than the sum of all individualized binary TGDRs. Additionally, Meta-TGDR and TGDR on the batch-effect adjusted pooled data approximately provided same results. By adding Bagging procedure in each application, the stability and good predictive performance are warranted. CONCLUSIONS: Compared with Meta-TGDR, TGDR is less computing time intensive, and requires no samples of all classes in each study. On the adjusted data, it has approximate same predictive performance with Meta-TGDR. Thus, it is highly recommended. Public Library of Science 2013-11-19 /pmc/articles/PMC3833980/ /pubmed/24260109 http://dx.doi.org/10.1371/journal.pone.0078302 Text en © 2013 Tian, Suárez-Fariñas http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Tian, Suyan Suárez-Fariñas, Mayte Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title | Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title_full | Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title_fullStr | Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title_full_unstemmed | Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title_short | Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments |
title_sort | multi-tgdr: a regularization method for multi-class classification in microarray experiments |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3833980/ https://www.ncbi.nlm.nih.gov/pubmed/24260109 http://dx.doi.org/10.1371/journal.pone.0078302 |
work_keys_str_mv | AT tiansuyan multitgdraregularizationmethodformulticlassclassificationinmicroarrayexperiments AT suarezfarinasmayte multitgdraregularizationmethodformulticlassclassificationinmicroarrayexperiments |