Cargando…

Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments

BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the abilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Suyan, Suárez-Fariñas, Mayte
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3833980/
https://www.ncbi.nlm.nih.gov/pubmed/24260109
http://dx.doi.org/10.1371/journal.pone.0078302
_version_ 1782291922632572928
author Tian, Suyan
Suárez-Fariñas, Mayte
author_facet Tian, Suyan
Suárez-Fariñas, Mayte
author_sort Tian, Suyan
collection PubMed
description BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the ability to handle multiple classes, arguably a common application. Here, we propose an extension to an existing regularization algorithm, called Threshold Gradient Descent Regularization (TGDR), to specifically tackle multi-class classification of microarray data. When there are several microarray experiments addressing the same/similar objectives, one option is to use a meta-analysis version of TGDR (Meta-TGDR), which considers the classification task as a combination of classifiers with the same structure/model while allowing the parameters to vary across studies. However, the original Meta-TGDR extension did not offer a solution to the prediction on independent samples. Here, we propose an explicit method to estimate the overall coefficients of the biomarkers selected by Meta-TGDR. This extension permits broader applicability and allows a comparison between the predictive performance of Meta-TGDR and TGDR using an independent testing set. RESULTS: Using real-world applications, we demonstrated the proposed multi-TGDR framework works well and the number of selected genes is less than the sum of all individualized binary TGDRs. Additionally, Meta-TGDR and TGDR on the batch-effect adjusted pooled data approximately provided same results. By adding Bagging procedure in each application, the stability and good predictive performance are warranted. CONCLUSIONS: Compared with Meta-TGDR, TGDR is less computing time intensive, and requires no samples of all classes in each study. On the adjusted data, it has approximate same predictive performance with Meta-TGDR. Thus, it is highly recommended.
format Online
Article
Text
id pubmed-3833980
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38339802013-11-20 Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments Tian, Suyan Suárez-Fariñas, Mayte PLoS One Research Article BACKGROUND: As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the ability to handle multiple classes, arguably a common application. Here, we propose an extension to an existing regularization algorithm, called Threshold Gradient Descent Regularization (TGDR), to specifically tackle multi-class classification of microarray data. When there are several microarray experiments addressing the same/similar objectives, one option is to use a meta-analysis version of TGDR (Meta-TGDR), which considers the classification task as a combination of classifiers with the same structure/model while allowing the parameters to vary across studies. However, the original Meta-TGDR extension did not offer a solution to the prediction on independent samples. Here, we propose an explicit method to estimate the overall coefficients of the biomarkers selected by Meta-TGDR. This extension permits broader applicability and allows a comparison between the predictive performance of Meta-TGDR and TGDR using an independent testing set. RESULTS: Using real-world applications, we demonstrated the proposed multi-TGDR framework works well and the number of selected genes is less than the sum of all individualized binary TGDRs. Additionally, Meta-TGDR and TGDR on the batch-effect adjusted pooled data approximately provided same results. By adding Bagging procedure in each application, the stability and good predictive performance are warranted. CONCLUSIONS: Compared with Meta-TGDR, TGDR is less computing time intensive, and requires no samples of all classes in each study. On the adjusted data, it has approximate same predictive performance with Meta-TGDR. Thus, it is highly recommended. Public Library of Science 2013-11-19 /pmc/articles/PMC3833980/ /pubmed/24260109 http://dx.doi.org/10.1371/journal.pone.0078302 Text en © 2013 Tian, Suárez-Fariñas http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tian, Suyan
Suárez-Fariñas, Mayte
Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title_full Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title_fullStr Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title_full_unstemmed Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title_short Multi-TGDR: A Regularization Method for Multi-Class Classification in Microarray Experiments
title_sort multi-tgdr: a regularization method for multi-class classification in microarray experiments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3833980/
https://www.ncbi.nlm.nih.gov/pubmed/24260109
http://dx.doi.org/10.1371/journal.pone.0078302
work_keys_str_mv AT tiansuyan multitgdraregularizationmethodformulticlassclassificationinmicroarrayexperiments
AT suarezfarinasmayte multitgdraregularizationmethodformulticlassclassificationinmicroarrayexperiments