Cargando…

Inferring latent task structure for Multitask Learning by Multiple Kernel Learning

BACKGROUND: The lack of sufficient training data is the limiting factor for many Machine Learning applications in Computational Biology. If data is available for several different but related problem domains, Multitask Learning algorithms can be used to learn a model based on all available informati...

Descripción completa

Detalles Bibliográficos
Autores principales: Widmer, Christian, Toussaint, Nora C, Altun, Yasemin, Rätsch, Gunnar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2966292/
https://www.ncbi.nlm.nih.gov/pubmed/21034430
http://dx.doi.org/10.1186/1471-2105-11-S8-S5
_version_ 1782189567115264000
author Widmer, Christian
Toussaint, Nora C
Altun, Yasemin
Rätsch, Gunnar
author_facet Widmer, Christian
Toussaint, Nora C
Altun, Yasemin
Rätsch, Gunnar
author_sort Widmer, Christian
collection PubMed
description BACKGROUND: The lack of sufficient training data is the limiting factor for many Machine Learning applications in Computational Biology. If data is available for several different but related problem domains, Multitask Learning algorithms can be used to learn a model based on all available information. In Bioinformatics, many problems can be cast into the Multitask Learning scenario by incorporating data from several organisms. However, combining information from several tasks requires careful consideration of the degree of similarity between tasks. Our proposed method simultaneously learns or refines the similarity between tasks along with the Multitask Learning classifier. This is done by formulating the Multitask Learning problem as Multiple Kernel Learning, using the recently published q-Norm MKL algorithm. RESULTS: We demonstrate the performance of our method on two problems from Computational Biology. First, we show that our method is able to improve performance on a splice site dataset with given hierarchical task structure by refining the task relationships. Second, we consider an MHC-I dataset, for which we assume no knowledge about the degree of task relatedness. Here, we are able to learn the task similarities ab initio along with the Multitask classifiers. In both cases, we outperform baseline methods that we compare against. CONCLUSIONS: We present a novel approach to Multitask Learning that is capable of learning task similarity along with the classifiers. The framework is very general as it allows to incorporate prior knowledge about tasks relationships if available, but is also able to identify task similarities in absence of such prior information. Both variants show promising results in applications from Computational Biology.
format Text
id pubmed-2966292
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29662922010-10-30 Inferring latent task structure for Multitask Learning by Multiple Kernel Learning Widmer, Christian Toussaint, Nora C Altun, Yasemin Rätsch, Gunnar BMC Bioinformatics Research BACKGROUND: The lack of sufficient training data is the limiting factor for many Machine Learning applications in Computational Biology. If data is available for several different but related problem domains, Multitask Learning algorithms can be used to learn a model based on all available information. In Bioinformatics, many problems can be cast into the Multitask Learning scenario by incorporating data from several organisms. However, combining information from several tasks requires careful consideration of the degree of similarity between tasks. Our proposed method simultaneously learns or refines the similarity between tasks along with the Multitask Learning classifier. This is done by formulating the Multitask Learning problem as Multiple Kernel Learning, using the recently published q-Norm MKL algorithm. RESULTS: We demonstrate the performance of our method on two problems from Computational Biology. First, we show that our method is able to improve performance on a splice site dataset with given hierarchical task structure by refining the task relationships. Second, we consider an MHC-I dataset, for which we assume no knowledge about the degree of task relatedness. Here, we are able to learn the task similarities ab initio along with the Multitask classifiers. In both cases, we outperform baseline methods that we compare against. CONCLUSIONS: We present a novel approach to Multitask Learning that is capable of learning task similarity along with the classifiers. The framework is very general as it allows to incorporate prior knowledge about tasks relationships if available, but is also able to identify task similarities in absence of such prior information. Both variants show promising results in applications from Computational Biology. BioMed Central 2010-10-26 /pmc/articles/PMC2966292/ /pubmed/21034430 http://dx.doi.org/10.1186/1471-2105-11-S8-S5 Text en Copyright ©2010 Widmer et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Widmer, Christian
Toussaint, Nora C
Altun, Yasemin
Rätsch, Gunnar
Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title_full Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title_fullStr Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title_full_unstemmed Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title_short Inferring latent task structure for Multitask Learning by Multiple Kernel Learning
title_sort inferring latent task structure for multitask learning by multiple kernel learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2966292/
https://www.ncbi.nlm.nih.gov/pubmed/21034430
http://dx.doi.org/10.1186/1471-2105-11-S8-S5
work_keys_str_mv AT widmerchristian inferringlatenttaskstructureformultitasklearningbymultiplekernellearning
AT toussaintnorac inferringlatenttaskstructureformultitasklearningbymultiplekernellearning
AT altunyasemin inferringlatenttaskstructureformultitasklearningbymultiplekernellearning
AT ratschgunnar inferringlatenttaskstructureformultitasklearningbymultiplekernellearning