Cargando…

Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models

We consider multi-response and multi-task regression models, where the parameter matrix to be estimated is expected to have an unknown grouping structure. The groupings can be along tasks, or features, or both, the last one indicating a bi-cluster or “checkerboard” structure. Discovering this groupi...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Ming, Natesan Ramamurthy, Karthikeyan, Thompson, Addie, Lozano, Aurélie C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931892/
https://www.ncbi.nlm.nih.gov/pubmed/33693350
http://dx.doi.org/10.3389/fdata.2019.00027
_version_ 1783660376593793024
author Yu, Ming
Natesan Ramamurthy, Karthikeyan
Thompson, Addie
Lozano, Aurélie C.
author_facet Yu, Ming
Natesan Ramamurthy, Karthikeyan
Thompson, Addie
Lozano, Aurélie C.
author_sort Yu, Ming
collection PubMed
description We consider multi-response and multi-task regression models, where the parameter matrix to be estimated is expected to have an unknown grouping structure. The groupings can be along tasks, or features, or both, the last one indicating a bi-cluster or “checkerboard” structure. Discovering this grouping structure along with parameter inference makes sense in several applications, such as multi-response Genome-Wide Association Studies (GWAS). By inferring this additional structure we can obtain valuable information on the underlying data mechanisms (e.g., relationships among genotypes and phenotypes in GWAS). In this paper, we propose two formulations to simultaneously learn the parameter matrix and its group structures, based on convex regularization penalties. We present optimization approaches to solve the resulting problems and provide numerical convergence guarantees. Extensive experiments demonstrate much better clustering quality compared to other methods, and our approaches are also validated on real datasets concerning phenotypes and genotypes of plant varieties.
format Online
Article
Text
id pubmed-7931892
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79318922021-03-09 Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models Yu, Ming Natesan Ramamurthy, Karthikeyan Thompson, Addie Lozano, Aurélie C. Front Big Data Big Data We consider multi-response and multi-task regression models, where the parameter matrix to be estimated is expected to have an unknown grouping structure. The groupings can be along tasks, or features, or both, the last one indicating a bi-cluster or “checkerboard” structure. Discovering this grouping structure along with parameter inference makes sense in several applications, such as multi-response Genome-Wide Association Studies (GWAS). By inferring this additional structure we can obtain valuable information on the underlying data mechanisms (e.g., relationships among genotypes and phenotypes in GWAS). In this paper, we propose two formulations to simultaneously learn the parameter matrix and its group structures, based on convex regularization penalties. We present optimization approaches to solve the resulting problems and provide numerical convergence guarantees. Extensive experiments demonstrate much better clustering quality compared to other methods, and our approaches are also validated on real datasets concerning phenotypes and genotypes of plant varieties. Frontiers Media S.A. 2019-08-14 /pmc/articles/PMC7931892/ /pubmed/33693350 http://dx.doi.org/10.3389/fdata.2019.00027 Text en Copyright © 2019 Yu, Natesan Ramamurthy, Thompson and Lozano. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Yu, Ming
Natesan Ramamurthy, Karthikeyan
Thompson, Addie
Lozano, Aurélie C.
Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title_full Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title_fullStr Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title_full_unstemmed Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title_short Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models
title_sort simultaneous parameter learning and bi-clustering for multi-response models
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931892/
https://www.ncbi.nlm.nih.gov/pubmed/33693350
http://dx.doi.org/10.3389/fdata.2019.00027
work_keys_str_mv AT yuming simultaneousparameterlearningandbiclusteringformultiresponsemodels
AT natesanramamurthykarthikeyan simultaneousparameterlearningandbiclusteringformultiresponsemodels
AT thompsonaddie simultaneousparameterlearningandbiclusteringformultiresponsemodels
AT lozanoaureliec simultaneousparameterlearningandbiclusteringformultiresponsemodels