Cargando…
Latent Model-Based Clustering for Biological Discovery
LOVE, a robust, scalable latent model-based clustering method for biological discovery, can be used across a range of datasets to generate both overlapping and non-overlapping clusters. In our formulation, a cluster comprises variables associated with the same latent factor and is determined from an...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449745/ https://www.ncbi.nlm.nih.gov/pubmed/30954780 http://dx.doi.org/10.1016/j.isci.2019.03.018 |
_version_ | 1783408915326697472 |
---|---|
author | Bing, Xin Bunea, Florentina Royer, Martin Das, Jishnu |
author_facet | Bing, Xin Bunea, Florentina Royer, Martin Das, Jishnu |
author_sort | Bing, Xin |
collection | PubMed |
description | LOVE, a robust, scalable latent model-based clustering method for biological discovery, can be used across a range of datasets to generate both overlapping and non-overlapping clusters. In our formulation, a cluster comprises variables associated with the same latent factor and is determined from an allocation matrix that indexes our latent model. We prove that the allocation matrix and corresponding clusters are uniquely defined. We apply LOVE to biological datasets (gene expression, serological responses measured from HIV controllers and chronic progressors, vaccine-induced humoral immune responses) resulting in meaningful biological output. For all three datasets, the clusters generated by LOVE remain stable across tuning parameters. Finally, we compared LOVE's performance to that of 13 state-of-the-art methods using previously established benchmarks and found that LOVE outperformed these methods across datasets. Our results demonstrate that LOVE can be broadly used across large-scale biological datasets to generate accurate and meaningful overlapping and non-overlapping clusters. |
format | Online Article Text |
id | pubmed-6449745 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-64497452019-04-16 Latent Model-Based Clustering for Biological Discovery Bing, Xin Bunea, Florentina Royer, Martin Das, Jishnu iScience Article LOVE, a robust, scalable latent model-based clustering method for biological discovery, can be used across a range of datasets to generate both overlapping and non-overlapping clusters. In our formulation, a cluster comprises variables associated with the same latent factor and is determined from an allocation matrix that indexes our latent model. We prove that the allocation matrix and corresponding clusters are uniquely defined. We apply LOVE to biological datasets (gene expression, serological responses measured from HIV controllers and chronic progressors, vaccine-induced humoral immune responses) resulting in meaningful biological output. For all three datasets, the clusters generated by LOVE remain stable across tuning parameters. Finally, we compared LOVE's performance to that of 13 state-of-the-art methods using previously established benchmarks and found that LOVE outperformed these methods across datasets. Our results demonstrate that LOVE can be broadly used across large-scale biological datasets to generate accurate and meaningful overlapping and non-overlapping clusters. Elsevier 2019-03-21 /pmc/articles/PMC6449745/ /pubmed/30954780 http://dx.doi.org/10.1016/j.isci.2019.03.018 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Bing, Xin Bunea, Florentina Royer, Martin Das, Jishnu Latent Model-Based Clustering for Biological Discovery |
title | Latent Model-Based Clustering for Biological Discovery |
title_full | Latent Model-Based Clustering for Biological Discovery |
title_fullStr | Latent Model-Based Clustering for Biological Discovery |
title_full_unstemmed | Latent Model-Based Clustering for Biological Discovery |
title_short | Latent Model-Based Clustering for Biological Discovery |
title_sort | latent model-based clustering for biological discovery |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449745/ https://www.ncbi.nlm.nih.gov/pubmed/30954780 http://dx.doi.org/10.1016/j.isci.2019.03.018 |
work_keys_str_mv | AT bingxin latentmodelbasedclusteringforbiologicaldiscovery AT buneaflorentina latentmodelbasedclusteringforbiologicaldiscovery AT royermartin latentmodelbasedclusteringforbiologicaldiscovery AT dasjishnu latentmodelbasedclusteringforbiologicaldiscovery |