Cargando…
Directionally dependent multi-view clustering using copula model
Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for reveali...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584221/ https://www.ncbi.nlm.nih.gov/pubmed/33095785 http://dx.doi.org/10.1371/journal.pone.0238996 |
_version_ | 1783599552772702208 |
---|---|
author | Afrin, Kahkashan Iquebal, Ashif S. Karimi, Mostafa Souris, Allyson Lee, Se Yoon Mallick, Bani K. |
author_facet | Afrin, Kahkashan Iquebal, Ashif S. Karimi, Mostafa Souris, Allyson Lee, Se Yoon Mallick, Bani K. |
author_sort | Afrin, Kahkashan |
collection | PubMed |
description | Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results. |
format | Online Article Text |
id | pubmed-7584221 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-75842212020-10-28 Directionally dependent multi-view clustering using copula model Afrin, Kahkashan Iquebal, Ashif S. Karimi, Mostafa Souris, Allyson Lee, Se Yoon Mallick, Bani K. PLoS One Research Article Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results. Public Library of Science 2020-10-23 /pmc/articles/PMC7584221/ /pubmed/33095785 http://dx.doi.org/10.1371/journal.pone.0238996 Text en © 2020 Afrin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Afrin, Kahkashan Iquebal, Ashif S. Karimi, Mostafa Souris, Allyson Lee, Se Yoon Mallick, Bani K. Directionally dependent multi-view clustering using copula model |
title | Directionally dependent multi-view clustering using copula model |
title_full | Directionally dependent multi-view clustering using copula model |
title_fullStr | Directionally dependent multi-view clustering using copula model |
title_full_unstemmed | Directionally dependent multi-view clustering using copula model |
title_short | Directionally dependent multi-view clustering using copula model |
title_sort | directionally dependent multi-view clustering using copula model |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584221/ https://www.ncbi.nlm.nih.gov/pubmed/33095785 http://dx.doi.org/10.1371/journal.pone.0238996 |
work_keys_str_mv | AT afrinkahkashan directionallydependentmultiviewclusteringusingcopulamodel AT iquebalashifs directionallydependentmultiviewclusteringusingcopulamodel AT karimimostafa directionallydependentmultiviewclusteringusingcopulamodel AT sourisallyson directionallydependentmultiviewclusteringusingcopulamodel AT leeseyoon directionallydependentmultiviewclusteringusingcopulamodel AT mallickbanik directionallydependentmultiviewclusteringusingcopulamodel |