Cargando…

Directionally dependent multi-view clustering using copula model

Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for reveali...

Descripción completa

Detalles Bibliográficos
Autores principales: Afrin, Kahkashan, Iquebal, Ashif S., Karimi, Mostafa, Souris, Allyson, Lee, Se Yoon, Mallick, Bani K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584221/
https://www.ncbi.nlm.nih.gov/pubmed/33095785
http://dx.doi.org/10.1371/journal.pone.0238996
_version_ 1783599552772702208
author Afrin, Kahkashan
Iquebal, Ashif S.
Karimi, Mostafa
Souris, Allyson
Lee, Se Yoon
Mallick, Bani K.
author_facet Afrin, Kahkashan
Iquebal, Ashif S.
Karimi, Mostafa
Souris, Allyson
Lee, Se Yoon
Mallick, Bani K.
author_sort Afrin, Kahkashan
collection PubMed
description Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.
format Online
Article
Text
id pubmed-7584221
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75842212020-10-28 Directionally dependent multi-view clustering using copula model Afrin, Kahkashan Iquebal, Ashif S. Karimi, Mostafa Souris, Allyson Lee, Se Yoon Mallick, Bani K. PLoS One Research Article Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results. Public Library of Science 2020-10-23 /pmc/articles/PMC7584221/ /pubmed/33095785 http://dx.doi.org/10.1371/journal.pone.0238996 Text en © 2020 Afrin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Afrin, Kahkashan
Iquebal, Ashif S.
Karimi, Mostafa
Souris, Allyson
Lee, Se Yoon
Mallick, Bani K.
Directionally dependent multi-view clustering using copula model
title Directionally dependent multi-view clustering using copula model
title_full Directionally dependent multi-view clustering using copula model
title_fullStr Directionally dependent multi-view clustering using copula model
title_full_unstemmed Directionally dependent multi-view clustering using copula model
title_short Directionally dependent multi-view clustering using copula model
title_sort directionally dependent multi-view clustering using copula model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584221/
https://www.ncbi.nlm.nih.gov/pubmed/33095785
http://dx.doi.org/10.1371/journal.pone.0238996
work_keys_str_mv AT afrinkahkashan directionallydependentmultiviewclusteringusingcopulamodel
AT iquebalashifs directionallydependentmultiviewclusteringusingcopulamodel
AT karimimostafa directionallydependentmultiviewclusteringusingcopulamodel
AT sourisallyson directionallydependentmultiviewclusteringusingcopulamodel
AT leeseyoon directionallydependentmultiviewclusteringusingcopulamodel
AT mallickbanik directionallydependentmultiviewclusteringusingcopulamodel