Cargando…

Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces

Co-clustering, often called biclustering for two-dimensional data, has found many applications, such as gene expression data analysis and text mining. Nowadays, a variety of multi-dimensional arrays (tensors) frequently occur in data analysis tasks, and co-clustering techniques play a key role in de...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Hongya, Wang, Debby D., Chen, Long, Liu, Xinyu, Yan, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012624/
https://www.ncbi.nlm.nih.gov/pubmed/27598575
http://dx.doi.org/10.1371/journal.pone.0162293
_version_ 1782452022171140096
author Zhao, Hongya
Wang, Debby D.
Chen, Long
Liu, Xinyu
Yan, Hong
author_facet Zhao, Hongya
Wang, Debby D.
Chen, Long
Liu, Xinyu
Yan, Hong
author_sort Zhao, Hongya
collection PubMed
description Co-clustering, often called biclustering for two-dimensional data, has found many applications, such as gene expression data analysis and text mining. Nowadays, a variety of multi-dimensional arrays (tensors) frequently occur in data analysis tasks, and co-clustering techniques play a key role in dealing with such datasets. Co-clusters represent coherent patterns and exhibit important properties along all the modes. Development of robust co-clustering techniques is important for the detection and analysis of these patterns. In this paper, a co-clustering method based on hyperplane detection in singular vector spaces (HDSVS) is proposed. Specifically in this method, higher-order singular value decomposition (HOSVD) transforms a tensor into a core part and a singular vector matrix along each mode, whose row vectors can be clustered by a linear grouping algorithm (LGA). Meanwhile, hyperplanar patterns are extracted and successfully supported the identification of multi-dimensional co-clusters. To validate HDSVS, a number of synthetic and biological tensors were adopted. The synthetic tensors attested a favorable performance of this algorithm on noisy or overlapped data. Experiments with gene expression data and lineage data of embryonic cells further verified the reliability of HDSVS to practical problems. Moreover, the detected co-clusters are well consistent with important genetic pathways and gene ontology annotations. Finally, a series of comparisons between HDSVS and state-of-the-art methods on synthetic tensors and a yeast gene expression tensor were implemented, verifying the robust and stable performance of our method.
format Online
Article
Text
id pubmed-5012624
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-50126242016-09-27 Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces Zhao, Hongya Wang, Debby D. Chen, Long Liu, Xinyu Yan, Hong PLoS One Research Article Co-clustering, often called biclustering for two-dimensional data, has found many applications, such as gene expression data analysis and text mining. Nowadays, a variety of multi-dimensional arrays (tensors) frequently occur in data analysis tasks, and co-clustering techniques play a key role in dealing with such datasets. Co-clusters represent coherent patterns and exhibit important properties along all the modes. Development of robust co-clustering techniques is important for the detection and analysis of these patterns. In this paper, a co-clustering method based on hyperplane detection in singular vector spaces (HDSVS) is proposed. Specifically in this method, higher-order singular value decomposition (HOSVD) transforms a tensor into a core part and a singular vector matrix along each mode, whose row vectors can be clustered by a linear grouping algorithm (LGA). Meanwhile, hyperplanar patterns are extracted and successfully supported the identification of multi-dimensional co-clusters. To validate HDSVS, a number of synthetic and biological tensors were adopted. The synthetic tensors attested a favorable performance of this algorithm on noisy or overlapped data. Experiments with gene expression data and lineage data of embryonic cells further verified the reliability of HDSVS to practical problems. Moreover, the detected co-clusters are well consistent with important genetic pathways and gene ontology annotations. Finally, a series of comparisons between HDSVS and state-of-the-art methods on synthetic tensors and a yeast gene expression tensor were implemented, verifying the robust and stable performance of our method. Public Library of Science 2016-09-06 /pmc/articles/PMC5012624/ /pubmed/27598575 http://dx.doi.org/10.1371/journal.pone.0162293 Text en © 2016 Zhao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhao, Hongya
Wang, Debby D.
Chen, Long
Liu, Xinyu
Yan, Hong
Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title_full Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title_fullStr Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title_full_unstemmed Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title_short Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces
title_sort identifying multi-dimensional co-clusters in tensors based on hyperplane detection in singular vector spaces
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012624/
https://www.ncbi.nlm.nih.gov/pubmed/27598575
http://dx.doi.org/10.1371/journal.pone.0162293
work_keys_str_mv AT zhaohongya identifyingmultidimensionalcoclustersintensorsbasedonhyperplanedetectioninsingularvectorspaces
AT wangdebbyd identifyingmultidimensionalcoclustersintensorsbasedonhyperplanedetectioninsingularvectorspaces
AT chenlong identifyingmultidimensionalcoclustersintensorsbasedonhyperplanedetectioninsingularvectorspaces
AT liuxinyu identifyingmultidimensionalcoclustersintensorsbasedonhyperplanedetectioninsingularvectorspaces
AT yanhong identifyingmultidimensionalcoclustersintensorsbasedonhyperplanedetectioninsingularvectorspaces