Cargando…

Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery

Clustering analysis has a growing role in the study of co-expressed genes for gene discovery. Conventional binary and fuzzy clustering do not embrace the biological reality that some genes may be irrelevant for a problem and not be assigned to a cluster, while other genes may participate in several...

Descripción completa

Detalles Bibliográficos
Autores principales: Abu-Jamous, Basel, Fa, Rui, Roberts, David J., Nandi, Asoke K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3569426/
https://www.ncbi.nlm.nih.gov/pubmed/23409186
http://dx.doi.org/10.1371/journal.pone.0056432
_version_ 1782258903634935808
author Abu-Jamous, Basel
Fa, Rui
Roberts, David J.
Nandi, Asoke K.
author_facet Abu-Jamous, Basel
Fa, Rui
Roberts, David J.
Nandi, Asoke K.
author_sort Abu-Jamous, Basel
collection PubMed
description Clustering analysis has a growing role in the study of co-expressed genes for gene discovery. Conventional binary and fuzzy clustering do not embrace the biological reality that some genes may be irrelevant for a problem and not be assigned to a cluster, while other genes may participate in several biological functions and should simultaneously belong to multiple clusters. Also, these algorithms cannot generate tight clusters that focus on their cores or wide clusters that overlap and contain all possibly relevant genes. In this paper, a new clustering paradigm is proposed. In this paradigm, all three eventualities of a gene being exclusively assigned to a single cluster, being assigned to multiple clusters, and being not assigned to any cluster are possible. These possibilities are realised through the primary novelty of the introduction of tunable binarization techniques. Results from multiple clustering experiments are aggregated to generate one fuzzy consensus partition matrix (CoPaM), which is then binarized to obtain the final binary partitions. This is referred to as Binarization of Consensus Partition Matrices (Bi-CoPaM). The method has been tested with a set of synthetic datasets and a set of five real yeast cell-cycle datasets. The results demonstrate its validity in generating relevant tight, wide, and complementary clusters that can meet requirements of different gene discovery studies.
format Online
Article
Text
id pubmed-3569426
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35694262013-02-13 Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery Abu-Jamous, Basel Fa, Rui Roberts, David J. Nandi, Asoke K. PLoS One Research Article Clustering analysis has a growing role in the study of co-expressed genes for gene discovery. Conventional binary and fuzzy clustering do not embrace the biological reality that some genes may be irrelevant for a problem and not be assigned to a cluster, while other genes may participate in several biological functions and should simultaneously belong to multiple clusters. Also, these algorithms cannot generate tight clusters that focus on their cores or wide clusters that overlap and contain all possibly relevant genes. In this paper, a new clustering paradigm is proposed. In this paradigm, all three eventualities of a gene being exclusively assigned to a single cluster, being assigned to multiple clusters, and being not assigned to any cluster are possible. These possibilities are realised through the primary novelty of the introduction of tunable binarization techniques. Results from multiple clustering experiments are aggregated to generate one fuzzy consensus partition matrix (CoPaM), which is then binarized to obtain the final binary partitions. This is referred to as Binarization of Consensus Partition Matrices (Bi-CoPaM). The method has been tested with a set of synthetic datasets and a set of five real yeast cell-cycle datasets. The results demonstrate its validity in generating relevant tight, wide, and complementary clusters that can meet requirements of different gene discovery studies. Public Library of Science 2013-02-11 /pmc/articles/PMC3569426/ /pubmed/23409186 http://dx.doi.org/10.1371/journal.pone.0056432 Text en © 2013 Abu-Jamous et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Abu-Jamous, Basel
Fa, Rui
Roberts, David J.
Nandi, Asoke K.
Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title_full Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title_fullStr Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title_full_unstemmed Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title_short Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery
title_sort paradigm of tunable clustering using binarization of consensus partition matrices (bi-copam) for gene discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3569426/
https://www.ncbi.nlm.nih.gov/pubmed/23409186
http://dx.doi.org/10.1371/journal.pone.0056432
work_keys_str_mv AT abujamousbasel paradigmoftunableclusteringusingbinarizationofconsensuspartitionmatricesbicopamforgenediscovery
AT farui paradigmoftunableclusteringusingbinarizationofconsensuspartitionmatricesbicopamforgenediscovery
AT robertsdavidj paradigmoftunableclusteringusingbinarizationofconsensuspartitionmatricesbicopamforgenediscovery
AT nandiasokek paradigmoftunableclusteringusingbinarizationofconsensuspartitionmatricesbicopamforgenediscovery