Cargando…

A multiobjective multi-view cluster ensemble technique: Application in patient subclassification

Recent high throughput omics technology has been used to assemble large biomedical omics datasets. Clustering of single omics data has proven invaluable in biomedical research. For the task of patient sub-classification, all the available omics data should be utilized combinedly rather than treating...

Descripción completa

Detalles Bibliográficos
Autores principales: Mitra, Sayantan, Saha, Sriparna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6533037/
https://www.ncbi.nlm.nih.gov/pubmed/31120942
http://dx.doi.org/10.1371/journal.pone.0216904
_version_ 1783421114475610112
author Mitra, Sayantan
Saha, Sriparna
author_facet Mitra, Sayantan
Saha, Sriparna
author_sort Mitra, Sayantan
collection PubMed
description Recent high throughput omics technology has been used to assemble large biomedical omics datasets. Clustering of single omics data has proven invaluable in biomedical research. For the task of patient sub-classification, all the available omics data should be utilized combinedly rather than treating them individually. Clustering of multi-omics datasets has the potential to reveal deep insights. Here, we propose a late integration based multiobjective multi-view clustering algorithm which uses a special perturbation operator. Initially, a large number of diverse clustering solutions (called base partitionings) are generated for each omic dataset using four clustering algorithms, viz., k means, complete linkage, spectral and fast search clustering. These base partitionings of multi-omic datasets are suitably combined using a special perturbation operator. The perturbation operator uses an ensemble technique to generate new solutions from the base partitionings. The optimal combination of multiple partitioning solutions across different views is determined after optimizing the objective functions, namely conn-XB, for checking the quality of partitionings for different views, and agreement index, for checking agreement between the views. The search capability of a multiobjective simulated annealing approach, namely AMOSA is used for this purpose. Lastly, the non-dominated solutions of the different views are combined based on similarity to generate a single set of non-dominated solutions. The proposed algorithm is evaluated on 13 multi-view cancer datasets. An elaborated comparative study with several baseline methods and five state-of-the-art models is performed to show the effectiveness of the algorithm.
format Online
Article
Text
id pubmed-6533037
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65330372019-06-05 A multiobjective multi-view cluster ensemble technique: Application in patient subclassification Mitra, Sayantan Saha, Sriparna PLoS One Research Article Recent high throughput omics technology has been used to assemble large biomedical omics datasets. Clustering of single omics data has proven invaluable in biomedical research. For the task of patient sub-classification, all the available omics data should be utilized combinedly rather than treating them individually. Clustering of multi-omics datasets has the potential to reveal deep insights. Here, we propose a late integration based multiobjective multi-view clustering algorithm which uses a special perturbation operator. Initially, a large number of diverse clustering solutions (called base partitionings) are generated for each omic dataset using four clustering algorithms, viz., k means, complete linkage, spectral and fast search clustering. These base partitionings of multi-omic datasets are suitably combined using a special perturbation operator. The perturbation operator uses an ensemble technique to generate new solutions from the base partitionings. The optimal combination of multiple partitioning solutions across different views is determined after optimizing the objective functions, namely conn-XB, for checking the quality of partitionings for different views, and agreement index, for checking agreement between the views. The search capability of a multiobjective simulated annealing approach, namely AMOSA is used for this purpose. Lastly, the non-dominated solutions of the different views are combined based on similarity to generate a single set of non-dominated solutions. The proposed algorithm is evaluated on 13 multi-view cancer datasets. An elaborated comparative study with several baseline methods and five state-of-the-art models is performed to show the effectiveness of the algorithm. Public Library of Science 2019-05-23 /pmc/articles/PMC6533037/ /pubmed/31120942 http://dx.doi.org/10.1371/journal.pone.0216904 Text en © 2019 Mitra, Saha https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mitra, Sayantan
Saha, Sriparna
A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title_full A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title_fullStr A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title_full_unstemmed A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title_short A multiobjective multi-view cluster ensemble technique: Application in patient subclassification
title_sort multiobjective multi-view cluster ensemble technique: application in patient subclassification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6533037/
https://www.ncbi.nlm.nih.gov/pubmed/31120942
http://dx.doi.org/10.1371/journal.pone.0216904
work_keys_str_mv AT mitrasayantan amultiobjectivemultiviewclusterensembletechniqueapplicationinpatientsubclassification
AT sahasriparna amultiobjectivemultiviewclusterensembletechniqueapplicationinpatientsubclassification
AT mitrasayantan multiobjectivemultiviewclusterensembletechniqueapplicationinpatientsubclassification
AT sahasriparna multiobjectivemultiviewclusterensembletechniqueapplicationinpatientsubclassification