Cargando…

Multi-omic and multi-view clustering algorithms: review and cancer benchmark

Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of mul...

Descripción completa

Detalles Bibliográficos
Autores principales: Rappoport, Nimrod, Shamir, Ron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6237755/
https://www.ncbi.nlm.nih.gov/pubmed/30295871
http://dx.doi.org/10.1093/nar/gky889
_version_ 1783371232701317120
author Rappoport, Nimrod
Shamir, Ron
author_facet Rappoport, Nimrod
Shamir, Ron
author_sort Rappoport, Nimrod
collection PubMed
description Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of multi-omic data. Clustering multi-omic data has the potential to reveal further systems-level insights, but raises computational and biological challenges. Here, we review algorithms for multi-omics clustering, and discuss key issues in applying these algorithms. Our review covers methods developed specifically for omic data as well as generic multi-view methods developed in the machine learning community for joint clustering of multiple data types. In addition, using cancer data from TCGA, we perform an extensive benchmark spanning ten different cancer types, providing the first systematic comparison of leading multi-omics and multi-view clustering algorithms. The results highlight key issues regarding the use of single- versus multi-omics, the choice of clustering strategy, the power of generic multi-view methods and the use of approximated p-values for gauging solution quality. Due to the growing use of multi-omics data, we expect these issues to be important for future progress in the field.
format Online
Article
Text
id pubmed-6237755
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62377552018-11-21 Multi-omic and multi-view clustering algorithms: review and cancer benchmark Rappoport, Nimrod Shamir, Ron Nucleic Acids Res Survey and Summary Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of multi-omic data. Clustering multi-omic data has the potential to reveal further systems-level insights, but raises computational and biological challenges. Here, we review algorithms for multi-omics clustering, and discuss key issues in applying these algorithms. Our review covers methods developed specifically for omic data as well as generic multi-view methods developed in the machine learning community for joint clustering of multiple data types. In addition, using cancer data from TCGA, we perform an extensive benchmark spanning ten different cancer types, providing the first systematic comparison of leading multi-omics and multi-view clustering algorithms. The results highlight key issues regarding the use of single- versus multi-omics, the choice of clustering strategy, the power of generic multi-view methods and the use of approximated p-values for gauging solution quality. Due to the growing use of multi-omics data, we expect these issues to be important for future progress in the field. Oxford University Press 2018-11-16 2018-10-08 /pmc/articles/PMC6237755/ /pubmed/30295871 http://dx.doi.org/10.1093/nar/gky889 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Survey and Summary
Rappoport, Nimrod
Shamir, Ron
Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title_full Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title_fullStr Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title_full_unstemmed Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title_short Multi-omic and multi-view clustering algorithms: review and cancer benchmark
title_sort multi-omic and multi-view clustering algorithms: review and cancer benchmark
topic Survey and Summary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6237755/
https://www.ncbi.nlm.nih.gov/pubmed/30295871
http://dx.doi.org/10.1093/nar/gky889
work_keys_str_mv AT rappoportnimrod multiomicandmultiviewclusteringalgorithmsreviewandcancerbenchmark
AT shamirron multiomicandmultiviewclusteringalgorithmsreviewandcancerbenchmark