Cargando…
Omics community detection using multi-resolution clustering
MOTIVATION: The discovery of biologically interpretable and clinically actionable communities in heterogeneous omics data is a necessary first step toward deriving mechanistic insights into complex biological phenomena. Here, we present a novel clustering approach, omeClust, for community detection...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545346/ https://www.ncbi.nlm.nih.gov/pubmed/33974004 http://dx.doi.org/10.1093/bioinformatics/btab317 |
_version_ | 1784589997052002304 |
---|---|
author | Rahnavard, Ali Chatterjee, Suvo Sayoldin, Bahar Crandall, Keith A Tekola-Ayele, Fasil Mallick, Himel |
author_facet | Rahnavard, Ali Chatterjee, Suvo Sayoldin, Bahar Crandall, Keith A Tekola-Ayele, Fasil Mallick, Himel |
author_sort | Rahnavard, Ali |
collection | PubMed |
description | MOTIVATION: The discovery of biologically interpretable and clinically actionable communities in heterogeneous omics data is a necessary first step toward deriving mechanistic insights into complex biological phenomena. Here, we present a novel clustering approach, omeClust, for community detection in omics profiles by simultaneously incorporating similarities among measurements and the overall complex structure of the data. RESULTS: We show that omeClust outperforms published methods in inferring the true community structure as measured by both sensitivity and misclassification rate on simulated datasets. We further validated omeClust in diverse, multiple omics datasets, revealing new communities and functionally related groups in microbial strains, cell line gene expression patterns and fetal genomic variation. We also derived enrichment scores attributable to putatively meaningful biological factors in these datasets that can serve as hypothesis generators facilitating new sets of testable hypotheses. AVAILABILITY AND IMPLEMENTATION: omeClust is open-source software, and the implementation is available online at http://github.com/omicsEye/omeClust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8545346 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-85453462021-10-26 Omics community detection using multi-resolution clustering Rahnavard, Ali Chatterjee, Suvo Sayoldin, Bahar Crandall, Keith A Tekola-Ayele, Fasil Mallick, Himel Bioinformatics Original Papers MOTIVATION: The discovery of biologically interpretable and clinically actionable communities in heterogeneous omics data is a necessary first step toward deriving mechanistic insights into complex biological phenomena. Here, we present a novel clustering approach, omeClust, for community detection in omics profiles by simultaneously incorporating similarities among measurements and the overall complex structure of the data. RESULTS: We show that omeClust outperforms published methods in inferring the true community structure as measured by both sensitivity and misclassification rate on simulated datasets. We further validated omeClust in diverse, multiple omics datasets, revealing new communities and functionally related groups in microbial strains, cell line gene expression patterns and fetal genomic variation. We also derived enrichment scores attributable to putatively meaningful biological factors in these datasets that can serve as hypothesis generators facilitating new sets of testable hypotheses. AVAILABILITY AND IMPLEMENTATION: omeClust is open-source software, and the implementation is available online at http://github.com/omicsEye/omeClust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-11 /pmc/articles/PMC8545346/ /pubmed/33974004 http://dx.doi.org/10.1093/bioinformatics/btab317 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Rahnavard, Ali Chatterjee, Suvo Sayoldin, Bahar Crandall, Keith A Tekola-Ayele, Fasil Mallick, Himel Omics community detection using multi-resolution clustering |
title | Omics community detection using multi-resolution clustering |
title_full | Omics community detection using multi-resolution clustering |
title_fullStr | Omics community detection using multi-resolution clustering |
title_full_unstemmed | Omics community detection using multi-resolution clustering |
title_short | Omics community detection using multi-resolution clustering |
title_sort | omics community detection using multi-resolution clustering |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545346/ https://www.ncbi.nlm.nih.gov/pubmed/33974004 http://dx.doi.org/10.1093/bioinformatics/btab317 |
work_keys_str_mv | AT rahnavardali omicscommunitydetectionusingmultiresolutionclustering AT chatterjeesuvo omicscommunitydetectionusingmultiresolutionclustering AT sayoldinbahar omicscommunitydetectionusingmultiresolutionclustering AT crandallkeitha omicscommunitydetectionusingmultiresolutionclustering AT tekolaayelefasil omicscommunitydetectionusingmultiresolutionclustering AT mallickhimel omicscommunitydetectionusingmultiresolutionclustering |