Cargando…

Jumping across biomedical contexts using compressive data fusion

Motivation: The rapid growth of diverse biological data allows us to consider interactions between a variety of objects, such as genes, chemicals, molecular signatures, diseases, pathways and environmental exposures. Often, any pair of objects—such as a gene and a disease—can be related in different...

Descripción completa

Detalles Bibliográficos
Autores principales: Zitnik, Marinka, Zupan, Blaz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908331/
https://www.ncbi.nlm.nih.gov/pubmed/27307649
http://dx.doi.org/10.1093/bioinformatics/btw247
_version_ 1782437660968615936
author Zitnik, Marinka
Zupan, Blaz
author_facet Zitnik, Marinka
Zupan, Blaz
author_sort Zitnik, Marinka
collection PubMed
description Motivation: The rapid growth of diverse biological data allows us to consider interactions between a variety of objects, such as genes, chemicals, molecular signatures, diseases, pathways and environmental exposures. Often, any pair of objects—such as a gene and a disease—can be related in different ways, for example, directly via gene–disease associations or indirectly via functional annotations, chemicals and pathways. Different ways of relating these objects carry different semantic meanings. However, traditional methods disregard these semantics and thus cannot fully exploit their value in data modeling. Results: We present Medusa, an approach to detect size-k modules of objects that, taken together, appear most significant to another set of objects. Medusa operates on large-scale collections of heterogeneous datasets and explicitly distinguishes between diverse data semantics. It advances research along two dimensions: it builds on collective matrix factorization to derive different semantics, and it formulates the growing of the modules as a submodular optimization program. Medusa is flexible in choosing or combining semantic meanings and provides theoretical guarantees about detection quality. In a systematic study on 310 complex diseases, we show the effectiveness of Medusa in associating genes with diseases and detecting disease modules. We demonstrate that in predicting gene–disease associations Medusa compares favorably to methods that ignore diverse semantic meanings. We find that the utility of different semantics depends on disease categories and that, overall, Medusa recovers disease modules more accurately when combining different semantics. Availability and implementation: Source code is at http://github.com/marinkaz/medusa Contact: marinka@cs.stanford.edu, blaz.zupan@fri.uni-lj.si
format Online
Article
Text
id pubmed-4908331
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49083312016-06-17 Jumping across biomedical contexts using compressive data fusion Zitnik, Marinka Zupan, Blaz Bioinformatics Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Motivation: The rapid growth of diverse biological data allows us to consider interactions between a variety of objects, such as genes, chemicals, molecular signatures, diseases, pathways and environmental exposures. Often, any pair of objects—such as a gene and a disease—can be related in different ways, for example, directly via gene–disease associations or indirectly via functional annotations, chemicals and pathways. Different ways of relating these objects carry different semantic meanings. However, traditional methods disregard these semantics and thus cannot fully exploit their value in data modeling. Results: We present Medusa, an approach to detect size-k modules of objects that, taken together, appear most significant to another set of objects. Medusa operates on large-scale collections of heterogeneous datasets and explicitly distinguishes between diverse data semantics. It advances research along two dimensions: it builds on collective matrix factorization to derive different semantics, and it formulates the growing of the modules as a submodular optimization program. Medusa is flexible in choosing or combining semantic meanings and provides theoretical guarantees about detection quality. In a systematic study on 310 complex diseases, we show the effectiveness of Medusa in associating genes with diseases and detecting disease modules. We demonstrate that in predicting gene–disease associations Medusa compares favorably to methods that ignore diverse semantic meanings. We find that the utility of different semantics depends on disease categories and that, overall, Medusa recovers disease modules more accurately when combining different semantics. Availability and implementation: Source code is at http://github.com/marinkaz/medusa Contact: marinka@cs.stanford.edu, blaz.zupan@fri.uni-lj.si Oxford University Press 2016-06-15 2016-06-11 /pmc/articles/PMC4908331/ /pubmed/27307649 http://dx.doi.org/10.1093/bioinformatics/btw247 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
Zitnik, Marinka
Zupan, Blaz
Jumping across biomedical contexts using compressive data fusion
title Jumping across biomedical contexts using compressive data fusion
title_full Jumping across biomedical contexts using compressive data fusion
title_fullStr Jumping across biomedical contexts using compressive data fusion
title_full_unstemmed Jumping across biomedical contexts using compressive data fusion
title_short Jumping across biomedical contexts using compressive data fusion
title_sort jumping across biomedical contexts using compressive data fusion
topic Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908331/
https://www.ncbi.nlm.nih.gov/pubmed/27307649
http://dx.doi.org/10.1093/bioinformatics/btw247
work_keys_str_mv AT zitnikmarinka jumpingacrossbiomedicalcontextsusingcompressivedatafusion
AT zupanblaz jumpingacrossbiomedicalcontextsusingcompressivedatafusion