Cargando…

Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes

MOTIVATION: In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease’s (or patient’s) phenotypes and a database of gene-to-phenotype assoc...

Descripción completa

Detalles Bibliográficos
Autores principales: Alshahrani, Mona, Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129260/
https://www.ncbi.nlm.nih.gov/pubmed/30423077
http://dx.doi.org/10.1093/bioinformatics/bty559
_version_ 1783353769542549504
author Alshahrani, Mona
Hoehndorf, Robert
author_facet Alshahrani, Mona
Hoehndorf, Robert
author_sort Alshahrani, Mona
collection PubMed
description MOTIVATION: In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease’s (or patient’s) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. RESULTS: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprised of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network. AVAILABILITY AND IMPLEMENTATION: https://github.com/bio-ontology-research-group/SmuDGE
format Online
Article
Text
id pubmed-6129260
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61292602018-09-12 Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes Alshahrani, Mona Hoehndorf, Robert Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease’s (or patient’s) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. RESULTS: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprised of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network. AVAILABILITY AND IMPLEMENTATION: https://github.com/bio-ontology-research-group/SmuDGE Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129260/ /pubmed/30423077 http://dx.doi.org/10.1093/bioinformatics/bty559 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2018: European Conference on Computational Biology Proceedings
Alshahrani, Mona
Hoehndorf, Robert
Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title_full Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title_fullStr Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title_full_unstemmed Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title_short Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes
title_sort semantic disease gene embeddings (smudge): phenotype-based disease gene prioritization without phenotypes
topic Eccb 2018: European Conference on Computational Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129260/
https://www.ncbi.nlm.nih.gov/pubmed/30423077
http://dx.doi.org/10.1093/bioinformatics/bty559
work_keys_str_mv AT alshahranimona semanticdiseasegeneembeddingssmudgephenotypebaseddiseasegeneprioritizationwithoutphenotypes
AT hoehndorfrobert semanticdiseasegeneembeddingssmudgephenotypebaseddiseasegeneprioritizationwithoutphenotypes