Cargando…

Neuro-symbolic representation learning on biological knowledge graphs

MOTIVATION: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but h...

Descripción completa

Detalles Bibliográficos
Autores principales: Alshahrani, Mona, Khan, Mohammad Asif, Maddouri, Omar, Kinjo, Akira R, Queralt-Rosinach, Núria, Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860058/
https://www.ncbi.nlm.nih.gov/pubmed/28449114
http://dx.doi.org/10.1093/bioinformatics/btx275
_version_ 1783307939439706112
author Alshahrani, Mona
Khan, Mohammad Asif
Maddouri, Omar
Kinjo, Akira R
Queralt-Rosinach, Núria
Hoehndorf, Robert
author_facet Alshahrani, Mona
Khan, Mohammad Asif
Maddouri, Omar
Kinjo, Akira R
Queralt-Rosinach, Núria
Hoehndorf, Robert
author_sort Alshahrani, Mona
collection PubMed
description MOTIVATION: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge. Results: We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of Semantic Web based knowledge bases in biology to use in machine learning and data analytics. AVAILABILITY AND IMPLEMENTATION: https://github.com/bio-ontology-research-group/walking-rdf-and-owl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860058
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58600582018-03-23 Neuro-symbolic representation learning on biological knowledge graphs Alshahrani, Mona Khan, Mohammad Asif Maddouri, Omar Kinjo, Akira R Queralt-Rosinach, Núria Hoehndorf, Robert Bioinformatics Original Papers MOTIVATION: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge. Results: We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of Semantic Web based knowledge bases in biology to use in machine learning and data analytics. AVAILABILITY AND IMPLEMENTATION: https://github.com/bio-ontology-research-group/walking-rdf-and-owl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-09-01 2017-04-25 /pmc/articles/PMC5860058/ /pubmed/28449114 http://dx.doi.org/10.1093/bioinformatics/btx275 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Alshahrani, Mona
Khan, Mohammad Asif
Maddouri, Omar
Kinjo, Akira R
Queralt-Rosinach, Núria
Hoehndorf, Robert
Neuro-symbolic representation learning on biological knowledge graphs
title Neuro-symbolic representation learning on biological knowledge graphs
title_full Neuro-symbolic representation learning on biological knowledge graphs
title_fullStr Neuro-symbolic representation learning on biological knowledge graphs
title_full_unstemmed Neuro-symbolic representation learning on biological knowledge graphs
title_short Neuro-symbolic representation learning on biological knowledge graphs
title_sort neuro-symbolic representation learning on biological knowledge graphs
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860058/
https://www.ncbi.nlm.nih.gov/pubmed/28449114
http://dx.doi.org/10.1093/bioinformatics/btx275
work_keys_str_mv AT alshahranimona neurosymbolicrepresentationlearningonbiologicalknowledgegraphs
AT khanmohammadasif neurosymbolicrepresentationlearningonbiologicalknowledgegraphs
AT maddouriomar neurosymbolicrepresentationlearningonbiologicalknowledgegraphs
AT kinjoakirar neurosymbolicrepresentationlearningonbiologicalknowledgegraphs
AT queraltrosinachnuria neurosymbolicrepresentationlearningonbiologicalknowledgegraphs
AT hoehndorfrobert neurosymbolicrepresentationlearningonbiologicalknowledgegraphs