Cargando…

Biomedical ontology alignment: an approach based on representation learning

BACKGROUND: While representation learning techniques have shown great promise in application to a number of different NLP tasks, they have had little impact on the problem of ontology matching. Unlike past work that has focused on feature engineering, we present a novel representation learning appro...

Descripción completa

Detalles Bibliográficos
Autores principales: Kolyvakis, Prodromos, Kalousis, Alexandros, Smith, Barry, Kiritsis, Dimitris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6094585/
https://www.ncbi.nlm.nih.gov/pubmed/30111369
http://dx.doi.org/10.1186/s13326-018-0187-8
_version_ 1783347848694202368
author Kolyvakis, Prodromos
Kalousis, Alexandros
Smith, Barry
Kiritsis, Dimitris
author_facet Kolyvakis, Prodromos
Kalousis, Alexandros
Smith, Barry
Kiritsis, Dimitris
author_sort Kolyvakis, Prodromos
collection PubMed
description BACKGROUND: While representation learning techniques have shown great promise in application to a number of different NLP tasks, they have had little impact on the problem of ontology matching. Unlike past work that has focused on feature engineering, we present a novel representation learning approach that is tailored to the ontology matching task. Our approach is based on embedding ontological terms in a high-dimensional Euclidean space. This embedding is derived on the basis of a novel phrase retrofitting strategy through which semantic similarity information becomes inscribed onto fields of pre-trained word vectors. The resulting framework also incorporates a novel outlier detection mechanism based on a denoising autoencoder that is shown to improve performance. RESULTS: An ontology matching system derived using the proposed framework achieved an F-score of 94% on an alignment scenario involving the Adult Mouse Anatomical Dictionary and the Foundational Model of Anatomy ontology (FMA) as targets. This compares favorably with the best performing systems on the Ontology Alignment Evaluation Initiative anatomy challenge. We performed additional experiments on aligning FMA to NCI Thesaurus and to SNOMED CT based on a reference alignment extracted from the UMLS Metathesaurus. Our system obtained overall F-scores of 93.2% and 89.2% for these experiments, thus achieving state-of-the-art results. CONCLUSIONS: Our proposed representation learning approach leverages terminological embeddings to capture semantic similarity. Our results provide evidence that the approach produces embeddings that are especially well tailored to the ontology matching task, demonstrating a novel pathway for the problem.
format Online
Article
Text
id pubmed-6094585
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60945852018-08-24 Biomedical ontology alignment: an approach based on representation learning Kolyvakis, Prodromos Kalousis, Alexandros Smith, Barry Kiritsis, Dimitris J Biomed Semantics Research BACKGROUND: While representation learning techniques have shown great promise in application to a number of different NLP tasks, they have had little impact on the problem of ontology matching. Unlike past work that has focused on feature engineering, we present a novel representation learning approach that is tailored to the ontology matching task. Our approach is based on embedding ontological terms in a high-dimensional Euclidean space. This embedding is derived on the basis of a novel phrase retrofitting strategy through which semantic similarity information becomes inscribed onto fields of pre-trained word vectors. The resulting framework also incorporates a novel outlier detection mechanism based on a denoising autoencoder that is shown to improve performance. RESULTS: An ontology matching system derived using the proposed framework achieved an F-score of 94% on an alignment scenario involving the Adult Mouse Anatomical Dictionary and the Foundational Model of Anatomy ontology (FMA) as targets. This compares favorably with the best performing systems on the Ontology Alignment Evaluation Initiative anatomy challenge. We performed additional experiments on aligning FMA to NCI Thesaurus and to SNOMED CT based on a reference alignment extracted from the UMLS Metathesaurus. Our system obtained overall F-scores of 93.2% and 89.2% for these experiments, thus achieving state-of-the-art results. CONCLUSIONS: Our proposed representation learning approach leverages terminological embeddings to capture semantic similarity. Our results provide evidence that the approach produces embeddings that are especially well tailored to the ontology matching task, demonstrating a novel pathway for the problem. BioMed Central 2018-08-15 /pmc/articles/PMC6094585/ /pubmed/30111369 http://dx.doi.org/10.1186/s13326-018-0187-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kolyvakis, Prodromos
Kalousis, Alexandros
Smith, Barry
Kiritsis, Dimitris
Biomedical ontology alignment: an approach based on representation learning
title Biomedical ontology alignment: an approach based on representation learning
title_full Biomedical ontology alignment: an approach based on representation learning
title_fullStr Biomedical ontology alignment: an approach based on representation learning
title_full_unstemmed Biomedical ontology alignment: an approach based on representation learning
title_short Biomedical ontology alignment: an approach based on representation learning
title_sort biomedical ontology alignment: an approach based on representation learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6094585/
https://www.ncbi.nlm.nih.gov/pubmed/30111369
http://dx.doi.org/10.1186/s13326-018-0187-8
work_keys_str_mv AT kolyvakisprodromos biomedicalontologyalignmentanapproachbasedonrepresentationlearning
AT kalousisalexandros biomedicalontologyalignmentanapproachbasedonrepresentationlearning
AT smithbarry biomedicalontologyalignmentanapproachbasedonrepresentationlearning
AT kiritsisdimitris biomedicalontologyalignmentanapproachbasedonrepresentationlearning