Cargando…

Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation

BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for auto...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Lingyun, Feng, Jingtao, Yu, Huijun, Wang, Jiaolong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/
https://www.ncbi.nlm.nih.gov/pubmed/33127601
http://dx.doi.org/10.2196/22333
_version_ 1783620743623344128
author Luo, Lingyun
Feng, Jingtao
Yu, Huijun
Wang, Jiaolong
author_facet Luo, Lingyun
Feng, Jingtao
Yu, Huijun
Wang, Jiaolong
author_sort Luo, Lingyun
collection PubMed
description BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies.
format Online
Article
Text
id pubmed-7725650
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-77256502020-12-30 Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong JMIR Med Inform Original Paper BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies. JMIR Publications 2020-11-25 /pmc/articles/PMC7725650/ /pubmed/33127601 http://dx.doi.org/10.2196/22333 Text en ©Lingyun Luo, Jingtao Feng, Huijun Yu, Jiaolong Wang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 25.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Luo, Lingyun
Feng, Jingtao
Yu, Huijun
Wang, Jiaolong
Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_full Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_fullStr Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_full_unstemmed Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_short Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_sort automatic structuring of ontology terms based on lexical granularity and machine learning: algorithm development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/
https://www.ncbi.nlm.nih.gov/pubmed/33127601
http://dx.doi.org/10.2196/22333
work_keys_str_mv AT luolingyun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation
AT fengjingtao automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation
AT yuhuijun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation
AT wangjiaolong automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation