Cargando…
Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for auto...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/ https://www.ncbi.nlm.nih.gov/pubmed/33127601 http://dx.doi.org/10.2196/22333 |
_version_ | 1783620743623344128 |
---|---|
author | Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong |
author_facet | Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong |
author_sort | Luo, Lingyun |
collection | PubMed |
description | BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies. |
format | Online Article Text |
id | pubmed-7725650 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-77256502020-12-30 Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong JMIR Med Inform Original Paper BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies. JMIR Publications 2020-11-25 /pmc/articles/PMC7725650/ /pubmed/33127601 http://dx.doi.org/10.2196/22333 Text en ©Lingyun Luo, Jingtao Feng, Huijun Yu, Jiaolong Wang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 25.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title | Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title_full | Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title_fullStr | Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title_full_unstemmed | Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title_short | Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation |
title_sort | automatic structuring of ontology terms based on lexical granularity and machine learning: algorithm development and validation |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/ https://www.ncbi.nlm.nih.gov/pubmed/33127601 http://dx.doi.org/10.2196/22333 |
work_keys_str_mv | AT luolingyun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT fengjingtao automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT yuhuijun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT wangjiaolong automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation |