Cargando…

Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation

BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for auto...

Descripción completa

Detalles Bibliográficos
Autores principales:	Luo, Lingyun, Feng, Jingtao, Yu, Huijun, Wang, Jiaolong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/ https://www.ncbi.nlm.nih.gov/pubmed/33127601 http://dx.doi.org/10.2196/22333

_version_	1783620743623344128
author	Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong
author_facet	Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong
author_sort	Luo, Lingyun
collection	PubMed
description	BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies.
format	Online Article Text
id	pubmed-7725650
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-77256502020-12-30 Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong JMIR Med Inform Original Paper BACKGROUND: As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. OBJECTIVE: Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. METHODS: Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. RESULTS: Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. CONCLUSIONS: We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies. JMIR Publications 2020-11-25 /pmc/articles/PMC7725650/ /pubmed/33127601 http://dx.doi.org/10.2196/22333 Text en ©Lingyun Luo, Jingtao Feng, Huijun Yu, Jiaolong Wang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 25.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Luo, Lingyun Feng, Jingtao Yu, Huijun Wang, Jiaolong Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title	Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_full	Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_fullStr	Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_full_unstemmed	Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_short	Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
title_sort	automatic structuring of ontology terms based on lexical granularity and machine learning: algorithm development and validation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725650/ https://www.ncbi.nlm.nih.gov/pubmed/33127601 http://dx.doi.org/10.2196/22333
work_keys_str_mv	AT luolingyun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT fengjingtao automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT yuhuijun automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation AT wangjiaolong automaticstructuringofontologytermsbasedonlexicalgranularityandmachinelearningalgorithmdevelopmentandvalidation

Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation

Ejemplares similares