Cargando…

Towards natural language question generation for the validation of ontologies and mappings

BACKGROUND: The increasing number of open-access ontologies and their key role in several applications such as decision-support systems highlight the importance of their validation. Human expertise is crucial for the validation of ontologies from a domain point-of-view. However, the growing number o...

Descripción completa

Detalles Bibliográficos
Autores principales: Ben Abacha, Asma, Dos Reis, Julio Cesar, Mrabet, Yassine, Pruski, Cédric, Da Silveira, Marcos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976517/
https://www.ncbi.nlm.nih.gov/pubmed/27502477
http://dx.doi.org/10.1186/s13326-016-0089-6
_version_ 1782446884080582656
author Ben Abacha, Asma
Dos Reis, Julio Cesar
Mrabet, Yassine
Pruski, Cédric
Da Silveira, Marcos
author_facet Ben Abacha, Asma
Dos Reis, Julio Cesar
Mrabet, Yassine
Pruski, Cédric
Da Silveira, Marcos
author_sort Ben Abacha, Asma
collection PubMed
description BACKGROUND: The increasing number of open-access ontologies and their key role in several applications such as decision-support systems highlight the importance of their validation. Human expertise is crucial for the validation of ontologies from a domain point-of-view. However, the growing number of ontologies and their fast evolution over time make manual validation challenging. METHODS: We propose a novel semi-automatic approach based on the generation of natural language (NL) questions to support the validation of ontologies and their evolution. The proposed approach includes the automatic generation, factorization and ordering of NL questions from medical ontologies. The final validation and correction is performed by submitting these questions to domain experts and automatically analyzing their feedback. We also propose a second approach for the validation of mappings impacted by ontology changes. The method exploits the context of the changes to propose correction alternatives presented as Multiple Choice Questions. RESULTS: This research provides a question optimization strategy to maximize the validation of ontology entities with a reduced number of questions. We evaluate our approach for the validation of three medical ontologies. We also evaluate the feasibility and efficiency of our mappings validation approach in the context of ontology evolution. These experiments are performed with different versions of SNOMED-CT and ICD9. CONCLUSIONS: The obtained experimental results suggest the feasibility and adequacy of our approach to support the validation of interconnected and evolving ontologies. Results also suggest that taking into account RDFS and OWL entailment helps reducing the number of questions and validation time. The application of our approach to validate mapping evolution also shows the difficulty of adapting mapping evolution over time and highlights the importance of semi-automatic validation.
format Online
Article
Text
id pubmed-4976517
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49765172016-08-09 Towards natural language question generation for the validation of ontologies and mappings Ben Abacha, Asma Dos Reis, Julio Cesar Mrabet, Yassine Pruski, Cédric Da Silveira, Marcos J Biomed Semantics Research BACKGROUND: The increasing number of open-access ontologies and their key role in several applications such as decision-support systems highlight the importance of their validation. Human expertise is crucial for the validation of ontologies from a domain point-of-view. However, the growing number of ontologies and their fast evolution over time make manual validation challenging. METHODS: We propose a novel semi-automatic approach based on the generation of natural language (NL) questions to support the validation of ontologies and their evolution. The proposed approach includes the automatic generation, factorization and ordering of NL questions from medical ontologies. The final validation and correction is performed by submitting these questions to domain experts and automatically analyzing their feedback. We also propose a second approach for the validation of mappings impacted by ontology changes. The method exploits the context of the changes to propose correction alternatives presented as Multiple Choice Questions. RESULTS: This research provides a question optimization strategy to maximize the validation of ontology entities with a reduced number of questions. We evaluate our approach for the validation of three medical ontologies. We also evaluate the feasibility and efficiency of our mappings validation approach in the context of ontology evolution. These experiments are performed with different versions of SNOMED-CT and ICD9. CONCLUSIONS: The obtained experimental results suggest the feasibility and adequacy of our approach to support the validation of interconnected and evolving ontologies. Results also suggest that taking into account RDFS and OWL entailment helps reducing the number of questions and validation time. The application of our approach to validate mapping evolution also shows the difficulty of adapting mapping evolution over time and highlights the importance of semi-automatic validation. BioMed Central 2016-08-08 /pmc/articles/PMC4976517/ /pubmed/27502477 http://dx.doi.org/10.1186/s13326-016-0089-6 Text en © Ben Abacha et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ben Abacha, Asma
Dos Reis, Julio Cesar
Mrabet, Yassine
Pruski, Cédric
Da Silveira, Marcos
Towards natural language question generation for the validation of ontologies and mappings
title Towards natural language question generation for the validation of ontologies and mappings
title_full Towards natural language question generation for the validation of ontologies and mappings
title_fullStr Towards natural language question generation for the validation of ontologies and mappings
title_full_unstemmed Towards natural language question generation for the validation of ontologies and mappings
title_short Towards natural language question generation for the validation of ontologies and mappings
title_sort towards natural language question generation for the validation of ontologies and mappings
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976517/
https://www.ncbi.nlm.nih.gov/pubmed/27502477
http://dx.doi.org/10.1186/s13326-016-0089-6
work_keys_str_mv AT benabachaasma towardsnaturallanguagequestiongenerationforthevalidationofontologiesandmappings
AT dosreisjuliocesar towardsnaturallanguagequestiongenerationforthevalidationofontologiesandmappings
AT mrabetyassine towardsnaturallanguagequestiongenerationforthevalidationofontologiesandmappings
AT pruskicedric towardsnaturallanguagequestiongenerationforthevalidationofontologiesandmappings
AT dasilveiramarcos towardsnaturallanguagequestiongenerationforthevalidationofontologiesandmappings