Cargando…
Semantic biomedical resource discovery: a Natural Language Processing framework
BACKGROUND: A plethora of publicly available biomedical resources do currently exist and are constantly increasing at a fast rate. In parallel, specialized repositories are been developed, indexing numerous clinical and biomedical tools. The main drawback of such repositories is the difficulty in lo...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4591066/ https://www.ncbi.nlm.nih.gov/pubmed/26423616 http://dx.doi.org/10.1186/s12911-015-0200-4 |
_version_ | 1782393027391651840 |
---|---|
author | Sfakianaki, Pepi Koumakis, Lefteris Sfakianakis, Stelios Iatraki, Galatia Zacharioudakis, Giorgos Graf, Norbert Marias, Kostas Tsiknakis, Manolis |
author_facet | Sfakianaki, Pepi Koumakis, Lefteris Sfakianakis, Stelios Iatraki, Galatia Zacharioudakis, Giorgos Graf, Norbert Marias, Kostas Tsiknakis, Manolis |
author_sort | Sfakianaki, Pepi |
collection | PubMed |
description | BACKGROUND: A plethora of publicly available biomedical resources do currently exist and are constantly increasing at a fast rate. In parallel, specialized repositories are been developed, indexing numerous clinical and biomedical tools. The main drawback of such repositories is the difficulty in locating appropriate resources for a clinical or biomedical decision task, especially for non-Information Technology expert users. In parallel, although NLP research in the clinical domain has been active since the 1960s, progress in the development of NLP applications has been slow and lags behind progress in the general NLP domain. The aim of the present study is to investigate the use of semantics for biomedical resources annotation with domain specific ontologies and exploit Natural Language Processing methods in empowering the non-Information Technology expert users to efficiently search for biomedical resources using natural language. METHODS: A Natural Language Processing engine which can “translate” free text into targeted queries, automatically transforming a clinical research question into a request description that contains only terms of ontologies, has been implemented. The implementation is based on information extraction techniques for text in natural language, guided by integrated ontologies. Furthermore, knowledge from robust text mining methods has been incorporated to map descriptions into suitable domain ontologies in order to ensure that the biomedical resources descriptions are domain oriented and enhance the accuracy of services discovery. The framework is freely available as a web application at (http://calchas.ics.forth.gr/). RESULTS: For our experiments, a range of clinical questions were established based on descriptions of clinical trials from the ClinicalTrials.gov registry as well as recommendations from clinicians. Domain experts manually identified the available tools in a tools repository which are suitable for addressing the clinical questions at hand, either individually or as a set of tools forming a computational pipeline. The results were compared with those obtained from an automated discovery of candidate biomedical tools. For the evaluation of the results, precision and recall measurements were used. Our results indicate that the proposed framework has a high precision and low recall, implying that the system returns essentially more relevant results than irrelevant. CONCLUSIONS: There are adequate biomedical ontologies already available, sufficiency of existing NLP tools and quality of biomedical annotation systems for the implementation of a biomedical resources discovery framework, based on the semantic annotation of resources and the use on NLP techniques. The results of the present study demonstrate the clinical utility of the application of the proposed framework which aims to bridge the gap between clinical question in natural language and efficient dynamic biomedical resources discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0200-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4591066 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45910662015-10-03 Semantic biomedical resource discovery: a Natural Language Processing framework Sfakianaki, Pepi Koumakis, Lefteris Sfakianakis, Stelios Iatraki, Galatia Zacharioudakis, Giorgos Graf, Norbert Marias, Kostas Tsiknakis, Manolis BMC Med Inform Decis Mak Research Article BACKGROUND: A plethora of publicly available biomedical resources do currently exist and are constantly increasing at a fast rate. In parallel, specialized repositories are been developed, indexing numerous clinical and biomedical tools. The main drawback of such repositories is the difficulty in locating appropriate resources for a clinical or biomedical decision task, especially for non-Information Technology expert users. In parallel, although NLP research in the clinical domain has been active since the 1960s, progress in the development of NLP applications has been slow and lags behind progress in the general NLP domain. The aim of the present study is to investigate the use of semantics for biomedical resources annotation with domain specific ontologies and exploit Natural Language Processing methods in empowering the non-Information Technology expert users to efficiently search for biomedical resources using natural language. METHODS: A Natural Language Processing engine which can “translate” free text into targeted queries, automatically transforming a clinical research question into a request description that contains only terms of ontologies, has been implemented. The implementation is based on information extraction techniques for text in natural language, guided by integrated ontologies. Furthermore, knowledge from robust text mining methods has been incorporated to map descriptions into suitable domain ontologies in order to ensure that the biomedical resources descriptions are domain oriented and enhance the accuracy of services discovery. The framework is freely available as a web application at (http://calchas.ics.forth.gr/). RESULTS: For our experiments, a range of clinical questions were established based on descriptions of clinical trials from the ClinicalTrials.gov registry as well as recommendations from clinicians. Domain experts manually identified the available tools in a tools repository which are suitable for addressing the clinical questions at hand, either individually or as a set of tools forming a computational pipeline. The results were compared with those obtained from an automated discovery of candidate biomedical tools. For the evaluation of the results, precision and recall measurements were used. Our results indicate that the proposed framework has a high precision and low recall, implying that the system returns essentially more relevant results than irrelevant. CONCLUSIONS: There are adequate biomedical ontologies already available, sufficiency of existing NLP tools and quality of biomedical annotation systems for the implementation of a biomedical resources discovery framework, based on the semantic annotation of resources and the use on NLP techniques. The results of the present study demonstrate the clinical utility of the application of the proposed framework which aims to bridge the gap between clinical question in natural language and efficient dynamic biomedical resources discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-015-0200-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-30 /pmc/articles/PMC4591066/ /pubmed/26423616 http://dx.doi.org/10.1186/s12911-015-0200-4 Text en © Sfakianaki et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sfakianaki, Pepi Koumakis, Lefteris Sfakianakis, Stelios Iatraki, Galatia Zacharioudakis, Giorgos Graf, Norbert Marias, Kostas Tsiknakis, Manolis Semantic biomedical resource discovery: a Natural Language Processing framework |
title | Semantic biomedical resource discovery: a Natural Language Processing framework |
title_full | Semantic biomedical resource discovery: a Natural Language Processing framework |
title_fullStr | Semantic biomedical resource discovery: a Natural Language Processing framework |
title_full_unstemmed | Semantic biomedical resource discovery: a Natural Language Processing framework |
title_short | Semantic biomedical resource discovery: a Natural Language Processing framework |
title_sort | semantic biomedical resource discovery: a natural language processing framework |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4591066/ https://www.ncbi.nlm.nih.gov/pubmed/26423616 http://dx.doi.org/10.1186/s12911-015-0200-4 |
work_keys_str_mv | AT sfakianakipepi semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT koumakislefteris semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT sfakianakisstelios semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT iatrakigalatia semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT zacharioudakisgiorgos semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT grafnorbert semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT mariaskostas semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework AT tsiknakismanolis semanticbiomedicalresourcediscoveryanaturallanguageprocessingframework |