Cargando…

Biomedical question answering using semantic relations

BACKGROUND: The proliferation of the scientific literature in the field of biomedicine makes it difficult to keep abreast of current knowledge, even for domain experts. While general Web search engines and specialized information retrieval (IR) systems have made important strides in recent decades,...

Descripción completa

Detalles Bibliográficos
Autores principales: Hristovski, Dimitar, Dinevski, Dejan, Kastrin, Andrej, Rindflesch, Thomas C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307891/
https://www.ncbi.nlm.nih.gov/pubmed/25592675
http://dx.doi.org/10.1186/s12859-014-0365-3
_version_ 1782354512231530496
author Hristovski, Dimitar
Dinevski, Dejan
Kastrin, Andrej
Rindflesch, Thomas C
author_facet Hristovski, Dimitar
Dinevski, Dejan
Kastrin, Andrej
Rindflesch, Thomas C
author_sort Hristovski, Dimitar
collection PubMed
description BACKGROUND: The proliferation of the scientific literature in the field of biomedicine makes it difficult to keep abreast of current knowledge, even for domain experts. While general Web search engines and specialized information retrieval (IR) systems have made important strides in recent decades, the problem of accurate knowledge extraction from the biomedical literature is far from solved. Classical IR systems usually return a list of documents that have to be read by the user to extract relevant information. This tedious and time-consuming work can be lessened with automatic Question Answering (QA) systems, which aim to provide users with direct and precise answers to their questions. In this work we propose a novel methodology for QA based on semantic relations extracted from the biomedical literature. RESULTS: We extracted semantic relations with the SemRep natural language processing system from 122,421,765 sentences, which came from 21,014,382 MEDLINE citations (i.e., the complete MEDLINE distribution up to the end of 2012). A total of 58,879,300 semantic relation instances were extracted and organized in a relational database. The QA process is implemented as a search in this database, which is accessed through a Web-based application, called SemBT (available at http://sembt.mf.uni-lj.si). We conducted an extensive evaluation of the proposed methodology in order to estimate the accuracy of extracting a particular semantic relation from a particular sentence. Evaluation was performed by 80 domain experts. In total 7,510 semantic relation instances belonging to 2,675 distinct relations were evaluated 12,083 times. The instances were evaluated as correct 8,228 times (68%). CONCLUSIONS: In this work we propose an innovative methodology for biomedical QA. The system is implemented as a Web-based application that is able to provide precise answers to a wide range of questions. A typical question is answered within a few seconds. The tool has some extensions that make it especially useful for interpretation of DNA microarray results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0365-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4307891
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43078912015-01-28 Biomedical question answering using semantic relations Hristovski, Dimitar Dinevski, Dejan Kastrin, Andrej Rindflesch, Thomas C BMC Bioinformatics Research Article BACKGROUND: The proliferation of the scientific literature in the field of biomedicine makes it difficult to keep abreast of current knowledge, even for domain experts. While general Web search engines and specialized information retrieval (IR) systems have made important strides in recent decades, the problem of accurate knowledge extraction from the biomedical literature is far from solved. Classical IR systems usually return a list of documents that have to be read by the user to extract relevant information. This tedious and time-consuming work can be lessened with automatic Question Answering (QA) systems, which aim to provide users with direct and precise answers to their questions. In this work we propose a novel methodology for QA based on semantic relations extracted from the biomedical literature. RESULTS: We extracted semantic relations with the SemRep natural language processing system from 122,421,765 sentences, which came from 21,014,382 MEDLINE citations (i.e., the complete MEDLINE distribution up to the end of 2012). A total of 58,879,300 semantic relation instances were extracted and organized in a relational database. The QA process is implemented as a search in this database, which is accessed through a Web-based application, called SemBT (available at http://sembt.mf.uni-lj.si). We conducted an extensive evaluation of the proposed methodology in order to estimate the accuracy of extracting a particular semantic relation from a particular sentence. Evaluation was performed by 80 domain experts. In total 7,510 semantic relation instances belonging to 2,675 distinct relations were evaluated 12,083 times. The instances were evaluated as correct 8,228 times (68%). CONCLUSIONS: In this work we propose an innovative methodology for biomedical QA. The system is implemented as a Web-based application that is able to provide precise answers to a wide range of questions. A typical question is answered within a few seconds. The tool has some extensions that make it especially useful for interpretation of DNA microarray results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0365-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-01-16 /pmc/articles/PMC4307891/ /pubmed/25592675 http://dx.doi.org/10.1186/s12859-014-0365-3 Text en © Hristovski et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hristovski, Dimitar
Dinevski, Dejan
Kastrin, Andrej
Rindflesch, Thomas C
Biomedical question answering using semantic relations
title Biomedical question answering using semantic relations
title_full Biomedical question answering using semantic relations
title_fullStr Biomedical question answering using semantic relations
title_full_unstemmed Biomedical question answering using semantic relations
title_short Biomedical question answering using semantic relations
title_sort biomedical question answering using semantic relations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307891/
https://www.ncbi.nlm.nih.gov/pubmed/25592675
http://dx.doi.org/10.1186/s12859-014-0365-3
work_keys_str_mv AT hristovskidimitar biomedicalquestionansweringusingsemanticrelations
AT dinevskidejan biomedicalquestionansweringusingsemanticrelations
AT kastrinandrej biomedicalquestionansweringusingsemanticrelations
AT rindfleschthomasc biomedicalquestionansweringusingsemanticrelations