Cargando…

Combination of conditional random field with a rule based method in the extraction of PICO elements

BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chabou, Samir, Iglewski, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6278016/
https://www.ncbi.nlm.nih.gov/pubmed/30509272
http://dx.doi.org/10.1186/s12911-018-0699-2
_version_ 1783378266916126720
author Chabou, Samir
Iglewski, Michal
author_facet Chabou, Samir
Iglewski, Michal
author_sort Chabou, Samir
collection PubMed
description BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework. METHODS: First, we analyze the aspects of the MLM model that influence the quality of the PICO elements extraction. Secondly, we combine the MLM approach with the RBMs in order to improve the PICO elements retrieval process. To conduct our experiments, we use a corpus of 1000 abstracts. RESULTS: We obtain an F-score of 80% for P element, 64% for the I element and 92% for the O element. Given the nature of the used training corpus where P and I elements represent respectively only 6.5 and 5.8% of total sentences, the results are competitive with previously published ones. CONCLUSIONS: Our study of the PICO element extraction shows that the task is very challenging. The MLMs tend to have an acceptable precision rate but they have a low recall rate when the corpus is not representative. The RBMs backed up the MLMs to increase the recall rate and consequently the combination of the two methods gave better results.
format Online
Article
Text
id pubmed-6278016
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62780162018-12-06 Combination of conditional random field with a rule based method in the extraction of PICO elements Chabou, Samir Iglewski, Michal BMC Med Inform Decis Mak Research Article BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework. METHODS: First, we analyze the aspects of the MLM model that influence the quality of the PICO elements extraction. Secondly, we combine the MLM approach with the RBMs in order to improve the PICO elements retrieval process. To conduct our experiments, we use a corpus of 1000 abstracts. RESULTS: We obtain an F-score of 80% for P element, 64% for the I element and 92% for the O element. Given the nature of the used training corpus where P and I elements represent respectively only 6.5 and 5.8% of total sentences, the results are competitive with previously published ones. CONCLUSIONS: Our study of the PICO element extraction shows that the task is very challenging. The MLMs tend to have an acceptable precision rate but they have a low recall rate when the corpus is not representative. The RBMs backed up the MLMs to increase the recall rate and consequently the combination of the two methods gave better results. BioMed Central 2018-12-04 /pmc/articles/PMC6278016/ /pubmed/30509272 http://dx.doi.org/10.1186/s12911-018-0699-2 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Chabou, Samir
Iglewski, Michal
Combination of conditional random field with a rule based method in the extraction of PICO elements
title Combination of conditional random field with a rule based method in the extraction of PICO elements
title_full Combination of conditional random field with a rule based method in the extraction of PICO elements
title_fullStr Combination of conditional random field with a rule based method in the extraction of PICO elements
title_full_unstemmed Combination of conditional random field with a rule based method in the extraction of PICO elements
title_short Combination of conditional random field with a rule based method in the extraction of PICO elements
title_sort combination of conditional random field with a rule based method in the extraction of pico elements
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6278016/
https://www.ncbi.nlm.nih.gov/pubmed/30509272
http://dx.doi.org/10.1186/s12911-018-0699-2
work_keys_str_mv AT chabousamir combinationofconditionalrandomfieldwitharulebasedmethodintheextractionofpicoelements
AT iglewskimichal combinationofconditionalrandomfieldwitharulebasedmethodintheextractionofpicoelements