Cargando…
Combination of conditional random field with a rule based method in the extraction of PICO elements
BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6278016/ https://www.ncbi.nlm.nih.gov/pubmed/30509272 http://dx.doi.org/10.1186/s12911-018-0699-2 |
_version_ | 1783378266916126720 |
---|---|
author | Chabou, Samir Iglewski, Michal |
author_facet | Chabou, Samir Iglewski, Michal |
author_sort | Chabou, Samir |
collection | PubMed |
description | BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework. METHODS: First, we analyze the aspects of the MLM model that influence the quality of the PICO elements extraction. Secondly, we combine the MLM approach with the RBMs in order to improve the PICO elements retrieval process. To conduct our experiments, we use a corpus of 1000 abstracts. RESULTS: We obtain an F-score of 80% for P element, 64% for the I element and 92% for the O element. Given the nature of the used training corpus where P and I elements represent respectively only 6.5 and 5.8% of total sentences, the results are competitive with previously published ones. CONCLUSIONS: Our study of the PICO element extraction shows that the task is very challenging. The MLMs tend to have an acceptable precision rate but they have a low recall rate when the corpus is not representative. The RBMs backed up the MLMs to increase the recall rate and consequently the combination of the two methods gave better results. |
format | Online Article Text |
id | pubmed-6278016 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62780162018-12-06 Combination of conditional random field with a rule based method in the extraction of PICO elements Chabou, Samir Iglewski, Michal BMC Med Inform Decis Mak Research Article BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework. METHODS: First, we analyze the aspects of the MLM model that influence the quality of the PICO elements extraction. Secondly, we combine the MLM approach with the RBMs in order to improve the PICO elements retrieval process. To conduct our experiments, we use a corpus of 1000 abstracts. RESULTS: We obtain an F-score of 80% for P element, 64% for the I element and 92% for the O element. Given the nature of the used training corpus where P and I elements represent respectively only 6.5 and 5.8% of total sentences, the results are competitive with previously published ones. CONCLUSIONS: Our study of the PICO element extraction shows that the task is very challenging. The MLMs tend to have an acceptable precision rate but they have a low recall rate when the corpus is not representative. The RBMs backed up the MLMs to increase the recall rate and consequently the combination of the two methods gave better results. BioMed Central 2018-12-04 /pmc/articles/PMC6278016/ /pubmed/30509272 http://dx.doi.org/10.1186/s12911-018-0699-2 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Chabou, Samir Iglewski, Michal Combination of conditional random field with a rule based method in the extraction of PICO elements |
title | Combination of conditional random field with a rule based method in the extraction of PICO elements |
title_full | Combination of conditional random field with a rule based method in the extraction of PICO elements |
title_fullStr | Combination of conditional random field with a rule based method in the extraction of PICO elements |
title_full_unstemmed | Combination of conditional random field with a rule based method in the extraction of PICO elements |
title_short | Combination of conditional random field with a rule based method in the extraction of PICO elements |
title_sort | combination of conditional random field with a rule based method in the extraction of pico elements |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6278016/ https://www.ncbi.nlm.nih.gov/pubmed/30509272 http://dx.doi.org/10.1186/s12911-018-0699-2 |
work_keys_str_mv | AT chabousamir combinationofconditionalrandomfieldwitharulebasedmethodintheextractionofpicoelements AT iglewskimichal combinationofconditionalrandomfieldwitharulebasedmethodintheextractionofpicoelements |