Cargando…
Improving reference prioritisation with PICO recognition
BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, interv...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896258/ https://www.ncbi.nlm.nih.gov/pubmed/31805934 http://dx.doi.org/10.1186/s12911-019-0992-8 |
_version_ | 1783476739238789120 |
---|---|
author | Brockmeier, Austin J. Ju, Meizhi Przybyła, Piotr Ananiadou, Sophia |
author_facet | Brockmeier, Austin J. Ju, Meizhi Przybyła, Piotr Ananiadou, Sophia |
author_sort | Brockmeier, Austin J. |
collection | PubMed |
description | BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, intervention, comparator, and outcomes. The latter requires techniques for identifying and categorising fragments of text, known as named entity recognition. METHODS: A publicly available corpus of PICO annotations on biomedical abstracts is used to train a named entity recognition model, which is implemented as a recurrent neural network. This model is then applied to a separate collection of abstracts for references from systematic reviews within biomedical and health domains. The occurrences of words tagged in the context of specific PICO contexts are used as additional features for a relevancy classification model. Simulations of the machine learning-assisted screening are used to evaluate the work saved by the relevancy model with and without the PICO features. Chi-squared and statistical significance of positive predicted values are used to identify words that are more indicative of relevancy within PICO contexts. RESULTS: Inclusion of PICO features improves the performance metric on 15 of the 20 collections, with substantial gains on certain systematic reviews. Examples of words whose PICO context are more precise can explain this increase. CONCLUSIONS: Words within PICO tagged segments in abstracts are predictive features for determining inclusion. Combining PICO annotation model into the relevancy classification pipeline is a promising approach. The annotations may be useful on their own to aid users in pinpointing necessary information for data extraction, or to facilitate semantic search. |
format | Online Article Text |
id | pubmed-6896258 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68962582019-12-11 Improving reference prioritisation with PICO recognition Brockmeier, Austin J. Ju, Meizhi Przybyła, Piotr Ananiadou, Sophia BMC Med Inform Decis Mak Technical Advance BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, intervention, comparator, and outcomes. The latter requires techniques for identifying and categorising fragments of text, known as named entity recognition. METHODS: A publicly available corpus of PICO annotations on biomedical abstracts is used to train a named entity recognition model, which is implemented as a recurrent neural network. This model is then applied to a separate collection of abstracts for references from systematic reviews within biomedical and health domains. The occurrences of words tagged in the context of specific PICO contexts are used as additional features for a relevancy classification model. Simulations of the machine learning-assisted screening are used to evaluate the work saved by the relevancy model with and without the PICO features. Chi-squared and statistical significance of positive predicted values are used to identify words that are more indicative of relevancy within PICO contexts. RESULTS: Inclusion of PICO features improves the performance metric on 15 of the 20 collections, with substantial gains on certain systematic reviews. Examples of words whose PICO context are more precise can explain this increase. CONCLUSIONS: Words within PICO tagged segments in abstracts are predictive features for determining inclusion. Combining PICO annotation model into the relevancy classification pipeline is a promising approach. The annotations may be useful on their own to aid users in pinpointing necessary information for data extraction, or to facilitate semantic search. BioMed Central 2019-12-05 /pmc/articles/PMC6896258/ /pubmed/31805934 http://dx.doi.org/10.1186/s12911-019-0992-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Advance Brockmeier, Austin J. Ju, Meizhi Przybyła, Piotr Ananiadou, Sophia Improving reference prioritisation with PICO recognition |
title | Improving reference prioritisation with PICO recognition |
title_full | Improving reference prioritisation with PICO recognition |
title_fullStr | Improving reference prioritisation with PICO recognition |
title_full_unstemmed | Improving reference prioritisation with PICO recognition |
title_short | Improving reference prioritisation with PICO recognition |
title_sort | improving reference prioritisation with pico recognition |
topic | Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896258/ https://www.ncbi.nlm.nih.gov/pubmed/31805934 http://dx.doi.org/10.1186/s12911-019-0992-8 |
work_keys_str_mv | AT brockmeieraustinj improvingreferenceprioritisationwithpicorecognition AT jumeizhi improvingreferenceprioritisationwithpicorecognition AT przybyłapiotr improvingreferenceprioritisationwithpicorecognition AT ananiadousophia improvingreferenceprioritisationwithpicorecognition |