Cargando…

Improving reference prioritisation with PICO recognition

BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, interv...

Descripción completa

Detalles Bibliográficos
Autores principales: Brockmeier, Austin J., Ju, Meizhi, Przybyła, Piotr, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896258/
https://www.ncbi.nlm.nih.gov/pubmed/31805934
http://dx.doi.org/10.1186/s12911-019-0992-8
_version_ 1783476739238789120
author Brockmeier, Austin J.
Ju, Meizhi
Przybyła, Piotr
Ananiadou, Sophia
author_facet Brockmeier, Austin J.
Ju, Meizhi
Przybyła, Piotr
Ananiadou, Sophia
author_sort Brockmeier, Austin J.
collection PubMed
description BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, intervention, comparator, and outcomes. The latter requires techniques for identifying and categorising fragments of text, known as named entity recognition. METHODS: A publicly available corpus of PICO annotations on biomedical abstracts is used to train a named entity recognition model, which is implemented as a recurrent neural network. This model is then applied to a separate collection of abstracts for references from systematic reviews within biomedical and health domains. The occurrences of words tagged in the context of specific PICO contexts are used as additional features for a relevancy classification model. Simulations of the machine learning-assisted screening are used to evaluate the work saved by the relevancy model with and without the PICO features. Chi-squared and statistical significance of positive predicted values are used to identify words that are more indicative of relevancy within PICO contexts. RESULTS: Inclusion of PICO features improves the performance metric on 15 of the 20 collections, with substantial gains on certain systematic reviews. Examples of words whose PICO context are more precise can explain this increase. CONCLUSIONS: Words within PICO tagged segments in abstracts are predictive features for determining inclusion. Combining PICO annotation model into the relevancy classification pipeline is a promising approach. The annotations may be useful on their own to aid users in pinpointing necessary information for data extraction, or to facilitate semantic search.
format Online
Article
Text
id pubmed-6896258
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68962582019-12-11 Improving reference prioritisation with PICO recognition Brockmeier, Austin J. Ju, Meizhi Przybyła, Piotr Ananiadou, Sophia BMC Med Inform Decis Mak Technical Advance BACKGROUND: Machine learning can assist with multiple tasks during systematic reviews to facilitate the rapid retrieval of relevant references during screening and to identify and extract information relevant to the study characteristics, which include the PICO elements of patient/population, intervention, comparator, and outcomes. The latter requires techniques for identifying and categorising fragments of text, known as named entity recognition. METHODS: A publicly available corpus of PICO annotations on biomedical abstracts is used to train a named entity recognition model, which is implemented as a recurrent neural network. This model is then applied to a separate collection of abstracts for references from systematic reviews within biomedical and health domains. The occurrences of words tagged in the context of specific PICO contexts are used as additional features for a relevancy classification model. Simulations of the machine learning-assisted screening are used to evaluate the work saved by the relevancy model with and without the PICO features. Chi-squared and statistical significance of positive predicted values are used to identify words that are more indicative of relevancy within PICO contexts. RESULTS: Inclusion of PICO features improves the performance metric on 15 of the 20 collections, with substantial gains on certain systematic reviews. Examples of words whose PICO context are more precise can explain this increase. CONCLUSIONS: Words within PICO tagged segments in abstracts are predictive features for determining inclusion. Combining PICO annotation model into the relevancy classification pipeline is a promising approach. The annotations may be useful on their own to aid users in pinpointing necessary information for data extraction, or to facilitate semantic search. BioMed Central 2019-12-05 /pmc/articles/PMC6896258/ /pubmed/31805934 http://dx.doi.org/10.1186/s12911-019-0992-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Advance
Brockmeier, Austin J.
Ju, Meizhi
Przybyła, Piotr
Ananiadou, Sophia
Improving reference prioritisation with PICO recognition
title Improving reference prioritisation with PICO recognition
title_full Improving reference prioritisation with PICO recognition
title_fullStr Improving reference prioritisation with PICO recognition
title_full_unstemmed Improving reference prioritisation with PICO recognition
title_short Improving reference prioritisation with PICO recognition
title_sort improving reference prioritisation with pico recognition
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896258/
https://www.ncbi.nlm.nih.gov/pubmed/31805934
http://dx.doi.org/10.1186/s12911-019-0992-8
work_keys_str_mv AT brockmeieraustinj improvingreferenceprioritisationwithpicorecognition
AT jumeizhi improvingreferenceprioritisationwithpicorecognition
AT przybyłapiotr improvingreferenceprioritisationwithpicorecognition
AT ananiadousophia improvingreferenceprioritisationwithpicorecognition