Cargando…

Applying machine classifiers to update searches: Analysis from two case studies

Manual screening of citation records could be reduced by using machine classifiers to remove records of very low relevance. This seems particularly feasible for update searches, where a machine classifier can be trained from past screening decisions. However, feasibility is unclear for broad topics....

Descripción completa

Detalles Bibliográficos
Autores principales:	Stansfield, Claire, Stokes, Gillian, Thomas, James
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2021
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299040/ https://www.ncbi.nlm.nih.gov/pubmed/34747151 http://dx.doi.org/10.1002/jrsm.1537

_version_	1784750853919342592
author	Stansfield, Claire Stokes, Gillian Thomas, James
author_facet	Stansfield, Claire Stokes, Gillian Thomas, James
author_sort	Stansfield, Claire
collection	PubMed
description	Manual screening of citation records could be reduced by using machine classifiers to remove records of very low relevance. This seems particularly feasible for update searches, where a machine classifier can be trained from past screening decisions. However, feasibility is unclear for broad topics. We evaluate the performance and implementation of machine classifiers for update searches of public health research using two case studies. The first study evaluates the impact of using different sets of training data on classifier performance, comparing recall and screening reduction with a manual screening ‘gold standard’. The second study uses screening decisions from a review to train a classifier that is applied to rank the update search results. A stopping threshold was applied in the absence of a gold standard. Time spent screening titles and abstracts of different relevancy‐ranked records was measured. Results: Study one: Classifier performance varies according to the training data used; all custom‐built classifiers had a recall above 93% at the same threshold, achieving screening reductions between 41% and 74%. Study two: applying a classifier provided a solution for tackling a large volume of search results from the update search, and screening volume was reduced by 61%. A tentative estimate indicates over 25 h screening time was saved. In conclusion, custom‐built machine classifiers are feasible for reducing screening workload from update searches across a range of public health interventions, with some limitation on recall. Key considerations include selecting a training dataset, agreeing stopping thresholds and processes to ensure smooth workflows.
format	Online Article Text
id	pubmed-9299040
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-92990402022-07-21 Applying machine classifiers to update searches: Analysis from two case studies Stansfield, Claire Stokes, Gillian Thomas, James Res Synth Methods Research Articles Manual screening of citation records could be reduced by using machine classifiers to remove records of very low relevance. This seems particularly feasible for update searches, where a machine classifier can be trained from past screening decisions. However, feasibility is unclear for broad topics. We evaluate the performance and implementation of machine classifiers for update searches of public health research using two case studies. The first study evaluates the impact of using different sets of training data on classifier performance, comparing recall and screening reduction with a manual screening ‘gold standard’. The second study uses screening decisions from a review to train a classifier that is applied to rank the update search results. A stopping threshold was applied in the absence of a gold standard. Time spent screening titles and abstracts of different relevancy‐ranked records was measured. Results: Study one: Classifier performance varies according to the training data used; all custom‐built classifiers had a recall above 93% at the same threshold, achieving screening reductions between 41% and 74%. Study two: applying a classifier provided a solution for tackling a large volume of search results from the update search, and screening volume was reduced by 61%. A tentative estimate indicates over 25 h screening time was saved. In conclusion, custom‐built machine classifiers are feasible for reducing screening workload from update searches across a range of public health interventions, with some limitation on recall. Key considerations include selecting a training dataset, agreeing stopping thresholds and processes to ensure smooth workflows. John Wiley and Sons Inc. 2021-11-25 2022-01 /pmc/articles/PMC9299040/ /pubmed/34747151 http://dx.doi.org/10.1002/jrsm.1537 Text en © 2021 The Authors. Research Synthesis Methods published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Articles Stansfield, Claire Stokes, Gillian Thomas, James Applying machine classifiers to update searches: Analysis from two case studies
title	Applying machine classifiers to update searches: Analysis from two case studies
title_full	Applying machine classifiers to update searches: Analysis from two case studies
title_fullStr	Applying machine classifiers to update searches: Analysis from two case studies
title_full_unstemmed	Applying machine classifiers to update searches: Analysis from two case studies
title_short	Applying machine classifiers to update searches: Analysis from two case studies
title_sort	applying machine classifiers to update searches: analysis from two case studies
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299040/ https://www.ncbi.nlm.nih.gov/pubmed/34747151 http://dx.doi.org/10.1002/jrsm.1537
work_keys_str_mv	AT stansfieldclaire applyingmachineclassifierstoupdatesearchesanalysisfromtwocasestudies AT stokesgillian applyingmachineclassifierstoupdatesearchesanalysisfromtwocasestudies AT thomasjames applyingmachineclassifierstoupdatesearchesanalysisfromtwocasestudies

Applying machine classifiers to update searches: Analysis from two case studies

Ejemplares similares