Cargando…

SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation

BACKGROUND: In the screening phase of systematic review, researchers use detailed inclusion/exclusion criteria to decide whether each article in a set of candidate articles is relevant to the research question under consideration. A typical review may require screening thousands or tens of thousands...

Descripción completa

Detalles Bibliográficos
Autores principales: Howard, Brian E., Phillips, Jason, Tandon, Arpit, Maharana, Adyasha, Elmore, Rebecca, Mav, Deepak, Sedykh, Alex, Thayer, Kristina, Merrick, B. Alex, Walker, Vickie, Rooney, Andrew, Shah, Ruchir R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082972/
https://www.ncbi.nlm.nih.gov/pubmed/32203803
http://dx.doi.org/10.1016/j.envint.2020.105623
_version_ 1783685939878428672
author Howard, Brian E.
Phillips, Jason
Tandon, Arpit
Maharana, Adyasha
Elmore, Rebecca
Mav, Deepak
Sedykh, Alex
Thayer, Kristina
Merrick, B. Alex
Walker, Vickie
Rooney, Andrew
Shah, Ruchir R.
author_facet Howard, Brian E.
Phillips, Jason
Tandon, Arpit
Maharana, Adyasha
Elmore, Rebecca
Mav, Deepak
Sedykh, Alex
Thayer, Kristina
Merrick, B. Alex
Walker, Vickie
Rooney, Andrew
Shah, Ruchir R.
author_sort Howard, Brian E.
collection PubMed
description BACKGROUND: In the screening phase of systematic review, researchers use detailed inclusion/exclusion criteria to decide whether each article in a set of candidate articles is relevant to the research question under consideration. A typical review may require screening thousands or tens of thousands of articles in and can utilize hundreds of person-hours of labor. METHODS: Here we introduce SWIFT-Active Screener, a web-based, collaborative systematic review software application, designed to reduce the overall screening burden required during this resource-intensive phase of the review process. To prioritize articles for review, SWIFT-Active Screener uses active learning, a type of machine learning that incorporates user feedback during screening. Meanwhile, a negative binomial model is employed to estimate the number of relevant articles remaining in the unscreened document list. Using a simulation involving 26 diverse systematic review datasets that were previously screened by reviewers, we evaluated both the document prioritization and recall estimation methods. RESULTS: On average, 95% of the relevant articles were identified after screening only 40% of the total reference list. In the 5 document sets with 5,000 or more references, 95% recall was achieved after screening only 34% of the available references, on average. Furthermore, the recall estimator we have proposed provides a useful, conservative estimate of the percentage of relevant documents identified during the screening process. CONCLUSION: SWIFT-Active Screener can result in significant time savings compared to traditional screening and the savings are increased for larger project sizes. Moreover, the integration of explicit recall estimation during screening solves an important challenge faced by all machine learning systems for document screening: when to stop screening a prioritized reference list. The software is currently available in the form of a multi-user, collaborative, online web application.
format Online
Article
Text
id pubmed-8082972
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-80829722021-04-29 SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation Howard, Brian E. Phillips, Jason Tandon, Arpit Maharana, Adyasha Elmore, Rebecca Mav, Deepak Sedykh, Alex Thayer, Kristina Merrick, B. Alex Walker, Vickie Rooney, Andrew Shah, Ruchir R. Environ Int Article BACKGROUND: In the screening phase of systematic review, researchers use detailed inclusion/exclusion criteria to decide whether each article in a set of candidate articles is relevant to the research question under consideration. A typical review may require screening thousands or tens of thousands of articles in and can utilize hundreds of person-hours of labor. METHODS: Here we introduce SWIFT-Active Screener, a web-based, collaborative systematic review software application, designed to reduce the overall screening burden required during this resource-intensive phase of the review process. To prioritize articles for review, SWIFT-Active Screener uses active learning, a type of machine learning that incorporates user feedback during screening. Meanwhile, a negative binomial model is employed to estimate the number of relevant articles remaining in the unscreened document list. Using a simulation involving 26 diverse systematic review datasets that were previously screened by reviewers, we evaluated both the document prioritization and recall estimation methods. RESULTS: On average, 95% of the relevant articles were identified after screening only 40% of the total reference list. In the 5 document sets with 5,000 or more references, 95% recall was achieved after screening only 34% of the available references, on average. Furthermore, the recall estimator we have proposed provides a useful, conservative estimate of the percentage of relevant documents identified during the screening process. CONCLUSION: SWIFT-Active Screener can result in significant time savings compared to traditional screening and the savings are increased for larger project sizes. Moreover, the integration of explicit recall estimation during screening solves an important challenge faced by all machine learning systems for document screening: when to stop screening a prioritized reference list. The software is currently available in the form of a multi-user, collaborative, online web application. 2020-03-20 2020-05 /pmc/articles/PMC8082972/ /pubmed/32203803 http://dx.doi.org/10.1016/j.envint.2020.105623 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/BY-NC-ND/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Howard, Brian E.
Phillips, Jason
Tandon, Arpit
Maharana, Adyasha
Elmore, Rebecca
Mav, Deepak
Sedykh, Alex
Thayer, Kristina
Merrick, B. Alex
Walker, Vickie
Rooney, Andrew
Shah, Ruchir R.
SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title_full SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title_fullStr SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title_full_unstemmed SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title_short SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation
title_sort swift-active screener: accelerated document screening through active learning and integrated recall estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8082972/
https://www.ncbi.nlm.nih.gov/pubmed/32203803
http://dx.doi.org/10.1016/j.envint.2020.105623
work_keys_str_mv AT howardbriane swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT phillipsjason swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT tandonarpit swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT maharanaadyasha swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT elmorerebecca swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT mavdeepak swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT sedykhalex swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT thayerkristina swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT merrickbalex swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT walkervickie swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT rooneyandrew swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation
AT shahruchirr swiftactivescreeneraccelerateddocumentscreeningthroughactivelearningandintegratedrecallestimation