Cargando…

Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error

BACKGROUND: Here, we outline a method of applying existing machine learning (ML) approaches to aid citation screening in an on-going broad and shallow systematic review of preclinical animal studies. The aim is to achieve a high-performing algorithm comparable to human screening that can reduce huma...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bannach-Brown, Alexandra, Przybyła, Piotr, Thomas, James, Rice, Andrew S. C., Ananiadou, Sophia, Liao, Jing, Macleod, Malcolm Robert
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Methodology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334440/ https://www.ncbi.nlm.nih.gov/pubmed/30646959 http://dx.doi.org/10.1186/s13643-019-0942-7

_version_	1783387716708204544
author	Bannach-Brown, Alexandra Przybyła, Piotr Thomas, James Rice, Andrew S. C. Ananiadou, Sophia Liao, Jing Macleod, Malcolm Robert
author_facet	Bannach-Brown, Alexandra Przybyła, Piotr Thomas, James Rice, Andrew S. C. Ananiadou, Sophia Liao, Jing Macleod, Malcolm Robert
author_sort	Bannach-Brown, Alexandra
collection	PubMed
description	BACKGROUND: Here, we outline a method of applying existing machine learning (ML) approaches to aid citation screening in an on-going broad and shallow systematic review of preclinical animal studies. The aim is to achieve a high-performing algorithm comparable to human screening that can reduce human resources required for carrying out this step of a systematic review. METHODS: We applied ML approaches to a broad systematic review of animal models of depression at the citation screening stage. We tested two independently developed ML approaches which used different classification models and feature sets. We recorded the performance of the ML approaches on an unseen validation set of papers using sensitivity, specificity and accuracy. We aimed to achieve 95% sensitivity and to maximise specificity. The classification model providing the most accurate predictions was applied to the remaining unseen records in the dataset and will be used in the next stage of the preclinical biomedical sciences systematic review. We used a cross-validation technique to assign ML inclusion likelihood scores to the human screened records, to identify potential errors made during the human screening process (error analysis). RESULTS: ML approaches reached 98.7% sensitivity based on learning from a training set of 5749 records, with an inclusion prevalence of 13.2%. The highest level of specificity reached was 86%. Performance was assessed on an independent validation dataset. Human errors in the training and validation sets were successfully identified using the assigned inclusion likelihood from the ML model to highlight discrepancies. Training the ML algorithm on the corrected dataset improved the specificity of the algorithm without compromising sensitivity. Error analysis correction leads to a 3% improvement in sensitivity and specificity, which increases precision and accuracy of the ML algorithm. CONCLUSIONS: This work has confirmed the performance and application of ML algorithms for screening in systematic reviews of preclinical animal studies. It has highlighted the novel use of ML algorithms to identify human error. This needs to be confirmed in other reviews with different inclusion prevalence levels, but represents a promising approach to integrating human decisions and automation in systematic review methodology. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13643-019-0942-7) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6334440
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-63344402019-01-23 Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error Bannach-Brown, Alexandra Przybyła, Piotr Thomas, James Rice, Andrew S. C. Ananiadou, Sophia Liao, Jing Macleod, Malcolm Robert Syst Rev Methodology BACKGROUND: Here, we outline a method of applying existing machine learning (ML) approaches to aid citation screening in an on-going broad and shallow systematic review of preclinical animal studies. The aim is to achieve a high-performing algorithm comparable to human screening that can reduce human resources required for carrying out this step of a systematic review. METHODS: We applied ML approaches to a broad systematic review of animal models of depression at the citation screening stage. We tested two independently developed ML approaches which used different classification models and feature sets. We recorded the performance of the ML approaches on an unseen validation set of papers using sensitivity, specificity and accuracy. We aimed to achieve 95% sensitivity and to maximise specificity. The classification model providing the most accurate predictions was applied to the remaining unseen records in the dataset and will be used in the next stage of the preclinical biomedical sciences systematic review. We used a cross-validation technique to assign ML inclusion likelihood scores to the human screened records, to identify potential errors made during the human screening process (error analysis). RESULTS: ML approaches reached 98.7% sensitivity based on learning from a training set of 5749 records, with an inclusion prevalence of 13.2%. The highest level of specificity reached was 86%. Performance was assessed on an independent validation dataset. Human errors in the training and validation sets were successfully identified using the assigned inclusion likelihood from the ML model to highlight discrepancies. Training the ML algorithm on the corrected dataset improved the specificity of the algorithm without compromising sensitivity. Error analysis correction leads to a 3% improvement in sensitivity and specificity, which increases precision and accuracy of the ML algorithm. CONCLUSIONS: This work has confirmed the performance and application of ML algorithms for screening in systematic reviews of preclinical animal studies. It has highlighted the novel use of ML algorithms to identify human error. This needs to be confirmed in other reviews with different inclusion prevalence levels, but represents a promising approach to integrating human decisions and automation in systematic review methodology. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13643-019-0942-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-15 /pmc/articles/PMC6334440/ /pubmed/30646959 http://dx.doi.org/10.1186/s13643-019-0942-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Methodology Bannach-Brown, Alexandra Przybyła, Piotr Thomas, James Rice, Andrew S. C. Ananiadou, Sophia Liao, Jing Macleod, Malcolm Robert Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title	Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title_full	Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title_fullStr	Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title_full_unstemmed	Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title_short	Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
title_sort	machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error
topic	Methodology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334440/ https://www.ncbi.nlm.nih.gov/pubmed/30646959 http://dx.doi.org/10.1186/s13643-019-0942-7
work_keys_str_mv	AT bannachbrownalexandra machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT przybyłapiotr machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT thomasjames machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT riceandrewsc machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT ananiadousophia machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT liaojing machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror AT macleodmalcolmrobert machinelearningalgorithmsforsystematicreviewreducingworkloadinapreclinicalreviewofanimalstudiesandreducinghumanscreeningerror

Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error

Ejemplares similares