Cargando…

Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study

BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of...

Descripción completa

Detalles Bibliográficos
Autores principales: Zimmerman, John, Soler, Robin E., Lavinder, James, Murphy, Sarah, Atkins, Charisma, Hulbert, LaShonda, Lusk, Richard, Ng, Boon Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017891/
https://www.ncbi.nlm.nih.gov/pubmed/33810798
http://dx.doi.org/10.1186/s13643-021-01640-6
_version_ 1783674137289424896
author Zimmerman, John
Soler, Robin E.
Lavinder, James
Murphy, Sarah
Atkins, Charisma
Hulbert, LaShonda
Lusk, Richard
Ng, Boon Peng
author_facet Zimmerman, John
Soler, Robin E.
Lavinder, James
Murphy, Sarah
Atkins, Charisma
Hulbert, LaShonda
Lusk, Richard
Ng, Boon Peng
author_sort Zimmerman, John
collection PubMed
description BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. METHODS: In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. RESULTS: The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. CONCLUSIONS: We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13643-021-01640-6.
format Online
Article
Text
id pubmed-8017891
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80178912021-04-05 Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study Zimmerman, John Soler, Robin E. Lavinder, James Murphy, Sarah Atkins, Charisma Hulbert, LaShonda Lusk, Richard Ng, Boon Peng Syst Rev Methodology BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. METHODS: In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. RESULTS: The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. CONCLUSIONS: We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13643-021-01640-6. BioMed Central 2021-04-02 /pmc/articles/PMC8017891/ /pubmed/33810798 http://dx.doi.org/10.1186/s13643-021-01640-6 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Zimmerman, John
Soler, Robin E.
Lavinder, James
Murphy, Sarah
Atkins, Charisma
Hulbert, LaShonda
Lusk, Richard
Ng, Boon Peng
Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title_full Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title_fullStr Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title_full_unstemmed Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title_short Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
title_sort iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017891/
https://www.ncbi.nlm.nih.gov/pubmed/33810798
http://dx.doi.org/10.1186/s13643-021-01640-6
work_keys_str_mv AT zimmermanjohn iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT solerrobine iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT lavinderjames iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT murphysarah iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT atkinscharisma iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT hulbertlashonda iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT luskrichard iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy
AT ngboonpeng iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy