Cargando…
Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study
BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017891/ https://www.ncbi.nlm.nih.gov/pubmed/33810798 http://dx.doi.org/10.1186/s13643-021-01640-6 |
_version_ | 1783674137289424896 |
---|---|
author | Zimmerman, John Soler, Robin E. Lavinder, James Murphy, Sarah Atkins, Charisma Hulbert, LaShonda Lusk, Richard Ng, Boon Peng |
author_facet | Zimmerman, John Soler, Robin E. Lavinder, James Murphy, Sarah Atkins, Charisma Hulbert, LaShonda Lusk, Richard Ng, Boon Peng |
author_sort | Zimmerman, John |
collection | PubMed |
description | BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. METHODS: In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. RESULTS: The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. CONCLUSIONS: We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13643-021-01640-6. |
format | Online Article Text |
id | pubmed-8017891 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80178912021-04-05 Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study Zimmerman, John Soler, Robin E. Lavinder, James Murphy, Sarah Atkins, Charisma Hulbert, LaShonda Lusk, Richard Ng, Boon Peng Syst Rev Methodology BACKGROUND: Systematic Reviews (SR), studies of studies, use a formal process to evaluate the quality of scientific literature and determine ensuing effectiveness from qualifying articles to establish consensus findings around a hypothesis. Their value is increasing as the conduct and publication of research and evaluation has expanded and the process of identifying key insights becomes more time consuming. Text analytics and machine learning (ML) techniques may help overcome this problem of scale while still maintaining the level of rigor expected of SRs. METHODS: In this article, we discuss an approach that uses existing examples of SRs to build and test a method for assisting the SR title and abstract pre-screening by reducing the initial pool of potential articles down to articles that meet inclusion criteria. Our approach differs from previous approaches to using ML as a SR tool in that it incorporates ML configurations guided by previously conducted SRs, and human confirmation on ML predictions of relevant articles during multiple iterative reviews on smaller tranches of citations. We applied the tailored method to a new SR review effort to validate performance. RESULTS: The case study test of the approach proved a sensitivity (recall) in finding relevant articles during down selection that may rival many traditional processes and show ability to overcome most type II errors. The study achieved a sensitivity of 99.5% (213 out of 214) of total relevant articles while only conducting a human review of 31% of total articles available for review. CONCLUSIONS: We believe this iterative method can help overcome bias in initial ML model training by having humans reinforce ML models with new and relevant information, and is an applied step towards transfer learning for ML in SR. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13643-021-01640-6. BioMed Central 2021-04-02 /pmc/articles/PMC8017891/ /pubmed/33810798 http://dx.doi.org/10.1186/s13643-021-01640-6 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Zimmerman, John Soler, Robin E. Lavinder, James Murphy, Sarah Atkins, Charisma Hulbert, LaShonda Lusk, Richard Ng, Boon Peng Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title | Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title_full | Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title_fullStr | Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title_full_unstemmed | Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title_short | Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
title_sort | iterative guided machine learning-assisted systematic literature reviews: a diabetes case study |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017891/ https://www.ncbi.nlm.nih.gov/pubmed/33810798 http://dx.doi.org/10.1186/s13643-021-01640-6 |
work_keys_str_mv | AT zimmermanjohn iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT solerrobine iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT lavinderjames iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT murphysarah iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT atkinscharisma iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT hulbertlashonda iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT luskrichard iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy AT ngboonpeng iterativeguidedmachinelearningassistedsystematicliteraturereviewsadiabetescasestudy |