Cargando…
Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/ https://www.ncbi.nlm.nih.gov/pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9 |
_version_ | 1785053765894668288 |
---|---|
author | Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas |
author_facet | Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas |
author_sort | Knafou, Julien |
collection | PubMed |
description | BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence. |
format | Online Article Text |
id | pubmed-10240481 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102404812023-06-06 Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas Syst Rev Research BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence. BioMed Central 2023-06-05 /pmc/articles/PMC10240481/ /pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title | Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title_full | Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title_fullStr | Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title_full_unstemmed | Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title_short | Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature |
title_sort | ensemble of deep learning language models to support the creation of living systematic reviews for the covid-19 literature |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/ https://www.ncbi.nlm.nih.gov/pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9 |
work_keys_str_mv | AT knafoujulien ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT haasquentin ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT borissovnikolay ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT counottemichel ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT lownicola ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT imerihira ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT ipekciazizmert ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT buitragogarciadiana ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT heronleonie ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT aminipoorya ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT teodorodouglas ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature |