Cargando…

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging...

Descripción completa

Detalles Bibliográficos
Autores principales: Knafou, Julien, Haas, Quentin, Borissov, Nikolay, Counotte, Michel, Low, Nicola, Imeri, Hira, Ipekci, Aziz Mert, Buitrago-Garcia, Diana, Heron, Leonie, Amini, Poorya, Teodoro, Douglas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/
https://www.ncbi.nlm.nih.gov/pubmed/37277872
http://dx.doi.org/10.1186/s13643-023-02247-9
_version_ 1785053765894668288
author Knafou, Julien
Haas, Quentin
Borissov, Nikolay
Counotte, Michel
Low, Nicola
Imeri, Hira
Ipekci, Aziz Mert
Buitrago-Garcia, Diana
Heron, Leonie
Amini, Poorya
Teodoro, Douglas
author_facet Knafou, Julien
Haas, Quentin
Borissov, Nikolay
Counotte, Michel
Low, Nicola
Imeri, Hira
Ipekci, Aziz Mert
Buitrago-Garcia, Diana
Heron, Leonie
Amini, Poorya
Teodoro, Douglas
author_sort Knafou, Julien
collection PubMed
description BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence.
format Online
Article
Text
id pubmed-10240481
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102404812023-06-06 Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas Syst Rev Research BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence. BioMed Central 2023-06-05 /pmc/articles/PMC10240481/ /pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Knafou, Julien
Haas, Quentin
Borissov, Nikolay
Counotte, Michel
Low, Nicola
Imeri, Hira
Ipekci, Aziz Mert
Buitrago-Garcia, Diana
Heron, Leonie
Amini, Poorya
Teodoro, Douglas
Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_full Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_fullStr Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_full_unstemmed Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_short Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_sort ensemble of deep learning language models to support the creation of living systematic reviews for the covid-19 literature
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/
https://www.ncbi.nlm.nih.gov/pubmed/37277872
http://dx.doi.org/10.1186/s13643-023-02247-9
work_keys_str_mv AT knafoujulien ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT haasquentin ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT borissovnikolay ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT counottemichel ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT lownicola ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT imerihira ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT ipekciazizmert ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT buitragogarciadiana ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT heronleonie ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT aminipoorya ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature
AT teodorodouglas ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature