Cargando…

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging...

Descripción completa

Detalles Bibliográficos
Autores principales:	Knafou, Julien, Haas, Quentin, Borissov, Nikolay, Counotte, Michel, Low, Nicola, Imeri, Hira, Ipekci, Aziz Mert, Buitrago-Garcia, Diana, Heron, Leonie, Amini, Poorya, Teodoro, Douglas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/ https://www.ncbi.nlm.nih.gov/pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9

_version_	1785053765894668288
author	Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas
author_facet	Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas
author_sort	Knafou, Julien
collection	PubMed
description	BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence.
format	Online Article Text
id	pubmed-10240481
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-102404812023-06-06 Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas Syst Rev Research BACKGROUND: The COVID-19 pandemic has led to an unprecedented amount of scientific publications, growing at a pace never seen before. Multiple living systematic reviews have been developed to assist professionals with up-to-date and trustworthy health information, but it is increasingly challenging for systematic reviewers to keep up with the evidence in electronic databases. We aimed to investigate deep learning-based machine learning algorithms to classify COVID-19-related publications to help scale up the epidemiological curation process. METHODS: In this retrospective study, five different pre-trained deep learning-based language models were fine-tuned on a dataset of 6365 publications manually classified into two classes, three subclasses, and 22 sub-subclasses relevant for epidemiological triage purposes. In a k-fold cross-validation setting, each standalone model was assessed on a classification task and compared against an ensemble, which takes the standalone model predictions as input and uses different strategies to infer the optimal article class. A ranking task was also considered, in which the model outputs a ranked list of sub-subclasses associated with the article. RESULTS: The ensemble model significantly outperformed the standalone classifiers, achieving a F1-score of 89.2 at the class level of the classification task. The difference between the standalone and ensemble models increases at the sub-subclass level, where the ensemble reaches a micro F1-score of 70% against 67% for the best-performing standalone model. For the ranking task, the ensemble obtained the highest recall@3, with a performance of 89%. Using an unanimity voting rule, the ensemble can provide predictions with higher confidence on a subset of the data, achieving detection of original papers with a F1-score up to 97% on a subset of 80% of the collection instead of 93% on the whole dataset. CONCLUSION: This study shows the potential of using deep learning language models to perform triage of COVID-19 references efficiently and support epidemiological curation and review. The ensemble consistently and significantly outperforms any standalone model. Fine-tuning the voting strategy thresholds is an interesting alternative to annotate a subset with higher predictive confidence. BioMed Central 2023-06-05 /pmc/articles/PMC10240481/ /pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Knafou, Julien Haas, Quentin Borissov, Nikolay Counotte, Michel Low, Nicola Imeri, Hira Ipekci, Aziz Mert Buitrago-Garcia, Diana Heron, Leonie Amini, Poorya Teodoro, Douglas Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title	Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_full	Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_fullStr	Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_full_unstemmed	Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_short	Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature
title_sort	ensemble of deep learning language models to support the creation of living systematic reviews for the covid-19 literature
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240481/ https://www.ncbi.nlm.nih.gov/pubmed/37277872 http://dx.doi.org/10.1186/s13643-023-02247-9
work_keys_str_mv	AT knafoujulien ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT haasquentin ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT borissovnikolay ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT counottemichel ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT lownicola ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT imerihira ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT ipekciazizmert ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT buitragogarciadiana ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT heronleonie ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT aminipoorya ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature AT teodorodouglas ensembleofdeeplearninglanguagemodelstosupportthecreationoflivingsystematicreviewsforthecovid19literature

Ensemble of deep learning language models to support the creation of living systematic reviews for the COVID-19 literature

Ejemplares similares