Cargando…

An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2

To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed a...

Descripción completa

Detalles Bibliográficos
Autores principales: An, Xin, Zhang, Mengmeng, Xu, Shuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9480989/
https://www.ncbi.nlm.nih.gov/pubmed/36112646
http://dx.doi.org/10.1371/journal.pone.0273725
_version_ 1784791161743867904
author An, Xin
Zhang, Mengmeng
Xu, Shuo
author_facet An, Xin
Zhang, Mengmeng
Xu, Shuo
author_sort An, Xin
collection PubMed
description To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed articles were utilized to manually curate 170 relevant articles and 300 nonrelevant articles. Then, an active learning-based approach with three query strategies and three base classifiers is trained to screen the articles about the origins of SARS-CoV-2. Extensive experimental results show that our active learning-based approach outperforms traditional counterparts, and the uncertain sampling query strategy performs best among the three strategies. By manually checking the top 1,000 articles of each base classifier, we ultimately screened 715 unique scholarly articles to create a publicly available peer-reviewed literature corpus, COVID-Origin. This indicates that our approach for screening articles about the origins of SARS-CoV-2 is feasible.
format Online
Article
Text
id pubmed-9480989
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94809892022-09-17 An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 An, Xin Zhang, Mengmeng Xu, Shuo PLoS One Research Article To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed articles were utilized to manually curate 170 relevant articles and 300 nonrelevant articles. Then, an active learning-based approach with three query strategies and three base classifiers is trained to screen the articles about the origins of SARS-CoV-2. Extensive experimental results show that our active learning-based approach outperforms traditional counterparts, and the uncertain sampling query strategy performs best among the three strategies. By manually checking the top 1,000 articles of each base classifier, we ultimately screened 715 unique scholarly articles to create a publicly available peer-reviewed literature corpus, COVID-Origin. This indicates that our approach for screening articles about the origins of SARS-CoV-2 is feasible. Public Library of Science 2022-09-16 /pmc/articles/PMC9480989/ /pubmed/36112646 http://dx.doi.org/10.1371/journal.pone.0273725 Text en © 2022 An et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
An, Xin
Zhang, Mengmeng
Xu, Shuo
An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title_full An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title_fullStr An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title_full_unstemmed An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title_short An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
title_sort active learning-based approach for screening scholarly articles about the origins of sars-cov-2
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9480989/
https://www.ncbi.nlm.nih.gov/pubmed/36112646
http://dx.doi.org/10.1371/journal.pone.0273725
work_keys_str_mv AT anxin anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2
AT zhangmengmeng anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2
AT xushuo anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2
AT anxin activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2
AT zhangmengmeng activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2
AT xushuo activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2