Cargando…
An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2
To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9480989/ https://www.ncbi.nlm.nih.gov/pubmed/36112646 http://dx.doi.org/10.1371/journal.pone.0273725 |
_version_ | 1784791161743867904 |
---|---|
author | An, Xin Zhang, Mengmeng Xu, Shuo |
author_facet | An, Xin Zhang, Mengmeng Xu, Shuo |
author_sort | An, Xin |
collection | PubMed |
description | To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed articles were utilized to manually curate 170 relevant articles and 300 nonrelevant articles. Then, an active learning-based approach with three query strategies and three base classifiers is trained to screen the articles about the origins of SARS-CoV-2. Extensive experimental results show that our active learning-based approach outperforms traditional counterparts, and the uncertain sampling query strategy performs best among the three strategies. By manually checking the top 1,000 articles of each base classifier, we ultimately screened 715 unique scholarly articles to create a publicly available peer-reviewed literature corpus, COVID-Origin. This indicates that our approach for screening articles about the origins of SARS-CoV-2 is feasible. |
format | Online Article Text |
id | pubmed-9480989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-94809892022-09-17 An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 An, Xin Zhang, Mengmeng Xu, Shuo PLoS One Research Article To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed articles were utilized to manually curate 170 relevant articles and 300 nonrelevant articles. Then, an active learning-based approach with three query strategies and three base classifiers is trained to screen the articles about the origins of SARS-CoV-2. Extensive experimental results show that our active learning-based approach outperforms traditional counterparts, and the uncertain sampling query strategy performs best among the three strategies. By manually checking the top 1,000 articles of each base classifier, we ultimately screened 715 unique scholarly articles to create a publicly available peer-reviewed literature corpus, COVID-Origin. This indicates that our approach for screening articles about the origins of SARS-CoV-2 is feasible. Public Library of Science 2022-09-16 /pmc/articles/PMC9480989/ /pubmed/36112646 http://dx.doi.org/10.1371/journal.pone.0273725 Text en © 2022 An et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article An, Xin Zhang, Mengmeng Xu, Shuo An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title | An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title_full | An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title_fullStr | An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title_full_unstemmed | An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title_short | An active learning-based approach for screening scholarly articles about the origins of SARS-CoV-2 |
title_sort | active learning-based approach for screening scholarly articles about the origins of sars-cov-2 |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9480989/ https://www.ncbi.nlm.nih.gov/pubmed/36112646 http://dx.doi.org/10.1371/journal.pone.0273725 |
work_keys_str_mv | AT anxin anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 AT zhangmengmeng anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 AT xushuo anactivelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 AT anxin activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 AT zhangmengmeng activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 AT xushuo activelearningbasedapproachforscreeningscholarlyarticlesabouttheoriginsofsarscov2 |