Cargando…

Text mining approaches for dealing with the rapidly expanding literature on COVID-19

More than 50 000 papers have been published about COVID-19 since the beginning of 2020 and several hundred new papers continue to be published every day. This incredible rate of scientific productivity leads to information overload, making it difficult for researchers, clinicians and public health o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Lucy Lu, Lo, Kyle
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2020
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7799291/ https://www.ncbi.nlm.nih.gov/pubmed/33279995 http://dx.doi.org/10.1093/bib/bbaa296

_version_	1783635113473474560
author	Wang, Lucy Lu Lo, Kyle
author_facet	Wang, Lucy Lu Lo, Kyle
author_sort	Wang, Lucy Lu
collection	PubMed
description	More than 50 000 papers have been published about COVID-19 since the beginning of 2020 and several hundred new papers continue to be published every day. This incredible rate of scientific productivity leads to information overload, making it difficult for researchers, clinicians and public health officials to keep up with the latest findings. Automated text mining techniques for searching, reading and summarizing papers are helpful for addressing information overload. In this review, we describe the many resources that have been introduced to support text mining applications over the COVID-19 literature; specifically, we discuss the corpora, modeling resources, systems and shared tasks that have been introduced for COVID-19. We compile a list of 39 systems that provide functionality such as search, discovery, visualization and summarization over the COVID-19 literature. For each system, we provide a qualitative description and assessment of the system’s performance, unique data or user interface features and modeling decisions. Many systems focus on search and discovery, though several systems provide novel features, such as the ability to summarize findings over multiple documents or linking between scientific articles and clinical trials. We also describe the public corpora, models and shared tasks that have been introduced to help reduce repeated effort among community members; some of these resources (especially shared tasks) can provide a basis for comparing the performance of different systems. Finally, we summarize promising results and open challenges for text mining the COVID-19 literature.
format	Online Article Text
id	pubmed-7799291
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-77992912021-01-25 Text mining approaches for dealing with the rapidly expanding literature on COVID-19 Wang, Lucy Lu Lo, Kyle Brief Bioinform Articles More than 50 000 papers have been published about COVID-19 since the beginning of 2020 and several hundred new papers continue to be published every day. This incredible rate of scientific productivity leads to information overload, making it difficult for researchers, clinicians and public health officials to keep up with the latest findings. Automated text mining techniques for searching, reading and summarizing papers are helpful for addressing information overload. In this review, we describe the many resources that have been introduced to support text mining applications over the COVID-19 literature; specifically, we discuss the corpora, modeling resources, systems and shared tasks that have been introduced for COVID-19. We compile a list of 39 systems that provide functionality such as search, discovery, visualization and summarization over the COVID-19 literature. For each system, we provide a qualitative description and assessment of the system’s performance, unique data or user interface features and modeling decisions. Many systems focus on search and discovery, though several systems provide novel features, such as the ability to summarize findings over multiple documents or linking between scientific articles and clinical trials. We also describe the public corpora, models and shared tasks that have been introduced to help reduce repeated effort among community members; some of these resources (especially shared tasks) can provide a basis for comparing the performance of different systems. Finally, we summarize promising results and open challenges for text mining the COVID-19 literature. Oxford University Press 2020-12-07 /pmc/articles/PMC7799291/ /pubmed/33279995 http://dx.doi.org/10.1093/bib/bbaa296 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Articles Wang, Lucy Lu Lo, Kyle Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title	Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title_full	Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title_fullStr	Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title_full_unstemmed	Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title_short	Text mining approaches for dealing with the rapidly expanding literature on COVID-19
title_sort	text mining approaches for dealing with the rapidly expanding literature on covid-19
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7799291/ https://www.ncbi.nlm.nih.gov/pubmed/33279995 http://dx.doi.org/10.1093/bib/bbaa296
work_keys_str_mv	AT wanglucylu textminingapproachesfordealingwiththerapidlyexpandingliteratureoncovid19 AT lokyle textminingapproachesfordealingwiththerapidlyexpandingliteratureoncovid19

Text mining approaches for dealing with the rapidly expanding literature on COVID-19

Ejemplares similares