Cargando…

Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature

The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprisin...

Descripción completa

Detalles Bibliográficos
Autores principales: Reddy, Sandeep, Bhaskar, Ravi, Padmanabhan, Sandosh, Verspoor, Karin, Mamillapalli, Chaitanya, Lahoti, Rani, Makinen, Ville-Petteri, Pradhan, Smitan, Kushwah, Puru, Sinha, Saumya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier B.V. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050406/
https://www.ncbi.nlm.nih.gov/pubmed/34337589
http://dx.doi.org/10.1016/j.cmpbup.2021.100010
_version_ 1783679596546228224
author Reddy, Sandeep
Bhaskar, Ravi
Padmanabhan, Sandosh
Verspoor, Karin
Mamillapalli, Chaitanya
Lahoti, Rani
Makinen, Ville-Petteri
Pradhan, Smitan
Kushwah, Puru
Sinha, Saumya
author_facet Reddy, Sandeep
Bhaskar, Ravi
Padmanabhan, Sandosh
Verspoor, Karin
Mamillapalli, Chaitanya
Lahoti, Rani
Makinen, Ville-Petteri
Pradhan, Smitan
Kushwah, Puru
Sinha, Saumya
author_sort Reddy, Sandeep
collection PubMed
description The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprising over 200,000 scholarly articles, we a multi-disciplinary team of data scientists, clinicians, medical researchers and software engineers developed an innovative natural language processing (NLP) platform that combines an advanced search engine with a biomedical named entity recognition extraction package. In particular, the platform was developed to extract information relating to clinical risk factors for COVID-19 by presenting the results in a cluster format to support knowledge discovery. Here we describe the principles behind the development, the model and the results we obtained.
format Online
Article
Text
id pubmed-8050406
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Authors. Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-80504062021-04-16 Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature Reddy, Sandeep Bhaskar, Ravi Padmanabhan, Sandosh Verspoor, Karin Mamillapalli, Chaitanya Lahoti, Rani Makinen, Ville-Petteri Pradhan, Smitan Kushwah, Puru Sinha, Saumya Comput Methods Programs Biomed Update Article The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprising over 200,000 scholarly articles, we a multi-disciplinary team of data scientists, clinicians, medical researchers and software engineers developed an innovative natural language processing (NLP) platform that combines an advanced search engine with a biomedical named entity recognition extraction package. In particular, the platform was developed to extract information relating to clinical risk factors for COVID-19 by presenting the results in a cluster format to support knowledge discovery. Here we describe the principles behind the development, the model and the results we obtained. The Authors. Published by Elsevier B.V. 2021 2021-04-16 /pmc/articles/PMC8050406/ /pubmed/34337589 http://dx.doi.org/10.1016/j.cmpbup.2021.100010 Text en © 2021 The Authors. Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Reddy, Sandeep
Bhaskar, Ravi
Padmanabhan, Sandosh
Verspoor, Karin
Mamillapalli, Chaitanya
Lahoti, Rani
Makinen, Ville-Petteri
Pradhan, Smitan
Kushwah, Puru
Sinha, Saumya
Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title_full Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title_fullStr Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title_full_unstemmed Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title_short Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
title_sort use and validation of text mining and cluster algorithms to derive insights from corona virus disease-2019 (covid-19) medical literature
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050406/
https://www.ncbi.nlm.nih.gov/pubmed/34337589
http://dx.doi.org/10.1016/j.cmpbup.2021.100010
work_keys_str_mv AT reddysandeep useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT bhaskarravi useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT padmanabhansandosh useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT verspoorkarin useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT mamillapallichaitanya useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT lahotirani useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT makinenvillepetteri useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT pradhansmitan useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT kushwahpuru useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature
AT sinhasaumya useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature