Cargando…
Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature
The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprisin...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Authors. Published by Elsevier B.V.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050406/ https://www.ncbi.nlm.nih.gov/pubmed/34337589 http://dx.doi.org/10.1016/j.cmpbup.2021.100010 |
_version_ | 1783679596546228224 |
---|---|
author | Reddy, Sandeep Bhaskar, Ravi Padmanabhan, Sandosh Verspoor, Karin Mamillapalli, Chaitanya Lahoti, Rani Makinen, Ville-Petteri Pradhan, Smitan Kushwah, Puru Sinha, Saumya |
author_facet | Reddy, Sandeep Bhaskar, Ravi Padmanabhan, Sandosh Verspoor, Karin Mamillapalli, Chaitanya Lahoti, Rani Makinen, Ville-Petteri Pradhan, Smitan Kushwah, Puru Sinha, Saumya |
author_sort | Reddy, Sandeep |
collection | PubMed |
description | The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprising over 200,000 scholarly articles, we a multi-disciplinary team of data scientists, clinicians, medical researchers and software engineers developed an innovative natural language processing (NLP) platform that combines an advanced search engine with a biomedical named entity recognition extraction package. In particular, the platform was developed to extract information relating to clinical risk factors for COVID-19 by presenting the results in a cluster format to support knowledge discovery. Here we describe the principles behind the development, the model and the results we obtained. |
format | Online Article Text |
id | pubmed-8050406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Authors. Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80504062021-04-16 Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature Reddy, Sandeep Bhaskar, Ravi Padmanabhan, Sandosh Verspoor, Karin Mamillapalli, Chaitanya Lahoti, Rani Makinen, Ville-Petteri Pradhan, Smitan Kushwah, Puru Sinha, Saumya Comput Methods Programs Biomed Update Article The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprising over 200,000 scholarly articles, we a multi-disciplinary team of data scientists, clinicians, medical researchers and software engineers developed an innovative natural language processing (NLP) platform that combines an advanced search engine with a biomedical named entity recognition extraction package. In particular, the platform was developed to extract information relating to clinical risk factors for COVID-19 by presenting the results in a cluster format to support knowledge discovery. Here we describe the principles behind the development, the model and the results we obtained. The Authors. Published by Elsevier B.V. 2021 2021-04-16 /pmc/articles/PMC8050406/ /pubmed/34337589 http://dx.doi.org/10.1016/j.cmpbup.2021.100010 Text en © 2021 The Authors. Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Reddy, Sandeep Bhaskar, Ravi Padmanabhan, Sandosh Verspoor, Karin Mamillapalli, Chaitanya Lahoti, Rani Makinen, Ville-Petteri Pradhan, Smitan Kushwah, Puru Sinha, Saumya Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title | Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title_full | Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title_fullStr | Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title_full_unstemmed | Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title_short | Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature |
title_sort | use and validation of text mining and cluster algorithms to derive insights from corona virus disease-2019 (covid-19) medical literature |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050406/ https://www.ncbi.nlm.nih.gov/pubmed/34337589 http://dx.doi.org/10.1016/j.cmpbup.2021.100010 |
work_keys_str_mv | AT reddysandeep useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT bhaskarravi useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT padmanabhansandosh useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT verspoorkarin useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT mamillapallichaitanya useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT lahotirani useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT makinenvillepetteri useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT pradhansmitan useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT kushwahpuru useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature AT sinhasaumya useandvalidationoftextminingandclusteralgorithmstoderiveinsightsfromcoronavirusdisease2019covid19medicalliterature |