Cargando…

Data analytics for novel coronavirus disease

This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors...

Descripción completa

Detalles Bibliográficos
Autores principales: Mondal, M. Rubaiyat Hossain, Bharati, Subrato, Podder, Prajoy, Podder, Priya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier Ltd. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295495/
https://www.ncbi.nlm.nih.gov/pubmed/32835073
http://dx.doi.org/10.1016/j.imu.2020.100374
_version_ 1783546658480455680
author Mondal, M. Rubaiyat Hossain
Bharati, Subrato
Podder, Prajoy
Podder, Priya
author_facet Mondal, M. Rubaiyat Hossain
Bharati, Subrato
Podder, Prajoy
Podder, Priya
author_sort Mondal, M. Rubaiyat Hossain
collection PubMed
description This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find out the spread of the viral infection. It is shown here that although the disease started in China in December 2019, the highest number of confirmed cases up to June 04, 2020 is in the USA. Thirdly, the worldwide increase in the number of confirmed cases over time is modelled here using a polynomial regression algorithm with degree 2. Fourthly, classification algorithms are applied on a dataset of 5644 samples provided by Hospital Israelita Albert Einstein of Brazil in order to diagnose COVID-19. It is shown here that multilayer perceptron (MLP), XGBoost and logistic regression can classify COVID-19 patients at an accuracy above 91%. Finally, a discussion is presented on the potential applications of data analytics in several important factors of COVID-19.
format Online
Article
Text
id pubmed-7295495
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Authors. Published by Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-72954952020-06-16 Data analytics for novel coronavirus disease Mondal, M. Rubaiyat Hossain Bharati, Subrato Podder, Prajoy Podder, Priya Inform Med Unlocked Article This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find out the spread of the viral infection. It is shown here that although the disease started in China in December 2019, the highest number of confirmed cases up to June 04, 2020 is in the USA. Thirdly, the worldwide increase in the number of confirmed cases over time is modelled here using a polynomial regression algorithm with degree 2. Fourthly, classification algorithms are applied on a dataset of 5644 samples provided by Hospital Israelita Albert Einstein of Brazil in order to diagnose COVID-19. It is shown here that multilayer perceptron (MLP), XGBoost and logistic regression can classify COVID-19 patients at an accuracy above 91%. Finally, a discussion is presented on the potential applications of data analytics in several important factors of COVID-19. The Authors. Published by Elsevier Ltd. 2020 2020-06-15 /pmc/articles/PMC7295495/ /pubmed/32835073 http://dx.doi.org/10.1016/j.imu.2020.100374 Text en © 2020 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Mondal, M. Rubaiyat Hossain
Bharati, Subrato
Podder, Prajoy
Podder, Priya
Data analytics for novel coronavirus disease
title Data analytics for novel coronavirus disease
title_full Data analytics for novel coronavirus disease
title_fullStr Data analytics for novel coronavirus disease
title_full_unstemmed Data analytics for novel coronavirus disease
title_short Data analytics for novel coronavirus disease
title_sort data analytics for novel coronavirus disease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295495/
https://www.ncbi.nlm.nih.gov/pubmed/32835073
http://dx.doi.org/10.1016/j.imu.2020.100374
work_keys_str_mv AT mondalmrubaiyathossain dataanalyticsfornovelcoronavirusdisease
AT bharatisubrato dataanalyticsfornovelcoronavirusdisease
AT podderprajoy dataanalyticsfornovelcoronavirusdisease
AT podderpriya dataanalyticsfornovelcoronavirusdisease