Cargando…
Data analytics for novel coronavirus disease
This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Authors. Published by Elsevier Ltd.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295495/ https://www.ncbi.nlm.nih.gov/pubmed/32835073 http://dx.doi.org/10.1016/j.imu.2020.100374 |
_version_ | 1783546658480455680 |
---|---|
author | Mondal, M. Rubaiyat Hossain Bharati, Subrato Podder, Prajoy Podder, Priya |
author_facet | Mondal, M. Rubaiyat Hossain Bharati, Subrato Podder, Prajoy Podder, Priya |
author_sort | Mondal, M. Rubaiyat Hossain |
collection | PubMed |
description | This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find out the spread of the viral infection. It is shown here that although the disease started in China in December 2019, the highest number of confirmed cases up to June 04, 2020 is in the USA. Thirdly, the worldwide increase in the number of confirmed cases over time is modelled here using a polynomial regression algorithm with degree 2. Fourthly, classification algorithms are applied on a dataset of 5644 samples provided by Hospital Israelita Albert Einstein of Brazil in order to diagnose COVID-19. It is shown here that multilayer perceptron (MLP), XGBoost and logistic regression can classify COVID-19 patients at an accuracy above 91%. Finally, a discussion is presented on the potential applications of data analytics in several important factors of COVID-19. |
format | Online Article Text |
id | pubmed-7295495 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Authors. Published by Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-72954952020-06-16 Data analytics for novel coronavirus disease Mondal, M. Rubaiyat Hossain Bharati, Subrato Podder, Prajoy Podder, Priya Inform Med Unlocked Article This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find out the spread of the viral infection. It is shown here that although the disease started in China in December 2019, the highest number of confirmed cases up to June 04, 2020 is in the USA. Thirdly, the worldwide increase in the number of confirmed cases over time is modelled here using a polynomial regression algorithm with degree 2. Fourthly, classification algorithms are applied on a dataset of 5644 samples provided by Hospital Israelita Albert Einstein of Brazil in order to diagnose COVID-19. It is shown here that multilayer perceptron (MLP), XGBoost and logistic regression can classify COVID-19 patients at an accuracy above 91%. Finally, a discussion is presented on the potential applications of data analytics in several important factors of COVID-19. The Authors. Published by Elsevier Ltd. 2020 2020-06-15 /pmc/articles/PMC7295495/ /pubmed/32835073 http://dx.doi.org/10.1016/j.imu.2020.100374 Text en © 2020 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Mondal, M. Rubaiyat Hossain Bharati, Subrato Podder, Prajoy Podder, Priya Data analytics for novel coronavirus disease |
title | Data analytics for novel coronavirus disease |
title_full | Data analytics for novel coronavirus disease |
title_fullStr | Data analytics for novel coronavirus disease |
title_full_unstemmed | Data analytics for novel coronavirus disease |
title_short | Data analytics for novel coronavirus disease |
title_sort | data analytics for novel coronavirus disease |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295495/ https://www.ncbi.nlm.nih.gov/pubmed/32835073 http://dx.doi.org/10.1016/j.imu.2020.100374 |
work_keys_str_mv | AT mondalmrubaiyathossain dataanalyticsfornovelcoronavirusdisease AT bharatisubrato dataanalyticsfornovelcoronavirusdisease AT podderprajoy dataanalyticsfornovelcoronavirusdisease AT podderpriya dataanalyticsfornovelcoronavirusdisease |