Cargando…

A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic

BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramadan, Ahmed, Kamel, Ahmed, Taha, Alaa, El-Shabrawy, Abdelhamid, Abdel-Fatah, Noura Anwar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685045/
https://www.ncbi.nlm.nih.gov/pubmed/33251372
http://dx.doi.org/10.1016/j.heliyon.2020.e05575
_version_ 1783613119766986752
author Ramadan, Ahmed
Kamel, Ahmed
Taha, Alaa
El-Shabrawy, Abdelhamid
Abdel-Fatah, Noura Anwar
author_facet Ramadan, Ahmed
Kamel, Ahmed
Taha, Alaa
El-Shabrawy, Abdelhamid
Abdel-Fatah, Noura Anwar
author_sort Ramadan, Ahmed
collection PubMed
description BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models are suggested in order to visualize and compare the situation of different countries more accurately and precisely. AIMS: We aimed to utilize data analysis tools that display the relative positions of data points in fewer dimensions while keeping the variation of the original data set as much as possible, and cluster countries according to their scores on the formed dimensions. METHODS: Principal component analysis (PCA) and Partitioning around medoids (PAM) clustering algorithms were used to analyze data of 56 countries, 82 countries and 91 countries with COVID-19 at three time points, eligible countries included in the analysis are those with total cases of 500 or more with no missing data. RESULTS: After performing PCA, we generated two scores: Disease Magnitude score that represents total cases, total deaths, total actives cases, and critically ill cases, and Mortality Recovery Ratio score that represents the ratio between total deaths to total recoveries in any given country. CONCLUSION: Accurate multivariate analyses can be of great value as they can simplify difficult concepts, explore and communicate findings from health datasets, and support the decision-making process.
format Online
Article
Text
id pubmed-7685045
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-76850452020-11-25 A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic Ramadan, Ahmed Kamel, Ahmed Taha, Alaa El-Shabrawy, Abdelhamid Abdel-Fatah, Noura Anwar Heliyon Research Article BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models are suggested in order to visualize and compare the situation of different countries more accurately and precisely. AIMS: We aimed to utilize data analysis tools that display the relative positions of data points in fewer dimensions while keeping the variation of the original data set as much as possible, and cluster countries according to their scores on the formed dimensions. METHODS: Principal component analysis (PCA) and Partitioning around medoids (PAM) clustering algorithms were used to analyze data of 56 countries, 82 countries and 91 countries with COVID-19 at three time points, eligible countries included in the analysis are those with total cases of 500 or more with no missing data. RESULTS: After performing PCA, we generated two scores: Disease Magnitude score that represents total cases, total deaths, total actives cases, and critically ill cases, and Mortality Recovery Ratio score that represents the ratio between total deaths to total recoveries in any given country. CONCLUSION: Accurate multivariate analyses can be of great value as they can simplify difficult concepts, explore and communicate findings from health datasets, and support the decision-making process. Elsevier 2020-11-24 /pmc/articles/PMC7685045/ /pubmed/33251372 http://dx.doi.org/10.1016/j.heliyon.2020.e05575 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Ramadan, Ahmed
Kamel, Ahmed
Taha, Alaa
El-Shabrawy, Abdelhamid
Abdel-Fatah, Noura Anwar
A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title_full A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title_fullStr A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title_full_unstemmed A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title_short A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
title_sort multivariate data analysis approach for investigating daily statistics of countries affected with covid-19 pandemic
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685045/
https://www.ncbi.nlm.nih.gov/pubmed/33251372
http://dx.doi.org/10.1016/j.heliyon.2020.e05575
work_keys_str_mv AT ramadanahmed amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT kamelahmed amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT tahaalaa amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT elshabrawyabdelhamid amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT abdelfatahnouraanwar amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT ramadanahmed multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT kamelahmed multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT tahaalaa multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT elshabrawyabdelhamid multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic
AT abdelfatahnouraanwar multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic