Cargando…
A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic
BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685045/ https://www.ncbi.nlm.nih.gov/pubmed/33251372 http://dx.doi.org/10.1016/j.heliyon.2020.e05575 |
_version_ | 1783613119766986752 |
---|---|
author | Ramadan, Ahmed Kamel, Ahmed Taha, Alaa El-Shabrawy, Abdelhamid Abdel-Fatah, Noura Anwar |
author_facet | Ramadan, Ahmed Kamel, Ahmed Taha, Alaa El-Shabrawy, Abdelhamid Abdel-Fatah, Noura Anwar |
author_sort | Ramadan, Ahmed |
collection | PubMed |
description | BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models are suggested in order to visualize and compare the situation of different countries more accurately and precisely. AIMS: We aimed to utilize data analysis tools that display the relative positions of data points in fewer dimensions while keeping the variation of the original data set as much as possible, and cluster countries according to their scores on the formed dimensions. METHODS: Principal component analysis (PCA) and Partitioning around medoids (PAM) clustering algorithms were used to analyze data of 56 countries, 82 countries and 91 countries with COVID-19 at three time points, eligible countries included in the analysis are those with total cases of 500 or more with no missing data. RESULTS: After performing PCA, we generated two scores: Disease Magnitude score that represents total cases, total deaths, total actives cases, and critically ill cases, and Mortality Recovery Ratio score that represents the ratio between total deaths to total recoveries in any given country. CONCLUSION: Accurate multivariate analyses can be of great value as they can simplify difficult concepts, explore and communicate findings from health datasets, and support the decision-making process. |
format | Online Article Text |
id | pubmed-7685045 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-76850452020-11-25 A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic Ramadan, Ahmed Kamel, Ahmed Taha, Alaa El-Shabrawy, Abdelhamid Abdel-Fatah, Noura Anwar Heliyon Research Article BACKGROUND: To understand the impact and volume of coronavirus (COVID-19) crisis, univariate analysis is tedious for describing the datasets reported daily. However, to capture the full picture and be able to compare situations and consequences for different countries, multivariate analytical models are suggested in order to visualize and compare the situation of different countries more accurately and precisely. AIMS: We aimed to utilize data analysis tools that display the relative positions of data points in fewer dimensions while keeping the variation of the original data set as much as possible, and cluster countries according to their scores on the formed dimensions. METHODS: Principal component analysis (PCA) and Partitioning around medoids (PAM) clustering algorithms were used to analyze data of 56 countries, 82 countries and 91 countries with COVID-19 at three time points, eligible countries included in the analysis are those with total cases of 500 or more with no missing data. RESULTS: After performing PCA, we generated two scores: Disease Magnitude score that represents total cases, total deaths, total actives cases, and critically ill cases, and Mortality Recovery Ratio score that represents the ratio between total deaths to total recoveries in any given country. CONCLUSION: Accurate multivariate analyses can be of great value as they can simplify difficult concepts, explore and communicate findings from health datasets, and support the decision-making process. Elsevier 2020-11-24 /pmc/articles/PMC7685045/ /pubmed/33251372 http://dx.doi.org/10.1016/j.heliyon.2020.e05575 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Ramadan, Ahmed Kamel, Ahmed Taha, Alaa El-Shabrawy, Abdelhamid Abdel-Fatah, Noura Anwar A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title | A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title_full | A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title_fullStr | A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title_full_unstemmed | A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title_short | A multivariate data analysis approach for investigating daily statistics of countries affected with COVID-19 pandemic |
title_sort | multivariate data analysis approach for investigating daily statistics of countries affected with covid-19 pandemic |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685045/ https://www.ncbi.nlm.nih.gov/pubmed/33251372 http://dx.doi.org/10.1016/j.heliyon.2020.e05575 |
work_keys_str_mv | AT ramadanahmed amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT kamelahmed amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT tahaalaa amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT elshabrawyabdelhamid amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT abdelfatahnouraanwar amultivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT ramadanahmed multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT kamelahmed multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT tahaalaa multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT elshabrawyabdelhamid multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic AT abdelfatahnouraanwar multivariatedataanalysisapproachforinvestigatingdailystatisticsofcountriesaffectedwithcovid19pandemic |