Cargando…

COVID-19 in Italy and extreme data mining

In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is po...

Descripción completa

Detalles Bibliográficos
Autores principales: Buscema, Paolo Massimo, Della Torre, Francesca, Breda, Marco, Massini, Giulia, Grossi, Enzo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382358/
https://www.ncbi.nlm.nih.gov/pubmed/32834435
http://dx.doi.org/10.1016/j.physa.2020.124991
_version_ 1783563229628203008
author Buscema, Paolo Massimo
Della Torre, Francesca
Breda, Marco
Massini, Giulia
Grossi, Enzo
author_facet Buscema, Paolo Massimo
Della Torre, Francesca
Breda, Marco
Massini, Giulia
Grossi, Enzo
author_sort Buscema, Paolo Massimo
collection PubMed
description In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is possible, by necessity or choice, to work profitably even on small data. This peculiarity of the algorithm means that even in the early stages of an epidemic process, when the data are too few to have sufficient statistics, it is possible to obtain important information. To prove our theory, we addressed one of the most central issues at the moment: the COVID-19 epidemic. In particular, the cases recorded in Italy have been selected. Italy seems to have a central role in this epidemic because of the high number of measured infections. Through this innovative artificial intelligence algorithm, we have tried to analyze the evolution of the phenomenon and to predict its future steps using a dataset that contained only geospatial coordinates (longitude and latitude) of the first recorded cases. Once the coordinates of the places where at least one case of contagion had been officially diagnosed until February 26th, 2020 had been collected, research and analysis was carried out on: outbreak point and related heat map (TWC alpha); probability distribution of the contagion on February 26th (TWC beta); possible spread of the phenomenon in the immediate future and then in the future of the future (TWC gamma and TWC theta); how this passage occurred in terms of paths and mutual influence (Theta paths and Markov Machine). Finally, a heat map of the possible situation towards the end of the epidemic in terms of infectiousness of the areas was drawn up. The analyses with TWC confirm the assumptions made at the beginning.
format Online
Article
Text
id pubmed-7382358
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-73823582020-07-28 COVID-19 in Italy and extreme data mining Buscema, Paolo Massimo Della Torre, Francesca Breda, Marco Massini, Giulia Grossi, Enzo Physica A Article In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is possible, by necessity or choice, to work profitably even on small data. This peculiarity of the algorithm means that even in the early stages of an epidemic process, when the data are too few to have sufficient statistics, it is possible to obtain important information. To prove our theory, we addressed one of the most central issues at the moment: the COVID-19 epidemic. In particular, the cases recorded in Italy have been selected. Italy seems to have a central role in this epidemic because of the high number of measured infections. Through this innovative artificial intelligence algorithm, we have tried to analyze the evolution of the phenomenon and to predict its future steps using a dataset that contained only geospatial coordinates (longitude and latitude) of the first recorded cases. Once the coordinates of the places where at least one case of contagion had been officially diagnosed until February 26th, 2020 had been collected, research and analysis was carried out on: outbreak point and related heat map (TWC alpha); probability distribution of the contagion on February 26th (TWC beta); possible spread of the phenomenon in the immediate future and then in the future of the future (TWC gamma and TWC theta); how this passage occurred in terms of paths and mutual influence (Theta paths and Markov Machine). Finally, a heat map of the possible situation towards the end of the epidemic in terms of infectiousness of the areas was drawn up. The analyses with TWC confirm the assumptions made at the beginning. Elsevier B.V. 2020-11-01 2020-07-25 /pmc/articles/PMC7382358/ /pubmed/32834435 http://dx.doi.org/10.1016/j.physa.2020.124991 Text en © 2020 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Buscema, Paolo Massimo
Della Torre, Francesca
Breda, Marco
Massini, Giulia
Grossi, Enzo
COVID-19 in Italy and extreme data mining
title COVID-19 in Italy and extreme data mining
title_full COVID-19 in Italy and extreme data mining
title_fullStr COVID-19 in Italy and extreme data mining
title_full_unstemmed COVID-19 in Italy and extreme data mining
title_short COVID-19 in Italy and extreme data mining
title_sort covid-19 in italy and extreme data mining
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382358/
https://www.ncbi.nlm.nih.gov/pubmed/32834435
http://dx.doi.org/10.1016/j.physa.2020.124991
work_keys_str_mv AT buscemapaolomassimo covid19initalyandextremedatamining
AT dellatorrefrancesca covid19initalyandextremedatamining
AT bredamarco covid19initalyandextremedatamining
AT massinigiulia covid19initalyandextremedatamining
AT grossienzo covid19initalyandextremedatamining