Cargando…
COVID-19 in Italy and extreme data mining
In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is po...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382358/ https://www.ncbi.nlm.nih.gov/pubmed/32834435 http://dx.doi.org/10.1016/j.physa.2020.124991 |
_version_ | 1783563229628203008 |
---|---|
author | Buscema, Paolo Massimo Della Torre, Francesca Breda, Marco Massini, Giulia Grossi, Enzo |
author_facet | Buscema, Paolo Massimo Della Torre, Francesca Breda, Marco Massini, Giulia Grossi, Enzo |
author_sort | Buscema, Paolo Massimo |
collection | PubMed |
description | In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is possible, by necessity or choice, to work profitably even on small data. This peculiarity of the algorithm means that even in the early stages of an epidemic process, when the data are too few to have sufficient statistics, it is possible to obtain important information. To prove our theory, we addressed one of the most central issues at the moment: the COVID-19 epidemic. In particular, the cases recorded in Italy have been selected. Italy seems to have a central role in this epidemic because of the high number of measured infections. Through this innovative artificial intelligence algorithm, we have tried to analyze the evolution of the phenomenon and to predict its future steps using a dataset that contained only geospatial coordinates (longitude and latitude) of the first recorded cases. Once the coordinates of the places where at least one case of contagion had been officially diagnosed until February 26th, 2020 had been collected, research and analysis was carried out on: outbreak point and related heat map (TWC alpha); probability distribution of the contagion on February 26th (TWC beta); possible spread of the phenomenon in the immediate future and then in the future of the future (TWC gamma and TWC theta); how this passage occurred in terms of paths and mutual influence (Theta paths and Markov Machine). Finally, a heat map of the possible situation towards the end of the epidemic in terms of infectiousness of the areas was drawn up. The analyses with TWC confirm the assumptions made at the beginning. |
format | Online Article Text |
id | pubmed-7382358 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73823582020-07-28 COVID-19 in Italy and extreme data mining Buscema, Paolo Massimo Della Torre, Francesca Breda, Marco Massini, Giulia Grossi, Enzo Physica A Article In this article we want to show the potential of an evolutionary algorithm called Topological Weighted Centroid (TWC). This algorithm can obtain new and relevant information from extremely limited and poor datasets. In a world dominated by the concept of big (fat?) data we want to show that it is possible, by necessity or choice, to work profitably even on small data. This peculiarity of the algorithm means that even in the early stages of an epidemic process, when the data are too few to have sufficient statistics, it is possible to obtain important information. To prove our theory, we addressed one of the most central issues at the moment: the COVID-19 epidemic. In particular, the cases recorded in Italy have been selected. Italy seems to have a central role in this epidemic because of the high number of measured infections. Through this innovative artificial intelligence algorithm, we have tried to analyze the evolution of the phenomenon and to predict its future steps using a dataset that contained only geospatial coordinates (longitude and latitude) of the first recorded cases. Once the coordinates of the places where at least one case of contagion had been officially diagnosed until February 26th, 2020 had been collected, research and analysis was carried out on: outbreak point and related heat map (TWC alpha); probability distribution of the contagion on February 26th (TWC beta); possible spread of the phenomenon in the immediate future and then in the future of the future (TWC gamma and TWC theta); how this passage occurred in terms of paths and mutual influence (Theta paths and Markov Machine). Finally, a heat map of the possible situation towards the end of the epidemic in terms of infectiousness of the areas was drawn up. The analyses with TWC confirm the assumptions made at the beginning. Elsevier B.V. 2020-11-01 2020-07-25 /pmc/articles/PMC7382358/ /pubmed/32834435 http://dx.doi.org/10.1016/j.physa.2020.124991 Text en © 2020 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Buscema, Paolo Massimo Della Torre, Francesca Breda, Marco Massini, Giulia Grossi, Enzo COVID-19 in Italy and extreme data mining |
title | COVID-19 in Italy and extreme data mining |
title_full | COVID-19 in Italy and extreme data mining |
title_fullStr | COVID-19 in Italy and extreme data mining |
title_full_unstemmed | COVID-19 in Italy and extreme data mining |
title_short | COVID-19 in Italy and extreme data mining |
title_sort | covid-19 in italy and extreme data mining |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382358/ https://www.ncbi.nlm.nih.gov/pubmed/32834435 http://dx.doi.org/10.1016/j.physa.2020.124991 |
work_keys_str_mv | AT buscemapaolomassimo covid19initalyandextremedatamining AT dellatorrefrancesca covid19initalyandextremedatamining AT bredamarco covid19initalyandextremedatamining AT massinigiulia covid19initalyandextremedatamining AT grossienzo covid19initalyandextremedatamining |