Cargando…

Data mining methodology for obtaining epidemiological data in the context of road transport systems

Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in...

Descripción completa

Detalles Bibliográficos
Autores principales: Cristóbal, Teresa, Quesada-Arencibia, Alexis, de Blasio, Gabriele Salvatore, Padrón, Gabino, Alayón, Francisco, García, Carmelo R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525233/
https://www.ncbi.nlm.nih.gov/pubmed/36212894
http://dx.doi.org/10.1007/s12652-022-04427-2
_version_ 1784800663863033856
author Cristóbal, Teresa
Quesada-Arencibia, Alexis
de Blasio, Gabriele Salvatore
Padrón, Gabino
Alayón, Francisco
García, Carmelo R.
author_facet Cristóbal, Teresa
Quesada-Arencibia, Alexis
de Blasio, Gabriele Salvatore
Padrón, Gabino
Alayón, Francisco
García, Carmelo R.
author_sort Cristóbal, Teresa
collection PubMed
description Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in the context of a public road transport system. This epidemiological information is based on an estimation of interactions with risk of infection between users of the public transport system. The methodology is novel in its aim since, to the best of our knowledge, there is no previous study in the context of epidemiology and public transport systems that addresses this challenge. The information is obtained by mining the data generated from trips made by transport users who use contactless cards as a means of payment. Data mining therefore underpins the methodology. One achievement of the methodology is that it is a comprehensive approach, since, starting from a formalisation of the problem based on epidemiological concepts and the transport activity itself, all the necessary steps to obtain the required epidemiological knowledge are described and implemented. This includes the estimation of data that are generally unknown in the context of public transport systems, but that are required to generate the desired results. The outcome is useful epidemiological data based on a complete and reliable description of all estimated potentially infectious interactions between users of the transport system. The methodology can be implemented using a variety of initial specifications: epidemiological, temporal, geographic, inter alia. Another feature of the methodology is that with the information it provides, epidemiological studies can be carried out involving a large number of people, producing large samples of interactions obtained over long periods of time, thereby making it possible to carry out comparative studies. Moreover, a real use case is described, in which the methodology is applied to a road transport system that annually moves around 20 million passengers, in a period that predates the COVID-19 pandemic. The results have made it possible to identify the group of users most exposed to infection, although they are not the largest group. Finally, it is estimated that the application of a seat allocation strategy that minimises the risk of infection reduces the risk by 50%.
format Online
Article
Text
id pubmed-9525233
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-95252332022-10-03 Data mining methodology for obtaining epidemiological data in the context of road transport systems Cristóbal, Teresa Quesada-Arencibia, Alexis de Blasio, Gabriele Salvatore Padrón, Gabino Alayón, Francisco García, Carmelo R. J Ambient Intell Humaniz Comput Original Research Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in the context of a public road transport system. This epidemiological information is based on an estimation of interactions with risk of infection between users of the public transport system. The methodology is novel in its aim since, to the best of our knowledge, there is no previous study in the context of epidemiology and public transport systems that addresses this challenge. The information is obtained by mining the data generated from trips made by transport users who use contactless cards as a means of payment. Data mining therefore underpins the methodology. One achievement of the methodology is that it is a comprehensive approach, since, starting from a formalisation of the problem based on epidemiological concepts and the transport activity itself, all the necessary steps to obtain the required epidemiological knowledge are described and implemented. This includes the estimation of data that are generally unknown in the context of public transport systems, but that are required to generate the desired results. The outcome is useful epidemiological data based on a complete and reliable description of all estimated potentially infectious interactions between users of the transport system. The methodology can be implemented using a variety of initial specifications: epidemiological, temporal, geographic, inter alia. Another feature of the methodology is that with the information it provides, epidemiological studies can be carried out involving a large number of people, producing large samples of interactions obtained over long periods of time, thereby making it possible to carry out comparative studies. Moreover, a real use case is described, in which the methodology is applied to a road transport system that annually moves around 20 million passengers, in a period that predates the COVID-19 pandemic. The results have made it possible to identify the group of users most exposed to infection, although they are not the largest group. Finally, it is estimated that the application of a seat allocation strategy that minimises the risk of infection reduces the risk by 50%. Springer Berlin Heidelberg 2022-10-01 2023 /pmc/articles/PMC9525233/ /pubmed/36212894 http://dx.doi.org/10.1007/s12652-022-04427-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Research
Cristóbal, Teresa
Quesada-Arencibia, Alexis
de Blasio, Gabriele Salvatore
Padrón, Gabino
Alayón, Francisco
García, Carmelo R.
Data mining methodology for obtaining epidemiological data in the context of road transport systems
title Data mining methodology for obtaining epidemiological data in the context of road transport systems
title_full Data mining methodology for obtaining epidemiological data in the context of road transport systems
title_fullStr Data mining methodology for obtaining epidemiological data in the context of road transport systems
title_full_unstemmed Data mining methodology for obtaining epidemiological data in the context of road transport systems
title_short Data mining methodology for obtaining epidemiological data in the context of road transport systems
title_sort data mining methodology for obtaining epidemiological data in the context of road transport systems
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525233/
https://www.ncbi.nlm.nih.gov/pubmed/36212894
http://dx.doi.org/10.1007/s12652-022-04427-2
work_keys_str_mv AT cristobalteresa dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems
AT quesadaarencibiaalexis dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems
AT deblasiogabrielesalvatore dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems
AT padrongabino dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems
AT alayonfrancisco dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems
AT garciacarmelor dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems