Cargando…
Data mining methodology for obtaining epidemiological data in the context of road transport systems
Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525233/ https://www.ncbi.nlm.nih.gov/pubmed/36212894 http://dx.doi.org/10.1007/s12652-022-04427-2 |
_version_ | 1784800663863033856 |
---|---|
author | Cristóbal, Teresa Quesada-Arencibia, Alexis de Blasio, Gabriele Salvatore Padrón, Gabino Alayón, Francisco García, Carmelo R. |
author_facet | Cristóbal, Teresa Quesada-Arencibia, Alexis de Blasio, Gabriele Salvatore Padrón, Gabino Alayón, Francisco García, Carmelo R. |
author_sort | Cristóbal, Teresa |
collection | PubMed |
description | Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in the context of a public road transport system. This epidemiological information is based on an estimation of interactions with risk of infection between users of the public transport system. The methodology is novel in its aim since, to the best of our knowledge, there is no previous study in the context of epidemiology and public transport systems that addresses this challenge. The information is obtained by mining the data generated from trips made by transport users who use contactless cards as a means of payment. Data mining therefore underpins the methodology. One achievement of the methodology is that it is a comprehensive approach, since, starting from a formalisation of the problem based on epidemiological concepts and the transport activity itself, all the necessary steps to obtain the required epidemiological knowledge are described and implemented. This includes the estimation of data that are generally unknown in the context of public transport systems, but that are required to generate the desired results. The outcome is useful epidemiological data based on a complete and reliable description of all estimated potentially infectious interactions between users of the transport system. The methodology can be implemented using a variety of initial specifications: epidemiological, temporal, geographic, inter alia. Another feature of the methodology is that with the information it provides, epidemiological studies can be carried out involving a large number of people, producing large samples of interactions obtained over long periods of time, thereby making it possible to carry out comparative studies. Moreover, a real use case is described, in which the methodology is applied to a road transport system that annually moves around 20 million passengers, in a period that predates the COVID-19 pandemic. The results have made it possible to identify the group of users most exposed to infection, although they are not the largest group. Finally, it is estimated that the application of a seat allocation strategy that minimises the risk of infection reduces the risk by 50%. |
format | Online Article Text |
id | pubmed-9525233 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-95252332022-10-03 Data mining methodology for obtaining epidemiological data in the context of road transport systems Cristóbal, Teresa Quesada-Arencibia, Alexis de Blasio, Gabriele Salvatore Padrón, Gabino Alayón, Francisco García, Carmelo R. J Ambient Intell Humaniz Comput Original Research Millions of people use public transport systems daily, hence their interest for the epidemiology of respiratory infectious diseases, both from a scientific and a health control point of view. This article presents a methodology for obtaining epidemiological information on these types of diseases in the context of a public road transport system. This epidemiological information is based on an estimation of interactions with risk of infection between users of the public transport system. The methodology is novel in its aim since, to the best of our knowledge, there is no previous study in the context of epidemiology and public transport systems that addresses this challenge. The information is obtained by mining the data generated from trips made by transport users who use contactless cards as a means of payment. Data mining therefore underpins the methodology. One achievement of the methodology is that it is a comprehensive approach, since, starting from a formalisation of the problem based on epidemiological concepts and the transport activity itself, all the necessary steps to obtain the required epidemiological knowledge are described and implemented. This includes the estimation of data that are generally unknown in the context of public transport systems, but that are required to generate the desired results. The outcome is useful epidemiological data based on a complete and reliable description of all estimated potentially infectious interactions between users of the transport system. The methodology can be implemented using a variety of initial specifications: epidemiological, temporal, geographic, inter alia. Another feature of the methodology is that with the information it provides, epidemiological studies can be carried out involving a large number of people, producing large samples of interactions obtained over long periods of time, thereby making it possible to carry out comparative studies. Moreover, a real use case is described, in which the methodology is applied to a road transport system that annually moves around 20 million passengers, in a period that predates the COVID-19 pandemic. The results have made it possible to identify the group of users most exposed to infection, although they are not the largest group. Finally, it is estimated that the application of a seat allocation strategy that minimises the risk of infection reduces the risk by 50%. Springer Berlin Heidelberg 2022-10-01 2023 /pmc/articles/PMC9525233/ /pubmed/36212894 http://dx.doi.org/10.1007/s12652-022-04427-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Research Cristóbal, Teresa Quesada-Arencibia, Alexis de Blasio, Gabriele Salvatore Padrón, Gabino Alayón, Francisco García, Carmelo R. Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title | Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title_full | Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title_fullStr | Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title_full_unstemmed | Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title_short | Data mining methodology for obtaining epidemiological data in the context of road transport systems |
title_sort | data mining methodology for obtaining epidemiological data in the context of road transport systems |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525233/ https://www.ncbi.nlm.nih.gov/pubmed/36212894 http://dx.doi.org/10.1007/s12652-022-04427-2 |
work_keys_str_mv | AT cristobalteresa dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems AT quesadaarencibiaalexis dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems AT deblasiogabrielesalvatore dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems AT padrongabino dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems AT alayonfrancisco dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems AT garciacarmelor dataminingmethodologyforobtainingepidemiologicaldatainthecontextofroadtransportsystems |