Cargando…

Modelling tick bite risk by combining random forests and count data regression models

The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metr...

Descripción completa

Detalles Bibliográficos
Autores principales: Garcia-Marti, Irene, Zurita-Milla, Raul, Swart, Arno
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6903726/
https://www.ncbi.nlm.nih.gov/pubmed/31821325
http://dx.doi.org/10.1371/journal.pone.0216511
_version_ 1783477898436411392
author Garcia-Marti, Irene
Zurita-Milla, Raul
Swart, Arno
author_facet Garcia-Marti, Irene
Zurita-Milla, Raul
Swart, Arno
author_sort Garcia-Marti, Irene
collection PubMed
description The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metropolitan areas. Hence, the transmission of pathogens causing tick-borne diseases is an important threat to public health. Using volunteered tick bite reports collected by two Dutch initiatives, here we present a method to model tick bite risk using human exposure and tick hazard predictors. Our method represents a step forward in risk modelling, since we combine a well-known ensemble learning method, Random Forest, with four count data models of the (zero-inflated) Poisson family. This combination allows us to better model the disproportions inherent in the volunteered tick bite reports. Unlike canonical machine learning models, our method can capture the overdispersion or zero-inflation inherent in data, thus yielding tick bite risk predictions that resemble the original signal captured by volunteers. Mapping model predictions enables a visual inspection of the spatial patterns of tick bite risk in the Netherlands. The Veluwe national park and the Utrechtse Heuvelrug forest, which are large forest-urban interfaces with several cities, are areas with high tick bite risk. This is expected, since these are popular places for recreation and tick activity is high in forests. However, our model can also predict high risk in less-intensively visited recreational areas, such as the patchy forests in the northeast of the country, the natural areas along the coastline, or some of the Frisian Islands. Our model could help public health specialists to design mitigation strategies for tick-borne diseases, and to target risky areas with awareness and prevention campaigns.
format Online
Article
Text
id pubmed-6903726
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69037262019-12-20 Modelling tick bite risk by combining random forests and count data regression models Garcia-Marti, Irene Zurita-Milla, Raul Swart, Arno PLoS One Research Article The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metropolitan areas. Hence, the transmission of pathogens causing tick-borne diseases is an important threat to public health. Using volunteered tick bite reports collected by two Dutch initiatives, here we present a method to model tick bite risk using human exposure and tick hazard predictors. Our method represents a step forward in risk modelling, since we combine a well-known ensemble learning method, Random Forest, with four count data models of the (zero-inflated) Poisson family. This combination allows us to better model the disproportions inherent in the volunteered tick bite reports. Unlike canonical machine learning models, our method can capture the overdispersion or zero-inflation inherent in data, thus yielding tick bite risk predictions that resemble the original signal captured by volunteers. Mapping model predictions enables a visual inspection of the spatial patterns of tick bite risk in the Netherlands. The Veluwe national park and the Utrechtse Heuvelrug forest, which are large forest-urban interfaces with several cities, are areas with high tick bite risk. This is expected, since these are popular places for recreation and tick activity is high in forests. However, our model can also predict high risk in less-intensively visited recreational areas, such as the patchy forests in the northeast of the country, the natural areas along the coastline, or some of the Frisian Islands. Our model could help public health specialists to design mitigation strategies for tick-borne diseases, and to target risky areas with awareness and prevention campaigns. Public Library of Science 2019-12-10 /pmc/articles/PMC6903726/ /pubmed/31821325 http://dx.doi.org/10.1371/journal.pone.0216511 Text en © 2019 Garcia-Marti et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Garcia-Marti, Irene
Zurita-Milla, Raul
Swart, Arno
Modelling tick bite risk by combining random forests and count data regression models
title Modelling tick bite risk by combining random forests and count data regression models
title_full Modelling tick bite risk by combining random forests and count data regression models
title_fullStr Modelling tick bite risk by combining random forests and count data regression models
title_full_unstemmed Modelling tick bite risk by combining random forests and count data regression models
title_short Modelling tick bite risk by combining random forests and count data regression models
title_sort modelling tick bite risk by combining random forests and count data regression models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6903726/
https://www.ncbi.nlm.nih.gov/pubmed/31821325
http://dx.doi.org/10.1371/journal.pone.0216511
work_keys_str_mv AT garciamartiirene modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels
AT zuritamillaraul modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels
AT swartarno modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels