Cargando…
Modelling tick bite risk by combining random forests and count data regression models
The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6903726/ https://www.ncbi.nlm.nih.gov/pubmed/31821325 http://dx.doi.org/10.1371/journal.pone.0216511 |
_version_ | 1783477898436411392 |
---|---|
author | Garcia-Marti, Irene Zurita-Milla, Raul Swart, Arno |
author_facet | Garcia-Marti, Irene Zurita-Milla, Raul Swart, Arno |
author_sort | Garcia-Marti, Irene |
collection | PubMed |
description | The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metropolitan areas. Hence, the transmission of pathogens causing tick-borne diseases is an important threat to public health. Using volunteered tick bite reports collected by two Dutch initiatives, here we present a method to model tick bite risk using human exposure and tick hazard predictors. Our method represents a step forward in risk modelling, since we combine a well-known ensemble learning method, Random Forest, with four count data models of the (zero-inflated) Poisson family. This combination allows us to better model the disproportions inherent in the volunteered tick bite reports. Unlike canonical machine learning models, our method can capture the overdispersion or zero-inflation inherent in data, thus yielding tick bite risk predictions that resemble the original signal captured by volunteers. Mapping model predictions enables a visual inspection of the spatial patterns of tick bite risk in the Netherlands. The Veluwe national park and the Utrechtse Heuvelrug forest, which are large forest-urban interfaces with several cities, are areas with high tick bite risk. This is expected, since these are popular places for recreation and tick activity is high in forests. However, our model can also predict high risk in less-intensively visited recreational areas, such as the patchy forests in the northeast of the country, the natural areas along the coastline, or some of the Frisian Islands. Our model could help public health specialists to design mitigation strategies for tick-borne diseases, and to target risky areas with awareness and prevention campaigns. |
format | Online Article Text |
id | pubmed-6903726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-69037262019-12-20 Modelling tick bite risk by combining random forests and count data regression models Garcia-Marti, Irene Zurita-Milla, Raul Swart, Arno PLoS One Research Article The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metropolitan areas. Hence, the transmission of pathogens causing tick-borne diseases is an important threat to public health. Using volunteered tick bite reports collected by two Dutch initiatives, here we present a method to model tick bite risk using human exposure and tick hazard predictors. Our method represents a step forward in risk modelling, since we combine a well-known ensemble learning method, Random Forest, with four count data models of the (zero-inflated) Poisson family. This combination allows us to better model the disproportions inherent in the volunteered tick bite reports. Unlike canonical machine learning models, our method can capture the overdispersion or zero-inflation inherent in data, thus yielding tick bite risk predictions that resemble the original signal captured by volunteers. Mapping model predictions enables a visual inspection of the spatial patterns of tick bite risk in the Netherlands. The Veluwe national park and the Utrechtse Heuvelrug forest, which are large forest-urban interfaces with several cities, are areas with high tick bite risk. This is expected, since these are popular places for recreation and tick activity is high in forests. However, our model can also predict high risk in less-intensively visited recreational areas, such as the patchy forests in the northeast of the country, the natural areas along the coastline, or some of the Frisian Islands. Our model could help public health specialists to design mitigation strategies for tick-borne diseases, and to target risky areas with awareness and prevention campaigns. Public Library of Science 2019-12-10 /pmc/articles/PMC6903726/ /pubmed/31821325 http://dx.doi.org/10.1371/journal.pone.0216511 Text en © 2019 Garcia-Marti et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Garcia-Marti, Irene Zurita-Milla, Raul Swart, Arno Modelling tick bite risk by combining random forests and count data regression models |
title | Modelling tick bite risk by combining random forests and count data regression models |
title_full | Modelling tick bite risk by combining random forests and count data regression models |
title_fullStr | Modelling tick bite risk by combining random forests and count data regression models |
title_full_unstemmed | Modelling tick bite risk by combining random forests and count data regression models |
title_short | Modelling tick bite risk by combining random forests and count data regression models |
title_sort | modelling tick bite risk by combining random forests and count data regression models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6903726/ https://www.ncbi.nlm.nih.gov/pubmed/31821325 http://dx.doi.org/10.1371/journal.pone.0216511 |
work_keys_str_mv | AT garciamartiirene modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels AT zuritamillaraul modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels AT swartarno modellingtickbiteriskbycombiningrandomforestsandcountdataregressionmodels |