Cargando…
Using Random Forest to Improve the Downscaling of Global Livestock Census Data
Large scale, high-resolution global data on farm animal distributions are essential for spatially explicit assessments of the epidemiological, environmental and socio-economic impacts of the livestock sector. This has been the major motivation behind the development of the Gridded Livestock of the W...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792414/ https://www.ncbi.nlm.nih.gov/pubmed/26977807 http://dx.doi.org/10.1371/journal.pone.0150424 |
_version_ | 1782421238369484800 |
---|---|
author | Nicolas, Gaëlle Robinson, Timothy P. Wint, G. R. William Conchedda, Giulia Cinardi, Giuseppina Gilbert, Marius |
author_facet | Nicolas, Gaëlle Robinson, Timothy P. Wint, G. R. William Conchedda, Giulia Cinardi, Giuseppina Gilbert, Marius |
author_sort | Nicolas, Gaëlle |
collection | PubMed |
description | Large scale, high-resolution global data on farm animal distributions are essential for spatially explicit assessments of the epidemiological, environmental and socio-economic impacts of the livestock sector. This has been the major motivation behind the development of the Gridded Livestock of the World (GLW) database, which has been extensively used since its first publication in 2007. The database relies on a downscaling methodology whereby census counts of animals in sub-national administrative units are redistributed at the level of grid cells as a function of a series of spatial covariates. The recent upgrade of GLW1 to GLW2 involved automating the processing, improvement of input data, and downscaling at a spatial resolution of 1 km per cell (5 km per cell in the earlier version). The underlying statistical methodology, however, remained unchanged. In this paper, we evaluate new methods to downscale census data with a higher accuracy and increased processing efficiency. Two main factors were evaluated, based on sample census datasets of cattle in Africa and chickens in Asia. First, we implemented and evaluated Random Forest models (RF) instead of stratified regressions. Second, we investigated whether models that predicted the number of animals per rural person (per capita) could provide better downscaled estimates than the previous approach that predicted absolute densities (animals per km(2)). RF models consistently provided better predictions than the stratified regressions for both continents and species. The benefit of per capita over absolute density models varied according to the species and continent. In addition, different technical options were evaluated to reduce the processing time while maintaining their predictive power. Future GLW runs (GLW 3.0) will apply the new RF methodology with optimized modelling options. The potential benefit of per capita models will need to be further investigated with a better distinction between rural and agricultural populations. |
format | Online Article Text |
id | pubmed-4792414 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-47924142016-03-23 Using Random Forest to Improve the Downscaling of Global Livestock Census Data Nicolas, Gaëlle Robinson, Timothy P. Wint, G. R. William Conchedda, Giulia Cinardi, Giuseppina Gilbert, Marius PLoS One Research Article Large scale, high-resolution global data on farm animal distributions are essential for spatially explicit assessments of the epidemiological, environmental and socio-economic impacts of the livestock sector. This has been the major motivation behind the development of the Gridded Livestock of the World (GLW) database, which has been extensively used since its first publication in 2007. The database relies on a downscaling methodology whereby census counts of animals in sub-national administrative units are redistributed at the level of grid cells as a function of a series of spatial covariates. The recent upgrade of GLW1 to GLW2 involved automating the processing, improvement of input data, and downscaling at a spatial resolution of 1 km per cell (5 km per cell in the earlier version). The underlying statistical methodology, however, remained unchanged. In this paper, we evaluate new methods to downscale census data with a higher accuracy and increased processing efficiency. Two main factors were evaluated, based on sample census datasets of cattle in Africa and chickens in Asia. First, we implemented and evaluated Random Forest models (RF) instead of stratified regressions. Second, we investigated whether models that predicted the number of animals per rural person (per capita) could provide better downscaled estimates than the previous approach that predicted absolute densities (animals per km(2)). RF models consistently provided better predictions than the stratified regressions for both continents and species. The benefit of per capita over absolute density models varied according to the species and continent. In addition, different technical options were evaluated to reduce the processing time while maintaining their predictive power. Future GLW runs (GLW 3.0) will apply the new RF methodology with optimized modelling options. The potential benefit of per capita models will need to be further investigated with a better distinction between rural and agricultural populations. Public Library of Science 2016-03-15 /pmc/articles/PMC4792414/ /pubmed/26977807 http://dx.doi.org/10.1371/journal.pone.0150424 Text en © 2016 Nicolas et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Nicolas, Gaëlle Robinson, Timothy P. Wint, G. R. William Conchedda, Giulia Cinardi, Giuseppina Gilbert, Marius Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title | Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title_full | Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title_fullStr | Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title_full_unstemmed | Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title_short | Using Random Forest to Improve the Downscaling of Global Livestock Census Data |
title_sort | using random forest to improve the downscaling of global livestock census data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792414/ https://www.ncbi.nlm.nih.gov/pubmed/26977807 http://dx.doi.org/10.1371/journal.pone.0150424 |
work_keys_str_mv | AT nicolasgaelle usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata AT robinsontimothyp usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata AT wintgrwilliam usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata AT concheddagiulia usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata AT cinardigiuseppina usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata AT gilbertmarius usingrandomforesttoimprovethedownscalingofgloballivestockcensusdata |