Cargando…

Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential

Potential natural vegetation (PNV) is the vegetation cover in equilibrium with climate, that would exist at a given location if not impacted by human activities. PNV is useful for raising public awareness about land degradation and for estimating land potential. This paper presents results of assess...

Descripción completa

Detalles Bibliográficos
Autores principales: Hengl, Tomislav, Walsh, Markus G., Sanderman, Jonathan, Wheeler, Ichsani, Harrison, Sandy P., Prentice, Iain C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6109375/
https://www.ncbi.nlm.nih.gov/pubmed/30155360
http://dx.doi.org/10.7717/peerj.5457
_version_ 1783350318840414208
author Hengl, Tomislav
Walsh, Markus G.
Sanderman, Jonathan
Wheeler, Ichsani
Harrison, Sandy P.
Prentice, Iain C.
author_facet Hengl, Tomislav
Walsh, Markus G.
Sanderman, Jonathan
Wheeler, Ichsani
Harrison, Sandy P.
Prentice, Iain C.
author_sort Hengl, Tomislav
collection PubMed
description Potential natural vegetation (PNV) is the vegetation cover in equilibrium with climate, that would exist at a given location if not impacted by human activities. PNV is useful for raising public awareness about land degradation and for estimating land potential. This paper presents results of assessing machine learning algorithms—neural networks (nnet package), random forest (ranger), gradient boosting (gbm), K-nearest neighborhood (class) and Cubist—for operational mapping of PNV. Three case studies were considered: (1) global distribution of biomes based on the BIOME 6000 data set (8,057 modern pollen-based site reconstructions), (2) distribution of forest tree taxa in Europe based on detailed occurrence records (1,546,435 ground observations), and (3) global monthly fraction of absorbed photosynthetically active radiation (FAPAR) values (30,301 randomly-sampled points). A stack of 160 global maps representing biophysical conditions over land, including atmospheric, climatic, relief, and lithologic variables, were used as explanatory variables. The overall results indicate that random forest gives the overall best performance. The highest accuracy for predicting BIOME 6000 classes (20) was estimated to be between 33% (with spatial cross-validation) and 68% (simple random sub-setting), with the most important predictors being total annual precipitation, monthly temperatures, and bioclimatic layers. Predicting forest tree species (73) resulted in mapping accuracy of 25%, with the most important predictors being monthly cloud fraction, mean annual and monthly temperatures, and elevation. Regression models for FAPAR (monthly images) gave an R-square of 90% with the most important predictors being total annual precipitation, monthly cloud fraction, CHELSA bioclimatic layers, and month of the year, respectively. Further developments of PNV mapping could include using all GBIF records to map the global distribution of plant species at different taxonomic levels. This methodology could also be extended to dynamic modeling of PNV, so that future climate scenarios can be incorporated. Global maps of biomes, FAPAR and tree species at one km spatial resolution are available for download via http://dx.doi.org/10.7910/DVN/QQHCIK.
format Online
Article
Text
id pubmed-6109375
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-61093752018-08-28 Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential Hengl, Tomislav Walsh, Markus G. Sanderman, Jonathan Wheeler, Ichsani Harrison, Sandy P. Prentice, Iain C. PeerJ Biogeography Potential natural vegetation (PNV) is the vegetation cover in equilibrium with climate, that would exist at a given location if not impacted by human activities. PNV is useful for raising public awareness about land degradation and for estimating land potential. This paper presents results of assessing machine learning algorithms—neural networks (nnet package), random forest (ranger), gradient boosting (gbm), K-nearest neighborhood (class) and Cubist—for operational mapping of PNV. Three case studies were considered: (1) global distribution of biomes based on the BIOME 6000 data set (8,057 modern pollen-based site reconstructions), (2) distribution of forest tree taxa in Europe based on detailed occurrence records (1,546,435 ground observations), and (3) global monthly fraction of absorbed photosynthetically active radiation (FAPAR) values (30,301 randomly-sampled points). A stack of 160 global maps representing biophysical conditions over land, including atmospheric, climatic, relief, and lithologic variables, were used as explanatory variables. The overall results indicate that random forest gives the overall best performance. The highest accuracy for predicting BIOME 6000 classes (20) was estimated to be between 33% (with spatial cross-validation) and 68% (simple random sub-setting), with the most important predictors being total annual precipitation, monthly temperatures, and bioclimatic layers. Predicting forest tree species (73) resulted in mapping accuracy of 25%, with the most important predictors being monthly cloud fraction, mean annual and monthly temperatures, and elevation. Regression models for FAPAR (monthly images) gave an R-square of 90% with the most important predictors being total annual precipitation, monthly cloud fraction, CHELSA bioclimatic layers, and month of the year, respectively. Further developments of PNV mapping could include using all GBIF records to map the global distribution of plant species at different taxonomic levels. This methodology could also be extended to dynamic modeling of PNV, so that future climate scenarios can be incorporated. Global maps of biomes, FAPAR and tree species at one km spatial resolution are available for download via http://dx.doi.org/10.7910/DVN/QQHCIK. PeerJ Inc. 2018-08-22 /pmc/articles/PMC6109375/ /pubmed/30155360 http://dx.doi.org/10.7717/peerj.5457 Text en © 2018 Hengl et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Biogeography
Hengl, Tomislav
Walsh, Markus G.
Sanderman, Jonathan
Wheeler, Ichsani
Harrison, Sandy P.
Prentice, Iain C.
Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title_full Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title_fullStr Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title_full_unstemmed Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title_short Global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
title_sort global mapping of potential natural vegetation: an assessment of machine learning algorithms for estimating land potential
topic Biogeography
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6109375/
https://www.ncbi.nlm.nih.gov/pubmed/30155360
http://dx.doi.org/10.7717/peerj.5457
work_keys_str_mv AT hengltomislav globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential
AT walshmarkusg globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential
AT sandermanjonathan globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential
AT wheelerichsani globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential
AT harrisonsandyp globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential
AT prenticeiainc globalmappingofpotentialnaturalvegetationanassessmentofmachinelearningalgorithmsforestimatinglandpotential