Cargando…
A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling
Plant species diversity (PSD) has always been an essential component of biodiversity and plays an important role in ecosystem functions and services. However, it is still a huge challenge to simulate the spatial distribution of PSD due to the difficulties of data acquisition and unsatisfactory perfo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8873049/ https://www.ncbi.nlm.nih.gov/pubmed/34674121 http://dx.doi.org/10.1007/s11356-021-16973-x |
_version_ | 1784657378675785728 |
---|---|
author | Zhao, Yapeng Yin, Xiaozhe Fu, Yan Yue, Tianxiang |
author_facet | Zhao, Yapeng Yin, Xiaozhe Fu, Yan Yue, Tianxiang |
author_sort | Zhao, Yapeng |
collection | PubMed |
description | Plant species diversity (PSD) has always been an essential component of biodiversity and plays an important role in ecosystem functions and services. However, it is still a huge challenge to simulate the spatial distribution of PSD due to the difficulties of data acquisition and unsatisfactory performance of predicting algorithms over large areas. A surge in the number of remote sensing imagery, along with the great success of machine learning, opens new opportunities for the mapping of PSD. Therefore, different machine learning algorithms combined with high-accuracy surface modeling (HASM) were firstly proposed to predict the PSD in the Xinghai, northeastern Qinghai-Tibetan Plateau, China. Spectral reflectance and vegetation indices, generated from Landsat 8 images, and environmental variables were taken as the potential explanatory factors of machine learning models including least absolute shrinkage and selection operator (Lasso), ridge regression (Ridge), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF). The prediction generated from these machine learning methods and in situ observation data were integrated by using HASM for the high-accuracy mapping of PSD including three species diversity indices. The results showed that PSD was closely associated with vegetation indices, followed by spectral reflectance and environmental factors. XGBoost combined with HASM (HASM-XGBoost) showed the best performance with the lowest MAE and RMSE. Our results suggested that the fusion of heterogeneous data and the ensemble of heterogeneous models may revolutionize our ability to predict the PSD over large areas, especially in some places limited by sparse field samples. |
format | Online Article Text |
id | pubmed-8873049 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-88730492022-03-02 A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling Zhao, Yapeng Yin, Xiaozhe Fu, Yan Yue, Tianxiang Environ Sci Pollut Res Int Research Article Plant species diversity (PSD) has always been an essential component of biodiversity and plays an important role in ecosystem functions and services. However, it is still a huge challenge to simulate the spatial distribution of PSD due to the difficulties of data acquisition and unsatisfactory performance of predicting algorithms over large areas. A surge in the number of remote sensing imagery, along with the great success of machine learning, opens new opportunities for the mapping of PSD. Therefore, different machine learning algorithms combined with high-accuracy surface modeling (HASM) were firstly proposed to predict the PSD in the Xinghai, northeastern Qinghai-Tibetan Plateau, China. Spectral reflectance and vegetation indices, generated from Landsat 8 images, and environmental variables were taken as the potential explanatory factors of machine learning models including least absolute shrinkage and selection operator (Lasso), ridge regression (Ridge), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF). The prediction generated from these machine learning methods and in situ observation data were integrated by using HASM for the high-accuracy mapping of PSD including three species diversity indices. The results showed that PSD was closely associated with vegetation indices, followed by spectral reflectance and environmental factors. XGBoost combined with HASM (HASM-XGBoost) showed the best performance with the lowest MAE and RMSE. Our results suggested that the fusion of heterogeneous data and the ensemble of heterogeneous models may revolutionize our ability to predict the PSD over large areas, especially in some places limited by sparse field samples. Springer Berlin Heidelberg 2021-10-21 2022 /pmc/articles/PMC8873049/ /pubmed/34674121 http://dx.doi.org/10.1007/s11356-021-16973-x Text en © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Zhao, Yapeng Yin, Xiaozhe Fu, Yan Yue, Tianxiang A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title | A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title_full | A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title_fullStr | A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title_full_unstemmed | A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title_short | A comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
title_sort | comparative mapping of plant species diversity using ensemble learning algorithms combined with high accuracy surface modeling |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8873049/ https://www.ncbi.nlm.nih.gov/pubmed/34674121 http://dx.doi.org/10.1007/s11356-021-16973-x |
work_keys_str_mv | AT zhaoyapeng acomparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT yinxiaozhe acomparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT fuyan acomparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT yuetianxiang acomparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT zhaoyapeng comparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT yinxiaozhe comparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT fuyan comparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling AT yuetianxiang comparativemappingofplantspeciesdiversityusingensemblelearningalgorithmscombinedwithhighaccuracysurfacemodeling |