Cargando…

Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping

Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns u...

Descripción completa

Detalles Bibliográficos
Autores principales: Macintyre, Paul D., Van Niekerk, Adriaan, Dobrowolski, Mark P., Tsakalos, James L., Mucina, Ladislav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053567/
https://www.ncbi.nlm.nih.gov/pubmed/30038769
http://dx.doi.org/10.1002/ece3.4176
_version_ 1783340851034849280
author Macintyre, Paul D.
Van Niekerk, Adriaan
Dobrowolski, Mark P.
Tsakalos, James L.
Mucina, Ladislav
author_facet Macintyre, Paul D.
Van Niekerk, Adriaan
Dobrowolski, Mark P.
Tsakalos, James L.
Mucina, Ladislav
author_sort Macintyre, Paul D.
collection PubMed
description Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.
format Online
Article
Text
id pubmed-6053567
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-60535672018-07-23 Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping Macintyre, Paul D. Van Niekerk, Adriaan Dobrowolski, Mark P. Tsakalos, James L. Mucina, Ladislav Ecol Evol Original Research Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions. John Wiley and Sons Inc. 2018-06-11 /pmc/articles/PMC6053567/ /pubmed/30038769 http://dx.doi.org/10.1002/ece3.4176 Text en © 2018 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Macintyre, Paul D.
Van Niekerk, Adriaan
Dobrowolski, Mark P.
Tsakalos, James L.
Mucina, Ladislav
Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_full Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_fullStr Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_full_unstemmed Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_short Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_sort impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053567/
https://www.ncbi.nlm.nih.gov/pubmed/30038769
http://dx.doi.org/10.1002/ece3.4176
work_keys_str_mv AT macintyrepauld impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping
AT vanniekerkadriaan impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping
AT dobrowolskimarkp impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping
AT tsakalosjamesl impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping
AT mucinaladislav impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping