Cargando…

Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping

Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns u...

Descripción completa

Detalles Bibliográficos
Autores principales:	Macintyre, Paul D., Van Niekerk, Adriaan, Dobrowolski, Mark P., Tsakalos, James L., Mucina, Ladislav
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2018
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053567/ https://www.ncbi.nlm.nih.gov/pubmed/30038769 http://dx.doi.org/10.1002/ece3.4176

_version_	1783340851034849280
author	Macintyre, Paul D. Van Niekerk, Adriaan Dobrowolski, Mark P. Tsakalos, James L. Mucina, Ladislav
author_facet	Macintyre, Paul D. Van Niekerk, Adriaan Dobrowolski, Mark P. Tsakalos, James L. Mucina, Ladislav
author_sort	Macintyre, Paul D.
collection	PubMed
description	Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.
format	Online Article Text
id	pubmed-6053567
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-60535672018-07-23 Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping Macintyre, Paul D. Van Niekerk, Adriaan Dobrowolski, Mark P. Tsakalos, James L. Mucina, Ladislav Ecol Evol Original Research Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions. John Wiley and Sons Inc. 2018-06-11 /pmc/articles/PMC6053567/ /pubmed/30038769 http://dx.doi.org/10.1002/ece3.4176 Text en © 2018 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Research Macintyre, Paul D. Van Niekerk, Adriaan Dobrowolski, Mark P. Tsakalos, James L. Mucina, Ladislav Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title	Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_full	Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_fullStr	Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_full_unstemmed	Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_short	Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
title_sort	impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053567/ https://www.ncbi.nlm.nih.gov/pubmed/30038769 http://dx.doi.org/10.1002/ece3.4176
work_keys_str_mv	AT macintyrepauld impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping AT vanniekerkadriaan impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping AT dobrowolskimarkp impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping AT tsakalosjamesl impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping AT mucinaladislav impactofecologicalredundancyontheperformanceofmachinelearningclassifiersinvegetationmapping

Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping

Ejemplares similares