Cargando…

Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning

Various research topics from the field of soil science or agriculture require digital maps of soil properties as input data. Such maps can be achieved by digital soil mapping (DSM) techniques which have developed consistently during the last decades. Our research focuses on the application of geosta...

Descripción completa

Detalles Bibliográficos
Autores principales: Patriche, Cristian Valeriu, Roşca, Bogdan, Pîrnău, Radu Gabriel, Vasiliniuc, Ionuţ
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446225/
https://www.ncbi.nlm.nih.gov/pubmed/37611038
http://dx.doi.org/10.1371/journal.pone.0289286
_version_ 1785094358682304512
author Patriche, Cristian Valeriu
Roşca, Bogdan
Pîrnău, Radu Gabriel
Vasiliniuc, Ionuţ
author_facet Patriche, Cristian Valeriu
Roşca, Bogdan
Pîrnău, Radu Gabriel
Vasiliniuc, Ionuţ
author_sort Patriche, Cristian Valeriu
collection PubMed
description Various research topics from the field of soil science or agriculture require digital maps of soil properties as input data. Such maps can be achieved by digital soil mapping (DSM) techniques which have developed consistently during the last decades. Our research focuses on the application of geostatistical methods (including ordinary kriging, regression-kriging and geographically weighted regression) and machine learning algorithms to produce high resolution digital maps of topsoil properties in Romania. Six continuous predictors were considered in our study (digital elevation model, topographic wetness index, normalized difference vegetation index, slope, latitude and longitude). A tolerance test was performed to ensure that all predictors can be used for the purpose of digital soil mapping. The input soil data was extracted from the LUCAS database and includes 7 chemical properties (pH, electrical conductivity, calcium carbonate, organic carbon, N, P, K) and the particle-size fractions (sand, silt, clay). The spatial autocorrelation is higher for pH, organic carbon and calcium carbonate, as indicated by the partial sill / nugget ratio of semivariograms, meaning that these properties are more predictable than the others by kriging interpolation. The optimal DSM method was selected by independent sample validation, using resampled statistics from 100 samples randomly extracted from the validation dataset. Also, an additional independent sample of soil profiles, comprising legacy soil data, and the 200k Romania soil map were used for a supplementary validation. The results show that machine learning and regression-kriging are the optimal methods in most cases. Among the machine learning tested algorithms, the best performance is associated with Support Vector Machines and Random Forests methods. The geographically weighted regression is also among the optimum methods for pH and calcium carbonates spatial prediction. Good predictions were achieved for pH (R(2) of 0.417–0.469, depending on the method), organic carbon (R(2) of 0.302–0.443), calcium carbonates (R(2) of 0.300–0.330) and moderate predictions for electric conductivity, total nitrogen, silt and sand (R(2) of 0.155–0.331), while the lowest prediction characterizes the phosphorous content (R(2) of 0.015–0.044). LUCAS proved to be a reliable and useful soil database and the achieved spatial distributions of soil properties can be further used for national and regional soil studies.
format Online
Article
Text
id pubmed-10446225
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104462252023-08-24 Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning Patriche, Cristian Valeriu Roşca, Bogdan Pîrnău, Radu Gabriel Vasiliniuc, Ionuţ PLoS One Research Article Various research topics from the field of soil science or agriculture require digital maps of soil properties as input data. Such maps can be achieved by digital soil mapping (DSM) techniques which have developed consistently during the last decades. Our research focuses on the application of geostatistical methods (including ordinary kriging, regression-kriging and geographically weighted regression) and machine learning algorithms to produce high resolution digital maps of topsoil properties in Romania. Six continuous predictors were considered in our study (digital elevation model, topographic wetness index, normalized difference vegetation index, slope, latitude and longitude). A tolerance test was performed to ensure that all predictors can be used for the purpose of digital soil mapping. The input soil data was extracted from the LUCAS database and includes 7 chemical properties (pH, electrical conductivity, calcium carbonate, organic carbon, N, P, K) and the particle-size fractions (sand, silt, clay). The spatial autocorrelation is higher for pH, organic carbon and calcium carbonate, as indicated by the partial sill / nugget ratio of semivariograms, meaning that these properties are more predictable than the others by kriging interpolation. The optimal DSM method was selected by independent sample validation, using resampled statistics from 100 samples randomly extracted from the validation dataset. Also, an additional independent sample of soil profiles, comprising legacy soil data, and the 200k Romania soil map were used for a supplementary validation. The results show that machine learning and regression-kriging are the optimal methods in most cases. Among the machine learning tested algorithms, the best performance is associated with Support Vector Machines and Random Forests methods. The geographically weighted regression is also among the optimum methods for pH and calcium carbonates spatial prediction. Good predictions were achieved for pH (R(2) of 0.417–0.469, depending on the method), organic carbon (R(2) of 0.302–0.443), calcium carbonates (R(2) of 0.300–0.330) and moderate predictions for electric conductivity, total nitrogen, silt and sand (R(2) of 0.155–0.331), while the lowest prediction characterizes the phosphorous content (R(2) of 0.015–0.044). LUCAS proved to be a reliable and useful soil database and the achieved spatial distributions of soil properties can be further used for national and regional soil studies. Public Library of Science 2023-08-23 /pmc/articles/PMC10446225/ /pubmed/37611038 http://dx.doi.org/10.1371/journal.pone.0289286 Text en © 2023 Patriche et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Patriche, Cristian Valeriu
Roşca, Bogdan
Pîrnău, Radu Gabriel
Vasiliniuc, Ionuţ
Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title_full Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title_fullStr Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title_full_unstemmed Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title_short Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning
title_sort spatial modelling of topsoil properties in romania using geostatistical methods and machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446225/
https://www.ncbi.nlm.nih.gov/pubmed/37611038
http://dx.doi.org/10.1371/journal.pone.0289286
work_keys_str_mv AT patrichecristianvaleriu spatialmodellingoftopsoilpropertiesinromaniausinggeostatisticalmethodsandmachinelearning
AT roscabogdan spatialmodellingoftopsoilpropertiesinromaniausinggeostatisticalmethodsandmachinelearning
AT pirnauradugabriel spatialmodellingoftopsoilpropertiesinromaniausinggeostatisticalmethodsandmachinelearning
AT vasiliniucionut spatialmodellingoftopsoilpropertiesinromaniausinggeostatisticalmethodsandmachinelearning