Cargando…

Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States

The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from...

Descripción completa

Detalles Bibliográficos
Autores principales: Clingensmith, Christopher M., Grunwald, Sabine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9100618/
https://www.ncbi.nlm.nih.gov/pubmed/35590876
http://dx.doi.org/10.3390/s22093187
_version_ 1784706889554067456
author Clingensmith, Christopher M.
Grunwald, Sabine
author_facet Clingensmith, Christopher M.
Grunwald, Sabine
author_sort Clingensmith, Christopher M.
collection PubMed
description The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from the database for samples collected from 2011 to 2015. Only sites that had been fully described and horizons that had been analyzed for the full suite of desired properties were used. This resulted in over 14,000 samples that were used for modeling and eight soil properties: soil organic carbon (SOC); total nitrogen (TN); total sulfur (TS); clay; sand; exchangeable calcium (Ca(ex)); cation exchange capacity (CEC); and pH. Four chemometric methods were employed for soil property prediction: partial least squares (PLSR); Random Forest (RF); Cubist; and multivariable adaptive regression splines (MARS). The dataset was sufficiently large that only random subsetting was used to create calibration (70%) and validation (30%) sets. SOC, TN, and TS had the strongest prediction results, with an R(2) value of over 0.9. Ca(ex), CEC and pH were predicted moderately well. Clay and sand models had slightly lower performance. Of the four methods, Cubist produced the strongest models, while PLSR produced the weakest. This may be due to the complex relationships between soil properties and spectra that PLSR could not capture. The only drawback of Cubist is the difficult interpretability of variable importance. Future research should include the use of environmental variables to improve prediction results. Future work may also avoid the use of PLSR when dealing with large datasets that cover large areas and have high degrees of variability.
format Online
Article
Text
id pubmed-9100618
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91006182022-05-14 Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States Clingensmith, Christopher M. Grunwald, Sabine Sensors (Basel) Article The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from the database for samples collected from 2011 to 2015. Only sites that had been fully described and horizons that had been analyzed for the full suite of desired properties were used. This resulted in over 14,000 samples that were used for modeling and eight soil properties: soil organic carbon (SOC); total nitrogen (TN); total sulfur (TS); clay; sand; exchangeable calcium (Ca(ex)); cation exchange capacity (CEC); and pH. Four chemometric methods were employed for soil property prediction: partial least squares (PLSR); Random Forest (RF); Cubist; and multivariable adaptive regression splines (MARS). The dataset was sufficiently large that only random subsetting was used to create calibration (70%) and validation (30%) sets. SOC, TN, and TS had the strongest prediction results, with an R(2) value of over 0.9. Ca(ex), CEC and pH were predicted moderately well. Clay and sand models had slightly lower performance. Of the four methods, Cubist produced the strongest models, while PLSR produced the weakest. This may be due to the complex relationships between soil properties and spectra that PLSR could not capture. The only drawback of Cubist is the difficult interpretability of variable importance. Future research should include the use of environmental variables to improve prediction results. Future work may also avoid the use of PLSR when dealing with large datasets that cover large areas and have high degrees of variability. MDPI 2022-04-21 /pmc/articles/PMC9100618/ /pubmed/35590876 http://dx.doi.org/10.3390/s22093187 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Clingensmith, Christopher M.
Grunwald, Sabine
Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title_full Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title_fullStr Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title_full_unstemmed Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title_short Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
title_sort predicting soil properties and interpreting vis-nir models from across continental united states
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9100618/
https://www.ncbi.nlm.nih.gov/pubmed/35590876
http://dx.doi.org/10.3390/s22093187
work_keys_str_mv AT clingensmithchristopherm predictingsoilpropertiesandinterpretingvisnirmodelsfromacrosscontinentalunitedstates
AT grunwaldsabine predictingsoilpropertiesandinterpretingvisnirmodelsfromacrosscontinentalunitedstates