Cargando…
Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States
The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9100618/ https://www.ncbi.nlm.nih.gov/pubmed/35590876 http://dx.doi.org/10.3390/s22093187 |
_version_ | 1784706889554067456 |
---|---|
author | Clingensmith, Christopher M. Grunwald, Sabine |
author_facet | Clingensmith, Christopher M. Grunwald, Sabine |
author_sort | Clingensmith, Christopher M. |
collection | PubMed |
description | The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from the database for samples collected from 2011 to 2015. Only sites that had been fully described and horizons that had been analyzed for the full suite of desired properties were used. This resulted in over 14,000 samples that were used for modeling and eight soil properties: soil organic carbon (SOC); total nitrogen (TN); total sulfur (TS); clay; sand; exchangeable calcium (Ca(ex)); cation exchange capacity (CEC); and pH. Four chemometric methods were employed for soil property prediction: partial least squares (PLSR); Random Forest (RF); Cubist; and multivariable adaptive regression splines (MARS). The dataset was sufficiently large that only random subsetting was used to create calibration (70%) and validation (30%) sets. SOC, TN, and TS had the strongest prediction results, with an R(2) value of over 0.9. Ca(ex), CEC and pH were predicted moderately well. Clay and sand models had slightly lower performance. Of the four methods, Cubist produced the strongest models, while PLSR produced the weakest. This may be due to the complex relationships between soil properties and spectra that PLSR could not capture. The only drawback of Cubist is the difficult interpretability of variable importance. Future research should include the use of environmental variables to improve prediction results. Future work may also avoid the use of PLSR when dealing with large datasets that cover large areas and have high degrees of variability. |
format | Online Article Text |
id | pubmed-9100618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91006182022-05-14 Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States Clingensmith, Christopher M. Grunwald, Sabine Sensors (Basel) Article The United States NRCS has a soil database that has data collected from across the country over the last several decades. This also includes soil spectral scans. This data is available, but it may not be used to its full potential. For this study, pedon, horizon and spectral data was extracted from the database for samples collected from 2011 to 2015. Only sites that had been fully described and horizons that had been analyzed for the full suite of desired properties were used. This resulted in over 14,000 samples that were used for modeling and eight soil properties: soil organic carbon (SOC); total nitrogen (TN); total sulfur (TS); clay; sand; exchangeable calcium (Ca(ex)); cation exchange capacity (CEC); and pH. Four chemometric methods were employed for soil property prediction: partial least squares (PLSR); Random Forest (RF); Cubist; and multivariable adaptive regression splines (MARS). The dataset was sufficiently large that only random subsetting was used to create calibration (70%) and validation (30%) sets. SOC, TN, and TS had the strongest prediction results, with an R(2) value of over 0.9. Ca(ex), CEC and pH were predicted moderately well. Clay and sand models had slightly lower performance. Of the four methods, Cubist produced the strongest models, while PLSR produced the weakest. This may be due to the complex relationships between soil properties and spectra that PLSR could not capture. The only drawback of Cubist is the difficult interpretability of variable importance. Future research should include the use of environmental variables to improve prediction results. Future work may also avoid the use of PLSR when dealing with large datasets that cover large areas and have high degrees of variability. MDPI 2022-04-21 /pmc/articles/PMC9100618/ /pubmed/35590876 http://dx.doi.org/10.3390/s22093187 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Clingensmith, Christopher M. Grunwald, Sabine Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title | Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title_full | Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title_fullStr | Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title_full_unstemmed | Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title_short | Predicting Soil Properties and Interpreting Vis-NIR Models from across Continental United States |
title_sort | predicting soil properties and interpreting vis-nir models from across continental united states |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9100618/ https://www.ncbi.nlm.nih.gov/pubmed/35590876 http://dx.doi.org/10.3390/s22093187 |
work_keys_str_mv | AT clingensmithchristopherm predictingsoilpropertiesandinterpretingvisnirmodelsfromacrosscontinentalunitedstates AT grunwaldsabine predictingsoilpropertiesandinterpretingvisnirmodelsfromacrosscontinentalunitedstates |