Cargando…

Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States

BACKGROUND: High-resolution soil moisture estimates are critical for planning water management and assessing environmental quality. In-situ measurements alone are too costly to support the spatial and temporal resolutions needed for water management. Recent efforts have combined calibration data wit...

Descripción completa

Detalles Bibliográficos
Autores principales: Xia, Yushu, Watts, Jennifer D., Machmuller, Megan B., Sanderman, Jonathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9639422/
https://www.ncbi.nlm.nih.gov/pubmed/36353602
http://dx.doi.org/10.7717/peerj.14275
_version_ 1784825637156945920
author Xia, Yushu
Watts, Jennifer D.
Machmuller, Megan B.
Sanderman, Jonathan
author_facet Xia, Yushu
Watts, Jennifer D.
Machmuller, Megan B.
Sanderman, Jonathan
author_sort Xia, Yushu
collection PubMed
description BACKGROUND: High-resolution soil moisture estimates are critical for planning water management and assessing environmental quality. In-situ measurements alone are too costly to support the spatial and temporal resolutions needed for water management. Recent efforts have combined calibration data with machine learning algorithms to fill the gap where high resolution moisture estimates are lacking at the field scale. This study aimed to provide calibrated soil moisture models and methodology for generating gridded estimates of soil moisture at multiple depths, according to user-defined temporal periods, spatial resolution and extent. METHODS: We applied nearly one million national library soil moisture records from over 100 sites, spanning the U.S. Midwest and West, to build Quantile Random Forest (QRF) calibration models. The QRF models were built on covariates including soil moisture estimates from North American Land Data Assimilation System (NLDAS), soil properties, climate variables, digital elevation models, and remote sensing-derived indices. We also explored an alternative approach that adopted a regionalized calibration dataset for the Western U.S. The broad-scale QRF models were independently validated according to sampling depths, land cover type, and observation period. We then explored the model performance improved with local samples used for spiking. Finally, the QRF models were applied to estimate soil moisture at the field scale where evaluation was carried out to check estimated temporal and spatial patterns. RESULTS: The broad-scale QRF model showed moderate performance (R(2) = 0.53, RMSE = 0.078 m(3)/m(3)) when data points from all depth layers (up to 100 cm) were considered for an independent validation. Elevation, NLDAS-derived moisture, soil properties, and sampling depth were ranked as the most important covariates. The best model performance was observed for forest and pasture sites (R(2) > 0.5; RMSE < 0.09 m(3)/m(3)), followed by grassland and cropland (R(2) > 0.4; RMSE < 0.11 m(3)/m(3)). Model performance decreased with sampling depths and was slightly lower during the winter months. Spiking the national QRF model with local samples improved model performance by reducing the RMSE to less than 0.05 m(3)/m(3) for grassland sites. At the field scale, model estimates illustrated more accurate temporal trends for surface than subsurface soil layers. Model estimated spatial patterns need to be further improved and validated with management data. CONCLUSIONS: The model accuracy for top 0–20 cm soil depth (R(2) > 0.5, RMSE < 0.08 m(3)/m(3)) showed promise for adopting the methodology for soil moisture monitoring. The success of spiking the national model with local samples showed the need to collect multi-year high frequency (e.g., hourly) sensor-based field measurements to improve estimates of soil moisture for a longer time period. Future work should improve model performance for deeper depths with additional hydraulic properties and use of locally-selected calibration datasets.
format Online
Article
Text
id pubmed-9639422
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-96394222022-11-08 Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States Xia, Yushu Watts, Jennifer D. Machmuller, Megan B. Sanderman, Jonathan PeerJ Agricultural Science BACKGROUND: High-resolution soil moisture estimates are critical for planning water management and assessing environmental quality. In-situ measurements alone are too costly to support the spatial and temporal resolutions needed for water management. Recent efforts have combined calibration data with machine learning algorithms to fill the gap where high resolution moisture estimates are lacking at the field scale. This study aimed to provide calibrated soil moisture models and methodology for generating gridded estimates of soil moisture at multiple depths, according to user-defined temporal periods, spatial resolution and extent. METHODS: We applied nearly one million national library soil moisture records from over 100 sites, spanning the U.S. Midwest and West, to build Quantile Random Forest (QRF) calibration models. The QRF models were built on covariates including soil moisture estimates from North American Land Data Assimilation System (NLDAS), soil properties, climate variables, digital elevation models, and remote sensing-derived indices. We also explored an alternative approach that adopted a regionalized calibration dataset for the Western U.S. The broad-scale QRF models were independently validated according to sampling depths, land cover type, and observation period. We then explored the model performance improved with local samples used for spiking. Finally, the QRF models were applied to estimate soil moisture at the field scale where evaluation was carried out to check estimated temporal and spatial patterns. RESULTS: The broad-scale QRF model showed moderate performance (R(2) = 0.53, RMSE = 0.078 m(3)/m(3)) when data points from all depth layers (up to 100 cm) were considered for an independent validation. Elevation, NLDAS-derived moisture, soil properties, and sampling depth were ranked as the most important covariates. The best model performance was observed for forest and pasture sites (R(2) > 0.5; RMSE < 0.09 m(3)/m(3)), followed by grassland and cropland (R(2) > 0.4; RMSE < 0.11 m(3)/m(3)). Model performance decreased with sampling depths and was slightly lower during the winter months. Spiking the national QRF model with local samples improved model performance by reducing the RMSE to less than 0.05 m(3)/m(3) for grassland sites. At the field scale, model estimates illustrated more accurate temporal trends for surface than subsurface soil layers. Model estimated spatial patterns need to be further improved and validated with management data. CONCLUSIONS: The model accuracy for top 0–20 cm soil depth (R(2) > 0.5, RMSE < 0.08 m(3)/m(3)) showed promise for adopting the methodology for soil moisture monitoring. The success of spiking the national model with local samples showed the need to collect multi-year high frequency (e.g., hourly) sensor-based field measurements to improve estimates of soil moisture for a longer time period. Future work should improve model performance for deeper depths with additional hydraulic properties and use of locally-selected calibration datasets. PeerJ Inc. 2022-11-04 /pmc/articles/PMC9639422/ /pubmed/36353602 http://dx.doi.org/10.7717/peerj.14275 Text en © 2022 Xia et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Agricultural Science
Xia, Yushu
Watts, Jennifer D.
Machmuller, Megan B.
Sanderman, Jonathan
Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title_full Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title_fullStr Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title_full_unstemmed Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title_short Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States
title_sort machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the western and midwestern united states
topic Agricultural Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9639422/
https://www.ncbi.nlm.nih.gov/pubmed/36353602
http://dx.doi.org/10.7717/peerj.14275
work_keys_str_mv AT xiayushu machinelearningbasedestimationoffieldscaledailyhighresolutionmultidepthsoilmoistureforthewesternandmidwesternunitedstates
AT wattsjenniferd machinelearningbasedestimationoffieldscaledailyhighresolutionmultidepthsoilmoistureforthewesternandmidwesternunitedstates
AT machmullermeganb machinelearningbasedestimationoffieldscaledailyhighresolutionmultidepthsoilmoistureforthewesternandmidwesternunitedstates
AT sandermanjonathan machinelearningbasedestimationoffieldscaledailyhighresolutionmultidepthsoilmoistureforthewesternandmidwesternunitedstates