Cargando…

Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States

Soil organic carbon (SOC) is a key component of the global carbon cycle, yet it is not well-represented in Earth system models to accurately predict global carbon dynamics in response to climate change. This novel study integrated deep learning, data assimilation, 25,444 vertical soil profiles, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Feng, Zhou, Zhenghu, Huang, Yuanyuan, Li, Qianyu, Lu, Xingjie, Ma, Shuang, Huang, Xiaomeng, Liang, Yishuang, Hugelius, Gustaf, Jiang, Lifen, Doughty, Russell, Ren, Zhehao, Luo, Yiqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931903/
https://www.ncbi.nlm.nih.gov/pubmed/33693391
http://dx.doi.org/10.3389/fdata.2020.00017
_version_ 1783660379190067200
author Tao, Feng
Zhou, Zhenghu
Huang, Yuanyuan
Li, Qianyu
Lu, Xingjie
Ma, Shuang
Huang, Xiaomeng
Liang, Yishuang
Hugelius, Gustaf
Jiang, Lifen
Doughty, Russell
Ren, Zhehao
Luo, Yiqi
author_facet Tao, Feng
Zhou, Zhenghu
Huang, Yuanyuan
Li, Qianyu
Lu, Xingjie
Ma, Shuang
Huang, Xiaomeng
Liang, Yishuang
Hugelius, Gustaf
Jiang, Lifen
Doughty, Russell
Ren, Zhehao
Luo, Yiqi
author_sort Tao, Feng
collection PubMed
description Soil organic carbon (SOC) is a key component of the global carbon cycle, yet it is not well-represented in Earth system models to accurately predict global carbon dynamics in response to climate change. This novel study integrated deep learning, data assimilation, 25,444 vertical soil profiles, and the Community Land Model version 5 (CLM5) to optimize the model representation of SOC over the conterminous United States. We firstly constrained parameters in CLM5 using observations of vertical profiles of SOC in both a batch mode (using all individual soil layers in one batch) and at individual sites (site-by-site). The estimated parameter values from the site-by-site data assimilation were then either randomly sampled (random-sampling) to generate continentally homogeneous (constant) parameter values or maximally preserved for their spatially heterogeneous distributions (varying parameter values to match the spatial patterns from the site-by-site data assimilation) so as to optimize spatial representation of SOC in CLM5 through a deep learning technique (neural networking) over the conterminous United States. Comparing modeled spatial distributions of SOC by CLM5 to observations yielded increasing predictive accuracy from default CLM5 settings (R(2) = 0.32) to randomly sampled (0.36), one-batch estimated (0.43), and deep learning optimized (0.62) parameter values. While CLM5 with parameter values derived from random-sampling and one-batch methods substantially corrected the overestimated SOC storage by that with default model parameters, there were still considerable geographical biases. CLM5 with the spatially heterogeneous parameter values optimized from the neural networking method had the least estimation error and less geographical biases across the conterminous United States. Our study indicated that deep learning in combination with data assimilation can significantly improve the representation of SOC by complex land biogeochemical models.
format Online
Article
Text
id pubmed-7931903
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79319032021-03-09 Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States Tao, Feng Zhou, Zhenghu Huang, Yuanyuan Li, Qianyu Lu, Xingjie Ma, Shuang Huang, Xiaomeng Liang, Yishuang Hugelius, Gustaf Jiang, Lifen Doughty, Russell Ren, Zhehao Luo, Yiqi Front Big Data Big Data Soil organic carbon (SOC) is a key component of the global carbon cycle, yet it is not well-represented in Earth system models to accurately predict global carbon dynamics in response to climate change. This novel study integrated deep learning, data assimilation, 25,444 vertical soil profiles, and the Community Land Model version 5 (CLM5) to optimize the model representation of SOC over the conterminous United States. We firstly constrained parameters in CLM5 using observations of vertical profiles of SOC in both a batch mode (using all individual soil layers in one batch) and at individual sites (site-by-site). The estimated parameter values from the site-by-site data assimilation were then either randomly sampled (random-sampling) to generate continentally homogeneous (constant) parameter values or maximally preserved for their spatially heterogeneous distributions (varying parameter values to match the spatial patterns from the site-by-site data assimilation) so as to optimize spatial representation of SOC in CLM5 through a deep learning technique (neural networking) over the conterminous United States. Comparing modeled spatial distributions of SOC by CLM5 to observations yielded increasing predictive accuracy from default CLM5 settings (R(2) = 0.32) to randomly sampled (0.36), one-batch estimated (0.43), and deep learning optimized (0.62) parameter values. While CLM5 with parameter values derived from random-sampling and one-batch methods substantially corrected the overestimated SOC storage by that with default model parameters, there were still considerable geographical biases. CLM5 with the spatially heterogeneous parameter values optimized from the neural networking method had the least estimation error and less geographical biases across the conterminous United States. Our study indicated that deep learning in combination with data assimilation can significantly improve the representation of SOC by complex land biogeochemical models. Frontiers Media S.A. 2020-06-03 /pmc/articles/PMC7931903/ /pubmed/33693391 http://dx.doi.org/10.3389/fdata.2020.00017 Text en Copyright © 2020 Tao, Zhou, Huang, Li, Lu, Ma, Huang, Liang, Hugelius, Jiang, Doughty, Ren and Luo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Tao, Feng
Zhou, Zhenghu
Huang, Yuanyuan
Li, Qianyu
Lu, Xingjie
Ma, Shuang
Huang, Xiaomeng
Liang, Yishuang
Hugelius, Gustaf
Jiang, Lifen
Doughty, Russell
Ren, Zhehao
Luo, Yiqi
Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title_full Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title_fullStr Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title_full_unstemmed Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title_short Deep Learning Optimizes Data-Driven Representation of Soil Organic Carbon in Earth System Model Over the Conterminous United States
title_sort deep learning optimizes data-driven representation of soil organic carbon in earth system model over the conterminous united states
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931903/
https://www.ncbi.nlm.nih.gov/pubmed/33693391
http://dx.doi.org/10.3389/fdata.2020.00017
work_keys_str_mv AT taofeng deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT zhouzhenghu deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT huangyuanyuan deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT liqianyu deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT luxingjie deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT mashuang deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT huangxiaomeng deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT liangyishuang deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT hugeliusgustaf deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT jianglifen deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT doughtyrussell deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT renzhehao deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates
AT luoyiqi deeplearningoptimizesdatadrivenrepresentationofsoilorganiccarboninearthsystemmodelovertheconterminousunitedstates