Cargando…

Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate

Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO(3)(−))). The mechanism between ε(NO(3)(−)) and its drivers is highly complex and nonlinear,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Bo, Yu, Haofei, Shi, Zongbo, Liu, Jinxing, Wei, Yuting, Zhang, Zhongcheng, Huangfu, Yanqi, Xu, Han, Li, Yue, Zhang, Linlin, Feng, Yinchang, Shi, Guoliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661687/
https://www.ncbi.nlm.nih.gov/pubmed/38021366
http://dx.doi.org/10.1016/j.ese.2023.100333
Descripción
Sumario:Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO(3)(−))). The mechanism between ε(NO(3)(−)) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO(3)(−)). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH(4)(+), SO(4)(2−), and temperature as pivotal drivers for ε(NO(3)(−)). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH(4)(+) during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.