Cargando…

Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules

Low-volatile organic compounds (LVOCs) drive key atmospheric processes, such as new particle formation (NPF) and growth. Machine learning tools can accelerate studies of these phenomena, but extensive and versatile LVOC datasets relevant for the atmospheric research community are lacking. We present...

Descripción completa

Detalles Bibliográficos
Autores principales: Besel, Vitus, Todorović, Milica, Kurtén, Theo, Rinke, Patrick, Vehkamäki, Hanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10338534/
https://www.ncbi.nlm.nih.gov/pubmed/37438370
http://dx.doi.org/10.1038/s41597-023-02366-x
_version_ 1785071648686211072
author Besel, Vitus
Todorović, Milica
Kurtén, Theo
Rinke, Patrick
Vehkamäki, Hanna
author_facet Besel, Vitus
Todorović, Milica
Kurtén, Theo
Rinke, Patrick
Vehkamäki, Hanna
author_sort Besel, Vitus
collection PubMed
description Low-volatile organic compounds (LVOCs) drive key atmospheric processes, such as new particle formation (NPF) and growth. Machine learning tools can accelerate studies of these phenomena, but extensive and versatile LVOC datasets relevant for the atmospheric research community are lacking. We present the GeckoQ dataset with atomic structures of 31,637 atmospherically relevant molecules resulting from the oxidation of α-pinene, toluene and decane. For each molecule, we performed comprehensive conformer sampling with the COSMOconf program and calculated thermodynamic properties with density functional theory (DFT) using the Conductor-like Screening Model (COSMO). Our dataset contains the geometries of the 7 Mio. conformers we found and their corresponding structural and thermodynamic properties, including saturation vapor pressures (p(Sat)), chemical potentials and free energies. The p(Sat) were compared to values calculated with the group contribution method SIMPOL. To validate the dataset, we explored the relationship between structural and thermodynamic properties, and then demonstrated a first machine-learning application with Gaussian process regression.
format Online
Article
Text
id pubmed-10338534
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103385342023-07-14 Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules Besel, Vitus Todorović, Milica Kurtén, Theo Rinke, Patrick Vehkamäki, Hanna Sci Data Data Descriptor Low-volatile organic compounds (LVOCs) drive key atmospheric processes, such as new particle formation (NPF) and growth. Machine learning tools can accelerate studies of these phenomena, but extensive and versatile LVOC datasets relevant for the atmospheric research community are lacking. We present the GeckoQ dataset with atomic structures of 31,637 atmospherically relevant molecules resulting from the oxidation of α-pinene, toluene and decane. For each molecule, we performed comprehensive conformer sampling with the COSMOconf program and calculated thermodynamic properties with density functional theory (DFT) using the Conductor-like Screening Model (COSMO). Our dataset contains the geometries of the 7 Mio. conformers we found and their corresponding structural and thermodynamic properties, including saturation vapor pressures (p(Sat)), chemical potentials and free energies. The p(Sat) were compared to values calculated with the group contribution method SIMPOL. To validate the dataset, we explored the relationship between structural and thermodynamic properties, and then demonstrated a first machine-learning application with Gaussian process regression. Nature Publishing Group UK 2023-07-12 /pmc/articles/PMC10338534/ /pubmed/37438370 http://dx.doi.org/10.1038/s41597-023-02366-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Besel, Vitus
Todorović, Milica
Kurtén, Theo
Rinke, Patrick
Vehkamäki, Hanna
Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title_full Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title_fullStr Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title_full_unstemmed Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title_short Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
title_sort atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10338534/
https://www.ncbi.nlm.nih.gov/pubmed/37438370
http://dx.doi.org/10.1038/s41597-023-02366-x
work_keys_str_mv AT beselvitus atomicstructuresconformersandthermodynamicpropertiesof32katmosphericmolecules
AT todorovicmilica atomicstructuresconformersandthermodynamicpropertiesof32katmosphericmolecules
AT kurtentheo atomicstructuresconformersandthermodynamicpropertiesof32katmosphericmolecules
AT rinkepatrick atomicstructuresconformersandthermodynamicpropertiesof32katmosphericmolecules
AT vehkamakihanna atomicstructuresconformersandthermodynamicpropertiesof32katmosphericmolecules