Cargando…
Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic grou...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442335/ https://www.ncbi.nlm.nih.gov/pubmed/37604820 http://dx.doi.org/10.1038/s41597-023-02408-4 |
_version_ | 1785093571178659840 |
---|---|
author | Lupo Pasini, Massimiliano Mehta, Kshitij Yoo, Pilsun Irle, Stephan |
author_facet | Lupo Pasini, Massimiliano Mehta, Kshitij Yoo, Pilsun Irle, Stephan |
author_sort | Lupo Pasini, Massimiliano |
collection | PubMed |
description | We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic ground state. The GDB-9-Ex dataset contains a subset of 96,766 organic molecules from the original open-source GDB-9 dataset. The ORNL_AISD-Ex dataset consists of 10,502,904 organic molecules that contain between 5 and 71 non-hydrogen atoms. The data reveals the close correlation between the magnitude of the gaps between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO), and the excitation energy of the lowest singlet excited state energies quantitatively. The chemical variability of the large number of molecules was examined with a topological fingerprint estimation based on extended-connectivity fingerprints (ECFPs) followed by uniform manifold approximation and projection (UMAP) for dimension reduction. Both datasets were generated using the DFTB+ software on the “Andes” cluster of the Oak Ridge Leadership Computing Facility (OLCF). |
format | Online Article Text |
id | pubmed-10442335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104423352023-08-23 Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules Lupo Pasini, Massimiliano Mehta, Kshitij Yoo, Pilsun Irle, Stephan Sci Data Data Descriptor We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic ground state. The GDB-9-Ex dataset contains a subset of 96,766 organic molecules from the original open-source GDB-9 dataset. The ORNL_AISD-Ex dataset consists of 10,502,904 organic molecules that contain between 5 and 71 non-hydrogen atoms. The data reveals the close correlation between the magnitude of the gaps between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO), and the excitation energy of the lowest singlet excited state energies quantitatively. The chemical variability of the large number of molecules was examined with a topological fingerprint estimation based on extended-connectivity fingerprints (ECFPs) followed by uniform manifold approximation and projection (UMAP) for dimension reduction. Both datasets were generated using the DFTB+ software on the “Andes” cluster of the Oak Ridge Leadership Computing Facility (OLCF). Nature Publishing Group UK 2023-08-21 /pmc/articles/PMC10442335/ /pubmed/37604820 http://dx.doi.org/10.1038/s41597-023-02408-4 Text en © UT-Battelle, LLC 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Lupo Pasini, Massimiliano Mehta, Kshitij Yoo, Pilsun Irle, Stephan Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title | Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title_full | Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title_fullStr | Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title_full_unstemmed | Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title_short | Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules |
title_sort | two excited-state datasets for quantum chemical uv-vis spectra of organic molecules |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442335/ https://www.ncbi.nlm.nih.gov/pubmed/37604820 http://dx.doi.org/10.1038/s41597-023-02408-4 |
work_keys_str_mv | AT lupopasinimassimiliano twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules AT mehtakshitij twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules AT yoopilsun twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules AT irlestephan twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules |