Cargando…

Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules

We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic grou...

Descripción completa

Detalles Bibliográficos
Autores principales: Lupo Pasini, Massimiliano, Mehta, Kshitij, Yoo, Pilsun, Irle, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442335/
https://www.ncbi.nlm.nih.gov/pubmed/37604820
http://dx.doi.org/10.1038/s41597-023-02408-4
_version_ 1785093571178659840
author Lupo Pasini, Massimiliano
Mehta, Kshitij
Yoo, Pilsun
Irle, Stephan
author_facet Lupo Pasini, Massimiliano
Mehta, Kshitij
Yoo, Pilsun
Irle, Stephan
author_sort Lupo Pasini, Massimiliano
collection PubMed
description We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic ground state. The GDB-9-Ex dataset contains a subset of 96,766 organic molecules from the original open-source GDB-9 dataset. The ORNL_AISD-Ex dataset consists of 10,502,904 organic molecules that contain between 5 and 71 non-hydrogen atoms. The data reveals the close correlation between the magnitude of the gaps between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO), and the excitation energy of the lowest singlet excited state energies quantitatively. The chemical variability of the large number of molecules was examined with a topological fingerprint estimation based on extended-connectivity fingerprints (ECFPs) followed by uniform manifold approximation and projection (UMAP) for dimension reduction. Both datasets were generated using the DFTB+ software on the “Andes” cluster of the Oak Ridge Leadership Computing Facility (OLCF).
format Online
Article
Text
id pubmed-10442335
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104423352023-08-23 Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules Lupo Pasini, Massimiliano Mehta, Kshitij Yoo, Pilsun Irle, Stephan Sci Data Data Descriptor We present two open-source datasets that provide time-dependent density-functional tight-binding (TD-DFTB) electronic excitation spectra of organic molecules. These datasets represent predictions of UV-vis absorption spectra performed on optimized geometries of the molecules in their electronic ground state. The GDB-9-Ex dataset contains a subset of 96,766 organic molecules from the original open-source GDB-9 dataset. The ORNL_AISD-Ex dataset consists of 10,502,904 organic molecules that contain between 5 and 71 non-hydrogen atoms. The data reveals the close correlation between the magnitude of the gaps between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO), and the excitation energy of the lowest singlet excited state energies quantitatively. The chemical variability of the large number of molecules was examined with a topological fingerprint estimation based on extended-connectivity fingerprints (ECFPs) followed by uniform manifold approximation and projection (UMAP) for dimension reduction. Both datasets were generated using the DFTB+ software on the “Andes” cluster of the Oak Ridge Leadership Computing Facility (OLCF). Nature Publishing Group UK 2023-08-21 /pmc/articles/PMC10442335/ /pubmed/37604820 http://dx.doi.org/10.1038/s41597-023-02408-4 Text en © UT-Battelle, LLC 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Lupo Pasini, Massimiliano
Mehta, Kshitij
Yoo, Pilsun
Irle, Stephan
Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title_full Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title_fullStr Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title_full_unstemmed Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title_short Two excited-state datasets for quantum chemical UV-vis spectra of organic molecules
title_sort two excited-state datasets for quantum chemical uv-vis spectra of organic molecules
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442335/
https://www.ncbi.nlm.nih.gov/pubmed/37604820
http://dx.doi.org/10.1038/s41597-023-02408-4
work_keys_str_mv AT lupopasinimassimiliano twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules
AT mehtakshitij twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules
AT yoopilsun twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules
AT irlestephan twoexcitedstatedatasetsforquantumchemicaluvvisspectraoforganicmolecules