Cargando…

SOMAS: a platform for data-driven material discovery in redox flow battery development

Aqueous organic redox flow batteries offer an environmentally benign, tunable, and safe route to large-scale energy storage. The energy density is one of the key performance parameters of organic redox flow batteries, which critically depends on the solubility of the redox-active molecule in water....

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Peiyuan, Andersen, Amity, Sepulveda, Jonathan, Panapitiya, Gihan U., Hollas, Aaron, Saldanha, Emily G., Murugesan, Vijayakumar, Wang, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9715657/
https://www.ncbi.nlm.nih.gov/pubmed/36456604
http://dx.doi.org/10.1038/s41597-022-01814-4
_version_ 1784842501845155840
author Gao, Peiyuan
Andersen, Amity
Sepulveda, Jonathan
Panapitiya, Gihan U.
Hollas, Aaron
Saldanha, Emily G.
Murugesan, Vijayakumar
Wang, Wei
author_facet Gao, Peiyuan
Andersen, Amity
Sepulveda, Jonathan
Panapitiya, Gihan U.
Hollas, Aaron
Saldanha, Emily G.
Murugesan, Vijayakumar
Wang, Wei
author_sort Gao, Peiyuan
collection PubMed
description Aqueous organic redox flow batteries offer an environmentally benign, tunable, and safe route to large-scale energy storage. The energy density is one of the key performance parameters of organic redox flow batteries, which critically depends on the solubility of the redox-active molecule in water. Prediction of aqueous solubility remains a challenge in chemistry. Recently, machine learning models have been developed for molecular properties prediction in chemistry and material science. The fidelity of a machine learning model critically depends on the diversity, accuracy, and abundancy of the training datasets. We build a comprehensive open access organic molecular database “Solubility of Organic Molecules in Aqueous Solution” (SOMAS) containing about 12,000 molecules that covers wider chemical and solubility regimes suitable for aqueous organic redox flow battery development efforts. In addition to experimental solubility, we also provide eight distinctive quantum descriptors including optimized geometry derived from high-throughput density functional theory calculations along with six molecular descriptors for each molecule. SOMAS builds a critical foundation for future efforts in artificial intelligence-based solubility prediction models.
format Online
Article
Text
id pubmed-9715657
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97156572022-12-03 SOMAS: a platform for data-driven material discovery in redox flow battery development Gao, Peiyuan Andersen, Amity Sepulveda, Jonathan Panapitiya, Gihan U. Hollas, Aaron Saldanha, Emily G. Murugesan, Vijayakumar Wang, Wei Sci Data Data Descriptor Aqueous organic redox flow batteries offer an environmentally benign, tunable, and safe route to large-scale energy storage. The energy density is one of the key performance parameters of organic redox flow batteries, which critically depends on the solubility of the redox-active molecule in water. Prediction of aqueous solubility remains a challenge in chemistry. Recently, machine learning models have been developed for molecular properties prediction in chemistry and material science. The fidelity of a machine learning model critically depends on the diversity, accuracy, and abundancy of the training datasets. We build a comprehensive open access organic molecular database “Solubility of Organic Molecules in Aqueous Solution” (SOMAS) containing about 12,000 molecules that covers wider chemical and solubility regimes suitable for aqueous organic redox flow battery development efforts. In addition to experimental solubility, we also provide eight distinctive quantum descriptors including optimized geometry derived from high-throughput density functional theory calculations along with six molecular descriptors for each molecule. SOMAS builds a critical foundation for future efforts in artificial intelligence-based solubility prediction models. Nature Publishing Group UK 2022-12-01 /pmc/articles/PMC9715657/ /pubmed/36456604 http://dx.doi.org/10.1038/s41597-022-01814-4 Text en © Battelle Memorial Institute 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Gao, Peiyuan
Andersen, Amity
Sepulveda, Jonathan
Panapitiya, Gihan U.
Hollas, Aaron
Saldanha, Emily G.
Murugesan, Vijayakumar
Wang, Wei
SOMAS: a platform for data-driven material discovery in redox flow battery development
title SOMAS: a platform for data-driven material discovery in redox flow battery development
title_full SOMAS: a platform for data-driven material discovery in redox flow battery development
title_fullStr SOMAS: a platform for data-driven material discovery in redox flow battery development
title_full_unstemmed SOMAS: a platform for data-driven material discovery in redox flow battery development
title_short SOMAS: a platform for data-driven material discovery in redox flow battery development
title_sort somas: a platform for data-driven material discovery in redox flow battery development
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9715657/
https://www.ncbi.nlm.nih.gov/pubmed/36456604
http://dx.doi.org/10.1038/s41597-022-01814-4
work_keys_str_mv AT gaopeiyuan somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT andersenamity somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT sepulvedajonathan somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT panapitiyagihanu somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT hollasaaron somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT saldanhaemilyg somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT murugesanvijayakumar somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment
AT wangwei somasaplatformfordatadrivenmaterialdiscoveryinredoxflowbatterydevelopment