Cargando…

Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets

[Image: see text] Screening combinatorial space for novel materials, such as perovskite-like ones for photovoltaics, has resulted in a high amount of simulated high-throughput data and analysis thereof. This study proposes a comprehensive comparison of structural fingerprint-based machine learning m...

Descripción completa

Detalles Bibliográficos
Autores principales: Mayr, Felix, Gagliardi, Alessio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8154242/
https://www.ncbi.nlm.nih.gov/pubmed/34056424
http://dx.doi.org/10.1021/acsomega.1c00991
_version_ 1783698968855707648
author Mayr, Felix
Gagliardi, Alessio
author_facet Mayr, Felix
Gagliardi, Alessio
author_sort Mayr, Felix
collection PubMed
description [Image: see text] Screening combinatorial space for novel materials, such as perovskite-like ones for photovoltaics, has resulted in a high amount of simulated high-throughput data and analysis thereof. This study proposes a comprehensive comparison of structural fingerprint-based machine learning models on seven open-source databases of perovskite-like materials to predict band gaps and energies. It shows that none of the given methods, including graph neural networks, are able to capture arbitrary databases evenly, while underlining that commonly used metrics are highly database-dependent in typical workflows. In addition, the applicability of variance selection and autoencoders to significantly reduce fingerprint size indicates that models built with common fingerprints only rely on a submanifold of the available fingerprint space.
format Online
Article
Text
id pubmed-8154242
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-81542422021-05-28 Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets Mayr, Felix Gagliardi, Alessio ACS Omega [Image: see text] Screening combinatorial space for novel materials, such as perovskite-like ones for photovoltaics, has resulted in a high amount of simulated high-throughput data and analysis thereof. This study proposes a comprehensive comparison of structural fingerprint-based machine learning models on seven open-source databases of perovskite-like materials to predict band gaps and energies. It shows that none of the given methods, including graph neural networks, are able to capture arbitrary databases evenly, while underlining that commonly used metrics are highly database-dependent in typical workflows. In addition, the applicability of variance selection and autoencoders to significantly reduce fingerprint size indicates that models built with common fingerprints only rely on a submanifold of the available fingerprint space. American Chemical Society 2021-05-03 /pmc/articles/PMC8154242/ /pubmed/34056424 http://dx.doi.org/10.1021/acsomega.1c00991 Text en © 2021 The Authors. Published by American Chemical Society Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Mayr, Felix
Gagliardi, Alessio
Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title_full Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title_fullStr Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title_full_unstemmed Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title_short Global Property Prediction: A Benchmark Study on Open-Source, Perovskite-like Datasets
title_sort global property prediction: a benchmark study on open-source, perovskite-like datasets
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8154242/
https://www.ncbi.nlm.nih.gov/pubmed/34056424
http://dx.doi.org/10.1021/acsomega.1c00991
work_keys_str_mv AT mayrfelix globalpropertypredictionabenchmarkstudyonopensourceperovskitelikedatasets
AT gagliardialessio globalpropertypredictionabenchmarkstudyonopensourceperovskitelikedatasets