Cargando…

3DSC - a dataset of superconductors including crystal structures

Data-driven methods, in particular machine learning, can help to speed up the discovery of new materials by finding hidden patterns in existing data and using them to identify promising candidate materials. In the case of superconductors, the use of data science tools is to date slowed down by a lac...

Descripción completa

Detalles Bibliográficos
Autores principales: Sommer, Timo, Willa, Roland, Schmalian, Jörg, Friederich, Pascal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663493/
https://www.ncbi.nlm.nih.gov/pubmed/37990027
http://dx.doi.org/10.1038/s41597-023-02721-y
_version_ 1785148648297857024
author Sommer, Timo
Willa, Roland
Schmalian, Jörg
Friederich, Pascal
author_facet Sommer, Timo
Willa, Roland
Schmalian, Jörg
Friederich, Pascal
author_sort Sommer, Timo
collection PubMed
description Data-driven methods, in particular machine learning, can help to speed up the discovery of new materials by finding hidden patterns in existing data and using them to identify promising candidate materials. In the case of superconductors, the use of data science tools is to date slowed down by a lack of accessible data. In this work, we present a new and publicly available superconductivity dataset (‘3DSC’), featuring the critical temperature T(C) of superconducting materials additionally to tested non-superconductors. In contrast to existing databases such as the SuperCon database which contains information on the chemical composition, the 3DSC is augmented by approximate three-dimensional crystal structures. We perform a statistical analysis and machine learning experiments to show that access to this structural information improves the prediction of the critical temperature T(C) of materials. Furthermore, we provide ideas and directions for further research to improve the 3DSC. We are confident that this database will be useful in applying state-of-the-art machine learning methods to eventually find new superconductors.
format Online
Article
Text
id pubmed-10663493
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106634932023-11-21 3DSC - a dataset of superconductors including crystal structures Sommer, Timo Willa, Roland Schmalian, Jörg Friederich, Pascal Sci Data Data Descriptor Data-driven methods, in particular machine learning, can help to speed up the discovery of new materials by finding hidden patterns in existing data and using them to identify promising candidate materials. In the case of superconductors, the use of data science tools is to date slowed down by a lack of accessible data. In this work, we present a new and publicly available superconductivity dataset (‘3DSC’), featuring the critical temperature T(C) of superconducting materials additionally to tested non-superconductors. In contrast to existing databases such as the SuperCon database which contains information on the chemical composition, the 3DSC is augmented by approximate three-dimensional crystal structures. We perform a statistical analysis and machine learning experiments to show that access to this structural information improves the prediction of the critical temperature T(C) of materials. Furthermore, we provide ideas and directions for further research to improve the 3DSC. We are confident that this database will be useful in applying state-of-the-art machine learning methods to eventually find new superconductors. Nature Publishing Group UK 2023-11-21 /pmc/articles/PMC10663493/ /pubmed/37990027 http://dx.doi.org/10.1038/s41597-023-02721-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Sommer, Timo
Willa, Roland
Schmalian, Jörg
Friederich, Pascal
3DSC - a dataset of superconductors including crystal structures
title 3DSC - a dataset of superconductors including crystal structures
title_full 3DSC - a dataset of superconductors including crystal structures
title_fullStr 3DSC - a dataset of superconductors including crystal structures
title_full_unstemmed 3DSC - a dataset of superconductors including crystal structures
title_short 3DSC - a dataset of superconductors including crystal structures
title_sort 3dsc - a dataset of superconductors including crystal structures
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663493/
https://www.ncbi.nlm.nih.gov/pubmed/37990027
http://dx.doi.org/10.1038/s41597-023-02721-y
work_keys_str_mv AT sommertimo 3dscadatasetofsuperconductorsincludingcrystalstructures
AT willaroland 3dscadatasetofsuperconductorsincludingcrystalstructures
AT schmalianjorg 3dscadatasetofsuperconductorsincludingcrystalstructures
AT friederichpascal 3dscadatasetofsuperconductorsincludingcrystalstructures