Cargando…

Text-mined dataset of inorganic materials synthesis recipes

Materials discovery has become significantly facilitated and accelerated by high-throughput ab-initio computations. This ability to rapidly design interesting novel compounds has displaced the materials innovation bottleneck to the development of synthesis routes for the desired material. As there i...

Descripción completa

Detalles Bibliográficos
Autores principales: Kononova, Olga, Huo, Haoyan, He, Tanjin, Rong, Ziqin, Botari, Tiago, Sun, Wenhao, Tshitoyan, Vahe, Ceder, Gerbrand
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6794279/
https://www.ncbi.nlm.nih.gov/pubmed/31615989
http://dx.doi.org/10.1038/s41597-019-0224-1
_version_ 1783459255468163072
author Kononova, Olga
Huo, Haoyan
He, Tanjin
Rong, Ziqin
Botari, Tiago
Sun, Wenhao
Tshitoyan, Vahe
Ceder, Gerbrand
author_facet Kononova, Olga
Huo, Haoyan
He, Tanjin
Rong, Ziqin
Botari, Tiago
Sun, Wenhao
Tshitoyan, Vahe
Ceder, Gerbrand
author_sort Kononova, Olga
collection PubMed
description Materials discovery has become significantly facilitated and accelerated by high-throughput ab-initio computations. This ability to rapidly design interesting novel compounds has displaced the materials innovation bottleneck to the development of synthesis routes for the desired material. As there is no a fundamental theory for materials synthesis, one might attempt a data-driven approach for predicting inorganic materials synthesis, but this is impeded by the lack of a comprehensive database containing synthesis processes. To overcome this limitation, we have generated a dataset of “codified recipes” for solid-state synthesis automatically extracted from scientific publications. The dataset consists of 19,488 synthesis entries retrieved from 53,538 solid-state synthesis paragraphs by using text mining and natural language processing approaches. Every entry contains information about target material, starting compounds, operations used and their conditions, as well as the balanced chemical equation of the synthesis reaction. The dataset is publicly available and can be used for data mining of various aspects of inorganic materials synthesis.
format Online
Article
Text
id pubmed-6794279
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-67942792019-10-18 Text-mined dataset of inorganic materials synthesis recipes Kononova, Olga Huo, Haoyan He, Tanjin Rong, Ziqin Botari, Tiago Sun, Wenhao Tshitoyan, Vahe Ceder, Gerbrand Sci Data Data Descriptor Materials discovery has become significantly facilitated and accelerated by high-throughput ab-initio computations. This ability to rapidly design interesting novel compounds has displaced the materials innovation bottleneck to the development of synthesis routes for the desired material. As there is no a fundamental theory for materials synthesis, one might attempt a data-driven approach for predicting inorganic materials synthesis, but this is impeded by the lack of a comprehensive database containing synthesis processes. To overcome this limitation, we have generated a dataset of “codified recipes” for solid-state synthesis automatically extracted from scientific publications. The dataset consists of 19,488 synthesis entries retrieved from 53,538 solid-state synthesis paragraphs by using text mining and natural language processing approaches. Every entry contains information about target material, starting compounds, operations used and their conditions, as well as the balanced chemical equation of the synthesis reaction. The dataset is publicly available and can be used for data mining of various aspects of inorganic materials synthesis. Nature Publishing Group UK 2019-10-15 /pmc/articles/PMC6794279/ /pubmed/31615989 http://dx.doi.org/10.1038/s41597-019-0224-1 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Kononova, Olga
Huo, Haoyan
He, Tanjin
Rong, Ziqin
Botari, Tiago
Sun, Wenhao
Tshitoyan, Vahe
Ceder, Gerbrand
Text-mined dataset of inorganic materials synthesis recipes
title Text-mined dataset of inorganic materials synthesis recipes
title_full Text-mined dataset of inorganic materials synthesis recipes
title_fullStr Text-mined dataset of inorganic materials synthesis recipes
title_full_unstemmed Text-mined dataset of inorganic materials synthesis recipes
title_short Text-mined dataset of inorganic materials synthesis recipes
title_sort text-mined dataset of inorganic materials synthesis recipes
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6794279/
https://www.ncbi.nlm.nih.gov/pubmed/31615989
http://dx.doi.org/10.1038/s41597-019-0224-1
work_keys_str_mv AT kononovaolga textmineddatasetofinorganicmaterialssynthesisrecipes
AT huohaoyan textmineddatasetofinorganicmaterialssynthesisrecipes
AT hetanjin textmineddatasetofinorganicmaterialssynthesisrecipes
AT rongziqin textmineddatasetofinorganicmaterialssynthesisrecipes
AT botaritiago textmineddatasetofinorganicmaterialssynthesisrecipes
AT sunwenhao textmineddatasetofinorganicmaterialssynthesisrecipes
AT tshitoyanvahe textmineddatasetofinorganicmaterialssynthesisrecipes
AT cedergerbrand textmineddatasetofinorganicmaterialssynthesisrecipes