Cargando…

A database of battery materials auto-generated using ChemDataExtractor

A database of battery materials is presented which comprises a total of 292,313 data records, with 214,617 unique chemical-property data relations between 17,354 unique chemicals and up to five material properties: capacity, voltage, conductivity, Coulombic efficiency and energy. 117,403 data are mu...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Shu, Cole, Jacqueline M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411033/
https://www.ncbi.nlm.nih.gov/pubmed/32764659
http://dx.doi.org/10.1038/s41597-020-00602-2
_version_ 1783568291777740800
author Huang, Shu
Cole, Jacqueline M.
author_facet Huang, Shu
Cole, Jacqueline M.
author_sort Huang, Shu
collection PubMed
description A database of battery materials is presented which comprises a total of 292,313 data records, with 214,617 unique chemical-property data relations between 17,354 unique chemicals and up to five material properties: capacity, voltage, conductivity, Coulombic efficiency and energy. 117,403 data are multivariate on a property where it is the dependent variable in part of a data series. The database was auto-generated by mining text from 229,061 academic papers using the chemistry-aware natural language processing toolkit, ChemDataExtractor version 1.5, which was modified for the specific domain of batteries. The collected data can be used as a representative overview of battery material information that is contained within text of scientific papers. Public availability of these data will also enable battery materials design and prediction via data-science methods. To the best of our knowledge, this is the first auto-generated database of battery materials extracted from a relatively large number of scientific papers. We also provide a Graphical User Interface (GUI) to aid the use of this database.
format Online
Article
Text
id pubmed-7411033
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-74110332020-08-14 A database of battery materials auto-generated using ChemDataExtractor Huang, Shu Cole, Jacqueline M. Sci Data Data Descriptor A database of battery materials is presented which comprises a total of 292,313 data records, with 214,617 unique chemical-property data relations between 17,354 unique chemicals and up to five material properties: capacity, voltage, conductivity, Coulombic efficiency and energy. 117,403 data are multivariate on a property where it is the dependent variable in part of a data series. The database was auto-generated by mining text from 229,061 academic papers using the chemistry-aware natural language processing toolkit, ChemDataExtractor version 1.5, which was modified for the specific domain of batteries. The collected data can be used as a representative overview of battery material information that is contained within text of scientific papers. Public availability of these data will also enable battery materials design and prediction via data-science methods. To the best of our knowledge, this is the first auto-generated database of battery materials extracted from a relatively large number of scientific papers. We also provide a Graphical User Interface (GUI) to aid the use of this database. Nature Publishing Group UK 2020-08-06 /pmc/articles/PMC7411033/ /pubmed/32764659 http://dx.doi.org/10.1038/s41597-020-00602-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Huang, Shu
Cole, Jacqueline M.
A database of battery materials auto-generated using ChemDataExtractor
title A database of battery materials auto-generated using ChemDataExtractor
title_full A database of battery materials auto-generated using ChemDataExtractor
title_fullStr A database of battery materials auto-generated using ChemDataExtractor
title_full_unstemmed A database of battery materials auto-generated using ChemDataExtractor
title_short A database of battery materials auto-generated using ChemDataExtractor
title_sort database of battery materials auto-generated using chemdataextractor
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411033/
https://www.ncbi.nlm.nih.gov/pubmed/32764659
http://dx.doi.org/10.1038/s41597-020-00602-2
work_keys_str_mv AT huangshu adatabaseofbatterymaterialsautogeneratedusingchemdataextractor
AT colejacquelinem adatabaseofbatterymaterialsautogeneratedusingchemdataextractor
AT huangshu databaseofbatterymaterialsautogeneratedusingchemdataextractor
AT colejacquelinem databaseofbatterymaterialsautogeneratedusingchemdataextractor