Cargando…

A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature

The electrocatalytic CO(2) reduction process has gained enormous attention for both environmental protection and chemicals production. Thereinto, the design of new electrocatalysts with high activity and selectivity can draw inspiration from the abundant scientific literature. An annotated and verif...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ludi, Gao, Yang, Chen, Xueqing, Cui, Wenjuan, Zhou, Yuanchun, Luo, Xinying, Xu, Shuaishuai, Du, Yi, Wang, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10060421/
https://www.ncbi.nlm.nih.gov/pubmed/36991006
http://dx.doi.org/10.1038/s41597-023-02089-z
_version_ 1785017093224136704
author Wang, Ludi
Gao, Yang
Chen, Xueqing
Cui, Wenjuan
Zhou, Yuanchun
Luo, Xinying
Xu, Shuaishuai
Du, Yi
Wang, Bin
author_facet Wang, Ludi
Gao, Yang
Chen, Xueqing
Cui, Wenjuan
Zhou, Yuanchun
Luo, Xinying
Xu, Shuaishuai
Du, Yi
Wang, Bin
author_sort Wang, Ludi
collection PubMed
description The electrocatalytic CO(2) reduction process has gained enormous attention for both environmental protection and chemicals production. Thereinto, the design of new electrocatalysts with high activity and selectivity can draw inspiration from the abundant scientific literature. An annotated and verified corpus made from massive literature can assist the development of natural language processing (NLP) models, which can offer insight to help guide the understanding of these underlying mechanisms. To facilitate data mining in this direction, we present a benchmark corpus of 6,086 records manually extracted from 835 electrocatalytic publications, along with an extended corpus with 145,179 records in this article. In this corpus, nine types of knowledge such as material, regulation method, product, faradaic efficiency, cell setup, electrolyte, synthesis method, current density, and voltage are provided by either annotating or extracting. Machine learning algorithms can be applied to the corpus to help scientists find new and effective electrocatalysts. Furthermore, researchers familiar with NLP can use this corpus to design domain-specific named entity recognition (NER) models.
format Online
Article
Text
id pubmed-10060421
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100604212023-03-31 A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature Wang, Ludi Gao, Yang Chen, Xueqing Cui, Wenjuan Zhou, Yuanchun Luo, Xinying Xu, Shuaishuai Du, Yi Wang, Bin Sci Data Data Descriptor The electrocatalytic CO(2) reduction process has gained enormous attention for both environmental protection and chemicals production. Thereinto, the design of new electrocatalysts with high activity and selectivity can draw inspiration from the abundant scientific literature. An annotated and verified corpus made from massive literature can assist the development of natural language processing (NLP) models, which can offer insight to help guide the understanding of these underlying mechanisms. To facilitate data mining in this direction, we present a benchmark corpus of 6,086 records manually extracted from 835 electrocatalytic publications, along with an extended corpus with 145,179 records in this article. In this corpus, nine types of knowledge such as material, regulation method, product, faradaic efficiency, cell setup, electrolyte, synthesis method, current density, and voltage are provided by either annotating or extracting. Machine learning algorithms can be applied to the corpus to help scientists find new and effective electrocatalysts. Furthermore, researchers familiar with NLP can use this corpus to design domain-specific named entity recognition (NER) models. Nature Publishing Group UK 2023-03-29 /pmc/articles/PMC10060421/ /pubmed/36991006 http://dx.doi.org/10.1038/s41597-023-02089-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Wang, Ludi
Gao, Yang
Chen, Xueqing
Cui, Wenjuan
Zhou, Yuanchun
Luo, Xinying
Xu, Shuaishuai
Du, Yi
Wang, Bin
A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title_full A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title_fullStr A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title_full_unstemmed A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title_short A corpus of CO(2) electrocatalytic reduction process extracted from the scientific literature
title_sort corpus of co(2) electrocatalytic reduction process extracted from the scientific literature
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10060421/
https://www.ncbi.nlm.nih.gov/pubmed/36991006
http://dx.doi.org/10.1038/s41597-023-02089-z
work_keys_str_mv AT wangludi acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT gaoyang acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT chenxueqing acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT cuiwenjuan acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT zhouyuanchun acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT luoxinying acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT xushuaishuai acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT duyi acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT wangbin acorpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT wangludi corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT gaoyang corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT chenxueqing corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT cuiwenjuan corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT zhouyuanchun corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT luoxinying corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT xushuaishuai corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT duyi corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature
AT wangbin corpusofco2electrocatalyticreductionprocessextractedfromthescientificliterature