Cargando…

A Method of Domain Dictionary Construction for Electric Vehicles Disassembly

Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Wei, Zhang, Hengwei, Chen, Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947409/
https://www.ncbi.nlm.nih.gov/pubmed/35327874
http://dx.doi.org/10.3390/e24030363
_version_ 1784674432399179776
author Ren, Wei
Zhang, Hengwei
Chen, Ming
author_facet Ren, Wei
Zhang, Hengwei
Chen, Ming
author_sort Ren, Wei
collection PubMed
description Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for the disassembly of electric vehicles is a research work that has important research significance. Extracting high-quality keywords from text and categorizing them widely uses information mining, which is the basis of named entity recognition, relation extraction, knowledge questions and answers and other disassembly domain information recognition and extraction. In this paper, we propose a supervised learning dictionary construction algorithm based on multi-dimensional features that combines different features of extraction candidate keywords from the text of each scientific study. Keywords recognition is regarded as a binary classification problem using the LightGBM model to filter each keyword, and then expand the domain dictionary based on the pointwise mutual information value between keywords and its category. Here, we make use of Chinese disassembly manuals, patents and papers in order to establish a general corpus about the disassembly information and then use our model to mine the disassembly parts, disassembly tools, disassembly methods, disassembly process, and other categories of disassembly keywords. The experiment evidenced that our algorithms can significantly improve extraction and category performance better than traditional algorithms in the disassembly domain. We also investigated the performance algorithms and attempts to describe them. Our work sets a benchmark for domain dictionary construction in the field of disassembly of electric vehicles that is based on the newly developed dataset using a multi-class terminology classification.
format Online
Article
Text
id pubmed-8947409
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89474092022-03-25 A Method of Domain Dictionary Construction for Electric Vehicles Disassembly Ren, Wei Zhang, Hengwei Chen, Ming Entropy (Basel) Article Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for the disassembly of electric vehicles is a research work that has important research significance. Extracting high-quality keywords from text and categorizing them widely uses information mining, which is the basis of named entity recognition, relation extraction, knowledge questions and answers and other disassembly domain information recognition and extraction. In this paper, we propose a supervised learning dictionary construction algorithm based on multi-dimensional features that combines different features of extraction candidate keywords from the text of each scientific study. Keywords recognition is regarded as a binary classification problem using the LightGBM model to filter each keyword, and then expand the domain dictionary based on the pointwise mutual information value between keywords and its category. Here, we make use of Chinese disassembly manuals, patents and papers in order to establish a general corpus about the disassembly information and then use our model to mine the disassembly parts, disassembly tools, disassembly methods, disassembly process, and other categories of disassembly keywords. The experiment evidenced that our algorithms can significantly improve extraction and category performance better than traditional algorithms in the disassembly domain. We also investigated the performance algorithms and attempts to describe them. Our work sets a benchmark for domain dictionary construction in the field of disassembly of electric vehicles that is based on the newly developed dataset using a multi-class terminology classification. MDPI 2022-03-03 /pmc/articles/PMC8947409/ /pubmed/35327874 http://dx.doi.org/10.3390/e24030363 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ren, Wei
Zhang, Hengwei
Chen, Ming
A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title_full A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title_fullStr A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title_full_unstemmed A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title_short A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
title_sort method of domain dictionary construction for electric vehicles disassembly
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947409/
https://www.ncbi.nlm.nih.gov/pubmed/35327874
http://dx.doi.org/10.3390/e24030363
work_keys_str_mv AT renwei amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly
AT zhanghengwei amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly
AT chenming amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly
AT renwei methodofdomaindictionaryconstructionforelectricvehiclesdisassembly
AT zhanghengwei methodofdomaindictionaryconstructionforelectricvehiclesdisassembly
AT chenming methodofdomaindictionaryconstructionforelectricvehiclesdisassembly