Cargando…
A Method of Domain Dictionary Construction for Electric Vehicles Disassembly
Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947409/ https://www.ncbi.nlm.nih.gov/pubmed/35327874 http://dx.doi.org/10.3390/e24030363 |
_version_ | 1784674432399179776 |
---|---|
author | Ren, Wei Zhang, Hengwei Chen, Ming |
author_facet | Ren, Wei Zhang, Hengwei Chen, Ming |
author_sort | Ren, Wei |
collection | PubMed |
description | Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for the disassembly of electric vehicles is a research work that has important research significance. Extracting high-quality keywords from text and categorizing them widely uses information mining, which is the basis of named entity recognition, relation extraction, knowledge questions and answers and other disassembly domain information recognition and extraction. In this paper, we propose a supervised learning dictionary construction algorithm based on multi-dimensional features that combines different features of extraction candidate keywords from the text of each scientific study. Keywords recognition is regarded as a binary classification problem using the LightGBM model to filter each keyword, and then expand the domain dictionary based on the pointwise mutual information value between keywords and its category. Here, we make use of Chinese disassembly manuals, patents and papers in order to establish a general corpus about the disassembly information and then use our model to mine the disassembly parts, disassembly tools, disassembly methods, disassembly process, and other categories of disassembly keywords. The experiment evidenced that our algorithms can significantly improve extraction and category performance better than traditional algorithms in the disassembly domain. We also investigated the performance algorithms and attempts to describe them. Our work sets a benchmark for domain dictionary construction in the field of disassembly of electric vehicles that is based on the newly developed dataset using a multi-class terminology classification. |
format | Online Article Text |
id | pubmed-8947409 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89474092022-03-25 A Method of Domain Dictionary Construction for Electric Vehicles Disassembly Ren, Wei Zhang, Hengwei Chen, Ming Entropy (Basel) Article Currently, there is no domain dictionary in the field of electric vehicles disassembly and other domain dictionary construction algorithms do not accurately extract terminology from disassembly text, because the terminology is complex and variable. Herein, the construction of a domain dictionary for the disassembly of electric vehicles is a research work that has important research significance. Extracting high-quality keywords from text and categorizing them widely uses information mining, which is the basis of named entity recognition, relation extraction, knowledge questions and answers and other disassembly domain information recognition and extraction. In this paper, we propose a supervised learning dictionary construction algorithm based on multi-dimensional features that combines different features of extraction candidate keywords from the text of each scientific study. Keywords recognition is regarded as a binary classification problem using the LightGBM model to filter each keyword, and then expand the domain dictionary based on the pointwise mutual information value between keywords and its category. Here, we make use of Chinese disassembly manuals, patents and papers in order to establish a general corpus about the disassembly information and then use our model to mine the disassembly parts, disassembly tools, disassembly methods, disassembly process, and other categories of disassembly keywords. The experiment evidenced that our algorithms can significantly improve extraction and category performance better than traditional algorithms in the disassembly domain. We also investigated the performance algorithms and attempts to describe them. Our work sets a benchmark for domain dictionary construction in the field of disassembly of electric vehicles that is based on the newly developed dataset using a multi-class terminology classification. MDPI 2022-03-03 /pmc/articles/PMC8947409/ /pubmed/35327874 http://dx.doi.org/10.3390/e24030363 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ren, Wei Zhang, Hengwei Chen, Ming A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title | A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title_full | A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title_fullStr | A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title_full_unstemmed | A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title_short | A Method of Domain Dictionary Construction for Electric Vehicles Disassembly |
title_sort | method of domain dictionary construction for electric vehicles disassembly |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947409/ https://www.ncbi.nlm.nih.gov/pubmed/35327874 http://dx.doi.org/10.3390/e24030363 |
work_keys_str_mv | AT renwei amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly AT zhanghengwei amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly AT chenming amethodofdomaindictionaryconstructionforelectricvehiclesdisassembly AT renwei methodofdomaindictionaryconstructionforelectricvehiclesdisassembly AT zhanghengwei methodofdomaindictionaryconstructionforelectricvehiclesdisassembly AT chenming methodofdomaindictionaryconstructionforelectricvehiclesdisassembly |