Cargando…

Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System

On-board system fault knowledge base (KB) is a collection of fault causes, maintenance methods, and interrelationships among on-board modules and components of high-speed railways, which plays a crucial role in knowledge-driven dynamic operation and maintenance (O&M) decisions for on-board syste...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Lu-jie, Zhao, Zhi-peng, Dang, Jian-wu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173926/
https://www.ncbi.nlm.nih.gov/pubmed/35685160
http://dx.doi.org/10.1155/2022/9948218
_version_ 1784722125436747776
author Zhou, Lu-jie
Zhao, Zhi-peng
Dang, Jian-wu
author_facet Zhou, Lu-jie
Zhao, Zhi-peng
Dang, Jian-wu
author_sort Zhou, Lu-jie
collection PubMed
description On-board system fault knowledge base (KB) is a collection of fault causes, maintenance methods, and interrelationships among on-board modules and components of high-speed railways, which plays a crucial role in knowledge-driven dynamic operation and maintenance (O&M) decisions for on-board systems. To solve the problem of multi-source heterogeneity of on-board system O&M data, an entity matching (EM) approach using the BERT model and semi-supervised incremental learning is proposed. The heterogeneous knowledge fusion task is formulated as a pairwise binary classification task of entities in the knowledge units. Firstly, the deep semantic features of fault knowledge units are obtained by BERT. We also investigate the effectiveness of knowledge unit features extracted from different hidden layers of the model on heterogeneous knowledge fusion during model fine-tuning. To further improve the utilization of unlabeled test samples, a semi-supervised incremental learning strategy based on pseudo labels is devised. By selecting entity pairs with high confidence to generate pseudo labels, the label sample set is expanded to realize incremental learning and enhance the knowledge fusion ability of the model. Furthermore, the model's robustness is strengthened by embedding-based adversarial training in the fine-tuning stage. Based on the on-board system's O&M data, this paper constructs the fault KB and compares the model with other solutions developed for related matching tasks, which verifies the effectiveness of this model in the heterogeneous knowledge fusion task of the on-board system.
format Online
Article
Text
id pubmed-9173926
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-91739262022-06-08 Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System Zhou, Lu-jie Zhao, Zhi-peng Dang, Jian-wu Comput Intell Neurosci Research Article On-board system fault knowledge base (KB) is a collection of fault causes, maintenance methods, and interrelationships among on-board modules and components of high-speed railways, which plays a crucial role in knowledge-driven dynamic operation and maintenance (O&M) decisions for on-board systems. To solve the problem of multi-source heterogeneity of on-board system O&M data, an entity matching (EM) approach using the BERT model and semi-supervised incremental learning is proposed. The heterogeneous knowledge fusion task is formulated as a pairwise binary classification task of entities in the knowledge units. Firstly, the deep semantic features of fault knowledge units are obtained by BERT. We also investigate the effectiveness of knowledge unit features extracted from different hidden layers of the model on heterogeneous knowledge fusion during model fine-tuning. To further improve the utilization of unlabeled test samples, a semi-supervised incremental learning strategy based on pseudo labels is devised. By selecting entity pairs with high confidence to generate pseudo labels, the label sample set is expanded to realize incremental learning and enhance the knowledge fusion ability of the model. Furthermore, the model's robustness is strengthened by embedding-based adversarial training in the fine-tuning stage. Based on the on-board system's O&M data, this paper constructs the fault KB and compares the model with other solutions developed for related matching tasks, which verifies the effectiveness of this model in the heterogeneous knowledge fusion task of the on-board system. Hindawi 2022-05-31 /pmc/articles/PMC9173926/ /pubmed/35685160 http://dx.doi.org/10.1155/2022/9948218 Text en Copyright © 2022 Lu-jie Zhou et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhou, Lu-jie
Zhao, Zhi-peng
Dang, Jian-wu
Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title_full Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title_fullStr Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title_full_unstemmed Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title_short Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
title_sort combining bert model with semi-supervised incremental learning for heterogeneous knowledge fusion of high-speed railway on-board system
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173926/
https://www.ncbi.nlm.nih.gov/pubmed/35685160
http://dx.doi.org/10.1155/2022/9948218
work_keys_str_mv AT zhoulujie combiningbertmodelwithsemisupervisedincrementallearningforheterogeneousknowledgefusionofhighspeedrailwayonboardsystem
AT zhaozhipeng combiningbertmodelwithsemisupervisedincrementallearningforheterogeneousknowledgefusionofhighspeedrailwayonboardsystem
AT dangjianwu combiningbertmodelwithsemisupervisedincrementallearningforheterogeneousknowledgefusionofhighspeedrailwayonboardsystem