Cargando…

Named entity recognition for Chinese based on global pointer and adversarial training

Named entity recognition aims to identify entities from unstructured text and is an important subtask for natural language processing and building knowledge graphs. Most of the existing entity recognition methods use conditional random fields as label decoders or use pointer networks for entity reco...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Hongjun, Cheng, Mingzhe, Yang, Zelin, Yang, Liqun, Chua, Yansong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9958032/
https://www.ncbi.nlm.nih.gov/pubmed/36828907
http://dx.doi.org/10.1038/s41598-023-30355-y
_version_ 1784894938507378688
author Li, Hongjun
Cheng, Mingzhe
Yang, Zelin
Yang, Liqun
Chua, Yansong
author_facet Li, Hongjun
Cheng, Mingzhe
Yang, Zelin
Yang, Liqun
Chua, Yansong
author_sort Li, Hongjun
collection PubMed
description Named entity recognition aims to identify entities from unstructured text and is an important subtask for natural language processing and building knowledge graphs. Most of the existing entity recognition methods use conditional random fields as label decoders or use pointer networks for entity recognition. However, when the number of tags is large, the computational cost of method based on conditional random fields is high and the problem of nested entities cannot be solved. The pointer network uses two modules to identify the first and the last of the entities separately, and a single module can only focus on the information of the first or the last of the entities, but cannot pay attention to the global information of the entities. In addition, the neural network model has the problem of local instability. To solve mentioned problems, a named entity recognition model based on global pointer and adversarial training is proposed. To obtain global entity information, global pointer is used to decode entity information, and rotary relative position information is considered in the model designing to improve the model’s perception of position; to solve the model’s local instability problem, adversarial training is used to improve the robustness and generalization of the model. The experimental results show that the F1 score of the model are improved on several public datasets of OntoNotes5, MSRA, Resume, and Weibo compared with the existing mainstream models.
format Online
Article
Text
id pubmed-9958032
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99580322023-02-26 Named entity recognition for Chinese based on global pointer and adversarial training Li, Hongjun Cheng, Mingzhe Yang, Zelin Yang, Liqun Chua, Yansong Sci Rep Article Named entity recognition aims to identify entities from unstructured text and is an important subtask for natural language processing and building knowledge graphs. Most of the existing entity recognition methods use conditional random fields as label decoders or use pointer networks for entity recognition. However, when the number of tags is large, the computational cost of method based on conditional random fields is high and the problem of nested entities cannot be solved. The pointer network uses two modules to identify the first and the last of the entities separately, and a single module can only focus on the information of the first or the last of the entities, but cannot pay attention to the global information of the entities. In addition, the neural network model has the problem of local instability. To solve mentioned problems, a named entity recognition model based on global pointer and adversarial training is proposed. To obtain global entity information, global pointer is used to decode entity information, and rotary relative position information is considered in the model designing to improve the model’s perception of position; to solve the model’s local instability problem, adversarial training is used to improve the robustness and generalization of the model. The experimental results show that the F1 score of the model are improved on several public datasets of OntoNotes5, MSRA, Resume, and Weibo compared with the existing mainstream models. Nature Publishing Group UK 2023-02-24 /pmc/articles/PMC9958032/ /pubmed/36828907 http://dx.doi.org/10.1038/s41598-023-30355-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Hongjun
Cheng, Mingzhe
Yang, Zelin
Yang, Liqun
Chua, Yansong
Named entity recognition for Chinese based on global pointer and adversarial training
title Named entity recognition for Chinese based on global pointer and adversarial training
title_full Named entity recognition for Chinese based on global pointer and adversarial training
title_fullStr Named entity recognition for Chinese based on global pointer and adversarial training
title_full_unstemmed Named entity recognition for Chinese based on global pointer and adversarial training
title_short Named entity recognition for Chinese based on global pointer and adversarial training
title_sort named entity recognition for chinese based on global pointer and adversarial training
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9958032/
https://www.ncbi.nlm.nih.gov/pubmed/36828907
http://dx.doi.org/10.1038/s41598-023-30355-y
work_keys_str_mv AT lihongjun namedentityrecognitionforchinesebasedonglobalpointerandadversarialtraining
AT chengmingzhe namedentityrecognitionforchinesebasedonglobalpointerandadversarialtraining
AT yangzelin namedentityrecognitionforchinesebasedonglobalpointerandadversarialtraining
AT yangliqun namedentityrecognitionforchinesebasedonglobalpointerandadversarialtraining
AT chuayansong namedentityrecognitionforchinesebasedonglobalpointerandadversarialtraining