Cargando…

FGSI: distant supervision for relation extraction method based on fine-grained semantic information

Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Chenghong, Ji, Weidong, Zhou, Guohui, Guo, Hui, Yin, Zengxiang, Yue, Yuqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462618/
https://www.ncbi.nlm.nih.gov/pubmed/37640843
http://dx.doi.org/10.1038/s41598-023-41354-4
_version_ 1785098070681190400
author Sun, Chenghong
Ji, Weidong
Zhou, Guohui
Guo, Hui
Yin, Zengxiang
Yue, Yuqi
author_facet Sun, Chenghong
Ji, Weidong
Zhou, Guohui
Guo, Hui
Yin, Zengxiang
Yue, Yuqi
author_sort Sun, Chenghong
collection PubMed
description Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction aligns knowledge bases with natural language texts and generates labeled data, which alleviates the burden of manually annotating datasets. However, the labeled corpus obtained from remote supervision contains a large amount of noisy data, which greatly affects the training of relation extraction models. In this paper, we propose the hypothesis that key semantic information within the sentence plays a crucial role in entity relation extraction in the task of remote supervised relation extraction. Based on this hypothesis, we divide the sentence into three segments by splitting it according to the positions of entities, starting from within the sentence. Then, using intra-sentence attention mechanisms, we identify fine-grained semantic features within the sentence to reduce the interference of irrelevant noise information. We also improved the intra-bag attention mechanism by setting a threshold gate to filter out low-relevant noisy sentences, minimizing the impact of noise on the relation extraction model, and making full use of available positive semantic information. Experimental results show that the proposed relation extraction model in this paper achieves improvements in precision-recall curve, P@N value, and AUC value compared to existing methods, demonstrating the effectiveness of this model.
format Online
Article
Text
id pubmed-10462618
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104626182023-08-30 FGSI: distant supervision for relation extraction method based on fine-grained semantic information Sun, Chenghong Ji, Weidong Zhou, Guohui Guo, Hui Yin, Zengxiang Yue, Yuqi Sci Rep Article Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction aligns knowledge bases with natural language texts and generates labeled data, which alleviates the burden of manually annotating datasets. However, the labeled corpus obtained from remote supervision contains a large amount of noisy data, which greatly affects the training of relation extraction models. In this paper, we propose the hypothesis that key semantic information within the sentence plays a crucial role in entity relation extraction in the task of remote supervised relation extraction. Based on this hypothesis, we divide the sentence into three segments by splitting it according to the positions of entities, starting from within the sentence. Then, using intra-sentence attention mechanisms, we identify fine-grained semantic features within the sentence to reduce the interference of irrelevant noise information. We also improved the intra-bag attention mechanism by setting a threshold gate to filter out low-relevant noisy sentences, minimizing the impact of noise on the relation extraction model, and making full use of available positive semantic information. Experimental results show that the proposed relation extraction model in this paper achieves improvements in precision-recall curve, P@N value, and AUC value compared to existing methods, demonstrating the effectiveness of this model. Nature Publishing Group UK 2023-08-28 /pmc/articles/PMC10462618/ /pubmed/37640843 http://dx.doi.org/10.1038/s41598-023-41354-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Sun, Chenghong
Ji, Weidong
Zhou, Guohui
Guo, Hui
Yin, Zengxiang
Yue, Yuqi
FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title_full FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title_fullStr FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title_full_unstemmed FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title_short FGSI: distant supervision for relation extraction method based on fine-grained semantic information
title_sort fgsi: distant supervision for relation extraction method based on fine-grained semantic information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462618/
https://www.ncbi.nlm.nih.gov/pubmed/37640843
http://dx.doi.org/10.1038/s41598-023-41354-4
work_keys_str_mv AT sunchenghong fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation
AT jiweidong fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation
AT zhouguohui fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation
AT guohui fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation
AT yinzengxiang fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation
AT yueyuqi fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation