Cargando…
FGSI: distant supervision for relation extraction method based on fine-grained semantic information
Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462618/ https://www.ncbi.nlm.nih.gov/pubmed/37640843 http://dx.doi.org/10.1038/s41598-023-41354-4 |
_version_ | 1785098070681190400 |
---|---|
author | Sun, Chenghong Ji, Weidong Zhou, Guohui Guo, Hui Yin, Zengxiang Yue, Yuqi |
author_facet | Sun, Chenghong Ji, Weidong Zhou, Guohui Guo, Hui Yin, Zengxiang Yue, Yuqi |
author_sort | Sun, Chenghong |
collection | PubMed |
description | Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction aligns knowledge bases with natural language texts and generates labeled data, which alleviates the burden of manually annotating datasets. However, the labeled corpus obtained from remote supervision contains a large amount of noisy data, which greatly affects the training of relation extraction models. In this paper, we propose the hypothesis that key semantic information within the sentence plays a crucial role in entity relation extraction in the task of remote supervised relation extraction. Based on this hypothesis, we divide the sentence into three segments by splitting it according to the positions of entities, starting from within the sentence. Then, using intra-sentence attention mechanisms, we identify fine-grained semantic features within the sentence to reduce the interference of irrelevant noise information. We also improved the intra-bag attention mechanism by setting a threshold gate to filter out low-relevant noisy sentences, minimizing the impact of noise on the relation extraction model, and making full use of available positive semantic information. Experimental results show that the proposed relation extraction model in this paper achieves improvements in precision-recall curve, P@N value, and AUC value compared to existing methods, demonstrating the effectiveness of this model. |
format | Online Article Text |
id | pubmed-10462618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104626182023-08-30 FGSI: distant supervision for relation extraction method based on fine-grained semantic information Sun, Chenghong Ji, Weidong Zhou, Guohui Guo, Hui Yin, Zengxiang Yue, Yuqi Sci Rep Article Relation extraction is one of the important steps in building a knowledge graph. Its main objective is to extract semantic relationships from identified entity pairs in sentences, playing a crucial role in semantic understanding and knowledge graph construction. Remote supervised relation extraction aligns knowledge bases with natural language texts and generates labeled data, which alleviates the burden of manually annotating datasets. However, the labeled corpus obtained from remote supervision contains a large amount of noisy data, which greatly affects the training of relation extraction models. In this paper, we propose the hypothesis that key semantic information within the sentence plays a crucial role in entity relation extraction in the task of remote supervised relation extraction. Based on this hypothesis, we divide the sentence into three segments by splitting it according to the positions of entities, starting from within the sentence. Then, using intra-sentence attention mechanisms, we identify fine-grained semantic features within the sentence to reduce the interference of irrelevant noise information. We also improved the intra-bag attention mechanism by setting a threshold gate to filter out low-relevant noisy sentences, minimizing the impact of noise on the relation extraction model, and making full use of available positive semantic information. Experimental results show that the proposed relation extraction model in this paper achieves improvements in precision-recall curve, P@N value, and AUC value compared to existing methods, demonstrating the effectiveness of this model. Nature Publishing Group UK 2023-08-28 /pmc/articles/PMC10462618/ /pubmed/37640843 http://dx.doi.org/10.1038/s41598-023-41354-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Sun, Chenghong Ji, Weidong Zhou, Guohui Guo, Hui Yin, Zengxiang Yue, Yuqi FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title | FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title_full | FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title_fullStr | FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title_full_unstemmed | FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title_short | FGSI: distant supervision for relation extraction method based on fine-grained semantic information |
title_sort | fgsi: distant supervision for relation extraction method based on fine-grained semantic information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462618/ https://www.ncbi.nlm.nih.gov/pubmed/37640843 http://dx.doi.org/10.1038/s41598-023-41354-4 |
work_keys_str_mv | AT sunchenghong fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation AT jiweidong fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation AT zhouguohui fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation AT guohui fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation AT yinzengxiang fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation AT yueyuqi fgsidistantsupervisionforrelationextractionmethodbasedonfinegrainedsemanticinformation |