Cargando…

Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction

Sentence-level relation extraction (RE) has a highly imbalanced data distribution that about 80% of data are labeled as negative, i.e., no relation; and there exist minority classes (MC) among positive labels; furthermore, some of MC instances have an incorrect label. Due to those challenges, i.e.,...

Descripción completa

Detalles Bibliográficos
Autores principales: Baek, Hyeong-Ryeol, Choi, Yong-Suk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269806/
https://www.ncbi.nlm.nih.gov/pubmed/35808404
http://dx.doi.org/10.3390/s22134911
_version_ 1784744311919738880
author Baek, Hyeong-Ryeol
Choi, Yong-Suk
author_facet Baek, Hyeong-Ryeol
Choi, Yong-Suk
author_sort Baek, Hyeong-Ryeol
collection PubMed
description Sentence-level relation extraction (RE) has a highly imbalanced data distribution that about 80% of data are labeled as negative, i.e., no relation; and there exist minority classes (MC) among positive labels; furthermore, some of MC instances have an incorrect label. Due to those challenges, i.e., label noise and low source availability, most of the models fail to learn MC and get zero or very low F1 scores on MCs. Previous studies, however, have rather focused on micro F1 scores and MCs have not been addressed adequately. To tackle high mis-classification errors for MCs, we introduce (1) a minority class attention module (MCAM), and (2) effective augmentation methods specialized in RE. MCAM calculates the confidence scores on MC instances to select reliable ones for augmentation, and aggregates MCs information in the process of training a model. Our experiments show that our methods achieve a state-of-the-art F1 scores on TACRED as well as enhancing minority class F1 score dramatically.
format Online
Article
Text
id pubmed-9269806
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92698062022-07-09 Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction Baek, Hyeong-Ryeol Choi, Yong-Suk Sensors (Basel) Article Sentence-level relation extraction (RE) has a highly imbalanced data distribution that about 80% of data are labeled as negative, i.e., no relation; and there exist minority classes (MC) among positive labels; furthermore, some of MC instances have an incorrect label. Due to those challenges, i.e., label noise and low source availability, most of the models fail to learn MC and get zero or very low F1 scores on MCs. Previous studies, however, have rather focused on micro F1 scores and MCs have not been addressed adequately. To tackle high mis-classification errors for MCs, we introduce (1) a minority class attention module (MCAM), and (2) effective augmentation methods specialized in RE. MCAM calculates the confidence scores on MC instances to select reliable ones for augmentation, and aggregates MCs information in the process of training a model. Our experiments show that our methods achieve a state-of-the-art F1 scores on TACRED as well as enhancing minority class F1 score dramatically. MDPI 2022-06-29 /pmc/articles/PMC9269806/ /pubmed/35808404 http://dx.doi.org/10.3390/s22134911 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Baek, Hyeong-Ryeol
Choi, Yong-Suk
Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title_full Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title_fullStr Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title_full_unstemmed Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title_short Enhancing Targeted Minority Class Prediction in Sentence-Level Relation Extraction
title_sort enhancing targeted minority class prediction in sentence-level relation extraction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269806/
https://www.ncbi.nlm.nih.gov/pubmed/35808404
http://dx.doi.org/10.3390/s22134911
work_keys_str_mv AT baekhyeongryeol enhancingtargetedminorityclasspredictioninsentencelevelrelationextraction
AT choiyongsuk enhancingtargetedminorityclasspredictioninsentencelevelrelationextraction