Cargando…

Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models

Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more chal...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, JongYoon, Sa, Inkyu, Ahn, Ho Seok, Gasteiger, Norina, Lee, Sanghyub John, MacDonald, Bruce
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068876/
https://www.ncbi.nlm.nih.gov/pubmed/33921483
http://dx.doi.org/10.3390/s21082712
_version_ 1783683105379319808
author Lim, JongYoon
Sa, Inkyu
Ahn, Ho Seok
Gasteiger, Norina
Lee, Sanghyub John
MacDonald, Bruce
author_facet Lim, JongYoon
Sa, Inkyu
Ahn, Ho Seok
Gasteiger, Norina
Lee, Sanghyub John
MacDonald, Bruce
author_sort Lim, JongYoon
collection PubMed
description Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more challenging with only text-based input. Meanwhile, the rise of deep learning and an unprecedented large volume of data have paved the way for artificial intelligence to perform impressively accurate predictions or even human-level reasoning. Drawing inspiration from this, we propose a coverage-based sentiment and subsentence extraction system that estimates a span of input text and recursively feeds this information back to the networks. The predicted subsentence consists of auxiliary information expressing a sentiment. This is an important building block for enabling vivid and epic sentiment delivery (within the scope of this paper) and for other natural language processing tasks such as text summarisation and Q&A. Our approach outperforms the state-of-the-art approaches by a large margin in subsentence prediction (i.e., Average Jaccard scores from 0.72 to 0.89). For the evaluation, we designed rigorous experiments consisting of 24 ablation studies. Finally, our learned lessons are returned to the community by sharing software packages and a public dataset that can reproduce the results presented in this paper.
format Online
Article
Text
id pubmed-8068876
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80688762021-04-26 Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models Lim, JongYoon Sa, Inkyu Ahn, Ho Seok Gasteiger, Norina Lee, Sanghyub John MacDonald, Bruce Sensors (Basel) Article Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more challenging with only text-based input. Meanwhile, the rise of deep learning and an unprecedented large volume of data have paved the way for artificial intelligence to perform impressively accurate predictions or even human-level reasoning. Drawing inspiration from this, we propose a coverage-based sentiment and subsentence extraction system that estimates a span of input text and recursively feeds this information back to the networks. The predicted subsentence consists of auxiliary information expressing a sentiment. This is an important building block for enabling vivid and epic sentiment delivery (within the scope of this paper) and for other natural language processing tasks such as text summarisation and Q&A. Our approach outperforms the state-of-the-art approaches by a large margin in subsentence prediction (i.e., Average Jaccard scores from 0.72 to 0.89). For the evaluation, we designed rigorous experiments consisting of 24 ablation studies. Finally, our learned lessons are returned to the community by sharing software packages and a public dataset that can reproduce the results presented in this paper. MDPI 2021-04-12 /pmc/articles/PMC8068876/ /pubmed/33921483 http://dx.doi.org/10.3390/s21082712 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lim, JongYoon
Sa, Inkyu
Ahn, Ho Seok
Gasteiger, Norina
Lee, Sanghyub John
MacDonald, Bruce
Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title_full Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title_fullStr Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title_full_unstemmed Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title_short Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
title_sort subsentence extraction from text using coverage-based deep learning language models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068876/
https://www.ncbi.nlm.nih.gov/pubmed/33921483
http://dx.doi.org/10.3390/s21082712
work_keys_str_mv AT limjongyoon subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels
AT sainkyu subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels
AT ahnhoseok subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels
AT gasteigernorina subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels
AT leesanghyubjohn subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels
AT macdonaldbruce subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels