Cargando…
Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models
Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more chal...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068876/ https://www.ncbi.nlm.nih.gov/pubmed/33921483 http://dx.doi.org/10.3390/s21082712 |
_version_ | 1783683105379319808 |
---|---|
author | Lim, JongYoon Sa, Inkyu Ahn, Ho Seok Gasteiger, Norina Lee, Sanghyub John MacDonald, Bruce |
author_facet | Lim, JongYoon Sa, Inkyu Ahn, Ho Seok Gasteiger, Norina Lee, Sanghyub John MacDonald, Bruce |
author_sort | Lim, JongYoon |
collection | PubMed |
description | Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more challenging with only text-based input. Meanwhile, the rise of deep learning and an unprecedented large volume of data have paved the way for artificial intelligence to perform impressively accurate predictions or even human-level reasoning. Drawing inspiration from this, we propose a coverage-based sentiment and subsentence extraction system that estimates a span of input text and recursively feeds this information back to the networks. The predicted subsentence consists of auxiliary information expressing a sentiment. This is an important building block for enabling vivid and epic sentiment delivery (within the scope of this paper) and for other natural language processing tasks such as text summarisation and Q&A. Our approach outperforms the state-of-the-art approaches by a large margin in subsentence prediction (i.e., Average Jaccard scores from 0.72 to 0.89). For the evaluation, we designed rigorous experiments consisting of 24 ablation studies. Finally, our learned lessons are returned to the community by sharing software packages and a public dataset that can reproduce the results presented in this paper. |
format | Online Article Text |
id | pubmed-8068876 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80688762021-04-26 Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models Lim, JongYoon Sa, Inkyu Ahn, Ho Seok Gasteiger, Norina Lee, Sanghyub John MacDonald, Bruce Sensors (Basel) Article Sentiment prediction remains a challenging and unresolved task in various research fields, including psychology, neuroscience, and computer science. This stems from its high degree of subjectivity and limited input sources that can effectively capture the actual sentiment. This can be even more challenging with only text-based input. Meanwhile, the rise of deep learning and an unprecedented large volume of data have paved the way for artificial intelligence to perform impressively accurate predictions or even human-level reasoning. Drawing inspiration from this, we propose a coverage-based sentiment and subsentence extraction system that estimates a span of input text and recursively feeds this information back to the networks. The predicted subsentence consists of auxiliary information expressing a sentiment. This is an important building block for enabling vivid and epic sentiment delivery (within the scope of this paper) and for other natural language processing tasks such as text summarisation and Q&A. Our approach outperforms the state-of-the-art approaches by a large margin in subsentence prediction (i.e., Average Jaccard scores from 0.72 to 0.89). For the evaluation, we designed rigorous experiments consisting of 24 ablation studies. Finally, our learned lessons are returned to the community by sharing software packages and a public dataset that can reproduce the results presented in this paper. MDPI 2021-04-12 /pmc/articles/PMC8068876/ /pubmed/33921483 http://dx.doi.org/10.3390/s21082712 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lim, JongYoon Sa, Inkyu Ahn, Ho Seok Gasteiger, Norina Lee, Sanghyub John MacDonald, Bruce Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title | Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title_full | Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title_fullStr | Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title_full_unstemmed | Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title_short | Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models |
title_sort | subsentence extraction from text using coverage-based deep learning language models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8068876/ https://www.ncbi.nlm.nih.gov/pubmed/33921483 http://dx.doi.org/10.3390/s21082712 |
work_keys_str_mv | AT limjongyoon subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels AT sainkyu subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels AT ahnhoseok subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels AT gasteigernorina subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels AT leesanghyubjohn subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels AT macdonaldbruce subsentenceextractionfromtextusingcoveragebaseddeeplearninglanguagemodels |