Cargando…

What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification

The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information w...

Descripción completa

Detalles Bibliográficos
Autores principales: Mar, Jaron, Liu, Jiamou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206305/
http://dx.doi.org/10.1007/978-3-030-47426-3_21
_version_ 1783530390623879168
author Mar, Jaron
Liu, Jiamou
author_facet Mar, Jaron
Liu, Jiamou
author_sort Mar, Jaron
collection PubMed
description The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information which facilitates the fast disambiguation and prediction of related concepts bringing about a set of natural relationships defined between information sources. In this paper, we investigate and introduce a novel unsupervised gist extraction and quantification framework that represents a computational form of the gist based on notions from fuzzy trace theory. To evaluate our purposed framework, we apply the gist to the task of semantic similarity, specifically to few-shot large document classification where documents on average have a large number of words. The results show our proposed gist representation can effectively capture the essential information from a text document while dramatically reducing the features used.
format Online
Article
Text
id pubmed-7206305
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72063052020-05-08 What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification Mar, Jaron Liu, Jiamou Advances in Knowledge Discovery and Data Mining Article The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information which facilitates the fast disambiguation and prediction of related concepts bringing about a set of natural relationships defined between information sources. In this paper, we investigate and introduce a novel unsupervised gist extraction and quantification framework that represents a computational form of the gist based on notions from fuzzy trace theory. To evaluate our purposed framework, we apply the gist to the task of semantic similarity, specifically to few-shot large document classification where documents on average have a large number of words. The results show our proposed gist representation can effectively capture the essential information from a text document while dramatically reducing the features used. 2020-04-17 /pmc/articles/PMC7206305/ http://dx.doi.org/10.1007/978-3-030-47426-3_21 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Mar, Jaron
Liu, Jiamou
What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title_full What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title_fullStr What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title_full_unstemmed What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title_short What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
title_sort what’s in a gist? towards an unsupervised gist representation for few-shot large document classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206305/
http://dx.doi.org/10.1007/978-3-030-47426-3_21
work_keys_str_mv AT marjaron whatsinagisttowardsanunsupervisedgistrepresentationforfewshotlargedocumentclassification
AT liujiamou whatsinagisttowardsanunsupervisedgistrepresentationforfewshotlargedocumentclassification