Cargando…
What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification
The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information w...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206305/ http://dx.doi.org/10.1007/978-3-030-47426-3_21 |
_version_ | 1783530390623879168 |
---|---|
author | Mar, Jaron Liu, Jiamou |
author_facet | Mar, Jaron Liu, Jiamou |
author_sort | Mar, Jaron |
collection | PubMed |
description | The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information which facilitates the fast disambiguation and prediction of related concepts bringing about a set of natural relationships defined between information sources. In this paper, we investigate and introduce a novel unsupervised gist extraction and quantification framework that represents a computational form of the gist based on notions from fuzzy trace theory. To evaluate our purposed framework, we apply the gist to the task of semantic similarity, specifically to few-shot large document classification where documents on average have a large number of words. The results show our proposed gist representation can effectively capture the essential information from a text document while dramatically reducing the features used. |
format | Online Article Text |
id | pubmed-7206305 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72063052020-05-08 What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification Mar, Jaron Liu, Jiamou Advances in Knowledge Discovery and Data Mining Article The gist can be viewed as an abstract concept that represents only the quintessential meaning derived from a single or multiple sources of information. We live in an age where vast quantities of information are widely available and easily accessible. Identifying the gist contextualises information which facilitates the fast disambiguation and prediction of related concepts bringing about a set of natural relationships defined between information sources. In this paper, we investigate and introduce a novel unsupervised gist extraction and quantification framework that represents a computational form of the gist based on notions from fuzzy trace theory. To evaluate our purposed framework, we apply the gist to the task of semantic similarity, specifically to few-shot large document classification where documents on average have a large number of words. The results show our proposed gist representation can effectively capture the essential information from a text document while dramatically reducing the features used. 2020-04-17 /pmc/articles/PMC7206305/ http://dx.doi.org/10.1007/978-3-030-47426-3_21 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Mar, Jaron Liu, Jiamou What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title | What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title_full | What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title_fullStr | What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title_full_unstemmed | What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title_short | What’s in a Gist? Towards an Unsupervised Gist Representation for Few-Shot Large Document Classification |
title_sort | what’s in a gist? towards an unsupervised gist representation for few-shot large document classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206305/ http://dx.doi.org/10.1007/978-3-030-47426-3_21 |
work_keys_str_mv | AT marjaron whatsinagisttowardsanunsupervisedgistrepresentationforfewshotlargedocumentclassification AT liujiamou whatsinagisttowardsanunsupervisedgistrepresentationforfewshotlargedocumentclassification |