Cargando…

The importance of Term Weighting in semantic understanding of text: A review of techniques

In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations o...

Descripción completa

Detalles Bibliográficos
Autores principales: Rathi, R. N., Mustafi, A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007265/
https://www.ncbi.nlm.nih.gov/pubmed/35437420
http://dx.doi.org/10.1007/s11042-022-12538-3
_version_ 1784686811479539712
author Rathi, R. N.
Mustafi, A.
author_facet Rathi, R. N.
Mustafi, A.
author_sort Rathi, R. N.
collection PubMed
description In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations of these methods. Term weighting schemes have broadly been classified as supervised and statistical methods and we present numerous examples from both categories to highlight the difference in approaches between the two broad categories. We pay particular attention to the Vector Space Model and its variants which form the basis of many of the other methods which have been discussed in the paper.
format Online
Article
Text
id pubmed-9007265
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-90072652022-04-14 The importance of Term Weighting in semantic understanding of text: A review of techniques Rathi, R. N. Mustafi, A. Multimed Tools Appl 1222: Intelligent Multimedia Data Analytics and Computing In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations of these methods. Term weighting schemes have broadly been classified as supervised and statistical methods and we present numerous examples from both categories to highlight the difference in approaches between the two broad categories. We pay particular attention to the Vector Space Model and its variants which form the basis of many of the other methods which have been discussed in the paper. Springer US 2022-04-13 2023 /pmc/articles/PMC9007265/ /pubmed/35437420 http://dx.doi.org/10.1007/s11042-022-12538-3 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle 1222: Intelligent Multimedia Data Analytics and Computing
Rathi, R. N.
Mustafi, A.
The importance of Term Weighting in semantic understanding of text: A review of techniques
title The importance of Term Weighting in semantic understanding of text: A review of techniques
title_full The importance of Term Weighting in semantic understanding of text: A review of techniques
title_fullStr The importance of Term Weighting in semantic understanding of text: A review of techniques
title_full_unstemmed The importance of Term Weighting in semantic understanding of text: A review of techniques
title_short The importance of Term Weighting in semantic understanding of text: A review of techniques
title_sort importance of term weighting in semantic understanding of text: a review of techniques
topic 1222: Intelligent Multimedia Data Analytics and Computing
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007265/
https://www.ncbi.nlm.nih.gov/pubmed/35437420
http://dx.doi.org/10.1007/s11042-022-12538-3
work_keys_str_mv AT rathirn theimportanceoftermweightinginsemanticunderstandingoftextareviewoftechniques
AT mustafia theimportanceoftermweightinginsemanticunderstandingoftextareviewoftechniques
AT rathirn importanceoftermweightinginsemanticunderstandingoftextareviewoftechniques
AT mustafia importanceoftermweightinginsemanticunderstandingoftextareviewoftechniques