Cargando…
The importance of Term Weighting in semantic understanding of text: A review of techniques
In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations o...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007265/ https://www.ncbi.nlm.nih.gov/pubmed/35437420 http://dx.doi.org/10.1007/s11042-022-12538-3 |
_version_ | 1784686811479539712 |
---|---|
author | Rathi, R. N. Mustafi, A. |
author_facet | Rathi, R. N. Mustafi, A. |
author_sort | Rathi, R. N. |
collection | PubMed |
description | In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations of these methods. Term weighting schemes have broadly been classified as supervised and statistical methods and we present numerous examples from both categories to highlight the difference in approaches between the two broad categories. We pay particular attention to the Vector Space Model and its variants which form the basis of many of the other methods which have been discussed in the paper. |
format | Online Article Text |
id | pubmed-9007265 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-90072652022-04-14 The importance of Term Weighting in semantic understanding of text: A review of techniques Rathi, R. N. Mustafi, A. Multimed Tools Appl 1222: Intelligent Multimedia Data Analytics and Computing In this paper we review a wide spectrum of techniques which have been proposed in literature to enable acceptable recognition of language and text by machines. We discuss many techniques which have been proposed by researchers in the field of term weighting and explore the mathematical foundations of these methods. Term weighting schemes have broadly been classified as supervised and statistical methods and we present numerous examples from both categories to highlight the difference in approaches between the two broad categories. We pay particular attention to the Vector Space Model and its variants which form the basis of many of the other methods which have been discussed in the paper. Springer US 2022-04-13 2023 /pmc/articles/PMC9007265/ /pubmed/35437420 http://dx.doi.org/10.1007/s11042-022-12538-3 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | 1222: Intelligent Multimedia Data Analytics and Computing Rathi, R. N. Mustafi, A. The importance of Term Weighting in semantic understanding of text: A review of techniques |
title | The importance of Term Weighting in semantic understanding of text: A review of techniques |
title_full | The importance of Term Weighting in semantic understanding of text: A review of techniques |
title_fullStr | The importance of Term Weighting in semantic understanding of text: A review of techniques |
title_full_unstemmed | The importance of Term Weighting in semantic understanding of text: A review of techniques |
title_short | The importance of Term Weighting in semantic understanding of text: A review of techniques |
title_sort | importance of term weighting in semantic understanding of text: a review of techniques |
topic | 1222: Intelligent Multimedia Data Analytics and Computing |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007265/ https://www.ncbi.nlm.nih.gov/pubmed/35437420 http://dx.doi.org/10.1007/s11042-022-12538-3 |
work_keys_str_mv | AT rathirn theimportanceoftermweightinginsemanticunderstandingoftextareviewoftechniques AT mustafia theimportanceoftermweightinginsemanticunderstandingoftextareviewoftechniques AT rathirn importanceoftermweightinginsemanticunderstandingoftextareviewoftechniques AT mustafia importanceoftermweightinginsemanticunderstandingoftextareviewoftechniques |