Cargando…

A Semi-automated Approach for Bengali Neologism

Neologisms refer to newly coined words or phrases adopted by a language, and it is a slow but ongoing process that occurs in all languages. Sometimes, rarely used or obsolete words are also considered neologisms. Certain events, such as wars, the emergence of new diseases, or advancements like compu...

Descripción completa

Detalles Bibliográficos
Autor principal: Senapati, Apurbalal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245352/
https://www.ncbi.nlm.nih.gov/pubmed/37304837
http://dx.doi.org/10.1007/s42979-023-01866-2
_version_ 1785054847490326528
author Senapati, Apurbalal
author_facet Senapati, Apurbalal
author_sort Senapati, Apurbalal
collection PubMed
description Neologisms refer to newly coined words or phrases adopted by a language, and it is a slow but ongoing process that occurs in all languages. Sometimes, rarely used or obsolete words are also considered neologisms. Certain events, such as wars, the emergence of new diseases, or advancements like computers and the internet, can trigger the creation of new words or neologisms. The COVID-19 pandemic is one such event that has rapidly led to an explosion of neologisms in the context of the disease and several other social contexts. Even the term COVID-19 itself is a newly coined term. Studying such adaptation or change and quantifying it is essential from a linguistic perspective. However, identifying newly coined terms or extracting neologisms computationally is a challenging task. The standard tools and techniques for finding newly coined terms in English-like languages may not be suitable for Bengali and other Indic languages. This study aims to use a semi-automated approach to investigate the emergence or modification of new words in the Bengali language amidst the COVID-19 pandemic. To conduct this study, a Bengali web corpus was compiled consisting of COVID-19 related articles sourced from various web sources in Bengali. The current experiment focuses solely on COVID-19-related neologisms, but the method can be adapted for general purposes and extended to other languages as well.
format Online
Article
Text
id pubmed-10245352
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-102453522023-06-08 A Semi-automated Approach for Bengali Neologism Senapati, Apurbalal SN Comput Sci Original Research Neologisms refer to newly coined words or phrases adopted by a language, and it is a slow but ongoing process that occurs in all languages. Sometimes, rarely used or obsolete words are also considered neologisms. Certain events, such as wars, the emergence of new diseases, or advancements like computers and the internet, can trigger the creation of new words or neologisms. The COVID-19 pandemic is one such event that has rapidly led to an explosion of neologisms in the context of the disease and several other social contexts. Even the term COVID-19 itself is a newly coined term. Studying such adaptation or change and quantifying it is essential from a linguistic perspective. However, identifying newly coined terms or extracting neologisms computationally is a challenging task. The standard tools and techniques for finding newly coined terms in English-like languages may not be suitable for Bengali and other Indic languages. This study aims to use a semi-automated approach to investigate the emergence or modification of new words in the Bengali language amidst the COVID-19 pandemic. To conduct this study, a Bengali web corpus was compiled consisting of COVID-19 related articles sourced from various web sources in Bengali. The current experiment focuses solely on COVID-19-related neologisms, but the method can be adapted for general purposes and extended to other languages as well. Springer Nature Singapore 2023-06-07 2023 /pmc/articles/PMC10245352/ /pubmed/37304837 http://dx.doi.org/10.1007/s42979-023-01866-2 Text en © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Senapati, Apurbalal
A Semi-automated Approach for Bengali Neologism
title A Semi-automated Approach for Bengali Neologism
title_full A Semi-automated Approach for Bengali Neologism
title_fullStr A Semi-automated Approach for Bengali Neologism
title_full_unstemmed A Semi-automated Approach for Bengali Neologism
title_short A Semi-automated Approach for Bengali Neologism
title_sort semi-automated approach for bengali neologism
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245352/
https://www.ncbi.nlm.nih.gov/pubmed/37304837
http://dx.doi.org/10.1007/s42979-023-01866-2
work_keys_str_mv AT senapatiapurbalal asemiautomatedapproachforbengalineologism
AT senapatiapurbalal semiautomatedapproachforbengalineologism