Cargando…

Dataset of Karakalpak language stop words

The dataset presented in this paper aims to address the challenge of automatic extraction of stop words in Natural Language Processing (NLP) for the low-resource Karakalpak language spoken by approximately two million people in Uzbekistan. To accomplish this, we have created a corpus of 23 Karakalpa...

Descripción completa

Detalles Bibliográficos
Autores principales: Madatov, Khabibulla, Bekchanov, Shukurla, Vičič, Jernej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126844/
https://www.ncbi.nlm.nih.gov/pubmed/37113499
http://dx.doi.org/10.1016/j.dib.2023.109111