Cargando…

Unicode-8 based linguistics data set of annotated Sindhi text

Sindhi Unicode-8 based linguistics data set is multi-class and multi-featured data set. It is developed to solve the natural languages processing (NLP) and linguistics problems of Sindhi language. The data set presents information on grammatical and morphological structure of Sindhi language text as...

Descripción completa

Detalles Bibliográficos
Autores principales: Dootio, Mazhar Ali, Wagan, Asim Imdad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6139473/
https://www.ncbi.nlm.nih.gov/pubmed/30225294
http://dx.doi.org/10.1016/j.dib.2018.05.062