Cargando…
Unicode-8 based linguistics data set of annotated Sindhi text
Sindhi Unicode-8 based linguistics data set is multi-class and multi-featured data set. It is developed to solve the natural languages processing (NLP) and linguistics problems of Sindhi language. The data set presents information on grammatical and morphological structure of Sindhi language text as...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6139473/ https://www.ncbi.nlm.nih.gov/pubmed/30225294 http://dx.doi.org/10.1016/j.dib.2018.05.062 |