Cargando…
Morpheme Matching Based Text Tokenization for a Scarce Resourced Language
Text tokenization is a fundamental pre-processing step for almost all the information processing applications. This task is nontrivial for the scarce resourced languages such as Urdu, as there is inconsistent use of space between words. In this paper a morpheme matching based approach has been propo...
Autores principales: | Rehman, Zobia, Anwar, Waqas, Bajwa, Usama Ijaz, Xuan, Wang, Chaoying, Zhou |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3749178/ https://www.ncbi.nlm.nih.gov/pubmed/23990871 http://dx.doi.org/10.1371/journal.pone.0068178 |
Ejemplares similares
-
Language and Reading: the Role of Morpheme and Phoneme Awareness
por: Duncan, Lynne G.
Publicado: (2018) -
Morpheme Ordering Across Languages Reflects Optimization for Processing Efficiency
por: Hahn, Michael, et al.
Publicado: (2022) -
Connectivity, Not Frequency, Determines the Fate of a Morpheme
por: Keller, Daniela Barbara, et al.
Publicado: (2013) -
Word Formation Is Aware of Morpheme Family Size
por: Keller, Daniela Barbara, et al.
Publicado: (2014) -
A Multifaceted Independent Performance Analysis of Facial Subspace Recognition Algorithms
por: Bajwa, Usama Ijaz, et al.
Publicado: (2013)