Cargando…

Morpheme Matching Based Text Tokenization for a Scarce Resourced Language

Text tokenization is a fundamental pre-processing step for almost all the information processing applications. This task is nontrivial for the scarce resourced languages such as Urdu, as there is inconsistent use of space between words. In this paper a morpheme matching based approach has been propo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rehman, Zobia, Anwar, Waqas, Bajwa, Usama Ijaz, Xuan, Wang, Chaoying, Zhou
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3749178/ https://www.ncbi.nlm.nih.gov/pubmed/23990871 http://dx.doi.org/10.1371/journal.pone.0068178

Ejemplares similares

Language and Reading: the Role of Morpheme and Phoneme Awareness
por: Duncan, Lynne G.
Publicado: (2018)

Morpheme Ordering Across Languages Reflects Optimization for Processing Efficiency
por: Hahn, Michael, et al.
Publicado: (2022)

Connectivity, Not Frequency, Determines the Fate of a Morpheme
por: Keller, Daniela Barbara, et al.
Publicado: (2013)

Word Formation Is Aware of Morpheme Family Size
por: Keller, Daniela Barbara, et al.
Publicado: (2014)

A Multifaceted Independent Performance Analysis of Facial Subspace Recognition Algorithms
por: Bajwa, Usama Ijaz, et al.
Publicado: (2013)

The Tokens
por: Reid, Greg
Publicado: (2019)

Can grammatical morphemes be taught? Evidence of gestures influencing second language procedural learning in middle childhood
por: Janzen Ulbricht, Natasha
Publicado: (2023)

Pandemic Dementia Scarce Resource Allocation
por: Smith, Eric E., et al.
Publicado: (2020)

Fair and diverse allocation of scarce resources
por: Anahideh, Hadis, et al.
Publicado: (2022)

Morpheme-Based Reading and Writing in Spanish Children with Dyslexia
por: Suárez-Coalla, Paz, et al.
Publicado: (2017)

Real-World Matching Performance of Deidentified Record-Linking Tokens
por: Bernstam, Elmer V., et al.
Publicado: (2022)

Length of Utterance, in Morphemes or in Words?: MLU3-w, a Reliable Measure of Language Development in Early Basque
por: Ezeizabarrena, Maria-José, et al.
Publicado: (2018)

Beautiful Token
Publicado: (1888)

Neural encoding and production of functional morphemes in the posterior temporal lobe
por: Lee, Daniel K., et al.
Publicado: (2018)

The role of syllables and morphemes in silent reading: An eye-tracking study
por: De Simone, Elisabetta, et al.
Publicado: (2023)

Medical resources are scarce, but theories about their allocation are not
por: Basson, Marc D.
Publicado: (2020)

Lightweight ResGRU: a deep learning-based prediction of SARS-CoV-2 (COVID-19) and its severity classification using multimodal chest radiography images
por: Ahmad, Mughees, et al.
Publicado: (2023)

Improving the quality of chemical language model outcomes with atom-in-SMILES tokenization
por: Ucak, Umit V., et al.
Publicado: (2023)

The Segmentation of Sub-Lexical Morphemes in English-Learning 15-Month-Olds
por: Mintz, Toben H.
Publicado: (2013)

Herniotomy in resource-scarce environment: Comparison of incisions and techniques
por: Ibrahim, Musa, et al.
Publicado: (2015)

Age, “Life-Cycles,” and the Allocation of Scarce Medical Resources
por: May, Thomas, et al.
Publicado: (2020)

Competitive Analysis of the Online Leasing Problem for Scarce Resources
por: Lu, Jiamin, et al.
Publicado: (2023)

Oral Health Workforce in Africa: A Scarce Resource
por: Gallagher, Jennifer E., et al.
Publicado: (2023)

What Is the Token Economy?
por: Voshmgir, Shermin
Publicado: (2019)

Inside the Token-Ring
por: Haugdahl, J Scott
Publicado: (1987)

Chapter VIII. Of Tokens
Publicado: (1894)

Correction: Improving the quality of chemical language model outcomes with atom-in-SMILES tokenization
por: Ucak, Umit V., et al.
Publicado: (2023)

The token economy : a review and evaluation /
por: Kazdin, Alan E.
Publicado: (1977)

An fMRI Study of Grammatical Morpheme Processing Associated with Nouns and Verbs in Chinese
por: Yu, Xi, et al.
Publicado: (2013)

The Form of Morphemes: MEG Evidence From Masked Priming of Two Hebrew Templates
por: Kastner, Itamar, et al.
Publicado: (2018)

Sensitivity to Inflectional Morphemes in the Absence of Meaning: Evidence from a Novel Task
por: Cilibrasi, Luca, et al.
Publicado: (2019)

Morpheme Position Coding in Reading Development as Explored With a Letter Search Task
por: Hasenäcker, Jana, et al.
Publicado: (2021)

The pattern of name tokens in narrative clinical text and a comparison of five systems for redacting them
por: Kayaalp, Mehmet, et al.
Publicado: (2014)

Interrogating scarcity: how to think about ‘resource-scarce settings’
por: Schrecker, Ted
Publicado: (2013)

The token economy : a motivational system for therapy and rehabilitation
por: Ayllon, Teodoro, 1929-
Publicado: (1968)

The IBM token-ring network
por: Computer Technology Research Corp. New York
Publicado: (1987)

WLCG Token Usage and Discovery
por: Dack, Tom
Publicado: (2021)

WLCG Token Usage and Discovery
por: Bockelman, Brian, et al.
Publicado: (2021)

Nonfungible Tokens in Plastic Surgery
por: Tian, William M., et al.
Publicado: (2022)

Morpheme Analysis Associated with German Noun Plural Endings among Second Language (L2) Learners Using Event-Related Potentials (ERPs)
por: Son, Guiyoung
Publicado: (2020)

Cannot write session to /tmp/vufind_sessions/sess_19h2ai7i2prisbgtg9k4pltojr