Cargando…
Corpus creation and language identification for code-mixed Indonesian-Javanese-English Tweets
With the massive use of social media today, mixing between languages in social media text is prevalent. In linguistics, the phenomenon of mixing languages is known as code-mixing. The prevalence of code-mixing exposes various concerns and challenges in natural language processing (NLP), including la...
Autores principales: | Hidayatullah, Ahmad Fathan, Apong, Rosyzie Anna, Lai, Daphne T.C., Qazi, Atika |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10319257/ https://www.ncbi.nlm.nih.gov/pubmed/37409088 http://dx.doi.org/10.7717/peerj-cs.1312 |
Ejemplares similares
-
A natural language processing based technique for sentiment analysis of college english corpus
por: Xu, Jingjing
Publicado: (2023) -
RuSentiTweet: a sentiment analysis dataset of general domain tweets in Russian
por: Smetanin, Sergey
Publicado: (2022) -
Multi-label emotion classification of Urdu tweets
por: Ashraf, Noman, et al.
Publicado: (2022) -
Stop voicing contrast in American English: Data of individual speakers in trochaic and iambic words in different prosodic structural contexts
por: Kim, Sahyang, et al.
Publicado: (2018) -
Coarticulatory vowel nasalization in American English: Data of individual differences in acoustic realization of vowel nasalization as a function of prosodic prominence and boundary
por: Kim, Daejin, et al.
Publicado: (2019)