Cargando…
Using Twitter to collect a multi-dialectal corpus of Albanian using advanced geotagging and dialect modeling
In this study, we present the acquisition and categorization of a geographically-informed, multi-dialectal Albanian National Corpus, derived from Twitter data. The primary dialects from three distinct regions—Albania, Kosovo, and North Macedonia—are considered. The assembled publicly available datas...
Autores principales: | Canhasi, Ercan, Shijaku, Rexhep |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10681245/ https://www.ncbi.nlm.nih.gov/pubmed/38011168 http://dx.doi.org/10.1371/journal.pone.0294284 |
Ejemplares similares
-
Clearing the Transcription Hurdle in Dialect Corpus Building: The Corpus of Southern Dutch Dialects as Case Study
por: Ghyselen, Anne-Sophie, et al.
Publicado: (2020) -
Crowdsourcing Dialect Characterization through Twitter
por: Gonçalves, Bruno, et al.
Publicado: (2014) -
Mapping Lexical Dialect Variation in British English Using Twitter
por: Grieve, Jack, et al.
Publicado: (2019) -
Kant's Dialectic
por: Bennett, Jonathan, 1930-
Publicado: (1974) -
English dialects/
por: Brook, G. L. (George Leslie), 1910-
Publicado: (1963)