Cargando…

Linguistic markers of moderate and absolute natural language

In social, personality and mental health research, the tendency to select absolute end-points on Likert scales has been linked to certain cultures, lower intelligence, lower income and personality/mental health disorders. It is unclear whether this response style reflects an absolutist cognitive sty...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Mosaiwi, Mohammed, Johnstone, Tom
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pergamon Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6085512/
https://www.ncbi.nlm.nih.gov/pubmed/30393418
http://dx.doi.org/10.1016/j.paid.2018.06.004
Descripción
Sumario:In social, personality and mental health research, the tendency to select absolute end-points on Likert scales has been linked to certain cultures, lower intelligence, lower income and personality/mental health disorders. It is unclear whether this response style reflects an absolutist cognitive style or is merely an experimental artefact. In this study, we introduce an alternative, more informative, flexible and ecologically valid approach for estimating absolute responding, that uses natural language markers. We focussed on ‘function words’ (e.g. particles, conjunctions, prepositions) as they are more generalizable because they do not depend on any specific context. To identify such linguistic markers and test their generalizability, we conducted a text analysis of online reviews for films, tourist attractions and consumer products. All written reviews were accompanied by a rating scale (akin to Likert scale), which allowed us to label text samples as absolute/moderate. The data was split into independent ‘training’ and ‘test’ sets. Using the training set we identified a rank order of linguistic markers for absolute and moderate text, which were evaluated in a classifier on the test set. The top three markers alone (“but”, “!” and “seem”) produced 88% classification accuracy, which increased to 91% using 31 linguistic markers.