Cargando…
Online reading habits can reveal personality traits: towards detecting psychological microtargeting
Building on big data from Reddit, we generated two computational text models: (i) Predicting the personality of users from the text they have written and (ii) predicting the personality of users based on the text they have consumed. The second model is novel and without precedent in the literature....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10276193/ https://www.ncbi.nlm.nih.gov/pubmed/37333766 http://dx.doi.org/10.1093/pnasnexus/pgad191 |
_version_ | 1785060024384487424 |
---|---|
author | Simchon, Almog Sutton, Adam Edwards, Matthew Lewandowsky, Stephan |
author_facet | Simchon, Almog Sutton, Adam Edwards, Matthew Lewandowsky, Stephan |
author_sort | Simchon, Almog |
collection | PubMed |
description | Building on big data from Reddit, we generated two computational text models: (i) Predicting the personality of users from the text they have written and (ii) predicting the personality of users based on the text they have consumed. The second model is novel and without precedent in the literature. We recruited active Reddit users ([Formula: see text]) of fiction-writing communities. The participants completed a Big Five personality questionnaire and consented for their Reddit activity to be scraped and used to create a machine learning model. We trained an natural language processing model [Bidirectional Encoder Representations from Transformers (BERT)], predicting personality from produced text (average performance: [Formula: see text]). We then applied this model to a new set of Reddit users ([Formula: see text]), predicted their personality based on their produced text, and trained a second BERT model to predict their predicted-personality scores based on consumed text (average performance: [Formula: see text]). By doing so, we provide the first glimpse into the linguistic markers of personality-congruent consumed content. |
format | Online Article Text |
id | pubmed-10276193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-102761932023-06-18 Online reading habits can reveal personality traits: towards detecting psychological microtargeting Simchon, Almog Sutton, Adam Edwards, Matthew Lewandowsky, Stephan PNAS Nexus Social and Political Sciences Building on big data from Reddit, we generated two computational text models: (i) Predicting the personality of users from the text they have written and (ii) predicting the personality of users based on the text they have consumed. The second model is novel and without precedent in the literature. We recruited active Reddit users ([Formula: see text]) of fiction-writing communities. The participants completed a Big Five personality questionnaire and consented for their Reddit activity to be scraped and used to create a machine learning model. We trained an natural language processing model [Bidirectional Encoder Representations from Transformers (BERT)], predicting personality from produced text (average performance: [Formula: see text]). We then applied this model to a new set of Reddit users ([Formula: see text]), predicted their personality based on their produced text, and trained a second BERT model to predict their predicted-personality scores based on consumed text (average performance: [Formula: see text]). By doing so, we provide the first glimpse into the linguistic markers of personality-congruent consumed content. Oxford University Press 2023-06-07 /pmc/articles/PMC10276193/ /pubmed/37333766 http://dx.doi.org/10.1093/pnasnexus/pgad191 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Social and Political Sciences Simchon, Almog Sutton, Adam Edwards, Matthew Lewandowsky, Stephan Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title | Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title_full | Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title_fullStr | Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title_full_unstemmed | Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title_short | Online reading habits can reveal personality traits: towards detecting psychological microtargeting |
title_sort | online reading habits can reveal personality traits: towards detecting psychological microtargeting |
topic | Social and Political Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10276193/ https://www.ncbi.nlm.nih.gov/pubmed/37333766 http://dx.doi.org/10.1093/pnasnexus/pgad191 |
work_keys_str_mv | AT simchonalmog onlinereadinghabitscanrevealpersonalitytraitstowardsdetectingpsychologicalmicrotargeting AT suttonadam onlinereadinghabitscanrevealpersonalitytraitstowardsdetectingpsychologicalmicrotargeting AT edwardsmatthew onlinereadinghabitscanrevealpersonalitytraitstowardsdetectingpsychologicalmicrotargeting AT lewandowskystephan onlinereadinghabitscanrevealpersonalitytraitstowardsdetectingpsychologicalmicrotargeting |