Cargando…
Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features
Automatic personality recognition from source code is a scarcely explored problem. We propose personality recognition with handcrafted features, based on lexical, syntactic and semantic properties of source code. Out of 35 proposed features, 22 features are completely novel. We also show that n-gram...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302843/ http://dx.doi.org/10.1007/978-3-030-50417-5_26 |
_version_ | 1783547933680992256 |
---|---|
author | Biel, Mikołaj Kuta, Marcin Kitowski, Jacek |
author_facet | Biel, Mikołaj Kuta, Marcin Kitowski, Jacek |
author_sort | Biel, Mikołaj |
collection | PubMed |
description | Automatic personality recognition from source code is a scarcely explored problem. We propose personality recognition with handcrafted features, based on lexical, syntactic and semantic properties of source code. Out of 35 proposed features, 22 features are completely novel. We also show that n-gram features are simple but surprisingly good predictors of personality and present results arising from joint usage of both handcrafted and baseline features. Additionally we compare our results with scores obtained within the Personality Recognition in SOurce COde track during Forum for Information Retrieval Evaluation 2016 and set up state-of-the-art results for conscientiousness and neuroticism traits. |
format | Online Article Text |
id | pubmed-7302843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73028432020-06-19 Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features Biel, Mikołaj Kuta, Marcin Kitowski, Jacek Computational Science – ICCS 2020 Article Automatic personality recognition from source code is a scarcely explored problem. We propose personality recognition with handcrafted features, based on lexical, syntactic and semantic properties of source code. Out of 35 proposed features, 22 features are completely novel. We also show that n-gram features are simple but surprisingly good predictors of personality and present results arising from joint usage of both handcrafted and baseline features. Additionally we compare our results with scores obtained within the Personality Recognition in SOurce COde track during Forum for Information Retrieval Evaluation 2016 and set up state-of-the-art results for conscientiousness and neuroticism traits. 2020-06-15 /pmc/articles/PMC7302843/ http://dx.doi.org/10.1007/978-3-030-50417-5_26 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Biel, Mikołaj Kuta, Marcin Kitowski, Jacek Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title | Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title_full | Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title_fullStr | Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title_full_unstemmed | Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title_short | Personality Recognition from Source Code Based on Lexical, Syntactic and Semantic Features |
title_sort | personality recognition from source code based on lexical, syntactic and semantic features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302843/ http://dx.doi.org/10.1007/978-3-030-50417-5_26 |
work_keys_str_mv | AT bielmikołaj personalityrecognitionfromsourcecodebasedonlexicalsyntacticandsemanticfeatures AT kutamarcin personalityrecognitionfromsourcecodebasedonlexicalsyntacticandsemanticfeatures AT kitowskijacek personalityrecognitionfromsourcecodebasedonlexicalsyntacticandsemanticfeatures |