Cargando…

The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis

In this study, we demonstrate how supervised learning can extract interpretable survey motivation measurements from a large number of responses to an open-ended question. We manually coded a subsample of 5,000 responses to an open-ended question on survey motivation from the GESIS Panel (25,000 resp...

Descripción completa

Detalles Bibliográficos
Autores principales: Haensch, Anna-Carolina, Weiß, Bernd, Steins, Patricia, Chyrva, Priscilla, Bitz, Katja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403118/
https://www.ncbi.nlm.nih.gov/pubmed/36035509
http://dx.doi.org/10.3389/fdata.2022.880554
_version_ 1784773300827717632
author Haensch, Anna-Carolina
Weiß, Bernd
Steins, Patricia
Chyrva, Priscilla
Bitz, Katja
author_facet Haensch, Anna-Carolina
Weiß, Bernd
Steins, Patricia
Chyrva, Priscilla
Bitz, Katja
author_sort Haensch, Anna-Carolina
collection PubMed
description In this study, we demonstrate how supervised learning can extract interpretable survey motivation measurements from a large number of responses to an open-ended question. We manually coded a subsample of 5,000 responses to an open-ended question on survey motivation from the GESIS Panel (25,000 responses in total); we utilized supervised machine learning to classify the remaining responses. We can demonstrate that the responses on survey motivation in the GESIS Panel are particularly well suited for automated classification, since they are mostly one-dimensional. The evaluation of the test set also indicates very good overall performance. We present the pre-processing steps and methods we used for our data, and by discussing other popular options that might be more suitable in other cases, we also generalize beyond our use case. We also discuss various minor problems, such as a necessary spelling correction. Finally, we can showcase the analytic potential of the resulting categorization of panelists' motivation through an event history analysis of panel dropout. The analytical results allow a close look at respondents' motivations: they span a wide range, from the urge to help to interest in questions or the incentive and the wish to influence those in power through their participation. We conclude our paper by discussing the re-usability of the hand-coded responses for other surveys, including similar open questions to the GESIS Panel question.
format Online
Article
Text
id pubmed-9403118
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-94031182022-08-26 The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis Haensch, Anna-Carolina Weiß, Bernd Steins, Patricia Chyrva, Priscilla Bitz, Katja Front Big Data Big Data In this study, we demonstrate how supervised learning can extract interpretable survey motivation measurements from a large number of responses to an open-ended question. We manually coded a subsample of 5,000 responses to an open-ended question on survey motivation from the GESIS Panel (25,000 responses in total); we utilized supervised machine learning to classify the remaining responses. We can demonstrate that the responses on survey motivation in the GESIS Panel are particularly well suited for automated classification, since they are mostly one-dimensional. The evaluation of the test set also indicates very good overall performance. We present the pre-processing steps and methods we used for our data, and by discussing other popular options that might be more suitable in other cases, we also generalize beyond our use case. We also discuss various minor problems, such as a necessary spelling correction. Finally, we can showcase the analytic potential of the resulting categorization of panelists' motivation through an event history analysis of panel dropout. The analytical results allow a close look at respondents' motivations: they span a wide range, from the urge to help to interest in questions or the incentive and the wish to influence those in power through their participation. We conclude our paper by discussing the re-usability of the hand-coded responses for other surveys, including similar open questions to the GESIS Panel question. Frontiers Media S.A. 2022-08-11 /pmc/articles/PMC9403118/ /pubmed/36035509 http://dx.doi.org/10.3389/fdata.2022.880554 Text en Copyright © 2022 Haensch, Weiß, Steins, Chyrva and Bitz. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Haensch, Anna-Carolina
Weiß, Bernd
Steins, Patricia
Chyrva, Priscilla
Bitz, Katja
The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title_full The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title_fullStr The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title_full_unstemmed The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title_short The semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
title_sort semi-automatic classification of an open-ended question on panel survey motivation and its application in attrition analysis
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403118/
https://www.ncbi.nlm.nih.gov/pubmed/36035509
http://dx.doi.org/10.3389/fdata.2022.880554
work_keys_str_mv AT haenschannacarolina thesemiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT weißbernd thesemiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT steinspatricia thesemiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT chyrvapriscilla thesemiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT bitzkatja thesemiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT haenschannacarolina semiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT weißbernd semiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT steinspatricia semiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT chyrvapriscilla semiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis
AT bitzkatja semiautomaticclassificationofanopenendedquestiononpanelsurveymotivationanditsapplicationinattritionanalysis