Cargando…

Fooling Automatic Short Answer Grading Systems

With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investi...

Descripción completa

Detalles Bibliográficos
Autores principales: Filighera, Anna, Steuer, Tim, Rensing, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334174/
http://dx.doi.org/10.1007/978-3-030-52237-7_15
_version_ 1783553881615106048
author Filighera, Anna
Steuer, Tim
Rensing, Christoph
author_facet Filighera, Anna
Steuer, Tim
Rensing, Christoph
author_sort Filighera, Anna
collection PubMed
description With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investigate the robustness of the state of the art automatic short answer grading system proposed by Sung et al. towards cheating in the form of universal adversarial trigger employment. These are short token sequences that can be prepended to students’ answers in an exam to artificially improve their automatically assigned grade. Such triggers are especially critical as they can easily be used by anyone once they are found. In our experiments, we discovered triggers which allow students to pass exams with passing thresholds of [Formula: see text] without answering a single question correctly. Furthermore, we show that such triggers generalize across models and datasets in this scenario, nullifying the defense strategy of keeping grading models or data secret.
format Online
Article
Text
id pubmed-7334174
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73341742020-07-06 Fooling Automatic Short Answer Grading Systems Filighera, Anna Steuer, Tim Rensing, Christoph Artificial Intelligence in Education Article With the rising success of adversarial attacks on many NLP tasks, systems which actually operate in an adversarial scenario need to be reevaluated. For this purpose, we pose the following research question: How difficult is it to fool automatic short answer grading systems? In particular, we investigate the robustness of the state of the art automatic short answer grading system proposed by Sung et al. towards cheating in the form of universal adversarial trigger employment. These are short token sequences that can be prepended to students’ answers in an exam to artificially improve their automatically assigned grade. Such triggers are especially critical as they can easily be used by anyone once they are found. In our experiments, we discovered triggers which allow students to pass exams with passing thresholds of [Formula: see text] without answering a single question correctly. Furthermore, we show that such triggers generalize across models and datasets in this scenario, nullifying the defense strategy of keeping grading models or data secret. 2020-06-09 /pmc/articles/PMC7334174/ http://dx.doi.org/10.1007/978-3-030-52237-7_15 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Filighera, Anna
Steuer, Tim
Rensing, Christoph
Fooling Automatic Short Answer Grading Systems
title Fooling Automatic Short Answer Grading Systems
title_full Fooling Automatic Short Answer Grading Systems
title_fullStr Fooling Automatic Short Answer Grading Systems
title_full_unstemmed Fooling Automatic Short Answer Grading Systems
title_short Fooling Automatic Short Answer Grading Systems
title_sort fooling automatic short answer grading systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334174/
http://dx.doi.org/10.1007/978-3-030-52237-7_15
work_keys_str_mv AT filigheraanna foolingautomaticshortanswergradingsystems
AT steuertim foolingautomaticshortanswergradingsystems
AT rensingchristoph foolingautomaticshortanswergradingsystems