Cargando…

Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams

We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparen...

Descripción completa

Detalles Bibliográficos
Autores principales: Azad, Sushmita, Chen, Binglin, Fowler, Maxwell, West, Matthew, Zilles, Craig
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334190/
http://dx.doi.org/10.1007/978-3-030-52237-7_2
_version_ 1783553885448699904
author Azad, Sushmita
Chen, Binglin
Fowler, Maxwell
West, Matthew
Zilles, Craig
author_facet Azad, Sushmita
Chen, Binglin
Fowler, Maxwell
West, Matthew
Zilles, Craig
author_sort Azad, Sushmita
collection PubMed
description We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparency (the question was graded interactively during the computer-based exam and correct solutions were shown to students that could be compared to their answer). We study two techniques designed to mitigate the potential student dissatisfaction resulting from students incorrectly not granted credit by the imperfect AI grader. We find (1) that providing multiple attempts can eliminate first-attempt false negatives at the cost of additional false positives, and (2) that students not granted credit from the algorithm cannot reliably determine if their answer was mis-scored.
format Online
Article
Text
id pubmed-7334190
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73341902020-07-06 Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams Azad, Sushmita Chen, Binglin Fowler, Maxwell West, Matthew Zilles, Craig Artificial Intelligence in Education Article We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparency (the question was graded interactively during the computer-based exam and correct solutions were shown to students that could be compared to their answer). We study two techniques designed to mitigate the potential student dissatisfaction resulting from students incorrectly not granted credit by the imperfect AI grader. We find (1) that providing multiple attempts can eliminate first-attempt false negatives at the cost of additional false positives, and (2) that students not granted credit from the algorithm cannot reliably determine if their answer was mis-scored. 2020-06-09 /pmc/articles/PMC7334190/ http://dx.doi.org/10.1007/978-3-030-52237-7_2 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Azad, Sushmita
Chen, Binglin
Fowler, Maxwell
West, Matthew
Zilles, Craig
Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title_full Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title_fullStr Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title_full_unstemmed Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title_short Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
title_sort strategies for deploying unreliable ai graders in high-transparency high-stakes exams
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334190/
http://dx.doi.org/10.1007/978-3-030-52237-7_2
work_keys_str_mv AT azadsushmita strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams
AT chenbinglin strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams
AT fowlermaxwell strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams
AT westmatthew strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams
AT zillescraig strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams