Cargando…
Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams
We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparen...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334190/ http://dx.doi.org/10.1007/978-3-030-52237-7_2 |
_version_ | 1783553885448699904 |
---|---|
author | Azad, Sushmita Chen, Binglin Fowler, Maxwell West, Matthew Zilles, Craig |
author_facet | Azad, Sushmita Chen, Binglin Fowler, Maxwell West, Matthew Zilles, Craig |
author_sort | Azad, Sushmita |
collection | PubMed |
description | We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparency (the question was graded interactively during the computer-based exam and correct solutions were shown to students that could be compared to their answer). We study two techniques designed to mitigate the potential student dissatisfaction resulting from students incorrectly not granted credit by the imperfect AI grader. We find (1) that providing multiple attempts can eliminate first-attempt false negatives at the cost of additional false positives, and (2) that students not granted credit from the algorithm cannot reliably determine if their answer was mis-scored. |
format | Online Article Text |
id | pubmed-7334190 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73341902020-07-06 Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams Azad, Sushmita Chen, Binglin Fowler, Maxwell West, Matthew Zilles, Craig Artificial Intelligence in Education Article We describe the deployment of an imperfect NLP-based automatic short answer grading system on an exam in a large-enrollment introductory college course. We characterize this deployment as both high stakes (the questions were on an mid-term exam worth 10% of students’ final grade) and high transparency (the question was graded interactively during the computer-based exam and correct solutions were shown to students that could be compared to their answer). We study two techniques designed to mitigate the potential student dissatisfaction resulting from students incorrectly not granted credit by the imperfect AI grader. We find (1) that providing multiple attempts can eliminate first-attempt false negatives at the cost of additional false positives, and (2) that students not granted credit from the algorithm cannot reliably determine if their answer was mis-scored. 2020-06-09 /pmc/articles/PMC7334190/ http://dx.doi.org/10.1007/978-3-030-52237-7_2 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Azad, Sushmita Chen, Binglin Fowler, Maxwell West, Matthew Zilles, Craig Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title | Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title_full | Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title_fullStr | Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title_full_unstemmed | Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title_short | Strategies for Deploying Unreliable AI Graders in High-Transparency High-Stakes Exams |
title_sort | strategies for deploying unreliable ai graders in high-transparency high-stakes exams |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334190/ http://dx.doi.org/10.1007/978-3-030-52237-7_2 |
work_keys_str_mv | AT azadsushmita strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams AT chenbinglin strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams AT fowlermaxwell strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams AT westmatthew strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams AT zillescraig strategiesfordeployingunreliableaigradersinhightransparencyhighstakesexams |