Auditing an Automatic Grading Model with deep Reinforcement Learning

Condor, Aubrey; Pardos, Zachary

Computer Science > Artificial Intelligence

arXiv:2405.07087 (cs)

[Submitted on 11 May 2024]

Title:Auditing an Automatic Grading Model with deep Reinforcement Learning

Authors:Aubrey Condor, Zachary Pardos

View PDF HTML (experimental)

Abstract:We explore the use of deep reinforcement learning to audit an automatic short answer grading (ASAG) model. Automatic grading may decrease the time burden of rating open-ended items for educators, but a lack of robust evaluation methods for these models can result in uncertainty of their quality. Current state-of-the-art ASAG models are configured to match human ratings from a training set, and researchers typically assess their quality with accuracy metrics that signify agreement between model and human scores. In this paper, we show that a high level of agreement to human ratings does not give sufficient evidence that an ASAG model is infallible. We train a reinforcement learning agent to revise student responses with the objective of achieving a high rating from an automatic grading model in the least number of revisions. By analyzing the agent's revised responses that achieve a high grade from the ASAG model but would not be considered a high scoring responses according to a scoring rubric, we discover ways in which the automated grader can be exploited, exposing shortcomings in the grading model.

Subjects:	Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
Cite as:	arXiv:2405.07087 [cs.AI]
	(or arXiv:2405.07087v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2405.07087

Submission history

From: Aubrey Condor [view email]
[v1] Sat, 11 May 2024 20:07:09 UTC (238 KB)

Computer Science > Artificial Intelligence

Title:Auditing an Automatic Grading Model with deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Auditing an Automatic Grading Model with deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators