Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces

Hort, Max; Moonen, Leon

Computer Science > Software Engineering

arXiv:2503.23466 (cs)

[Submitted on 30 Mar 2025]

Title:Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces

Authors:Max Hort, Leon Moonen

View PDF HTML (experimental)

Abstract:Software is used in critical applications in our day-to-day life and it is important to ensure its correctness. One popular approach to assess correctness is to evaluate software on tests. If a test fails, it indicates a fault in the software under test; if all tests pass correctly, one may assume that the software is correct. However, the reliability of these results depends on the test suite considered, and there is a risk of false negatives (i.e. software that passes all available tests but contains bugs because some cases are not tested). Therefore, it is important to consider error-inducing test cases when evaluating software.
To support data-driven creation of such a test-suite, which is especially of interest for testing software synthesized from large language models, we curate a dataset (Codehacks) of programming problems together with corresponding error-inducing test cases (i.e., "hacks"). This dataset is collected from the wild, in particular, from the Codeforces online judge platform. The dataset comprises 288,617 hacks for 5,578 programming problems, each with a natural language description, as well as the source code for 2,196 submitted solutions to these problems that can be broken with their corresponding hacks.
Keywords: competitive programming, language model, dataset

Comments:	Accepted for publication at the 18th IEEE International Conference on Software Testing, Verification and Validation (ICST 2025)
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2503.23466 [cs.SE]
	(or arXiv:2503.23466v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2503.23466

Submission history

From: Leon Moonen [view email]
[v1] Sun, 30 Mar 2025 14:50:03 UTC (383 KB)

Computer Science > Software Engineering

Title:Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators