Assessing Human Error Against a Benchmark of Perfection

Anderson, Ashton; Kleinberg, Jon; Mullainathan, Sendhil

doi:10.1145/2939672.2939803

Computer Science > Artificial Intelligence

arXiv:1606.04956 (cs)

[Submitted on 15 Jun 2016]

Title:Assessing Human Error Against a Benchmark of Perfection

Authors:Ashton Anderson, Jon Kleinberg, Sendhil Mullainathan

View PDF

Abstract:An increasing number of domains are providing us with detailed trace data on human decisions in settings where we can evaluate the quality of these decisions via an algorithm. Motivated by this development, an emerging line of work has begun to consider whether we can characterize and predict the kinds of decisions where people are likely to make errors.
To investigate what a general framework for human error prediction might look like, we focus on a model system with a rich history in the behavioral sciences: the decisions made by chess players as they select moves in a game. We carry out our analysis at a large scale, employing datasets with several million recorded games, and using chess tablebases to acquire a form of ground truth for a subset of chess positions that have been completely solved by computers but remain challenging even for the best players in the world.
We organize our analysis around three categories of features that we argue are present in most settings where the analysis of human error is applicable: the skill of the decision-maker, the time available to make the decision, and the inherent difficulty of the decision. We identify rich structure in all three of these categories of features, and find strong evidence that in our domain, features describing the inherent difficulty of an instance are significantly more powerful than features based on skill or time.

Comments:	KDD 2016; 10 pages
Subjects:	Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Social and Information Networks (cs.SI)
Cite as:	arXiv:1606.04956 [cs.AI]
	(or arXiv:1606.04956v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1606.04956
Related DOI:	https://doi.org/10.1145/2939672.2939803

Submission history

From: Ashton Anderson [view email]
[v1] Wed, 15 Jun 2016 20:00:32 UTC (79 KB)

Computer Science > Artificial Intelligence

Title:Assessing Human Error Against a Benchmark of Perfection

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Assessing Human Error Against a Benchmark of Perfection

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators