Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Ben-Michael, Eli; Greiner, D. James; Imai, Kosuke; Jiang, Zhichao

Statistics > Machine Learning

arXiv:2109.11679 (stat)

[Submitted on 22 Sep 2021 (v1), last revised 31 Mar 2025 (this version, v4)]

Title:Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Authors:Eli Ben-Michael, D. James Greiner, Kosuke Imai, Zhichao Jiang

View PDF HTML (experimental)

Abstract:Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. We examine a particular case of algorithmic pre-trial risk assessments in the US criminal justice system, which provide deterministic classification scores and recommendations to help judges make release decisions. Our goal is to analyze data from a unique field experiment on an algorithmic pre-trial risk assessment to investigate whether the scores and recommendations can be improved. Unfortunately, prior methods for policy learning are not applicable because they require existing policies to be stochastic. We develop a maximin robust optimization approach that partially identifies the expected utility of a policy, and then finds a policy that maximizes the worst-case expected utility. The resulting policy has a statistical safety property, limiting the probability of producing a worse policy than the existing one, under structural assumptions about the outcomes. Our analysis of data from the field experiment shows that we can safely improve certain components of the risk assessment instrument by classifying arrestees as lower risk under a wide range of utility specifications, though the analysis is not informative about several components of the instrument.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:2109.11679 [stat.ML]
	(or arXiv:2109.11679v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2109.11679

Submission history

From: Eli Ben-Michael [view email]
[v1] Wed, 22 Sep 2021 00:52:03 UTC (192 KB)
[v2] Tue, 14 Dec 2021 16:45:39 UTC (215 KB)
[v3] Tue, 15 Feb 2022 21:08:06 UTC (520 KB)
[v4] Mon, 31 Mar 2025 20:43:58 UTC (415 KB)

Statistics > Machine Learning

Title:Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators