Mathematics > Probability
[Submitted on 6 Oct 2023 (v1), last revised 14 Oct 2024 (this version, v2)]
Title:Markov chain entropy games and the geometry of their Nash equilibria
View PDF HTML (experimental)Abstract:Consider the following two-person mixed strategy game of a probabilist against Nature with respect to the parameters $(f, \mathcal{B},\pi)$, where $f$ is a convex function satisfying certain regularity conditions, $\mathcal{B}$ is either the set $\{L_i\}_{i=1}^n$ or its convex hull with each $L_i$ being a Markov infinitesimal generator on a finite state space $\mathcal{X}$ and $\pi$ is a given positive discrete distribution on $\mathcal{X}$. The probabilist chooses a prior measure $\mu$ within the set of probability measures on $\mathcal{B}$ denoted by $\mathcal{P}(\mathcal{B})$ and picks a $L \in \mathcal{B}$ at random according to $\mu$, whereas Nature follows a pure strategy to select $M \in \mathcal{L}(\pi)$, the set of $\pi$-reversible Markov generators on $\mathcal{X}$. Nature pays an amount $D_f(M||L)$, the $f$-divergence from $L$ to $M$, to the probabilist. We prove that a mixed strategy Nash equilibrium always exists, and establish a minimax result on the expected payoff of the game. This also contrasts with the pure strategy version of the game where we show a Nash equilibrium may not exist. To find approximately a mixed strategy Nash equilibrium, we propose and develop a simple projected subgradient algorithm that provably converges with a rate of $\mathcal{O}(1/\sqrt{t})$, where $t$ is the number of iterations. In addition, we elucidate the relationships of Nash equilibrium with other seemingly disparate notions such as weighted information centroid, Chebyshev center and Bayes risk. This article generalizes the two-person game of a statistician against Nature developed in the literature, and highlights the powerful interplay and synergy between modern Markov chains theory and geometry, information theory, game theory, optimization and mathematical statistics.
Submission history
From: Michael Choi [view email][v1] Fri, 6 Oct 2023 09:29:42 UTC (58 KB)
[v2] Mon, 14 Oct 2024 10:54:42 UTC (58 KB)
Current browse context:
math.IT
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.