A Unifying Framework for Causal Imitation Learning with Hidden Confounders

Shao, Daqian; Buening, Thomas Kleine; Kwiatkowska, Marta

Computer Science > Machine Learning

arXiv:2502.07656 (cs)

[Submitted on 11 Feb 2025]

Title:A Unifying Framework for Causal Imitation Learning with Hidden Confounders

Authors:Daqian Shao, Thomas Kleine Buening, Marta Kwiatkowska

View PDF HTML (experimental)

Abstract:We propose a general and unifying framework for causal Imitation Learning (IL) with hidden confounders that subsumes several existing confounded IL settings from the literature. Our framework accounts for two types of hidden confounders: (a) those observed by the expert, which thus influence the expert's policy, and (b) confounding noise hidden to both the expert and the IL algorithm. For additional flexibility, we also introduce a confounding noise horizon and time-varying expert-observable hidden variables. We show that causal IL in our framework can be reduced to a set of Conditional Moment Restrictions (CMRs) by leveraging trajectory histories as instruments to learn a history-dependent policy. We propose DML-IL, a novel algorithm that uses instrumental variable regression to solve these CMRs and learn a policy. We provide a bound on the imitation gap for DML-IL, which recovers prior results as special cases. Empirical evaluation on a toy environment with continues state-action spaces and multiple Mujoco tasks demonstrate that DML-IL outperforms state-of-the-art causal IL algorithms.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.07656 [cs.LG]
	(or arXiv:2502.07656v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.07656

Submission history

From: Daqian Shao [view email]
[v1] Tue, 11 Feb 2025 15:43:49 UTC (1,356 KB)

Computer Science > Machine Learning

Title:A Unifying Framework for Causal Imitation Learning with Hidden Confounders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Unifying Framework for Causal Imitation Learning with Hidden Confounders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators