Planning from Pixels in Atari with Learned Symbolic Representations

Dittadi, Andrea; Drachmann, Frederik K.; Bolander, Thomas

Computer Science > Artificial Intelligence

arXiv:2012.09126 (cs)

[Submitted on 16 Dec 2020 (v1), last revised 15 Mar 2021 (this version, v2)]

Title:Planning from Pixels in Atari with Learned Symbolic Representations

Authors:Andrea Dittadi, Frederik K. Drachmann, Thomas Bolander

View PDF

Abstract:Width-based planning methods have been shown to yield state-of-the-art performance in the Atari 2600 domain using pixel input. One successful approach, RolloutIW, represents states with the B-PROST boolean feature set. An augmented version of RolloutIW, $\pi$-IW, shows that learned features can be competitive with handcrafted ones for width-based search. In this paper, we leverage variational autoencoders (VAEs) to learn features directly from pixels in a principled manner, and without supervision. The inference model of the trained VAEs extracts boolean features from pixels, and RolloutIW plans with these features. The resulting combination outperforms the original RolloutIW and human professional play on Atari 2600 and drastically reduces the size of the feature set.

Comments:	Published at AAAI 2021
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2012.09126 [cs.AI]
	(or arXiv:2012.09126v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2012.09126

Submission history

From: Andrea Dittadi [view email]
[v1] Wed, 16 Dec 2020 18:15:11 UTC (1,145 KB)
[v2] Mon, 15 Mar 2021 17:20:05 UTC (848 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Thomas Bolander

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Planning from Pixels in Atari with Learned Symbolic Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Planning from Pixels in Atari with Learned Symbolic Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators