Structured Object-Aware Physics Prediction for Video Modeling and Planning

Kossen, Jannik; Stelzner, Karl; Hussing, Marcel; Voelcker, Claas; Kersting, Kristian

Computer Science > Machine Learning

arXiv:1910.02425 (cs)

[Submitted on 6 Oct 2019 (v1), last revised 12 Feb 2020 (this version, v2)]

Title:Structured Object-Aware Physics Prediction for Video Modeling and Planning

Authors:Jannik Kossen, Karl Stelzner, Marcel Hussing, Claas Voelcker, Kristian Kersting

View PDF

Abstract:When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with complicated and previously unseen interactions. For computers, however, learning such models from videos in an unsupervised fashion is an unsolved research problem. In this paper, we present STOVE, a novel state-space model for videos, which explicitly reasons about objects and their positions, velocities, and interactions. It is constructed by combining an image model and a dynamics model in compositional manner and improves on previous work by reusing the dynamics model for inference, accelerating and regularizing training. STOVE predicts videos with convincing physical behavior over hundreds of timesteps, outperforms previous unsupervised models, and even approaches the performance of supervised baselines. We further demonstrate the strength of our model as a simulator for sample efficient model-based control in a task with heavily interacting objects.

Comments:	Published as a conference paper at 2020 International Conference for Learning Representations
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1910.02425 [cs.LG]
	(or arXiv:1910.02425v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1910.02425

Submission history

From: Jannik Kossen [view email]
[v1] Sun, 6 Oct 2019 11:48:26 UTC (526 KB)
[v2] Wed, 12 Feb 2020 09:38:20 UTC (633 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-10

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Karl Stelzner
Kristian Kersting

export BibTeX citation

Computer Science > Machine Learning

Title:Structured Object-Aware Physics Prediction for Video Modeling and Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Structured Object-Aware Physics Prediction for Video Modeling and Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators