Learning Visual Predictive Models of Physics for Playing Billiards

Fragkiadaki, Katerina; Agrawal, Pulkit; Levine, Sergey; Malik, Jitendra

Computer Science > Computer Vision and Pattern Recognition

arXiv:1511.07404 (cs)

[Submitted on 23 Nov 2015 (v1), last revised 19 Jan 2016 (this version, v3)]

Title:Learning Visual Predictive Models of Physics for Playing Billiards

Authors:Katerina Fragkiadaki, Pulkit Agrawal, Sergey Levine, Jitendra Malik

View PDF

Abstract:The ability to plan and execute goal specific actions in varied, unexpected settings is a central requirement of intelligent agents. In this paper, we explore how an agent can be equipped with an internal model of the dynamics of the external world, and how it can use this model to plan novel actions by running multiple internal simulations ("visual imagination"). Our models directly process raw visual input, and use a novel object-centric prediction formulation based on visual glimpses centered on objects (fixations) to enforce translational invariance of the learned physical laws. The agent gathers training data through random interaction with a collection of different environments, and the resulting model can then be used to plan goal-directed actions in novel environments that the agent has not seen before. We demonstrate that our agent can accurately plan actions for playing a simulated billiards game, which requires pushing a ball into a target position or into collision with another ball.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1511.07404 [cs.CV]
	(or arXiv:1511.07404v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1511.07404

Submission history

From: Katerina Fragkiadaki [view email]
[v1] Mon, 23 Nov 2015 20:27:48 UTC (2,604 KB)
[v2] Mon, 11 Jan 2016 20:44:39 UTC (2,832 KB)
[v3] Tue, 19 Jan 2016 20:58:24 UTC (2,608 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Katerina Fragkiadaki
Pulkit Agrawal
Sergey Levine
Jitendra Malik

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Visual Predictive Models of Physics for Playing Billiards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Visual Predictive Models of Physics for Playing Billiards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators