VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

Kumar, Manoj; Babaeizadeh, Mohammad; Erhan, Dumitru; Finn, Chelsea; Levine, Sergey; Dinh, Laurent; Kingma, Durk

Computer Science > Computer Vision and Pattern Recognition

arXiv:1903.01434 (cs)

[Submitted on 4 Mar 2019 (v1), last revised 12 Feb 2020 (this version, v3)]

Title:VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

Authors:Manoj Kumar, Mohammad Babaeizadeh, Dumitru Erhan, Chelsea Finn, Sergey Levine, Laurent Dinh, Durk Kingma

View PDF

Abstract:Generative models that can model and predict sequences of future events can, in principle, learn to capture complex real-world phenomena, such as physical interactions. However, a central challenge in video prediction is that the future is highly uncertain: a sequence of past observations of events can imply many possible futures. Although a number of recent works have studied probabilistic models that can represent uncertain futures, such models are either extremely expensive computationally as in the case of pixel-level autoregressive models, or do not directly optimize the likelihood of the data. To our knowledge, our work is the first to propose multi-frame video prediction with normalizing flows, which allows for direct optimization of the data likelihood, and produces high-quality stochastic predictions. We describe an approach for modeling the latent space dynamics, and demonstrate that flow-based generative models offer a viable and competitive approach to generative modelling of video.

Comments:	ICLR 2020 Camera-Ready. Previous title: VideoFlow: A Flow-Based Generative Model for Video
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1903.01434 [cs.CV]
	(or arXiv:1903.01434v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1903.01434

Submission history

From: Manoj Kumar [view email]
[v1] Mon, 4 Mar 2019 18:55:45 UTC (929 KB)
[v2] Mon, 10 Jun 2019 17:40:04 UTC (825 KB)
[v3] Wed, 12 Feb 2020 16:55:25 UTC (2,046 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators