Skip to main content

Showing 1–16 of 16 results for author: Jonschkowski, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.12594  [pdf, other

    cs.CV cs.LG stat.ML

    Conditional Object-Centric Learning from Video

    Authors: Thomas Kipf, Gamaleldin F. Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff

    Abstract: Object-centric representations are a promising path toward more systematic generalization by providing flexible abstractions upon which compositional world models can be built. Recent work on simple 2D and 3D datasets has shown that models with object-centric inductive biases can learn to segment and represent meaningful objects from the statistical structure of the data alone without the need for… ▽ More

    Submitted 15 March, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Published at ICLR 2022. Project page at https://slot-attention-video.github.io/

  2. arXiv:2105.07014  [pdf, other

    cs.CV

    SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image War**

    Authors: Austin Stone, Daniel Maurer, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski

    Abstract: We present SMURF, a method for unsupervised learning of optical flow that improves state of the art on all benchmarks by $36\%$ to $40\%$ (over the prior best method UFlow) and even outperforms several supervised approaches such as PWC-Net and FlowNet2. Our method integrates architecture improvements from supervised optical flow, i.e. the RAFT model, with new ideas for unsupervised learning that i… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR 2021, all code available at https://github.com/google-research/google-research/tree/master/smurf

  3. arXiv:2104.08212  [pdf, other

    cs.RO cs.LG

    MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale

    Authors: Dmitry Kalashnikov, Jacob Varley, Yevgen Chebotar, Benjamin Swanson, Rico Jonschkowski, Chelsea Finn, Sergey Levine, Karol Hausman

    Abstract: General-purpose robotic systems must master a large repertoire of diverse skills to be useful in a range of daily tasks. While reinforcement learning provides a powerful framework for acquiring individual behaviors, the time needed to acquire each skill makes the prospect of a generalist robot trained with RL daunting. In this paper, we study how a large-scale collective robotic learning system ca… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

  4. arXiv:2104.07135  [pdf, other

    cs.CV

    Adaptive Intermediate Representations for Video Understanding

    Authors: Juhana Kangaspunta, AJ Piergiovanni, Rico Jonschkowski, Michael Ryoo, Anelia Angelova

    Abstract: A common strategy to video understanding is to incorporate spatial and motion information by fusing features derived from RGB frames and optical flow. In this work, we introduce a new way to leverage semantic segmentation as an intermediate representation for video understanding and use it in a way that requires no additional labeling. Second, we propose a general framework which learns the inte… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  5. arXiv:2101.02722  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels

    Authors: Austin Stone, Oscar Ramirez, Kurt Konolige, Rico Jonschkowski

    Abstract: Robots have to face challenging perceptual settings, including changes in viewpoint, lighting, and background. Current simulated reinforcement learning (RL) benchmarks such as DM Control provide visual input without such complexity, which limits the transfer of well-performing methods to the real world. In this paper, we extend DM Control with three kinds of visual distractions (variations in back… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: Code available at https://github.com/google-research/google-research/tree/master/distracting_control

  6. arXiv:2011.10287  [pdf, other

    cs.CV cs.LG

    Learning Object-Centric Video Models by Contrasting Sets

    Authors: Sindy Löwe, Klaus Greff, Rico Jonschkowski, Alexey Dosovitskiy, Thomas Kipf

    Abstract: Contrastive, self-supervised learning of object representations recently emerged as an attractive alternative to reconstruction-based training. Prior approaches focus on contrasting individual object representations (slots) against one another. However, a fundamental problem with this approach is that the overall contrastive loss is the same for (i) representing a different object in each slot, as… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Workshop on Object Representations for Learning and Reasoning

  7. arXiv:2011.07318  [pdf, other

    cs.LG cs.AI

    A Geometric Perspective on Self-Supervised Policy Adaptation

    Authors: Cristian Bodnar, Karol Hausman, Gabriel Dulac-Arnold, Rico Jonschkowski

    Abstract: One of the most challenging aspects of real-world reinforcement learning (RL) is the multitude of unpredictable and ever-changing distractions that could divert an agent from what was tasked to do in its training environment. While an agent could learn from reward signals to ignore them, the complexity of the real-world can make rewards hard to acquire, or, at best, extremely sparse. A recent clas… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Contains 17 pages, 18 figures

  8. arXiv:2006.04902  [pdf, other

    cs.CV cs.LG eess.IV

    What Matters in Unsupervised Optical Flow

    Authors: Rico Jonschkowski, Austin Stone, Jonathan T. Barron, Ariel Gordon, Kurt Konolige, Anelia Angelova

    Abstract: We systematically compare and analyze a set of key components in unsupervised optical flow to identify which photometric loss, occlusion handling, and smoothness regularization is most effective. Alongside this investigation we construct a number of novel improvements to unsupervised flow models, such as cost volume normalization, stop** the gradient at the occlusion mask, encouraging smoothness… ▽ More

    Submitted 14 August, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at ECCV 2020 (Oral). Source code is available at https://github.com/google-research/google-research/tree/master/uflow

  9. arXiv:2005.09530  [pdf, other

    cs.CV cs.LG cs.RO

    Differentiable Map** Networks: Learning Structured Map Representations for Sparse Visual Localization

    Authors: Peter Karkus, Anelia Angelova, Vincent Vanhoucke, Rico Jonschkowski

    Abstract: Map** and localization, preferably from a small number of observations, are fundamental tasks in robotics. We address these tasks by combining spatial structure (differentiable map**) and end-to-end learning in a novel neural network architecture: the Differentiable Map** Network (DMN). The DMN constructs a spatially structured view-embedding map and uses it for subsequent visual localizatio… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: ICRA 2020

  10. arXiv:2004.11938  [pdf, other

    cs.LG cs.RO stat.ML

    Towards Differentiable Resampling

    Authors: Michael Zhu, Kevin Murphy, Rico Jonschkowski

    Abstract: Resampling is a key component of sample-based recursive state estimation in particle filters. Recent work explores differentiable particle filters for end-to-end learning. However, resampling remains a challenge in these works, as it is inherently non-differentiable. We address this challenge by replacing traditional resampling with a learned neural network resampler. We present a novel network ar… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

  11. arXiv:1912.02805  [pdf, other

    cs.CV cs.LG cs.RO

    KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects

    Authors: Xingyu Liu, Rico Jonschkowski, Anelia Angelova, Kurt Konolige

    Abstract: Estimating the 3D pose of desktop objects is crucial for applications such as robotic manipulation. Many existing approaches to this problem require a depth map of the object for both training and prediction, which restricts them to opaque, lambertian objects that produce good returns in an RGBD sensor. In this paper we forgo using a depth sensor in favor of raw stereo input. We address two proble… ▽ More

    Submitted 18 May, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: CVPR 2020

  12. arXiv:1909.12950  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Towards Object Detection from Motion

    Authors: Rico Jonschkowski, Austin Stone

    Abstract: We present a novel approach to weakly supervised object detection. Instead of annotated images, our method only requires two short videos to learn to detect a new object: 1) a video of a moving object and 2) one or more "negative" videos of the scene without the object. The key idea of our algorithm is to train the object detector to produce physically plausible object motion when applied to the f… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  13. arXiv:1904.04998  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

    Authors: Ariel Gordon, Hanhan Li, Rico Jonschkowski, Anelia Angelova

    Abstract: We present a novel method for simultaneous learning of depth, egomotion, object motion, and camera intrinsics from monocular videos, using only consistency across neighboring video frames as supervision signal. Similarly to prior work, our method learns by applying differentiable war** to frames and comparing the result to adjacent ones, but it provides several improvements: We address occlusion… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Journal ref: The IEEE International Conference on Computer Vision (ICCV), 2019, pp. 8977-8986

  14. arXiv:1805.11122  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors

    Authors: Rico Jonschkowski, Divyam Rastogi, Oliver Brock

    Abstract: We present differentiable particle filters (DPFs): a differentiable implementation of the particle filter algorithm with learnable motion and measurement models. Since DPFs are end-to-end differentiable, we can efficiently train their models by optimizing end-to-end state estimation performance, rather than proxy objectives such as model accuracy. DPFs encode the structure of recursive state estim… ▽ More

    Submitted 29 May, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted at Robotics: Science and Systems 2018 (http://www.roboticsconference.org)

  15. arXiv:1705.09805  [pdf, other

    cs.RO cs.CV cs.LG

    PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

    Authors: Rico Jonschkowski, Roland Hafner, Jonathan Scholz, Martin Riedmiller

    Abstract: We propose position-velocity encoders (PVEs) which learn---without supervision---to encode images to positions and velocities of task-relevant objects. PVEs encode a single image into a low-dimensional position state and compute the velocity state from finite differences in position. In contrast to autoencoders, position-velocity encoders are not trained by image reconstruction, but by making the… ▽ More

    Submitted 24 July, 2017; v1 submitted 27 May, 2017; originally announced May 2017.

    Comments: Accepted at Robotics: Science and Systems (RSS 2017) Workshop -- New Frontiers for Deep Learning in Robotics http://juxi.net/workshop/deep-learning-rss-2017/

  16. arXiv:1511.06429  [pdf, other

    cs.LG stat.ML

    Patterns for Learning with Side Information

    Authors: Rico Jonschkowski, Sebastian Höfer, Oliver Brock

    Abstract: Supervised, semi-supervised, and unsupervised learning estimate a function given input/output samples. Generalization of the learned function to unseen data can be improved by incorporating side information into learning. Side information are data that are neither from the input space nor from the output space of the function, but include useful information for learning it. In this paper we show t… ▽ More

    Submitted 10 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: The first two authors contributed equally to this work