Skip to main content

Showing 1–14 of 14 results for author: Whitney, W F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19800  [pdf, other

    cs.LG cs.RO

    Modeling the Real World with High-Density Visual Particle Dynamics

    Authors: William F. Whitney, Jacob Varley, Deepali Jain, Krzysztof Choromanski, Sumeet Singh, Vikas Sindhwani

    Abstract: We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neig… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2405.14045  [pdf, other

    cs.LG cs.CV

    Learning rigid-body simulators over implicit shapes for large-scale scenes and vision

    Authors: Yulia Rubanova, Tatiana Lopez-Guevara, Kelsey R. Allen, William F. Whitney, Kimberly Stachenfeld, Tobias Pfaff

    Abstract: Simulating large scenes with many rigid objects is crucial for a variety of applications, such as robotics, engineering, film and video games. Rigid interactions are notoriously hard to model: small changes to the initial state or the simulation parameters can lead to large changes in the final state. Recently, learned simulators based on graph networks (GNNs) were developed as an alternative to h… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2401.11985  [pdf, other

    cs.LG cs.CV cs.RO

    Scaling Face Interaction Graph Networks to Real World Scenes

    Authors: Tatiana Lopez-Guevara, Yulia Rubanova, William F. Whitney, Tobias Pfaff, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Accurately simulating real world object dynamics is essential for various applications such as robotics, engineering, graphics, and design. To better capture complex real dynamics such as contact and friction, learned simulators based on graph networks have recently shown great promise. However, applying these learned simulators to real scenes comes with two major challenges: first, scaling learne… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 16 pages, 12 figures

  4. arXiv:2312.05359  [pdf, other

    cs.LG

    Learning 3D Particle-based Simulators from RGB-D Videos

    Authors: William F. Whitney, Tatiana Lopez-Guevara, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Realistic simulation is critical for applications ranging from robotics to animation. Traditional analytic simulators sometimes struggle to capture sufficiently realistic simulation which can lead to problems including the well known "sim-to-real" gap in robotics. Learned simulators have emerged as an alternative for better capturing real-world physical dynamics, but require access to privileged g… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  5. arXiv:2309.07578  [pdf, other

    cs.LG cs.AI cs.RO

    Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning

    Authors: Cristina Pinneri, Sarah Bechtle, Markus Wulfmeier, Arunkumar Byravan, **gwei Zhang, William F. Whitney, Martin Riedmiller

    Abstract: We present a novel approach to address the challenge of generalization in offline reinforcement learning (RL), where the agent learns from a fixed dataset without any additional interaction with the environment. Specifically, we aim to improve the agent's ability to generalize to out-of-distribution goals. To achieve this, we propose to learn a dynamics model and check if it is equivariant with re… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  6. arXiv:2112.00950  [pdf, other

    cs.LG stat.ML

    Quantile Filtered Imitation Learning

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: We introduce quantile filtered imitation learning (QFIL), a novel policy improvement operator designed for offline reinforcement learning. QFIL performs policy improvement by running imitation learning on a filtered version of the offline dataset. The filtering process removes $ s,a $ pairs whose estimated Q values fall below a given quantile of the pushforward distribution over values induced by… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2021

  7. arXiv:2106.08909  [pdf, other

    cs.LG stat.ML

    Offline RL Without Off-Policy Evaluation

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: Most prior approaches to offline reinforcement learning (RL) have taken an iterative actor-critic approach involving off-policy evaluation. In this paper we show that simply doing one step of constrained/regularized policy improvement using an on-policy Q estimate of the behavior policy performs surprisingly well. This one-step algorithm beats the previously reported results of iterative algorithm… ▽ More

    Submitted 3 December, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Thirty-fifth Conference on Neural Information Processing Systems, 2021

  8. arXiv:2101.09458  [pdf, other

    cs.LG

    Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning

    Authors: William F. Whitney, Michael Bloesch, Jost Tobias Springenberg, Abbas Abdolmaleki, Kyunghyun Cho, Martin Riedmiller

    Abstract: Despite the close connection between exploration and sample efficiency, most state of the art reinforcement learning algorithms include no considerations for exploration beyond maximizing the entropy of the policy. In this work we address this seeming missed opportunity. We observe that the most common formulation of directed exploration in deep RL, known as bonus-based exploration (BBE), suffers… ▽ More

    Submitted 1 July, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

  9. arXiv:2009.07368  [pdf, other

    cs.LG cs.AI stat.ML

    Evaluating representations by the complexity of learning low-loss predictors

    Authors: William F. Whitney, Min Jae Song, David Brandfonbrener, Jaan Altosaar, Kyunghyun Cho

    Abstract: We consider the problem of evaluating representations of data for use in solving a downstream task. We propose to measure the quality of a representation by the complexity of learning a predictor on top of the representation that achieves low loss on a task of interest, and introduce two methods, surplus description length (SDL) and $\varepsilon$ sample complexity ($\varepsilon$SC). In contrast to… ▽ More

    Submitted 5 February, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

  10. arXiv:2006.15368  [pdf, other

    cs.LG stat.ML

    Offline Contextual Bandits with Overparameterized Models

    Authors: David Brandfonbrener, William F. Whitney, Rajesh Ranganath, Joan Bruna

    Abstract: Recent results in supervised learning suggest that while overparameterized models have the capacity to overfit, they in fact generalize quite well. We ask whether the same phenomenon occurs for offline contextual bandits. Our results are mixed. Value-based algorithms benefit from the same generalization behavior as overparameterized supervised learning, but policy-based algorithms do not. We show… ▽ More

    Submitted 16 June, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  11. arXiv:1901.05590  [pdf, other

    cs.LG cs.CV stat.ML

    Disentangling Video with Independent Prediction

    Authors: William F. Whitney, Rob Fergus

    Abstract: We propose an unsupervised variational model for disentangling video into independent factors, i.e. each factor's future can be predicted from its past without considering the others. We show that our approach often learns factors which are interpretable as objects in a scene.

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: Presented at the Learning Disentangled Representations: from Perception to Control workshop at NIPS 2017

  12. arXiv:1705.07177  [pdf, other

    cs.AI

    Model-Based Planning with Discrete and Continuous Actions

    Authors: Mikael Henaff, William F. Whitney, Yann LeCun

    Abstract: Action planning using learned and differentiable forward models of the world is a general approach which has a number of desirable properties, including improved sample complexity over model-free RL methods, reuse of learned models across different tasks, and the ability to perform efficient gradient-based optimization in continuous action spaces. However, this approach does not apply straightforw… ▽ More

    Submitted 4 April, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

  13. arXiv:1702.06230  [pdf, other

    cs.LG cs.AI

    Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning

    Authors: Vlad Firoiu, William F. Whitney, Joshua B. Tenenbaum

    Abstract: There has been a recent explosion in the capabilities of game-playing artificial intelligence. Many classes of RL tasks, from Atari games to motor control to board games, are now solvable by fairly generic algorithms, based on deep learning, that learn to play from experience with minimal knowledge of the specific domain of interest. In this work, we will investigate the performance of these metho… ▽ More

    Submitted 8 May, 2017; v1 submitted 20 February, 2017; originally announced February 2017.

    MSC Class: I.2.6

  14. arXiv:1602.06822  [pdf, other

    cs.LG

    Understanding Visual Concepts with Continuation Learning

    Authors: William F. Whitney, Michael Chang, Tejas Kulkarni, Joshua B. Tenenbaum

    Abstract: We introduce a neural network architecture and a learning algorithm to produce factorized symbolic representations. We propose to learn these concepts by observing consecutive frames, letting all the components of the hidden representation except a small discrete set (gating units) be predicted from the previous frame, and let the factors of variation in the next frame be represented entirely by t… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

    Comments: Under review as a workshop paper for ICLR 2016