Skip to main content

Showing 1–22 of 22 results for author: Amos, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00288  [pdf, other

    cs.LG stat.ML

    Neural Optimal Transport with Lagrangian Costs

    Authors: Aram-Alexandre Pooladian, Carles Domingo-Enrich, Ricky T. Q. Chen, Brandon Amos

    Abstract: We investigate the optimal transport problem between probability measures when the underlying cost function is understood to satisfy a least action principle, also known as a Lagrangian cost. These generalizations are useful when connecting observations from a physical system where the transport dynamics are influenced by the geometry of the system, such as obstacles (e.g., incorporating barrier f… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: UAI 2024

  2. arXiv:2312.05250  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    TaskMet: Task-Driven Metric Learning for Model Learning

    Authors: Dishank Bansal, Ricky T. Q. Chen, Mustafa Mukadam, Brandon Amos

    Abstract: Deep learning models are often deployed in downstream tasks that the training procedure may not be aware of. For example, models solely trained to achieve accurate predictions may struggle to perform well on downstream tasks because seemingly small prediction errors may incur drastic task errors. The standard end-to-end learning approach is to make the task loss differentiable or to introduce a di… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  3. arXiv:2312.02027  [pdf, other

    math.OC cs.LG math.NA math.PR stat.ML

    Stochastic Optimal Control Matching

    Authors: Carles Domingo-Enrich, Jiequn Han, Brandon Amos, Joan Bruna, Ricky T. Q. Chen

    Abstract: Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffu… ▽ More

    Submitted 28 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  4. arXiv:2210.12153  [pdf, other

    cs.LG cs.AI stat.ML

    On amortizing convex conjugates for optimal transport

    Authors: Brandon Amos

    Abstract: This paper focuses on computing the convex conjugate operation that arises when solving Euclidean Wasserstein-2 optimal transport problems. This conjugation, which is also referred to as the Legendre-Fenchel conjugate or c-transform,is considered difficult to compute and in practice,Wasserstein-2 methods are limited by not being able to exactly conjugate the dual potentials in continuous space. To… ▽ More

    Submitted 1 March, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: ICLR 2023

  5. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  6. arXiv:2206.05262  [pdf, other

    cs.LG cs.AI stat.ML

    Meta Optimal Transport

    Authors: Brandon Amos, Samuel Cohen, Giulia Luise, Ievgen Redko

    Abstract: We study the use of amortized optimization to predict optimal transport (OT) maps from the input measures, which we call Meta OT. This helps repeatedly solve similar OT problems between different measures by leveraging the knowledge and information present from past problems to rapidly predict and solve new problems. Otherwise, standard methods ignore the knowledge of the past solutions and subopt… ▽ More

    Submitted 2 June, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: ICML 2023

  7. arXiv:2203.06832  [pdf, other

    cs.LG stat.ML

    Semi-Discrete Normalizing Flows through Differentiable Tessellation

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: Map** between discrete and continuous distributions is a difficult task and many have had to resort to heuristical approaches. We propose a tessellation-based approach that directly learns quantization boundaries in a continuous space, complete with exact likelihood evaluations. This is done through constructing normalizing flows on convex polytopes parameterized using a simple homeomorphism wit… ▽ More

    Submitted 11 December, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Journal ref: NeurIPS 2022

  8. arXiv:2111.12187  [pdf, other

    cs.LG stat.ML

    Input Convex Gradient Networks

    Authors: Jack Richter-Powell, Jonathan Lorraine, Brandon Amos

    Abstract: The gradients of convex functions are expressive models of non-trivial vector fields. For example, Brenier's theorem yields that the optimal transport map between any two measures on Euclidean space under the squared distance is realized as a convex gradient, which is a key insight used in recent generative flow models. In this paper, we study how to model convex gradients by integrating a Jacobia… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021 Optimal Transport and Machine Learning Workshop https://otml2021.github.io

  9. arXiv:2110.03684  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Cross-Domain Imitation Learning via Optimal Transport

    Authors: Arnaud Fickinger, Samuel Cohen, Stuart Russell, Brandon Amos

    Abstract: Cross-domain imitation learning studies how to leverage expert demonstrations of one agent to train an imitation agent with a different embodiment or morphology. Comparing trajectories and stationary distributions between the expert and imitation agents is challenging because they live on different systems that may not even have the same dimensionality. We propose Gromov-Wasserstein Imitation Lear… ▽ More

    Submitted 25 April, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  10. arXiv:2106.10272  [pdf, other

    cs.LG stat.ML

    Riemannian Convex Potential Maps

    Authors: Samuel Cohen, Brandon Amos, Yaron Lipman

    Abstract: Modeling distributions on Riemannian manifolds is a crucial component in understanding non-Euclidean data that arises, e.g., in physics and geology. The budding approaches in this space are limited by representational and computational tradeoffs. We propose and study a class of flows that uses convex potentials from Riemannian optimal transport. These are universal and can model distributions on a… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  11. arXiv:2102.07115  [pdf, other

    stat.ML cs.LG

    Sliced Multi-Marginal Optimal Transport

    Authors: Samuel Cohen, Alexander Terenin, Yannik Pitcan, Brandon Amos, Marc Peter Deisenroth, K S Sesh Kumar

    Abstract: Multi-marginal optimal transport enables one to compare multiple probability measures, which increasingly finds application in multi-task learning problems. One practical limitation of multi-marginal transport is computational scalability in the number of measures, samples and dimensionality. In this work, we propose a multi-marginal optimal transport paradigm based on random one-dimensional proje… ▽ More

    Submitted 23 November, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  12. arXiv:2011.03902  [pdf, other

    cs.LG stat.ML

    Learning Neural Event Functions for Ordinary Differential Equations

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: The existing Neural ODE formulation relies on an explicit knowledge of the termination time. We extend Neural ODEs to implicitly defined termination criteria modeled by neural event functions, which can be chained together and differentiated through. Neural Event ODEs are capable of modeling discrete and instantaneous changes in a continuous-time system, without prior knowledge of when these chang… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 November, 2020; originally announced November 2020.

    Journal ref: ICLR 2021

  13. arXiv:2008.12775  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    On the model-based stochastic value gradient for continuous reinforcement learning

    Authors: Brandon Amos, Samuel Stanton, Denis Yarats, Andrew Gordon Wilson

    Abstract: For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of reinforcement learning agents. While model-based agents are conceptually appealing, their policies tend to lag behind those of model-free agents in terms of final reward, especially in non-trivial environments. In response, researchers have pro… ▽ More

    Submitted 27 May, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: L4DC 2021

  14. arXiv:2006.12648  [pdf, other

    cs.LG stat.ML

    Aligning Time Series on Incomparable Spaces

    Authors: Samuel Cohen, Giulia Luise, Alexander Terenin, Brandon Amos, Marc Peter Deisenroth

    Abstract: Dynamic time war** (DTW) is a useful method for aligning, comparing and combining time series, but it requires them to live in comparable spaces. In this work, we consider a setting in which time series live on different spaces without a sensible ground metric, causing DTW to become ill-defined. To alleviate this, we propose Gromov dynamic time war** (GDTW), a distance between time series on p… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  15. arXiv:2002.04523  [pdf, other

    cs.LG cs.RO stat.ML

    Objective Mismatch in Model-based Reinforcement Learning

    Authors: Nathan Lambert, Brandon Amos, Omry Yadan, Roberto Calandra

    Abstract: Model-based reinforcement learning (MBRL) has been shown to be a powerful framework for data-efficiently learning control of continuous tasks. Recent work in MBRL has mostly focused on using more advanced function approximators and planning schemes, with little development of the general framework. In this paper, we identify a fundamental issue of the standard MBRL framework -- what we call the ob… ▽ More

    Submitted 18 April, 2021; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: 9 pages, 2 pages references, 5 pages appendices

    Journal ref: Proceedings of the 2nd Conference on Learning for Dynamics and Control, PMLR 120:761-770, 2020

  16. arXiv:1910.12430  [pdf, other

    cs.LG math.OC stat.ML

    Differentiable Convex Optimization Layers

    Authors: Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, Zico Kolter

    Abstract: Recent work has shown how to embed differentiable optimization problems (that is, problems whose solutions can be backpropagated through) as layers within deep learning architectures. This method provides a useful inductive bias for certain problems, but existing software for differentiable optimization layers is rigid and difficult to apply to new settings. In this paper, we propose an approach t… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: In NeurIPS 2019. Code available at https://www.github.com/cvxgrp/cvxpylayers. Authors in alphabetical order

  17. arXiv:1910.01741  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

    Authors: Denis Yarats, Amy Zhang, Ilya Kostrikov, Brandon Amos, Joelle Pineau, Rob Fergus

    Abstract: Training an agent to solve control tasks directly from high-dimensional images with model-free reinforcement learning (RL) has proven difficult. A promising approach is to learn a latent representation together with the control policy. However, fitting a high-capacity encoder using a scarce reward signal is sample inefficient and leads to poor performance. Prior work has shown that auxiliary losse… ▽ More

    Submitted 9 July, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

  18. arXiv:1910.01727  [pdf, other

    cs.LG stat.ML

    Generalized Inner Loop Meta-Learning

    Authors: Edward Grefenstette, Brandon Amos, Denis Yarats, Phu Mon Htut, Artem Molchanov, Franziska Meier, Douwe Kiela, Kyunghyun Cho, Soumith Chintala

    Abstract: Many (but not all) approaches self-qualifying as "meta-learning" in deep learning and reinforcement learning fit a common pattern of approximating the solution to a nested optimization problem. In this paper, we give a formalization of this shared pattern, which we call GIMLI, prove its general requirements, and derive a general-purpose algorithm for implementing similar approaches. Based on this… ▽ More

    Submitted 7 October, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: 17 pages, 3 figures, 1 algorithm

  19. arXiv:1909.12830  [pdf, other

    cs.LG cs.RO math.OC stat.ML

    The Differentiable Cross-Entropy Method

    Authors: Brandon Amos, Denis Yarats

    Abstract: We study the cross-entropy method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible.… ▽ More

    Submitted 14 August, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: ICML 2020

  20. arXiv:1906.08707  [pdf, other

    cs.LG cs.CV stat.ML

    The Limited Multi-Label Projection Layer

    Authors: Brandon Amos, Vladlen Koltun, J. Zico Kolter

    Abstract: We propose the Limited Multi-Label (LML) projection layer as a new primitive operation for end-to-end learning systems. The LML layer provides a probabilistic way of modeling multi-label predictions limited to having exactly k labels. We derive efficient forward and backward passes for this layer and show how the layer can be used to optimize the top-k recall for multi-label tasks with incomplete… ▽ More

    Submitted 14 October, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

  21. arXiv:1810.13400  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Differentiable MPC for End-to-end Planning and Control

    Authors: Brandon Amos, Ivan Dario Jimenez Rodriguez, Jacob Sacks, Byron Boots, J. Zico Kolter

    Abstract: We present foundations for using Model Predictive Control (MPC) as a differentiable policy class for reinforcement learning in continuous state and action spaces. This provides one way of leveraging and combining the advantages of model-free and model-based approaches. Specifically, we differentiate through MPC by using the KKT conditions of the convex approximation at a fixed point of the control… ▽ More

    Submitted 14 October, 2019; v1 submitted 31 October, 2018; originally announced October 2018.

    Comments: NeurIPS 2018

  22. arXiv:1703.00443  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    OptNet: Differentiable Optimization as a Layer in Neural Networks

    Authors: Brandon Amos, J. Zico Kolter

    Abstract: This paper presents OptNet, a network architecture that integrates optimization problems (here, specifically in the form of quadratic programs) as individual layers in larger end-to-end trainable deep networks. These layers encode constraints and complex dependencies between the hidden states that traditional convolutional and fully-connected layers often cannot capture. We explore the foundations… ▽ More

    Submitted 2 December, 2021; v1 submitted 1 March, 2017; originally announced March 2017.

    Comments: ICML 2017