Skip to main content

Showing 1–13 of 13 results for author: Vlastelica, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18917  [pdf, other

    cs.LG cs.AI cs.RO

    Causal Action Influence Aware Counterfactual Data Augmentation

    Authors: Núria Armengol Urpí, Marco Bagatella, Marin Vlastelica, Georg Martius

    Abstract: Offline data are both valuable and practical resources for teaching robots complex behaviors. Ideally, learning agents should not be constrained by the scarcity of available demonstrations, but rather generalize beyond the training distribution. However, the complexity of real-world scenarios typically requires huge amounts of data to prevent neural network policies from picking up on spurious cor… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted in 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2310.02440  [pdf, other

    cs.RO cs.AI

    Learning Diverse Skills for Local Navigation under Multi-constraint Optimality

    Authors: ** Cheng, Marin Vlastelica, Pavel Kolev, Chenhao Li, Georg Martius

    Abstract: Despite many successful applications of data-driven control in robotics, extracting meaningful diverse behaviors remains a challenge. Typically, task performance needs to be compromised in order to achieve diversity. In many scenarios, task requirements are specified as a multitude of reward terms, each requiring a different trade-off. In this work, we take a constrained optimization viewpoint on… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures, in submission to ICRA 2024

  3. arXiv:2309.05582  [pdf, other

    cs.LG cs.AI cs.RO

    Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning

    Authors: Marin Vlastelica, Sebastian Blaes, Cristina Pineri, Georg Martius

    Abstract: We introduce a simple but effective method for managing risk in model-based reinforcement learning with trajectory sampling that involves probabilistic safety constraints and balancing of optimism in the face of epistemic uncertainty and pessimism in the face of aleatoric uncertainty of an ensemble of stochastic neural networks.Various experiments indicate that the separation of uncertainties is e… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  4. arXiv:2309.02040  [pdf, other

    cs.LG cs.AI

    Diffusion Generative Inverse Design

    Authors: Marin Vlastelica, Tatiana López-Guevara, Kelsey Allen, Peter Battaglia, Arnaud Doucet, Kimberley Stachenfeld

    Abstract: Inverse design refers to the problem of optimizing the input of an objective function in order to enact a target outcome. For many real-world engineering problems, the objective function takes the form of a simulator that predicts how the system state will evolve over time, and the design challenge is to optimize the initial conditions that lead to a target outcome. Recent developments in learned… ▽ More

    Submitted 18 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: ICML workshop on Structured Probabilistic Inference & Generative Modeling

  5. arXiv:2307.11373  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Diversity Maximization Under Imitation Constraints

    Authors: Marin Vlastelica, ** Cheng, Georg Martius, Pavel Kolev

    Abstract: There has been significant recent progress in the area of unsupervised skill discovery, utilizing various information-theoretic objectives as measures of diversity. Despite these advances, challenges remain: current methods require significant online interaction, fail to leverage vast amounts of available task-agnostic data and typically lack a quantitative measure of skill utility. We address the… ▽ More

    Submitted 21 June, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: RLC 2024

  6. arXiv:2307.09933  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

    Authors: Cian Eastwood, Shashank Singh, Andrei Liviu Nicolicioiu, Marin Vlastelica, Julius von Kügelgen, Bernhard Schölkopf

    Abstract: To avoid failures on out-of-distribution data, recent works have sought to extract features that have an invariant or stable relationship with the label across domains, discarding "spurious" or unstable features whose relationship with the label changes across domains. However, unstable features often carry complementary information that could boost performance if used correctly in the test domain… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 Camera-Ready

  7. arXiv:2209.07899  [pdf, other

    cs.RO cs.AI cs.LG

    Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

    Authors: Chenhao Li, Sebastian Blaes, Pavel Kolev, Marin Vlastelica, Jonas Frey, Georg Martius

    Abstract: Learning diverse skills is one of the main challenges in robotics. To this end, imitation learning approaches have achieved impressive results. These methods require explicitly labeled datasets or assume consistent skill execution to enable learning and active control of individual behaviors, which limits their applicability. In this work, we propose a cooperative adversarial method for obtaining… ▽ More

    Submitted 11 February, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

  8. arXiv:2206.11693  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

    Authors: Chenhao Li, Marin Vlastelica, Sebastian Blaes, Jonas Frey, Felix Grimminger, Georg Martius

    Abstract: Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method fo… ▽ More

    Submitted 21 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  9. arXiv:2205.15213  [pdf, other

    cs.LG

    Backpropagation through Combinatorial Algorithms: Identity with Projection Works

    Authors: Subham Sekhar Sahoo, Anselm Paulus, Marin Vlastelica, Vít Musil, Volodymyr Kuleshov, Georg Martius

    Abstract: Embedding discrete solvers as differentiable layers has given modern deep learning architectures combinatorial expressivity and discrete reasoning capabilities. The derivative of these solvers is zero or undefined, therefore a meaningful replacement is crucial for effective gradient-based learning. Prior works rely on smoothing the solver with input perturbations, relaxing the solver to continuous… ▽ More

    Submitted 17 March, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICLR 2023 conference paper. The first two authors contributed equally

  10. arXiv:2205.07633  [pdf, other

    cs.CL cs.AI cs.LG

    Taming Continuous Posteriors for Latent Variational Dialogue Policies

    Authors: Marin Vlastelica, Patrick Ernst, György Szarvas

    Abstract: Utilizing amortized variational inference for latent-action reinforcement learning (RL) has been shown to be an effective approach in Task-oriented Dialogue (ToD) systems for optimizing dialogue success. Until now, categorical posteriors have been argued to be one of the main drivers of performance. In this work we revisit Gaussian variational posteriors for latent-action RL and show that they can… ▽ More

    Submitted 1 June, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

  11. arXiv:2102.07456  [pdf, other

    cs.LG cs.AI cs.DM

    Neuro-algorithmic Policies enable Fast Combinatorial Generalization

    Authors: Marin Vlastelica, Michal Rolínek, Georg Martius

    Abstract: Although model-based and model-free approaches to learning the control of systems have achieved impressive results on standard benchmarks, generalization to task variations is still lacking. Recent results suggest that generalization for standard architectures improves only after obtaining exhaustive amounts of data. We give evidence that generalization capabilities are in many cases bottlenecked… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 15 pages

  12. arXiv:1912.03500  [pdf, other

    cs.LG stat.ML

    Optimizing Rank-based Metrics with Blackbox Differentiation

    Authors: Michal Rolínek, Vít Musil, Anselm Paulus, Marin Vlastelica, Claudio Michaelis, Georg Martius

    Abstract: Rank-based metrics are some of the most widely used criteria for performance evaluation of computer vision models. Despite years of effort, direct optimization for these metrics remains a challenge due to their non-differentiable and non-decomposable nature. We present an efficient, theoretically sound, and general method for differentiating rank-based metrics with mini-batch gradient descent. In… ▽ More

    Submitted 18 March, 2020; v1 submitted 7 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 conference paper (oral). The first two authors contributed equally

  13. arXiv:1912.02175  [pdf, other

    cs.LG stat.ML

    Differentiation of Blackbox Combinatorial Solvers

    Authors: Marin Vlastelica, Anselm Paulus, Vít Musil, Georg Martius, Michal Rolínek

    Abstract: Achieving fusion of deep learning with combinatorial algorithms promises transformative changes to artificial intelligence. One possible approach is to introduce combinatorial building blocks into neural networks. Such end-to-end architectures have the potential to tackle combinatorial problems on raw input data such as ensuring global consistency in multi-object tracking or route planning on maps… ▽ More

    Submitted 16 February, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: ICLR 2020 conference paper (spotlight). The first two authors contributed equally