Skip to main content

Showing 1–15 of 15 results for author: Bousmalis, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  2. arXiv:2205.03353  [pdf, other

    cs.RO cs.LG

    How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation

    Authors: Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis

    Abstract: Reinforcement learning (RL) has been shown to be effective at learning control from experience. However, RL typically requires a large amount of online interaction with the environment. This limits its applicability to real-world settings, such as in robotics, where such interaction is expensive. In this work we investigate ways to minimize online interactions in a target task, by reusing a subopt… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  3. arXiv:2111.00262  [pdf, other

    cs.RO cs.LG

    Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner

    Authors: Philemon Brakel, Steven Bohez, Leonard Hasenclever, Nicolas Heess, Konstantinos Bousmalis

    Abstract: Dynamic quadruped locomotion over challenging terrains with precise foot placements is a hard problem for both optimal control methods and Reinforcement Learning (RL). Non-linear solvers can produce coordinated constraint satisfying motions, but often take too long to converge for online application. RL methods can learn dynamic reactive controllers but require carefully tuned sha** rewards to p… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: A shorter version without appendix was submitted to ICRA 2022

  4. arXiv:2110.06192  [pdf, other

    cs.RO cs.LG

    Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

    Authors: Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

    Abstract: We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can ef… ▽ More

    Submitted 3 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: CoRL 2021. Video: https://dpmd.ai/robotics-stacking-YT . Blog: https://dpmd.ai/robotics-stacking . Code: https://github.com/deepmind/rgb_stacking

  5. arXiv:2106.08199  [pdf, other

    cs.LG cs.RO

    On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

    Abstract: Many advances that have improved the robustness and efficiency of deep reinforcement learning (RL) algorithms can, in one way or another, be understood as introducing additional objectives or constraints in the policy optimization step. This includes ideas as far ranging as exploration bonuses, entropy regularization, and regularization toward teachers or data priors. Often, the task reward and au… ▽ More

    Submitted 1 August, 2023; v1 submitted 15 June, 2021; originally announced June 2021.

  6. arXiv:2103.09016  [pdf, other

    cs.RO

    Manipulator-Independent Representations for Visual Imitation

    Authors: Yuxiang Zhou, Yusuf Aytar, Konstantinos Bousmalis

    Abstract: Imitation learning is an effective tool for robotic learning tasks where specifying a reinforcement learning (RL) reward is not feasible or where the exploration problem is particularly difficult. Imitation, typically behavior cloning or inverse RL, derive a policy from a collection of first-person action-state trajectories. This is contrary to how humans and other animals imitate: we observe a be… ▽ More

    Submitted 18 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  7. arXiv:1910.09470  [pdf, other

    cs.RO cs.CV

    Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation

    Authors: Rae Jeong, Yusuf Aytar, David Khosid, Yuxiang Zhou, Jackie Kay, Thomas Lampe, Konstantinos Bousmalis, Francesco Nori

    Abstract: Collecting and automatically obtaining reward signals from real robotic visual data for the purposes of training reinforcement learning algorithms can be quite challenging and time-consuming. Methods for utilizing unlabeled data can have a huge potential to further accelerate robotic learning. We consider here the problem of performing manipulation tasks from pixels. In such tasks, choosing an app… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  8. arXiv:1906.01624  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Off-Policy Evaluation via Off-Policy Classification

    Authors: Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine

    Abstract: In this work, we consider the problem of model selection for deep reinforcement learning (RL) in real-world environments. Typically, the performance of deep RL algorithms is evaluated via on-policy interactions with the target environment. However, comparing models in a real-world environment for the purposes of early stop** or hyperparameter tuning is costly and often practically infeasible. Th… ▽ More

    Submitted 22 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted to NeurIPS 2019. Camera ready version

  9. arXiv:1812.07252  [pdf, other

    cs.RO cs.CV cs.LG

    Sim-to-Real via Sim-to-Sim: Data-efficient Robotic Gras** via Randomized-to-Canonical Adaptation Networks

    Authors: Stephen James, Paul Wohlhart, Mrinal Kalakrishnan, Dmitry Kalashnikov, Alex Irpan, Julian Ibarz, Sergey Levine, Raia Hadsell, Konstantinos Bousmalis

    Abstract: Real world data, especially in the domain of robotics, is notoriously costly to collect. One way to circumvent this can be to leverage the power of simulation to produce large amounts of labelled data. However, training models on simulated images does not readily transfer to real-world ones. Using domain adaptation methods to cross this "reality gap" requires a large amount of unlabelled real-worl… ▽ More

    Submitted 21 July, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019)

  10. arXiv:1810.11547  [pdf, other

    cs.CV

    Unsupervised Multi-Target Domain Adaptation: An Information Theoretic Approach

    Authors: Behnam Gholami, Pritish Sahu, Ognjen Rudovic, Konstantinos Bousmalis, Vladimir Pavlovic

    Abstract: Unsupervised domain adaptation (uDA) models focus on pairwise adaptation settings where there is a single, labeled, source and a single target domain. However, in many real-world settings one seeks to adapt to multiple, but somewhat similar, target domains. Applying pairwise adaptation approaches to this setting may be suboptimal, as they fail to leverage shared information among multiple domains.… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

    Comments: 19 pages, 5 Figures, 5 Tables

  11. arXiv:1711.05139  [pdf, other

    cs.CV

    XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Map**s

    Authors: Amélie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, Kevin Murphy

    Abstract: Style transfer usually refers to the task of applying color and texture information from a specific style image to a given content image while preserving the structure of the latter. Here we tackle the more generic problem of semantic style transfer: given two unpaired collections of images, we aim to learn a map** between the corpus-level style of each collection, while preserving semantic cont… ▽ More

    Submitted 10 July, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Domain Adaptation for Visual Understanding at ICML'18

  12. arXiv:1709.07857  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Gras**

    Authors: Konstantinos Bousmalis, Alex Irpan, Paul Wohlhart, Yunfei Bai, Matthew Kelcey, Mrinal Kalakrishnan, Laura Downs, Julian Ibarz, Peter Pastor, Kurt Konolige, Sergey Levine, Vincent Vanhoucke

    Abstract: Instrumenting and collecting annotated visual gras** datasets to train modern machine learning algorithms can be extremely time-consuming and expensive. An appealing alternative is to use off-the-shelf simulators to render synthetic data for which ground-truth annotations are generated automatically. Unfortunately, models trained purely on simulated data often fail to generalize to the real worl… ▽ More

    Submitted 25 September, 2017; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: 9 pages, 5 figures, 3 tables

  13. arXiv:1612.05424  [pdf, other

    cs.CV

    Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

    Authors: Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan

    Abstract: Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks. One appealing alternative is rendering synthetic data where ground-truth annotations are generated automatically. Unfortunately, models trained purely on rendered images often fail to generalize to real images. To address this shortcoming, prior work introduced unsupervis… ▽ More

    Submitted 23 August, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: Final CVPR 2017 paper and supplementary material

  14. arXiv:1608.06019  [pdf, other

    cs.CV

    Domain Separation Networks

    Authors: Konstantinos Bousmalis, George Trigeorgis, Nathan Silberman, Dilip Krishnan, Dumitru Erhan

    Abstract: The cost of large scale data collection and annotation often makes the application of machine learning algorithms to new tasks or datasets prohibitively expensive. One approach circumventing this cost is training models on synthetic data where annotations are provided automatically. Despite their appeal, such models often fail to generalize from synthetic to real images, necessitating domain adapt… ▽ More

    Submitted 21 August, 2016; originally announced August 2016.

    Comments: This work will be presented at NIPS 2016

  15. arXiv:1509.03248  [pdf, other

    cs.CV cs.LG stat.ML

    A deep matrix factorization method for learning attribute representations

    Authors: George Trigeorgis, Konstantinos Bousmalis, Stefanos Zafeiriou, Bjoern W. Schuller

    Abstract: Semi-Non-negative Matrix Factorization is a technique that learns a low-dimensional representation of a dataset that lends itself to a clustering interpretation. It is possible that the map** between this new representation and our original data matrix contains rather complex hierarchical information with implicit lower-level hidden attributes, that classical one level clustering methodologies c… ▽ More

    Submitted 10 September, 2015; originally announced September 2015.

    Comments: Submitted to TPAMI (16-Mar-2015)