Skip to main content

Showing 1–10 of 10 results for author: Ghasemipour, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.06114  [pdf, other

    cs.AI

    Learning Interactive Real-World Simulators

    Authors: Mengjiao Yang, Yilun Du, Kamyar Ghasemipour, Jonathan Tompson, Leslie Kaelbling, Dale Schuurmans, Pieter Abbeel

    Abstract: Generative models trained on internet data have revolutionized how text, image, and video content can be created. Perhaps the next milestone for generative models is to simulate realistic experience in response to actions taken by humans, robots, and other interactive agents. Applications of a real-world simulator range from controllable content creation in games and movies, to training embodied a… ▽ More

    Submitted 12 January, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: https://universal-simulator.github.io

  2. arXiv:2303.14870  [pdf, other

    cs.RO cs.AI cs.LG

    Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning

    Authors: Satoshi Kataoka, Youngseog Chung, Seyed Kamyar Seyed Ghasemipour, Pannag Sanketi, Shixiang Shane Gu, Igor Mordatch

    Abstract: Most successes in robotic manipulation have been restricted to single-arm gripper robots, whose low dexterity limits the range of solvable tasks to pick-and-place, inser-tion, and object rearrangement. More complex tasks such as assembly require dual and multi-arm platforms, but entail a suite of unique challenges such as bi-arm coordination and collision avoidance, robust gras**, and long-horiz… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Our accompanying project webpage can be found at: https://sites.google.com/view/u-shape-block-assembly. arXiv admin note: substantial text overlap with arXiv:2203.08277

  3. arXiv:2205.13703  [pdf, other

    cs.LG

    Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters

    Authors: Seyed Kamyar Seyed Ghasemipour, Shixiang Shane Gu, Ofir Nachum

    Abstract: Motivated by the success of ensembles for uncertainty estimation in supervised learning, we take a renewed look at how ensembles of $Q$-functions can be leveraged as the primary source of pessimism for offline reinforcement learning (RL). We begin by identifying a critical flaw in a popular algorithmic choice used by many ensemble-based RL algorithms, namely the use of shared pessimistic target va… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Our codebase can be found at https://github.com/google-research/google-research/tree/master/jrl

  4. arXiv:2205.11487  [pdf, other

    cs.CV cs.LG

    Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

    Authors: Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi

    Abstract: We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only c… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  5. arXiv:2203.13733  [pdf, other

    cs.RO cs.LG

    Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

    Authors: Seyed Kamyar Seyed Ghasemipour, Daniel Freeman, Byron David, Shixiang Shane Gu, Satoshi Kataoka, Igor Mordatch

    Abstract: Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Desp… ▽ More

    Submitted 12 April, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Accompanying project webpage can be found at: https://sites.google.com/view/learning-direct-assembly

  6. arXiv:2203.08277  [pdf, other

    cs.RO cs.AI cs.LG

    Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning

    Authors: Satoshi Kataoka, Seyed Kamyar Seyed Ghasemipour, Daniel Freeman, Igor Mordatch

    Abstract: Most successes in robotic manipulation have been restricted to single-arm robots, which limits the range of solvable tasks to pick-and-place, insertion, and objects rearrangement. In contrast, dual and multi arm robot platforms unlock a rich diversity of problems that can be tackled, such as laundry folding and executing cooking skills. However, develo** controllers for multi-arm robots is compl… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Our accompanying project webpage can be found at: https://sites.google.com/view/bimanual-attachment

  7. arXiv:2110.04686  [pdf, other

    cs.LG cs.AI

    Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

    Authors: Shixiang Shane Gu, Manfred Diaz, Daniel C. Freeman, Hiroki Furuta, Seyed Kamyar Seyed Ghasemipour, Anton Raichuk, Byron David, Erik Frey, Erwin Coumans, Olivier Bachem

    Abstract: The goal of continuous control is to synthesize desired behaviors. In reinforcement learning (RL)-driven approaches, this is often accomplished through careful task reward engineering for efficient exploration and running an off-the-shelf RL algorithm. While reward maximization is at the core of RL, reward engineering is not the only -- sometimes nor the easiest -- way for specifying complex behav… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  8. arXiv:2007.11091  [pdf, other

    cs.LG stat.ML

    EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

    Authors: Seyed Kamyar Seyed Ghasemipour, Dale Schuurmans, Shixiang Shane Gu

    Abstract: Off-policy reinforcement learning holds the promise of sample-efficient learning of decision-making policies by leveraging past experience. However, in the offline RL setting -- where a fixed collection of interactions are provided and no further interactions are allowed -- it has been shown that standard off-policy RL methods can significantly underperform. Recently proposed methods often aim to… ▽ More

    Submitted 13 January, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

  9. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  10. arXiv:1911.02256  [pdf, other

    cs.LG stat.ML

    A Divergence Minimization Perspective on Imitation Learning Methods

    Authors: Seyed Kamyar Seyed Ghasemipour, Richard Zemel, Shixiang Gu

    Abstract: In many settings, it is desirable to learn decision-making and control policies through learning or bootstrap** from expert demonstrations. The most common approaches under this Imitation Learning (IL) framework are Behavioural Cloning (BC), and Inverse Reinforcement Learning (IRL). Recent methods for IRL have demonstrated the capacity to learn effective policies with access to a very limited se… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Published at Conference on Robot Learning (CoRL) 2019. For datasets and reproducing results please refer to https://github.com/KamyarGh/rl_swiss/blob/master/reproducing/fmax_paper.md