Skip to main content

Showing 1–10 of 10 results for author: Hartikainen, K

.
  1. Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

    Authors: Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Jan Humplik, Markus Wulfmeier, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley , et al. (3 additional authors not shown)

    Abstract: We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic environments. We used Deep RL to train a humanoid robot with 20 actuated joints to play a simplified one-versus-one (1v1) soccer game. The resulting agent exhibits robust… ▽ More

    Submitted 11 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Project website: https://sites.google.com/view/op3-soccer

  2. arXiv:2201.08115  [pdf, other

    cs.AI cs.LG cs.RO stat.ML

    Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning

    Authors: Sasha Salter, Kristian Hartikainen, Walter Goodwin, Ingmar Posner

    Abstract: The ability to discover behaviours from past experience and transfer them to new tasks is a hallmark of intelligent agents acting sample-efficiently in the real world. Equip** embodied reinforcement learners with the same ability may be crucial for their successful deployment in robotics. While hierarchical and KL-regularized reinforcement learning individually hold promise here, arguably a hybr… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

    Journal ref: Published at the International Conference on Learning Representations, 2023

  3. arXiv:2106.05012  [pdf, other

    cs.LG

    Bayesian Bellman Operators

    Authors: Matthew Fellows, Kristian Hartikainen, Shimon Whiteson

    Abstract: We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approaches infer a posterior over the transition distribution or Q-function, we characterise the uncertainty in the Bellman operator. Our Bayesian Bellman operator (BBO) framework is motivated by the insight that when bootstrap** is introduced, model-free approaches actually infer a posterior over Bellman… ▽ More

    Submitted 15 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  4. arXiv:2010.01062  [pdf, other

    cs.LG cs.AI stat.ML

    Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

    Authors: Luisa Zintgraf, Leo Feng, Cong Lu, Maximilian Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

    Abstract: To rapidly learn a new task, it is often essential for agents to explore efficiently -- especially when performance matters from the first timestep. One way to learn such behaviour is via meta-learning. Many existing methods however rely on dense rewards for meta-training, and can fail catastrophically if the rewards are sparse. Without a suitable reward signal, the need for exploration during met… ▽ More

    Submitted 9 June, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Published at the International Conference on Machine Learning (ICML) 2021

  5. arXiv:2004.12570  [pdf, other

    cs.LG cs.RO stat.ML

    The Ingredients of Real-World Robotic Reinforcement Learning

    Authors: Henry Zhu, Justin Yu, Abhishek Gupta, Dhruv Shah, Kristian Hartikainen, Avi Singh, Vikash Kumar, Sergey Levine

    Abstract: The success of reinforcement learning for real world robotics has been, in many cases limited to instrumented laboratory scenarios, often requiring arduous human effort and oversight to enable continuous learning. In this work, we discuss the elements that are needed for a robotic learning system that can continually and autonomously improve with data collected in the real world. We propose a part… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: First three authors contributed equally. Accepted as a spotlight presentation at ICLR 2020

  6. arXiv:1909.11639  [pdf, other

    cs.RO cs.LG stat.ML

    ROBEL: Robotics Benchmarks for Learning with Low-Cost Robots

    Authors: Michael Ahn, Henry Zhu, Kristian Hartikainen, Hugo Ponte, Abhishek Gupta, Sergey Levine, Vikash Kumar

    Abstract: ROBEL is an open-source platform of cost-effective robots designed for reinforcement learning in the real world. ROBEL introduces two robots, each aimed to accelerate reinforcement learning research in different task domains: D'Claw is a three-fingered hand robot that facilitates learning dexterous manipulation tasks, and D'Kitty is a four-legged robot that facilitates learning agile legged locomo… ▽ More

    Submitted 15 December, 2019; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: Published @ CoRL2019. For details visit - http://www.roboticsbenchmarks.org

    Journal ref: Conference on Robot Learning, 2019

  7. arXiv:1907.08225  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery

    Authors: Kristian Hartikainen, Xinyang Geng, Tuomas Haarnoja, Sergey Levine

    Abstract: Reinforcement learning requires manual specification of a reward function to learn a task. While in principle this reward function only needs to specify the task goal, in practice reinforcement learning can be very time-consuming or even infeasible unless the reward function is shaped so as to provide a smooth gradient towards a successful outcome. This sha** is difficult to specify by hand, par… ▽ More

    Submitted 14 February, 2020; v1 submitted 18 July, 2019; originally announced July 2019.

    Comments: 11+6 pages, 6+2 figures, last two authors (Tuomas Haarnoja, Sergey Levine) advised equally

  8. arXiv:1904.07854  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    End-to-End Robotic Reinforcement Learning without Reward Engineering

    Authors: Avi Singh, Larry Yang, Kristian Hartikainen, Chelsea Finn, Sergey Levine

    Abstract: The combination of deep neural network models and reinforcement learning algorithms can make it possible to learn policies for robotic behaviors that directly read in raw sensory inputs, such as camera images, effectively subsuming both estimation and control into one model. However, real-world applications of reinforcement learning must specify the goal of the task by means of a manually programm… ▽ More

    Submitted 15 May, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: Accepted to RSS 2019. 14 pages and 13 figures including references and appendix. Website: https://sites.google.com/view/reward-learning-rl/home

  9. arXiv:1812.05905  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Soft Actor-Critic Algorithms and Applications

    Authors: Tuomas Haarnoja, Aurick Zhou, Kristian Hartikainen, George Tucker, Sehoon Ha, Jie Tan, Vikash Kumar, Henry Zhu, Abhishek Gupta, Pieter Abbeel, Sergey Levine

    Abstract: Model-free deep reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks. However, these methods typically suffer from two major challenges: high sample complexity and brittleness to hyperparameters. Both of these challenges limit the applicability of such methods to real-world domains. In this paper, we describe S… ▽ More

    Submitted 29 January, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1801.01290

  10. arXiv:1804.02808  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Space Policies for Hierarchical Reinforcement Learning

    Authors: Tuomas Haarnoja, Kristian Hartikainen, Pieter Abbeel, Sergey Levine

    Abstract: We address the problem of learning hierarchical deep neural network policies for reinforcement learning. In contrast to methods that explicitly restrict or cripple lower layers of a hierarchy to force them to use higher-level modulating signals, each layer in our framework is trained to directly solve the task, but acquires a range of diverse strategies via a maximum entropy reinforcement learning… ▽ More

    Submitted 3 September, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: ICML 2018; Videos: https://sites.google.com/view/latent-space-deep-rl Code: https://github.com/haarnoja/sac