Skip to main content

Showing 1–3 of 3 results for author: Oiki, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.07193  [pdf, other

    cs.LG cs.AI cs.RO

    Data-Efficient Learning for Complex and Real-Time Physical Problem Solving using Augmented Simulation

    Authors: Kei Ota, Devesh K. Jha, Diego Romeres, Jeroen van Baar, Kevin A. Smith, Takayuki Semitsu, Tomoaki Oiki, Alan Sullivan, Daniel Nikovski, Joshua B. Tenenbaum

    Abstract: Humans quickly solve tasks in novel systems with complex dynamics, without requiring much interaction. While deep reinforcement learning algorithms have achieved tremendous success in many complex tasks, these algorithms need a large number of samples to learn meaningful policies. In this paper, we present a task for navigating a marble to the center of a circular maze. While this system is very i… ▽ More

    Submitted 15 February, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Under submission

  2. arXiv:2003.01629  [pdf, other

    cs.LG cs.RO stat.ML

    Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

    Authors: Kei Ota, Tomoaki Oiki, Devesh K. Jha, Toshisada Mariyama, Daniel Nikovski

    Abstract: Deep reinforcement learning (RL) algorithms have recently achieved remarkable successes in various sequential decision making tasks, leveraging advances in methods for training large deep networks. However, these methods usually require large amounts of training data, which is often a big problem for real-world applications. One natural question to ask is whether learning good representations for… ▽ More

    Submitted 26 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 11 pages, 10 figures. Accepted to ICML 2020

  3. arXiv:1903.05751  [pdf, other

    stat.ML cs.LG cs.RO

    Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

    Authors: Kei Ota, Devesh K. Jha, Tomoaki Oiki, Mamoru Miura, Takashi Nammoto, Daniel Nikovski, Toshisada Mariyama

    Abstract: In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known. Generating smooth, dynamically feasible trajectories could be difficult for such systems. Using sampling-based algorithms for motion planning may result in traject… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures, Accepted to IROS 2019