Skip to main content

Showing 1–10 of 10 results for author: Jha, D K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2103.11238  [pdf, ps, other

    stat.ML cs.LG

    Markov Modeling of Time-Series Data using Symbolic Analysis

    Authors: Devesh K. Jha

    Abstract: Markov models are often used to capture the temporal patterns of sequential data for statistical learning applications. While the Hidden Markov modeling-based learning mechanisms are well studied in literature, we analyze a symbolic-dynamics inspired approach. Under this umbrella, Markov modeling of time-series data consists of two major steps -- discretization of continuous attributes followed by… ▽ More

    Submitted 23 March, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

  2. arXiv:2003.11696  [pdf, other

    cs.LG cs.RO stat.ML

    CAZSL: Zero-Shot Regression for Pushing Models by Generalizing Through Context

    Authors: Wenyu Zhang, Skyler Seto, Devesh K. Jha

    Abstract: Learning accurate models of the physical world is required for a lot of robotic manipulation tasks. However, during manipulation, robots are expected to interact with unknown workpieces so that building predictive models which can generalize over a number of these objects is highly desirable. In this paper, we study the problem of designing deep learning agents which can generalize their models of… ▽ More

    Submitted 1 November, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: Accepted at IROS 2020

  3. arXiv:2003.01641  [pdf, other

    cs.LG cs.RO stat.ML

    Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

    Authors: Kei Ota, Yoko Sasaki, Devesh K. Jha, Yusuke Yoshiyasu, Asako Kanezaki

    Abstract: In this paper, we consider the problem of building learning agents that can efficiently learn to navigate in constrained environments. The main goal is to design agents that can efficiently learn to understand and generalize to different environments using high-dimensional inputs (a 2D map), while following feasible paths that avoid obstacles in obstacle-cluttered environment. To achieve this, we… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 8 pages, 10 figures

  4. arXiv:2003.01629  [pdf, other

    cs.LG cs.RO stat.ML

    Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

    Authors: Kei Ota, Tomoaki Oiki, Devesh K. Jha, Toshisada Mariyama, Daniel Nikovski

    Abstract: Deep reinforcement learning (RL) algorithms have recently achieved remarkable successes in various sequential decision making tasks, leveraging advances in methods for training large deep networks. However, these methods usually require large amounts of training data, which is often a big problem for real-world applications. One natural question to ask is whether learning good representations for… ▽ More

    Submitted 26 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 11 pages, 10 figures. Accepted to ICML 2020

  5. arXiv:2002.10621  [pdf, other

    cs.LG cs.RO eess.SP eess.SY stat.ML

    Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements

    Authors: Alberto Dalla Libera, Diego Romeres, Devesh K. Jha, Bill Yerazunis, Daniel Nikovski

    Abstract: In this paper, we propose a derivative-free model learning framework for Reinforcement Learning (RL) algorithms based on Gaussian Process Regression (GPR). In many mechanical systems, only positions can be measured by the sensing instruments. Then, instead of representing the system state as suggested by the physics with a collection of positions, velocities, and accelerations, we define the state… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted at RA-L

  6. arXiv:2001.10098  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-label Prediction in Time Series Data using Deep Neural Networks

    Authors: Wenyu Zhang, Devesh K. Jha, Emil Laftchiev, Daniel Nikovski

    Abstract: This paper addresses a multi-label predictive fault classification problem for multidimensional time-series data. While fault (event) detection problems have been thoroughly studied in literature, most of the state-of-the-art techniques can't reliably predict faults (events) over a desired future horizon. In the most general setting of these types of problems, one or more samples of data across mu… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted by IJPHM. Presented at PHM19

  7. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  8. arXiv:1905.05927  [pdf, ps, other

    cs.LG cs.CV math.OC stat.ML

    Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function

    Authors: Arvind U. Raghunathan, Anoop Cherian, Devesh K. Jha

    Abstract: Computing Nash equilibrium (NE) of multi-player games has witnessed renewed interest due to recent advances in generative adversarial networks. However, computing equilibrium efficiently is challenging. To this end, we introduce the Gradient-based Nikaido-Isoda (GNI) function which serves: (i) as a merit function, vanishing only at the first-order stationary points of each player's optimization pr… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted at International Conference on Machine Learning (ICML), 2019

  9. arXiv:1903.05751  [pdf, other

    stat.ML cs.LG cs.RO

    Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

    Authors: Kei Ota, Devesh K. Jha, Tomoaki Oiki, Mamoru Miura, Takashi Nammoto, Daniel Nikovski, Toshisada Mariyama

    Abstract: In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known. Generating smooth, dynamically feasible trajectories could be difficult for such systems. Using sampling-based algorithms for motion planning may result in traject… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures, Accepted to IROS 2019

  10. arXiv:1709.09274  [pdf, ps, other

    stat.ML

    Symbolic Analysis-based Reduced Order Markov Modeling of Time Series Data

    Authors: Devesh K Jha, Nurali Virani, Jan Reimann, Abhishek Srivastav, Asok Ray

    Abstract: This paper presents a technique for reduced-order Markov modeling for compact representation of time-series data. In this work, symbolic dynamics-based tools have been used to infer an approximate generative Markov model. The time-series data are first symbolized by partitioning the continuous measurement space of the signal and then, the discrete sequential data are modeled using symbolic dynamic… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

    Comments: 21 pages, 12 figures