Skip to main content

Showing 1–9 of 9 results for author: Menda, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.02753  [pdf, other

    cs.LG

    Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Forecasting with Application to Electricity Demand

    Authors: Arec Jamgochian, Di Wu, Kunal Menda, Soyeon Jung, Mykel J. Kochenderfer

    Abstract: Some real-world decision-making problems require making probabilistic forecasts over multiple steps at once. However, methods for probabilistic forecasting may fail to capture correlations in the underlying time-series that exist over long time horizons as errors accumulate. One such application is with resource scheduling under uncertainty in a grid environment, which requires forecasting electri… ▽ More

    Submitted 14 January, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  2. arXiv:2201.02718  [pdf, other

    cs.RO

    Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

    Authors: Arec Jamgochian, Kunal Menda, Mykel J. Kochenderfer

    Abstract: Safe navigation in dense, urban driving environments remains an open problem and an active area of research. Unlike typical predict-then-plan approaches, game-theoretic planning considers how one vehicle's plan will affect the actions of another. Recent work has demonstrated significant improvements in the time required to find local Nash equilibria in general-sum games with nonlinear objectives a… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: Presented in the Artificial Intelligence for Autonomous Driving (AI4AD) Workshop at the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)

  3. arXiv:2105.01811  [pdf, other

    cs.RO cs.LG

    Training Structured Mechanical Models by Minimizing Discrete Euler-Lagrange Residual

    Authors: Kunal Menda, Jayesh K. Gupta, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Model-based paradigms for decision-making and control are becoming ubiquitous in robotics. They rely on the ability to efficiently learn a model of the system from data. Structured Mechanical Models (SMMs) are a data-efficient black-box parameterization of mechanical systems, typically fit to data by minimizing the error between predicted and observed accelerations or next states. In this work, we… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  4. arXiv:2006.11615  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM

    Authors: Kunal Menda, Jean de Becdelièvre, Jayesh K. Gupta, Ilan Kroo, Mykel J. Kochenderfer, Zachary Manchester

    Abstract: System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: First three authors contributed equally. Accepted at ICML 2020. Website: https://sites.google.com/stanford.edu/ceem/

  5. arXiv:2004.10301  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Structured Mechanical Models for Robot Learning and Control

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Model-based methods are the dominant paradigm for controlling robotic systems, though their efficacy depends heavily on the accuracy of the model used. Deep neural networks have been used to learn models of robot dynamics from data, but they suffer from data-inefficiency and the difficulty to incorporate prior knowledge. We introduce Structured Mechanical Models, a flexible model class for mechani… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: First two authors contributed equally. Accepted at L4DC2020. Source code and videos at https://sites.google.com/stanford.edu/smm/

  6. arXiv:1902.08705  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    A General Framework for Structured Learning of Mechanical Systems

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is avail… ▽ More

    Submitted 1 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 10 pages, 7 figures. First two authors contributed equally. Submitted to IROS/RA-L. Code at https://github.com/sisl/mechamodlearn/

  7. arXiv:1807.08364  [pdf, other

    cs.LG cs.AI

    EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning

    Authors: Kunal Menda, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: While imitation learning is often used in robotics, the approach frequently suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which attempts to quantify the confidence of the… ▽ More

    Submitted 19 July, 2019; v1 submitted 22 July, 2018; originally announced July 2018.

    Comments: Accepted to the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

  8. Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes

    Authors: Kunal Menda, Yi-Chun Chen, Justin Grana, James W. Bono, Brendan D. Tracey, Mykel J. Kochenderfer, David Wolpert

    Abstract: The incorporation of macro-actions (temporally extended actions) into multi-agent decision problems has the potential to address the curse of dimensionality associated with such decision problems. Since macro-actions last for stochastic durations, multiple agents executing decentralized policies in cooperative environments must act asynchronously. We present an algorithm that modifies generalized… ▽ More

    Submitted 29 May, 2019; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: Published in IEEE Transactions on Intelligent Transportation Systems (Volume: 20, Issue: 4, April 2019). https://ieeexplore.ieee.org/document/8419722

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 4, pp. 1259-1268, April 2019

  9. arXiv:1709.06166  [pdf, ps, other

    cs.AI cs.RO

    DropoutDAgger: A Bayesian Approach to Safe Imitation Learning

    Authors: Kunal Menda, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.