Skip to main content

Showing 1–5 of 5 results for author: Ganai, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14058  [pdf, other

    cs.AI cs.LG eess.SY

    Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

    Authors: Udayan Mandal, Guy Amir, Haoze Wu, Ieva Daukantas, Fletcher Lee Newell, Umberto J. Ravaioli, Baoluo Meng, Michael Durling, Milan Ganai, Tobey Shim, Guy Katz, Clark Barrett

    Abstract: Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the "black box" nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2309.13528  [pdf, other

    cs.LG cs.AI cs.RO

    Iterative Reachability Estimation for Safe Reinforcement Learning

    Authors: Milan Ganai, Zheng Gong, Chenning Yu, Sylvia Herbert, Sicun Gao

    Abstract: Ensuring safety is important for the practical deployment of reinforcement learning (RL). Various challenges must be addressed, such as handling stochasticity in the environments, providing rigorous guarantees of persistent state-wise safety satisfaction, and avoiding overly conservative behaviors that sacrifice performance. We propose a new framework, Reachability Estimation for Safe Policy Optim… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted in NeurIPS 2023

  3. arXiv:2308.14364  [pdf, other

    cs.LG cs.AI

    Target-independent XLA optimization using Reinforcement Learning

    Authors: Milan Ganai, Haichen Li, Theodore Enns, Yida Wang, Randy Huang

    Abstract: An important challenge in Machine Learning compilers like XLA is multi-pass optimization and analysis. There has been recent interest chiefly in XLA target-dependent optimization on the graph-level, subgraph-level, and kernel-level phases. We specifically focus on target-independent optimization XLA HLO pass ordering: our approach aims at finding the optimal sequence of compiler optimization passe… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Workshop on ML for Systems @ NeurIPS 2022

  4. arXiv:2303.02215  [pdf, other

    cs.RO

    Learning Stabilization Control from Observations by Learning Lyapunov-like Proxy Models

    Authors: Milan Ganai, Chiaki Hirayama, Ya-Chien Chang, Sicun Gao

    Abstract: The deployment of Reinforcement Learning to robotics applications faces the difficulty of reward engineering. Therefore, approaches have focused on creating reward functions by Learning from Observations (LfO) which is the task of learning policies from expert trajectories that only contain state sequences. We propose new methods for LfO for the important class of continuous control problems of le… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: ICRA 2023

  5. arXiv:0710.4666  [pdf

    cs.LO

    Verification of Embedded Memory Systems using Efficient Memory Modeling

    Authors: Malay K. Ganai, Aarti Gupta, Pranav Ashar

    Abstract: We describe verification techniques for embedded memory systems using efficient memory modeling (EMM), without explicitly modeling each memory bit. We extend our previously proposed approach of EMM in Bounded Model Checking (BMC) for a single read/write port single memory system, to more commonly occurring systems with multiple memories, having multiple read and write ports. More importantly, we… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

    Journal ref: Dans Design, Automation and Test in Europe - DATE'05, Munich : Allemagne (2005)