Skip to main content

Showing 1–8 of 8 results for author: Danesh, M H

.
  1. arXiv:2402.17065  [pdf, other

    cs.CV cs.AI cs.LG

    Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions

    Authors: Saeed Khorram, Mingqi Jiang, Mohamad Shahbazi, Mohamad H. Danesh, Li Fuxin

    Abstract: Despite extensive research on training generative adversarial networks (GANs) with limited training data, learning to generate images from long-tailed training distributions remains fairly unexplored. In the presence of imbalanced multi-class training data, GANs tend to favor classes with more samples, leading to the generation of low-quality and less diverse samples in tail classes. In this study… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2402.03046  [pdf, other

    cs.LG

    Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

    Authors: Shengyi Huang, Quentin Gallouédec, Florian Felten, Antonin Raffin, Rousslan Fernand Julien Dossa, Yanxiao Zhao, Ryan Sullivan, Viktor Makoviychuk, Denys Makoviichuk, Mohamad H. Danesh, Cyril Roumégous, Jiayi Weng, Chufan Chen, Md Masudur Rahman, João G. M. Araújo, Guorui Quan, Daniel Tan, Timo Klein, Rujikorn Charakorn, Mark Towers, Yann Berthelot, Kinal Mehta, Dipam Chakraborty, Arjun KG, Valentin Charraut , et al. (8 additional authors not shown)

    Abstract: In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, i… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Under review

  3. arXiv:2307.05209  [pdf, other

    cs.AI cs.LG

    Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

    Authors: Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren

    Abstract: Recent studies show that deep reinforcement learning (DRL) agents tend to overfit to the task on which they were trained and fail to adapt to minor environment changes. To expedite learning when transferring to unseen tasks, we propose a novel approach to representing the current task using reward machines (RMs), state machine abstractions that induce subtasks based on the current task's rewards a… ▽ More

    Submitted 20 February, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI), 2024

  4. arXiv:2209.11422  [pdf, other

    cs.LG cs.RO

    LEADER: Learning Attention over Driving Behaviors for Planning under Uncertainty

    Authors: Mohamad H. Danesh, Panpan Cai, David Hsu

    Abstract: Uncertainty on human behaviors poses a significant challenge to autonomous driving in crowded urban environments. The partially observable Markov decision processes (POMDPs) offer a principled framework for planning under uncertainty, often leveraging Monte Carlo sampling to achieve online performance for complex tasks. However, sampling also raises safety concerns by potentially missing critical… ▽ More

    Submitted 29 October, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: CoRL 2022 (oral)

  5. arXiv:2107.04982  [pdf, other

    cs.LG cs.AI

    Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results

    Authors: Mohamad H Danesh, Alan Fern

    Abstract: We study the problem of out-of-distribution dynamics (OODD) detection, which involves detecting when the dynamics of a temporal process change compared to the training-distribution dynamics. This is relevant to applications in control, reinforcement learning (RL), and multi-variate time-series, where changes to test time dynamics can impact the performance of learning controllers/predictors in unk… ▽ More

    Submitted 24 May, 2022; v1 submitted 11 July, 2021; originally announced July 2021.

    Comments: ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning

  6. arXiv:2105.00339  [pdf, other

    cs.LG cs.AI cs.DC

    Stochastic Block-ADMM for Training Deep Networks

    Authors: Saeed Khorram, Xiao Fu, Mohamad H. Danesh, Zhongang Qi, Li Fuxin

    Abstract: In this paper, we propose Stochastic Block-ADMM as an approach to train deep neural networks in batch and online settings. Our method works by splitting neural networks into an arbitrary number of blocks and utilizes auxiliary variables to connect these blocks while optimizing with stochastic gradient descent. This allows training deep networks with non-differentiable constraints where conventiona… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

  7. arXiv:2011.01191  [pdf, other

    cs.LG cs.CC stat.ML

    Reducing Neural Network Parameter Initialization Into an SMT Problem

    Authors: Mohamad H. Danesh

    Abstract: Training a neural network (NN) depends on multiple factors, including but not limited to the initial weights. In this paper, we focus on initializing deep NN parameters such that it performs better, comparing to random or zero initialization. We do this by reducing the process of initialization into an SMT solver. Previous works consider certain activation functions on small NNs, however the studi… ▽ More

    Submitted 9 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: AAAI-21 SA Program

  8. arXiv:2006.03745  [pdf, other

    cs.LG stat.ML

    Re-understanding Finite-State Representations of Recurrent Policy Networks

    Authors: Mohamad H. Danesh, Anurag Koul, Alan Fern, Saeed Khorram

    Abstract: We introduce an approach for understanding control policies represented as recurrent neural networks. Recent work has approached this problem by transforming such recurrent policy networks into finite-state machines (FSM) and then analyzing the equivalent minimized FSM. While this led to interesting insights, the minimization process can obscure a deeper understanding of a machine's operation by m… ▽ More

    Submitted 11 July, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: ICML 2021