Skip to main content

Showing 1–4 of 4 results for author: Hamadanian, P

.
  1. arXiv:2302.02182  [pdf, other

    cs.LG cs.AI

    Online Reinforcement Learning in Non-Stationary Context-Driven Environments

    Authors: Pouya Hamadanian, Arash Nasr-Esfahany, Malte Schwarzkopf, Siddartha Sen, Mohammad Alizadeh

    Abstract: We study online reinforcement learning (RL) in non-stationary environments, where a time-varying exogenous context process affects the environment dynamics. Online RL is challenging in such environments due to "catastrophic forgetting" (CF). The agent tends to forget prior knowledge as it trains on new experiences. Prior approaches to mitigate this issue assume task labels (which are often not ava… ▽ More

    Submitted 10 February, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: 9 pages + 6 pages in the appendix, 10 Figures and 8 Tables

  2. arXiv:2201.05560  [pdf, other

    cs.LG cs.AI cs.NI

    Demystifying Reinforcement Learning in Time-Varying Systems

    Authors: Pouya Hamadanian, Malte Schwarzkopf, Siddartha Sen, Mohammad Alizadeh

    Abstract: Recent research has turned to Reinforcement Learning (RL) to solve challenging decision problems, as an alternative to hand-tuned heuristics. RL can learn good policies without the need for modeling the environment's dynamics. Despite this promise, RL remains an impractical solution for many real-world systems problems. A particularly challenging case occurs when the environment changes over time,… ▽ More

    Submitted 26 January, 2023; v1 submitted 14 January, 2022; originally announced January 2022.

  3. arXiv:2201.01811  [pdf, other

    cs.LG cs.AI cs.NI stat.ML

    CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

    Authors: Abdullah Alomar, Pouya Hamadanian, Arash Nasr-Esfahany, Anish Agarwal, Mohammad Alizadeh, Devavrat Shah

    Abstract: We present CausalSim, a causal framework for unbiased trace-driven simulation. Current trace-driven simulators assume that the interventions being simulated (e.g., a new algorithm) would not affect the validity of the traces. However, real-world traces are often biased by the choices algorithms make during trace collection, and hence replaying traces under an intervention may lead to incorrect res… ▽ More

    Submitted 5 May, 2023; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: NSDI'23 Best Paper Award

    Journal ref: 20th USENIX Symposium on Networked Systems Design and Implementation (2023) 1115--1147

  4. arXiv:2006.06628  [pdf, other

    cs.LG cs.CV cs.NI eess.IV stat.ML

    Real-Time Video Inference on Edge Devices via Adaptive Model Streaming

    Authors: Mehrdad Khani, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh

    Abstract: Real-time video inference on edge devices like mobile phones and drones is challenging due to the high computation cost of Deep Neural Networks. We present Adaptive Model Streaming (AMS), a new approach to improving performance of efficient lightweight models for video inference on edge devices. AMS uses a remote server to continually train and adapt a small model running on the edge device, boost… ▽ More

    Submitted 5 April, 2021; v1 submitted 11 June, 2020; originally announced June 2020.