Skip to main content

Showing 1–5 of 5 results for author: Karamzade, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.12309  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning from Delayed Observations via World Models

    Authors: Armin Karamzade, Kyungmin Kim, Montek Kalsi, Roy Fox

    Abstract: In standard reinforcement learning settings, agents typically assume immediate feedback about the effects of their actions after taking them. However, in practice, this assumption may not hold true due to physical constraints and can significantly impact the performance of learning algorithms. In this paper, we address observation delays in partially observable environments. We propose leveraging… ▽ More

    Submitted 25 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2402.14212  [pdf, other

    cs.LG cs.AI

    Moonwalk: Inverse-Forward Differentiation

    Authors: Dmitrii Krylov, Armin Karamzade, Roy Fox

    Abstract: Backpropagation, while effective for gradient computation, falls short in addressing memory consumption, limiting scalability. This work explores forward-mode gradient computation as an alternative in invertible networks, showing its potential to reduce the memory footprint without substantial drawbacks. We introduce a novel technique based on a vector-inverse-Jacobian product that accelerates the… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2212.02304  [pdf, other

    cs.LG cs.NI

    Matching DNN Compression and Cooperative Training with Resources and Data Availability

    Authors: Francesco Malandrino, Giuseppe Di Giacomo, Armin Karamzade, Marco Levorato, Carla Fabiana Chiasserini

    Abstract: To make machine learning (ML) sustainable and apt to run on the diverse devices where relevant data is, it is essential to compress ML models as needed, while still meeting the required learning quality and time performance. However, how much and when an ML model should be compressed, and {\em where} its training should be executed, are hard decisions to make, as they depend on the model itself, t… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Journal ref: IEEE INFOCOM 2023

  4. arXiv:2012.07527  [pdf, other

    cs.CL cs.LG stat.ML

    Regularizing Recurrent Neural Networks via Sequence Mixup

    Authors: Armin Karamzade, Amir Najafi, Seyed Abolfazl Motahari

    Abstract: In this paper, we extend a class of celebrated regularization techniques originally proposed for feed-forward neural networks, namely Input Mixup (Zhang et al., 2017) and Manifold Mixup (Verma et al., 2018), to the realm of Recurrent Neural Networks (RNN). Our proposed methods are easy to implement and have a low computational complexity, while leverage the performance of simple neural architectur… ▽ More

    Submitted 27 November, 2020; originally announced December 2020.

    Comments: 17 pages

  5. arXiv:1812.10437  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    Structure Learning of Sparse GGMs over Multiple Access Networks

    Authors: Mostafa Tavassolipour, Armin Karamzade, Reza Mirzaeifard, Seyed Abolfazl Motahari, Mohammad-Taghi Manzuri Shalmani

    Abstract: A central machine is interested in estimating the underlying structure of a sparse Gaussian Graphical Model (GGM) from datasets distributed across multiple local machines. The local machines can communicate with the central machine through a wireless multiple access channel. In this paper, we are interested in designing effective strategies where reliable learning is feasible under power and bandw… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.