Skip to main content

Showing 1–5 of 5 results for author: Mehrjou, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2201.05830  [pdf, other

    cs.RO math.DS stat.ML

    Physical Derivatives: Computing policy gradients by physical forward-propagation

    Authors: Arash Mehrjou, Ashkan Soleymani, Stefan Bauer, Bernhard Schölkopf

    Abstract: Model-free and model-based reinforcement learning are two ends of a spectrum. Learning a good policy without a dynamic model can be prohibitively expensive. Learning the dynamic model of a system can reduce the cost of learning the policy, but it can also introduce bias if it is not accurate. We propose a middle ground where instead of the transition model, the sensitivity of the trajectories with… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

  2. arXiv:2107.03770  [pdf, other

    stat.ML cs.LG math.DS math.OC math.PR

    Federated Learning as a Mean-Field Game

    Authors: Arash Mehrjou

    Abstract: We establish a connection between federated learning, a concept from machine learning, and mean-field games, a concept from game theory and control theory. In this analogy, the local federated learners are considered as the players and the aggregation of the gradients in a central server is the mean-field effect. We present federated learning as a differential game and discuss the properties of th… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  3. arXiv:1910.14428   

    stat.ML cs.LG math.DS

    Kernel-Guided Training of Implicit Generative Models with Stability Guarantees

    Authors: Arash Mehrjou, Wittawat Jitkrittum, Krikamol Muandet, Bernhard Schölkopf

    Abstract: Modern implicit generative models such as generative adversarial networks (GANs) are generally known to suffer from issues such as instability, uninterpretability, and difficulty in assessing their performance. If we see these implicit models as dynamical systems, some of these issues are caused by being unable to control their behavior in a meaningful way during the course of training. In this wo… ▽ More

    Submitted 3 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: There was a misunderstanding in how an article should be updated on arXiv. We have withdrawn this article from this link. The same article can be found at arXiv:1901.09206

  4. arXiv:1901.08403  [pdf, other

    math.DS

    Deep Lyapunov Function: Automatic Stability Analysis for Dynamical Systems

    Authors: Arash Mehrjou, Bernhard Schölkopf

    Abstract: Stability analysis plays a crucial role in studying the behavior of dynamical systems with theoretical and engineering applications. Among various kinds of stability, the stability of equilibrium points is of the greatest importance which is mainly studied by Lyapunov's stability theory. This theory requires finding a function with specified properties. Except for a few simple examples, there is n… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

  5. arXiv:1805.10615  [pdf, other

    stat.ML cs.LG math.DS

    A Local Information Criterion for Dynamical Systems

    Authors: Arash Mehrjou, Friedrich Solowjow, Sebastian Trimpe, Bernhard Schölkopf

    Abstract: Encoding a sequence of observations is an essential task with many applications. The encoding can become highly efficient when the observations are generated by a dynamical system. A dynamical system imposes regularities on the observations that can be leveraged to achieve a more efficient code. We propose a method to encode a given or learned dynamical system. Apart from its application for encod… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.