Skip to main content

Showing 1–2 of 2 results for author: Soleymani, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2201.05830  [pdf, other

    cs.RO math.DS stat.ML

    Physical Derivatives: Computing policy gradients by physical forward-propagation

    Authors: Arash Mehrjou, Ashkan Soleymani, Stefan Bauer, Bernhard Schölkopf

    Abstract: Model-free and model-based reinforcement learning are two ends of a spectrum. Learning a good policy without a dynamic model can be prohibitively expensive. Learning the dynamic model of a system can reduce the cost of learning the policy, but it can also introduce bias if it is not accurate. We propose a middle ground where instead of the transition model, the sensitivity of the trajectories with… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

  2. arXiv:2007.02938  [pdf, other

    stat.ML cs.LG math.ST

    Causal Feature Selection via Orthogonal Search

    Authors: Ashkan Soleymani, Anant Raj, Stefan Bauer, Bernhard Schölkopf, Michel Besserve

    Abstract: The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines. However, established approaches often scale at least exponentially with the number of explanatory variables, are difficult to extend to nonlinear relationships, and are difficult to extend to cyclic data. Inspired by {\em Debiased… ▽ More

    Submitted 16 September, 2022; v1 submitted 6 July, 2020; originally announced July 2020.