Skip to main content

Showing 1–3 of 3 results for author: Babadi, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2009.10337  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Learning Task-Agnostic Action Spaces for Movement Optimization

    Authors: Amin Babadi, Michiel van de Panne, C. Karen Liu, Perttu Hämäläinen

    Abstract: We propose a novel method for exploring the dynamics of physically based animated characters, and learning a task-agnostic action space that makes movement optimization easier. Like several previous papers, we parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets. Our novel contribution is that w… ▽ More

    Submitted 23 July, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: Accepted as a regular paper by IEEE Transactions on Visualization and Computer Graphics (TVCG) in July 2021

  2. arXiv:1909.07869  [pdf, other

    cs.LG stat.ML

    Visualizing Movement Control Optimization Landscapes

    Authors: Perttu Hämäläinen, Juuso Toikka, Amin Babadi, C. Karen Liu

    Abstract: A large body of animation research focuses on optimization of movement control, either as action sequences or policy parameters. However, as closed-form expressions of the objective functions are often not available, our understanding of the optimization problems is limited. Building on recent work on analyzing neural network training, we contribute novel visualizations of high-dimensional control… ▽ More

    Submitted 22 August, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE Transactions on Visualization and Computer Graphics (IEEE TVCG)

  3. arXiv:1810.02541  [pdf, other

    cs.LG stat.ML

    PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

    Authors: Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen

    Abstract: Proximal Policy Optimization (PPO) is a highly popular model-free reinforcement learning (RL) approach. However, we observe that in a continuous action space, PPO can prematurely shrink the exploration variance, which leads to slow progress and may make the algorithm prone to getting stuck in local optima. Drawing inspiration from CMA-ES, a black-box evolutionary optimization method designed for r… ▽ More

    Submitted 3 November, 2020; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: This paper has been accepted to IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2020). The arxiv version also includes an appendix that covers more results