Skip to main content

Showing 1–1 of 1 results for author: Ward, P N

Searching in archive cs. Search in all archives.
.
  1. arXiv:1906.02771  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies

    Authors: Patrick Nadeem Ward, Ariella Smofsky, Avishek Joey Bose

    Abstract: Deep Reinforcement Learning (DRL) algorithms for continuous action spaces are known to be brittle toward hyperparameters as well as \cut{being}sample inefficient. Soft Actor Critic (SAC) proposes an off-policy deep actor critic algorithm within the maximum entropy RL framework which offers greater stability and empirical gains. The choice of policy distribution, a factored Gaussian, is motivated b… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: INNF workshop, International Conference on Machine Learning 2019, Long Beach CA, USA