Skip to main content

Showing 1–9 of 9 results for author: Hefny, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:1803.01489  [pdf, other

    stat.ML cs.AI cs.LG

    Recurrent Predictive State Policy Networks

    Authors: Ahmed Hefny, Zita Marinho, Wen Sun, Siddhartha Srinivasa, Geoffrey Gordon

    Abstract: We introduce Recurrent Predictive State Policy (RPSP) networks, a recurrent architecture that brings insights from predictive state representations to reinforcement learning in partially observable environments. Predictive state policy networks consist of a recursive filter, which keeps track of a belief about the state of the environment, and a reactive policy that directly maps beliefs to action… ▽ More

    Submitted 4 March, 2018; originally announced March 2018.

  2. arXiv:1705.09353  [pdf, other

    stat.ML

    Predictive State Recurrent Neural Networks

    Authors: Carlton Downey, Ahmed Hefny, Boyue Li, Byron Boots, Geoffrey Gordon

    Abstract: We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for filtering and prediction in dynamical systems. PSRNNs draw on insights from both Recurrent Neural Networks (RNNs) and Predictive State Representations (PSRs), and inherit advantages from both types of models. Like many successful RNN architectures, PSRNNs use (potentially deeply composed) bilinear transfer functions t… ▽ More

    Submitted 17 June, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  3. arXiv:1702.04121  [pdf, other

    stat.ML cs.LG

    Practical Learning of Predictive State Representations

    Authors: Carlton Downey, Ahmed Hefny, Geoffrey Gordon

    Abstract: Over the past decade there has been considerable interest in spectral algorithms for learning Predictive State Representations (PSRs). Spectral algorithms have appealing theoretical guarantees; however, the resulting models do not always perform well on inference tasks in practice. One reason for this behavior is the mismatch between the intended task (accurate filtering or prediction) and the los… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

  4. arXiv:1702.03537  [pdf, other

    stat.ML

    An Efficient, Expressive and Local Minima-free Method for Learning Controlled Dynamical Systems

    Authors: Ahmed Hefny, Carlton Downey, Geoffrey J. Gordon

    Abstract: We propose a framework for modeling and estimating the state of controlled dynamical systems, where an agent can affect the system through actions and receives partial observations. Based on this framework, we propose the Predictive State Representation with Random Fourier Features (RFFPSR). A key property in RFF-PSRs is that the state estimate is represented by a conditional distribution of futur… ▽ More

    Submitted 28 February, 2018; v1 submitted 12 February, 2017; originally announced February 2017.

  5. arXiv:1603.06160  [pdf, other

    math.OC cs.LG cs.NE stat.ML

    Stochastic Variance Reduction for Nonconvex Optimization

    Authors: Sashank J. Reddi, Ahmed Hefny, Suvrit Sra, Barnabas Poczos, Alex Smola

    Abstract: We study nonconvex finite-sum problems and analyze stochastic variance reduced gradient (SVRG) methods for them. SVRG and related methods have recently surged into prominence for convex optimization given their edge over stochastic gradient descent (SGD); but their theoretical analysis almost exclusively assumes convexity. In contrast, we prove non-asymptotic rates of convergence (to stationary po… ▽ More

    Submitted 4 April, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: Minor feedback changes

  6. arXiv:1506.06840  [pdf, other

    cs.LG stat.ML

    On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants

    Authors: Sashank J. Reddi, Ahmed Hefny, Suvrit Sra, Barnabás Póczos, Alex Smola

    Abstract: We study optimization algorithms based on variance reduction for stochastic gradient descent (SGD). Remarkable recent progress has been made in this direction through development of algorithms like SAG, SVRG, SAGA. These algorithms have been shown to outperform SGD, both theoretically and empirically. However, asynchronous versions of these algorithms---a crucial requirement for modern large-scale… ▽ More

    Submitted 24 January, 2016; v1 submitted 22 June, 2015; originally announced June 2015.

  7. arXiv:1505.05310  [pdf, other

    stat.ML cs.LG

    Supervised Learning for Dynamical System Learning

    Authors: Ahmed Hefny, Carlton Downey, Geoffrey Gordon

    Abstract: Recently there has been substantial interest in spectral methods for learning dynamical systems. These methods are popular since they often offer a good tradeoff between computational and statistical efficiency. Unfortunately, they can be difficult to use and extend in practice: e.g., they can make it difficult to incorporate prior information such as sparsity or structure. To address this problem… ▽ More

    Submitted 4 November, 2015; v1 submitted 20 May, 2015; originally announced May 2015.

  8. arXiv:1409.2617  [pdf, other

    math.OC stat.ML

    Large-scale randomized-coordinate descent methods with non-separable linear constraints

    Authors: Sashank Reddi, Ahmed Hefny, Carlton Downey, Avinava Dubey, Suvrit Sra

    Abstract: We develop randomized (block) coordinate descent (CD) methods for linearly constrained convex optimization. Unlike most CD methods, we do not assume the constraints to be separable, but let them be coupled linearly. To our knowledge, ours is the first CD method that allows linear coupling constraints, without making the global iteration complexity have an exponential dependence on the number of co… ▽ More

    Submitted 10 June, 2015; v1 submitted 9 September, 2014; originally announced September 2014.

  9. arXiv:1208.4411  [pdf, other

    stat.ML

    A non-parametric mixture model for topic modeling over time

    Authors: Avinava Dubey, Ahmed Hefny, Sinead Williamson, Eric P. Xing

    Abstract: A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling corpora that span long time periods, as the popularity of topics is likely to change over time. A number of models that incorporate time have been proposed, but in general they either exhibit limited forms of temporal variation, or require computationally expensive inference methods. In this paper we… ▽ More

    Submitted 21 August, 2012; originally announced August 2012.

    Comments: 9 pages