Skip to main content

Showing 1–3 of 3 results for author: Jui, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2312.15246  [pdf, other

    cs.LG cs.AI math.NA math.PR

    A Theory of Non-Acyclic Generative Flow Networks

    Authors: Leo Maxime Brunswic, Yinchuan Li, Yushun Xu, Shangling Jui, Lizhuang Ma

    Abstract: GFlowNets is a novel flow-based method for learning a stochastic policy to generate objects via a sequence of actions and with probability proportional to a given positive reward. We contribute to relaxing hypotheses limiting the application range of GFlowNets, in particular: acyclicity (or lack thereof). To this end, we extend the theory of GFlowNets on measurable spaces which includes continuous… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: 16 pages, 8 figures, 1 table, AAAI 2024

    MSC Class: 68T07; 68T20; 60J05; 60J20; 60J22; 65F45; 65J20; 68T05; 68T20

  2. arXiv:2112.05368  [pdf, other

    math.OC

    Sample Average Approximation for Stochastic Optimization with Dependent Data: Performance Guarantees and Tractability

    Authors: Yafei Wang, Bo Pan, Wei Tu, Peng Liu, Bei Jiang, Chao Gao, Wei Lu, Shangling Jui, Linglong Kong

    Abstract: Sample average approximation (SAA), a popular method for tractably solving stochastic optimization problems, enjoys strong asymptotic performance guarantees in settings with independent training samples. However, these guarantees are not known to hold generally with dependent samples, such as in online learning with time series data or distributed computing with Markovian training samples. In this… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  3. arXiv:2110.08896  [pdf, other

    cs.LG math.OC

    Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization

    Authors: Ke Sun, Yafei Wang, Yi Liu, Yingnan Zhao, Bo Pan, Shangling Jui, Bei Jiang, Linglong Kong

    Abstract: Anderson mixing has been heuristically applied to reinforcement learning (RL) algorithms for accelerating convergence and improving the sampling efficiency of deep RL. Despite its heuristic improvement of convergence, a rigorous mathematical justification for the benefits of Anderson mixing in RL has not yet been put forward. In this paper, we provide deeper insights into a class of acceleration s… ▽ More

    Submitted 20 October, 2021; v1 submitted 17 October, 2021; originally announced October 2021.