Skip to main content

Showing 1–5 of 5 results for author: Yang-Zhao, S

.
  1. arXiv:2406.17649  [pdf, other

    cs.LG cs.CR

    Privacy Preserving Reinforcement Learning for Population Processes

    Authors: Samuel Yang-Zhao, Kee Siong Ng

    Abstract: We consider the problem of privacy protection in Reinforcement Learning (RL) algorithms that operate over population processes, a practical but understudied setting that includes, for example, the control of epidemics in large populations of dynamically interacting individuals. In this setting, the RL algorithm interacts with the population over $T$ time steps by receiving population-level statist… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2312.16184  [pdf, other

    cs.AI cs.LG

    Dynamic Knowledge Injection for AIXI Agents

    Authors: Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

    Abstract: Prior approximations of AIXI, a Bayesian optimality notion for general reinforcement learning, can only approximate AIXI's Bayesian environment model using an a-priori defined set of models. This is a fundamental source of epistemic uncertainty for the agent in settings where the existence of systematic bias in the predefined model class cannot be resolved by simply collecting more data from the e… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 16 pages, 2 figures, extended length version of paper to be published in AAAI2024

  3. arXiv:2210.06917  [pdf, other

    cs.AI cs.LG

    A Direct Approximation of AIXI Using Logical State Abstractions

    Authors: Samuel Yang-Zhao, Tianyu Wang, Kee Siong Ng

    Abstract: We propose a practical integration of logical state abstraction with AIXI, a Bayesian optimality notion for reinforcement learning agents, to significantly expand the model class that AIXI agents can be approximated over to complex history-dependent and structured environments. The state representation and reasoning framework is based on higher-order logic, which can be used to define and enumerat… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  4. arXiv:2206.02178  [pdf, other

    cs.AI cs.LG

    Factored Conditional Filtering: Tracking States and Estimating Parameters in High-Dimensional Spaces

    Authors: Dawei Chen, Samuel Yang-Zhao, John Lloyd, Kee Siong Ng

    Abstract: This paper introduces factored conditional filters, new filtering algorithms for simultaneously tracking states and estimating parameters in high-dimensional state spaces. The conditional nature of the algorithms is used to estimate parameters and the factored nature is used to decompose the state space into low-dimensional subspaces in such a way that filtering on these subspaces gives distributi… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: 66 pages

    MSC Class: 68T37 ACM Class: I.2.6

  5. arXiv:1905.11702  [pdf, ps, other

    cs.LG stat.ML

    Conditions on Features for Temporal Difference-Like Methods to Converge

    Authors: Marcus Hutter, Samuel Yang-Zhao, Sultan J. Majeed

    Abstract: The convergence of many reinforcement learning (RL) algorithms with linear function approximation has been investigated extensively but most proofs assume that these methods converge to a unique solution. In this paper, we provide a complete characterization of non-uniqueness issues for a large class of reinforcement learning algorithms, simultaneously unifying many counter-examples to convergence… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: 13 pages, 6 figures