Skip to main content

Showing 1–7 of 7 results for author: Fan, F X

.
  1. arXiv:2406.01631  [pdf, other

    cs.IR cs.LG

    An LLM-based Recommender System Environment

    Authors: Nathan Corecco, Giorgio Piatti, Luca A. Lanzendörfer, Flint Xiaofeng Fan, Roger Wattenhofer

    Abstract: Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2403.20156  [pdf, other

    cs.LG cs.AI

    CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

    Authors: Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

    Abstract: In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converg… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2401.03489  [pdf, other

    cs.LG cs.AI cs.DC cs.MA

    Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

    Authors: Philip Jordan, Florian Grötschla, Flint Xiaofeng Fan, Roger Wattenhofer

    Abstract: In Federated Reinforcement Learning (FRL), agents aim to collaboratively learn a common task, while each agent is acting in its local environment without exchanging raw trajectories. Existing approaches for FRL either (a) do not provide any fault-tolerance guarantees (against misbehaving agents), or (b) rely on a trusted central agent (a single point of failure) for aggregating updates. We provide… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at AAMAS'24

  4. arXiv:2306.15968  [pdf, other

    cs.RO cs.LG

    Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

    Authors: Xinyang Lu, Flint Xiaofeng Fan, Tianying Wang

    Abstract: Reinforcement Learning (RL) has made promising progress in planning and decision-making for Autonomous Vehicles (AVs) in simple driving scenarios. However, existing RL algorithms for AVs fail to learn critical driving skills in complex urban scenarios. First, urban driving scenarios require AVs to handle multiple driving tasks of which conventional RL algorithms are incapable. Second, the presence… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems

  5. arXiv:2301.11135  [pdf, other

    cs.LG cs.DC

    FedHQL: Federated Heterogeneous Q-Learning

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Cheston Tan, Bryan Kian Hsiang Low, Roger Wattenhofer

    Abstract: Federated Reinforcement Learning (FedRL) encourages distributed agents to learn collectively from each other's experience to improve their performance without exchanging their raw trajectories. The existing work on FedRL assumes that all participating agents are homogeneous, which requires all agents to share the same policy parameterization (e.g., network architectures and training configurations… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Preprint. Under review

  6. arXiv:2205.14309  [pdf, other

    cs.LG cs.AI

    Federated Neural Bandits

    Authors: Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Recent works on neural contextual bandits have achieved compelling performances due to their ability to leverage the strong representation power of neural networks (NNs) for reward prediction. Many applications of contextual bandits involve multiple agents who collaborate without sharing raw observations, thus giving rise to the setting of federated contextual bandits. Existing works on federated… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICLR 2023. Code: https://github.com/daizhongxiang/Federated-Neural-Bandits

  7. arXiv:2110.14074  [pdf, other

    cs.LG cs.AI

    Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei **g, Cheston Tan, Bryan Kian Hsiang Low

    Abstract: The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories. Despite its promising applications, existing works on FRL fail to I) provide theoretical analysis on its convergence, and II) account for random system failures and adversa… ▽ More

    Submitted 3 November, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021. Extended version with proofs and additional experimental details and results. New version changes: reduced file size of figures; added a diagram illustrating the problem setting; added link to code on GitHub; modified proof for Theorem 6 (highlighted in red)