Skip to main content

Showing 1–9 of 9 results for author: Kuang, N L

.
  1. arXiv:2407.00610  [pdf, other

    cs.LG

    Diff-BBO: Diffusion-Based Inverse Modeling for Black-Box Optimization

    Authors: Dongxia Wu, Nikki Li**g Kuang, Ruijia Niu, Yi-An Ma, Rose Yu

    Abstract: Black-box optimization (BBO) aims to optimize an objective function by iteratively querying a black-box oracle. This process demands sample-efficient optimization due to the high computational cost of function evaluations. While prior studies focus on forward approaches to learn surrogates for the unknown objective function, they struggle with high-dimensional inputs where valid inputs form a smal… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2310.18919  [pdf, other

    cs.LG cs.AI stat.ML

    Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

    Authors: Nikki Li**g Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma

    Abstract: Recent studies in reinforcement learning (RL) have made significant progress by leveraging function approximation to alleviate the sample complexity hurdle for better performance. Despite the success, existing provably efficient algorithms typically rely on the accessibility of immediate feedback upon taking actions. The failure to account for the impact of delay in observations can significantly… ▽ More

    Submitted 3 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  3. arXiv:2306.08803  [pdf, other

    cs.LG cs.AI stat.ML

    Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

    Authors: Amin Karbasi, Nikki Li**g Kuang, Yi-An Ma, Siddharth Mitra

    Abstract: Thompson sampling (TS) is widely used in sequential decision making due to its ease of use and appealing empirical performance. However, many existing analytical and empirical results for TS rely on restrictive assumptions on reward distributions, such as belonging to conjugate families, which limits their applicability in realistic scenarios. Moreover, sequential decision making problems are ofte… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: ICML 2023

    ACM Class: G.3; I.2.0

  4. arXiv:2207.11208  [pdf, other

    stat.ML cs.LG

    Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

    Authors: Kush Bhatia, Nikki Li**g Kuang, Yi-An Ma, Yixin Wang

    Abstract: Variational inference has recently emerged as a popular alternative to the classical Markov chain Monte Carlo (MCMC) in large-scale Bayesian inference. The core idea is to trade statistical accuracy for computational efficiency. In this work, we study these statistical and computational trade-offs in variational inference via a case study in inferential model selection. Focusing on Gaussian infere… ▽ More

    Submitted 6 August, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: 57 pages, 8 figures

  5. arXiv:1911.09891  [pdf, other

    cs.IR cs.LG cs.MM

    Performance Effectiveness of Multimedia Information Search Using the Epsilon-Greedy Algorithm

    Authors: Nikki Li**g Kuang, Clement H. C. Leung

    Abstract: In the search and retrieval of multimedia objects, it is impractical to either manually or automatically extract the contents for indexing since most of the multimedia contents are not machine extractable, while manual extraction tends to be highly laborious and time-consuming. However, by systematically capturing and analyzing the feedback patterns of human users, vital information concerning the… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 8 pages, 10 figures. IEEE ICMLA 2019

  6. arXiv:1911.09882  [pdf, other

    cs.AI cs.IR cs.LG cs.MM

    Analysis of Evolutionary Behavior in Self-Learning Media Search Engines

    Authors: Nikki Li**g Kuang, Clement H. C. Leung

    Abstract: The diversity of intrinsic qualities of multimedia entities tends to impede their effective retrieval. In a SelfLearning Search Engine architecture, the subtle nuances of human perceptions and deep knowledge are taught and captured through unsupervised reinforcement learning, where the degree of reinforcement may be suitably calibrated. Such architectural paradigm enables indexes to evolve natural… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: IEEE BigData 2019

  7. arXiv:1906.09340  [pdf

    cs.LG cs.AI

    Leveraging Reinforcement Learning Techniques for Effective Policy Adoption and Validation

    Authors: Nikki Li**g Kuang, Clement H. C. Leung

    Abstract: Rewards and punishments in different forms are pervasive and present in a wide variety of decision-making scenarios. By observing the outcome of a sufficient number of repeated trials, one would gradually learn the value and usefulness of a particular policy or strategy. However, in a given environment, the outcomes resulting from different trials are subject to chance influence and variations. In… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: 12 pages; ICCSA 2019

  8. arXiv:1902.04179  [pdf

    cs.LG cs.AI stat.ML

    Performance Dynamics and Termination Errors in Reinforcement Learning: A Unifying Perspective

    Authors: Nikki Li**g Kuang, Clement H. C. Leung

    Abstract: In reinforcement learning, a decision needs to be made at some point as to whether it is worthwhile to carry on with the learning process or to terminate it. In many such situations, stochastic elements are often present which govern the occurrence of rewards, with the sequential occurrences of positive rewards randomly interleaved with negative rewards. For most practical learners, the learning i… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Short Paper in AIKE 2018

  9. arXiv:1902.04178  [pdf

    cs.LG cs.AI stat.ML

    Stochastic Reinforcement Learning

    Authors: Nikki Li**g Kuang, Clement H. C. Leung, Vienne W. K. Sung

    Abstract: In reinforcement learning episodes, the rewards and punishments are often non-deterministic, and there are invariably stochastic elements governing the underlying situation. Such stochastic elements are often numerous and cannot be known in advance, and they have a tendency to obscure the underlying rewards and punishments patterns. Indeed, if stochastic elements were absent, the same outcome woul… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: AIKE 2018