Skip to main content

Showing 1–50 of 118 results for author: Low, K

.
  1. arXiv:2407.04981  [pdf, other

    cs.CL cs.LG

    TRACE: TRansformer-based Attribution using Contrastive Embeddings in LLMs

    Authors: Cheng Wang, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The rapid evolution of large language models (LLMs) represents a substantial leap forward in natural language understanding and generation. However, alongside these advancements come significant challenges related to the accountability and transparency of LLM responses. Reliable source attribution is essential to adhering to stringent legal and regulatory standards, including those set forth by th… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2407.04411  [pdf, other

    cs.CR cs.AI cs.CL

    Waterfall: Framework for Robust and Scalable Text Watermarking

    Authors: Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: Protecting intellectual property (IP) of text such as articles and code is increasingly important, especially as sophisticated attacks become possible, such as paraphrasing by large language models (LLMs) or even unauthorized training of LLMs on copyrighted text to infringe such IP. However, existing text watermarking methods are not robust enough against such attacks nor scalable to millions of u… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2406.14507  [pdf, other

    cs.LG cs.AI

    On Newton's Method to Unlearn Neural Networks

    Authors: Nhung Bui, Xinyang Lu, See-Kiong Ng, Bryan Kian Hsian Low

    Abstract: Machine unlearning facilitates personal data ownership, including the ``right to be forgotten''. The proliferation of applications of \emph{neural networks} (NNs) trained on users' personal data calls for the need to develop algorithms to unlearn an NN. Since retraining is costly, efficiency is often achieved through approximate unlearning which aims to unlearn a trained NN to be close to the retr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.14473  [pdf, other

    cs.LG cs.CL

    Data-Centric AI in the Age of Large Language Models

    Authors: Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, **gtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low

    Abstract: This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint

  5. arXiv:2406.09348  [pdf, other

    physics.soc-ph cond-mat.stat-mech

    Emergence of Fluctuation Relations in UNO

    Authors: Peter Sidajaya, Jovan Hsuen Khai Low, Clive Cenxin Aw, Valerio Scarani

    Abstract: Fluctuation theorems are generalisations of the second law that describe the relations between work, temperature, and free energy in thermodynamic processes and are used extensively in studies of irreversibility and entropy. Many experiments have verified these relations for different physical systems in the setting of thermodynamics. In this study, we observe the same behavior away from physical… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 10 pages, 8 figures

  6. arXiv:2406.04606  [pdf, other

    cs.LG cs.AI

    Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions

    Authors: **gtan Wang, Xiaoqiang Lin, Rui Qiao, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: The increasing complexity of foundational models underscores the necessity for explainability, particularly for fine-tuning, the most widely used training method for adapting models to downstream tasks. Instance attribution, one type of explanation, attributes the model prediction to each training example by an instance score. However, the robustness of instance scores, specifically towards datase… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  7. arXiv:2405.17346  [pdf, other

    cs.LG cs.AI

    Prompt Optimization with Human Feedback

    Authors: Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint, 18 pages

  8. arXiv:2405.16122  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

    Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown impressive capabilities in real-world applications. The capability of in-context learning (ICL) allows us to adapt an LLM to downstream tasks by including input-label exemplars in the prompt without model fine-tuning. However, the quality of these exemplars in the prompt greatly impacts performance, highlighting the need for an effective automated exemplar s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 23 pages, 1 figure, 23 tables

  9. arXiv:2405.14899  [pdf, other

    cs.CL cs.AI cs.LG

    DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

    Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

    Abstract: In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learnin… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  10. arXiv:2404.07662  [pdf, other

    cs.LG cs.AI physics.comp-ph physics.data-an stat.ML

    PINNACLE: PINN Adaptive ColLocation and Experimental points selection

    Authors: Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Physics-Informed Neural Networks (PINNs), which incorporate PDEs as soft constraints, train with a composite loss function that contains multiple training point types: different types of collocation points chosen during training to enforce each PDE and initial/boundary conditions, and experimental points which are usually costly to obtain via experiments or simulations. Training PINNs using this l… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted to 12th International Conference on Learning Representations (ICLR 2024), 36 pages

  11. arXiv:2404.06939  [pdf, other

    cs.ET cs.AI

    Fast System Technology Co-Optimization Framework for Emerging Technology Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Xuguang Sun, Zhihui Deng, Kainlu Low, Leilai Shao

    Abstract: This paper proposes a fast system technology co-optimization (STCO) framework that optimizes power, performance, and area (PPA) for next-generation IC design, addressing the challenges and opportunities presented by novel materials and device architectures. We focus on accelerating the technology level of STCO using AI techniques, by employing graph neural network (GNN)-based approaches for both T… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted by the 61th Design Automation Conference (DAC)

  12. arXiv:2404.01676  [pdf, other

    cs.LG

    Incentives in Private Collaborative Machine Learning

    Authors: Rachael Hwee Ling Sim, Yehong Zhang, Trong Nghia Hoang, Xinyi Xu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2403.07591  [pdf, other

    cs.LG

    Robustifying and Boosting Training-Free Neural Architecture Search

    Authors: Zhenfeng He, Yao Shu, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has become a key component of AutoML and a standard tool to automate the design of deep neural networks. Recently, training-free NAS as an emerging paradigm has successfully reduced the search costs of standard training-based NAS by estimating the true architecture performance with only training-free metrics. Nevertheless, the estimation ability of these metrics ty… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024. Code available at https://github.com/hzf1174/RoBoT

  14. arXiv:2403.02993  [pdf, other

    cs.AI

    Localized Zeroth-Order Prompt Optimization

    Authors: Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The efficacy of large language models (LLMs) in understanding and generating natural language has aroused a wide interest in develo** prompt-based methods to harness the power of black-box LLMs. Existing methodologies usually prioritize a global optimization for finding the global optimum, which however will perform poorly in certain tasks. This thus motivates us to re-think the necessity of fin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  15. arXiv:2402.02359  [pdf, other

    math.OC cs.LG

    Incremental Quasi-Newton Methods with Faster Superlinear Convergence Rates

    Authors: Zhuanghua Liu, Luo Luo, Bryan Kian Hsiang Low

    Abstract: We consider the finite-sum optimization problem, where each component function is strongly convex and has Lipschitz continuous gradient and Hessian. The recently proposed incremental quasi-Newton method is based on BFGS update and achieves a local superlinear convergence rate that is dependent on the condition number of the problem. This paper proposes a more efficient quasi-Newton method by incor… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  16. arXiv:2402.02356  [pdf, other

    math.OC cs.LG

    Decentralized Sum-of-Nonconvex Optimization

    Authors: Zhuanghua Liu, Bryan Kian Hsiang Low

    Abstract: We consider the optimization problem of minimizing the sum-of-nonconvex function, i.e., a convex function that is the average of nonconvex components. The existing stochastic algorithms for such a problem only focus on a single machine and the centralized scenario. In this paper, we study the sum-of-nonconvex optimization in the decentralized setting. We present a new theoretical analysis of the P… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  17. arXiv:2401.14846  [pdf, other

    cs.LG cs.CV

    Understanding Domain Generalization: A Noise Robustness Perspective

    Authors: Rui Qiao, Bryan Kian Hsiang Low

    Abstract: Despite the rapid development of machine learning algorithms for domain generalization (DG), there is no clear empirical evidence that the existing DG algorithms outperform the classic empirical risk minimization (ERM) across standard benchmarks. To better understand this phenomenon, we investigate whether there are benefits of DG algorithms over ERM through the lens of label noise. Specifically,… ▽ More

    Submitted 17 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted to the 12th International Conference on Learning Representations (ICLR 2024). Code is available at https://github.com/qiaoruiyt/NoiseRobustDG

  18. arXiv:2312.12784  [pdf, other

    cs.LG

    Fast Cell Library Characterization for Design Technology Co-Optimization Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Zhihui Deng, Xuguang Sun, Kainlu Low, Leilai Shao

    Abstract: Design technology co-optimization (DTCO) plays a critical role in achieving optimal power, performance, and area (PPA) for advanced semiconductor process development. Cell library characterization is essential in DTCO flow, but traditional methods are time-consuming and costly. To overcome these challenges, we propose a graph neural network (GNN)-based machine learning model for rapid and accurate… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  19. arXiv:2312.11413  [pdf, other

    cs.LG cs.AI

    DeRDaVa: Deletion-Robust Data Valuation for Machine Learning

    Authors: Xiao Tian, Rachael Hwee Ling Sim, Jue Fan, Bryan Kian Hsiang Low

    Abstract: Data valuation is concerned with determining a fair valuation of data from data sources to compensate them or to identify training examples that are the most or least useful for predictions. With the rising interest in personal data ownership and data protection regulations, model owners will likely have to fulfil more data deletion requests. This raises issues that have not been addressed by exis… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  20. arXiv:2311.02715  [pdf, other

    cs.LG stat.ML

    Exploiting Correlated Auxiliary Feedback in Parameterized Bandits

    Authors: Arun Verma, Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low

    Abstract: We study a novel variant of the parameterized bandits problem in which the learner can observe additional auxiliary feedback that is correlated with the observed reward. The auxiliary feedback is readily available in many real-life applications, e.g., an online platform that wants to recommend the best-rated services to its users can observe the user's rating of service (rewards) and collect addit… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  21. arXiv:2311.01195  [pdf, other

    cs.LG cs.AI

    Batch Bayesian Optimization for Replicable Experimental Design

    Authors: Zhongxiang Dai, Quoc Phong Nguyen, Sebastian Shenghong Tay, Daisuke Urano, Richalynn Leong, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Many real-world experimental design problems (a) evaluate multiple experimental conditions in parallel and (b) replicate each condition multiple times due to large and heteroscedastic observation noise. Given a fixed total budget, this naturally induces a trade-off between evaluating more unique conditions while replicating each of them fewer times vs. evaluating fewer unique conditions and replic… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  22. arXiv:2310.05373  [pdf, other

    cs.LG cs.AI

    Quantum Bayesian Optimization

    Authors: Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets f… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  23. arXiv:2310.02905  [pdf, other

    cs.LG cs.AI cs.CL

    Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

    Authors: Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown remarkable instruction-following capabilities and achieved impressive performances in various applications. However, the performances of LLMs depend heavily on the instructions given to them, which are typically manually tuned with substantial human efforts. Recent work has used the query-efficient Bayesian optimization (BO) algorithm to automatically optimi… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024

  24. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data

    Authors: **gtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of large language models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the intellectual property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to (a) identify the data provider who… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  25. arXiv:2308.11624  [pdf

    cs.LG cs.AI cs.AR

    Revolutionizing TCAD Simulations with Universal Device Encoding and Graph Attention Networks

    Authors: Guangxi Fan, Leilai Shao, Kain Lu Low

    Abstract: An innovative methodology that leverages artificial intelligence (AI) and graph representation for semiconductor device encoding in TCAD device simulation is proposed. A graph-based universal encoding scheme is presented that not only considers material-level and device-level embeddings, but also introduces a novel spatial relationship embedding inspired by interpolation operations typically used… ▽ More

    Submitted 23 January, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 32 pages, 13 figures and 4 tables

  26. arXiv:2308.04077  [pdf, other

    cs.LG cs.AI

    Federated Zeroth-Order Optimization using Trajectory-Informed Surrogate Gradients

    Authors: Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Federated optimization, an emerging paradigm which finds wide real-world applications such as federated learning, enables multiple clients (e.g., edge devices) to collaboratively optimize a global function. The clients do not share their local datasets and typically only share their local gradients. However, the gradient information is not available in many applications of federated optimization,… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  27. arXiv:2308.00629  [pdf, other

    cs.LG cs.AI

    Hessian-Aware Bayesian Optimization for Decision Making Systems

    Authors: Mohit Rajpal, Lac Gia Tran, Yehong Zhang, Bryan Kian Hsiang Low

    Abstract: Many approaches for optimizing decision making systems rely on gradient based methods requiring informative feedback from the environment. However, in the case where such feedback is sparse or uninformative, such approaches may result in poor performance. Derivative-free approaches such as Bayesian Optimization mitigate the dependency on the quality of gradient feedback, but are known to scale poo… ▽ More

    Submitted 1 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Fixed a typo

  28. arXiv:2306.05764  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Fair yet Asymptotically Equal Collaborative Learning

    Authors: Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng, Chuan-Sheng Foo, Bryan Kian Hsiang Low

    Abstract: In collaborative learning with streaming data, nodes (e.g., organizations) jointly and continuously learn a machine learning (ML) model by sharing the latest model updates computed from their latest streaming data. For the more resourceful nodes to be willing to share their model updates, they need to be fairly incentivized. This paper explores an incentive design that guarantees fairness so that… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to 40th International Conference on Machine Learning (ICML 2023), 37 pages

  29. arXiv:2306.04454  [pdf, other

    cs.LG cs.AI

    Training-Free Neural Active Learning with Initialization-Robustness Guarantees

    Authors: Apivich Hemachandra, Zhongxiang Dai, Jasraj Singh, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: Existing neural active learning algorithms have aimed to optimize the predictive performance of neural networks (NNs) by selecting data for labelling. However, other than a good predictive performance, being robust against random parameter initializations is also a crucial requirement in safety-critical applications. To this end, we introduce our expected variance with Gaussian processes (EV-GP) c… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to 40th International Conference on Machine Learning (ICML 2023), 41 pages

  30. arXiv:2305.14201  [pdf, other

    cs.LG cs.AI cs.CL

    Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks

    Authors: Tiedong Liu, Bryan Kian Hsiang Low

    Abstract: We introduce Goat, a fine-tuned LLaMA model that significantly outperforms GPT-4 on a range of arithmetic tasks. Fine-tuned on a synthetically generated dataset, Goat achieves state-of-the-art performance on BIG-bench arithmetic sub-task. In particular, the zero-shot Goat-7B matches or even surpasses the accuracy achieved by the few-shot PaLM-540B. Surprisingly, Goat can achieve near-perfect accur… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  31. arXiv:2305.06176  [pdf

    cs.CL cs.AI cs.LG

    Fine-tuning Language Models with Generative Adversarial Reward Modelling

    Authors: Zhang Ze Yu, Lau Jia Jaw, Zhang Hui, Bryan Kian Hsiang Low

    Abstract: Reinforcement Learning with Human Feedback (RLHF) has been demonstrated to significantly enhance the performance of large language models (LLMs) by aligning their outputs with desired human values through instruction tuning. However, RLHF is constrained by the expertise and productivity limitations of human evaluators. A response to this downside is to fall back to supervised fine-tuning (SFT) wit… ▽ More

    Submitted 5 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 22 pages, 9 figures, 12 tables

  32. arXiv:2301.11135  [pdf, other

    cs.LG cs.DC

    FedHQL: Federated Heterogeneous Q-Learning

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Cheston Tan, Bryan Kian Hsiang Low, Roger Wattenhofer

    Abstract: Federated Reinforcement Learning (FedRL) encourages distributed agents to learn collectively from each other's experience to improve their performance without exchanging their raw trajectories. The existing work on FedRL assumes that all participating agents are homogeneous, which requires all agents to share the same policy parameterization (e.g., network architectures and training configurations… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Preprint. Under review

  33. arXiv:2212.00630  [pdf, other

    cs.LG cs.CY

    Probably Approximate Shapley Fairness with Applications in Machine Learning

    Authors: Zijian Zhou, Xinyi Xu, Rachael Hwee Ling Sim, Chuan Sheng Foo, Kian Hsiang Low

    Abstract: The Shapley value (SV) is adopted in various scenarios in machine learning (ML), including data valuation, agent valuation, and feature attribution, as it satisfies their fairness requirements. However, as exact SVs are infeasible to compute in practice, SV estimates are approximated instead. This approximation step raises an important question: do the SV estimates preserve the fairness guarantees… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

  34. arXiv:2211.08281  [pdf, other

    q-fin.TR cs.AI cs.LG q-fin.CP q-fin.PM

    Forecasting Bitcoin volatility spikes from whale transactions and CryptoQuant data using Synthesizer Transformer models

    Authors: Dorien Herremans, Kah Wee Low

    Abstract: The cryptocurrency market is highly volatile compared to traditional financial markets. Hence, forecasting its volatility is crucial for risk management. In this paper, we investigate CryptoQuant data (e.g. on-chain analytics, exchange and miner data) and whale-alert tweets, and explore their relationship to Bitcoin's next-day volatility, with a focus on extreme volatility spikes. We propose a dee… ▽ More

    Submitted 6 October, 2022; originally announced November 2022.

    Comments: Co-first authors

  35. arXiv:2210.06850  [pdf, other

    cs.LG cs.AI

    Sample-Then-Optimize Batch Neural Thompson Sampling

    Authors: Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO), which uses a Gaussian process (GP) as a surrogate to model its objective function, is popular for black-box optimization. However, due to the limitations of GPs, BO underperforms in some problems such as those with categorical, high-dimensional or image inputs. To this end, recent works have used the highly expressive neural networks (NNs) as the surrogate model and der… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Extended version with proofs and additional experimental details and results, 30 pages

  36. Vacillating about media bias: changing one's mind intermittently within a network of political allies and opponents

    Authors: Nicholas Kah Yean Low, Andrew Melatos

    Abstract: One form of long-term behavior revealed by opinion dynamics simulations is intermittency, where an individual cycles between eras of stable, constant beliefs and turbulent, fluctuating beliefs, for example when inferring the political bias of a media organization. We explore this phenomenon by building an idealized network of Bayesian learners, who infer the bias of a coin from observations of coi… ▽ More

    Submitted 30 July, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: 29 pages, 18 figures. Published in Physica A

  37. arXiv:2206.09341  [pdf, other

    cs.LG cs.AI

    Bayesian Optimization under Stochastic Delayed Feedback

    Authors: Arun Verma, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Bayesian optimization (BO) is a widely-used sequential method for zeroth-order optimization of complex and expensive-to-compute black-box functions. The existing BO methods assume that the function evaluation (feedback) is available to the learner immediately or after a fixed delay. Such assumptions may not be practical in many real-life problems like online recommendations, clinical trials, and h… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022

  38. arXiv:2206.06872  [pdf, other

    cs.LG cs.AI

    On Provably Robust Meta-Bayesian Optimization

    Authors: Zhongxiang Dai, Yizhou Chen, Haibin Yu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has become popular for sequential optimization of black-box functions. When BO is used to optimize a target function, we often have access to previous evaluations of potentially related functions. This begs the question as to whether we can leverage these previous experiences to accelerate the current BO task through meta-learning (meta-BO), while ensuring robustness aga… ▽ More

    Submitted 15 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted to 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), Extended version with proofs and additional experimental details and results, 31 pages

  39. arXiv:2205.14309  [pdf, other

    cs.LG cs.AI

    Federated Neural Bandits

    Authors: Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Recent works on neural contextual bandits have achieved compelling performances due to their ability to leverage the strong representation power of neural networks (NNs) for reward prediction. Many applications of contextual bandits involve multiple agents who collaborate without sharing raw observations, thus giving rise to the setting of federated contextual bandits. Existing works on federated… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICLR 2023. Code: https://github.com/daizhongxiang/Federated-Neural-Bandits

  40. arXiv:2205.10997  [pdf, other

    cs.LG cs.AI

    Data-Efficient Modeling for Precise Power Consumption Estimation of Quadrotor Operations Using Ensemble Learning

    Authors: Wei Dai, Mingcheng Zhang, Kin Huat Low

    Abstract: Electric Take-Off and Landing (eVTOL) aircraft is considered as the major aircraft type in the emerging urban air mobility. Accurate power consumption estimation is crucial to eVTOL, supporting advanced power management strategies and improving the efficiency and safety performance of flight operations. In this study, a framework for power consumption modeling of eVTOL aircraft was established. We… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  41. arXiv:2205.07428  [pdf, other

    cs.LG cs.GT stat.ML

    On the Convergence of the Shapley Value in Parametric Bayesian Learning Games

    Authors: Lucas Agussurja, Xinyi Xu, Bryan Kian Hsiang Low

    Abstract: Measuring contributions is a classical problem in cooperative game theory where the Shapley value is the most well-known solution concept. In this paper, we establish the convergence property of the Shapley value in parametric Bayesian learning games where players perform a Bayesian inference using their combined data, and the posterior-prior KL divergence is used as the characteristic function. W… ▽ More

    Submitted 14 June, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: Accepted to the 39th International Conference on Machine Learning (ICML 2022). Extended version with derivations

  42. arXiv:2205.04901  [pdf, other

    cs.LG math.ST

    Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization

    Authors: Shouri Hu, Haowei Wang, Zhongxiang Dai, Bryan Kian Hsiang Low, Szu Hui Ng

    Abstract: The expected improvement (EI) is one of the most popular acquisition functions for Bayesian optimization (BO) and has demonstrated good empirical performances in many applications for the minimization of simple regret. However, under the evaluation metric of cumulative regret, the performance of EI may not be competitive, and its existing theoretical regret upper bound still has room for improveme… ▽ More

    Submitted 24 May, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  43. arXiv:2202.13597  [pdf, other

    cs.LG stat.ML

    Rectified Max-Value Entropy Search for Bayesian Optimization

    Authors: Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Although the existing max-value entropy search (MES) is based on the widely celebrated notion of mutual information, its empirical performance can suffer due to two misconceptions whose implications on the exploration-exploitation trade-off are investigated in this paper. These issues are essential in the development of future acquisition functions and the improvement of the existing ones as they… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  44. Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten

    Authors: Quoc Phong Nguyen, Ryutaro Oikawa, Dinil Mon Divakaran, Mun Choon Chan, Bryan Kian Hsiang Low

    Abstract: As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the mod… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: Proceedings of the 2022 ACM Asia Conference on Computer and Communications Security (ASIA CCS '22), May 30-June 3, 2022, Nagasaki, Japan

  45. arXiv:2201.09785  [pdf, other

    cs.LG cs.AI

    Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search

    Authors: Yao Shu, Zhongxiang Dai, Zhaoxuan Wu, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has gained immense popularity owing to its ability to automate neural architecture design. A number of training-free metrics are recently proposed to realize NAS without training, hence making NAS more scalable. Despite their competitive empirical performances, a unified theoretical understanding of these training-free metrics is lacking. As a consequence, (a) the… ▽ More

    Submitted 12 October, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Published as a conference paper at NeurIPS 2022

  46. Discerning media bias within a network of political allies and opponents: the idealized example of a biased coin

    Authors: Nicholas Kah Yean Low, Andrew Melatos

    Abstract: Perceptions of political bias in the media are formed directly, through the independent consumption of the published outputs of a media organization, and indirectly, through observing the collective responses of political allies and opponents to the same published outputs. A network of Bayesian learners is constructed to model this system, in which the bias perceived by each agent obeys a probabil… ▽ More

    Submitted 26 June, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: 38 pages, 30 figures, 8 tables. Published in Physica A

  47. arXiv:2112.09327  [pdf, other

    cs.LG

    Incentivizing Collaboration in Machine Learning via Synthetic Data Rewards

    Authors: Sebastian Shenghong Tay, Xinyi Xu, Chuan Sheng Foo, Bryan Kian Hsiang Low

    Abstract: This paper presents a novel collaborative generative modeling (CGM) framework that incentivizes collaboration among self-interested parties to contribute data to a pool for training a generative model (e.g., GAN), from which synthetic data are drawn and distributed to the parties as rewards commensurate to their contributions. Distributing synthetic data as rewards (instead of trained models or mo… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 36th AAAI Conference on Artificial Intelligence (AAAI 2022), Extended version with derivations, 42 pages

  48. arXiv:2110.14153  [pdf, other

    cs.LG cs.CR

    Differentially Private Federated Bayesian Optimization with Distributed Exploration

    Authors: Zhongxiang Dai, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Bayesian optimization (BO) has recently been extended to the federated learning (FL) setting by the federated Thompson sampling (FTS) algorithm, which has promising applications such as federated hyperparameter tuning. However, FTS is not equipped with a rigorous privacy guarantee which is an important consideration in FL. Recent works have incorporated differential privacy (DP) into the training… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Extended version with proofs and additional experimental details and results, 29 pages

  49. arXiv:2110.14074  [pdf, other

    cs.LG cs.AI

    Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei **g, Cheston Tan, Bryan Kian Hsiang Low

    Abstract: The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories. Despite its promising applications, existing works on FRL fail to I) provide theoretical analysis on its convergence, and II) account for random system failures and adversa… ▽ More

    Submitted 3 November, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021. Extended version with proofs and additional experimental details and results. New version changes: reduced file size of figures; added a diagram illustrating the problem setting; added link to code on GitHub; modified proof for Theorem 6 (highlighted in red)

  50. arXiv:2109.02533  [pdf, other

    cs.LG

    Neural Ensemble Search via Bayesian Sampling

    Authors: Yao Shu, Yizhou Chen, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Recently, neural architecture search (NAS) has been applied to automate the design of neural networks in real-world applications. A large number of algorithms have been developed to improve the search cost or the performance of the final selected architectures in NAS. Unfortunately, these NAS algorithms aim to select only one single well-performing architecture from their search spaces and thus ha… ▽ More

    Submitted 17 June, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at UAI 2022