Skip to main content

Showing 1–50 of 223 results for author: Dai, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00051  [pdf, other

    cs.DC

    SAGIPS: A Scalable Asynchronous Generative Inverse Problem Solver

    Authors: Daniel Lersch, Malachi Schram, Zhenyu Dai, Kishansingh Rajput, Xingfu Wu, N. Sato, J. Taylor Childers

    Abstract: Large scale, inverse problem solving deep learning algorithms have become an essential part of modern research and industrial applications. The complexity of the underlying inverse problem often poses challenges to the algorithm and requires the proper utilization of high-performance computing systems. Most deep learning algorithms require, due to their design, custom parallelization techniques in… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

  2. arXiv:2406.18202  [pdf, other

    physics.geo-ph cs.DB

    GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI

    Authors: Shiqian Li, Zhi Li, Zhancun Mu, Shiji Xin, Zhixiang Dai, Kuangdai Leng, Ruihua Zhang, Xiaodong Song, Yixin Zhu

    Abstract: Global seismic tomography, taking advantage of seismic waves from natural earthquakes, provides essential insights into the earth's internal dynamics. Advanced Full-waveform Inversion (FWI) techniques, whose aim is to meticulously interpret every detail in seismograms, confront formidable computational demands in forward modeling and adjoint simulations on a global scale. Recent advancements in Ma… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 36 pages

  3. arXiv:2406.17425  [pdf, other

    cs.LG cs.AI cs.CR cs.MA

    CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems

    Authors: Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao

    Abstract: Cooperative Multi-Agent Reinforcement Learning (CMARL) strategies are well known to be vulnerable to adversarial perturbations. Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks. However, gaining complete access to victim agents in real-world environment… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.17255  [pdf, other

    cs.CL

    MPCODER: Multi-user Personalized Code Generator with Explicit and Implicit Style Representation Learning

    Authors: Zhenlong Dai, Chang Yao, WenKang Han, Ying Yuan, Zhipeng Gao, **gyuan Chen

    Abstract: Large Language Models (LLMs) have demonstrated great potential for assisting developers in their daily development. However, most research focuses on generating correct code, how to use LLMs to generate personalized code has seldom been investigated. To bridge this gap, we proposed MPCoder (Multi-user Personalized Code Generator) to generate personalized code for multiple users. To better learn co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024, Main Conference

  5. arXiv:2406.14473  [pdf, other

    cs.LG cs.CL

    Data-Centric AI in the Age of Large Language Models

    Authors: Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, **gtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low

    Abstract: This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint

  6. arXiv:2406.13121  [pdf, other

    cs.CL cs.AI cs.IR

    Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

    Authors: **hyuk Lee, Anthony Chen, Zhuyun Dai, Dheeru Dua, Devendra Singh Sachan, Michael Boratko, Yi Luan, Sébastien M. R. Arnold, Vincent Perot, Siddharth Dalmia, Hexiang Hu, Xudong Lin, Panupong Pasupat, Aida Amini, Jeremy R. Cole, Sebastian Riedel, Iftekhar Naim, Ming-Wei Chang, Kelvin Guu

    Abstract: Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire corpora of information offers numerous advantages. It enhances user-friendliness by eliminating the need for specialized knowledge of tools, provides robust end-to-… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 29 pages. Dataset available at https://github.com/google-deepmind/loft

  7. arXiv:2406.11462  [pdf, other

    cs.MM cs.GR cs.SD eess.AS

    MusicScore: A Dataset for Music Score Modeling and Generation

    Authors: Yuheng Lin, Zheqi Dai, Qiuqiang Kong

    Abstract: Music scores are written representations of music and contain rich information about musical components. The visual information on music scores includes notes, rests, staff lines, clefs, dynamics, and articulations. This visual information in music scores contains more semantic information than audio and symbolic representations of music. Previous music score datasets have limited sizes and are ma… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Dataset paper, dataset link: https://huggingface.co/datasets/ZheqiDAI/MusicScore

  8. arXiv:2406.07908  [pdf, other

    cs.LG cs.AI stat.ML

    Ablation Based Counterfactuals

    Authors: Zheng Dai, David K Gifford

    Abstract: Diffusion models are a class of generative models that generate high-quality samples, but at present it is difficult to characterize how they depend upon their training data. This difficulty raises scientific and regulatory questions, and is a consequence of the complexity of diffusion models and their sampling process. To analyze this dependence, we introduce Ablation Based Counterfactuals (ABC),… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures, appendix included

  9. arXiv:2406.04681  [pdf, other

    math.OC cs.SC math.AG

    Whitney Stratification of Algebraic Boundaries of Convex Semi-algebraic Sets

    Authors: Zihao Dai, Zijia Li, Zhi-Hong Yang, Lihong Zhi

    Abstract: Algebraic boundaries of convex semi-algebraic sets are closely related to polynomial optimization problems. Building upon Rainer Sinn's work, we refine the stratification of iterated singular loci to a Whitney (a) stratification, which gives a list of candidates of varieties whose dual is an irreducible component of the algebraic boundary of the dual convex body. We also present an algorithm based… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    MSC Class: 52A99; 14N05; 14P10; 51N35; 14Q15

  10. arXiv:2406.01922  [pdf, ps, other

    eess.SP cs.IT

    Performance Analysis of Hybrid Cellular and Cell-free MIMO Network

    Authors: Zhuoyin Dai, **gran Xu, Xiaoli Xu, Ruoguang Li, Yong Zeng

    Abstract: Cell-free wireless communication is envisioned as one of the most promising network architectures, which can achieve stable and uniform communication performance while improving the system energy and spectrum efficiency. The deployment of cell-free networks is envisioned to be a longterm evolutionary process, in which cell-free access points (APs) will be gradually introduced into the communicatio… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  11. arXiv:2405.18251  [pdf, other

    cs.RO eess.SY math.OC

    Sensor-Based Distributionally Robust Control for Safe Robot Navigation in Dynamic Environments

    Authors: Kehan Long, Yinzhuang Yi, Zhirui Dai, Sylvia Herbert, Jorge Cortés, Nikolay Atanasov

    Abstract: We introduce a novel method for safe mobile robot navigation in dynamic, unknown environments, utilizing onboard sensing to impose safety constraints without the need for accurate map reconstruction. Traditional methods typically rely on detailed map information to synthesize safe stabilizing controls for mobile robots, which can be computationally demanding and less effective, particularly in dyn… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Submitted to the International Journal of Robotics Research (IJRR). Project page: https://existentialrobotics.org/DR_Safe_Navigation_Webpage

  12. arXiv:2405.17346  [pdf, other

    cs.LG cs.AI

    Prompt Optimization with Human Feedback

    Authors: Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have demonstrated remarkable performances in various tasks. However, the performance of LLMs heavily depends on the input prompt, which has given rise to a number of recent works on prompt optimization. However, previous works often require the availability of a numeric score to assess the quality of every prompt. Unfortunately, when a human user interacts with a black… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint, 18 pages

  13. arXiv:2405.16122  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

    Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown impressive capabilities in real-world applications. The capability of in-context learning (ICL) allows us to adapt an LLM to downstream tasks by including input-label exemplars in the prompt without model fine-tuning. However, the quality of these exemplars in the prompt greatly impacts performance, highlighting the need for an effective automated exemplar s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 23 pages, 1 figure, 23 tables

  14. arXiv:2405.15316  [pdf, other

    cs.LG cs.CR

    Decaf: Data Distribution Decompose Attack against Federated Learning

    Authors: Zhiyang Dai, Chunyi Zhou, Anmin Fu

    Abstract: In contrast to prevalent Federated Learning (FL) privacy inference techniques such as generative adversarial networks attacks, membership inference attacks, property inference attacks, and model inversion attacks, we devise an innovative privacy threat: the Data Distribution Decompose Attack on FL, termed Decaf. This attack enables an honest-but-curious FL server to meticulously profile the propor… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  15. arXiv:2405.02639  [pdf, other

    cs.RO

    Wall-Climbing Performance of Gecko-inspired Robot with Soft Feet and Digits enhanced by Gravity Compensation

    Authors: Bingcheng Wang, Zhiyuan Weng, Haoyu Wang, Shuangjie Wang, Zhouyi Wang, Zhendong Dai, Ardian Jusufi

    Abstract: Gravitational forces can induce deviations in body posture from desired configurations in multi-legged arboreal robot locomotion with low leg stiffness, affecting the contact angle between the swing leg's end-effector and the climbing surface during the gait cycle. The relationship between desired and actual foot positions is investigated here in a leg-stiffness-enhanced model under external force… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  16. arXiv:2404.09625  [pdf, other

    cs.CR cs.AI cs.LG

    Privacy-Preserving Intrusion Detection using Convolutional Neural Networks

    Authors: Martin Kodys, Zhongmin Dai, Vrizlynn L. L. Thing

    Abstract: Privacy-preserving analytics is designed to protect valuable assets. A common service provision involves the input data from the client and the model on the analyst's side. The importance of the privacy preservation is fuelled by legal obligations and intellectual property concerns. We explore the use case of a model owner providing an analytic service on customer's private data. No information ab… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE Conference on Artificial Intelligence (CAI) 2024

  17. arXiv:2403.20327  [pdf, other

    cs.CL cs.AI

    Gecko: Versatile Text Embeddings Distilled from Large Language Models

    Authors: **hyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim

    Abstract: We present Gecko, a compact and versatile text embedding model. Gecko achieves strong retrieval performance by leveraging a key idea: distilling knowledge from large language models (LLMs) into a retriever. Our two-step distillation process begins with generating diverse, synthetic paired data using an LLM. Next, we further refine the data quality by retrieving a set of candidate passages for each… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 18 pages

  18. arXiv:2403.14781  [pdf, other

    cs.CV

    Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

    Authors: Shenhao Zhu, Junming Leo Chen, Zuozhuo Dai, Qingkun Su, Yinghui Xu, Xun Cao, Yao Yao, Hao Zhu, Siyu Zhu

    Abstract: In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques. The methodology utilizes the SMPL(Skinned Multi-Person Linear) model as the 3D human parametric model to establish a unified representation of body shape and pose. Thi… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  19. arXiv:2403.11568  [pdf, other

    cs.CV

    EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

    Authors: Zhenghao Zhang, Zuozhuo Dai, Long Qin, Weizhi Wang

    Abstract: Large-scale text-to-video models have shown remarkable abilities, but their direct application in video editing remains challenging due to limited available datasets. Current video editing methods commonly require per-video fine-tuning of diffusion models or specific inversion optimization to ensure high-fidelity edits. In this paper, we introduce EffiVED, an efficient diffusion-based model that d… ▽ More

    Submitted 5 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  20. arXiv:2403.07591  [pdf, other

    cs.LG

    Robustifying and Boosting Training-Free Neural Architecture Search

    Authors: Zhenfeng He, Yao Shu, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has become a key component of AutoML and a standard tool to automate the design of deep neural networks. Recently, training-free NAS as an emerging paradigm has successfully reduced the search costs of standard training-based NAS by estimating the true architecture performance with only training-free metrics. Nevertheless, the estimation ability of these metrics ty… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024. Code available at https://github.com/hzf1174/RoBoT

  21. arXiv:2403.06826  [pdf, other

    cs.LG cs.AI stat.ML

    In-context Exploration-Exploitation for Reinforcement Learning

    Authors: Zhenwen Dai, Federico Tomasi, Sina Ghiassian

    Abstract: In-context learning is a promising approach for online policy learning of offline reinforcement learning (RL) methods, which can be achieved at inference time without gradient optimization. However, this method is hindered by significant computational costs resulting from the gathering of large training trajectory sets and the need to train large Transformer models. We address this challenge by in… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Published at ICLR 2024

  22. arXiv:2403.04652  [pdf, other

    cs.CL cs.AI

    Yi: Open Foundation Models by 01.AI

    Authors: 01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, **g Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie , et al. (7 additional authors not shown)

    Abstract: We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models. Our base models achieve strong performance on a wide range of benchmarks like MMLU,… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  23. arXiv:2403.02993  [pdf, other

    cs.AI

    Localized Zeroth-Order Prompt Optimization

    Authors: Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The efficacy of large language models (LLMs) in understanding and generating natural language has aroused a wide interest in develo** prompt-based methods to harness the power of black-box LLMs. Existing methodologies usually prioritize a global optimization for finding the global optimum, which however will perform poorly in certain tasks. This thus motivates us to re-think the necessity of fin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  24. arXiv:2402.18682  [pdf, other

    cs.RO

    Acoustic tactile sensing for mobile robot wheels

    Authors: Wilfred Mason, David Brenken, Falcon Z. Dai, Ricardo Gonzalo Cruz Castillo, Olivier St-Martin Cormier, Audrey Sedal

    Abstract: Tactile sensing in mobile robots remains under-explored, mainly due to challenges related to sensor integration and the complexities of distributed sensing. In this work, we present a tactile sensing architecture for mobile robots based on wheel-mounted acoustic waveguides. Our sensor architecture enables tactile sensing along the entire circumference of a wheel with a single active component: an… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 pages, 12 figures

  25. arXiv:2312.17503  [pdf, other

    cs.LG cs.GT

    HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

    Authors: Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

    Abstract: Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single chan… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Report number: 23-NX-HOIX

  26. arXiv:2312.16149  [pdf, other

    cs.SD eess.AS

    SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network

    Authors: Yuhang He, Zhuangzhuang Dai, Long Chen, Niki Trigoni, Andrew Markham

    Abstract: In this paper, we study an underexplored, yet important and challenging problem: counting the number of distinct sounds in raw audio characterized by a high degree of polyphonicity. We do so by systematically proposing a novel end-to-end trainable neural network (which we call DyDecNet, consisting of a dyadic decomposition front-end and backbone network), and quantifying the difficulty level of co… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: AAAI2024 Paper

  27. arXiv:2312.15567  [pdf, other

    cs.HC

    Conversational Co-Speech Gesture Generation via Modeling Dialog Intention, Emotion, and Context with Diffusion Models

    Authors: Haiwei Xue, Sicheng Yang, Zhensong Zhang, Zhiyong Wu, Minglei Li, Zonghong Dai, Helen Meng

    Abstract: Audio-driven co-speech human gesture generation has made remarkable advancements recently. However, most previous works only focus on single person audio-driven gesture generation. We aim at solving the problem of conversational co-speech gesture generation that considers multiple participants in a conversation, which is a novel and challenging task due to the difficulty of simultaneously incorpor… ▽ More

    Submitted 10 January, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: 5 pages,2 figures, Accepted for publication at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  28. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  29. arXiv:2312.03431  [pdf, other

    cs.CV

    Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle

    Authors: Youtian Lin, Zuozhuo Dai, Siyu Zhu, Yao Yao

    Abstract: We introduce Gaussian-Flow, a novel point-based approach for fast dynamic scene reconstruction and real-time rendering from both multi-view and monocular videos. In contrast to the prevalent NeRF-based approaches hampered by slow training and rendering speeds, our approach harnesses recent advancements in point-based 3D Gaussian Splatting (3DGS). Specifically, a novel Dual-Domain Deformation Model… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  30. arXiv:2312.01135  [pdf, other

    cs.RO

    Vision-based FDM Printing for Fabricating Airtight Soft Actuators

    Authors: Yijia Wu, Zilin Dai, Haotian Liu, Lehong Wang, Markus P. Nemitz

    Abstract: Pneumatic soft robots are typically fabricated by molding, a manual fabrication process that requires skilled labor. Additive manufacturing has the potential to break this limitation and speed up the fabrication process but struggles with consistently producing high-quality prints. We propose a low-cost approach to improve the print quality of desktop fused deposition modeling by adding a webcam t… ▽ More

    Submitted 10 April, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: 6 pages, 7 figures, Accepted at IEEE RoboSoft conference 2024, San Diego CA

  31. arXiv:2311.12886  [pdf, other

    cs.CV

    AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

    Authors: Zuozhuo Dai, Zhenghao Zhang, Yao Yao, Bingxue Qiu, Siyu Zhu, Long Qin, Weizhi Wang

    Abstract: Image animation is a key task in computer vision which aims to generate dynamic visual content from static image. Recent image animation methods employ neural based rendering technique to generate realistic animations. Despite these advancements, achieving fine-grained and controllable image animation guided by text remains challenging, particularly for open-domain images captured in diverse real… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  32. arXiv:2311.02715  [pdf, other

    cs.LG stat.ML

    Exploiting Correlated Auxiliary Feedback in Parameterized Bandits

    Authors: Arun Verma, Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low

    Abstract: We study a novel variant of the parameterized bandits problem in which the learner can observe additional auxiliary feedback that is correlated with the observed reward. The auxiliary feedback is readily available in many real-life applications, e.g., an online platform that wants to recommend the best-rated services to its users can observe the user's rating of service (rewards) and collect addit… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  33. arXiv:2311.01195  [pdf, other

    cs.LG cs.AI

    Batch Bayesian Optimization for Replicable Experimental Design

    Authors: Zhongxiang Dai, Quoc Phong Nguyen, Sebastian Shenghong Tay, Daisuke Urano, Richalynn Leong, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Many real-world experimental design problems (a) evaluate multiple experimental conditions in parallel and (b) replicate each condition multiple times due to large and heteroscedastic observation noise. Given a fixed total budget, this naturally induces a trade-off between evaluating more unique conditions while replicating each of them fewer times vs. evaluating fewer unique conditions and replic… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  34. arXiv:2310.19861  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

    Authors: Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang

    Abstract: This paper investigates posterior sampling algorithms for competitive reinforcement learning (RL) in the context of general function approximations. Focusing on zero-sum Markov games (MGs) under two critical settings, namely self-play and adversarial learning, we first propose the self-play and adversarial generalized eluder coefficient (GEC) as complexity measures for function approximation, capt… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  35. arXiv:2310.14408  [pdf, other

    cs.IR

    PaRaDe: Passage Ranking using Demonstrations with Large Language Models

    Authors: Andrew Drozdov, Honglei Zhuang, Zhuyun Dai, Zhen Qin, Razieh Rahimi, Xuanhui Wang, Dana Alon, Mohit Iyyer, Andrew McCallum, Donald Metzler, Kai Hui

    Abstract: Recent studies show that large language models (LLMs) can be instructed to effectively perform zero-shot passage re-ranking, in which the results of a first stage retrieval method, such as BM25, are rated and reordered to improve relevance. In this work, we improve LLM-based re-ranking by algorithmically selecting few-shot demonstrations to include in the prompt. Our analysis investigates the cond… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  36. Automatic Music Playlist Generation via Simulation-based Reinforcement Learning

    Authors: Federico Tomasi, Joseph Cauteruccio, Surya Kanoria, Kamil Ciosek, Matteo Rinaldi, Zhenwen Dai

    Abstract: Personalization of playlists is a common feature in music streaming services, but conventional techniques, such as collaborative filtering, rely on explicit assumptions regarding content quality to learn how to make recommendations. Such assumptions often result in misalignment between offline model objectives and online user satisfaction metrics. In this paper, we present a reinforcement learning… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 10 pages. KDD 23

  37. arXiv:2310.05373  [pdf, other

    cs.LG cs.AI

    Quantum Bayesian Optimization

    Authors: Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets f… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  38. arXiv:2310.04440  [pdf, other

    eess.SY cs.AI

    Facilitating Battery Swap** Services for Freight Trucks with Spatial-Temporal Demand Prediction

    Authors: Linyu Liu, Zhen Dai, Shiji Song, Xiaocheng Li, Guanting Chen

    Abstract: Electrifying heavy-duty trucks offers a substantial opportunity to curtail carbon emissions, advancing toward a carbon-neutral future. However, the inherent challenges of limited battery energy and the sheer weight of heavy-duty trucks lead to reduced mileage and prolonged charging durations. Consequently, battery-swap** services emerge as an attractive solution for these trucks. This paper empl… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: 9 pages, 6 figures

    MSC Class: 90B06; 68T07

  39. arXiv:2310.02905  [pdf, other

    cs.LG cs.AI cs.CL

    Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

    Authors: Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) have shown remarkable instruction-following capabilities and achieved impressive performances in various applications. However, the performances of LLMs depend heavily on the instructions given to them, which are typically manually tuned with substantial human efforts. Recent work has used the query-efficient Bayesian optimization (BO) algorithm to automatically optimi… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024

  40. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data

    Authors: **gtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of large language models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the intellectual property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to (a) identify the data provider who… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  41. arXiv:2309.09182  [pdf, other

    cs.RO

    Optimal Scene Graph Planning with Large Language Model Guidance

    Authors: Zhirui Dai, Arash Asgharivaskasi, Thai Duong, Shusen Lin, Maria-Elizabeth Tzes, George Pappas, Nikolay Atanasov

    Abstract: Recent advances in metric, semantic, and topological map** have equipped autonomous robots with semantic concept grounding capabilities to interpret natural language tasks. This work aims to leverage these new capabilities with an efficient task planning algorithm for hierarchical metric-semantic models. We consider a scene graph representation of the environment and utilize a large language mod… ▽ More

    Submitted 10 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  42. arXiv:2309.07051  [pdf, other

    cs.HC cs.AI cs.MM

    UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons

    Authors: Sicheng Yang, Zilin Wang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Qiaochu Huang, Lei Hao, Songcen Xu, Xiaofei Wu, changpeng yang, Zonghong Dai

    Abstract: The automatic co-speech gesture generation draws much attention in computer animation. Previous works designed network structures on individual datasets, which resulted in a lack of data volume and generalizability across different motion capture standards. In addition, it is a challenging task due to the weak correlation between speech and gestures. To address these problems, we present UnifiedGe… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 16 pages, 11 figures, ACM MM 2023

  43. arXiv:2309.03214  [pdf, other

    cs.AR cs.SE

    Test Primitive:A Straightforward Method To Decouple March

    Authors: Yindong Xiao, Shanshan Lu, Ensheng Wang, Ruiqi Zhu, Zhijian Dai

    Abstract: The academic community has made outstanding achievements in researching the March algorithm. However, the current fault modeling method, which centers on fault primitives, cannot be directly applied to analyzing the March algorithm. This paper proposes a new test primitive. The test primitives, which decouple the cell states from sensitization and detection operations, describe the common features… ▽ More

    Submitted 29 August, 2023; originally announced September 2023.

  44. arXiv:2308.14919  [pdf, other

    cs.LG cs.AI

    On Reward Structures of Markov Decision Processes

    Authors: Falcon Z. Dai

    Abstract: A Markov decision process can be parameterized by a transition kernel and a reward function. Both play essential roles in the study of reinforcement learning as evidenced by their presence in the Bellman equations. In our inquiry of various kinds of "costs" associated with reinforcement learning inspired by the demands in robotic applications, rewards are central to understanding the structure of… ▽ More

    Submitted 31 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: This PhD thesis draws heavily from arXiv:1907.02114 and arXiv:2002.06299; minor edits

  45. arXiv:2308.13879  [pdf, other

    cs.HC cs.AI cs.MM

    The DiffuseStyleGesture+ entry to the GENEA Challenge 2023

    Authors: Sicheng Yang, Haiwei Xue, Zhensong Zhang, Minglei Li, Zhiyong Wu, Xiaofei Wu, Songcen Xu, Zonghong Dai

    Abstract: In this paper, we introduce the DiffuseStyleGesture+, our solution for the Generation and Evaluation of Non-verbal Behavior for Embodied Agents (GENEA) Challenge 2023, which aims to foster the development of realistic, automated systems for generating conversational gestures. Participants are provided with a pre-processed dataset and their systems are evaluated through crowdsourced scoring. Our pr… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 7 pages, 8 figures, ICMI 2023

  46. arXiv:2308.04719  [pdf, other

    cs.AI

    JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

    Authors: Yang Li, Kun Xiong, Ying** Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

    Abstract: This paper presents an empirical exploration of non-transitivity in perfect-information games, specifically focusing on Xiangqi, a traditional Chinese board game comparable in game-tree complexity to chess and shogi. By analyzing over 10,000 records of human Xiangqi play, we highlight the existence of both transitive and non-transitive elements within the game's strategic structure. To address non… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 28 pages, accepted by Transactions on Machine Learning Research (TMLR)

  47. arXiv:2308.04077  [pdf, other

    cs.LG cs.AI

    Federated Zeroth-Order Optimization using Trajectory-Informed Surrogate Gradients

    Authors: Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Federated optimization, an emerging paradigm which finds wide real-world applications such as federated learning, enables multiple clients (e.g., edge devices) to collaboratively optimize a global function. The clients do not share their local datasets and typically only share their local gradients. However, the gradient information is not available in many applications of federated optimization,… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  48. arXiv:2307.16115  [pdf, other

    cs.DB

    IWEK: An Interpretable What-If Estimator for Database Knobs

    Authors: Yu Yan, Hongzhi Wang, Jian Geng, Jian Ma, Geng Li, Zixuan Wang, Zhiyu Dai, Tianqing Wang

    Abstract: The knobs of modern database management systems have significant impact on the performance of the systems. With the development of cloud databases, an estimation service for knobs is urgently needed to improve the performance of database. Unfortunately, few attentions have been paid to estimate the performance of certain knob configurations. To fill this gap, we propose IWEK, an interpretable & tr… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

  49. arXiv:2307.09972  [pdf, other

    cs.CV

    Fine-grained Text-Video Retrieval with Frozen Image Encoders

    Authors: Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

    Abstract: State-of-the-art text-video retrieval (TVR) methods typically utilize CLIP and cosine similarity for efficient retrieval. Meanwhile, cross attention methods, which employ a transformer decoder to compute attention between each text query and all frames in a video, offer a more comprehensive interaction between text and videos. However, these methods lack important fine-grained spatial information… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  50. arXiv:2307.04028  [pdf, other

    cs.CV cs.AI cs.LG

    Measuring the Success of Diffusion Models at Imitating Human Artists

    Authors: Stephen Casper, Zifan Guo, Shreya Mogulothu, Zachary Marinov, Chinmay Deshpande, Rui-Jie Yew, Zheng Dai, Dylan Hadfield-Menell

    Abstract: Modern diffusion models have set the state-of-the-art in AI image generation. Their success is due, in part, to training on Internet-scale data which often includes copyrighted work. This prompts questions about the extent to which these models learn from, imitate, or copy the work of human artists. This work suggests that tying copyright liability to the capabilities of the model may be useful gi… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted to the 1 st Workshop on Generative AI and Law