Skip to main content

Showing 1–50 of 126 results for author: An, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15575  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Sketch-GNN: Scalable Graph Neural Networks with Sublinear Training Complexity

    Authors: Mucong Ding, Tahseen Rabbani, Bang An, Evan Z Wang, Furong Huang

    Abstract: Graph Neural Networks (GNNs) are widely applied to graph learning problems such as node classification. When scaling up the underlying graphs of GNNs to a larger size, we are forced to either train on the complete graph and keep the full graph adjacency and node embeddings in memory (which is often infeasible) or mini-batch sample the graph (which results in exponentially growing computational com… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2022

  2. arXiv:2406.14537  [pdf, other

    cs.LG q-fin.TR

    MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

    Authors: Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An

    Abstract: High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making probl… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024

  3. arXiv:2406.01124  [pdf, other

    cs.LG cs.CL

    Latent Logic Tree Extraction for Event Sequence Explanation from LLMs

    Authors: Zitao Song, Chao Yang, Chaojie Wang, Bo An, Shuang Li

    Abstract: Modern high-stakes systems, such as healthcare or robotics, often generate vast streaming event sequences. Our goal is to design an efficient, plug-and-play tool to elicit logic tree-based explanations from Large Language Models (LLMs) to provide customized insights into each observed event sequence. Built on the temporal point process model for events, our method employs the likelihood function a… ▽ More

    Submitted 28 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.18979  [pdf, other

    cs.LG stat.ML

    MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

    Abstract: Leveraging the models' outputs, specifically the logits, is a common approach to estimating the test accuracy of a pre-trained neural network on out-of-distribution (OOD) samples without requiring access to the corresponding ground truth labels. Despite their ease of implementation and computational efficiency, current logit-based methods are vulnerable to overconfidence issues, leading to predict… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: The three first authors contributed equally

  5. arXiv:2405.17879  [pdf, other

    cs.LG cs.AI

    Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree

    Authors: Lang Feng, Pengjie Gu, Bo An, Gang Pan

    Abstract: Diffusion planners have shown promise in handling long-horizon and sparse-reward tasks due to the non-autoregressive plan generation. However, their inherent stochastic risk of generating infeasible trajectories presents significant challenges to their reliability and stability. We introduce a novel approach, the Trajectory Aggregation Tree (TAT), to address this issue in diffusion planners. Compa… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML 2024 (Spotlight)

  6. arXiv:2405.15413  [pdf, other

    eess.IV cs.CV cs.IT

    MambaVC: Learned Visual Compression with Selective State Spaces

    Authors: Shiyu Qin, **peng Wang, Yimin Zhou, Bin Chen, Tianci Luo, Baoyi An, Tao Dai, Shutao Xia, Yaowei Wang

    Abstract: Learned visual compression is an important and active task in multimedia. Existing approaches have explored various CNN- and Transformer-based designs to model content distribution and eliminate redundancy, where balancing efficacy (i.e., rate-distortion trade-off) and efficiency remains a challenge. Recently, state-space models (SSMs) have shown promise due to their long-range modeling capacity a… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 17pages,15 figures

  7. arXiv:2405.11746  [pdf, other

    cs.AI cs.GT cs.LG cs.MA

    Configurable Mirror Descent: Towards a Unification of Decision Making

    Authors: Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Shuyue Hu, Xiao Huang, Hau Chan, Bo An

    Abstract: Decision-making problems, categorized as single-agent, e.g., Atari, cooperative multi-agent, e.g., Hanabi, competitive multi-agent, e.g., Hold'em poker, and mixed cooperative and competitive, e.g., football, are ubiquitous in the real world. Various methods are proposed to address the specific decision-making problems. Despite the successes in specific categories, these methods typically evolve in… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted to The Forty-first International Conference on Machine Learning (ICML 2024)

  8. arXiv:2405.08638  [pdf, other

    cs.LG

    vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement

    Authors: Yiwen Zhu, **yi Liu, Wenya Wei, Qianyi Fu, Yu**g Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan

    Abstract: Reinforcement Learning (RL) is a widely employed technique in decision-making problems, encompassing two fundamental operations -- policy evaluation and policy improvement. Enhancing learning efficiency remains a key challenge in RL, with many efforts focused on using ensemble critics to boost policy evaluation efficiency. However, when using multiple critics, the actor in the policy improvement p… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024, with appendix

  9. arXiv:2405.03518  [pdf, other

    cs.GT

    Reinforcement Nash Equilibrium Solver

    Authors: Xinrun Wang, Chang Yang, Shuxin Li, Pengdeng Li, Xiao Huang, Hau Chan, Bo An

    Abstract: Nash Equilibrium (NE) is the canonical solution concept of game theory, which provides an elegant tool to understand the rationalities. Though mixed strategy NE exists in any game with finite players and actions, computing NE in two- or multi-player general-sum games is PPAD-Complete. Various alternative solutions, e.g., Correlated Equilibrium (CE), and learning methods, e.g., fictitious play (FP)… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024

  10. arXiv:2405.00987  [pdf, other

    cs.LG

    S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic

    Authors: Safa Messaoud, Billel Mokeddem, Zhenghai Xue, Linsey Pang, Bo An, Haipeng Chen, Sanjay Chawla

    Abstract: Learning expressive stochastic policies instead of deterministic ones has been proposed to achieve better stability, sample complexity, and robustness. Notably, in Maximum Entropy Reinforcement Learning (MaxEnt RL), the policy is modeled as an expressive Energy-Based Model (EBM) over the Q-values. However, this formulation requires the estimation of the entropy of such EBMs, which is an open probl… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted for publication at ICLR 2024

  11. arXiv:2404.12626  [pdf, other

    cs.AI cs.GT cs.MA

    Grasper: A Generalist Pursuer for Pursuit-Evasion Problems

    Authors: Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerny, Youzhi Zhang, Stephen McAleer, Hau Chan, Bo An

    Abstract: Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in PSRO to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: To appear in the 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2024)

  12. arXiv:2404.11144  [pdf, other

    cs.AI cs.GT cs.MA

    Self-adaptive PSRO: Towards an Automatic Population-based Game Solver

    Authors: Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Xiao Huang, Hau Chan, Bo An

    Abstract: Policy-Space Response Oracles (PSRO) as a general algorithmic framework has achieved state-of-the-art performance in learning equilibrium policies of two-player zero-sum games. However, the hand-crafted hyperparameter value selection in most of the existing works requires extensive domain knowledge, forming the main barrier to applying PSRO to different games. In this work, we make the first attem… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted to 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  13. arXiv:2403.17918  [pdf, other

    cs.AI

    AgentStudio: A Toolkit for Building General Virtual Agents

    Authors: Longtao Zheng, Zhiyuan Huang, Zhenghai Xue, Xinrun Wang, Bo An, Shuicheng Yan

    Abstract: Creating autonomous virtual agents capable of using arbitrary software on any digital device remains a major challenge for artificial intelligence. Two key obstacles hinder progress: insufficient infrastructure for building virtual agents in real-world environments, and the need for in-the-wild evaluation of fundamental agent abilities. To address this, we introduce AgentStudio, an online, realist… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  14. arXiv:2403.11270  [pdf, other

    cs.CV

    Bilateral Propagation Network for Depth Completion

    Authors: Jie Tang, Fei-Peng Tian, Boshi An, Jian Li, ** Tan

    Abstract: Depth completion aims to derive a dense depth map from sparse depth measurements with a synchronized color image. Current state-of-the-art (SOTA) methods are predominantly propagation-based, which work as an iterative refinement on the initial estimated dense depth. However, the initial depth estimations mostly result from direct applications of convolutional layers on the sparse depth map. In thi… ▽ More

    Submitted 1 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  15. arXiv:2403.05861  [pdf, ps, other

    cs.DC

    DeepVM: Integrating Spot and On-Demand VMs for Cost-Efficient Deep Learning Clusters in the Cloud

    Authors: Yoochan Kim, Kihyun Kim, Yonghyeon Cho, **woo Kim, Awais Khan, Ki-Dong Kang, Baik-Song An, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim

    Abstract: Distributed Deep Learning (DDL), as a paradigm, dictates the use of GPU-based clusters as the optimal infrastructure for training large-scale Deep Neural Networks (DNNs). However, the high cost of such resources makes them inaccessible to many users. Public cloud services, particularly Spot Virtual Machines (VMs), offer a cost-effective alternative, but their unpredictable availability poses a sig… ▽ More

    Submitted 14 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  16. arXiv:2403.03186  [pdf, other

    cs.AI

    Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

    Authors: Weihao Tan, Ziluo Ding, Wentao Zhang, Boyu Li, Bohan Zhou, Junpeng Yue, Haochong Xia, Jiechuan Jiang, Longtao Zheng, Xinrun Xu, Yifei Bi, Pengjie Gu, Xinrun Wang, Börje F. Karlsson, Bo An, Zongqing Lu

    Abstract: Despite the success in specific tasks and scenarios, existing foundation agents, empowered by large models (LMs) and advanced tools, still cannot generalize to different scenarios, mainly due to dramatic differences in the observations and actions across scenarios. In this work, we propose the General Computer Control (GCC) setting: building foundation agents that can master any computer task by t… ▽ More

    Submitted 7 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  17. arXiv:2403.00255  [pdf, other

    cs.GT cs.MA

    Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games

    Authors: Naming Liu, Mingzhi Wang, Youzhi Zhang, Yaodong Yang, Bo An, Ying Wen

    Abstract: Two-team zero-sum games are one of the most important paradigms in game theory. In this paper, we focus on finding an unexploitable equilibrium in large team games. An unexploitable equilibrium is a worst-case policy, where members in the opponent team cannot increase their team reward by taking any policy, e.g., cooperatively changing to other joint policies. As an optimal unexploitable equilibri… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  18. arXiv:2402.18485  [pdf, other

    q-fin.TR cs.AI

    A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist

    Authors: Wentao Zhang, Lingxuan Zhao, Haochong Xia, Shuo Sun, Jiaze Sun, Molei Qin, Xinyi Li, Yuqing Zhao, Yilei Zhao, Xinyu Cai, Longtao Zheng, Xinrun Wang, Bo An

    Abstract: Financial trading is a crucial component of the markets, informed by a multimodal information landscape encompassing news, prices, and Kline charts, and encompasses diverse tasks such as quantitative trading and high-frequency trading with various assets. While advanced AI techniques like deep learning and reinforcement learning are extensively utilized in finance, their application in financial t… ▽ More

    Submitted 28 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  19. arXiv:2402.16855  [pdf, other

    cs.CV

    MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network

    Authors: Yujun Huang, Bin Chen, Naiqi Li, Baoyi An, Shu-Tao Xia, Yaowei Wang

    Abstract: Conventional compressed sensing (CS) algorithms typically apply a uniform sampling rate to different image blocks. A more strategic approach could be to allocate the number of measurements adaptively, based on each image block's complexity. In this paper, we propose a Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network (MB-RACS) framework, which aims to adaptively determine the… ▽ More

    Submitted 18 January, 2024; originally announced February 2024.

  20. arXiv:2402.13379  [pdf, other

    cs.LG cs.CY

    Referee-Meta-Learning for Fast Adaptation of Locational Fairness

    Authors: Weiye Chen, Yiqun Xie, Xiaowei Jia, Erhu He, Han Bao, Bang An, Xun Zhou

    Abstract: When dealing with data from distinct locations, machine learning algorithms tend to demonstrate an implicit preference of some locations over the others, which constitutes biases that sabotage the spatial fairness of the algorithm. This unfairness can easily introduce biases in subsequent decision-making given broad adoptions of learning-based solutions in practice. However, locational biases in A… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  21. arXiv:2401.14151  [pdf, other

    cs.LG cs.AI cs.CL

    True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning

    Authors: Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An

    Abstract: Despite the impressive performance across numerous tasks, large language models (LLMs) often fail in solving simple decision-making tasks due to the misalignment of the knowledge in LLMs with environments. On the contrary, reinforcement learning (RL) agents learn policies from scratch, which makes them always align with environments but difficult to incorporate prior knowledge for efficient explor… ▽ More

    Submitted 10 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR2024

  22. arXiv:2401.13360  [pdf, other

    cs.LG

    Debiased Sample Selection for Combating Noisy Labels

    Authors: Qi Wei, Lei Feng, Haobo Wang, Bo An

    Abstract: Learning with noisy labels aims to ensure model generalization given a label-corrupted training set. The sample selection strategy achieves promising performance by selecting a label-reliable subset for model training. In this paper, we empirically reveal that existing sample selection methods suffer from both data and training bias that are represented as imbalanced selected sets and accumulation… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  23. arXiv:2401.12587  [pdf, other

    eess.IV cs.CV

    An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding

    Authors: Xiang Liu, Jiahong Chen, Bin Chen, Zimo Liu, Baoyi An, Shu-Tao Xia, Zhi Wang

    Abstract: Displaying high-quality images on edge devices, such as augmented reality devices, is essential for enhancing the user experience. However, these devices often face power consumption and computing resource limitations, making it challenging to apply many deep learning-based image compression algorithms in this field. Implicit Neural Representation (INR) for image compression is an emerging technol… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  24. arXiv:2401.08909  [pdf, other

    cs.LG

    Leveraging Gradients for Unsupervised Accuracy Estimation under Distribution Shift

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Ievgen Redko, Jianfeng Zhang, Bo An

    Abstract: Estimating test accuracy without access to the ground-truth test labels under varying test environments is a challenging, yet extremely important problem in the safe deployment of machine learning algorithms. Existing works rely on the information from either the outputs or the extracted features of neural networks to formulate an estimation score correlating with the ground-truth test accuracy. I… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  25. arXiv:2401.08573  [pdf, other

    cs.CV cs.CR cs.LG

    WAVES: Benchmarking the Robustness of Image Watermarks

    Authors: Bang An, Mucong Ding, Tahseen Rabbani, Aakriti Agrawal, Yuancheng Xu, Chenghao Deng, Sicheng Zhu, Abdirisak Mohamed, Yuxin Wen, Tom Goldstein, Furong Huang

    Abstract: In the burgeoning age of generative AI, watermarks act as identifiers of provenance and artificial content. We present WAVES (Watermark Analysis Via Enhanced Stress-testing), a benchmark for assessing image watermark robustness, overcoming the limitations of current evaluation methods. WAVES integrates detection and identification tasks and establishes a standardized evaluation protocol comprised… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by ICML 2024

  26. arXiv:2401.02081  [pdf, ps, other

    cs.IT eess.SP

    Performance Trade-off and Joint Waveform Design for MIMO-OFDM DFRC Systems

    Authors: Tianchen Liu, Liang Wu, Bo An, Zaichen Zhang, Jian Dang, Jiangzhou Wang

    Abstract: Dual-functional radar-communication (DFRC) has attracted considerable attention. This paper considers the frequency-selective multipath fading environment and proposes DFRC waveform design strategies based on multiple-input and multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM) techniques. In the proposed waveform design strategies, the Cramer-Rao bound (CRB) of the radar… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  27. arXiv:2401.00426  [pdf, other

    cs.CL cs.AI

    keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM

    Authors: Chaojie Wang, Yishi Xu, Zhong Peng, Chenxi Zhang, Bo Chen, Xinrun Wang, Lei Feng, Bo An

    Abstract: Large language models (LLMs) have exhibited remarkable performance on various natural language processing (NLP) tasks, especially for question answering. However, in the face of problems beyond the scope of knowledge, these LLMs tend to talk nonsense with a straight face, where the potential solution could be incorporating an Information Retrieval (IR) module and generating response based on these… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 12 pages, 6 figures

  28. arXiv:2312.06443  [pdf, ps, other

    cs.GT cs.MA

    Exploring Leximin Principle for Fair Core-Selecting Combinatorial Auctions: Payment Rule Design and Implementation

    Authors: Hao Cheng, Shufeng Kong, Yanchen Deng, Caihua Liu, Xiaohu Wu, Bo An, Chongjun Wang

    Abstract: Core-selecting combinatorial auctions (CAs) restrict the auction result in the core such that no coalitions could improve their utilities by engaging in collusion. The minimum-revenue-core (MRC) rule is a widely used core-selecting payment rule to maximize the total utilities of all bidders. However, the MRC rule can suffer from severe unfairness since it ignores individuals' utilities. To address… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 8 pages, IJCAI 2023

  29. arXiv:2311.13847  [pdf, other

    cs.CV cs.IT eess.IV

    Perceptual Image Compression with Cooperative Cross-Modal Side Information

    Authors: Shiyu Qin, Bin Chen, Yujun Huang, Baoyi An, Tao Dai, Shu-Tao Xia

    Abstract: The explosion of data has resulted in more and more associated text being transmitted along with images. Inspired by from distributed source coding, many works utilize image side information to enhance image compression. However, existing methods generally do not consider using text as side information to enhance perceptual compression of images, even though the benefits of multimodal synergy have… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  30. arXiv:2311.13846  [pdf, other

    cs.CV cs.IT

    Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression

    Authors: Shiyu Qin, Yimin Zhou, **peng Wang, Bin Chen, Baoyi An, Tao Dai, Shu-Tao Xia

    Abstract: In this paper, we propose a progressive learning paradigm for transformer-based variable-rate image compression. Our approach covers a wide range of compression rates with the assistance of the Layer-adaptive Prompt Module (LPM). Inspired by visual prompt tuning, we use LPM to extract prompts for input images and hidden features at the encoder side and decoder side, respectively, which are fed as… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  31. arXiv:2311.10801  [pdf, other

    q-fin.PM cs.AI cs.CE cs.LG

    Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools

    Authors: Wentao Zhang, Yilei Zhao, Shuo Sun, Jie Ying, Yonggang Xie, Zitao Song, Xinrun Wang, Bo An

    Abstract: Portfolio management (PM) is a fundamental financial trading task, which explores the optimal periodical reallocation of capitals into different stocks to pursue long-term profits. Reinforcement learning (RL) has recently shown its potential to train profitable agents for PM through interacting with financial markets. However, existing work mostly focuses on fixed stock pools, which is inconsisten… ▽ More

    Submitted 27 February, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  32. arXiv:2311.08648  [pdf, other

    cs.CL cs.AI

    Explore Spurious Correlations at the Concept Level in Language Models for Text Classification

    Authors: Yuhang Zhou, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, Furong Huang

    Abstract: Language models (LMs) have achieved notable success in numerous NLP tasks, employing both fine-tuning and in-context learning (ICL) methods. While language models demonstrate exceptional performance, they face robustness challenges due to spurious correlations arising from imbalanced label distributions in training data or ICL exemplars. Previous research has primarily concentrated on word, phrase… ▽ More

    Submitted 15 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 14 pages, 4 page appendix, Accepted by ACL 2024 Main

  33. arXiv:2311.04550  [pdf, other

    cs.LG stat.ML

    Regression with Cost-based Rejection

    Authors: Xin Cheng, Yuzhou Cao, Haobo Wang, Hongxin Wei, Bo An, Lei Feng

    Abstract: Learning with rejection is an important framework that can refrain from making predictions to avoid critical mispredictions by balancing between prediction and rejection. Previous studies on cost-based rejection only focused on the classification setting, which cannot handle the continuous and infinite target space in the regression setting. In this paper, we investigate a novel regression problem… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  34. arXiv:2311.01106  [pdf, other

    cs.LG

    In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer

    Authors: Yuzhou Cao, Hussein Mozannar, Lei Feng, Hongxin Wei, Bo An

    Abstract: Enabling machine learning classifiers to defer their decision to a downstream expert when the expert is more accurate will ensure improved safety and performance. This objective can be achieved with the learning-to-defer framework which aims to jointly learn how to classify and how to defer to the expert. In recent studies, it has been theoretically shown that popular estimators for learning to de… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  35. arXiv:2310.17325  [pdf, other

    cs.LG cs.AI cs.CV

    C-Disentanglement: Discovering Causally-Independent Generative Factors under an Inductive Bias of Confounder

    Authors: Xiaoyu Liu, Jiaxin Yuan, Bang An, Yuancheng Xu, Yifan Yang, Furong Huang

    Abstract: Representation learning assumes that real-world data is generated by a few semantically meaningful generative factors (i.e., sources of variation) and aims to discover them in the latent space. These factors are expected to be causally disentangled, meaning that distinct factors are encoded into separate latent variables, and changes in one factor will not affect the values of the others. Compared… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: accepted to Neurips 2023

  36. arXiv:2310.15140  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models

    Authors: Sicheng Zhu, Ruiyi Zhang, Bang An, Gang Wu, Joe Barrow, Zichao Wang, Furong Huang, Ani Nenkova, Tong Sun

    Abstract: Safety alignment of Large Language Models (LLMs) can be compromised with manual jailbreak attacks and (automatic) adversarial attacks. Recent studies suggest that defending against these attacks is possible: adversarial attacks generate unlimited but unreadable gibberish prompts, detectable by perplexity-based filters; manual jailbreak attacks craft readable prompts, but their limited number due t… ▽ More

    Submitted 14 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Version 2 updates: Added comparison of three more evaluation methods and their reliability check using human labeling. Added results for jailbreaking Llama2 (individual behavior) and included complexity and hyperparameter analysis. Revised objectives for prompt leaking. Other minor changes made

  37. arXiv:2310.09069  [pdf, other

    cs.RO cs.AI

    ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection

    Authors: Xiaoqi Li, Yanzi Wang, Yan Shen, Ponomarenko Iaroslav, Haoran Lu, Qianxu Wang, Boshi An, Jiaming Liu, Hao Dong

    Abstract: In the realm of future home-assistant robots, 3D articulated object manipulation is essential for enabling robots to interact with their environment. Many existing studies make use of 3D point clouds as the primary input for manipulation policies. However, this approach encounters challenges due to data sparsity and the significant cost associated with acquiring point cloud data, which can limit i… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  38. arXiv:2310.03984  [pdf, other

    cs.IR cs.LG

    AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement

    Authors: Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An

    Abstract: Growing attention has been paid to Reinforcement Learning (RL) algorithms when optimizing long-term user engagement in sequential recommendation tasks. One challenge in large-scale online recommendation systems is the constant and complicated changes in users' behavior patterns, such as interaction rates and retention tendencies. When formulated as a Markov Decision Process (MDP), the dynamics and… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Preprint. Under Review

  39. arXiv:2310.03478  [pdf, other

    cs.RO

    RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation

    Authors: Boshi An, Yiran Geng, Kai Chen, Xiaoqi Li, Qi Dou, Hao Dong

    Abstract: Robotic manipulation requires accurate perception of the environment, which poses a significant challenge due to its inherent complexity and constantly changing nature. In this context, RGB image and point-cloud observations are two commonly used modalities in visual-based robotic manipulation, but each of these modalities have their own limitations. Commercial point-cloud observations often suffe… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  40. arXiv:2310.03188  [pdf, other

    cs.AI

    Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication

    Authors: Zhe Zhao, Qingyun Liu, Huan Gui, Bang An, Lichan Hong, Ed H. Chi

    Abstract: Many recent breakthroughs in machine learning have been enabled by the pre-trained foundation models. By scaling up model parameters, training data, and computation resources, foundation models have significantly advanced the state-of-the-art in many applications. However, it is still an open question of how to use these models to perform downstream tasks efficiently. Knowledge distillation (KD) h… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 19 pages, 3 figures

  41. arXiv:2310.00270  [pdf, other

    cs.LG

    SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data

    Authors: Bang An, Xun Zhou, Yongjian Zhong, Tianbao Yang

    Abstract: The problem of urban event ranking aims at predicting the top-k most risky locations of future events such as traffic accidents and crimes. This problem is of fundamental importance to public safety and urban administration especially when limited resources are available. The problem is, however, challenging due to complex and dynamic spatio-temporal correlations between locations, uneven distribu… ▽ More

    Submitted 26 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  42. arXiv:2309.12053  [pdf, other

    cs.CL

    AceGPT, Localizing Large Language Models in Arabic

    Authors: Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Juncai He, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, **chao Xu

    Abstract: This paper is devoted to the development of a localized Large Language Model (LLM) specifically for Arabic, a language imbued with unique cultural characteristics inadequately addressed by current mainstream models. Significant concerns emerge when addressing cultural sensitivity and local values. To address this, the paper proposes a comprehensive solution that includes further pre-training with… ▽ More

    Submitted 2 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted to NAACL main conference. https://github.com/FreedomIntelligence/AceGPT

  43. arXiv:2309.07708  [pdf, other

    cs.LG q-fin.TR

    Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context

    Authors: Haochong Xia, Shuo Sun, Xinrun Wang, Bo An

    Abstract: Financial simulators play an important role in enhancing forecasting accuracy, managing risks, and fostering strategic financial decision-making. Despite the development of financial market simulation methodologies, existing frameworks often struggle with adapting to specialized simulation context. We pinpoint the challenges as i) current financial datasets do not contain context labels; ii) curre… ▽ More

    Submitted 10 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to the 38th Annual AAAI Conference on Artificial Intelligence (AAAI24), Vancouver, British Columbia, 2024

  44. arXiv:2308.11256  [pdf, other

    cs.GT cs.AI cs.LG

    Efficient Last-iterate Convergence Algorithms in Solving Games

    Authors: Linjian Meng, Zhenxing Ge, Wenbin Li, Bo An, Yang Gao

    Abstract: No-regret algorithms are popular for learning Nash equilibrium (NE) in two-player zero-sum normal-form games (NFGs) and extensive-form games (EFGs). Many recent works consider the last-iterate convergence no-regret algorithms. Among them, the two most famous algorithms are Optimistic Gradient Descent Ascent (OGDA) and Optimistic Multiplicative Weight Update (OMWU). However, OGDA has high per-itera… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  45. arXiv:2308.08918  [pdf, other

    cs.LG cs.AI q-fin.TR

    IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making

    Authors: Hui Niu, Siyuan Li, Jiahao Zheng, Zhouchi Lin, Jian Li, Jian Guo, Bo An

    Abstract: Market making (MM) has attracted significant attention in financial trading owing to its essential function in ensuring market liquidity. With strong capabilities in sequential decision-making, Reinforcement Learning (RL) technology has achieved remarkable success in quantitative trading. Nonetheless, most existing RL-based MM methods focus on optimizing single-price level strategies which fail at… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  46. arXiv:2308.01313  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts

    Authors: Bang An, Sicheng Zhu, Michael-Andrei Panaitescu-Liess, Chaithanya Kumar Mummadi, Furong Huang

    Abstract: Vision-language models like CLIP are widely used in zero-shot image classification due to their ability to understand various visual concepts and natural language descriptions. However, how to fully leverage CLIP's unprecedented human-like understanding capabilities to achieve better performance is still an open question. This paper draws inspiration from the human visual perception process: when… ▽ More

    Submitted 18 March, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted by ICLR 2024

  47. arXiv:2306.10944  [pdf, other

    cs.MA

    Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

    Authors: Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan

    Abstract: Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior coordination. Many works propose to abstract teammate instances into high-level representation of types and then pre-train the best response for each type. However, most of them do not consider the distribution of teammate instances within a type. This could expose the agent to the hidden risk of \emph{type confoun… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023

  48. arXiv:2306.10458  [pdf, other

    cs.LG

    Weakly Supervised Regression with Interval Targets

    Authors: Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, Lei Feng

    Abstract: This paper investigates an interesting weakly supervised regression setting called regression with interval targets (RIT). Although some of the previous methods on relevant regression settings can be adapted to RIT, they are not statistically consistent, and thus their empirical performance is not guaranteed. In this paper, we provide a thorough study on RIT. First, we proposed a novel statistical… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023

  49. arXiv:2306.08968  [pdf, other

    cs.LG

    Partial-Label Regression

    Authors: Xin Cheng, Deng-Bao Wang, Lei Feng, Min-Ling Zhang, Bo An

    Abstract: Partial-label learning is a popular weakly supervised learning setting that allows each training example to be annotated with a set of candidate labels. Previous studies on partial-label learning only focused on the classification setting where candidate labels are all discrete, which cannot handle continuous labels with real values. In this paper, we provide the first attempt to investigate parti… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI 2023

  50. arXiv:2306.07863  [pdf, other

    cs.AI

    Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

    Authors: Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An

    Abstract: Building agents with large language models (LLMs) for computer control is a burgeoning research area, where the agent receives computer states and performs actions to complete complex tasks. Previous computer agents have demonstrated the benefits of in-context learning (ICL); however, their performance is hindered by several issues. First, the limited context length of LLMs and complex computer st… ▽ More

    Submitted 19 January, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: ICLR 2024. Project page: https://ltzheng.github.io/Synapse