Skip to main content

Showing 1–50 of 83 results for author: Pang, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01492  [pdf, other

    cs.CL cs.AI

    RegMix: Data Mixture as Regression for Language Model Pre-training

    Authors: Qian Liu, Xiaosen Zheng, Niklas Muennighoff, Guangtao Zeng, Longxu Dou, Tianyu Pang, **g Jiang, Min Lin

    Abstract: The data mixture for large language model pre-training significantly impacts performance, yet how to determine an effective mixture remains unclear. We propose RegMix to automatically identify a high-performing data mixture by formulating it as a regression task. RegMix involves training a set of small models with diverse data mixtures and fitting a regression model to predict their performance gi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18844  [pdf, other

    cs.CV

    Revisiting Backdoor Attacks against Large Vision-Language Models

    Authors: Siyuan Liang, Jiawei Liang, Tianyu Pang, Chao Du, Aishan Liu, Ee-Chien Chang, Xiaochun Cao

    Abstract: Instruction tuning enhances large vision-language models (LVLMs) but raises security risks through potential backdoor attacks due to their openness. Previous backdoor studies focus on enclosed scenarios with consistent training and testing instructions, neglecting the practical domain gaps that could affect attack effectiveness. This paper empirically examines the generalizability of backdoor atta… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 24 pages, 8 figures

  3. arXiv:2406.09760  [pdf, other

    cs.CL cs.LG

    Bootstrap** Language Models with DPO Implicit Rewards

    Authors: Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin

    Abstract: Human alignment in large language models (LLMs) is an active area of research. A recent groundbreaking work, direct preference optimization (DPO), has greatly simplified the process from past work in reinforcement learning from human feedback (RLHF) by bypassing the reward learning stage in RLHF. DPO, after training, provides an implicit reward model. In this work, we make a novel observation that… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.09136  [pdf, other

    cs.CL cs.LG

    Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

    Authors: Xuan Zhang, Chao Du, Tianyu Pang, Qian Liu, Wei Gao, Min Lin

    Abstract: The recent development of chain-of-thought (CoT) decoding has enabled large language models (LLMs) to generate explicit logical reasoning paths for complex problem-solving. However, research indicates that these paths are not always deliberate and optimal. The tree-of-thought (ToT) method employs tree-searching to extensively explore the reasoning space and find better reasoning paths that CoT dec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.04657  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Crafting Heavy-Tails in Weight Matrix Spectrum without Gradient Noise

    Authors: Vignesh Kothapalli, Tianyu Pang, Shenyang Deng, Zongmin Liu, Yaoqing Yang

    Abstract: Modern training strategies of deep neural networks (NNs) tend to induce a heavy-tailed (HT) spectra of layer weights. Extensive efforts to study this phenomenon have found that NNs with HT weight spectra tend to generalize well. A prevailing notion for the occurrence of such HT spectra attributes gradient noise during training as a key contributing factor. Our work shows that gradient noise is unn… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 31 pages, 37 figures

  6. arXiv:2406.01288  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

    Authors: Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, **g Jiang, Min Lin

    Abstract: Recently, Anil et al. (2024) show that many-shot (up to hundreds of) demonstrations can jailbreak state-of-the-art LLMs by exploiting their long-context capability. Nevertheless, is it possible to use few-shot demonstrations to efficiently jailbreak LLMs within limited context sizes? While the vanilla few-shot jailbreaking may be inefficient, we propose improved techniques such as injecting specia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.21018  [pdf, other

    cs.LG cs.CL cs.CR

    Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

    Authors: Xiaojun Jia, Tianyu Pang, Chao Du, Yihao Huang, **dong Gu, Yang Liu, Xiaochun Cao, Min Lin

    Abstract: Large language models (LLMs) are being rapidly developed, and a key component of their widespread deployment is their safety-related alignment. Many red-teaming efforts aim to jailbreak LLMs, where among these efforts, the Greedy Coordinate Gradient (GCG) attack's success has led to a growing interest in the study of optimization-based jailbreaking techniques. Although GCG is a significant milesto… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  8. arXiv:2405.14129  [pdf, other

    cs.CL cs.AI cs.CV

    AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

    Authors: Fei Zhao, Taotian Pang, Chunhui Li, Zhen Wu, Junjie Guo, Shangyu Xing, Xinyu Dai

    Abstract: Multimodal Large Language Models (MLLMs) are widely regarded as crucial in the exploration of Artificial General Intelligence (AGI). The core of MLLMs lies in their capability to achieve cross-modal alignment. To attain this goal, current MLLMs typically follow a two-phase training paradigm: the pre-training phase and the instruction-tuning phase. Despite their success, there are shortcomings in t… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Code and models are available at $\href{https://aligngpt-vl.github.io/}{\textit{this https URL}}$

  9. arXiv:2405.09024  [pdf, other

    cs.CV

    Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

    Authors: Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui Liao

    Abstract: The ambiguous appearance, tiny scale, and fine-grained classes of objects in remote sensing imagery inevitably lead to the noisy annotations in category labels of detection dataset. However, the effects and treatments of the label noises are underexplored in modern oriented remote sensing object detectors. To address this issue, we propose a robust oriented remote sensing object detection method t… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  10. arXiv:2405.03953  [pdf, other

    cs.SD eess.AS

    Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation

    Authors: Zixing Zhang, Tao Pang, **g Han, Björn W. Schuller

    Abstract: Heart murmurs are a common manifestation of cardiovascular diseases and can provide crucial clues to early cardiac abnormalities. While most current research methods primarily focus on the accuracy of models, they often overlook other important aspects such as the interpretability of machine learning algorithms and the uncertainty of predictions. This paper introduces a heart murmur detection meth… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Journal ref: published at ICASSP 2024

  11. arXiv:2403.16037  [pdf, other

    cs.IR

    Knowledge-aware Dual-side Attribute-enhanced Recommendation

    Authors: Taotian Pang, Xingyu Lou, Fei Zhao, Zhen Wu, Kuiyao Dong, Qiuying Peng, Yue Qi, Xinyu Dai

    Abstract: \textit{Knowledge-aware} recommendation methods (KGR) based on \textit{graph neural networks} (GNNs) and \textit{contrastive learning} (CL) have achieved promising performance. However, they fall short in modeling fine-grained user preferences and further fail to leverage the \textit{preference-attribute connection} to make predictions, leading to sub-optimal performance. To address the issue, we… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  12. arXiv:2402.16302  [pdf, other

    cs.LG cs.AI cs.CE

    Graph Diffusion Policy Optimization

    Authors: Yi**g Liu, Chao Du, Tianyu Pang, Chongxuan Li, Wei Chen, Min Lin

    Abstract: Recent research has made significant progress in optimizing diffusion models for specific downstream objectives, which is an important pursuit in fields such as graph generation for drug design. However, directly applying these models to graph diffusion presents challenges, resulting in suboptimal performance. This paper introduces graph diffusion policy optimization (GDPO), a novel approach to op… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  13. arXiv:2402.14845  [pdf, other

    cs.CL cs.AI cs.LG

    Purifying Large Language Models by Ensembling a Small Language Model

    Authors: Tianlin Li, Qian Liu, Tianyu Pang, Chao Du, Qing Guo, Yang Liu, Min Lin

    Abstract: The emerging success of large language models (LLMs) heavily relies on collecting abundant training data from external (untrusted) sources. Despite substantial efforts devoted to data cleaning and curation, well-constructed LLMs have been reported to suffer from copyright infringement, data poisoning, and/or privacy violations, which would impede practical deployment of LLMs. In this study, we pro… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    ACM Class: I.2

  14. arXiv:2402.13669  [pdf, other

    cs.CL

    Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

    Authors: Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu

    Abstract: The surge in Large Language Models (LLMs) has revolutionized natural language processing, but fine-tuning them for specific tasks often encounters challenges in balancing performance and preserving general instruction-following abilities. In this paper, we posit that the distribution gap between task datasets and the LLMs serves as the primary underlying cause. To address the problem, we introduce… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  15. arXiv:2402.12150  [pdf, other

    cs.CL cs.AI

    Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

    Authors: Tianlin Li, Xiaoyu Zhang, Chao Du, Tianyu Pang, Qian Liu, Qing Guo, Chao Shen, Yang Liu

    Abstract: The widespread adoption of large language models (LLMs) underscores the urgent need to ensure their fairness. However, LLMs frequently present dominant viewpoints while ignoring alternative perspectives from minority parties, resulting in potential biases. We hypothesize that these fairness-violating behaviors occur because LLMs express their viewpoints using a human personality that represents th… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    ACM Class: I.2; J.4

  16. arXiv:2402.08577  [pdf, other

    cs.CL cs.CR cs.CV cs.LG cs.MM

    Test-Time Backdoor Attacks on Multimodal Large Language Models

    Authors: Dong Lu, Tianyu Pang, Chao Du, Qian Liu, Xianjun Yang, Min Lin

    Abstract: Backdoor attacks are commonly executed by contaminating training data, such that a trigger can activate predetermined harmful effects during the test phase. In this work, we present AnyDoor, a test-time backdoor attack against multimodal large language models (MLLMs), which involves injecting the backdoor into the textual modality using adversarial test images (sharing the same universal perturbat… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  17. arXiv:2402.08567  [pdf, other

    cs.CL cs.CR cs.CV cs.LG cs.MA

    Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

    Authors: Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, **g Jiang, Min Lin

    Abstract: A multimodal large language model (MLLM) agent can receive instructions, capture images, retrieve histories from memory, and decide which tools to use. Nonetheless, red-teaming efforts have revealed that adversarial images/prompts can jailbreak an MLLM and cause unaligned behaviors. In this work, we report an even more severe safety issue in multi-agent environments, referred to as infectious jail… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  18. arXiv:2401.17256  [pdf, other

    cs.CL

    Weak-to-Strong Jailbreaking on Large Language Models

    Authors: Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei Li, Yu-Xiang Wang, William Yang Wang

    Abstract: Large language models (LLMs) are vulnerable to jailbreak attacks - resulting in harmful, unethical, or biased text generations. However, existing jailbreaking methods are computationally costly. In this paper, we propose the weak-to-strong jailbreaking attack, an efficient method to attack aligned LLMs to produce harmful text. Our key intuition is based on the observation that jailbroken and align… ▽ More

    Submitted 5 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  19. arXiv:2401.11943  [pdf, other

    cs.LG cs.CL cs.CR cs.CV cs.MM

    Benchmarking Large Multimodal Models against Common Corruptions

    Authors: Jiawei Zhang, Tianyu Pang, Chao Du, Yi Ren, Bo Li, Min Lin

    Abstract: This technical report aims to fill a deficiency in the assessment of large multimodal models (LMMs) by specifically examining the self-consistency of their outputs when subjected to common corruptions. We investigate the cross-modal interactions between text, image, and speech, encompassing four essential generation tasks: text-to-image, image-to-text, text-to-speech, and speech-to-text. We create… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Technical report

  20. arXiv:2312.06348  [pdf, other

    cs.LG

    DiffAIL: Diffusion Adversarial Imitation Learning

    Authors: Bingzheng Wang, Guoqiang Wu, Teng Pang, Yan Zhang, Yilong Yin

    Abstract: Imitation learning aims to solve the problem of defining reward functions in real-world decision-making tasks. The current popular approach is the Adversarial Imitation Learning (AIL) framework, which matches expert state-action occupancy measures to obtain a surrogate reward for forward reinforcement learning. However, the traditional discriminator is a simple binary classifier and doesn't learn… ▽ More

    Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  21. arXiv:2312.00359  [pdf, other

    cs.LG stat.ML

    Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training

    Authors: Yefan Zhou, Tianyu Pang, Keqin Liu, Charles H. Martin, Michael W. Mahoney, Yaoqing Yang

    Abstract: Regularization in modern machine learning is crucial, and it can take various forms in algorithmic design: training set, model family, error function, regularization terms, and optimizations. In particular, the learning rate, which can be interpreted as a temperature-like parameter within the statistical mechanics of learning, plays a crucial role in neural network training. Indeed, many widely ad… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Spotlight, first two authors contributed equally

  22. arXiv:2311.07604  [pdf, other

    cs.LG cs.AI cs.CV cs.CY

    Finetuning Text-to-Image Diffusion Models for Fairness

    Authors: Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli

    Abstract: The rapid adoption of text-to-image diffusion models in society underscores an urgent need to address their biases. Without interventions, these biases could propagate a skewed worldview and restrict opportunities for minority groups. In this work, we frame fairness as a distributional alignment problem. Our solution consists of two main technical contributions: (1) a distributional alignment loss… ▽ More

    Submitted 15 March, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: ICLR 2024 oral presentation

  23. arXiv:2311.00941  [pdf, other

    cs.LG cs.AI cs.CV

    Gaussian Mixture Solvers for Diffusion Models

    Authors: Hanzhong Guo, Cheng Lu, Fan Bao, Tianyu Pang, Shuicheng Yan, Chao Du, Chongxuan Li

    Abstract: Recently, diffusion models have achieved great success in generative tasks. Sampling from diffusion models is equivalent to solving the reverse diffusion stochastic differential equations (SDEs) or the corresponding probability flow ordinary differential equations (ODEs). In comparison, SDE-based solvers can generate samples of higher quality and are suited for image translation tasks like stroke-… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  24. arXiv:2311.00500  [pdf, other

    cs.LG cs.AI cs.CV

    Intriguing Properties of Data Attribution on Diffusion Models

    Authors: Xiaosen Zheng, Tianyu Pang, Chao Du, **g Jiang, Min Lin

    Abstract: Data attribution seeks to trace model outputs back to training data. With the recent development of diffusion models, data attribution has become a desired module to properly assign valuations for high-quality or copyrighted training samples, ensuring that data contributors are fairly compensated or credited. Several theoretically motivated methods have been proposed to implement data attribution,… ▽ More

    Submitted 15 March, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  25. arXiv:2310.14225  [pdf, other

    cs.CL

    Customising General Large Language Models for Specialised Emotion Recognition Tasks

    Authors: Liyizhe Peng, Zixing Zhang, Tao Pang, **g Han, Huan Zhao, Hao Chen, Björn W. Schuller

    Abstract: The advent of large language models (LLMs) has gained tremendous attention over the past year. Previous studies have shown the astonishing performance of LLMs not only in other tasks but also in emotion recognition in terms of accuracy, universality, explanation, robustness, few/zero-shot learning, and others. Leveraging the capability of LLMs inevitably becomes an essential solution for emotion r… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  26. arXiv:2310.02664  [pdf, other

    cs.LG cs.AI cs.CV

    On Memorization in Diffusion Models

    Authors: Xiangming Gu, Chao Du, Tianyu Pang, Chongxuan Li, Min Lin, Ye Wang

    Abstract: Due to their capacity to generate novel and high-quality samples, diffusion models have attracted significant research interest in recent years. Notably, the typical training objective of diffusion models, i.e., denoising score matching, has a closed-form optimal solution that can only generate training data replicating samples. This indicates that a memorization behavior is theoretically expected… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  27. arXiv:2310.00385  [pdf, other

    cs.CL cs.AI

    Dynamic Demonstrations Controller for In-Context Learning

    Authors: Fei Zhao, Taotian Pang, Zhen Wu, Zheng Ma, Shujian Huang, Xinyu Dai

    Abstract: In-Context Learning (ICL) is a new paradigm for natural language processing (NLP), where a large language model (LLM) observes a small number of demonstrations and a test instance as its input, and directly makes predictions without updating model parameters. Previous studies have revealed that ICL is sensitive to the selection and the ordering of demonstrations. However, there are few studies reg… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Under review

  28. arXiv:2308.11578  [pdf, other

    cs.CL cs.AI cs.LG

    Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models

    Authors: Zixing Zhang, Liyizhe Peng, Tao Pang, **g Han, Huan Zhao, Bjorn W. Schuller

    Abstract: After the inception of emotion recognition or affective computing, it has increasingly become an active research topic due to its broad applications. Over the past couple of decades, emotion recognition models have gradually migrated from statistically shallow models to neural network-based deep models, which can significantly boost the performance of emotion recognition models and consistently ac… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  29. arXiv:2307.13269  [pdf, other

    cs.CL cs.AI

    LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Authors: Chengsong Huang, Qian Liu, Bill Yuchen Lin, Tianyu Pang, Chao Du, Min Lin

    Abstract: Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks. With just a few examples f… ▽ More

    Submitted 18 January, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Add more related work and experimental results

  30. arXiv:2307.02263  [pdf, other

    cs.LG cs.CY

    Dynamical Isometry based Rigorous Fair Neural Architecture Search

    Authors: Jianxiang Luo, Junyi Hu, Tianji Pang, Weihao Huang, Chuang Liu

    Abstract: Recently, the weight-sharing technique has significantly speeded up the training and evaluation procedure of neural architecture search. However, most existing weight-sharing strategies are solely based on experience or observation, which makes the searching results lack interpretability and rationality. In addition, due to the negligence of fairness, current methods are prone to make misjudgments… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  31. arXiv:2307.01465  [pdf, other

    cs.CV

    AdAM: Few-Shot Image Generation via Adaptation-Aware Kernel Modulation

    Authors: Yunqing Zhao, Keshigeyan Chandrasegaran, Milad Abdollahzadeh, Chao Du, Tianyu Pang, Ruoteng Li, Henghui Ding, Ngai-Man Cheung

    Abstract: Few-shot image generation (FSIG) aims to learn to generate new and diverse images given few (e.g., 10) training samples. Recent work has addressed FSIG by leveraging a GAN pre-trained on a large-scale source domain and adapting it to the target domain with few target samples. Central to recent FSIG methods are knowledge preservation criteria, which select and preserve a subset of source knowledge… ▽ More

    Submitted 10 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 32 pages, update additional information, discussion, and experimental sections compared to NeurIPS-22 version: arXiv:2210.16559. arXiv admin note: substantial text overlap with arXiv:2210.16559

  32. arXiv:2306.01435  [pdf, other

    cs.LG stat.ML

    Improving Adversarial Robustness of DEQs with Explicit Regulations Along the Neural Dynamics

    Authors: Zonghan Yang, Peng Li, Tianyu Pang, Yang Liu

    Abstract: Deep equilibrium (DEQ) models replace the multiple-layer stacking of conventional deep networks with a fixed-point iteration of a single-layer transformation. Having been demonstrated to be competitive in a variety of real-world scenarios, the adversarial robustness of general DEQs becomes increasingly crucial for their reliable deployment. Existing works improve the robustness of general DEQ mode… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023. Our code is available at https://github.com/minicheshire/DEQ-Regulating-Neural-Dynamics

  33. arXiv:2306.01429  [pdf, other

    cs.LG stat.ML

    A Closer Look at the Adversarial Robustness of Deep Equilibrium Models

    Authors: Zonghan Yang, Tianyu Pang, Yang Liu

    Abstract: Deep equilibrium models (DEQs) refrain from the traditional layer-stacking paradigm and turn to find the fixed point of a single layer. DEQs have achieved promising performance on different applications with featured memory efficiency. At the same time, the adversarial vulnerability of DEQs raises concerns. Several works propose to certify robustness for monotone DEQs. However, limited efforts are… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2022. Our code is available at https://github.com/minicheshire/DEQ-White-Box-Robustness

  34. arXiv:2305.20081  [pdf, other

    cs.LG cs.AI

    Efficient Diffusion Policies for Offline Reinforcement Learning

    Authors: Bingyi Kang, Xiao Ma, Chao Du, Tianyu Pang, Shuicheng Yan

    Abstract: Offline reinforcement learning (RL) aims to learn optimal policies from offline datasets, where the parameterization of policies is crucial but often overlooked. Recently, Diffsuion-QL significantly boosts the performance of offline RL by representing a policy with a diffusion model, whose success relies on a parametrized Markov Chain with hundreds of steps for sampling. However, Diffusion-QL suff… ▽ More

    Submitted 26 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted by NeurIPS 2023

  35. arXiv:2305.16934  [pdf, other

    cs.CV cs.CL cs.CR cs.LG cs.MM

    On Evaluating Adversarial Robustness of Large Vision-Language Models

    Authors: Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-Man Cheung, Min Lin

    Abstract: Large vision-language models (VLMs) such as GPT-4 have achieved unprecedented performance in response generation, especially with visual inputs, enabling more creative and adaptable interaction than large language models such as ChatGPT. Nonetheless, multimodal generation exacerbates safety concerns, since adversaries may successfully evade the entire system by subtly manipulating the most vulnera… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  36. arXiv:2305.03433  [pdf, other

    cs.AI cs.CY

    Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects

    Authors: Kehui Tan, Tianqi Pang, Chenyou Fan, Song Yu

    Abstract: This perspective paper proposes a series of interactive scenarios that utilize Artificial Intelligence (AI) to enhance classroom teaching, such as dialogue auto-completion, knowledge and style transfer, and assessment of AI-generated content. By leveraging recent developments in Large Language Models (LLMs), we explore the potential of AI to augment and enrich teacher-student dialogues and improve… ▽ More

    Submitted 12 June, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 16 pages, 2 figures

  37. arXiv:2305.03224  [pdf

    cs.LG q-fin.ST

    Carbon Price Forecasting with Quantile Regression and Feature Selection

    Authors: Tianqi Pang, Kehui Tan, Chenyou Fan

    Abstract: Carbon futures has recently emerged as a novel financial asset in the trading markets such as the European Union and China. Monitoring the trend of the carbon price has become critical for both national policy-making as well as industrial manufacturing planning. However, various geopolitical, social, and economic factors can impose substantial influence on the carbon price. Due to its volatility a… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  38. arXiv:2305.02164  [pdf, other

    cs.LG

    Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows

    Authors: Chao Du, Tianbo Li, Tianyu Pang, Shuicheng Yan, Min Lin

    Abstract: Sliced-Wasserstein Flow (SWF) is a promising approach to nonparametric generative modeling but has not been widely adopted due to its suboptimal generative quality and lack of conditional modeling capabilities. In this work, we make two major contributions to bridging this gap. First, based on a pleasant observation that (under certain conditions) the SWF of joint distributions coincides with thos… ▽ More

    Submitted 25 July, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  39. arXiv:2304.13911  [pdf, other

    cs.AI

    Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering

    Authors: Xiangyang Liu, Tianqi Pang, Chenyou Fan

    Abstract: We investigate how to enhance answer precision in frequently asked questions posed by distributed users using cloud-based Large Language Models (LLMs). Our study focuses on a typical situations where users ask similar queries that involve identical mathematical reasoning steps and problem-solving procedures. Due to the unsatisfactory accuracy of LLMs' zero-shot prompting with standalone questions,… ▽ More

    Submitted 30 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  40. arXiv:2304.07574  [pdf, other

    cs.CV

    Exploring Incompatible Knowledge Transfer in Few-shot Image Generation

    Authors: Yunqing Zhao, Chao Du, Milad Abdollahzadeh, Tianyu Pang, Min Lin, Shuicheng Yan, Ngai-Man Cheung

    Abstract: Few-shot image generation (FSIG) learns to generate diverse and high-fidelity images from a target domain using a few (e.g., 10) reference samples. Existing FSIG methods select, preserve and transfer prior knowledge from a source generator (pretrained on a related domain) to learn the target generator. In this work, we investigate an underexplored issue in FSIG, dubbed as incompatible knowledge tr… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: 25 pages, 16 figures, 10 tables. The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  41. arXiv:2304.06627  [pdf, other

    cs.LG cs.CV

    CoSDA: Continual Source-Free Domain Adaptation

    Authors: Haozhe Feng, Zhaorui Yang, Hesun Chen, Tianyu Pang, Chao Du, Minfeng Zhu, Wei Chen, Shuicheng Yan

    Abstract: Without access to the source data, source-free domain adaptation (SFDA) transfers knowledge from a source-domain trained model to target domains. Recently, SFDA has gained popularity due to the need to protect the data privacy of the source domain, but it suffers from catastrophic forgetting on the source domain due to the lack of data. To systematically investigate the mechanism of catastrophic f… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 15 pages, 6 figures

  42. arXiv:2303.10137  [pdf, other

    cs.CV cs.CR cs.LG

    A Recipe for Watermarking Diffusion Models

    Authors: Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Ngai-Man Cheung, Min Lin

    Abstract: Diffusion models (DMs) have demonstrated advantageous potential on generative tasks. Widespread interest exists in incorporating DMs into downstream applications, such as producing or editing photorealistic images. However, practical deployment and unprecedented power of DMs raise legal issues, including copyright protection and monitoring of generated content. In this regard, watermarking has bee… ▽ More

    Submitted 15 October, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  43. arXiv:2302.10688  [pdf, other

    cs.LG cs.CV stat.ML

    On Calibrating Diffusion Probabilistic Models

    Authors: Tianyu Pang, Cheng Lu, Chao Du, Min Lin, Shuicheng Yan, Zhijie Deng

    Abstract: Recently, diffusion probabilistic models (DPMs) have achieved promising results in diverse generative tasks. A typical DPM framework includes a forward process that gradually diffuses the data distribution and a reverse process that recovers the data distribution from time-dependent data scores. In this work, we observe that the stochastic reverse process of data scores is a martingale, from which… ▽ More

    Submitted 29 October, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  44. arXiv:2302.04638  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Better Diffusion Models Further Improve Adversarial Training

    Authors: Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng Yan

    Abstract: It has been recognized that the data generated by the denoising diffusion probabilistic model (DDPM) improves adversarial training. After two years of rapid development in diffusion models, a question naturally arises: can better diffusion models further improve adversarial training? This paper gives an affirmative answer by employing the most recent diffusion model which has higher efficiency (… ▽ More

    Submitted 1 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  45. arXiv:2302.04460  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Bag of Tricks for Training Data Extraction from Language Models

    Authors: Weichen Yu, Tianyu Pang, Qian Liu, Chao Du, Bingyi Kang, Yan Huang, Min Lin, Shuicheng Yan

    Abstract: With the advance of language models, privacy protection is receiving more attention. Training data extraction is therefore of great importance, as it can serve as a potential tool to assess privacy leakage. However, due to the difficulty of this task, most of the existing methods are proof-of-concept and still not effective enough. In this paper, we investigate and benchmark tricks for improving t… ▽ More

    Submitted 1 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  46. arXiv:2301.12195  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Does Federated Learning Really Need Backpropagation?

    Authors: Haozhe Feng, Tianyu Pang, Chao Du, Wei Chen, Shuicheng Yan, Min Lin

    Abstract: Federated learning (FL) is a general principle for decentralized clients to train a server model collectively without sharing local data. FL is a promising framework with practical applications, but its standard training paradigm requires the clients to backpropagate through the model to compute gradients. Since these clients are typically edge devices and not fully trusted, executing backpropagat… ▽ More

    Submitted 26 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

  47. arXiv:2211.16544  [pdf, other

    cs.RO cs.HC

    Towards Transcervical Ultrasound Image Guidance for Transoral Robotic Surgery

    Authors: Wanwen Chen, Megha Kalia, Qi Zeng, Emily H. T. Pang, Razeyeh Bagherinasab, Thomas D. Milner, Farahna Sabiq, Eitan Prisman, Septimiu E. Salcudean

    Abstract: Purpose: Trans-oral robotic surgery (TORS) using the da Vinci surgical robot is a new minimally-invasive surgery method to treat oropharyngeal tumors, but it is a challenging operation. Augmented reality (AR) based on intra-operative ultrasound (US) has the potential to enhance the visualization of the anatomy and cancerous tumors to provide additional tools for decision-making in surgery. Methods… ▽ More

    Submitted 31 March, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 12 pages, 8 figures. Accepted by Information Processing for Computer Assisted Interventions (IPCAI 2023)

  48. arXiv:2206.10787  [pdf, other

    cs.RO

    Global Planning for Contact-Rich Manipulation via Local Smoothing of Quasi-dynamic Contact Models

    Authors: Tao Pang, H. J. Terry Suh, Lujie Yang, Russ Tedrake

    Abstract: The empirical success of Reinforcement Learning (RL) in the setting of contact-rich manipulation leaves much to be understood from a model-based perspective, where the key difficulties are often attributed to (i) the explosion of contact modes, (ii) stiff, non-smooth contact dynamics and the resulting exploding / discontinuous gradients, and (iii) the non-convexity of the planning problem. The sto… ▽ More

    Submitted 27 February, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: The first two authors contributed equally to this work

  49. arXiv:2205.13205  [pdf, ps, other

    cs.LG physics.chem-ph physics.comp-ph quant-ph

    $O(N^2)$ Universal Antisymmetry in Fermionic Neural Networks

    Authors: Tianyu Pang, Shuicheng Yan, Min Lin

    Abstract: Fermionic neural network (FermiNet) is a recently proposed wavefunction Ansatz, which is used in variational Monte Carlo (VMC) methods to solve the many-electron Schrödinger equation. FermiNet proposes permutation-equivariant architectures, on which a Slater determinant is applied to induce antisymmetry. FermiNet is proved to have universal approximation capability with a single determinant, namel… ▽ More

    Submitted 16 June, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: ICML 2022 AI for Science Workshop

  50. arXiv:2203.14101   

    cs.LG cs.AI cs.CL

    A Roadmap for Big Model

    Authors: Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, **g Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui , et al. (75 additional authors not shown)

    Abstract: With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out the overall progress of BMs and guides the follow-up research. In this paper, we cover not only the BM… ▽ More

    Submitted 20 April, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

    Comments: This report has been withdrawn by the authors due to critical issues in Section 2.3.1 of Article 2