Skip to main content

Showing 1–50 of 322 results for author: Xie, T

.
  1. arXiv:2406.17565  [pdf, other

    cs.DC

    MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool

    Authors: Cunchen Hu, Heyang Huang, Junhao Hu, Jiang Xu, Xusheng Chen, Tao Xie, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan

    Abstract: Large language model (LLM) serving has transformed from stateless to stateful systems, utilizing techniques like context caching and disaggregated inference. These optimizations extend the lifespan and domain of the KV cache, necessitating a new architectural approach. We present MemServe, a unified system that integrates both inter-request and intra-request optimizations. MemServe introduces MemP… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.17305  [pdf, other

    cs.CL

    Retrieval Augmented Instruction Tuning for Open NER with Large Language Models

    Authors: Tingyu Xie, Jian Zhang, Yan Zhang, Yuanyuan Liang, Qi Li, Hongwei Wang

    Abstract: The strong capability of large language models (LLMs) has been applied to information extraction (IE) through either retrieval augmented prompting or instruction tuning (IT). However, the best way to incorporate information with LLMs for IE remains an open question. In this paper, we explore Retrieval Augmented Instruction Tuning (RA-IT) for IE, focusing on the task of open named entity recognitio… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16756  [pdf, other

    cs.LG cs.AI cs.CY

    Addressing Polarization and Unfairness in Performative Prediction

    Authors: Kun **, Tian Xie, Yang Liu, Xueru Zhang

    Abstract: When machine learning (ML) models are used in applications that involve humans (e.g., online recommendation, school admission, hiring, lending), the model itself may trigger changes in the distribution of targeted data it aims to predict. Performative prediction (PP) is a framework that explicitly considers such model-dependent distribution shifts when learning ML models. While significant efforts… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.16495  [pdf, other

    cs.CL cs.AI

    OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser

    Authors: **gze Shi, Ting Xie, Bingheng Wu, Chunjun Zheng, Kai Wang

    Abstract: Recent research has shown that combining Mamba with Transformer architecture, which has selective state space and quadratic self-attention mechanism, outperforms using Mamba or Transformer architecture alone in language modeling tasks. The quadratic self-attention mechanism effectively alleviates the shortcomings of selective state space in handling long-term dependencies of any element in the seq… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  5. arXiv:2406.16017  [pdf, other

    quant-ph physics.atom-ph

    Competing excitation quenching and charge exchange in ultracold Li-Ba$^+$ collisions

    Authors: Xiaodong Xing, Pascal Weckesser, Fabian Thielemann, Tibor Jónás, Romain Vexiau, Nadia Bouloufa-Maafa, Eliane Luc-Koenig, Kirk W. Madison, Andrea Orbán, Ting Xie, Tobias Schaetz, Olivier Dulieu

    Abstract: Hybrid atom-ion systems are a rich and powerful platform for studying chemical reactions, as they feature both excellent control over the electronic state preparation and readout as well as a versatile tunability over the scattering energy, ranging from the few-partial wave regime to the quantum regime. In this work, we make use of these excellent control knobs, and present a joint experimental an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages, 15 figures, 4 tables

  6. arXiv:2406.14598  [pdf, other

    cs.AI

    SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

    Authors: Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Kaixuan Huang, Luxi He, Boyi Wei, Dacheng Li, Ying Sheng, Ruoxi Jia, Bo Li, Kai Li, Danqi Chen, Peter Henderson, Prateek Mittal

    Abstract: Evaluating aligned large language models' (LLMs) ability to recognize and reject unsafe user requests is crucial for safe, policy-compliant deployments. Existing evaluation efforts, however, face three limitations that we address with SORRY-Bench, our proposed benchmark. First, existing methods often use coarse-grained taxonomies of unsafe topics, and are over-representing some fine-grained topics… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.14526  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Fantastic Copyrighted Beasts and How (Not) to Generate Them

    Authors: Luxi He, Yangsibo Huang, Weijia Shi, Tinghao Xie, Haotian Liu, Yue Wang, Luke Zettlemoyer, Chiyuan Zhang, Danqi Chen, Peter Henderson

    Abstract: Recent studies show that image and video generation models can be prompted to reproduce copyrighted content from their training data, raising serious legal concerns around copyright infringement. Copyrighted characters, in particular, pose a difficult challenge for image generation services, with at least one lawsuit already awarding damages based on the generation of these characters. Yet, little… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  8. arXiv:2406.12845  [pdf, other

    cs.LG cs.CL

    Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts

    Authors: Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the primary method for aligning large language models (LLMs) with human preferences. The RLHF process typically starts by training a reward model (RM) using human preference data. Conventional RMs are trained on pairwise responses to the same user request, with relative ratings indicating which response humans prefer. The trained RM… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Technical report v1. Code and model are released at https://github.com/RLHFlow/RLHF-Reward-Modeling/

  9. arXiv:2406.04274  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models

    Authors: Xiang Ji, Sanjeev Kulkarni, Mengdi Wang, Tengyang Xie

    Abstract: This work studies the challenge of aligning large language models (LLMs) with offline preference data. We focus on alignment by Reinforcement Learning from Human Feedback (RLHF) in particular. While popular preference optimization methods exhibit good empirical performance in practice, they are not theoretically guaranteed to converge to the optimal policy and can provably fail when the data cover… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  10. arXiv:2406.03520  [pdf, other

    cs.CV cs.AI cs.LG

    VideoPhy: Evaluating Physical Commonsense for Video Generation

    Authors: Hritik Bansal, Zongyu Lin, Tianyi Xie, Zeshun Zong, Michal Yarom, Yonatan Bitton, Chenfanfu Jiang, Yizhou Sun, Kai-Wei Chang, Aditya Grover

    Abstract: Recent advances in internet-scale video data pretraining have led to the development of text-to-video generative models that can create high-quality videos across a broad range of visual concepts and styles. Due to their ability to synthesize realistic motions and render complex objects, these generative models have the potential to become general-purpose simulators of the physical world. However,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 36 pages, 26 figures, 8 tables

  11. arXiv:2406.01304  [pdf, other

    cs.CL cs.AI cs.SE

    CodeR: Issue Resolving with Multi-Agent and Task Graphs

    Authors: Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang

    Abstract: GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve 28.33% of issue… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: https://github.com/NL2Code/CodeR

  12. arXiv:2405.21046  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

    Authors: Tengyang Xie, Dylan J. Foster, Akshay Krishnamurthy, Corby Rosset, Ahmed Awadallah, Alexander Rakhlin

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a central tool for language model alignment. We consider online exploration in RLHF, which exploits interactive access to human or AI feedback by deliberately encouraging the model to produce diverse, maximally informative responses. By allowing RLHF to confidently stray from the pre-trained model, online exploration offers the possi… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  13. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Gei**, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  14. arXiv:2405.18997  [pdf, other

    stat.ML cs.LG

    Kernel Semi-Implicit Variational Inference

    Authors: Ziheng Cheng, Longlin Yu, Tianyu Xie, Shiyue Zhang, Cheng Zhang

    Abstract: Semi-implicit variational inference (SIVI) extends traditional variational families with semi-implicit distributions defined in a hierarchical manner. Due to the intractable densities of semi-implicit distributions, classical SIVI often resorts to surrogates of evidence lower bound (ELBO) that would introduce biases for training. A recent advancement in SIVI, named SIVI-SM, utilizes an alternative… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera ready

  15. arXiv:2405.18515  [pdf, other

    cs.LG

    Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

    Authors: Yunuo Chen, Tianyi Xie, Zeshun Zong, Xuan Li, Feng Gao, Yin Yang, Ying Nian Wu, Chenfanfu Jiang

    Abstract: Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to maintain balance when placed in physics-based simulations or 3D printed. This balance is crucial for satisfying user design intentions in interactive gaming, embod… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  16. arXiv:2405.16577  [pdf, other

    stat.ML cs.LG

    Reflected Flow Matching

    Authors: Tianyu Xie, Yu Zhu, Longlin Yu, Tong Yang, Ziheng Cheng, Shiyue Zhang, Xiangyu Zhang, Cheng Zhang

    Abstract: Continuous normalizing flows (CNFs) learn an ordinary differential equation to transform prior samples into data. Flow matching (FM) has recently emerged as a simulation-free approach for training CNFs by regressing a velocity model towards the conditional velocity field. However, on constrained domains, the learned velocity model may lead to undesirable flows that result in highly unnatural sampl… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera-ready

  17. arXiv:2405.13803  [pdf, other

    cs.HC cs.CL

    Sunnie: An Anthropomorphic LLM-Based Conversational Agent for Mental Well-Being Activity Recommendation

    Authors: Siyi Wu, Feixue Han, Bingsheng Yao, Tianyi Xie, Xuan Zhao, Dakuo Wang

    Abstract: A longstanding challenge in mental well-being support is the reluctance of people to adopt psychologically beneficial activities, often due to lack of motivation, low perceived trustworthiness, and limited personalization of recommendations. Chatbots have shown promise in promoting positive mental health practices, yet their rigid interaction flows and less human-like conversational experiences pr… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: In Submission

  18. arXiv:2405.12420  [pdf, other

    cs.CV

    GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details

    Authors: Boqian Li, Xuan Li, Ying Jiang, Tianyi Xie, Feng Gao, Huamin Wang, Yin Yang, Chenfanfu Jiang

    Abstract: Traditional 3D garment creation is labor-intensive, involving sketching, modeling, UV map**, and texturing, which are time-consuming and costly. Recent advances in diffusion-based generative models have enabled new possibilities for 3D garment generation from text prompts, images, and videos. However, existing methods either suffer from inconsistencies among multi-view images or require addition… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  19. arXiv:2405.09939  [pdf, other

    cs.CL cs.AI

    SciQAG: A Framework for Auto-Generated Scientific Question Answering Dataset with Fine-grained Evaluation

    Authors: Yuwei Wan, Aswathy Ajith, Yixuan Liu, Ke Lu, Clara Grazian, Bram Hoex, Wenjie Zhang, Chunyu Kit, Tong Xie, Ian Foster

    Abstract: The use of question-answer (QA) pairs for training and evaluating large language models (LLMs) has attracted considerable attention. Yet few available QA datasets are based on knowledge from the scientific literature. Here we bridge this gap by presenting Automatic Generation of Scientific Question Answers (SciQAG), a framework for automatic generation and evaluation of scientific QA pairs sourced… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  20. arXiv:2405.08074  [pdf

    cond-mat.mes-hall cond-mat.str-el physics.optics

    Optical Imaging of Flavor Order in Flat Band Graphene

    Authors: Tian Xie, Tobias M. Wolf, Siyuan Xu, Zhiyuan Cui, Richen Xiong, Yunbo Ou, Patrick Hays, Ludwig F Holleis, Yi Guo, Owen I Sheekey, Caitlin Patterson, Trevor Arp, Kenji Watanabe, Takashi Taniguchi, Seth Ariel Tongay, Andrea F Young, Allan H. MacDonald, Chenhao **

    Abstract: Spin and valley flavor polarization plays a central role in the many-body physics of flat band graphene, with fermi surface reconstructions often accompanied by quantized anomalous Hall and superconducting state observed in a variety of experimental systems. Here we describe an optical technique that sensitively and selectively detects flavor textures via the exciton response of a proximal transit… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 29 pages, 4 figures, with supplementary materials

  21. arXiv:2405.08027  [pdf, other

    cs.LG cs.AI

    Automating Data Annotation under Strategic Human Agents: Risks and Potential Solutions

    Authors: Tian Xie, Xueru Zhang

    Abstract: As machine learning (ML) models are increasingly used in social domains to make consequential decisions about humans, they often have the power to reshape data distributions. Humans, as strategic agents, continuously adapt their behaviors in response to the learning system. As populations change dynamically, ML systems may need frequent updates to ensure high performance. However, acquiring high-q… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  22. arXiv:2405.04967  [pdf, other

    cond-mat.mtrl-sci

    MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures

    Authors: Han Yang, Chenxi Hu, Yichi Zhou, Xixian Liu, Yu Shi, Jielan Li, Guanzhi Li, Zekun Chen, Shuizhou Chen, Claudio Zeni, Matthew Horton, Robert Pinsler, Andrew Fowler, Daniel Zügner, Tian Xie, Jake Smith, Lixin Sun, Qian Wang, Lingyu Kong, Chang Liu, Hongxia Hao, Ziheng Lu

    Abstract: Accurate and fast prediction of materials properties is central to the digital transformation of materials design. However, the vast design space and diverse operating conditions pose significant challenges for accurately modeling arbitrary material candidates and forecasting their properties. We present MatterSim, a deep learning model actively learned from large-scale first-principles computatio… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  23. arXiv:2405.04108  [pdf, other

    cs.CR cs.AI

    A2-DIDM: Privacy-preserving Accumulator-enabled Auditing for Distributed Identity of DNN Model

    Authors: Tianxiu Xie, Keke Gai, **g Yu, Liehuang Zhu, Kim-Kwang Raymond Choo

    Abstract: Recent booming development of Generative Artificial Intelligence (GenAI) has facilitated an emerging model commercialization for the purpose of reinforcement on model performance, such as licensing or trading Deep Neural Network (DNN) models. However, DNN model trading may trigger concerns of the unauthorized replications or misuses over the model, so that the benefit of the model ownership will b… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  24. arXiv:2405.01810  [pdf, other

    cs.AI cs.LG

    Non-linear Welfare-Aware Strategic Learning

    Authors: Tian Xie, Xueru Zhang

    Abstract: This paper studies algorithmic decision-making in the presence of strategic individual behaviors, where an ML model is used to make decisions about human agents and the latter can adapt their behavior strategically to improve their future data. Existing results on strategic learning have largely focused on the linear setting where agents with linear labeling functions best respond to a (noisy) lin… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  25. arXiv:2405.01807  [pdf, other

    cs.GT cs.AI

    Algorithmic Decision-Making under Agents with Persistent Improvement

    Authors: Tian Xie, Xuwei Tan, Xueru Zhang

    Abstract: This paper studies algorithmic decision-making under human's strategic behavior, where a decision maker uses an algorithm to make decisions about human agents, and the latter with information about the algorithm may exert effort strategically and improve to receive favorable decisions. Unlike prior works that assume agents benefit from their efforts immediately, we consider realistic scenarios whe… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  26. arXiv:2405.01797  [pdf, other

    cs.AI

    Learning under Imitative Strategic Behavior with Unforeseeable Outcomes

    Authors: Tian Xie, Zhiqun Zuo, Mohammad Mahdi Khalili, Xueru Zhang

    Abstract: Machine learning systems have been widely used to make decisions about individuals who may best respond and behave strategically to receive favorable outcomes, e.g., they may genuinely improve the true labels or manipulate observable features directly to game the system without changing labels. Although both behaviors have been studied (often as two separate problems) in the literature, most works… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  27. arXiv:2404.18598  [pdf, other

    cs.CV cs.GR

    Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting

    Authors: Tianyidan Xie, Rui Ma, Qian Wang, Xiaoqian Ye, Feixuan Liu, Ying Tai, Zhenyu Zhang, Zili Yi

    Abstract: Recent advancements in image inpainting, particularly through diffusion modeling, have yielded promising outcomes. However, when tested in scenarios involving the completion of images based on the foreground objects, current methods that aim to inpaint an image in an end-to-end manner encounter challenges such as "over-imagination", inconsistency between foreground and background, and limited dive… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 16 pages, 9 figures, project page: https://anywheremultiagent.github.io

  28. arXiv:2404.17561  [pdf, other

    stat.ME stat.ML

    Structured Conformal Inference for Matrix Completion with Applications to Group Recommender Systems

    Authors: Ziyi Liang, Tianmin Xie, Xin Tong, Matteo Sesia

    Abstract: We develop a conformal inference method to construct joint confidence regions for structured groups of missing entries within a sparsely observed matrix. This method is useful to provide reliable uncertainty estimation for group-level collaborative filtering; for example, it can be applied to help suggest a movie for a group of friends to watch together. Unlike standard conformal techniques, which… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  29. arXiv:2404.14367  [pdf, other

    cs.LG

    Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

    Authors: Fahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov, Jeff Schneider, Tengyang Xie, Stefano Ermon, Chelsea Finn, Aviral Kumar

    Abstract: Learning from preference labels plays a crucial role in fine-tuning large language models. There are several distinct approaches for preference fine-tuning, including supervised learning, on-policy reinforcement learning (RL), and contrastive learning. Different methods come with different implementation tradeoffs and performance differences, and existing empirical findings present different concl… ▽ More

    Submitted 2 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: International Conference on Machine Learning (ICML), 2024

  30. arXiv:2404.11369  [pdf

    cond-mat.supr-con cond-mat.str-el

    Pressure-driven right-triangle shape superconductivity in bilayer nickelate La$_3$Ni$_2$O$_7$

    Authors: **gyuan Li, Peiyue Ma, Hengyuan Zhang, Xing Huang, Chaoxin Huang, Mengwu Huo, Deyuan Hu, Zixian Dong, Chengliang He, Jiahui Liao, Xiang Chen, Tao Xie, Hualei Sun, Meng Wang

    Abstract: Here we report a comprehensive study of the crystal structure, resistivity, and alternating-current magnetic susceptibility in single crystals of La$_3$Ni$_2$O$_7$, with a hydrostatic pressure up to 104 GPa. X-ray diffraction measurements reveal a bilayer orthorhombic structure (space group $Amam$) at ambient pressure and a transformation into a tetragonal phase (space group $I4/mmm$) at ad critic… ▽ More

    Submitted 24 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 4 figures in the article and 4 figures in the supplementary. Comments and suggestions are welcome!

  31. arXiv:2404.07972  [pdf, other

    cs.AI cs.CL

    OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    Authors: Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh **g Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu

    Abstract: Autonomous agents that accomplish complex computer tasks with minimal human interventions have the potential to transform human-computer interaction, significantly enhancing accessibility and productivity. However, existing benchmarks either lack an interactive environment or are limited to environments specific to certain applications or domains, failing to reflect the diverse and complex nature… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 51 pages, 21 figures

  32. arXiv:2404.07940  [pdf, other

    cs.SE cs.LG

    InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models

    Authors: Linyi Li, Shijie Geng, Zhenwen Li, Yibo He, Hao Yu, Ziyue Hua, Guanghan Ning, Siwei Wang, Tao Xie, Hongxia Yang

    Abstract: Large Language Models for code (code LLMs) have witnessed tremendous progress in recent years. With the rapid development of code LLMs, many popular evaluation benchmarks, such as HumanEval, DS-1000, and MBPP, have emerged to measure the performance of code LLMs with a particular focus on code generation tasks. However, they are insufficient to cover the full range of expected capabilities of code… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 March, 2024; originally announced April 2024.

    Comments: 30 pages, 10 pages for main content, work in progress

  33. arXiv:2404.03715  [pdf, other

    cs.LG cs.AI cs.CL

    Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

    Authors: Corby Rosset, Ching-An Cheng, Arindam Mitra, Michael Santacroce, Ahmed Awadallah, Tengyang Xie

    Abstract: This paper studies post-training large language models (LLMs) using preference feedback from a powerful oracle to help a model iteratively improve over itself. The typical approach for post-training LLMs involves Reinforcement Learning from Human Feedback (RLHF), which traditionally separates reward learning and subsequent policy optimization. However, such a reward maximization approach is limite… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  34. arXiv:2404.03080  [pdf, other

    cs.CL cs.AI

    Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model

    Authors: Yanpeng Ye, Jie Ren, Shaozhou Wang, Yuwei Wan, Haofen Wang, Imran Razzak, Tong Xie, Wenjie Zhang

    Abstract: Knowledge in materials science is widely dispersed across extensive scientific literature, posing significant challenges for efficient discovery and integration of new materials. Traditional methods, often reliant on costly and time-consuming experimental approaches, further complicate rapid innovation. Addressing these challenges, the integration of artificial intelligence with materials science… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures, 3 tables

  35. arXiv:2403.17357  [pdf, other

    cs.SE cs.AI

    MESIA: Understanding and Leveraging Supplementary Nature of Method-level Comments for Automatic Comment Generation

    Authors: Xinglu Pan, Chenxiao Liu, Yanzhen Zou, Tao Xie, Bing Xie

    Abstract: Code comments are important for developers in program comprehension. In scenarios of comprehending and reusing a method, developers expect code comments to provide supplementary information beyond the method signature. However, the extent of such supplementary information varies a lot in different code comments. In this paper, we raise the awareness of the supplementary nature of method-level comm… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: In 32nd IEEE/ACM International Conference on Program Comprehension (ICPC'24)

  36. arXiv:2403.13765  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Principled Representation Learning from Videos for Reinforcement Learning

    Authors: Dipendra Misra, Akanksha Saran, Tengyang Xie, Alex Lamb, John Langford

    Abstract: We study pre-training representations for decision-making using video data, which is abundantly available for tasks such as game agents and software testing. Even though significant empirical advances have been made on this problem, a theoretical understanding remains absent. We initiate the theoretical investigation into principled approaches for representation learning and focus on learning the… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Spotlight Conference Paper

  37. arXiv:2403.10585  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint

    Authors: Haoyue Tang, Tian Xie, Aosong Feng, Hanyu Wang, Chenyang Zhang, Yang Bai

    Abstract: Solving image inverse problems (e.g., super-resolution and inpainting) requires generating a high fidelity image that matches the given input (the low-resolution image or the masked image). By using the input image as guidance, we can leverage a pretrained diffusion generative model to solve a wide range of image inverse tasks without task specific model fine-tuning. To precisely estimate the guid… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted and to Appear, AISTATS 2024

  38. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  39. arXiv:2402.16567  [pdf, other

    cs.CL cs.AI cs.DB

    Aligning Large Language Models to a Domain-specific Graph Database

    Authors: Yuanyuan Liang, Keren Tan, Tingyu Xie, Wenbiao Tao, Siyuan Wang, Yunshi Lan, Weining Qian

    Abstract: Graph Databases (Graph DB) are widely applied in various fields, including finance, social networks, and medicine. However, translating Natural Language (NL) into the Graph Query Language (GQL), commonly known as NL2GQL, proves to be challenging due to its inherent complexity and specialized nature. Some approaches have sought to utilize Large Language Models (LLMs) to address analogous tasks like… ▽ More

    Submitted 28 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 pages,2 figures

  40. arXiv:2402.15069  [pdf, other

    astro-ph.HE

    Investigation of profile shifting and subpulse movement in PSR J0344-0901 with FAST

    Authors: H. M. Tedila, R. Yuen, N. Wang, D. Li, Z. G. Wen, W. M. Yan, J. P. Yuan, X. H. Han, P. Wang, W. W. Zhu, S. J. Dang, S. Q. Wang, J. T. Xie, Q. D. Wu, Sh. Khasanov, FAST Collaboration

    Abstract: We report two phenomena detected in PSR J0344$-$0901 from two observations conducted at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The first phenomenon manifests as shifting in the pulse emission to later longitudinal phases and then gradually returns to its original location. The event lasts for about 216 pulse periods, with an average s… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  41. arXiv:2402.13254  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

    Authors: Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee

    Abstract: We propose CounterCurate, a framework to comprehensively improve the visio-linguistic compositional reasoning capability for both contrastive and generative multimodal models. In particular, we identify two critical under-explored problems: the neglect of the physically grounded reasoning (counting and position understanding) and the potential of using highly capable text and image generation mode… ▽ More

    Submitted 12 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 15 pages, 6 figures, 12 tables, Project Page: https://countercurate.github.io/

  42. arXiv:2402.12820  [pdf, other

    eess.SY

    ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer

    Authors: Tong Xie, Yixuan Hu, Renjie Wei, Meng Li, Yuan Wang, Runsheng Wang, Ru Huang

    Abstract: Stochastic computing (SC) has emerged as a promising computing paradigm for neural acceleration. However, how to accelerate the state-of-the-art Vision Transformer (ViT) with SC remains unclear. Unlike convolutional neural networks, ViTs introduce notable compatibility and efficiency challenges because of their nonlinear functions, e.g., softmax and Gaussian Error Linear Units (GELU). In this pape… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted in DATE 2024

  43. arXiv:2402.11428  [pdf, other

    astro-ph.HE hep-th

    Modelling The Radial Distribution of Pulsars in the Galaxy

    Authors: J. T. Xie, J. B. Wang, N. Wang, R. Manchester, G. Hobbs

    Abstract: The Parkes 20 cm Multibeam pulsar surveys have discovered nearly half of the known pulsars and revealed many distant pulsars with high dispersion measures. Using a sample of 1,301 pulsars from these surveys, we have explored the spatial distribution and birth rate of normal pulsars. The pulsar distances used to calculate the pulsar surface density are estimated from the YMW16 electron-density mode… ▽ More

    Submitted 22 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  44. arXiv:2402.10671  [pdf, other

    cs.CL

    Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm

    Authors: Yuanzhen Xie, Xinzhou **, Tao Xie, MingXiong Lin, Liang Chen, Chenyun Yu, Lei Cheng, ChengXiang Zhuo, Bo Hu, Zang Li

    Abstract: In-context learning of large-language models (LLMs) has achieved remarkable success in the field of natural language processing, while extensive case studies reveal that the single-step chain-of-thought prompting approach faces challenges such as attention diffusion and inadequate performance in complex tasks like text-to-SQL. To improve the contextual learning capabilities of LLMs in text-to-SQL,… ▽ More

    Submitted 3 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  45. arXiv:2402.05162  [pdf, other

    cs.LG cs.AI cs.CL

    Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

    Authors: Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

    Abstract: Large language models (LLMs) show inherent brittleness in their safety mechanisms, as evidenced by their susceptibility to jailbreaking and even non-malicious fine-tuning. This study explores this brittleness of safety alignment by leveraging pruning and low-rank modifications. We develop methods to identify critical regions that are vital for safety guardrails, and that are disentangled from util… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 22 pages, 9 figures. Project page is available at https://boyiwei.com/alignment-attribution/

  46. arXiv:2402.04623  [pdf, other

    cs.SE

    Validity-Preserving Delta Debugging via Generator

    Authors: Luyao Ren, Xing Zhang, Ziyue Hua, Yanyan Jiang, Xiao He, Tao Xie

    Abstract: Reducing test inputs that trigger bugs is crucial for efficient debugging. Delta debugging is the most popular approach for this purpose. When test inputs need to conform to certain specifications, existing delta debugging practice encounters a validity problem: it blindly applies reduction rules, producing a large number of invalid test inputs that do not satisfy the required specifications. This… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  47. A narrow-band parameterization for the stochastic gravitational wave background

    Authors: Tianyi Xie, Dongdong Zhang, Jie Jiang, Jia-Rui Li, Bo Wang, Yi-Fu Cai

    Abstract: In light of the non-perturbative resonance effects that may occur during inflation, we introduce a parametrization for the power spectrum of the stochastic gravitational wave background (SGWB) characterized by narrow-band amplification. We utilize the universal $Ω_\text{GW}\propto k^3$ infrared limit, applicable to a wide array of gravitational wave sources, to devise a robust yet straightforward… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  48. arXiv:2401.16663  [pdf, other

    cs.HC cs.CV

    VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality

    Authors: Ying Jiang, Chang Yu, Tianyi Xie, Xuan Li, Yutao Feng, Huamin Wang, Minchen Li, Henry Lau, Feng Gao, Yin Yang, Chenfanfu Jiang

    Abstract: As consumer Virtual Reality (VR) and Mixed Reality (MR) technologies gain momentum, there's a growing focus on the development of engagements with 3D virtual content. Unfortunately, traditional techniques for content creation, editing, and interaction within these virtual spaces are fraught with difficulties. They tend to be not only engineering-intensive but also require extensive expertise, whic… ▽ More

    Submitted 4 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  49. arXiv:2401.12635  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Neutron Scattering Studies on the High-$T_c$ Superconductor La$_3$Ni$_2$O$_{7-δ}$ at Ambient Pressure

    Authors: Tao Xie, Mengwu Huo, Xiaosheng Ni, Feiran Shen, Xing Huang, Hualei Sun, Helen C. Walker, Devashibhai Adroja, Dehong Yu, Bing Shen, Lunhua He, Kun Cao, Meng Wang

    Abstract: After several decades of studies of high-temperature superconductivity, there is no compelling theory for the mechanism yet; however, the spin fluctuations have been widely believed to play a crucial role in forming the superconducting Cooper pairs. The recent discovery of high-temperature superconductivity near 80 K in the bilayer nickelate La$_3$Ni$_2$O$_7$ under pressure provides a new platform… ▽ More

    Submitted 4 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 10 pages, 9 figures with supplementary information

  50. arXiv:2401.11525  [pdf, ps, other

    math.CO

    Weak rainbow saturation numbers of graphs

    Authors: Xihe Li, Jie Ma, Tianying Xie

    Abstract: For a fixed graph $H$, we say that an edge-colored graph $G$ is \emph{weakly $H$-rainbow saturated} if there exists an ordering $e_1, e_2, \ldots, e_m$ of $E\left(\overline{G}\right)$ such that, for any list $c_1, c_2, \ldots, c_m$ of pairwise distinct colors from $\mathbb{N}$, the non-edges $e_i$ in color $c_i$ can be added to $G$, one at a time, so that every added edge creates a new rainbow cop… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 13 pages, 1 figure