Skip to main content

Showing 1–50 of 478 results for author: Feng, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17797  [pdf, other

    physics.chem-ph cs.AI cs.LG

    MoleculeCLA: Rethinking Molecular Benchmark via Computational Ligand-Target Binding Analysis

    Authors: Shikun Feng, Jiaxin Zheng, Yinjun Jia, Yanwen Huang, Fengfeng Zhou, Wei-Ying Ma, Yanyan Lan

    Abstract: Molecular representation learning is pivotal for various molecular property prediction tasks related to drug discovery. Robust and accurate benchmarks are essential for refining and validating current methods. Existing molecular property benchmarks derived from wet experiments, however, face limitations such as data volume constraints, unbalanced label distribution, and noisy labels. To address th… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.16588  [pdf, other

    eess.SY cs.FL

    Switching Controller Synthesis for Hybrid Systems Against STL Formulas

    Authors: Han Su, Shenghua Feng, Sinong Zhan, Naijun Zhan

    Abstract: Switching controllers play a pivotal role in directing hybrid systems (HSs) towards the desired objective, embodying a ``correct-by-construction'' approach to HS design. Identifying these objectives is thus crucial for the synthesis of effective switching controllers. While most of existing works focus on safety and liveness, few of them consider timing constraints. In this paper, we delves into t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.15992  [pdf, other

    cs.CL

    Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

    Authors: Yizhuo Zhang, Heng Wang, Shangbin Feng, Zhaoxuan Tan, Xiaochuang Han, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) demonstrate great potential for problems with implicit graphical structures, while recent works seek to enhance the graph reasoning capabilities of LLMs through specialized instruction tuning. The resulting 'graph LLMs' are evaluated with in-distribution settings only, thus it remains underexplored whether LLMs are learning generalizable graph reasoning skills or merel… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 6 figures, Code and data will be publicly available at https://github.com/MatthewYZhang/NLGift

    ACM Class: I.2.7

  4. arXiv:2406.15951  [pdf, other

    cs.CL

    Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

    Authors: Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Ye** Choi, Yulia Tsvetkov

    Abstract: While existing alignment paradigms have been integral in develo** large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but special… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  5. arXiv:2406.15948  [pdf, other

    cs.CL

    Teaching LLMs to Abstain across Languages via Multilingual Feedback

    Authors: Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov

    Abstract: Multilingual LLMs often have knowledge disparities across languages, with larger gaps in under-resourced languages. Teaching LLMs to abstain in the face of knowledge gaps is thus a promising strategy to mitigate hallucinations in multilingual settings. However, previous studies on LLM abstention primarily focus on English; we find that directly applying existing solutions beyond English results in… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  6. arXiv:2406.15352  [pdf, other

    cs.CL

    A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick

    Authors: Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Boyd-Graber

    Abstract: Keyword mnemonics are memorable explanations that link new terms to simpler keywords. Prior works generate mnemonics for students, but they do not guide models toward mnemonics students prefer and aid learning. We build SMART, a mnemonic generator trained on feedback from real students learning new terms. To train SMART, we first fine-tune LLaMA-2 on a curated set of user-written mnemonics. We the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In-Progress Preprint

  7. arXiv:2406.14103  [pdf, other

    cs.AI

    Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation

    Authors: Yanwei Zheng, Shaopu Feng, Bowen Huang, Changrui Li, Xiao Zhang, Dongxiao Yu

    Abstract: The task that requires an agent to navigate to a given object through only visual observation is called visual object navigation (VON). The main bottlenecks of VON are strategies exploration and prior knowledge exploitation. Traditional strategies exploration ignores the differences of searching and navigating stages, using the same reward in two stages, which reduces navigation performance and tr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  8. arXiv:2406.11633  [pdf, other

    cs.CV

    DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

    Authors: Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao

    Abstract: Scientific documents record research findings and valuable human knowledge, comprising a vast corpus of high-quality data. Leveraging multi-modality data extracted from these documents and assessing large models' abilities to handle scientific document-oriented tasks is therefore meaningful. Despite promising advancements, large models still perform poorly on multi-page scientific document extract… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Homepage of DocGenome: https://unimodal4reasoning.github.io/DocGenome_page 22 pages, 11 figures

  9. arXiv:2406.11568  [pdf, other

    cs.CL cs.SD eess.AS q-bio.NC

    Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

    Authors: Sheng Feng, Heyang Liu, Yu Wang, Yanfeng Wang

    Abstract: In this paper, we introduce a groundbreaking end-to-end (E2E) framework for decoding invasive brain signals, marking a significant advancement in the field of speech neuroprosthesis. Our methodology leverages the comprehensive reasoning abilities of large language models (LLMs) to facilitate direct decoding. By fully integrating LLMs, we achieve results comparable to the state-of-the-art cascade m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2406.10447  [pdf, other

    cs.CV

    The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

    Authors: Bria Long, Violet Xiang, Stefan Stojanov, Robert Z. Sparks, Zi Yin, Grace E. Keene, Alvin W. M. Tan, Steven Y. Feng, Chengxu Zhuang, Virginia A. Marchman, Daniel L. K. Yamins, Michael C. Frank

    Abstract: Human children far exceed modern machine learning algorithms in their sample efficiency, achieving high performance in key domains with much less data than current models. This ''data gap'' is a key challenge both for building intelligent artificial systems and for understanding human development. Egocentric video capturing children's experience -- their ''training data'' -- is a key ingredient fo… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 4 tables and SI. Submitted to NeurIPS Datasets and Benchmarks

  11. arXiv:2406.09881  [pdf, other

    cs.CL

    A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

    Authors: Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling Wang, Yifei Zhang, Hinrich Schütze

    Abstract: Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMD$^2$G}. The AMD… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 17pages,ECML-PKDD

    Journal ref: 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

  12. arXiv:2406.09486  [pdf, other

    cs.CV cs.AI

    SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

    Authors: Shenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based offline reinforcement Learning (RL) is a promising approach that leverages existing data effectively in many real-world applications, especially those involving high-dimensional inputs like images and videos. To alleviate the distribution shift issue in offline RL, existing model-based methods heavily rely on the uncertainty of learned dynamics. However, the model uncertainty estimatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  13. arXiv:2406.07850  [pdf, other

    cs.CL cs.AI

    Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

    Authors: Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li

    Abstract: Stochastic sampling strategies such as top-k and top-p have been widely used in dialogue generation task. However, as an open-domain chatting system, there will be two different conversation scenarios, i.e. chit-chat and knowledge-based question answering. In the former situation, responses diversity is essential due to the one-to-many nature in dialogue. The latter, on the other hand, requires le… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings

  14. arXiv:2406.05135  [pdf

    cs.RO math.OC

    Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics

    Authors: Xi Cheng, Gaofeng Su, Siyuan Feng, Ke Liu, Chen Zhu, Hui Lin, Jilin Song, Jianan Chen

    Abstract: Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducte… ▽ More

    Submitted 14 May, 2024; originally announced June 2024.

  15. arXiv:2406.00922  [pdf, other

    cs.CL cs.AI

    MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

    Authors: Shuyue Stella Li, Vidhisha Balachandran, Shangbin Feng, Jonathan Ilgen, Emma Pierson, Pang Wei Koh, Yulia Tsvetkov

    Abstract: In high-stakes domains like clinical reasoning, AI assistants powered by large language models (LLMs) are yet to be reliable and safe. We identify a key obstacle towards reliability: existing LLMs are trained to answer any question, even with incomplete context in the prompt or insufficient parametric knowledge. We propose to change this paradigm to develop more careful LLMs that ask follow-up que… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 29 pages, 12 figures

  16. arXiv:2405.19062  [pdf, other

    cs.LG cs.AI

    SIG: Efficient Self-Interpretable Graph Neural Network for Continuous-time Dynamic Graphs

    Authors: Lanting Fang, Yulian Yang, Kai Wang, Shanshan Feng, Kaiyu Feng, Jie Gui, Shuliang Wang, Yew-Soon Ong

    Abstract: While dynamic graph neural networks have shown promise in various applications, explaining their predictions on continuous-time dynamic graphs (CTDGs) is difficult. This paper investigates a new research task: self-interpretable GNNs for CTDGs. We aim to predict future links within the dynamic graph while simultaneously providing causal explanations for these predictions. There are two key challen… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 19 pages

  17. arXiv:2405.18549  [pdf, other

    cs.LG cs.DB cs.SC

    Learning from Uncertain Data: From Possible Worlds to Possible Models

    Authors: Jiongli Zhu, Su Feng, Boris Glavic, Babak Salimi

    Abstract: We introduce an efficient method for learning linear models from uncertain data, where uncertainty is represented as a set of possible variations in the data, leading to predictive multiplicity. Our approach leverages abstract interpretation and zonotopes, a type of convex polytope, to compactly represent these dataset variations, enabling the symbolic execution of gradient descent on all possible… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  18. arXiv:2405.10558  [pdf, other

    cs.SI

    CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection

    Authors: Sirry Chen, Shuo Feng, Songsong Liang, Chen-Chen Zong, **g Li, Piji Li

    Abstract: Social media bot detection is increasingly crucial with the rise of social media platforms. Existing methods predominantly construct social networks as graph and utilize graph neural networks (GNNs) for bot detection. However, most of these methods focus on how to improve the performance of GNNs while neglecting the community structure within social networks. Moreover, GNNs based methods still fac… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 findings

  19. arXiv:2405.10343  [pdf, other

    q-bio.BM cs.AI cs.LG

    UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning

    Authors: Shikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: Recently, a noticeable trend has emerged in develo** pre-trained foundation models in the domains of CV and NLP. However, for molecular pre-training, there lacks a universal model capable of effectively applying to various categories of molecular tasks, since existing prevalent pre-training methods exhibit effectiveness for specific types of downstream tasks. Furthermore, the lack of profound un… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  20. arXiv:2405.09220  [pdf, other

    cs.LG cs.AI cs.CL

    ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

    Authors: Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen

    Abstract: In this paper, we present the findings of our Project ALPINE which stands for ``Autoregressive Learning for Planning In NEtworks." Project ALPINE initiates a theoretical investigation into the development of planning capabilities in Transformer-based language models through their autoregressive learning mechanisms, aiming to identify any potential limitations in their planning abilities. We abstra… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  21. arXiv:2405.08298  [pdf, other

    cs.LG

    Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments

    Authors: Ke Liu, Fan Hu, Hui Lin, Xi Cheng, Jianan Chen, Jilin Song, Siyuan Feng, Gaofeng Su, Chen Zhu

    Abstract: This paper explores the optimization of Ground Delay Programs (GDP), a prevalent Traffic Management Initiative used in Air Traffic Management (ATM) to reconcile capacity and demand discrepancies at airports. Employing Reinforcement Learning (RL) to manage the inherent uncertainties in the national airspace system-such as weather variability, fluctuating flight demands, and airport arrival rates-we… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  22. arXiv:2405.08293  [pdf, other

    cs.LG

    Airport Delay Prediction with Temporal Fusion Transformers

    Authors: Ke Liu, Kai**g Ding, Xi Cheng, Jianan Chen, Siyuan Feng, Hui Lin, Jilin Song, Chen Zhu

    Abstract: Since flight delay hurts passengers, airlines, and airports, its prediction becomes crucial for the decision-making of all stakeholders in the aviation industry and thus has been attempted by various previous research. However, previous delay predictions are often categorical and at a highly aggregated level. To improve that, this study proposes to apply the novel Temporal Fusion Transformer model… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  23. arXiv:2405.07229  [pdf, other

    cs.MM

    MM-InstructEval: Zero-Shot Evaluation of (Multimodal) Large Language Models on Multimodal Reasoning Tasks

    Authors: Xiaocui Yang, Wenfang Wu, Shi Feng, Ming Wang, Daling Wang, Yang Li, Qi Sun, Yifei Zhang, Xiaoming Fu, Soujanya Poria

    Abstract: The rising popularity of multimodal large language models (MLLMs) has sparked a significant increase in research dedicated to evaluating these models. However, current evaluation studies predominantly concentrate on the ability of models to comprehend and reason within a unimodal (vision-only) context, overlooking critical performance evaluations in complex multimodal reasoning tasks that integrat… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Under review, the new version of MM-BigBench: arXiv:2310.09036

  24. arXiv:2405.07090  [pdf, other

    cs.HC

    MUD: Towards a Large-Scale and Noise-Filtered UI Dataset for Modern Style UI Modeling

    Authors: Sidong Feng, Suyu Ma, Han Wang, David Kong, Chunyang Chen

    Abstract: The importance of computational modeling of mobile user interfaces (UIs) is undeniable. However, these require a high-quality UI dataset. Existing datasets are often outdated, collected years ago, and are frequently noisy with mismatches in their visual representation. This presents challenges in modeling UI understanding in the wild. This paper introduces a novel approach to automatically mine UI… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  25. arXiv:2405.06705  [pdf, other

    cs.CL cs.AI

    LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

    Authors: Zhuoxuan Jiang, Haoyuan Peng, Shanshan Feng, Fan Li, Dongsheng Li

    Abstract: Self-correction is emerging as a promising approach to mitigate the issue of hallucination in Large Language Models (LLMs). To facilitate effective self-correction, recent research has proposed mistake detection as its initial step. However, current literature suggests that LLMs often struggle with reliably identifying reasoning mistakes when using simplistic prompting strategies. To address this… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: To appear at IJCAI 2024

  26. arXiv:2404.14701  [pdf, other

    cs.LG

    Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization

    Authors: Siqi Feng, Rui Yao, Stephane Hess, Ricardo A. Daziano, Timothy Brathwaite, Joan Walker, Shenhao Wang

    Abstract: Deep neural networks (DNNs) frequently present behaviorally irregular patterns, significantly limiting their practical potentials and theoretical validity in travel behavior modeling. This study proposes strong and weak behavioral regularities as novel metrics to evaluate the monotonicity of individual demand functions (a.k.a. law of demand), and further designs a constrained optimization framewor… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  27. arXiv:2404.13076  [pdf, other

    cs.CL cs.AI

    LLM Evaluators Recognize and Favor Their Own Generations

    Authors: Arjun Panickssery, Samuel R. Bowman, Shi Feng

    Abstract: Self-evaluation using large language models (LLMs) has proven valuable not only in benchmarking but also methods like reward modeling, constitutional AI, and self-refinement. But new biases are introduced due to the same LLM acting as both the evaluator and the evaluatee. One such bias is self-preference, where an LLM evaluator scores its own outputs higher than others' while human annotators cons… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  28. arXiv:2404.09151  [pdf, other

    cs.SE cs.LG

    Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development

    Authors: Siyuan Feng, Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

    Abstract: Deploying machine learning (ML) on diverse computing platforms is crucial to accelerate and broaden their applications. However, it presents significant software engineering challenges due to the fast evolution of models, especially the recent Large Language Models (LLMs), and the emergence of new computing platforms. Current ML frameworks are primarily engineered for CPU and CUDA platforms, leavi… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  29. arXiv:2404.03386  [pdf, other

    cs.RO cs.AI cs.LG

    SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

    Authors: Kaichen Huang, Minghao Shao, Shenghua Wan, Hai-Hang Sun, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: In many real-world visual Imitation Learning (IL) scenarios, there is a misalignment between the agent's and the expert's perspectives, which might lead to the failure of imitation. Previous methods have generally solved this problem by domain alignment, which incurs extra computation and storage costs, and these methods fail to handle the \textit{hard cases} where the viewpoint gap is too large.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  30. arXiv:2404.03382  [pdf, other

    cs.LG cs.AI

    DIDA: Denoised Imitation Learning based on Domain Adaptation

    Authors: Kaichen Huang, Hai-Hang Sun, Shenghua Wan, Minghao Shao, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: Imitating skills from low-quality datasets, such as sub-optimal demonstrations and observations with distractors, is common in real-world applications. In this work, we focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. Previous IL methods improve t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  31. arXiv:2404.01855  [pdf, other

    cs.IR cs.AI

    Where to Move Next: Zero-shot Generalization of LLMs for Next POI Recommendation

    Authors: Shanshan Feng, Haoming Lyu, Caishun Chen, Yew-Soon Ong

    Abstract: Next Point-of-interest (POI) recommendation provides valuable suggestions for users to explore their surrounding environment. Existing studies rely on building recommendation models from large-scale users' check-in data, which is task-specific and needs extensive computational resources. Recently, the pretrained large language models (LLMs) have achieved significant advancements in various NLP tas… ▽ More

    Submitted 22 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  32. arXiv:2404.00924  [pdf, other

    cs.CV

    BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

    Authors: Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo, Shiwei Feng, Dongfang Liu, Mingjie Tang, Xiangyu Zhang

    Abstract: Pixel-wise regression tasks (e.g., monocular depth estimation (MDE) and optical flow estimation (OFE)) have been widely involved in our daily life in applications like autonomous driving, augmented reality and video composition. Although certain applications are security-critical or bear societal significance, the adversarial robustness of such models are not sufficiently studied, especially in th… ▽ More

    Submitted 24 May, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Paper accepted at ICML 2024

  33. arXiv:2404.00473  [pdf, other

    cs.CR cs.LG

    Privacy Backdoors: Stealing Data with Corrupted Pretrained Models

    Authors: Shanglun Feng, Florian Tramèr

    Abstract: Practitioners commonly download pretrained machine learning models from open repositories and finetune them to fit specific applications. We show that this practice introduces a new risk of privacy backdoors. By tampering with a pretrained model's weights, an attacker can fully compromise the privacy of the finetuning data. We show how to build privacy backdoors for a variety of models, including… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Code at https://github.com/ShanglunFengatETHZ/PrivacyBackdoor

  34. arXiv:2403.18127  [pdf, ps, other

    cs.LG math.ST stat.ML

    A Correction of Pseudo Log-Likelihood Method

    Authors: Shi Feng, Nuoya Xiong, Zhijie Zhang, Wei Chen

    Abstract: Pseudo log-likelihood is a type of maximum likelihood estimation (MLE) method used in various fields including contextual bandits, influence maximization of social networks, and causal bandits. However, in previous literature \citep{li2017provably, zhang2022online, xiong2022combinatorial, feng2023combinatorial1, feng2023combinatorial2}, the log-likelihood function may not be bounded, which may res… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages

  35. arXiv:2403.17188  [pdf, other

    cs.CV cs.CR

    LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

    Authors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Guangyu Shen, Shengwei An, Shiwei Feng, Xiangzhe Xu, Kaiyuan Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Backdoor attack poses a significant security threat to Deep Learning applications. Existing attacks are often not evasive to established backdoor detection techniques. This susceptibility primarily stems from the fact that these attacks typically leverage a universal trigger pattern or transformation function, such that the trigger can cause misclassification for any input. In response to this, re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  36. arXiv:2403.16645  [pdf

    cs.HC

    Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations

    Authors: Fan Li, Shanshan Feng, Yuqi Yan, Ching-Hung Lee, Yew Soon Ong

    Abstract: Advancements in technology, pilot shortages, and cost pressures are driving a trend towards single-pilot and even remote operations in aviation. Considering the extensive workload and huge risks associated with single-pilot operations, the development of a Virtual Co-Pilot (V-CoP) is expected to be a potential way to ensure aviation safety. This study proposes a V-CoP concept and explores how huma… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 10 pages,7 figures

  37. arXiv:2403.15699  [pdf

    cs.CL

    FEEL: A Framework for Evaluating Emotional Support Capability with Large Language Models

    Authors: Huaiwen Zhang, Yu Chen, Ming Wang, Shi Feng

    Abstract: Emotional Support Conversation (ESC) is a typical dialogue that can effectively assist the user in mitigating emotional pressures. However, owing to the inherent subjectivity involved in analyzing emotions, current non-artificial methodologies face challenges in effectively appraising the emotional support capability. These metrics exhibit a low correlation with human judgments. Concurrently, manu… ▽ More

    Submitted 15 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 14 pages,3 figures and 4 tables

  38. arXiv:2403.13869  [pdf, other

    cs.LG cs.AI

    Accurately Predicting Probabilities of Safety-Critical Rare Events for Intelligent Systems

    Authors: Ruoxuan Bai, **gxuan Yang, Weiduo Gong, Yi Zhang, Qiu**g Lu, Shuo Feng

    Abstract: Intelligent systems are increasingly integral to our daily lives, yet rare safety-critical events present significant latent threats to their practical deployment. Addressing this challenge hinges on accurately predicting the probability of safety-critical events occurring within a given time step from the current state, a metric we define as 'criticality'. The complexity of predicting criticality… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  39. arXiv:2403.13714  [pdf, other

    cs.RO cs.CV

    DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Map**

    Authors: Yuxuan Zhou, Xingxing Li, Shengyu Li, Xuanbin Wang, Shaoquan Feng, Yuxuan Tan

    Abstract: Visual simultaneous localization and map** (VSLAM) has broad applications, with state-of-the-art methods leveraging deep neural networks for better robustness and applicability. However, there is a lack of research in fusing these learning-based methods with multi-sensor information, which could be indispensable to push related applications to large-scale and complex scenarios. In this paper, we… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  40. arXiv:2403.11574  [pdf, ps, other

    cs.LG

    Offline Multitask Representation Learning for Reinforcement Learning

    Authors: Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

    Abstract: We study offline multitask representation learning in reinforcement learning (RL), where a learner is provided with an offline dataset from different tasks that share a common representation and is asked to learn the shared representation. We theoretically investigate offline multitask low-rank RL, and propose a new algorithm called MORL for offline multitask representation learning. Furthermore,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  41. arXiv:2403.11144  [pdf, other

    cs.LG

    Is Mamba Effective for Time Series Forecasting?

    Authors: Zihan Wang, Fanheng Kong, Shi Feng, Ming Wang, Xiaocui Yang, Han Zhao, Daling Wang, Yifei Zhang

    Abstract: In the realm of time series forecasting (TSF), it is imperative for models to adeptly discern and distill hidden patterns within historical time series data to forecast future states. Transformer-based models exhibit formidable efficacy in TSF, primarily attributed to their advantage in apprehending these patterns. However, the quadratic complexity of the Transformer leads to low computational eff… ▽ More

    Submitted 27 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  42. arXiv:2403.09976  [pdf, other

    cs.LG cs.CV

    AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

    Authors: Yucen Wang, Shenghua Wan, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based methods have significantly contributed to distinguishing task-irrelevant distractors for visual control. However, prior research has primarily focused on heterogeneous distractors like noisy background videos, leaving homogeneous distractors that closely resemble controllable agents largely unexplored, which poses significant challenges to existing methods. To tackle this problem, we p… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  43. arXiv:2403.07331  [pdf, other

    cs.IR cs.DB

    LIST: Learning to Index Spatio-Textual Data for Embedding based Spatial Keyword Queries

    Authors: Ziqi Yin, Shanshan Feng, Shang Liu, Gao Cong, Yew Soon Ong, Bin Cui

    Abstract: With the proliferation of spatio-textual data, Top-k KNN spatial keyword queries (TkQs), which return a list of objects based on a ranking function that evaluates both spatial and textual relevance, have found many real-life applications. Existing geo-textual indexes for TkQs use traditional retrieval models like BM25 to compute text relevance and usually exploit a simple linear function to comput… ▽ More

    Submitted 18 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  44. arXiv:2402.19275  [pdf, other

    eess.SY cs.LG

    Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning

    Authors: **gxuan Yang, Ruoxuan Bai, Haoyuan Ji, Yi Zhang, Jianming Hu, Shuo Feng

    Abstract: The assessment of safety performance plays a pivotal role in the development and deployment of connected and automated vehicles (CAVs). A common approach involves designing testing scenarios based on prior knowledge of CAVs (e.g., surrogate models), conducting tests in these scenarios, and subsequently evaluating CAVs' safety performances. However, substantial differences between CAVs and the prio… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  45. arXiv:2402.16929  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming Language

    Authors: Ming Wang, Yuanzhong Liu, Xiaoyu Liang, Songlian Li, Yijie Huang, Xiaoming Zhang, Sijia Shen, Chaofeng Guan, Daling Wang, Shi Feng, Huaiwen Zhang, Yifei Zhang, Minghui Zheng, Chi Zhang

    Abstract: LLMs have demonstrated commendable performance across diverse domains. Nevertheless, formulating high-quality prompts to instruct LLMs proficiently poses a challenge for non-AI experts. Existing research in prompt engineering suggests somewhat scattered optimization principles and designs empirically dependent prompt optimizers. Unfortunately, these endeavors lack a structured design template, inc… ▽ More

    Submitted 29 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  46. arXiv:2402.13779  [pdf, other

    cs.LG cs.AI q-bio.BM

    Contextual Molecule Representation Learning from Chemical Reaction Knowledge

    Authors: Han Tang, Shikun Feng, Bicheng Lin, Yuyan Ni, JIng**g Liu, Wei-Ying Ma, Yanyan Lan

    Abstract: In recent years, self-supervised learning has emerged as a powerful tool to harness abundant unlabelled data for representation learning and has been broadly adopted in diverse areas. However, when applied to molecular representation learning (MRL), prevailing techniques such as masked sub-unit reconstruction often fall short, due to the high degree of freedom in the possible combinations of atoms… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint. Under Review

  47. arXiv:2402.13249  [pdf, other

    cs.CL cs.AI

    TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

    Authors: Liyan Tang, Igor Shalyminov, Amy Wing-mei Wong, Jon Burnsky, Jake W. Vincent, Yu'an Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Lijia Sun, Yi Zhang, Saab Mansour, Kathleen McKeown

    Abstract: Single document news summarization has seen substantial progress on faithfulness in recent years, driven by research on the evaluation of factual consistency, or hallucinations. We ask whether these advances carry over to other text summarization domains. We propose a new evaluation benchmark on topic-focused dialogue summarization, generated by LLMs of varying sizes. We provide binary sentence-le… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: NAACL 2024; Linguistic annotations available at https://github.com/amazon-science/tofueval

  48. arXiv:2402.12291  [pdf, other

    cs.CL

    KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students

    Authors: Matthew Shu, Nishant Balepur, Shi Feng, Jordan Boyd-Graber

    Abstract: Flashcard schedulers are tools that rely on 1) student models to predict the flashcards a student knows; and 2) teaching policies to schedule cards based on these predictions. Existing student models, however, only use flashcard-level features, like the student's past responses, ignoring the semantic ties of flashcards. Deep Knowledge Tracing (DKT) models can capture semantic relations with langua… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: In-progress preprint

  49. arXiv:2402.11638  [pdf, other

    cs.CL

    Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

    Authors: Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He

    Abstract: The widespread use of large language models (LLMs) is increasing the demand for methods that detect machine-generated text to prevent misuse. The goal of our study is to stress test the detectors' robustness to malicious attacks under realistic scenarios. We comprehensively study the robustness of popular machine-generated text detectors under attacks from diverse categories: editing, paraphrasing… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  50. arXiv:2402.11057  [pdf, other

    cs.CV

    Are you Struggling? Dataset and Baselines for Struggle Determination in Assembly Videos

    Authors: Shijia Feng, Michael Wray, Brian Sullivan, Youngkyoon Jang, Casimir Ludwig, Iain Gilchrist, Walterio Mayol-Cuevas

    Abstract: Determining when people are struggling from video enables a finer-grained understanding of actions and opens opportunities for building intelligent support visual interfaces. In this paper, we present a new dataset with three assembly activities and corresponding performance baselines for the determination of struggle from video. Three real-world problem-solving activities including assembling plu… ▽ More

    Submitted 28 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.