Skip to main content

Showing 1–50 of 646 results for author: Yu, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19532  [pdf, other

    cs.DM cs.LG

    Dataless Quadratic Neural Networks for the Maximum Independent Set Problem

    Authors: Ismail Alkhouri, Cedric Le Denmat, Yingjie Li, Cunxi Yu, Jia Liu, Rongrong Wang, Alvaro Velasquez

    Abstract: Combinatorial Optimization (CO) plays a crucial role in addressing various significant problems, among them the challenging Maximum Independent Set (MIS) problem. In light of recent advancements in deep learning methods, efforts have been directed towards leveraging data-driven learning approaches, typically rooted in supervised learning and reinforcement learning, to tackle the NP-hard MIS proble… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.16995  [pdf, other

    q-bio.QM cs.AI

    A large language model for predicting T cell receptor-antigen binding specificity

    Authors: Xing Fang, Chenpeng Yu, Shiye Tian, Hui Liu

    Abstract: The human immune response depends on the binding of T-cell receptors (TCRs) to antigens (pTCR), which elicits the T cells to eliminate viruses, tumor cells, and other pathogens. The ability of human immunity system responding to unknown viruses and bacteria stems from the TCR diversity. However, this vast diversity poses challenges on the TCR-antigen binding prediction methods. In this study, we p… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.15282  [pdf, other

    cs.GT cs.CR econ.TH

    Computing Optimal Manipulations in Cryptographic Self-Selection Proof-of-Stake Protocols

    Authors: Matheus V. X. Ferreira, Aadityan Ganesh, Jack Hourigan, Hannah Huh, S. Matthew Weinberg, Catherine Yu

    Abstract: Cryptographic Self-Selection is a paradigm employed by modern Proof-of-Stake consensus protocols to select a block-proposing "leader." Algorand [Chen and Micali, 2019] proposes a canonical protocol, and Ferreira et al. [2022] establish bounds $f(α,β)$ on the maximum fraction of rounds a strategic player can lead as a function of their stake $α$ and a network connectivity parameter $β$. While both… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Appeared in the 25th ACM Conference on Economics and Computation (EC '24)

    ACM Class: G.3

  4. arXiv:2406.14422  [pdf, other

    cs.CV cs.AI

    FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding

    Authors: Mingkun Wang, Xiaoguang Ren, Ruochun **, Minglong Li, Xiaochuan Zhang, Changqian Yu, Mingxu Wang, Wen**g Yang

    Abstract: Most prior motion prediction endeavors in autonomous driving have inadequately encoded future scenarios, leading to predictions that may fail to accurately capture the diverse movements of agents (e.g., vehicles or pedestrians). To address this, we propose FutureNet, which explicitly integrates initially predicted trajectories into the future scenario and further encodes these future contexts to e… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 10 pages

  5. arXiv:2406.13626  [pdf, other

    cs.CL cs.AI

    Fine-Tuning Gemma-7B for Enhanced Sentiment Analysis of Financial News Headlines

    Authors: Kangtong Mo, Wenyan Liu, Xuanzhen Xu, Chang Yu, Yuelin Zou, Fangqing Xia

    Abstract: In this study, we explore the application of sentiment analysis on financial news headlines to understand investor sentiment. By leveraging Natural Language Processing (NLP) and Large Language Models (LLM), we analyze sentiment from the perspective of retail investors. The FinancialPhraseBank dataset, which contains categorized sentiments of financial news headlines, serves as the basis for our an… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.12095  [pdf, other

    cs.CV cs.AI cs.RO

    DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

    Authors: Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

    Abstract: We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving. Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs, and is trained self-supervised with differentiable rendering to reconstruct RGB,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.11645  [pdf, other

    cs.HC cs.CV

    SeamPose: Repurposing Seams as Capacitive Sensors in a Shirt for Upper-Body Pose Tracking

    Authors: Tianhong Catherine Yu, Manru, Zhang, Peter He, Chi-Jung Lee, Cassidy Cheesman, Saif Mahmud, Ruidong Zhang, François Guimbretière, Cheng Zhang

    Abstract: Seams are areas of overlap** fabric formed by stitching two or more pieces of fabric together in the cut-and-sew apparel manufacturing process. In SeamPose, we repurposed seams as capacitive sensors in a shirt for continuous upper-body pose estimation. Compared to previous all-textile motion-capturing garments that place the electrodes on the surface of clothing, our solution leverages existing… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.11619  [pdf, other

    eess.AS cs.LG

    AV-CrossNet: an Audiovisual Complex Spectral Map** Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling

    Authors: Vahid Ahmadi Kalkhorani, Cheng Yu, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang

    Abstract: Adding visual cues to audio-based speech separation can improve separation performance. This paper introduces AV-CrossNet, an audiovisual (AV) system for speech enhancement, target speaker extraction, and multi-talker speaker separation. AV-CrossNet is extended from the CrossNet architecture, which is a recently proposed network that performs complex spectral map** for speech separation by lever… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 Figures, and 4 Tables

  9. arXiv:2406.06593  [pdf, other

    cs.LG cs.AI cs.AR

    Differentiable Combinatorial Scheduling at Scale

    Authors: Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu

    Abstract: This paper addresses the complex issue of resource-constrained scheduling, an NP-hard problem that spans critical areas including chip design and high-performance computing. Traditional scheduling methods often stumble over scalability and applicability challenges. We propose a novel approach using a differentiable combinatorial scheduling framework, utilizing Gumbel-Softmax differentiable samplin… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 13 pages; International Conference on Machine Learning (ICML'24)

  10. arXiv:2406.06252  [pdf, other

    eess.SP cs.CR

    Random Time-hop** Secure Ranging Strategy Against Distance-Reduction Attacks in UWB

    Authors: Wenlong Gou, Chuanhang Yu, Gang Wu

    Abstract: In order to mitigate the distance reduction attack in Ultra-Wide Band (UWB) ranging, this paper proposes a secure ranging scheme based on a random time-hop** mechanism without redundant signaling overhead. Additionally, a secure ranging strategy is designed for backward compatibility with existing standards such as IEEE 802.15.4a/z, combined with an attack detection scheme. The effectiveness and… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    ACM Class: H.1.1

  11. arXiv:2406.05687  [pdf, other

    cs.RO

    FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors

    Authors: Shu-Ang Yu, Chao Yu, Feng Gao, Yi Wu, Yu Wang

    Abstract: Spatial planning in cluttered environments is crucial for mobile systems, particularly agile quadrotors. Existing methods, both optimization-based and learning-based, often focus only on success rates in specific environments and lack a unified platform with tasks of varying difficulty. To address this, we introduce FlightBench, the first comprehensive open-source benchmark for 3D spatial planning… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: The first three authors contribute equally

  12. arXiv:2406.05488  [pdf, other

    cs.LG cs.AI

    Online Policy Distillation with Decision-Attention

    Authors: Xinqiang Yu, Chuanguang Yang, Chengqing Yu, Libo Huang, Zhulin An, Yongjun Xu

    Abstract: Policy Distillation (PD) has become an effective method to improve deep reinforcement learning tasks. The core idea of PD is to distill policy knowledge from a teacher agent to a student agent. However, the teacher-student framework requires a well-trained teacher model which is computationally expensive.In the light of online knowledge distillation, we study the knowledge transfer between differe… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  13. arXiv:2406.05128  [pdf, other

    eess.AS cs.SD

    Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis

    Authors: Chin-Yun Yu, György Fazekas

    Abstract: Training the linear prediction (LP) operator end-to-end for audio synthesis in modern deep learning frameworks is slow due to its recursive formulation. In addition, frame-wise approximation as an acceleration method cannot generalise well to test time conditions where the LP is computed sample-wise. Efficient differentiable sample-wise LP for end-to-end training is the key to removing this barrie… ▽ More

    Submitted 18 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  14. arXiv:2406.04979  [pdf, other

    cs.CV

    Semantic Segmentation on VSPW Dataset through Masked Video Consistency

    Authors: Chen Liang, Qiang Guo, Chongkai Yu, Cheng**g Wu, Ting Liu, Luoqi Liu

    Abstract: Pixel-level Video Understanding requires effectively integrating three-dimensional data in both spatial and temporal dimensions to learn accurate and stable semantic information from continuous frames. However, existing advanced models on the VSPW dataset have not fully modeled spatiotemporal relationships. In this paper, we present our solution for the PVUW competition, where we introduce masked… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  15. arXiv:2406.04658  [pdf, other

    cs.CR cs.AI cs.LG

    Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated

    Authors: Qi Zheng, Chang Yu, ** Cao, Yongshun Xu, Qianwen Xing, Yinxin **

    Abstract: With the rise of various online and mobile payment systems, transaction fraud has become a significant threat to financial security. This study explores the application of advanced machine learning models, specifically XGBoost and LightGBM, for develo** a more accurate and robust Payment Security Protection Model.To enhance data reliability, we meticulously processed the data sources and used SM… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: This paper is received by https://ieee-metacom.org

  16. arXiv:2406.04466  [pdf, other

    cs.HC

    Dog Heart Rate and Blood Oxygen Metaverse Interaction System

    Authors: Yanhui Jiang, ** Cao, Chang Yu

    Abstract: This study developed an improved dog heart rate and blood oxygen sensor system using Arduino. Traditional methods face accuracy and reliability issues. Our system integrates advanced computational techniques with hardware-based sensing to enhance measurement precision. An Arduino microcontroller connected to a heart rate and blood oxygen sensor collects raw data, which is preprocessed and filtered… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures, conference for IEEE metacom accepted (https://ieee-metacom.org/)

  17. arXiv:2406.04465  [pdf, other

    cs.HC

    Rough Set improved Therapy-Based Metaverse Assisting System

    Authors: ** Cao, Yanhui Jiang, Chang Yu, Feiwei Qin, Zekun Jiang

    Abstract: Chronic neck and shoulder pain (CNSP) is a major global public health issue. Traditional treatments like physiotherapy and rehabilitation have drawbacks, including high costs, low precision, and user discomfort. This paper presents an interactive system based on Cognitive Therapy Theory (CBT) for CNSP treatment. The system includes a pain detection module using EMG and IMU to monitor pain and opti… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures, conference for IEEE metacom accepted (https://ieee-metacom.org/)

  18. arXiv:2406.03733  [pdf, other

    cs.LG cs.AI

    Credit Card Fraud Detection Using Advanced Transformer Model

    Authors: Chang Yu, Yongshun Xu, ** Cao, Ye Zhang, Yinxin **, Mengran Zhu

    Abstract: With the proliferation of various online and mobile payment systems, credit card fraud has emerged as a significant threat to financial security. This study focuses on innovative applications of the latest Transformer models for more robust and precise fraud detection. To ensure the reliability of the data, we meticulously processed the data sources, balancing the dataset to address the issue of d… ▽ More

    Submitted 21 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: This paper have been received by https://ieee-metacom.org/

  19. arXiv:2406.02126  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination

    Authors: **wei Zeng, Chao Yu, Xinyi Yang, Wenxuan Ao, Jian Yuan, Yong Li, Yu Wang, Huazhong Yang

    Abstract: Traffic signal control (TSC) is a promising low-cost measure to enhance transportation efficiency without affecting existing road infrastructure. While various reinforcement learning-based TSC methods have been proposed and experimentally outperform conventional rule-based methods, none of them has been deployed in the real world. An essential gap lies in the oversimplification of the scenarios in… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  20. arXiv:2406.02037  [pdf

    cs.CV

    Multi-Scale Direction-Aware Network for Infrared Small Target Detection

    Authors: **miao Zhao, Zelin Shi, Chuang Yu, Yunpeng Liu

    Abstract: Infrared small target detection faces the problem that it is difficult to effectively separate the background and the target. Existing deep learning-based methods focus on appearance features and ignore high-frequency directional features. Therefore, we propose a multi-scale direction-aware network (MSDA-Net), which is the first attempt to integrate the high-frequency directional features of infra… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  21. arXiv:2406.01355  [pdf, other

    cs.CV cs.AI cs.CR

    Differentially Private Fine-Tuning of Diffusion Models

    Authors: Yu-Lin Tsai, Yizhe Li, Zekai Chen, Po-Yu Chen, Chia-Mu Yu, Xuebin Ren, Francois Buet-Golfouse

    Abstract: The integration of Differential Privacy (DP) with diffusion models (DMs) presents a promising yet challenging frontier, particularly due to the substantial memorization capabilities of DMs that pose significant privacy risks. Differential privacy offers a rigorous framework for safeguarding individual data points during model training, with Differential Privacy Stochastic Gradient Descent (DP-SGD)… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures, 11 tables

  22. arXiv:2406.01159  [pdf, other

    cs.CV

    Dimba: Transformer-Mamba Diffusion Models

    Authors: Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Youqiang Zhang, Junshi Huang

    Abstract: This paper unveils Dimba, a new text-to-image diffusion model that employs a distinctive hybrid architecture combining Transformer and Mamba elements. Specifically, Dimba sequentially stacked blocks alternate between Transformer and Mamba layers, and integrate conditional information through the cross-attention layer, thus capitalizing on the advantages of both architectural paradigms. We investig… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  23. arXiv:2406.00938  [pdf, other

    cs.CR cs.AI cs.SC

    A Synergistic Approach In Network Intrusion Detection By Neurosymbolic AI

    Authors: Alice Bizzarri, Chung-En Yu, Brian Jalaian, Fabrizio Riguzzi, Nathaniel D. Bastian

    Abstract: The prevailing approaches in Network Intrusion Detection Systems (NIDS) are often hampered by issues such as high resource consumption, significant computational demands, and poor interpretability. Furthermore, these systems generally struggle to identify novel, rapidly changing cyber threats. This paper delves into the potential of incorporating Neurosymbolic Artificial Intelligence (NSAI) into N… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  24. arXiv:2406.00721  [pdf, other

    cs.CV

    Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks

    Authors: Cong Wang, Wei Wang, Cheng** Yu, Jie Mu

    Abstract: Patch-level non-local self-similarity is an important property of natural images. However, most existing methods do not consider this property into neural networks for image deraining, thus affecting recovery performance. Motivated by this property, we find that there exists significant patch recurrence property of a rainy image, that is, similar patches tend to recur many times in one image and i… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: IJCAI-24; Project Page: https://github.com/supersupercong/MSGNN

  25. arXiv:2406.00415  [pdf, other

    cs.AI

    Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives

    Authors: Xuan Wu, Di Wang, Lijie Wen, Yubin Xiao, Chunguo Wu, Yuesong Wu, Chaoyu Yu, Douglas L. Maskell, You Zhou

    Abstract: Although several surveys on Neural Combinatorial Optimization (NCO) solvers specifically designed to solve Vehicle Routing Problems (VRPs) have been conducted. These existing surveys did not cover the state-of-the-art (SOTA) NCO solvers emerged recently. More importantly, to provide a comprehensive taxonomy of NCO solvers with up-to-date coverage, based on our thorough review of relevant publicati… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  26. arXiv:2405.19327  [pdf, other

    cs.CL cs.AI cs.LG

    MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

    Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kai**g Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

    Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More

    Submitted 2 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: https://map-neo.github.io/

  27. arXiv:2405.19326  [pdf, other

    cs.CV cs.GR cs.HC

    Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

    Authors: Tianrun Chen, Chunan Yu, **g Li, Jianqi Zhang, Lanyun Zhu, Deyi Ji, Yong Zhang, Ying Zang, Zejian Li, Lingyun Sun

    Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. We design a simple baseline method, Reasoning3D, with the capability to understand… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  28. arXiv:2405.19133  [pdf, other

    cs.NI

    Preamble Design and Burst-Mode DSP for Upstream Reception of 200G Coherent TDM-PON

    Authors: Haide Wang, Ji Zhou, **yang Yang, Zhiyang Liu, Cheng Li, Wei** Liu, Changyuan Yu

    Abstract: Burst-mode DSP based on 10ns preamble is proposed for upstream reception of 200G coherent TDM-PON. The 128-symbol tone preamble is used for SOP, frequency offset, and sampling phase estimation, while the 192-symbol CAZAC preamble is used for frame synchronization and channel estimation.

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: This papaer has been submitted to the ECOC 2024

  29. arXiv:2405.18255  [pdf, other

    cs.CR cs.SI eess.SP

    Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder

    Authors: Wenlong Gou, Chuanhang Yu, Juntao Ma, Gang Wu, Vladimir Mordachev

    Abstract: A variety of ranging threats represented by Ghost Peak attack have raised concerns regarding the security performance of Ultra-Wide Band (UWB) systems with the finalization of the IEEE 802.15.4z standard. Based on channel reciprocity, this paper proposes a low complexity attack detection scheme that compares Channel Impulse Response (CIR) features of both ranging sides utilizing an autoencoder wit… ▽ More

    Submitted 10 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    ACM Class: H.1.1

  30. arXiv:2405.16833  [pdf, other

    cs.LG

    Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models

    Authors: Chia-Yi Hsu, Yu-Lin Tsai, Chih-Hsun Lin, Pin-Yu Chen, Chia-Mu Yu, Chun-Ying Huang

    Abstract: While large language models (LLMs) such as Llama-2 or GPT-4 have shown impressive zero-shot performance, fine-tuning is still necessary to enhance their performance for customized datasets, domain-specific tasks, or other private needs. However, fine-tuning all parameters of LLMs requires significant hardware resources, which can be impractical for typical users. Therefore, parameter-efficient fin… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  31. arXiv:2405.16450  [pdf, other

    cs.LG cs.AI cs.PL

    Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

    Authors: Max Liu, Chan-Hung Yu, Wei-Hsu Lee, Cheng-Wei Hung, Yen-Chun Chen, Shao-Hua Sun

    Abstract: Programmatic reinforcement learning (PRL) has been explored for representing policies through programs as a means to achieve interpretability and generalization. Despite promising outcomes, current state-of-the-art PRL methods are hindered by sample inefficiency, necessitating tens of millions of program-environment interactions. To tackle this challenge, we introduce a novel LLM-guided search fra… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  32. arXiv:2405.16262  [pdf, other

    cs.LG

    Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

    Authors: Runqi Lin, Chaojian Yu, Bo Han, Hang Su, Tongliang Liu

    Abstract: Catastrophic overfitting (CO) presents a significant challenge in single-step adversarial training (AT), manifesting as highly distorted deep neural networks (DNNs) that are vulnerable to multi-step adversarial attacks. However, the underlying factors that lead to the distortion of decision boundaries remain unclear. In this work, we delve into the specific changes within different DNN layers and… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  33. arXiv:2405.16141  [pdf, other

    cs.LG cs.AI cs.CE

    AIGB: Generative Auto-bidding via Diffusion Modeling

    Authors: Jiayan Guo, Yusen Huo, Zhilin Zhang, Tianyu Wang, Chuan Yu, Jian Xu, Yan Zhang, Bo Zheng

    Abstract: Auto-bidding plays a crucial role in facilitating online advertising by automatically providing bids for advertisers. Reinforcement learning (RL) has gained popularity for auto-bidding. However, most current RL auto-bidding methods are modeled through the Markovian Decision Process (MDP), which assumes the Markovian state transition. This assumption restricts the ability to perform in long horizon… ▽ More

    Submitted 27 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  34. arXiv:2405.15984  [pdf, other

    cs.CL cs.AI

    Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

    Authors: Simon Chi Lok Yu, Jie He, Pasquale Minervini, Jeff Z. Pan

    Abstract: With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically r… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 29 pages, 6 figures

  35. arXiv:2405.13659  [pdf, other

    cs.CV

    EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views

    Authors: Yuhang Yang, Wei Zhai, Chengfeng Wang, Chengjun Yu, Yang Cao, Zheng-Jun Zha

    Abstract: Understanding egocentric human-object interaction (HOI) is a fundamental aspect of human-centric perception, facilitating applications like AR/VR and embodied AI. For the egocentric HOI, in addition to perceiving semantics e.g., ''what'' interaction is occurring, capturing ''where'' the interaction specifically manifests in 3D space is also crucial, which links the perception and operation. Existi… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 23 pages,10 figures

  36. arXiv:2405.11694  [pdf, other

    cs.GR

    PBI: Position-Based Dynamics Handles Updated Lagrangian Inelasticity

    Authors: Chang Yu, Xuan Li, Lei Lan, Yin Yang, Chenfanfu Jiang

    Abstract: Position-based Dynamics (PBD) and its extension, eXtended Position-based Dynamics (XPBD), have been predominantly applied to compliant constrained dynamics, with their potential in finite strain inelasticity remaining underexplored. XPBD stands in contrast to other meshless methods, such as the Material Point Method (MPM). MPM is based on discretizing the weak form of governing partial differentia… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  37. arXiv:2405.11333  [pdf, other

    cs.LG cs.AI

    GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable Missing

    Authors: Chengqing Yu, Fei Wang, Zezhi Shao, Tangwen Qian, Zhao Zhang, Wei Wei, Yongjun Xu

    Abstract: Multivariate time series forecasting (MTSF) is crucial for decision-making to precisely forecast the future values/trends, based on the complex relationships identified from historical observations of multiple sequences. Recently, Spatial-Temporal Graph Neural Networks (STGNNs) have gradually become the theme of MTSF model as their powerful capability in mining spatial-temporal dependencies, but a… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024 (Research track)

  38. arXiv:2405.08013  [pdf, other

    cs.LG cs.AI cs.SI

    CTRL: Continuous-Time Representation Learning on Temporal Heterogeneous Information Network

    Authors: Chenglin Li, Yuanzhen Xie, Chenyun Yu, Lei Cheng, Bo Hu, Zang Li, Di Niu

    Abstract: Inductive representation learning on temporal heterogeneous graphs is crucial for scalable deep learning on heterogeneous information networks (HINs) which are time-varying, such as citation networks. However, most existing approaches are not inductive and thus cannot handle new nodes or edges. Moreover, previous temporal graph embedding methods are often trained with the temporal link prediction… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  39. arXiv:2405.07652  [pdf, other

    cs.HC cs.AI

    G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

    Authors: Zeyu Wang, Yuanchun Shi, Yuntao Wang, Yuchen Yao, Kun Yan, Yuhan Wang, Lei Ji, Xuhai Xu, Chun Yu

    Abstract: Modern information querying systems are progressively incorporating multimodal inputs like vision and audio. However, the integration of gaze -- a modality deeply linked to user intent and increasingly accessible via gaze-tracking wearables -- remains underexplored. This paper introduces a novel gaze-facilitated information querying paradigm, named G-VOILA, which synergizes users' gaze, visual fie… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 25 pages, 12 figures

  40. arXiv:2405.06804  [pdf, other

    cs.SD eess.AS eess.SP

    Time-of-arrival Estimation and Phase Unwrap** of Head-related Transfer Functions With Integer Linear Programming

    Authors: Chin-Yun Yu, Johan Pauwels, György Fazekas

    Abstract: In binaural audio synthesis, aligning head-related impulse responses (HRIRs) in time has been an important pre-processing step, enabling accurate spatial interpolation and efficient data compression. The maximum correlation time delay between spatially nearby HRIRs has previously been used to get accurate and smooth alignment by solving a matrix equation in which the solution has the minimum Eucli… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to be presented at Audio Engineering Society 156th Convention, 2024 June, Madrid, Spain

  41. arXiv:2405.06653  [pdf, other

    q-bio.BM cs.LG

    A unified cross-attention model for predicting antigen binding specificity to both HLA and TCR molecules

    Authors: Chenpeng Yu, Xing Fang, Hui Liu

    Abstract: The immune checkpoint inhibitors have demonstrated promising clinical efficacy across various tumor types, yet the percentage of patients who benefit from them remains low. The binding affinity between antigens and HLA-I/TCR molecules plays a critical role in antigen presentation and T-cell activation. Some computational methods have been developed to predict antigen-HLA or antigen-TCR binding spe… ▽ More

    Submitted 8 April, 2024; originally announced May 2024.

  42. arXiv:2405.06373  [pdf, other

    cs.CL cs.AI

    LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play

    Authors: Li-Chun Lu, Shou-Jen Chen, Tsung-Min Pai, Chan-Hung Yu, Hung-yi Lee, Shao-Hua Sun

    Abstract: Large language models (LLMs) have shown exceptional proficiency in natural language processing but often fall short of generating creative and original responses to open-ended questions. To enhance LLM creativity, our key insight is to emulate the human process of inducing collective creativity through engaging discussions with participants from diverse backgrounds and perspectives. To this end, w… ▽ More

    Submitted 18 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, Under review as a conference paper at COLM 2024

  43. arXiv:2405.05119  [pdf, other

    stat.ME cs.SI

    Combining Rollout Designs and Clustering for Causal Inference under Low-order Interference

    Authors: Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Lee Yu

    Abstract: Estimating causal effects under interference is pertinent to many real-world settings. However, the true interference network may be unknown to the practitioner, precluding many existing techniques that leverage this information. A recent line of work with low-order potential outcomes models uses staggered rollout designs to obtain unbiased estimators that require no network information. However,… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 30 pages, 13 figures

    MSC Class: 62K99 (Primary); 62P30 (Secondary)

  44. arXiv:2405.03067  [pdf, other

    cs.SE

    Automated Deep Learning Optimization via DSL-Based Source Code Transformation

    Authors: Ruixin Wang, Minghai Lu, Cody Hao Yu, Yi-Hsiang Lai, Tianyi Zhang

    Abstract: As deep learning models become increasingly bigger and more complex, it is critical to improve model training and inference efficiency. Though a variety of highly optimized libraries and packages (known as DL kernels) have been developed, it is tedious and time-consuming to figure out which kernel to use, where to use, and how to use them correctly. To address this challenge, we propose an Automat… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 12 pages, 6 figures

    ACM Class: D.2.11; I.2.0

    Journal ref: In Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2024)

  45. arXiv:2405.03026  [pdf, other

    cs.RO

    Enhanced Detection Classification via Clustering SVM for Various Robot Collaboration Task

    Authors: Rui Liu, Xuanzhen Xu, Yuwei Shen, Armando Zhu, Chang Yu, Tianjian Chen, Ye Zhang

    Abstract: We introduce an advanced, swift pattern recognition strategy for various multiple robotics during curve negotiation. This method, leveraging a sophisticated k-means clustering-enhanced Support Vector Machine algorithm, distinctly categorizes robotics into flying or mobile robots. Initially, the paradigm considers robot locations and features as quintessential parameters indicative of divergent rob… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: This paper has been received by CISCE 2024 Conference

  46. arXiv:2405.01044  [pdf, other

    cs.RO

    Differentiable Particles for General-Purpose Deformable Object Manipulation

    Authors: Siwei Chen, Yiqing Xu, Cunjun Yu, Linfeng Li, David Hsu

    Abstract: Deformable object manipulation is a long-standing challenge in robotics. While existing approaches often focus narrowly on a specific type of object, we seek a general-purpose algorithm, capable of manipulating many different types of objects: beans, rope, cloth, liquid, . . . . One key difficulty is a suitable representation, rich enough to capture object shape, dynamics for manipulation and yet… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  47. arXiv:2404.19171  [pdf, other

    cs.CV cs.AI

    Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection

    Authors: Cai Yu, Shan Jia, Xiaomeng Fu, ** Liu, Jiahe Tian, Jiao Dai, Xi Wang, Siwei Lyu, Jizhong Han

    Abstract: With the rising prevalence of deepfakes, there is a growing interest in develo** generalizable detection methods for various types of deepfakes. While effective in their specific modalities, traditional detection methods fall short in addressing the generalizability of detection across diverse cross-modal deepfakes. This paper aims to explicitly learn potential cross-modal correlation to enhance… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: accepted by ICME 2024

  48. arXiv:2404.15974  [pdf, other

    cs.HC

    A Human-Computer Collaborative Tool for Training a Single Large Language Model Agent into a Network through Few Examples

    Authors: Lihang Pan, Yuxuan Li, Chun Yu, Yuanchun Shi

    Abstract: The capabilities of a single large language model (LLM) agent for solving a complex task are limited. Connecting multiple LLM agents to a network can effectively improve overall performance. However, building an LLM agent network (LAN) requires a substantial amount of time and effort. In this paper, we introduce EasyLAN, a human-computer collaborative tool that helps developers construct LANs. Eas… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  49. arXiv:2404.15238  [pdf, other

    cs.CL cs.AI

    CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies

    Authors: Weiyan Shi, Ryan Li, Yutong Zhang, Caleb Ziems, Chunhua yu, Raya Horesh, Rogério Abreu de Paula, Diyi Yang

    Abstract: To enhance language models' cultural awareness, we design a generalizable pipeline to construct cultural knowledge bases from different online communities on a massive scale. With the pipeline, we construct CultureBank, a knowledge base built upon users' self-narratives with 12K cultural descriptors sourced from TikTok and 11K from Reddit. Unlike previous cultural knowledge resources, CultureBank… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 32 pages, 7 figures, preprint

  50. arXiv:2404.12980  [pdf, other

    cs.HC

    Ring-a-Pose: A Ring for Continuous Hand Pose Tracking

    Authors: Tianhong Catherine Yu, Guilin Hu, Ruidong Zhang, Hyunchul Lim, Saif Mahmud, Chi-Jung Lee, Ke Li, Devansh Agarwal, Shuyang Nie, **seok Oh, François Guimbretière, Cheng Zhang

    Abstract: We present Ring-a-Pose, a single untethered ring that tracks continuous 3D hand poses. Located in the center of the hand, the ring emits an inaudible acoustic signal that each hand pose reflects differently. Ring-a-Pose imposes minimal obtrusions on the hand, unlike multi-ring or glove systems. It is not affected by the choice of clothing that may cover wrist-worn systems. In a series of three use… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.