Skip to main content

Showing 1–50 of 883 results for author: Xie, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18966  [pdf, other

    cs.MS

    svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition

    Authors: Xu Feng, Wenjian Yu, Yuyang Xie

    Abstract: This article presents svds-C, an open-source and high-performance C program for accurately and robustly computing truncated SVD, e.g. computing several largest singular values and corresponding singular vectors. We have re-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS and multi-thread computing to obtain the parallel program named svds-C. svds-C running on shared-memory… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 20 pages, accepted by SoftwareX

  2. arXiv:2405.18653  [pdf, other

    cs.CL

    Recent Advances of Foundation Language Models-based Continual Learning: A Survey

    Authors: Yutao Yang, Jie Zhou, Xuanwen Ding, Tianyu Huai, Shunyu Liu, Qin Chen, Liang He, Yuan Xie

    Abstract: Recently, foundation language models (LMs) have marked significant achievements in the domains of natural language processing (NLP) and computer vision (CV). Unlike traditional neural network models, foundation LMs obtain a great ability for transfer learning by acquiring rich commonsense knowledge through pre-training on extensive unsupervised datasets with a vast number of parameters. However, t… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.16828  [pdf, other

    cs.LG math.ST stat.ML

    Kernel-based optimally weighted conformal prediction intervals

    Authors: Jonghyeok Lee, Chen Xu, Yao Xie

    Abstract: Conformal prediction has been a popular distribution-free framework for uncertainty quantification. In this paper, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learn… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.16599  [pdf, other

    cs.RO

    MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups

    Authors: Yusen Xie, Zhenmin Huang, Kai Chen, Lei Zhu, Jun Ma

    Abstract: Structure from Motion (SfM) and visual localization in indoor texture-less scenes and industrial scenarios present prevalent yet challenging research topics. Existing SfM methods designed for natural scenes typically yield low accuracy or map-building failures due to insufficient robust feature extraction in such settings. Visual markers, with their artificially designed features, can effectively… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 8 pages,8 figures

  5. arXiv:2405.16328  [pdf, other

    cs.CV

    A Classifier-Free Incremental Learning Framework for Scalable Medical Image Segmentation

    Authors: Xiaoyang Chen, Hao Zheng, Yifang Xie, Yuncong Ma, Tengfei Li

    Abstract: Current methods for develo** foundation models in medical image segmentation rely on two primary assumptions: a fixed set of classes and the immediate availability of a substantial and diverse training dataset. However, this can be impractical due to the evolving nature of imaging technology and patient demographics, as well as labor-intensive data curation, limiting their practical applicabilit… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.15441  [pdf, other

    stat.ML cs.CC cs.LG

    Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

    Authors: Jie Wang, March Boedihardjo, Yao Xie

    Abstract: Optimal transport has been very successful for various machine learning tasks; however, it is known to suffer from the curse of dimensionality. Hence, dimensionality reduction is desirable when applied to high-dimensional data with low-dimensional structures. The kernel max-sliced (KMS) Wasserstein distance is developed for this purpose by finding an optimal nonlinear map** that reduces data int… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 34 pages, 7 figures, 4 tables

  7. arXiv:2405.15165  [pdf, other

    cs.CL cs.AI cs.SE

    A Solution-based LLM API-using Methodology for Academic Information Seeking

    Authors: Yuanchun Wang, Jifan Yu, Zijun Yao, **g Zhang, Yuyang Xie, Shangqing Tu, Yiyang Fu, Youhe Feng, **kai Zhang, **gyao Zhang, Bowen Huang, Yuanyao Li, Huihui Yuan, Lei Hou, Juanzi Li, Jie Tang

    Abstract: Applying large language models (LLMs) for academic API usage shows promise in reducing researchers' academic information seeking efforts. However, current LLM API-using methods struggle with complex API coupling commonly encountered in academic queries. To address this, we introduce SoAy, a solution-based LLM API-using methodology for academic information seeking. It uses code with a solution as t… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 22 pages, 13 figures

  8. arXiv:2405.14597  [pdf, other

    cs.LG cs.AI

    Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs

    Authors: Qingyuan Li, Ran Meng, Yiduo Li, Bo Zhang, Yifan Lu, Yerui Sun, Lin Ma, Yuchen Xie

    Abstract: We introduce Integer Scale, a novel post-training quantization scheme for large language models that effectively resolves the inference bottleneck in current fine-grained quantization approaches while maintaining similar accuracies. Integer Scale is a free lunch as it requires no extra calibration or fine-tuning which will otherwise incur additional costs. It can be used plug-and-play for most fin… ▽ More

    Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  9. arXiv:2405.13383  [pdf, other

    cs.LG

    Gradient Projection For Parameter-Efficient Continual Learning

    Authors: **gyang Qiao, Zhizhong Zhang, Xin Tan, Yanyun Qu, Wensheng Zhang, Yuan Xie

    Abstract: Catastrophic forgetting poses the primary challenge in the continual learning. Nowadays, methods based on parameter-efficient tuning (PET) have demonstrated impressive performance in continual learning. However, these methods are still confronted with a common problem: fine-tuning on consecutive distinct tasks can disrupt the existing parameter distribution and lead to forgetting. Recent progress… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  10. arXiv:2405.11730  [pdf

    cs.LG q-fin.GN

    Degree of Irrationality: Sentiment and Implied Volatility Surface

    Authors: Jiahao Weng, Yan Xie

    Abstract: In this study, we constructed daily high-frequency sentiment data and used the VAR method to attempt to predict the next day's implied volatility surface. We utilized 630,000 text data entries from the East Money Stock Forum from 2014 to 2023 and employed deep learning methods such as BERT and LSTM to build daily market sentiment indicators. By applying FFT and EMD methods for sentiment decomposit… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 21 pages, 8 figures

  11. arXiv:2405.10591  [pdf, other

    cs.CV

    GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision

    Authors: Xin Tan, Wenbin Wu, Zhiwei Zhang, Chaojie Fan, Yong Peng, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

    Abstract: 3D occupancy perception holds a pivotal role in recent vision-centric autonomous driving systems by converting surround-view images into integrated geometric and semantic representations within dense 3D grids. Nevertheless, current models still encounter two main challenges: modeling depth accurately in the 2D-3D view transformation stage, and overcoming the lack of generalizability issues due to… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  12. arXiv:2405.09965  [pdf, other

    cs.SE

    Leveraging Large Language Models for Automated Web-Form-Test Generation: An Empirical Study

    Authors: Tao Li, Chenhui Cui, Lei Ma, Dave Towey, Yujie Xie, Rubing Huang

    Abstract: The testing of web forms is an essential activity for ensuring the quality of web applications, which mainly involves evaluating the interactions between users and forms. Automated test-case generation remains a challenge for web-form testing: Due to the complex, multi-level structure of web pages, it can be difficult to automatically capture their inherent contextual information for inclusion in… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  13. arXiv:2405.09021  [pdf

    cs.LG

    Deep Learning in Earthquake Engineering: A Comprehensive Review

    Authors: Yazhou Xie

    Abstract: This article surveys the growing interest in utilizing Deep Learning (DL) as a powerful tool to address challenging problems in earthquake engineering. Despite decades of advancement in domain knowledge, issues such as uncertainty in earthquake occurrence, unpredictable seismic loads, nonlinear structural responses, and community engagement remain difficult to tackle using domain-specific methods.… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  14. arXiv:2405.08013  [pdf, other

    cs.LG cs.AI cs.SI

    CTRL: Continuous-Time Representation Learning on Temporal Heterogeneous Information Network

    Authors: Chenglin Li, Yuanzhen Xie, Chenyun Yu, Lei Cheng, Bo Hu, Zang Li, Di Niu

    Abstract: Inductive representation learning on temporal heterogeneous graphs is crucial for scalable deep learning on heterogeneous information networks (HINs) which are time-varying, such as citation networks. However, most existing approaches are not inductive and thus cannot handle new nodes or edges. Moreover, previous temporal graph embedding methods are often trained with the temporal link prediction… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  15. arXiv:2405.07547  [pdf, other

    cs.IT eess.SP

    Channel Coding Toward 6G: Technical Overview and Outlook

    Authors: Mohammad Rowshan, Min Qiu, Yixuan Xie, Xinyi Gu, **hong Yuan

    Abstract: Channel coding plays a pivotal role in ensuring reliable communication over wireless channels. With the growing need for ultra-reliable communication in emerging wireless use cases, the significance of channel coding has amplified. Furthermore, minimizing decoding latency is crucial for critical-mission applications, while optimizing energy efficiency is paramount for mobile and the Internet of Th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 102 pages, 87 figures, IEEE Open Journal of the Communications Society (invited paper)

  16. arXiv:2405.07500  [pdf, other

    cs.IR cs.AI cs.CL

    PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking

    Authors: Yuzhang Xie, Jiaying Lu, Joyce Ho, Fadi Nahab, Xiao Hu, Carl Yang

    Abstract: Linking (aligning) biomedical concepts across diverse data sources enables various integrative analyses, but it is challenging due to the discrepancies in concept naming conventions. Various strategies have been developed to overcome this challenge, such as those based on string-matching rules, manually crafted thesauri, and machine learning models. However, these methods are constrained by limite… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (Short-Paper Track), 2024

  17. arXiv:2405.07409  [pdf, other

    cs.NI

    ZBanner: Fast Stateless Scanning Capable of Obtaining Responses over TCP

    Authors: Chiyu Chen, Yuliang Lu, Guozheng Yang, Yi Xie, Shasha Guo

    Abstract: Fast large-scale network scanning is an important way to understand internet service configurations and security in real time, among which stateless scan is representative. Existing stateless scanners can perform single-packet scans for internet-wide network measurements but are limited to host discovery or port scanning. To obtain further information over TCP, slower stateful scanners must be use… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: The paper has been submitted and the code will be published later

  18. arXiv:2405.07201  [pdf, other

    cs.CV

    Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception

    Authors: Haoming Chen, Zhizhong Zhang, Yanyun Qu, Ruixin Zhang, Xin Tan, Yuan Xie

    Abstract: An effective pre-training framework with universal 3D representations is extremely desired in perceiving large-scale dynamic scenes. However, establishing such an ideal framework that is both task-generic and label-efficient poses a challenge in unifying the representation of the same primitive across diverse scenes. The current contrastive 3D pre-training methods typically follow a frame-level co… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  19. arXiv:2405.06246  [pdf

    cs.CV

    Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios

    Authors: Qiyan Luo, Jidan Zhang, Yuzhen Xie, Xu Huang, Ting Han

    Abstract: Feature matching determines the orientation accuracy for the High Spatial Resolution (HSR) optical satellite stereos, subsequently impacting several significant applications such as 3D reconstruction and change detection. However, the matching of off-track HSR optical satellite stereos often encounters challenging conditions including wide-baseline observation, significant radiometric differences,… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: The manuscript is accepted as Oral Presentation in IEEE International Geoscience and Remote Sensing Symposium(IGARSS 2024)

  20. arXiv:2405.05807  [pdf, other

    cs.RO

    NeuRSS: Enhancing AUV Localization and Bathymetric Map** with Neural Rendering for Sidescan SLAM

    Authors: Jun Zhang, Nils Bore, John Folkesson

    Abstract: Implicit neural representations and neural rendering have gained increasing attention for bathymetry estimation from sidescan sonar (SSS). These methods incorporate multiple observations of the same place from SSS data to constrain the elevation estimate, converging to a globally-consistent bathymetric model. However, the quality and precision of the bathymetric estimate are limited by the positio… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  21. arXiv:2405.05613  [pdf, other

    cs.CV

    Robust Pseudo-label Learning with Neighbor Relation for Unsupervised Visible-Infrared Person Re-Identification

    Authors: Xiangbo Yin, Jiangming Shi, Yachao Zhang, Yang Lu, Zhizhong Zhang, Yuan Xie, Yanyun Qu

    Abstract: Unsupervised Visible-Infrared Person Re-identification (USVI-ReID) presents a formidable challenge, which aims to match pedestrian images across visible and infrared modalities without any annotations. Recently, clustered pseudo-label methods have become predominant in USVI-ReID, although the inherent noise in pseudo-labels presents a significant obstacle. Most existing works primarily focus on sh… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2405.04880  [pdf, other

    cs.SD cs.AI eess.AS

    The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio

    Authors: Yuankun Xie, Yi Lu, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Jianhua Tao, Xin Qi, Xiaopeng Wang, Yukun Liu, Haonan Cheng, Long Ye, Yi Sun

    Abstract: With the proliferation of Audio Language Model (ALM) based deepfake audio, there is an urgent need for generalized detection methods. ALM-based deepfake audio currently exhibits widespread, high deception, and type versatility, posing a significant challenge to current audio deepfake detection (ADD) models trained solely on vocoded data. To effectively detect ALM-based deepfake audio, we focus on… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  23. arXiv:2405.04753  [pdf, other

    cs.CR cs.AI

    AttacKG+:Boosting Attack Knowledge Graph Construction with Large Language Models

    Authors: Yongheng Zhang, Tingwen Du, Yunshan Ma, Xiang Wang, Yi Xie, Guozheng Yang, Yuliang Lu, Ee-Chien Chang

    Abstract: Attack knowledge graph construction seeks to convert textual cyber threat intelligence (CTI) reports into structured representations, portraying the evolutionary traces of cyber attacks. Even though previous research has proposed various methods to construct attack knowledge graphs, they generally suffer from limited generalization capability to diverse knowledge types as well as requirement of ex… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 20 pages, 5 figures

  24. arXiv:2405.01466  [pdf, other

    cs.SE

    A Systematic Literature Review on Large Language Models for Automated Program Repair

    Authors: Quanjun Zhang, Chunrong Fang, Yang Xie, YuXiang Ma, Weisong Sun, Yun Yang, Zhenyu Chen

    Abstract: Automated Program Repair (APR) attempts to patch software bugs and reduce manual debugging efforts. Very recently, with the advances in Large Language Models (LLMs), an increasing number of APR techniques have been proposed, facilitating software development and maintenance and demonstrating remarkable performance. However, due to ongoing explorations in the LLM-based APR field, it is challenging… ▽ More

    Submitted 12 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: update new papers

  25. arXiv:2405.00451  [pdf, other

    cs.AI cs.LG

    Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

    Authors: Yuxi Xie, Anirudh Goyal, Wenyue Zheng, Min-Yen Kan, Timothy P. Lillicrap, Kenji Kawaguchi, Michael Shieh

    Abstract: We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte Carlo Tree Search (MCTS) to iteratively collect preference data, utilizing its look-ahead ability to break down instance-level rewards into more granular step-level… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  26. arXiv:2404.19417  [pdf, other

    cs.CV

    Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

    Authors: Wen Yin, Jian Lou, Pan Zhou, Yulai Xie, Dan Feng, Yuhua Sun, Tailai Zhang, Lichao Sun

    Abstract: Backdoor attacks have been well-studied in visible light object detection (VLOD) in recent years. However, VLOD can not effectively work in dark and temperature-sensitive scenarios. Instead, thermal infrared object detection (TIOD) is the most accessible and practical in such environments. In this paper, our team is the first to investigate the security vulnerabilities associated with TIOD in the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: To appear in CVPR 2024.11pages, 8 figures and 4 tables

  27. arXiv:2404.18539  [pdf, other

    cs.CV cs.AI

    Enhancing Boundary Segmentation for Topological Accuracy with Skeleton-based Methods

    Authors: Chuni Liu, Boyuan Ma, Xiaojuan Ban, Yujie Xie, Hao Wang, Weihua Xue, **gchao Ma, Ke Xu

    Abstract: Topological consistency plays a crucial role in the task of boundary segmentation for reticular images, such as cell membrane segmentation in neuron electron microscopic images, grain boundary segmentation in material microscopic images and road segmentation in aerial images. In these fields, topological changes in segmentation results have a serious impact on the downstream tasks, which can even… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  28. arXiv:2404.17287  [pdf, other

    cs.CL

    When to Trust LLMs: Aligning Confidence with Response Quality

    Authors: Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, **yang Gao, Huawei Shen, Bolin Ding

    Abstract: Despite the success of large language models (LLMs) in natural language generation, much evidence shows that LLMs may produce incorrect or nonsensical text. This limitation highlights the importance of discerning when to trust LLMs, especially in safety-critical domains. Existing methods, which rely on verbalizing confidence to tell the reliability by inducing top-k responses and sampling-aggregat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  29. arXiv:2404.15611  [pdf, other

    cs.CR

    PoisonedFL: Model Poisoning Attacks to Federated Learning via Multi-Round Consistency

    Authors: Yueqi Xie, Minghong Fang, Neil Zhenqiang Gong

    Abstract: Model poisoning attacks are critical security threats to Federated Learning (FL). Existing model poisoning attacks suffer from two key limitations: 1) they achieve suboptimal effectiveness when defenses are deployed, and/or 2) they require knowledge of the model updates or local training data on genuine clients. In this work, we make a key observation that their suboptimal effectiveness arises fro… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  30. arXiv:2404.14819  [pdf, other

    cs.RO

    Bathymetric Surveying with Imaging Sonar Using Neural Volume Rendering

    Authors: Giancarlo Troni, Nils Bore, John Folkesson

    Abstract: This research addresses the challenge of estimating bathymetry from imaging sonars where the state-of-the-art works have primarily relied on either supervised learning with ground-truth labels or surface rendering based on the Lambertian assumption. In this letter, we propose a novel, self-supervised framework based on volume rendering for reconstructing bathymetry using forward-looking sonar (FLS… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  31. arXiv:2404.14542  [pdf, other

    cs.CV

    UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement

    Authors: Yaofeng Xie, Lingwei Kong, Kai Chen, Ziqiang Zheng, Xiao Yu, Zhibin Yu, Bing Zheng

    Abstract: Learning-based underwater image enhancement (UIE) methods have made great progress. However, the lack of large-scale and high-quality paired training samples has become the main bottleneck hindering the development of UIE. The inter-frame information in underwater videos can accelerate or optimize the UIE process. Thus, we constructed the first large-scale high-resolution underwater video enhancem… ▽ More

    Submitted 27 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages,CVPR2024 accept

    ACM Class: I.4

  32. arXiv:2404.13372  [pdf, other

    eess.IV cs.CV

    HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression

    Authors: Lei Lu, Yanyue Xie, Wei Jiang, Wei Wang, Xue Lin, Yanzhi Wang

    Abstract: This paper investigates the challenging problem of learned image compression (LIC) with extreme low bitrates. Previous LIC methods based on transmitting quantized continuous features often yield blurry and noisy reconstruction due to the severe quantization loss. While previous LIC methods based on learned codebooks that discretize visual space usually give poor-fidelity reconstruction due to the… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  33. arXiv:2404.12803  [pdf, other

    cs.CV cs.LG

    TextSquare: Scaling up Text-Centric Visual Instruction Tuning

    Authors: **gqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu, Hao Liu, Yuan Xie, Xiang Bai, Can Huang

    Abstract: Text-centric visual question answering (VQA) has made great strides with the development of Multimodal Large Language Models (MLLMs), yet open-source models still fall short of leading models like GPT4V and Gemini, partly due to a lack of extensive, high-quality instruction tuning data. To this end, we introduce a new approach for creating a massive, high-quality instruction-tuning dataset, Square… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  34. arXiv:2404.11797  [pdf, other

    cs.CV cs.AI cs.LG

    When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery

    Authors: Yiqun Xie, Zhihao Wang, Weiye Chen, Zhili Li, Xiaowei Jia, Yanhua Li, Ruichen Wang, Kangyang Chai, Ruohan Li, Sergii Skakun

    Abstract: Foundation models, i.e., very large deep learning models, have demonstrated impressive performances in various language and vision tasks that are otherwise difficult to reach using smaller-size models. The major success of GPT-type of language models is particularly exciting and raises expectations on the potential of foundation models in other domains including satellite remote sensing. In this c… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  35. arXiv:2404.11509  [pdf, other

    stat.ML cs.LG

    VC Theory for Inventory Policies

    Authors: Yaqi Xie, Will Ma, Linwei Xin

    Abstract: Advances in computational power and AI have increased interest in reinforcement learning approaches to inventory management. This paper provides a theoretical foundation for these approaches and investigates the benefits of restricting to policy structures that are well-established by decades of inventory theory. In particular, we prove generalization guarantees for learning several well-known cla… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  36. arXiv:2404.11294  [pdf, other

    cs.SE

    LogSD: Detecting Anomalies from System Logs through Self-supervised Learning and Frequency-based Masking

    Authors: Yongzheng Xie, Hongyu Zhang, Muhammad Ali Babar

    Abstract: Log analysis is one of the main techniques that engineers use for troubleshooting large-scale software systems. Over the years, many supervised, semi-supervised, and unsupervised log analysis methods have been proposed to detect system anomalies by analyzing system logs. Among these, semi-supervised methods have garnered increasing attention as they strike a balance between relaxed labeled data re… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 23 pages with 11 figures

  37. arXiv:2404.11121  [pdf, other

    cs.CR cs.AI

    TransLinkGuard: Safeguarding Transformer Models Against Model Stealing in Edge Deployment

    Authors: Qinfeng Li, Zhiqiang Shen, Zhenghan Qin, Yangfan Xie, Xuhong Zhang, Tianyu Du, Jianwei Yin

    Abstract: Proprietary large language models (LLMs) have been widely applied in various scenarios. Additionally, deploying LLMs on edge devices is trending for efficiency and privacy reasons. However, edge deployment of proprietary LLMs introduces new security challenges: edge-deployed models are exposed as white-box accessible to users, enabling adversaries to conduct effective model stealing (MS) attacks.… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.07152 by other authors

  38. arXiv:2404.09276  [pdf, other

    cs.MS math.NA

    Algorithm xxx: Faster Randomized SVD with Dynamic Shifts

    Authors: Xu Feng, Wenjian Yu, Yuyang Xie, Jie Tang

    Abstract: Aiming to provide a faster and convenient truncated SVD algorithm for large sparse matrices from real applications (i.e. for computing a few of largest singular values and the corresponding singular vectors), a dynamically shifted power iteration technique is applied to improve the accuracy of the randomized SVD method. This results in a dynamic shifts based randomized SVD (dashSVD) algorithm, whi… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 26 pages, accepted by ACM Transactions on Mathematical Software

  39. arXiv:2404.07821  [pdf, other

    cs.CV

    Sparse Laneformer

    Authors: Ji Liu, Zifeng Zhang, Mingjie Lu, Hongyang Wei, Dong Li, Yile Xie, **zhang Peng, Lu Tian, Ashish Sirasao, Emad Barsoum

    Abstract: Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse an… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  40. arXiv:2404.07493  [pdf, other

    cs.LG cs.AI

    Characterizing the Influence of Topology on Graph Learning Tasks

    Authors: Kailong Wu, Yule Xie, Jiaxin Ding, Yuxiang Ren, Luoyi Fu, Xinbing Wang, Chenghu Zhou

    Abstract: Graph neural networks (GNN) have achieved remarkable success in a wide range of tasks by encoding features combined with topology to create effective representations. However, the fundamental problem of understanding and analyzing how graph topology influences the performance of learning models on downstream tasks has not yet been well understood. In this paper, we propose a metric, TopoInf, which… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  41. arXiv:2404.06852  [pdf, other

    cs.SE

    Research Artifacts in Software Engineering Publications: Status and Trends

    Authors: Mugeng Liu, Xiaolong Huang, Wei He, Yibing Xie, Jie M. Zhang, Xiang **g, Zhenpeng Chen, Yun Ma

    Abstract: The Software Engineering (SE) community has been embracing the open science policy and encouraging researchers to disclose artifacts in their publications. However, the status and trends of artifact practice and quality remain unclear, lacking insights on further improvement. In this paper, we present an empirical study to characterize the research artifacts in SE publications. Specifically, we ma… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted by Journal of Systems and Software (JSS 2024). Please include JSS in any citations

  42. arXiv:2404.06605  [pdf, other

    cs.CV

    RoadBEV: Road Surface Reconstruction in Bird's Eye View

    Authors: Tong Zhao, Lei Yang, Yichen Xie, Mingyu Ding, Masayoshi Tomizuka, Yintao Wei

    Abstract: Road surface conditions, especially geometry profiles, enormously affect driving performance of autonomous vehicles. Vision-based online road reconstruction promisingly captures road information in advance. Existing solutions like monocular depth estimation and stereo matching suffer from modest performance. The recent technique of Bird's-Eye-View (BEV) perception provides immense potential to mor… ▽ More

    Submitted 20 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Dataset page: https://thu-rsxd.com/rsrd Code: https://github.com/ztsrxh/RoadBEV

  43. arXiv:2404.06365  [pdf, other

    cs.CV cs.MM

    Dynamic Resolution Guidance for Facial Expression Recognition

    Authors: Jie Ou, Xu Li, Tianxiang Jiang, Yuanlun Xie

    Abstract: Facial expression recognition (FER) is vital for human-computer interaction and emotion analysis, yet recognizing expressions in low-resolution images remains challenging. This paper introduces a practical method called Dynamic Resolution Guidance for Facial Expression Recognition (DRGFER) to effectively recognize facial expressions in images with varying resolutions without compromising FER model… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  44. arXiv:2404.05242  [pdf, other

    cs.RO

    Collision-Free Trajectory Optimization in Cluttered Environments with Sums-of-Squares Programming

    Authors: Yulin Li, Chunxin Zheng, Kai Chen, Yusen Xie, Xindong Tang, Michael Yu Wang, Jun Ma

    Abstract: In this work, we propose a trajectory optimization approach for robot navigation in cluttered 3D environments. We represent the robot's geometry as a semialgebraic set defined by polynomial inequalities such that robots with general shapes can be suitably characterized. To address the robot navigation task in obstacle-dense environments, we exploit the free space directly to construct a sequence o… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  45. arXiv:2404.05231  [pdf, other

    cs.CV

    PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

    Authors: Xiaofan Li, Zhizhong Zhang, Xin Tan, Chengwei Chen, Yanyun Qu, Yuan Xie, Lizhuang Ma

    Abstract: The vision-language model has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering. For automated scenarios, we first use conventional prompt learning with many-class paradigm as the baseline to automatically learn prompts but found that it can not work well in one-class anomaly detection. To address the… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024

  46. arXiv:2404.04960  [pdf, other

    cs.CV

    PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

    Authors: Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu

    Abstract: Current vision-language pre-training (VLP) methodologies predominantly depend on paired image-text datasets, a resource that is challenging to acquire in radiology due to privacy considerations and labelling complexities. Data augmentation provides a practical solution to overcome the issue of data scarcity, however, most augmentation methods exhibit a limited focus, prioritising either image or t… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  47. arXiv:2404.04879  [pdf, other

    cs.RO eess.SY

    Multi-Type Map Construction via Semantics-Aware Autonomous Exploration in Unknown Indoor Environments

    Authors: Jianfang Mao, Yuheng Xie, Si Chen, Zhixiong Nan, Xiao Wang

    Abstract: This paper proposes a novel semantics-aware autonomous exploration model to handle the long-standing issue: the mainstream RRT (Rapid-exploration Random Tree) based exploration models usually make the mobile robot switch frequently between different regions, leading to the excessively-repeated explorations for the same region. Our proposed semantics-aware model encourages a mobile robot to fully e… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  48. arXiv:2404.04256  [pdf, other

    cs.CV

    Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

    Authors: Zifu Wan, Yuhao Wang, Silong Yong, **** Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie

    Abstract: Multi-modal semantic segmentation significantly enhances AI agents' perception and scene understanding, especially under adverse conditions like low-light or overexposed environments. Leveraging additional modalities (X-modality) like thermal and depth alongside traditional RGB provides complementary information, enabling more robust and reliable segmentation. In this work, we introduce Sigma, a S… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  49. arXiv:2404.03572  [pdf, other

    cs.CV cs.CG

    Terrain Point Cloud Inpainting via Signal Decomposition

    Authors: Yizhou Xie, Xiangning Xie, Yuran Wang, Yanci Zhang, Zejun Lv

    Abstract: The rapid development of 3D acquisition technology has made it possible to obtain point clouds of real-world terrains. However, due to limitations in sensor acquisition technology or specific requirements, point clouds often contain defects such as holes with missing data. Inpainting algorithms are widely used to patch these holes. However, existing traditional inpainting algorithms rely on precis… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  50. arXiv:2404.03329  [pdf

    cs.LG eess.SP stat.ML

    DeepFunction: Deep Metric Learning-based Imbalanced Classification for Diagnosing Threaded Pipe Connection Defects using Functional Data

    Authors: Yukun Xie, Juan Du, Chen Zhang

    Abstract: In modern manufacturing, most of the product lines are conforming. Few products are nonconforming but with different defect types. The identification of defect types can help further root cause diagnosis of production lines. With the sensing development, signals of process variables can be collected in high resolution, which can be regarded as multichannel functional data. They have abundant infor… ▽ More

    Submitted 24 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Revised version for submission to IISE Transactions