Skip to main content

Showing 1–50 of 316 results for author: Pan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17431  [pdf, other

    cs.SE

    A Large-scale Investigation of Semantically Incompatible APIs behind Compatibility Issues in Android Apps

    Authors: Shidong Pan, Tianchen Guo, Lihong Zhang, Pei Liu, Zhenchang Xing, Xiaoyu Sun

    Abstract: Application Programming Interface (API) incompatibility is a long-standing issue in Android application development. The rapid evolution of Android APIs results in a significant number of API additions, removals, and changes between adjacent versions. Unfortunately, this high frequency of alterations may lead to compatibility issues, often without adequate notification to developers regarding thes… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.16615  [pdf, other

    cs.CV

    The Championship-Winning Solution for the 5th CLVISION Challenge 2024

    Authors: Sishun Pan, Tingmin Li, Yang Yang

    Abstract: In this paper, we introduce our approach to the 5th CLVision Challenge, which presents distinctive challenges beyond traditional class incremental learning. Unlike standard settings, this competition features the recurrence of previously encountered classes and includes unlabeled data that may contain Out-of-Distribution (OOD) categories. Our approach is based on Winning Subnetworks to allocate in… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.15859  [pdf, other

    cs.IR cs.AI

    LLM-Powered Explanations: Unraveling Recommendations Through Subgraph Reasoning

    Authors: Guangsi Shi, Xiaofeng Deng, Linhao Luo, Lijuan Xia, Lei Bao, Bei Ye, Fei Du, Shirui Pan, Yuxiao Li

    Abstract: Recommender systems are pivotal in enhancing user experiences across various web applications by analyzing the complicated relationships between users and items. Knowledge graphs(KGs) have been widely used to enhance the performance of recommender systems. However, KGs are known to be noisy and incomplete, which are hard to provide reliable explanations for recommendation results. An explainable r… ▽ More

    Submitted 29 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2406.15656  [pdf, other

    eess.IV cs.CV

    Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction

    Authors: Mojtaba Safari, Zach Eidex, Shaoyan Pan, Richard L. J. Qiu, Xiaofeng Yang

    Abstract: Purpose: To propose a self-supervised deep learning-based compressed sensing MRI (DL-based CS-MRI) method named "Adaptive Self-Supervised Consistency Guided Diffusion Model (ASSCGD)" to accelerate data acquisition without requiring fully sampled datasets. Materials and Methods: We used the fastMRI multi-coil brain axial T2-weighted (T2-w) dataset from 1,376 cases and single-coil brain quantitative… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.15523  [pdf, other

    cs.LG stat.ML

    Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark

    Authors: Yili Wang, Yixin Liu, Xu Shen, Chenyu Li, Kaize Ding, Rui Miao, Ying Wang, Shirui Pan, Xin Wang

    Abstract: To build safe and reliable graph machine learning systems, unsupervised graph-level anomaly detection (GLAD) and unsupervised graph-level out-of-distribution (OOD) detection (GLOD) have received significant attention in recent years. Though those two lines of research indeed share the same objective, they have been studied independently in the community due to distinct evaluation setups, creating… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.13925  [pdf, other

    cs.CL cs.AI

    GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

    Authors: Tao Zhang, Ziqian Zeng, Yuxiang Xiao, Hui** Zhuang, Cen Chen, James Foulds, Shimei Pan

    Abstract: Large Language Models (LLMs) are prone to generating content that exhibits gender biases, raising significant ethical concerns. Alignment, the process of fine-tuning LLMs to better align with desired behaviors, is recognized as an effective approach to mitigate gender biases. Although proprietary LLMs have made significant strides in mitigating gender bias, their alignment datasets are not publicl… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  7. arXiv:2406.12539  [pdf, other

    cs.LG cs.AI

    The Heterophilic Snowflake Hypothesis: Training and Empowering GNNs for Heterophilic Graphs

    Authors: Kun Wang, Guibin Zhang, Xinnan Zhang, Junfeng Fang, Xun Wu, Guohao Li, Shirui Pan, Wei Huang, Yuxuan Liang

    Abstract: Graph Neural Networks (GNNs) have become pivotal tools for a range of graph-based learning tasks. Notably, most current GNN architectures operate under the assumption of homophily, whether explicitly or implicitly. While this underlying assumption is frequently adopted, it is not universally applicable, which can result in potential shortcomings in learning effectiveness. In this paper, \textbf{fo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: KDD 2024

  8. arXiv:2406.05261  [pdf, other

    cs.CV cs.GR

    Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi Partitioning

    Authors: Yilin Liu, Jiale Chen, Shanshan Pan, Daniel Cohen-Or, Hao Zhang, Hui Huang

    Abstract: We introduce a novel method for acquiring boundary representations (B-Reps) of 3D CAD models which involves a two-step process: it first applies a spatial partitioning, referred to as the ``split``, followed by a ``fit`` operation to derive a single primitive within each partition. Specifically, our partitioning aims to produce the classical Voronoi diagram of the set of ground-truth (GT) B-Rep pr… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACM Transactions on Graphics (SIGGRAPH 2024); Project page: https://vcc.tech/research/2024/BRepVP; Code: https://github.com/yilinliu77/NVDNet

  9. arXiv:2406.03647  [pdf, other

    cs.LG cs.AI

    Decision-focused Graph Neural Networks for Combinatorial Optimization

    Authors: Yang Liu, Chuan Zhou, Peng Zhang, Shirui Pan, Zhao Li, Hongyang Chen

    Abstract: In recent years, there has been notable interest in investigating combinatorial optimization (CO) problems by neural-based framework. An emerging strategy to tackle these challenging problems involves the adoption of graph neural networks (GNNs) as an alternative to traditional algorithms, a subject that has attracted considerable attention. Despite the growing popularity of GNNs and traditional a… ▽ More

    Submitted 9 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages

  10. arXiv:2406.03511  [pdf, other

    cs.LG cs.AI

    MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data

    Authors: Jian** Zhou, Bin Lu, Zhanyu Liu, Siyu Pan, Xuejun Feng, Hua Wei, Guanjie Zheng, Xinbing Wang, Chenghu Zhou

    Abstract: Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, intro… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages, 7 figures

  11. arXiv:2406.01256  [pdf, other

    cs.CV cs.AI

    Augmented Commonsense Knowledge for Remote Object Grounding

    Authors: Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi

    Abstract: The vision-and-language navigation (VLN) task necessitates an agent to perceive the surroundings, follow natural language instructions, and act in photo-realistic unseen environments. Most of the existing methods employ the entire image or object features to represent navigable viewpoints. However, these representations are insufficient for proper action prediction, especially for the REVERIE task… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  12. arXiv:2406.00210  [pdf, other

    cs.CV

    A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies

    Authors: **chao Zhu, Yuxuan Wang, Siyuan Pan, Pengfei Wan, Di Zhang, Gao Huang

    Abstract: The Stable Diffusion Model (SDM) is a prevalent and effective model for text-to-image (T2I) and image-to-image (I2I) generation. Despite various attempts at sampler optimization, model distillation, and network quantification, these approaches typically maintain the original network architecture. The extensive parameter scale and substantial computational demands have limited research into adjusti… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

    Comments: 19 pages, 16 figures, submitted to IEEE Transactions on Neural Networks and Learning Systems

  13. arXiv:2405.20652  [pdf, other

    cs.LG

    Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

    Authors: Langzhang Liang, Sunwoo Kim, Kijung Shin, Zenglin Xu, Shirui Pan, Yuan Qi

    Abstract: Graph Neural Networks (GNNs) have gained significant attention as a powerful modeling and inference method, especially for homophilic graph-structured data. To empower GNNs in heterophilic graphs, where adjacent nodes exhibit dissimilar labels or features, Signed Message Passing (SMP) has been widely adopted. However, there is a lack of theoretical and empirical analysis regarding the limitations… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICML 2024

  14. arXiv:2405.19677  [pdf, other

    cs.CR cs.AI

    Large Language Model Watermark Stealing With Mixed Integer Programming

    Authors: Zhaoxi Zhang, Xiaomei Zhang, Yanjun Zhang, Leo Yu Zhang, Chao Chen, Shengshan Hu, Asif Gill, Shirui Pan

    Abstract: The Large Language Model (LLM) watermark is a newly emerging technique that shows promise in addressing concerns surrounding LLM copyright, monitoring AI-generated text, and preventing its misuse. The LLM watermark scheme commonly includes generating secret keys to partition the vocabulary into green and red lists, applying a perturbation to the logits of tokens in the green list to increase their… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 12 pages

  15. arXiv:2405.17493  [pdf, other

    cs.LG

    Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

    Authors: Ziyan Wang, Mohamed Ragab, Wenmian Yang, Min Wu, Sinno Jialin Pan, Jie Zhang, Zhenghua Chen

    Abstract: Unsupervised domain adaptation (UDA) has achieved remarkable success in fault diagnosis, bringing significant benefits to diverse industrial applications. While most UDA methods focus on cross-working condition scenarios where the source and target domains are notably similar, real-world applications often grapple with severe domain shifts. We coin the term `distant domain adaptation problem' to d… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  16. arXiv:2405.16772  [pdf, other

    cs.SI cs.LG

    Balancing User Preferences by Social Networks: A Condition-Guided Social Recommendation Model for Mitigating Popularity Bias

    Authors: Xin He, Wenqi Fan, Ruobing Wang, Yili Wang, Ying Wang, Shirui Pan, Xin Wang

    Abstract: Social recommendation models weave social interactions into their design to provide uniquely personalized recommendation results for users. However, social networks not only amplify the popularity bias in recommendation models, resulting in more frequent recommendation of hot items and fewer long-tail items, but also include a substantial amount of redundant information that is essentially meaning… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures

  17. arXiv:2405.16771  [pdf, other

    cs.LG

    ARC: A Generalist Graph Anomaly Detector with In-Context Learning

    Authors: Yixin Liu, Shiyuan Li, Yu Zheng, Qingfeng Chen, Chengqi Zhang, Shirui Pan

    Abstract: Graph anomaly detection (GAD), which aims to identify abnormal nodes that differ from the majority within a graph, has garnered significant attention. However, current GAD methods necessitate training specific to each dataset, resulting in high training costs, substantial data requirements, and limited generalizability when being applied to new datasets and domains. To address these limitations, t… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 25 pages, 10 figures

  18. arXiv:2405.14545  [pdf, other

    q-bio.BM cs.LG

    A Cross-Field Fusion Strategy for Drug-Target Interaction Prediction

    Authors: Hongzhi Zhang, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug-target interaction (DTI) prediction is a critical component of the drug discovery process. In the drug development engineering field, predicting novel drug-target interactions is extremely crucial.However, although existing methods have achieved high accuracy levels in predicting known drugs and drug targets, they fail to utilize global protein information during DTI prediction. This leads to… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  19. arXiv:2405.14536  [pdf, other

    q-bio.MN cs.AI cs.LG

    Regressor-free Molecule Generation to Support Drug Response Prediction

    Authors: Kun Li, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug response prediction (DRP) is a crucial phase in drug discovery, and the most important metric for its evaluation is the IC50 score. DRP results are heavily dependent on the quality of the generated molecules. Existing molecule generation methods typically employ classifier-based guidance, enabling sampling within the IC50 classification range. However, these methods fail to ensure the samplin… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 22 pages, 7 figures, 9 tables,

  20. arXiv:2405.14407  [pdf, other

    cs.LG

    Gradient Transformation: Towards Efficient and Model-Agnostic Unlearning for Dynamic Graph Neural Networks

    Authors: He Zhang, Bang Wu, Xiangwen Yang, Xingliang Yuan, Chengqi Zhang, Shirui Pan

    Abstract: Graph unlearning has emerged as an essential tool for safeguarding user privacy and mitigating the negative impacts of undesirable data. Meanwhile, the advent of dynamic graph neural networks (DGNNs) marks a significant advancement due to their superior capability in learning from dynamic graphs, which encapsulate spatial-temporal variations in diverse real-world applications (e.g., traffic foreca… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  21. arXiv:2405.14260  [pdf, other

    cs.LG cs.AI

    Graph Sparsification via Mixture of Graphs

    Authors: Guibin Zhang, Xiangguo Sun, Yanwei Yue, Kun Wang, Tianlong Chen, Shirui Pan

    Abstract: Graph Neural Networks (GNNs) have demonstrated superior performance across various graph learning tasks but face significant computational challenges when applied to large-scale graphs. One effective approach to mitigate these challenges is graph sparsification, which involves removing non-essential edges to reduce computational overhead. However, previous graph sparsification methods often rely o… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  22. arXiv:2405.14170  [pdf, other

    cs.AI cs.CL

    Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning

    Authors: Jiapu Wang, Kai Sun, Linhao Luo, Wei Wei, Yongli Hu, Alan Wee-Chung Liew, Shirui Pan, Baocai Yin

    Abstract: Temporal Knowledge Graph Reasoning (TKGR) is the process of utilizing temporal information to capture complex relations within a Temporal Knowledge Graph (TKG) to infer new knowledge. Conventional methods in TKGR typically depend on deep learning algorithms or temporal logical rules. However, deep learning-based TKGRs often lack interpretability, whereas rule-based TKGRs struggle to effectively le… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  23. arXiv:2405.10642  [pdf, other

    cs.LG

    Hi-GMAE: Hierarchical Graph Masked Autoencoders

    Authors: Chuang Liu, Zelin Yao, Yibing Zhan, Xueqi Ma, Dapeng Tao, Jia Wu, Wenbin Hu, Shirui Pan, Bo Du

    Abstract: Graph Masked Autoencoders (GMAEs) have emerged as a notable self-supervised learning approach for graph-structured data. Existing GMAE models primarily focus on reconstructing node-level information, categorizing them as single-scale GMAEs. This methodology, while effective in certain contexts, tends to overlook the complex hierarchical structures inherent in many real-world graphs. For instance,… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, 3 tables

  24. arXiv:2405.07892  [pdf, other

    cs.LG

    All Nodes are created Not Equal: Node-Specific Layer Aggregation and Filtration for GNN

    Authors: Shilong Wang, Hao Wu, Yifan Duan, Guibin Zhang, Guohao Li, Yuxuan Liang, Shirui Pan, Kun Wang, Yang Wang

    Abstract: The ever-designed Graph Neural Networks, though opening a promising path for the modeling of the graph-structure data, unfortunately introduce two daunting obstacles to their deployment on devices. (I) Most of existing GNNs are shallow, due mostly to the over-smoothing and gradient-vanish problem as they go deeper as convolutional architectures. (II) The vast majority of GNNs adhere to the homophi… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  25. arXiv:2405.07430  [pdf, other

    cs.SE cs.CR

    Don't Chase Your Tail! Missing Key Aspects Augmentation in Textual Vulnerability Descriptions of Long-tail Software through Feature Inference

    Authors: Linyi Han, Shidong Pan, Zhenchang Xing, Jiamou Sun, Sofonias Yitagesu, Xiaowang Zhang, Zhiyong Feng

    Abstract: Augmenting missing key aspects in Textual Vulnerability Descriptions (TVDs) for software with a large user base (referred to as non-long-tail software) has greatly advanced vulnerability analysis and software security research. However, these methods often overlook software instances that have a limited user base (referred to as long-tail software) due to limited TVDs, variations in software featu… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  26. arXiv:2405.03615  [pdf, other

    cs.LG

    Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey

    Authors: Farid Saberi-Movahed, Kamal Berahman, Razieh Sheikhpour, Yuefeng Li, Shirui Pan

    Abstract: Dimensionality Reduction plays a pivotal role in improving feature learning accuracy and reducing training time by eliminating redundant features, noise, and irrelevant data. Nonnegative Matrix Factorization (NMF) has emerged as a popular and powerful method for dimensionality reduction. Despite its extensive use, there remains a need for a comprehensive analysis of NMF in the context of dimension… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 Paes, 2 figures, to be appear in acm computing survey

  27. arXiv:2404.18886  [pdf, other

    cs.LG cs.AI

    A Survey on Diffusion Models for Time Series and Spatio-Temporal Data

    Authors: Yiyuan Yang, Ming **, Haomin Wen, Chaoli Zhang, Yuxuan Liang, Lintao Ma, Yi Wang, Chenghao Liu, Bin Yang, Zenglin Xu, Jiang Bian, Shirui Pan, Qingsong Wen

    Abstract: The study of time series is crucial for understanding trends and anomalies over time, enabling predictive insights across various sectors. Spatio-temporal data, on the other hand, is vital for analyzing phenomena in both space and time, providing a dynamic perspective on complex system interactions. Recently, diffusion models have seen widespread application in time series and spatio-temporal data… ▽ More

    Submitted 11 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Ongoing work & Under review; 27 pages, 8 figures, 2 tables; Github Repo: https://github.com/yyysjz1997/Awesome-TimeSeries-SpatioTemporal-Diffusion-Model

  28. arXiv:2404.17964  [pdf, other

    cs.SE

    Automating Zero-Shot Patch Porting for Hard Forks

    Authors: Shengyi Pan, You Wang, Zhongxin Liu, Xing Hu, Xin Xia, Shan** Li

    Abstract: Forking is a typical way of code reuse, which provides a simple way for developers to create a variant software (denoted as hard fork) by copying and modifying an existing codebase. Despite of the benefits, forking also leads to duplicate efforts in software maintenance. Developers need to port patches across the hard forks to address similar bugs or implement similar features. Due to the divergen… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted by ISSTA 2024

  29. arXiv:2404.15729  [pdf, other

    cs.LG

    Gradformer: Graph Transformer with Exponential Decay

    Authors: Chuang Liu, Zelin Yao, Yibing Zhan, Xueqi Ma, Shirui Pan, Wenbin Hu

    Abstract: Graph Transformers (GTs) have demonstrated their advantages across a wide range of tasks. However, the self-attention mechanism in GTs overlooks the graph's inductive biases, particularly biases related to structure, which are crucial for the graph tasks. Although some methods utilize positional encoding and attention bias to model inductive biases, their effectiveness is still suboptimal analytic… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 9 pages, 7 figures. Accepted by IJCAI 2024

  30. arXiv:2404.15625  [pdf, other

    cs.LG

    Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models

    Authors: Xu Shen, Yili Wang, Kaixiong Zhou, Shirui Pan, Xin Wang

    Abstract: The open-world test dataset is often mixed with out-of-distribution (OOD) samples, where the deployed models will struggle to make accurate predictions. Traditional detection methods need to trade off OOD detection and in-distribution (ID) classification performance since they share the same representation learning model. In this work, we propose to detect OOD molecules by adopting an auxiliary di… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 11 pages,10 figures

  31. arXiv:2404.11536  [pdf, other

    cs.LG cs.AI

    FedPFT: Federated Proxy Fine-Tuning of Foundation Models

    Authors: Zhaopeng Peng, Xiaoliang Fan, Yufan Chen, Zheng Wang, Shirui Pan, Chenglu Wen, Ruisheng Zhang, Cheng Wang

    Abstract: Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs. Existing methods fine-tune FM by allocating sub-FM to clients in FL, however, leading to suboptimal performance due to insufficient tuning and inevitable error accumulations of gradients. In this paper, we propose Federated Proxy Fine-Tuni… ▽ More

    Submitted 28 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI'24

  32. arXiv:2404.07546  [pdf, other

    cs.CL

    Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning

    Authors: Quanyu Long, Yin Wu, Wenya Wang, Sinno Jialin Pan

    Abstract: In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without updating millions of parameters. However, the precise contributions of demonstrations towards improving end-task performance have not been thoroughly i… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 36 pages, 8 figures

  33. arXiv:2403.19881  [pdf, other

    cs.AI

    IME: Integrating Multi-curvature Shared and Specific Embedding for Temporal Knowledge Graph Completion

    Authors: Jiapu Wang, Zheng Cui, Boyue Wang, Shirui Pan, Junbin Gao, Baocai Yin, Wen Gao

    Abstract: Temporal Knowledge Graphs (TKGs) incorporate a temporal dimension, allowing for a precise capture of the evolution of knowledge and reflecting the dynamic nature of the real world. Typically, TKGs contain complex geometric structures, with various geometric structures interwoven. However, existing Temporal Knowledge Graph Completion (TKGC) methods either model TKGs in a single space or neglect the… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  34. arXiv:2403.17702  [pdf, other

    cs.CV

    The Solution for the CVPR 2023 1st foundation model challenge-Track2

    Authors: Haonan Xu, Yurui Huang, Sishun Pan, Zhihao Guan, Yi Xu, Yang Yang

    Abstract: In this paper, we propose a solution for cross-modal transportation retrieval. Due to the cross-domain problem of traffic images, we divide the problem into two sub-tasks of pedestrian retrieval and vehicle retrieval through a simple strategy. In pedestrian retrieval tasks, we use IRRA as the base model and specifically design an Attribute Classification to mine the knowledge implied by attribute… ▽ More

    Submitted 2 April, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  35. arXiv:2403.16803  [pdf, other

    cs.RO cs.CV

    Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning

    Authors: Sicong Pan, Liren **, Xuying Huang, Cyrill Stachniss, Marija Popović, Maren Bennewitz

    Abstract: Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path conn… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Sicong Pan and Liren ** have equal contribution. Submitted to IROS 2024

  36. Foundation Models for Time Series Analysis: A Tutorial and Survey

    Authors: Yuxuan Liang, Haomin Wen, Yuqi Nie, Yushan Jiang, Ming **, Dong** Song, Shirui Pan, Qingsong Wen

    Abstract: Time series analysis stands as a focal point within the data mining community, serving as a cornerstone for extracting valuable insights crucial to a myriad of real-world applications. Recent advances in Foundation Models (FMs) have fundamentally reshaped the paradigm of model design for time series analysis, boosting various downstream tasks in practice. These innovative approaches often leverage… ▽ More

    Submitted 18 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'24)

  37. arXiv:2403.09973  [pdf, other

    cs.CV

    Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience

    Authors: Xiaohang Yu, Zhengxian Yang, Shi Pan, Yuqi Han, Haoxiang Wang, Jun Zhang, Shi Yan, Borong Lin, Lei Yang, Tao Yu, Lu Fang

    Abstract: We have built a custom mobile multi-camera large-space dense light field capture system, which provides a series of high-quality and sufficiently dense light field images for various scenarios. Our aim is to contribute to the development of popular 3D scene reconstruction algorithms such as IBRnet, NeRF, and 3D Gaussian splitting. More importantly, the collected dataset, which is much denser than… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  38. arXiv:2403.09953  [pdf, other

    cs.LG

    Online GNN Evaluation Under Test-time Graph Distribution Shifts

    Authors: Xin Zheng, Dong** Song, Qingsong Wen, Bo Du, Shirui Pan

    Abstract: Evaluating the performance of a well-trained GNN model on real-world graphs is a pivotal step for reliable GNN online deployment and serving. Due to a lack of test node labels and unknown potential training-test graph data distribution shifts, conventional model evaluation encounters limitations in calculating performance metrics (e.g., test error) and measuring graph data-level discrepancies, par… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR-2024

  39. arXiv:2403.08758  [pdf

    eess.IV cs.CV

    Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: Current deep learning reconstruction for accelerated cardiac cine MRI suffers from spatial and temporal blurring. We aim to improve image sharpness and motion delineation for cine MRI under high undersampling rates. A spatiotemporal diffusion enhancement model conditional on an existing deep learning reconstruction along with a novel paired sampling strategy was developed. The diffusion model prov… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  40. arXiv:2403.08749  [pdf

    eess.IV cs.CV

    Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: The currently limited quality of accelerated cardiac cine reconstruction may potentially be improved by the emerging diffusion models, but the clinically unacceptable long processing time poses a challenge. We aim to develop a clinically feasible diffusion-model-based reconstruction pipeline to improve the image quality of cine MRI. A multi-in multi-out diffusion enhancement model together with fa… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  41. arXiv:2403.07943  [pdf, other

    cs.LG cs.CR

    Revisiting Edge Perturbation for Graph Neural Network in Graph Data Augmentation and Attack

    Authors: Xin Liu, Yuxiang Zhang, Meng Wu, Mingyu Yan, Kun He, Wei Yan, Shirui Pan, Xiaochun Ye, Dongrui Fan

    Abstract: Edge perturbation is a basic method to modify graph structures. It can be categorized into two veins based on their effects on the performance of graph neural networks (GNNs), i.e., graph data augmentation and attack. Surprisingly, both veins of edge perturbation methods employ the same operations, yet yield opposite effects on GNNs' accuracy. A distinct boundary between these methods in using edg… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 14P

  42. arXiv:2403.07673  [pdf, other

    cs.CR

    Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation

    Authors: Di Mi, Yanjun Zhang, Leo Yu Zhang, Shengshan Hu, Qi Zhong, Haizhuan Yuan, Shirui Pan

    Abstract: Model extraction attacks (MEAs) enable an attacker to replicate the functionality of a victim deep neural network (DNN) model by only querying its API service remotely, posing a severe threat to the security and integrity of pay-per-query DNN-based services. Although the majority of current research on MEAs has primarily concentrated on neural classifiers, there is a growing prevalence of image-to… ▽ More

    Submitted 19 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  43. arXiv:2403.01193  [pdf, other

    cs.CL cs.AI

    RAGged Edges: The Double-Edged Sword of Retrieval-Augmented Chatbots

    Authors: Philip Feldman, James R. Foulds, Shimei Pan

    Abstract: Large language models (LLMs) like ChatGPT demonstrate the remarkable progress of artificial intelligence. However, their tendency to hallucinate -- generate plausible but false information -- poses a significant challenge. This issue is critical, as seen in recent court cases where ChatGPT's use led to citations of non-existent legal rulings. This paper explores how Retrieval-Augmented Generation… ▽ More

    Submitted 12 June, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: 7 Pages, 1 Figure, 1 Table

    ACM Class: H.3.3; I.2.7

  44. arXiv:2402.18495  [pdf, other

    cs.LG

    ROG$_{PL}$: Robust Open-Set Graph Learning via Region-Based Prototype Learning

    Authors: Qin Zhang, Xiaowei Li, Jiexin Lu, Li** Qiu, Shirui Pan, Xiaojun Chen, Junyang Chen

    Abstract: Open-set graph learning is a practical task that aims to classify the known class nodes and to identify unknown class samples as unknowns. Conventional node classification methods usually perform unsatisfactorily in open-set scenarios due to the complex data they encounter, such as out-of-distribution (OOD) data and in-distribution (IND) noise. OOD data are samples that do not belong to any known… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  45. arXiv:2402.16374  [pdf, other

    cs.LG cs.SI

    Graph Learning under Distribution Shifts: A Comprehensive Survey on Domain Adaptation, Out-of-distribution, and Continual Learning

    Authors: Man Wu, Xin Zheng, Qin Zhang, Xiao Shen, Xiong Luo, Xingquan Zhu, Shirui Pan

    Abstract: Graph learning plays a pivotal role and has gained significant attention in various application scenarios, from social network analysis to recommendation systems, for its effectiveness in modeling complex data relations represented by graph structural data. In reality, the real-world graph data typically show dynamics over time, with changing node attributes and edge structure, leading to the seve… ▽ More

    Submitted 7 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  46. arXiv:2402.14544  [pdf, other

    cs.CR cs.SE

    {A New Hope}: Contextual Privacy Policies for Mobile Applications and An Approach Toward Automated Generation

    Authors: Shidong Pan, Zhen Tao, Thong Hoang, Dawen Zhang, Tianshi Li, Zhenchang Xing, Sherry Xu, Mark Staples, Thierry Rakotoarivelo, David Lo

    Abstract: Privacy policies have emerged as the predominant approach to conveying privacy notices to mobile application users. In an effort to enhance both readability and user engagement, the concept of contextual privacy policies (CPPs) has been proposed by researchers. The aim of CPPs is to fragment privacy policies into concise snippets, displaying them only within the corresponding contexts within the a… ▽ More

    Submitted 10 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: USENIX Security 2024. arXiv admin note: text overlap with arXiv:2307.01691

  47. arXiv:2402.13532  [pdf, other

    cs.CL

    Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation

    Authors: Quanyu Long, Yue Deng, LeiLei Gan, Wenya Wang, Sinno Jialin Pan

    Abstract: Dense retrievers and retrieval-augmented language models have been widely used in various NLP applications. Despite being designed to deliver reliable and secure outcomes, the vulnerability of retrievers to potential attacks remains unclear, raising concerns about their security. In this paper, we introduce a novel scenario where the attackers aim to covertly disseminate targeted misinformation, s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  48. arXiv:2402.11463  [pdf, other

    cs.LG cs.AI nlin.CD

    Attractor Memory for Long-Term Time Series Forecasting: A Chaos Perspective

    Authors: Jiaxi Hu, Yuehong Hu, Wei Chen, Ming **, Shirui Pan, Qingsong Wen, Yuxuan Liang

    Abstract: In long-term time series forecasting (LTSF) tasks, an increasing number of models have acknowledged that discrete time series originate from continuous dynamic systems and have attempted to model their dynamical structures. Recognizing the chaotic nature of real-world data, our model, \textbf{\textit{Attraos}}, incorporates chaos theory into LTSF, perceiving real-world time series as observations… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:nlin/0307015 by other authors

  49. arXiv:2402.02713  [pdf, other

    cs.LG cs.AI

    Position: What Can Large Language Models Tell Us about Time Series Analysis

    Authors: Ming **, Yifan Zhang, Wei Chen, Kexin Zhang, Yuxuan Liang, Bin Yang, **dong Wang, Shirui Pan, Qingsong Wen

    Abstract: Time series analysis is essential for comprehending the complexities inherent in various realworld systems and applications. Although large language models (LLMs) have recently made significant strides, the development of artificial general intelligence (AGI) equipped with time series analysis capabilities remains in its nascent phase. Most existing time series models heavily rely on domain knowle… ▽ More

    Submitted 1 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by the 41st International Conference on Machine Learning (ICML 2024)

  50. arXiv:2402.01364  [pdf, other

    cs.CL cs.LG

    Continual Learning for Large Language Models: A Survey

    Authors: Tongtong Wu, Linhao Luo, Yuan-Fang Li, Shirui Pan, Thuy-Trang Vu, Gholamreza Haffari

    Abstract: Large language models (LLMs) are not amenable to frequent re-training, due to high training costs arising from their massive scale. However, updates are necessary to endow LLMs with new skills and keep them up-to-date with rapidly evolving human knowledge. This paper surveys recent works on continual learning for LLMs. Due to the unique nature of LLMs, we catalog continue learning techniques in a… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.