Skip to main content

Showing 1–50 of 306 results for author: Xiali

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02098  [pdf, other

    cs.CV

    DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection

    Authors: Kaixin Xu, Qingtian Feng, Hao Chen, Zhe Wang, Xue Geng, Xulei Yang, Min Wu, Xiaoli Li, Weisi Lin

    Abstract: Applying deep neural networks to 3D point cloud processing has attracted increasing attention due to its advanced performance in many areas, such as AR/VR, autonomous driving, and robotics. However, as neural network models and 3D point clouds expand in size, it becomes a crucial challenge to reduce the computational and memory overhead to meet latency and energy constraints in real-world applicat… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.02068  [pdf, other

    cs.CV

    LPViT: Low-Power Semi-structured Pruning for Vision Transformers

    Authors: Kaixin Xu, Zhe Wang, Chunyun Chen, Xue Geng, Jie Lin, Xulei Yang, Min Wu, Xiaoli Li, Weisi Lin

    Abstract: Vision transformers have emerged as a promising alternative to convolutional neural networks for various image analysis tasks, offering comparable or superior performance. However, one significant drawback of ViTs is their resource-intensive nature, leading to increased memory footprint, computation complexity, and power consumption. To democratize this high-performance technology and make it more… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2406.08765  [pdf, other

    cs.LG

    LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices

    Authors: Ruibing **, Qing Xu, Min Wu, Yuecong Xu, Dan Li, Xiaoli Li, Zhenghua Chen

    Abstract: Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive generalization in diverse fields. Although massive LLM based approaches are proposed for time series tasks, these methods require to load the whole LLM in both training and… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  4. arXiv:2406.05485  [pdf, other

    cs.CV

    Training-Free Robust Interactive Video Object Segmentation

    Authors: Xiaoli Wei, Zhaoqing Wang, Yandong Guo, Chunxia Zhang, Tongliang Liu, Mingming Gong

    Abstract: Interactive video object segmentation is a crucial video task, having various applications from video editing to data annotating. However, current approaches struggle to accurately segment objects across diverse domains. Recently, Segment Anything Model (SAM) introduces interactive visual prompts and demonstrates impressive performance across different domains. In this paper, we propose a training… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  5. arXiv:2406.02635  [pdf, other

    cs.LG cs.AI

    Evidentially Calibrated Source-Free Time-Series Domain Adaptation with Temporal Imputation

    Authors: Mohamed Ragab, Peiliang Gong, Emadeldeen Eldele, Wenyu Zhang, Min Wu, Chuan-Sheng Foo, Daoqiang Zhang, Xiaoli Li, Zhenghua Chen

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model pre-trained on a labeled source domain to an unlabeled target domain without access to source data, preserving the source domain's privacy. While SFDA is prevalent in computer vision, it remains largely unexplored in time series analysis. Existing SFDA methods, designed for visual data, struggle to capture the inherent temporal dynamics of… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.01922  [pdf, ps, other

    eess.SP cs.IT

    Performance Analysis of Hybrid Cellular and Cell-free MIMO Network

    Authors: Zhuoyin Dai, **gran Xu, Xiaoli Xu, Ruoguang Li, Yong Zeng

    Abstract: Cell-free wireless communication is envisioned as one of the most promising network architectures, which can achieve stable and uniform communication performance while improving the system energy and spectrum efficiency. The deployment of cell-free networks is envisioned to be a longterm evolutionary process, in which cell-free access points (APs) will be gradually introduced into the communicatio… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.19623  [pdf, other

    cs.SE

    A Novel Approach for Automated Design Information Mining from Issue Logs

    Authors: Jiuang Zhao, Zitian Yang, Li Zhang, Xiaoli Lian, Donghao Yang

    Abstract: Software architectures are usually meticulously designed to address multiple quality concerns and support long-term maintenance. However, due to the imbalance between the cost and value for developers to document design rationales (i.e., the design alternatives and the underlying arguments for making or rejecting decisions), these rationales are often obsolete or even missing. The lack of design k… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.15458  [pdf, other

    cs.LG cs.DC

    FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized Scaler

    Authors: Hongyi Peng, Han Yu, Xiaoli Tang, Xiaoxiao Li

    Abstract: Federated learning (FL) enables collaborative machine learning across distributed data owners, but data heterogeneity poses a challenge for model calibration. While prior work focused on improving accuracy for non-iid data, calibration remains under-explored. This study reveals existing FL aggregation approaches lead to sub-optimal calibration, and theoretical analysis shows despite constraining v… ▽ More

    Submitted 3 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by ICML'24

  9. arXiv:2405.14767  [pdf, other

    q-fin.ST cs.CL cs.LG q-fin.TR

    FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

    Authors: Hongyang Yang, Boyu Zhang, Neng Wang, Cheng Guo, Xiaoli Zhang, Likun Lin, Junlin Wang, Tianyu Zhou, Mao Guan, Runjia Zhang, Christina Dan Wang

    Abstract: As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: FinRobot Whitepaper V1.0

  10. arXiv:2405.06038  [pdf, other

    cs.LG cs.AI

    From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks

    Authors: Xue Geng, Zhe Wang, Chunyun Chen, Qing Xu, Kaixin Xu, Chao **, Manas Gupta, Xulei Yang, Zhenghua Chen, Mohamed M. Sabry Aly, Jie Lin, Min Wu, Xiaoli Li

    Abstract: Deep neural networks (DNNs) have been widely used in many artificial intelligence (AI) tasks. However, deploying them brings significant challenges due to the huge cost of memory, energy, and computation. To address these challenges, researchers have developed various model compression techniques such as model quantization and model pruning. Recently, there has been a surge in research of compress… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: This manuscript is the accepted version for TNNLS(IEEE Transactions on Neural Networks and Learning Systems)

  11. arXiv:2405.05991  [pdf, other

    cs.LG cs.AI cs.GT

    Agent-oriented Joint Decision Support for Data Owners in Auction-based Federated Learning

    Authors: Xiaoli Tang, Han Yu, Xiaoxiao Li

    Abstract: Auction-based Federated Learning (AFL) has attracted extensive research interest due to its ability to motivate data owners (DOs) to join FL through economic means. While many existing AFL methods focus on providing decision support to model users (MUs) and the AFL auctioneer, decision support for data owners remains open. To bridge this gap, we propose a first-of-its-kind agent-oriented joint Pri… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  12. arXiv:2405.00718  [pdf, other

    cs.CL cs.AI

    Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

    Authors: Xu Ji, Jianyi Zhang, Ziyin Zhou, Zhangchi Zhao, Qianqian Qiao, Kaiying Han, Md Imran Hossen, Xiali Hei

    Abstract: Ensuring the resilience of Large Language Models (LLMs) against malicious exploitation is paramount, with recent focus on mitigating offensive responses. Yet, the understanding of cant or dark jargon remains unexplored. This paper introduces a domain-specific Cant dataset and CantCounter evaluation framework, employing Fine-Tuning, Co-Tuning, Data-Diffusion, and Data-Analysis stages. Experiments r… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

  13. arXiv:2404.18567  [pdf, other

    cs.CR

    Assessing Cybersecurity Vulnerabilities in Code Large Language Models

    Authors: Md Imran Hossen, Jianyi Zhang, Yinzhi Cao, Xiali Hei

    Abstract: Instruction-tuned Code Large Language Models (Code LLMs) are increasingly utilized as AI coding assistants and integrated into various applications. However, the cybersecurity vulnerabilities and implications arising from the widespread integration of these models are not yet fully understood due to limited research in this domain. To bridge this gap, this paper presents EvilInstructCoder, a frame… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  14. arXiv:2404.15381  [pdf, other

    cs.LG cs.AI

    Advances and Open Challenges in Federated Learning with Foundation Models

    Authors: Chao Ren, Han Yu, Hongyi Peng, Xiaoli Tang, Anran Li, Yulan Gao, Alysa Ziying Tan, Bo Zhao, Xiaoxiao Li, Zengxiang Li, Qiang Yang

    Abstract: The integration of Foundation Models (FMs) with Federated Learning (FL) presents a transformative paradigm in Artificial Intelligence (AI), offering enhanced capabilities while addressing concerns of privacy, data decentralization, and computational efficiency. This paper provides a comprehensive survey of the emerging field of Federated Foundation Models (FedFM), elucidating their synergistic rel… ▽ More

    Submitted 29 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Survey of Federated Foundation Models (FedFM)

  15. arXiv:2404.13244  [pdf, other

    cs.LG cs.GT

    Intelligent Agents for Auction-based Federated Learning: A Survey

    Authors: Xiaoli Tang, Han Yu, Xiaoxiao Li, Sarit Kraus

    Abstract: Auction-based federated learning (AFL) is an important emerging category of FL incentive mechanism design, due to its ability to fairly and efficiently motivate high-quality data owners to join data consumers' (i.e., servers') FL training tasks. To enhance the efficiency in AFL decision support for stakeholders (i.e., data consumers, data owners, and the auctioneer), intelligent agent-based techni… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  16. arXiv:2404.08472  [pdf, other

    cs.LG stat.ML

    TSLANet: Rethinking Transformers for Time Series Representation Learning

    Authors: Emadeldeen Eldele, Mohamed Ragab, Zhenghua Chen, Min Wu, Xiaoli Li

    Abstract: Time series data, characterized by its intrinsic long and short-range dependencies, poses a unique challenge across analytical applications. While Transformer-based models excel at capturing long-range dependencies, they face limitations in noise sensitivity, computational efficiency, and overfitting with smaller datasets. In response, we introduce a novel Time Series Lightweight Adaptive Network… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted in ICML 2024

  17. arXiv:2404.08408  [pdf, other

    cs.LG cs.AI eess.SP physics.geo-ph

    Seismic First Break Picking in a Higher Dimension Using Deep Graph Learning

    Authors: Hongtao Wang, Li Long, Jiangshe Zhang, Xiaoli Wei, Chunxia Zhang, Zhenbo Guo

    Abstract: Contemporary automatic first break (FB) picking methods typically analyze 1D signals, 2D source gathers, or 3D source-receiver gathers. Utilizing higher-dimensional data, such as 2D or 3D, incorporates global features, improving the stability of local picking. Despite the benefits, high-dimensional data requires structured input and increases computational demands. Addressing this, we propose a no… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  18. arXiv:2404.04887  [pdf, other

    cs.CV

    A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images

    Authors: Qingshan Hou, Shuai Cheng, Peng Cao, **zhu Yang, Xiaoli Liu, Osmar R. Zaiane, Yih Chung Tham

    Abstract: Representation learning offers a conduit to elucidate distinctive features within the latent space and interpret the deep models. However, the randomness of lesion distribution and the complexity of low-quality factors in medical images pose great challenges for models to extract key lesion features. Disease diagnosis methods guided by contrastive learning (CL) have shown significant advantages in… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  19. arXiv:2403.16424  [pdf

    cs.AI cs.DL cs.IR

    An Experiment with the Use of ChatGPT for LCSH Subject Assignment on Electronic Theses and Dissertations

    Authors: Eric H. C. Chow, TJ Kao, Xiaoli Li

    Abstract: This study delves into the potential use of Large Language Models (LLMs) for generating Library of Congress Subject Headings (LCSH). The authors employed ChatGPT to generate subject headings for electronic theses and dissertations (ETDs) based on their titles and summaries. The results revealed that although some generated subject headings were valid, there were issues regarding specificity and ex… ▽ More

    Submitted 3 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 20 pages

  20. From Hardware Fingerprint to Access Token: Enhancing the Authentication on IoT Devices

    Authors: Yue Xiao, Yi He, Xiaoli Zhang, Qian Wang, Renjie Xie, Kun Sun, Ke Xu, Qi Li

    Abstract: The proliferation of consumer IoT products in our daily lives has raised the need for secure device authentication and access control. Unfortunately, these resource-constrained devices typically use token-based authentication, which is vulnerable to token compromise attacks that allow attackers to impersonate the devices and perform malicious operations by stealing the access token. Using hardware… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  21. arXiv:2403.14734  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

    Authors: Qiushi Sun, Zhirui Chen, Fangzhi Xu, Kanzhi Cheng, Chang Ma, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Qipeng Guo, Xipeng Qiu, Pengcheng Yin, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

    Abstract: Neural Code Intelligence -- leveraging deep learning to understand, generate, and optimize code -- holds immense potential for transformative impacts on the whole society. Bridging the gap between Natural Language and Programming Language, this domain has drawn significant attention from researchers in both research communities over the past few years. This survey presents a systematic and chronol… ▽ More

    Submitted 23 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 64 pages, 6 figures, 10 tables, 692 references

  22. arXiv:2403.14097  [pdf, other

    cs.DC

    Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances

    Authors: Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia

    Abstract: Deep neural networks (DNNs) are becoming progressively large and costly to train. This paper aims to reduce DNN training costs by leveraging preemptible instances on modern clouds, which can be allocated at a much lower price when idle but may be preempted by the cloud provider at any time. Prior work that supports DNN training on preemptive instances employs a reactive approach to handling instan… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: NSDI '24

  23. arXiv:2403.10897  [pdf, other

    cs.CV cs.MM

    Rethinking Multi-view Representation Learning via Distilled Disentangling

    Authors: Guanzhou Ke, Bo Wang, Xiaoli Wang, Shengfeng He

    Abstract: Multi-view representation learning aims to derive robust representations that are both view-consistent and view-specific from diverse data sources. This paper presents an in-depth analysis of existing approaches in this domain, highlighting a commonly overlooked aspect: the redundancy between view-consistent and view-specific representations. To this end, we propose an innovative framework for mul… ▽ More

    Submitted 29 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  24. arXiv:2403.06737  [pdf, other

    cs.IR

    Post-Training Attribute Unlearning in Recommender Systems

    Authors: Chaochao Chen, Yizhao Zhang, Yuyuan Li, Dan Meng, Jun Wang, Xiaoli Zheng, Jianwei Yin

    Abstract: With the growing privacy concerns in recommender systems, recommendation unlearning is getting increasing attention. Existing studies predominantly use training data, i.e., model inputs, as unlearning target. However, attackers can extract private information from the model even if it has not been explicitly encountered during training. We name this unseen information as \textit{attribute} and tre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.05847

  25. arXiv:2403.05102  [pdf, other

    cs.CV

    Enhancing Texture Generation with High-Fidelity Using Advanced Texture Priors

    Authors: Kuo Xu, Maoyu Wang, Muyu Wang, Lincong Feng, Tianhui Zhang, Xiaoli Liu

    Abstract: The recent advancements in 2D generation technology have sparked a widespread discussion on using 2D priors for 3D shape and texture content generation. However, these methods often overlook the subsequent user operations, such as texture aliasing and blurring that occur when the user acquires the 3D model and simplifies its structure. Traditional graphics methods partially alleviate this issue, b… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  26. arXiv:2403.03645  [pdf, other

    cs.AI

    K-Link: Knowledge-Link Graph from LLMs for Enhanced Representation Learning in Multivariate Time-Series Data

    Authors: Yucheng Wang, Ruibing **, Min Wu, Xiaoli Li, Lihua Xie, Zhenghua Chen

    Abstract: Sourced from various sensors and organized chronologically, Multivariate Time-Series (MTS) data involves crucial spatial-temporal dependencies, e.g., correlations among sensors. To capture these dependencies, Graph Neural Networks (GNNs) have emerged as powerful tools, yet their effectiveness is restricted by the quality of graph construction from MTS data. Typically, existing approaches construct… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 12 pages,7 figures

  27. arXiv:2402.18933  [pdf, other

    cs.CV

    Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration

    Authors: Tony C. W. Mok, Zi Li, Yunhao Bai, Jianpeng Zhang, Wei Liu, Yan-Jie Zhou, Ke Yan, Dakai **, Yu Shi, Xiaoli Yin, Le Lu, Ling Zhang

    Abstract: Establishing dense anatomical correspondence across distinct imaging modalities is a foundational yet challenging procedure for numerous medical image analysis studies and image-guided radiotherapy. Existing multi-modality image registration algorithms rely on statistical-based similarity measures or local structural image representations. However, the former is sensitive to locally varying noise,… ▽ More

    Submitted 31 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR2024

  28. arXiv:2402.15061  [pdf, other

    cs.CL cs.LG

    Fine-tuning Large Language Models for Domain-specific Machine Translation

    Authors: Jiawei Zheng, Hanghai Hong, Xiaoli Wang, **gsong Su, Yonggui Liang, Shikai Wu

    Abstract: Large language models (LLMs) have made significant progress in machine translation (MT). However, their potential in domain-specific MT remains under-explored. Current LLM-based MT systems still face several challenges. First, for LLMs with in-context learning, their effectiveness is highly sensitive to input translation examples, and processing them can increase inference costs. They often requir… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 9 pages, 6 figures, 6tables

  29. arXiv:2402.11887  [pdf, other

    cs.LG

    Generative Semi-supervised Graph Anomaly Detection

    Authors: Hezhe Qiao, Qingsong Wen, Xiaoli Li, Ee-Peng Lim, Guansong Pang

    Abstract: This work considers a practical semi-supervised graph anomaly detection (GAD) scenario, where part of the nodes in a graph are known to be normal, contrasting to the extensively explored unsupervised setting with a fully unlabeled graph. We reveal that having access to the normal nodes, even just a small percentage of normal nodes, helps enhance the detection performance of existing unsupervised G… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 20 pages, 11 figures

  30. arXiv:2402.11604  [pdf, other

    cs.LG

    Self-evolving Autoencoder Embedded Q-Network

    Authors: J. Senthilnath, Bangjian Zhou, Zhen Wei Ng, Deeksha Aggarwal, Rajdeep Dutta, Ji Wei Yoon, Aye Phyu Phyu Aung, Keyu Wu, Min Wu, Xiaoli Li

    Abstract: In the realm of sequential decision-making tasks, the exploration capability of a reinforcement learning (RL) agent is paramount for achieving high rewards through interactions with the environment. To enhance this crucial ability, we propose SAQN, a novel approach wherein a self-evolving autoencoder (SA) is embedded with a Q-Network (QN). In SAQN, the self-evolving autoencoder architecture adapts… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 11 pages, 9 figures, 3 tables

  31. arXiv:2402.09167  [pdf, other

    cs.LG

    Evolving Restricted Boltzmann Machine-Kohonen Network for Online Clustering

    Authors: J. Senthilnath, Adithya Bhattiprolu, Ankur Singh, Bangjian Zhou, Min Wu, Jón Atli Benediktsson, Xiaoli Li

    Abstract: A novel online clustering algorithm is presented where an Evolving Restricted Boltzmann Machine (ERBM) is embedded with a Kohonen Network called ERBM-KNet. The proposed ERBM-KNet efficiently handles streaming data in a single-pass mode using the ERBM, employing a bias-variance strategy for neuron growing and pruning, as well as online clustering based on a cluster update strategy for cluster predi… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 9 pages, 11 figures, 3 tables

  32. arXiv:2402.04362  [pdf, other

    cs.LG

    Neural Networks Learn Statistics of Increasing Complexity

    Authors: Nora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Fern

    Abstract: The distributional simplicity bias (DSB) posits that neural networks learn low-order moments of the data distribution first, before moving on to higher-order correlations. In this work, we present compelling new evidence for the DSB by showing that networks automatically learn to perform well on maximum-entropy distributions whose low-order statistics match those of the training set early in train… ▽ More

    Submitted 13 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  33. arXiv:2402.02526  [pdf, other

    cs.LG

    CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition

    Authors: Quang Pham, Giang Do, Huy Nguyen, TrungTin Nguyen, Chenghao Liu, Mina Sartipi, Binh T. Nguyen, Savitha Ramasamy, Xiaoli Li, Steven Hoi, Nhat Ho

    Abstract: Sparse mixture of experts (SMoE) offers an appealing solution to scale up the model complexity beyond the mean of increasing the network's depth or width. However, effective training of SMoE has proven to be challenging due to the representation collapse issue, which causes parameter redundancy and limited representation potentials. In this work, we propose a competition mechanism to address this… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  34. arXiv:2401.17786  [pdf, other

    cs.DB cs.PF

    A Graph-Native Query Optimization Framework

    Authors: Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, **gren Zhou

    Abstract: Graph queries that combine pattern matching with relational operations, referred as PatRelQuery, are widely used in many real-world applications. It allows users to identify arbitrary patterns in a graph and further perform in-depth relational analysis on the results. To effectively support PatRelQuery, two key challenges need to be addressed: (1) how to optimize PatRelQuery in a unified framework… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  35. arXiv:2401.08940  [pdf

    cs.LG cs.AI

    CEL: A Continual Learning Model for Disease Outbreak Prediction by Leveraging Domain Adaptation via Elastic Weight Consolidation

    Authors: Saba Aslam, Abdur Rasool, Hongyan Wu, Xiaoli Li

    Abstract: Continual learning, the ability of a model to learn over time without forgetting previous knowledge and, therefore, be adaptive to new data, is paramount in dynamic fields such as disease outbreak prediction. Deep neural networks, i.e., LSTM, are prone to error due to catastrophic forgetting. This study introduces a novel CEL model for continual learning by leveraging domain adaptation via Elastic… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  36. arXiv:2401.05391  [pdf

    cs.AR cs.AI

    Efficient LLM inference solution on Intel GPU

    Authors: Hui Wu, Yi Gan, Feng Yuan, **g Ma, Wei Zhu, Yutao Xu, Hong Zhu, Yuhua Zhu, Xiaoli Liu, **ghui Gu, Peng Zhao

    Abstract: Transformer based Large Language Models (LLMs) have been widely used in many fields, and the efficiency of LLM inference becomes hot topic in real applications. However, LLMs are usually complicatedly designed in model structure with massive operations and perform inference in the auto-regressive mode, making it a challenging task to design a system with high efficiency. In this paper, we propos… ▽ More

    Submitted 23 June, 2024; v1 submitted 19 December, 2023; originally announced January 2024.

  37. A Change Point Detection Integrated Remaining Useful Life Estimation Model under Variable Operating Conditions

    Authors: Anushiya Arunan, Yan Qin, Xiaoli Li, Chau Yuen

    Abstract: By informing the onset of the degradation process, health status evaluation serves as a significant preliminary step for reliable remaining useful life (RUL) estimation of complex equipment. This paper proposes a novel temporal dynamics learning-based model for detecting change points of individual devices, even under variable operating conditions, and utilises the learnt change points to improve… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted in Control Engineering Practice Journal with DOI: https://doi.org/10.1016/j.conengprac.2023.105840

  38. arXiv:2401.04057  [pdf

    cs.IR cs.AI cs.SE

    Unveiling Bias in Fairness Evaluations of Large Language Models: A Critical Literature Review of Music and Movie Recommendation Systems

    Authors: Chandan Kumar Sah, Dr. Lian Xiaoli, Muhammad Mirajul Islam

    Abstract: The rise of generative artificial intelligence, particularly Large Language Models (LLMs), has intensified the imperative to scrutinize fairness alongside accuracy. Recent studies have begun to investigate fairness evaluations for LLMs within domains such as recommendations. Given that personalization is an intrinsic aspect of recommendation systems, its incorporation into fairness assessments is… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 10 pages

  39. arXiv:2401.00973  [pdf, other

    cs.LG cs.CR

    Facebook Report on Privacy of fNIRS data

    Authors: Md Imran Hossen, Sai Venkatesh Chilukoti, Liqun Shan, Vijay Srinivas Tida, Xiali Hei

    Abstract: The primary goal of this project is to develop privacy-preserving machine learning model training techniques for fNIRS data. This project will build a local model in a centralized setting with both differential privacy (DP) and certified robustness. It will also explore collaborative federated learning to train a shared model between multiple clients without sharing local fNIRS datasets. To preven… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures, 3 tables

    MSC Class: I.2.0

  40. arXiv:2312.15182  [pdf, other

    eess.IV cs.CV cs.LG

    Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation

    Authors: Haonan Wang, Peng Cao, Xiaoli Liu, **zhu Yang, Osmar Zaiane

    Abstract: Most state-of-the-art methods for medical image segmentation adopt the encoder-decoder architecture. However, this U-shaped framework still has limitations in capturing the non-local multi-scale information with a simple skip connection. To solve the problem, we firstly explore the potential weakness of skip connections in U-Net on multiple segmentation tasks, and find that i) not all skip connect… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  41. arXiv:2312.12107  [pdf, other

    cs.DC cs.DB

    GraphScope Flex: LEGO-like Graph Computing Stack

    Authors: Tao He, Shuxian Hu, Longbin Lai, Dongze Li, Neng Li, Xue Li, Lexiao Liu, Xiaojian Luo, Binqing Lyu, Ke Meng, Sijie Shen, Li Su, Lei Wang, **gbo Xu, Wenyuan Yu, Weibin Zeng, Lei Zhang, Siyuan Zhang, **gren Zhou, Xiaoli Zhou, Diwen Zhu

    Abstract: Graph computing has become increasingly crucial in processing large-scale graph data, with numerous systems developed for this purpose. Two years ago, we introduced GraphScope as a system addressing a wide array of graph computing needs, including graph traversal, analytics, and learning in one system. Since its inception, GraphScope has achieved significant technological advancements and gained w… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  42. arXiv:2312.07867  [pdf, other

    cs.AI cs.CL

    BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering

    Authors: Xiaojie Hong, Zixin Song, Liangzhi Li, Xiaoli Wang, Feiyan Liu

    Abstract: Medical Visual Question Answering (Med-VQA) is a very important task in healthcare industry, which answers a natural language question with a medical image. Existing VQA techniques in information systems can be directly applied to solving the task. However, they often suffer from (i) the data insufficient problem, which makes it difficult to train the state of the arts (SOTAs) for the domain-speci… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  43. arXiv:2312.07035  [pdf, other

    cs.LG cs.AI

    HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

    Authors: Giang Do, Khiem Le, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Bint T. Nguyen, Chenghao Liu, Savitha Ramasamy, Xiaoli Li, Steven Hoi

    Abstract: By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing problem, where all experts eventually learn similar representations. However, this strategy has two key limitations: (i) the policy derived from rando… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  44. arXiv:2312.06966  [pdf, other

    cs.IT eess.SP

    How Much Data is Needed for Channel Knowledge Map Construction?

    Authors: Xiaoli Xu, Yong Zeng

    Abstract: Channel knowledge map (CKM) has been recently proposed to enable environment-aware communications by utilizing historical or simulation generated wireless channel data. This paper studies the construction of one particular type of CKM, namely channel gain map (CGM), by using a finite number of measurements or simulation-generated data, with model-based spatial channel prediction. We try to answer… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  45. arXiv:2311.15566  [pdf, other

    cs.DC cs.CL cs.LG

    SpotServe: Serving Generative Large Language Models on Preemptible Instances

    Authors: Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia

    Abstract: The high computational and memory requirements of generative large language models (LLMs) make it challenging to serve them cheaply. This paper aims to reduce the monetary cost for serving LLMs by leveraging preemptible GPU instances on modern clouds, which offer accesses to spare GPUs at a much cheaper price than regular instances but may be preempted by the cloud at any time. Serving LLMs on pre… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: ASPLOS 2024

  46. arXiv:2311.12548  [pdf, other

    cs.AI

    Multi-Session Budget Optimization for Forward Auction-based Federated Learning

    Authors: Xiaoli Tang, Han Yu

    Abstract: Auction-based Federated Learning (AFL) has emerged as an important research field in recent years. The prevailing strategies for FL model users (MUs) assume that the entire team of the required data owners (DOs) for an FL task must be assembled before training can commence. In practice, an MU can trigger the FL training process multiple times. DOs can thus be gradually recruited over multiple FL m… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  47. arXiv:2311.10806  [pdf, other

    cs.LG cs.AI

    SEA++: Multi-Graph-based High-Order Sensor Alignment for Multivariate Time-Series Unsupervised Domain Adaptation

    Authors: Yucheng Wang, Yuecong Xu, Jianfei Yang, Min Wu, Xiaoli Li, Lihua Xie, Zhenghua Chen

    Abstract: Unsupervised Domain Adaptation (UDA) methods have been successful in reducing label dependency by minimizing the domain discrepancy between a labeled source domain and an unlabeled target domain. However, these methods face challenges when dealing with Multivariate Time-Series (MTS) data. MTS data typically consist of multiple sensors, each with its own unique distribution. This characteristic mak… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  48. arXiv:2311.10123  [pdf, other

    cs.CV

    MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture

    Authors: Lincong Feng, Muyu Wang, Maoyu Wang, Kuo Xu, Xiaoli Liu

    Abstract: Generative models for 3D object synthesis have seen significant advancements with the incorporation of prior knowledge distilled from 2D diffusion models. Nevertheless, challenges persist in the form of multi-view geometric inconsistencies and slow generation speeds within the existing 3D synthesis frameworks. This can be attributed to two factors: firstly, the deficiency of abundant geometric a p… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.17843, arXiv:2209.14988 by other authors

  49. arXiv:2311.09574  [pdf, other

    cs.LG cs.AI cs.CV

    LymphoML: An interpretable artificial intelligence-based method identifies morphologic features that correlate with lymphoma subtype

    Authors: Vivek Shankar, Xiaoli Yang, Vrishab Krishna, Brent Tan, Oscar Silva, Rebecca Rojansky, Andrew Ng, Fabiola Valvert, Edward Briercheck, David Weinstock, Yasodha Natkunam, Sebastian Fernandez-Pol, Pranav Rajpurkar

    Abstract: The accurate classification of lymphoma subtypes using hematoxylin and eosin (H&E)-stained tissue is complicated by the wide range of morphological features these cancers can exhibit. We present LymphoML - an interpretable machine learning method that identifies morphologic features that correlate with lymphoma subtypes. Our method applies steps to process H&E-stained tissue microarray cores, segm… ▽ More

    Submitted 19 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To be published in Proceedings of the 3rd Machine Learning for Health symposium, Proceedings of Machine Learning Research (PMLR)

    ACM Class: I.5.1; I.5.2; I.5.4; J.3

  50. arXiv:2311.07843  [pdf, ps, other

    cs.IT eess.SP

    On the IRS Deployment in Smart Factories Considering Blockage Effects: Collocated or Distributed?

    Authors: Yixin Zhang, Saeed R. Khosravirad, Xiaoli Chu, Mikko A. Uusitalo

    Abstract: In this article, we study the collocated and distributed deployment of intelligent reflecting surfaces (IRS) for a fixed total number of IRS elements to support enhanced mobile broadband (eMBB) and ultra-reliable low-latency communication (URLLC) services inside a factory. We build a channel model that incorporates the line-of-sight (LOS) probability and power loss of each transmission path, and p… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.