Skip to main content

Showing 1–50 of 322 results for author: wei, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01178  [pdf, other

    cs.CL cs.AI cs.LG

    $\text{Memory}^3$: Language Modeling with Explicit Memory

    Authors: Hongkang Yang, Zehao Lin, Wen** Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, **bo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

    Abstract: The training and inference of large language models (LLMs) are together a costly process that transports knowledge from raw data to meaningful computation. Inspired by the memory hierarchy of the human brain, we reduce this cost by equip** LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG). Conceptually, with most of its knowled… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2407.01067  [pdf, other

    cs.AI cs.CL cs.CV cs.HC cs.LG

    Human-like object concept representations emerge naturally in multimodal large language models

    Authors: Changde Du, Kaicheng Fu, Bincheng Wen, Yi Sun, Jie Peng, Wei Wei, Ying Gao, Shengpei Wang, Chuncheng Zhang, **peng Li, Shuang Qiu, Le Chang, Huiguang He

    Abstract: The conceptualization and categorization of natural objects in the human mind have long intrigued cognitive scientists and neuroscientists, offering crucial insights into human perception and cognition. Recently, the rapid development of Large Language Models (LLMs) has raised the attractive question of whether these models can also develop human-like object representations through exposure to vas… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.15480  [pdf, other

    cs.CL cs.AI cs.LG

    On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

    Authors: Chenghao Fan, Zhenyi Lu, Wei Wei, Jie Tian, Xiaoye Qu, Dangyang Chen, Yu Cheng

    Abstract: Efficient fine-tuning of large language models for task-specific applications is imperative, yet the vast number of parameters in these models makes their training increasingly challenging. Despite numerous proposals for effective methods, a substantial memory overhead remains for gradient computations during updates. \thm{Can we fine-tune a series of task-specific small models and transfer their… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: submit under review

  4. arXiv:2406.15479  [pdf, other

    cs.CL cs.AI cs.LG

    Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

    Authors: Zhenyi Lu, Chenghao Fan, Wei Wei, Xiaoye Qu, Dangyang Chen, Yu Cheng

    Abstract: In the era of large language models, model merging is a promising way to combine multiple task-specific models into a single multitask model without extra training. However, two challenges remain: (a) interference between different models and (b) heterogeneous data during testing. Traditional model merging methods often show significant performance gaps compared to fine-tuned models due to these i… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: submit in review

  5. arXiv:2406.13672  [pdf, other

    cs.CV

    Q-SNNs: Quantized Spiking Neural Networks

    Authors: Wenjie Wei, Yu Liang, Ammar Belatreche, Yichen Xiao, Honglin Cao, Zhenbang Ren, Guoqing Wang, Malu Zhang, Yang Yang

    Abstract: Brain-inspired Spiking Neural Networks (SNNs) leverage sparse spikes to represent information and process them in an asynchronous event-driven manner, offering an energy-efficient paradigm for the next generation of machine intelligence. However, the current focus within the SNN community prioritizes accuracy optimization through the development of large-scale models, limiting their viability in r… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 8 pages, 5 figures

  6. arXiv:2406.13179  [pdf, other

    cs.SD cs.AI cs.NE eess.AS

    Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting

    Authors: Shuai Wang, Dehao Zhang, Kexin Shi, Yuchen Wang, Wenjie Wei, Jibin Wu, Malu Zhang

    Abstract: Thanks to Deep Neural Networks (DNNs), the accuracy of Keyword Spotting (KWS) has made substantial progress. However, as KWS systems are usually implemented on edge devices, energy efficiency becomes a critical requirement besides performance. Here, we take advantage of spiking neural networks' energy efficiency and propose an end-to-end lightweight KWS model. The model consists of two innovative… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.12189  [pdf, other

    cs.DC

    Energy-aware Incremental OTA Update for Flash-based Batteryless IoT Devices

    Authors: Wei Wei, Jishnu Banerjee, Sahidul Islam, Chen Pan, Mimi Xie

    Abstract: Over-the-air (OTA) firmware updates are essential for updating and maintaining IoT devices, especially those batteryless devices reliant on energy harvesting power sources. Flash memory, favored for its low cost and high density, is extensively used for data storage in many IoT devices. However, due to its high energy demands for update operations, there is often insufficient energy for code updat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages

  8. arXiv:2406.11781  [pdf, other

    cs.IR

    DiffMM: Multi-Modal Diffusion Model for Recommendation

    Authors: Yangqin Jiang, Lianghao Xia, Wei Wei, Da Luo, Kangyi Lin, Chao Huang

    Abstract: The rise of online multi-modal sharing platforms like TikTok and YouTube has enabled personalized recommender systems to incorporate multiple modalities (such as visual, textual, and acoustic) into user representations. However, addressing the challenge of data sparsity in these systems remains a key issue. To address this limitation, recent research has introduced self-supervised learning techniq… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.09860  [pdf, other

    cs.LG cs.AI cs.CV

    Dataset Condensation with Latent Quantile Matching

    Authors: Wei Wei, Tom De Schepper, Kevin Mets

    Abstract: Dataset condensation (DC) methods aim to learn a smaller synthesized dataset with informative data records to accelerate the training of machine learning models. Current distribution matching (DM) based DC methods learn a synthesized dataset by matching the mean of the latent embeddings between the synthetic and the real dataset. However two distributions with the same mean can still be vastly dif… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR Workshop 2024: 1st Workshop on Dataset Distillation for Computer Vision

  10. arXiv:2406.09126  [pdf, other

    cs.CV

    Auto-Vocabulary Segmentation for LiDAR Points

    Authors: Weijie Wei, Osman Ülger, Fatemeh Karimi Najadasl, Theo Gevers, Martin R. Oswald

    Abstract: Existing perception methods for autonomous driving fall short of recognizing unknown entities not covered in the training data. Open-vocabulary methods offer promising capabilities in detecting any object but are limited by user-specified queries representing target classes. We propose AutoVoc3D, a framework for automatic object class recognition and open-ended segmentation. Evaluation on nuScenes… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024 OpenSun3D Workshop

  11. arXiv:2406.07001  [pdf, other

    cs.CL cs.AI

    Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models

    Authors: Zhenyi Lu, Jie Tian, Wei Wei, Xiaoye Qu, Yu Cheng, Wenfeng xie, Dangyang Chen

    Abstract: Text classification is a crucial task encountered frequently in practical scenarios, yet it is still under-explored in the era of large language models (LLMs). This study shows that LLMs are vulnerable to changes in the number and arrangement of options in text classification. Our extensive empirical analyses reveal that the key bottleneck arises from ambiguous decision boundaries and inherent bia… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL2024 findings

  12. arXiv:2406.06559  [pdf, other

    cs.CL cs.AI cs.LG

    Harnessing Business and Media Insights with Large Language Models

    Authors: Yujia Bao, Ankit Parag Shah, Neeru Narang, Jonathan Rivers, Rajeev Maksey, Lan Guan, Louise N. Barrere, Shelley Evenson, Rahul Basole, Connie Miao, Ankit Mehta, Fabien Boulay, Su Min Park, Natalie E. Pearson, Eldhose Joy, Tiger He, Sumiran Thakur, Koustav Ghosal, Josh On, Phoebe Morrison, Tim Major, Eva Siqi Wang, Gina Escobar, Jiaheng Wei, Tharindu Cyril Weerasooriya , et al. (8 additional authors not shown)

    Abstract: This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  13. arXiv:2406.03805  [pdf, other

    cs.CR

    AutoJailbreak: Exploring Jailbreak Attacks and Defenses through a Dependency Lens

    Authors: Lin Lu, Hai Yan, Zenghui Yuan, Jiawen Shi, Wenqi Wei, Pin-Yu Chen, Pan Zhou

    Abstract: Jailbreak attacks in large language models (LLMs) entail inducing the models to generate content that breaches ethical and legal norm through the use of malicious prompts, posing a substantial threat to LLM security. Current strategies for jailbreak attack and defense often focus on optimizing locally within specific algorithmic frameworks, resulting in ineffective optimization and limited scalabi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 32 pages, 2 figures

  14. arXiv:2406.02002  [pdf, other

    cs.CL cs.AI

    Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue

    Authors: Shixuan Fan, Wei Wei, Wendi Li, Xian-Ling Mao, Wenfeng Xie, Dangyang Chen

    Abstract: The core of the dialogue system is to generate relevant, informative, and human-like responses based on extensive dialogue history. Recently, dialogue generation domain has seen mainstream adoption of large language models (LLMs), due to its powerful capability in generating utterances. However, there is a natural deficiency for such models, that is, inherent position bias, which may lead them to… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to IJCAI 2024

  15. arXiv:2406.01988  [pdf, other

    cs.CL cs.AI

    Personalized Topic Selection Model for Topic-Grounded Dialogue

    Authors: Shixuan Fan, Wei Wei, Xiaofei Wen, Xianling Mao, Jixiong Chen, Dangyang Chen

    Abstract: Recently, the topic-grounded dialogue (TGD) system has become increasingly popular as its powerful capability to actively guide users to accomplish specific tasks through topic-guided conversations. Most existing works utilize side information (\eg topics or personas) in isolation to enhance the topic selection ability. However, due to disregarding the noise within these auxiliary information sour… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  16. arXiv:2406.01425  [pdf, other

    cs.CV

    Sensitivity-Informed Augmentation for Robust Segmentation

    Authors: Laura Zheng, Wenjie Wei, Tony Wu, Jacob Clements, Shreelekha Revankar, Andre Harrison, Yu Shen, Ming C. Lin

    Abstract: Segmentation is an integral module in many visual computing applications such as virtual try-on, medical imaging, autonomous driving, and agricultural automation. These applications often involve either widespread consumer use or highly variable environments, both of which can degrade the quality of visual sensor data, whether from a common mobile phone or an expensive satellite imaging camera. In… ▽ More

    Submitted 16 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  17. arXiv:2406.01213  [pdf, other

    cs.CL cs.AI

    Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition

    Authors: Zhuojun Ding, Wei Wei, Xiaoye Qu, Dangyang Chen

    Abstract: Cross-lingual named entity recognition (NER) aims to train an NER model for the target language leveraging only labeled source language data and unlabeled target language data. Prior approaches either perform label projection on translated source language data or employ a source model to assign pseudo labels for target language data and train a target model on these pseudo-labeled data to generali… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI 2024

  18. arXiv:2406.01027  [pdf, other

    cs.DB cs.LG

    PRICE: A Pretrained Model for Cross-Database Cardinality Estimation

    Authors: Tian**g Zeng, Junwei Lan, Jiahong Ma, Wenqing Wei, Rong Zhu, Pengfei Li, Bolin Ding, Defu Lian, Zhewei Wei, **gren Zhou

    Abstract: Cardinality estimation (CardEst) is essential for optimizing query execution plans. Recent ML-based CardEst methods achieve high accuracy but face deployment challenges due to high preparation costs and lack of transferability across databases. In this paper, we propose PRICE, a PRetrained multI-table CardEst model, which addresses these limitations. PRICE takes low-level but transferable features… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  19. Knowledge Enhanced Multi-intent Transformer Network for Recommendation

    Authors: Ding Zou, Wei Wei, Feida Zhu, Chuanyu Xu, Tao Zhang, Chengfu Huo

    Abstract: Incorporating Knowledge Graphs into Recommendation has attracted growing attention in industry, due to the great potential of KG in providing abundant supplementary information and interpretability for the underlying models. However, simply integrating KG into recommendation usually brings in negative feedback in industry, due to the ignorance of the following two factors: i) users' multiple inten… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accept By The Web Conf 2024 (WWW 2024) Industry Track. arXiv admin note: text overlap with arXiv:2204.08807

  20. arXiv:2405.18757  [pdf, other

    cs.RO

    Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation

    Authors: Jiawei Fu, Yonghao Long, Kai Chen, Wang Wei, Qi Dou

    Abstract: Surgical robot task automation has been a promising research topic for improving surgical efficiency and quality. Learning-based methods have been recognized as an interesting paradigm and been increasingly investigated. However, existing approaches encounter difficulties in long-horizon goal-conditioned tasks due to the intricate compositional structure, which requires decision-making for a seque… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  21. arXiv:2405.16707  [pdf, other

    cs.CR

    Visualizing the Shadows: Unveiling Data Poisoning Behaviors in Federated Learning

    Authors: Xueqing Zhang, Junkai Zhang, Ka-Ho Chow, Juntao Chen, Ying Mao, Mohamed Rahouti, Xiang Li, Yuchen Liu, Wenqi Wei

    Abstract: This demo paper examines the susceptibility of Federated Learning (FL) systems to targeted data poisoning attacks, presenting a novel system for visualizing and mitigating such threats. We simulate targeted data poisoning attacks via label flip** and analyze the impact on model performance, employing a five-component system that includes Simulation and Data Generation, Data Collection and Upload… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  22. arXiv:2405.14170  [pdf, other

    cs.AI cs.CL

    Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning

    Authors: Jiapu Wang, Kai Sun, Linhao Luo, Wei Wei, Yongli Hu, Alan Wee-Chung Liew, Shirui Pan, Baocai Yin

    Abstract: Temporal Knowledge Graph Reasoning (TKGR) is the process of utilizing temporal information to capture complex relations within a Temporal Knowledge Graph (TKG) to infer new knowledge. Conventional methods in TKGR typically depend on deep learning algorithms or temporal logical rules. However, deep learning-based TKGRs often lack interpretability, whereas rule-based TKGRs struggle to effectively le… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  23. arXiv:2405.11333  [pdf, other

    cs.LG cs.AI

    GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable Missing

    Authors: Chengqing Yu, Fei Wang, Zezhi Shao, Tangwen Qian, Zhao Zhang, Wei Wei, Yongjun Xu

    Abstract: Multivariate time series forecasting (MTSF) is crucial for decision-making to precisely forecast the future values/trends, based on the complex relationships identified from historical observations of multiple sequences. Recently, Spatial-Temporal Graph Neural Networks (STGNNs) have gradually become the theme of MTSF model as their powerful capability in mining spatial-temporal dependencies, but a… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024 (Research track)

  24. arXiv:2405.08638  [pdf, other

    cs.LG

    vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement

    Authors: Yiwen Zhu, **yi Liu, Wenya Wei, Qianyi Fu, Yu**g Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan

    Abstract: Reinforcement Learning (RL) is a widely employed technique in decision-making problems, encompassing two fundamental operations -- policy evaluation and policy improvement. Enhancing learning efficiency remains a key challenge in RL, with many efforts focused on using ensemble critics to boost policy evaluation efficiency. However, when using multiple critics, the actor in the policy improvement p… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024, with appendix

  25. arXiv:2405.04514  [pdf, other

    quant-ph cs.DC

    Scalable Circuit Cutting and Scheduling in a Resource-constrained and Distributed Quantum System

    Authors: Shuwen Kan, Zefan Du, Miguel Palma, Samuel A Stein, Chenxu Liu, Wenqi Wei, Juntao Chen, Ang Li, Ying Mao

    Abstract: Despite quantum computing's rapid development, current systems remain limited in practical applications due to their limited qubit count and quality. Various technologies, such as superconducting, trapped ions, and neutral atom quantum computing technologies are progressing towards a fault tolerant era, however they all face a diverse set of challenges in scalability and control. Recent efforts ha… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  26. arXiv:2404.17876  [pdf, other

    cs.CV

    DF-SLAM: Dictionary Factors Representation for High-Fidelity Neural Implicit Dense Visual SLAM System

    Authors: Weifeng Wei, Jie Wang, Shuqi Deng, Jie Liu

    Abstract: We introduce a high-fidelity neural implicit dense visual Simultaneous Localization and Map** (SLAM) system, termed DF-SLAM. In our work, we employ dictionary factors for scene representation, encoding the geometry and appearance information of the scene as a combination of basis and coefficient factors. Compared to neural implicit dense visual SLAM methods that directly encode scene information… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

  27. arXiv:2404.17136  [pdf, other

    cs.DB cs.AI cs.CL

    Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study

    Authors: Yang Wu, Yao Wan, Hongyu Zhang, Yulei Sui, Wucai Wei, Wei Zhao, Guandong Xu, Hai **

    Abstract: The Natural Language to Visualization (NL2Vis) task aims to transform natural-language descriptions into visual representations for a grounded table, enabling users to gain insights from vast amounts of data. Recently, many deep learning-based approaches have been developed for NL2Vis. Despite the considerable efforts made by these approaches, challenges persist in visualizing data sourced from un… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  28. arXiv:2404.03354  [pdf, other

    cs.IR cs.AI

    A Comprehensive Survey on Self-Supervised Learning for Recommendation

    Authors: Xubin Ren, Wei Wei, Lianghao Xia, Chao Huang

    Abstract: Recommender systems play a crucial role in tackling the challenge of information overload by delivering personalized recommendations based on individual user preferences. Deep learning techniques, such as RNNs, GNNs, and Transformer architectures, have significantly propelled the advancement of recommender systems by enhancing their comprehension of user behaviors and preferences. However, supervi… ▽ More

    Submitted 7 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  29. arXiv:2404.00684  [pdf, other

    cs.IR cs.AI

    Generative Retrieval as Multi-Vector Dense Retrieval

    Authors: Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren

    Abstract: Generative retrieval generates identifiers of relevant documents in an end-to-end manner using a sequence-to-sequence architecture for a given query. The relation between generative retrieval and other retrieval methods, especially those based on matching within dense retrieval models, is not yet fully comprehended. Prior work has demonstrated that generative retrieval with atomic identifiers is e… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 12 pages, 5 figures, 8 tables, accepted at SIGIR 2024

  30. arXiv:2403.13101  [pdf, other

    cs.LG cs.AI cs.DC

    AdaptSFL: Adaptive Split Federated Learning in Resource-constrained Edge Networks

    Authors: Zheng Lin, Guanqiao Qu, Wei Wei, Xianhao Chen, Kin K. Leung

    Abstract: The increasing complexity of deep neural networks poses significant barriers to democratizing them to resource-limited edge devices. To address this challenge, split federated learning (SFL) has emerged as a promising solution by of floading the primary training workload to a server via model partitioning while enabling parallel training among edge devices. However, although system optimization su… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 15 pages, 10 figures

  31. arXiv:2403.11558  [pdf, other

    cs.CL cs.AI

    Reinforcement Learning with Token-level Feedback for Controllable Text Generation

    Authors: Wendi Li, Wei Wei, Kaihe Xu, Wenfeng Xie, Dangyang Chen, Yu Cheng

    Abstract: To meet the requirements of real-world applications, it is essential to control generations of large language models (LLMs). Prior research has tried to introduce reinforcement learning (RL) into controllable text generation while most existing methods suffer from overfitting issues (finetuning-based methods) or semantic collapse (post-processing methods). However, current RL methods are generally… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 Findings

  32. A Novel Mutual Insurance Model for Hedging Against Cyber Risks in Power Systems Deploying Smart Technologies

    Authors: Pikkin Lau, Lingfeng Wang, Wei Wei, Zhaoxi Liu, Chee-Wooi Ten

    Abstract: In this paper, a novel cyber-insurance model design is proposed based on system risk evaluation with smart technology applications. The cyber insurance policy for power systems is tailored via cyber risk modeling, reliability impact analysis, and insurance premium calculation. A stochastic Epidemic Network Model is developed to evaluate the cyber risk by propagating cyberattacks among graphical vu… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Power system reliability, cyber-insurance, power system security, cyber-physical systems, cyber risk modeling, actuarial design, tail risk

    Journal ref: in IEEE Transactions on Power Systems, vol. 38, no. 1, pp. 630-642, Jan. 2023

  33. arXiv:2403.05868  [pdf, other

    cs.RO

    Toward Understanding Key Estimation in Learning Robust Humanoid Locomotion

    Authors: Zhicheng Wang, Wandi Wei, Ruiqi Yu, Jun Wu, Qiuguo Zhu

    Abstract: Accurate state estimation plays a critical role in ensuring the robust control of humanoid robots, particularly in the context of learning-based control policies for legged robots. However, there is a notable gap in analytical research concerning estimations. Therefore, we endeavor to further understand how various types of estimations influence the decision-making processes of policies. In this p… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  34. arXiv:2403.05606  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data

    Authors: Yifan Wu, Yang Liu, Yue Yang, Michael S. Yao, Wenli Yang, Xuehui Shi, Lihong Yang, Dongjun Li, Yueming Liu, James C. Gee, Xuan Yang, Wenbin Wei, Shi Gu

    Abstract: Diagnosing rare diseases presents a common challenge in clinical practice, necessitating the expertise of specialists for accurate identification. The advent of machine learning offers a promising solution, while the development of such technologies is hindered by the scarcity of data on rare conditions and the demand for models that are both interpretable and trustworthy in a clinical context. In… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  35. arXiv:2403.04283  [pdf, other

    cs.CL cs.AI cs.LG

    Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

    Authors: Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, Mingchuan yang

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values. However, existing RLHF methods require a high computational cost, one main reason being that RLHF assigns both the generation and alignment tasks to the LLM simultaneously. In this paper, we introduce Proxy-RLHF, which decouples the generation and alignment p… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  36. arXiv:2403.00270  [pdf, other

    cs.NE cs.CV

    Event-Driven Learning for Spiking Neural Networks

    Authors: Wenjie Wei, Malu Zhang, Jilin Zhang, Ammar Belatreche, Jibin Wu, Zi**g Xu, Xuerui Qiu, Hong Chen, Yang Yang, Haizhou Li

    Abstract: Brain-inspired spiking neural networks (SNNs) have gained prominence in the field of neuromorphic computing owing to their low energy consumption during feedforward inference on neuromorphic hardware. However, it remains an open challenge how to effectively benefit from the sparse event-driven property of SNNs to minimize backpropagation learning costs. In this paper, we conduct a comprehensive ex… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  37. arXiv:2402.17188  [pdf, other

    cs.IR

    PromptMM: Multi-Modal Knowledge Distillation for Recommendation with Prompt-Tuning

    Authors: Wei Wei, Jiabin Tang, Yangqin Jiang, Lianghao Xia, Chao Huang

    Abstract: Multimedia online platforms (e.g., Amazon, TikTok) have greatly benefited from the incorporation of multimedia (e.g., visual, textual, and acoustic) content into their personal recommender systems. These modalities provide intuitive semantics that facilitate modality-aware user preference modeling. However, two key challenges in multi-modal recommenders remain unresolved: i) The introduction of mu… ▽ More

    Submitted 10 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: WWW 2024

  38. arXiv:2402.16024  [pdf, other

    cs.CL cs.LG

    HiGPT: Heterogeneous Graph Language Model

    Authors: Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Long Xia, Dawei Yin, Chao Huang

    Abstract: Heterogeneous graph learning aims to capture complex relationships and diverse relational semantics among entities in a heterogeneous graph to obtain meaningful representations for nodes and edges. Recent advancements in heterogeneous graph neural networks (HGNNs) have achieved state-of-the-art performance by considering relation heterogeneity and using specialized message functions and aggregatio… ▽ More

    Submitted 18 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted by KDD'2024, full paper

  39. arXiv:2402.15183  [pdf, other

    cs.LG cs.AI

    GraphEdit: Large Language Models for Graph Structure Learning

    Authors: Zirui Guo, Lianghao Xia, Yanhua Yu, Yuling Wang, Zixuan Yang, Wei Wei, Liang Pang, Tat-Seng Chua, Chao Huang

    Abstract: Graph Structure Learning (GSL) focuses on capturing intrinsic dependencies and interactions among nodes in graph-structured data by generating novel graph structures. Graph Neural Networks (GNNs) have emerged as promising GSL solutions, utilizing recursive message passing to encode node-wise inter-dependencies. However, many existing GSL methods heavily depend on explicit graph structural informat… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  40. arXiv:2402.13537  [pdf, other

    cs.CV

    EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization

    Authors: Zhendong Xiao, Changhao Chen, Shan Yang, Wu Wei

    Abstract: Camera relocalization is pivotal in computer vision, with applications in AR, drones, robotics, and autonomous driving. It estimates 3D camera position and orientation (6-DoF) from images. Unlike traditional methods like SLAM, recent strides use deep learning for direct end-to-end pose estimation. We propose EffLoc, a novel efficient Vision Transformer for single-image camera relocalization. EffLo… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures, ICRA 2024 accepted

  41. arXiv:2402.11196  [pdf, other

    cs.LG cs.AI

    Maintaining Adversarial Robustness in Continuous Learning

    Authors: Xiaolei Ru, Xiaowei Cao, Zijia Liu, Jack Murdoch Moore, Xin-Ya Zhang, Xia Zhu, Wenjia Wei, Gang Yan

    Abstract: Adversarial robustness is essential for security and reliability of machine learning systems. However, the adversarial robustness gained by sophisticated defense algorithms is easily erased as the neural network evolves to learn new tasks. This vulnerability can be addressed by fostering a novel capability for neural networks, termed continual robust learning, which focuses on both the (classifica… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  42. arXiv:2402.10085  [pdf, other

    cs.NI cs.LG

    Develop End-to-End Anomaly Detection System

    Authors: Emanuele Mengoli, Zhiyuan Yao, Wutao Wei

    Abstract: Anomaly detection plays a crucial role in ensuring network robustness. However, implementing intelligent alerting systems becomes a challenge when considering scenarios in which anomalies can be caused by both malicious and non-malicious events, leading to the difficulty of determining anomaly patterns. The lack of labeled data in the computer networking domain further exacerbates this issue, impe… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  43. arXiv:2402.08714  [pdf, other

    cs.LG cs.AI

    PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

    Authors: Fei Deng, Qifei Wang, Wei Wei, Matthias Grundmann, Tingbo Hou

    Abstract: Reward finetuning has emerged as a promising approach to aligning foundation models with downstream objectives. Remarkable success has been achieved in the language domain by using reinforcement learning (RL) to maximize rewards that reflect human preference. However, in the vision domain, existing RL-based reward finetuning methods are limited by their instability in large-scale training, renderi… ▽ More

    Submitted 27 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: CVPR 2024. Project page: https://fdeng18.github.io/prdp

  44. arXiv:2402.08219  [pdf, other

    cs.CL cs.AI cs.LG

    BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

    Authors: Haotian Sun, Yuchen Zhuang, Wei Wei, Chao Zhang, Bo Dai

    Abstract: Adapting state-of-the-art Large Language Models (LLMs) like GPT-4 and Gemini for specific tasks is challenging. Due to the opacity in their parameters, embeddings, and even output probabilities, existing fine-tuning adaptation methods are inapplicable. Consequently, adapting these black-box LLMs is only possible through their API services, raising concerns about transparency, privacy, and cost. To… ▽ More

    Submitted 28 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 25 pages, 10 figures

  45. arXiv:2402.01181  [pdf, other

    cs.RO cs.GR

    Efficient Physically-based Simulation of Soft Bodies in Embodied Environment for Surgical Robot

    Authors: Zhenya Yang, Yonghao Long, Kai Chen, Wang Wei, Qi Dou

    Abstract: Surgical robot simulation platform plays a crucial role in enhancing training efficiency and advancing research on robot learning. Much effort have been made by scholars on develo** open-sourced surgical robot simulators to facilitate research. We also developed SurRoL formerly, an open-source, da Vinci Research Kit (dVRK) compatible and interactive embodied environment for robot learning. Despi… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 8 pages

  46. arXiv:2402.01096  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance

    Authors: Wenqi Wei, Ling Liu

    Abstract: Emerging Distributed AI systems are revolutionizing big data computing and data processing capabilities with growing economic and societal impact. However, recent studies have identified new attack surfaces and risks caused by security, privacy, and fairness issues in AI systems. In this paper, we review representative techniques, algorithms, and theoretical foundations for trustworthy distributed… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Manuscript accepted to ACM Computing Surveys

  47. Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition

    Authors: Wei Wei, Tom De Schepper, Kevin Mets

    Abstract: Continual learning (CL) is the research field that aims to build machine learning models that can accumulate knowledge continuously over different tasks without retraining from scratch. Previous studies have shown that pre-training graph neural networks (GNN) may lead to negative transfer (Hu et al., 2020) after fine-tuning, a setting which is closely related to CL. Thus, we focus on studying GNN… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: This work is accepted at VISAPP 2024 as a short paper

  48. arXiv:2401.15098  [pdf, other

    cs.LG cs.AI

    Hierarchical Continual Reinforcement Learning via Large Language Model

    Authors: Chaofan Pan, Xin Yang, Hao Wang, Wei Wei, Tianrui Li

    Abstract: The ability to learn continuously in dynamic environments is a crucial requirement for reinforcement learning (RL) agents applying in the real world. Despite the progress in continual reinforcement learning (CRL), existing methods often suffer from insufficient knowledge transfer, particularly when the tasks are diverse. To address this challenge, we propose a new framework, Hierarchical Continual… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  49. arXiv:2401.10744  [pdf, other

    cs.AI

    FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models

    Authors: Ziqiang Yuan, Kaiyuan Wang, Shoutai Zhu, Ye Yuan, **gya Zhou, Yanlin Zhu, Wenqi Wei

    Abstract: Large Language models (LLMs) usually rely on extensive training datasets. In the financial domain, creating numerical reasoning datasets that include a mix of tables and long text often involves substantial manual annotation expenses. To address the limited data resources and reduce the annotation cost, we introduce FinLLMs, a method for generating financial question-answering data based on common… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: Under submission of IEEE Transactions

  50. arXiv:2401.06340  [pdf, other

    cs.HC cs.AI

    A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding

    Authors: Xu** Li, Wei Wei, Shuang Qiu, Huiguang He

    Abstract: The Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interface (BCI) is an efficient technology for target retrieval using electroencephalography (EEG) signals. The performance improvement of traditional decoding methods relies on a substantial amount of training data from new test subjects, which increases preparation time for BCI systems. Several studies introduce data from existing… ▽ More

    Submitted 11 July, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 19 pages, 10 figures

    MSC Class: 68T07 ACM Class: I.5.4