Skip to main content

Showing 1–50 of 99 results for author: Xiang, L

Searching in archive cs. Search in all archives.
.
  1. Generative Iris Prior Embedded Transformer for Iris Restoration

    Authors: Yubo Huang, Jia Wang, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He

    Abstract: Iris restoration from complexly degraded iris images, aiming to improve iris recognition performance, is a challenging problem. Due to the complex degradation, directly training a convolutional neural network (CNN) without prior cannot yield satisfactory results. In this work, we propose a generative iris prior embedded Transformer model (Gformer), in which we build a hierarchical encoder-decoder… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Our code is available at https://github.com/sawyercharlton/Gformer

    Journal ref: 2023 IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia, 2023, pp. 510-515

  2. arXiv:2406.14903  [pdf, other

    cs.AI

    GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

    Authors: Leyan Wang, Yonggang **, Tianhao Shen, Tianyu Zheng, Xinrun Du, Chenchen Zhang, Wenhao Huang, Jiaheng Liu, Shi Wang, Ge Zhang, Liuyu Xiang, Zhaofeng He

    Abstract: As large language models (LLMs) continue to develop and gain widespread application, the ability of LLMs to exhibit empathy towards diverse group identities and understand their perspectives is increasingly recognized as critical. Most existing benchmarks for empathy evaluation of LLMs focus primarily on universal human emotions, such as sadness and pain, often overlooking the context of individua… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.10305  [pdf

    cs.SE cs.AI cs.LG

    Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models

    Authors: Jie Chen, Xintian Han, Yu Ma, Xun Zhou, Liang Xiang

    Abstract: Automatic code generation has been a longstanding research topic. With the advancement of general-purpose large language models (LLMs), the ability to code stands out as one important measure to the model's reasoning performance. Usually, a two-stage training paradigm is implemented to obtain a Code LLM, namely the pretraining and the fine-tuning. Within the fine-tuning, supervised fine-tuning (SF… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.04721  [pdf, other

    cs.IT eess.SP

    End-to-End Design of Polar Coded Integrated Data and Energy Networking

    Authors: Jie Hu, **gwen Cui, Kun Yang

    Abstract: In order to transmit data and transfer energy to the low-power Internet of Things (IoT) devices, integrated data and energy networking (IDEN) system may be harnessed. In this context, we propose a bitwise end-to-end design for polar coded IDEN systems, where the conventional encoding/decoding, modulation/demodulation, and energy harvesting (EH) modules are replaced by the neural networks (NNs). In… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2405.20234  [pdf, other

    cs.AI

    Context Injection Attacks on Large Language Models

    Authors: Cheng'an Wei, Kai Chen, Yue Zhao, Yujia Gong, Lu Xiang, Shenchen Zhu

    Abstract: Large Language Models (LLMs) such as ChatGPT and Llama-2 have become prevalent in real-world applications, exhibiting impressive text generation performance. LLMs are fundamentally developed from a scenario where the input data remains static and lacks a clear structure. To behave interactively over time, LLM-based chat systems must integrate additional contextual information (i.e., chat history)… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  6. arXiv:2405.18551  [pdf, other

    cs.RO

    Photorealistic Robotic Simulation using Unreal Engine 5 for Agricultural Applications

    Authors: Xingjian Li, Lirong Xiang

    Abstract: This work presents a new robotics simulation environment built upon Unreal Engine 5 (UE5) for agricultural image data generation. The simulation utilizes the state-of-the-art real-time rendering engine to provide realistic plant images which are often used in agricultural applications. This study showcases the rendering accuracy of UE5 in comparison to existing tools and assesses its positional ac… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 3 pages, 4 figures, extended abstract accepted at IROS 2023 Workshop on Agricultural Robotics for a Sustainable Future (WARS_1)

  7. arXiv:2405.16930  [pdf, other

    cs.CV

    From Obstacle to Opportunity: Enhancing Semi-supervised Learning with Synthetic Data

    Authors: Zerun Wang, Jiafeng Mao, Liuyu Xiang, Toshihiko Yamasaki

    Abstract: Semi-supervised learning (SSL) can utilize unlabeled data to enhance model performance. In recent years, with increasingly powerful generative models becoming available, a large number of synthetic images have been uploaded to public image sets. Therefore, when collecting unlabeled data from these sources, the inclusion of synthetic images is inevitable. This prompts us to consider the impact of u… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.15157  [pdf, other

    cs.CV

    Rethinking Class-Incremental Learning from a Dynamic Imbalanced Learning Perspective

    Authors: Leyuan Wang, Liuyu Xiang, Yunlong Wang, Huijia Wu, Zhaofeng He

    Abstract: Deep neural networks suffer from catastrophic forgetting when continually learning new concepts. In this paper, we analyze this problem from a data imbalance point of view. We argue that the imbalance between old task and new task data contributes to forgetting of the old tasks. Moreover, the increasing imbalance ratio during incremental learning further aggravates the problem. To address the dyna… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  9. arXiv:2405.15155  [pdf, other

    cs.CV

    CLIP model is an Efficient Online Lifelong Learner

    Authors: Leyuan Wang, Liuyu Xiang, Yujie Wei, Yunlong Wang, Zhaofeng He

    Abstract: Online Lifelong Learning (OLL) addresses the challenge of learning from continuous and non-stationary data streams. Existing online lifelong learning methods based on image classification models often require preset conditions such as the total number of classes or maximum memory capacity, which hinders the realization of real never-ending learning and renders them impractical for real-world scena… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2404.07181  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    BAMBOO: a predictive and transferable machine learning force field framework for liquid electrolyte development

    Authors: Sheng Gong, Yumin Zhang, Zhenliang Mu, Zhichen Pu, Hongyi Wang, Zhiao Yu, Mengyi Chen, Tianze Zheng, Zhi Wang, Lifei Chen, Xiaojie Wu, Shaochen Shi, Weihao Gao, Wen Yan, Liang Xiang

    Abstract: Despite the widespread applications of machine learning force field (MLFF) on solids and small molecules, there is a notable gap in applying MLFF to complex liquid electrolytes. In this work, we introduce BAMBOO (ByteDance AI Molecular Simulation Booster), a novel framework for molecular dynamics (MD) simulations, with a demonstration of its capabilities in the context of liquid electrolytes for l… ▽ More

    Submitted 22 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  11. arXiv:2404.07084  [pdf, other

    cs.CL cs.AI

    Dynamic Generation of Personalities with Large Language Models

    Authors: Jianzhi Liu, Hexiang Gu, Tianyu Zheng, Liuyu Xiang, Huijia Wu, Jie Fu, Zhaofeng He

    Abstract: In the realm of mimicking human deliberation, large language models (LLMs) show promising performance, thereby amplifying the importance of this research area. Deliberation is influenced by both logic and personality. However, previous studies predominantly focused on the logic of LLMs, neglecting the exploration of personality aspects. In this work, we introduce Dynamic Personality Generation (DP… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  12. arXiv:2403.10978  [pdf, other

    cs.CL cs.IR

    Entity Alignment with Unlabeled Dangling Cases

    Authors: Hang Yin, Dong Ding, Liyao Xiang, Yuheng He, Yihan Wu, Xinbing Wang, Chenghu Zhou

    Abstract: We investigate the entity alignment problem with unlabeled dangling cases, meaning that there are entities in the source or target graph having no counterparts in the other, and those entities remain unlabeled. The problem arises when the source and target graphs are of different scales, and it is much cheaper to label the matchable pairs than the dangling entities. To solve the issue, we propose… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 14 pages

    ACM Class: I.2.4; H.3.3

  13. arXiv:2403.05842  [pdf, other

    cs.CR cs.AI

    Hufu: A Modality-Agnositc Watermarking System for Pre-Trained Transformers via Permutation Equivariance

    Authors: Hengyuan Xu, Liyao Xiang, Xingjun Ma, Borui Yang, Baochun Li

    Abstract: With the blossom of deep learning models and services, it has become an imperative concern to safeguard the valuable model parameters from being stolen. Watermarking is considered an important tool for ownership verification. However, current watermarking schemes are customized for different models and tasks, hard to be integrated as an integrated intellectual protection service. We propose Hufu,… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  14. arXiv:2403.02576  [pdf, other

    cs.DL cs.LG cs.SI

    AceMap: Knowledge Discovery through Academic Graph

    Authors: Xinbing Wang, Luoyi Fu, Xiaoying Gan, Ying Wen, Guanjie Zheng, Jiaxin Ding, Liyao Xiang, Nanyang Ye, Meng **, Shiyu Liang, Bin Lu, Haiwen Wang, Yi Xu, Cheng Deng, Shao Zhang, Huquan Kang, Xingli Wang, Qi Li, Zhixin Guo, Jiexing Qi, Pan Liu, Yuyang Ren, Lyuwen Wu, Jungang Yang, Jian** Zhou , et al. (1 additional authors not shown)

    Abstract: The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publicatio… ▽ More

    Submitted 14 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Technical Report for AceMap (https://www.acemap.info)

  15. arXiv:2403.00839  [pdf, other

    cs.AI cs.CL

    ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph

    Authors: Xukun Liu, Zhiyuan Peng, Xiaoyuan Yi, Xing Xie, Lirong Xiang, Yuchen Liu, Dongkuan Xu

    Abstract: While achieving remarkable progress in a broad range of tasks, large language models (LLMs) remain significantly limited in properly using massive external tools. Existing in-context learning approaches simply format tools into a list of plain text descriptions and input them to LLMs, from which, LLMs generate a sequence of tool calls to solve problems step by step. Such a paradigm ignores the int… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  16. arXiv:2402.18821  [pdf, other

    cs.CV

    Debiased Novel Category Discovering and Localization

    Authors: Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding

    Abstract: In recent years, object detection in deep learning has experienced rapid development. However, most existing object detection models perform well only on closed-set datasets, ignoring a large number of potential objects whose categories are not defined in the training set. These objects are often identified as background or incorrectly classified as pre-defined categories by the detectors. In this… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

  17. arXiv:2402.15627  [pdf, other

    cs.LG cs.DC

    MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

    Authors: Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao , et al. (7 additional authors not shown)

    Abstract: We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs. Training LLMs at this scale brings unprecedented challenges to training efficiency and stability. We take a full-stack approach that co-designs the algorithmic and system components across model bl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2402.04154  [pdf, other

    cs.AI cs.LG

    Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction

    Authors: Yonggang **, Ge Zhang, Hao Zhao, Tianyu Zheng, Jarvi Guo, Liuyu Xiang, Shawn Yue, Stephen W. Huang, Zhaofeng He, Jie Fu

    Abstract: Develo** a generalist agent is a longstanding objective in artificial intelligence. Previous efforts utilizing extensive offline datasets from various tasks demonstrate remarkable performance in multitasking scenarios within Reinforcement Learning. However, these works encounter challenges in extending their capabilities to new tasks. Recent approaches integrate textual guidance or visual trajec… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  19. arXiv:2401.13154  [pdf, other

    cs.OS

    Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration

    Authors: Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang

    Abstract: With the advent of byte-addressable memory devices, such as CXL memory, persistent memory, and storage-class memory, tiered memory systems have become a reality. Page migration is the de facto method within operating systems for managing tiered memory. It aims to bring hot data whenever possible into fast memory to optimize the performance of data accesses while using slow memory to accommodate da… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  20. arXiv:2401.07205  [pdf, other

    cs.CR cs.CV cs.LG

    Crafter: Facial Feature Crafting against Inversion-based Identity Theft on Deep Models

    Authors: Shiming Wang, Zhe Ji, Liyao Xiang, Hao Zhang, Xinbing Wang, Chenghu Zhou, Bo Li

    Abstract: With the increased capabilities at the edge (e.g., mobile device) and more stringent privacy requirement, it becomes a recent trend for deep learning-enabled applications to pre-process sensitive raw data at the edge and transmit the features to the backend cloud for further processing. A typical application is to run machine learning (ML) services on facial images collected from different individ… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  21. arXiv:2312.15742  [pdf, other

    cs.CV cs.AI

    DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection

    Authors: Li Xiang, Junbo Yin, Wei Li, Cheng-Zhong Xu, Ruigang Yang, Jianbing Shen

    Abstract: Vehicle-to-Everything (V2X) collaborative perception has recently gained significant attention due to its capability to enhance scene understanding by integrating information from various agents, e.g., vehicles, and infrastructure. However, current works often treat the information from each agent equally, ignoring the inherent domain gap caused by the utilization of different LiDAR sensors of eac… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: aaai2024

  22. arXiv:2312.08594  [pdf, other

    cs.CV

    CT-MVSNet: Efficient Multi-View Stereo with Cross-scale Transformer

    Authors: Sicheng Wang, Hao Jiang, Lei Xiang

    Abstract: Recent deep multi-view stereo (MVS) methods have widely incorporated transformers into cascade network for high-resolution depth estimation, achieving impressive results. However, existing transformer-based methods are constrained by their computational costs, preventing their extension to finer stages. In this paper, we propose a novel cross-scale transformer (CT) that processes feature represent… ▽ More

    Submitted 1 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at the 30th International Conference on Multimedia Modeling(MMM'24 Oral)

  23. arXiv:2311.08024  [pdf, other

    eess.IV cs.CV cs.LG

    MD-IQA: Learning Multi-scale Distributed Image Quality Assessment with Semi Supervised Learning for Low Dose CT

    Authors: Tao Song, Ruizhi Hou, Lisong Dai, Lei Xiang

    Abstract: Image quality assessment (IQA) plays a critical role in optimizing radiation dose and develo** novel medical imaging techniques in computed tomography (CT). Traditional IQA methods relying on hand-crafted features have limitations in summarizing the subjective perceptual experience of image quality. Recent deep learning-based approaches have demonstrated strong modeling capabilities and potentia… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  24. arXiv:2311.07106  [pdf, other

    cs.IT cs.ET

    A Tutorial on Coding Methods for DNA-based Molecular Communications and Storage

    Authors: Qiang Liu, Sirong Chen, Kang Yan, Wenfeng Wu, Kun Yang

    Abstract: Exponential increase of data has motivated advances of data storage technologies. As a promising storage media, DeoxyriboNucleic Acid (DNA) storage provides a much higher data density and superior durability, compared with state-of-the-art media. In this paper, we provide a tutorial on DNA storage and its role in molecular communications. Firstly, we introduce fundamentals of DNA-based molecular c… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  25. arXiv:2311.01122  [pdf, other

    cs.ET

    Deep Joint Source-Channel Coding for DNA Image Storage: A Novel Approach with Enhanced Error Resilience and Biological Constraint Optimization

    Authors: Wenfeng Wu, Qiang Liu, Kun Yang

    Abstract: In the current era, DeoxyriboNucleic Acid (DNA) based data storage emerges as an intriguing approach, garnering substantial academic interest and investigation. This paper introduces a novel deep joint source-channel coding (DJSCC) scheme for DNA image storage, designated as DJSCC-DNA. This paradigm distinguishes itself from conventional DNA storage techniques through three key modifications: 1) i… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  26. arXiv:2310.20200  [pdf, other

    cs.IT cs.CR

    Multi-Domain Polarization for Enhancing the Physical Layer Security of MIMO Systems

    Authors: Yao Zeng, Jie Hu, Kun Yang, Lajos Hanzo

    Abstract: A novel Physical Layer Security (PLS) framework is conceived for enhancing the security of the wireless communication systems by exploiting multi-domain polarization in Multiple-Input Multiple-Output (MIMO) systems. We design a sophisticated key generation scheme based on multi-domain polarization, and the corresponding receivers. An in-depth analysis of the system's secrecy rate is provided, demo… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  27. Reconfigurable Intelligent Sensing Surface aided Wireless Powered Communication Networks: A Sensing-Then-Reflecting Approach

    Authors: Cheng Luo, Jie Hu, Kun Yang

    Abstract: This paper presents a reconfigurable intelligent sensing surface (RISS) that combines passive and active elements to achieve simultaneous reflection and direction of arrival (DOA) estimation tasks. By utilizing DOA information from the RISS instead of conventional channel estimation, the pilot overhead is reduced and the RISS becomes independent of the hybrid access point (HAP), enabling efficient… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  28. arXiv:2309.13833  [pdf, other

    cs.CV cs.AI

    Dual Feature Augmentation Network for Generalized Zero-shot Learning

    Authors: Lei Xiang, Yuan Zhou, Haoran Duan, Yang Long

    Abstract: Zero-shot learning (ZSL) aims to infer novel classes without training samples by transferring knowledge from seen classes. Existing embedding-based approaches for ZSL typically employ attention mechanisms to locate attributes on an image. However, these methods often ignore the complex entanglement among different attributes' visual features in the embedding space. Additionally, these methods empl… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to BMVC2023

  29. arXiv:2309.00860  [pdf, other

    cs.CR

    Towards Code Watermarking with Dual-Channel Transformations

    Authors: Borui Yang, Wei Li, Liyao Xiang, Bo Li

    Abstract: The expansion of the open source community and the rise of large language models have raised ethical and security concerns on the distribution of source code, such as misconduct on copyrighted code, distributions without proper licenses, or misuse of the code for malicious purposes. Hence it is important to track the ownership of source code, in which watermarking is a major technique. Yet, drasti… ▽ More

    Submitted 1 January, 2024; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: 16 pages, accepted by IEEE S&P 2024

  30. arXiv:2308.15143  [pdf, other

    cs.RO cs.AI

    Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models

    Authors: Lei Han, Qingxu Zhu, Jiapeng Sheng, Chong Zhang, Tingguang Li, Yizheng Zhang, He Zhang, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang

    Abstract: Summarizing knowledge from animals and human beings inspires robotic innovations. In this work, we propose a framework for driving legged robots act like real animals with lifelike agility and strategy in complex environments. Inspired by large pre-trained models witnessed with impressive performance in language and image understanding, we introduce the power of advanced deep generative models to… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  31. arXiv:2308.06668  [pdf, other

    cs.LG cs.CV

    Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges

    Authors: Jiajia Li, Mingle Xu, Lirong Xiang, Dong Chen, Weichao Zhuang, Xunyuan Yin, Zhaojian Li

    Abstract: The past decade has witnessed the rapid development and adoption of ML & DL methodologies in agricultural systems, showcased by great successes in agricultural applications. However, these conventional ML/DL models have certain limitations: they heavily rely on large, costly-to-acquire labeled datasets for training, require specialized expertise for development and maintenance, and are mostly tail… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 August, 2023; originally announced August 2023.

    Comments: 18 pages, 3 figures

  32. arXiv:2307.14487  [pdf, other

    cs.CV cs.AI

    Technical note: ShinyAnimalCV: open-source cloud-based web application for object detection, segmentation, and three-dimensional visualization of animals using computer vision

    Authors: ** Wang, Yu Hu, Lirong Xiang, Gota Morota, Samantha A. Brooks, Carissa L. Wickens, Emily K. Miller-Cushon, Haipeng Yu

    Abstract: Computer vision (CV), a non-intrusive and cost-effective technology, has furthered the development of precision livestock farming by enabling optimized decision-making through timely and individualized animal care. The availability of affordable two- and three-dimensional camera sensors, combined with various machine learning and deep learning algorithms, has provided a valuable opportunity to imp… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  33. arXiv:2307.02988  [pdf, other

    cs.NI eess.SP

    UAV Swarms for Joint Data Ferrying and Dynamic Cell Coverage via Optimal Transport Descent and Quadratic Assignment

    Authors: Kai Cui, Lars Baumgärtner, Burak Yilmaz, Mengguang Li, Christian Fabian, Benjamin Becker, Lin Xiang, Maximilian Bauer, Heinz Koeppl

    Abstract: Both data ferrying with disruption-tolerant networking (DTN) and mobile cellular base stations constitute important techniques for UAV-aided communication in situations of crises where standard communication infrastructure is unavailable. For optimal use of a limited number of UAVs, we propose providing both DTN and a cellular base station on each UAV. Here, DTN is used for large amounts of low-pr… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted to IEEE LCN 2023 as full paper, pre-final version

  34. arXiv:2306.16077  [pdf, other

    cs.LG cs.AI cs.DC

    Secure and Fast Asynchronous Vertical Federated Learning via Cascaded Hybrid Optimization

    Authors: Ganyu Wang, Qingsong Zhang, Li Xiang, Boyu Wang, Bin Gu, Charles Ling

    Abstract: Vertical Federated Learning (VFL) attracts increasing attention because it empowers multiple parties to jointly train a privacy-preserving model over vertically partitioned data. Recent research has shown that applying zeroth-order optimization (ZOO) has many advantages in building a practical VFL algorithm. However, a vital problem with the ZOO-based VFL is its slow convergence rate, which limits… ▽ More

    Submitted 29 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Under Review

  35. arXiv:2306.10698  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork

    Authors: Yonggang **, Chenxu Wang, Tianyu Zheng, Liuyu Xiang, Yaodong Yang, Junge Zhang, Jie Fu, Zhaofeng He

    Abstract: Deep reinforcement learning algorithms are usually impeded by sampling inefficiency, heavily depending on multiple interactions with the environment to acquire accurate decision-making capabilities. In contrast, humans rely on their hippocampus to retrieve relevant information from past experiences of relevant tasks, which guides their decision-making when learning a new task, rather than exclusiv… ▽ More

    Submitted 6 March, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  36. arXiv:2305.13802  [pdf, other

    cs.CV

    Online Open-set Semi-supervised Object Detection with Dual Competing Head

    Authors: Zerun Wang, Ling Xiao, Liuyu Xiang, Zhaotian Weng, Toshihiko Yamasaki

    Abstract: Open-set semi-supervised object detection (OSSOD) task leverages practical open-set unlabeled datasets that comprise both in-distribution (ID) and out-of-distribution (OOD) instances for conducting semi-supervised object detection (SSOD). The main challenge in OSSOD is distinguishing and filtering the OOD instances (i.e., outliers) during pseudo-labeling since OODs will affect the performance. The… ▽ More

    Submitted 21 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  37. arXiv:2305.12461  [pdf, other

    cs.CR cs.SE

    Towards Tracing Code Provenance with Code Watermarking

    Authors: Wei Li, Borui Yang, Yujie Sun, Suyu Chen, Ziyun Song, Liyao Xiang, Xinbing Wang, Chenghu Zhou

    Abstract: Recent advances in large language models have raised wide concern in generating abundant plausible source code without scrutiny, and thus tracing the provenance of code emerges as a critical issue. To solve the issue, we propose CodeMark, a watermarking system that hides bit strings into variables respecting the natural and operational semantics of the code. For naturalness, we novelly introduce a… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 12 pages

    MSC Class: 68T01 ACM Class: I.2.5

  38. arXiv:2304.07735  [pdf, other

    cs.CR

    Permutation Equivariance of Transformers and Its Applications

    Authors: Hengyuan Xu, Liyao Xiang, Hangyu Ye, Dixi Yao, Pengzhi Chu, Baochun Li

    Abstract: Revolutionizing the field of deep learning, Transformer-based models have achieved remarkable performance in many tasks. Recent research has recognized these models are robust to shuffling but are limited to inter-token permutation in the forward propagation. In this work, we propose our definition of permutation equivariance, a broader concept covering both inter- and intra- token permutation in… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2024

  39. arXiv:2304.01106  [pdf, ps, other

    cs.CL cs.IT

    Crossword: A Semantic Approach to Data Compression via Masking

    Authors: Mingxiao Li, Rui **, Liyao Xiang, Kaiming Shen, Shuguang Cui

    Abstract: The traditional methods for data compression are typically based on the symbol-level statistics, with the information source modeled as a long sequence of i.i.d. random variables or a stochastic process, thus establishing the fundamental limit as entropy for lossless compression and as mutual information for lossy compression. However, the source (including text, music, and speech) in the real wor… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 6 pages, 8 figures

  40. arXiv:2303.16038  [pdf, other

    cs.IT eess.SP

    Polar Coded Integrated Data and Energy Networking: A Deep Neural Network Assisted End-to-End Design

    Authors: **gwen Cui, Jie Hu, Kun Yang, Lajos Hanzo

    Abstract: Wireless sensors are everywhere. To address their energy supply, we proposed an end-to-end design for polar-coded integrated data and energy networking (IDEN), where the conventional signal processing modules, such as modulation/demodulation and channel decoding, are replaced by deep neural networks (DNNs). Moreover, the input-output relationship of an energy harvester (EH) is also modelled by a D… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  41. arXiv:2303.13089  [pdf, other

    cs.CV cs.LG

    Box-Level Active Detection

    Authors: Mengyao Lyu, Jundong Zhou, Hui Chen, Yijie Huang, Dongdong Yu, Yaqian Li, Yandong Guo, Yuchen Guo, Liuyu Xiang, Guiguang Ding

    Abstract: Active learning selects informative samples for annotation within budget, which has proven efficient recently on object detection. However, the widely used active detection benchmarks conduct image-level evaluation, which is unrealistic in human workload estimation and biased towards crowded images. Furthermore, existing methods still perform image-level annotation, but equally scoring all targets… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 highlight

  42. arXiv:2301.09422  [pdf, other

    cs.LG cs.AI cs.CV

    HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks

    Authors: **qi Xiao, Chengming Zhang, Yu Gong, Miao Yin, Yang Sui, Lizhi Xiang, Dingwen Tao, Bo Yuan

    Abstract: Low-rank compression is an important model compression strategy for obtaining compact neural network models. In general, because the rank values directly determine the model complexity and model accuracy, proper selection of layer-wise rank is very critical and desired. To date, though many low-rank compression approaches, either selecting the ranks in a manual or automatic way, have been proposed… ▽ More

    Submitted 1 February, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: AAAI-23

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 37, 9 (Jun. 2023), 10464-10472

  43. arXiv:2301.05565  [pdf, ps, other

    cs.CV

    DINF: Dynamic Instance Noise Filter for Occluded Pedestrian Detection

    Authors: Li Xiang, He Miao, Luo Haibo, Xiao Jiajie

    Abstract: Occlusion issue is the biggest challenge in pedestrian detection. RCNN-based detectors extract instance features by crop** rectangle regions of interest in the feature maps. However, the visible pixels of the occluded objects are limited, making the rectangle instance feature mixed with a lot of instance-irrelevant noise information. Besides, by counting the number of instances with different de… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 15 pages, 8 figures

  44. arXiv:2212.02800  [pdf, other

    cs.CL

    Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation

    Authors: Yang Zhao, Junnan Zhu, Lu Xiang, Jiajun Zhang, Yu Zhou, Feifei Zhai, Chengqing Zong

    Abstract: A common scenario of Multilingual Neural Machine Translation (MNMT) is that each translation task arrives in a sequential manner, and the training data of previous tasks is unavailable. In this scenario, the current methods suffer heavily from catastrophic forgetting (CF). To alleviate the CF, we investigate knowledge distillation based life-long learning methods. Specifically, in one-tomany scena… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  45. X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications

    Authors: Junyuan Shang, Shuohuan Wang, Yu Sun, Yanjun Yu, Yue Zhou, Li Xiang, Guixiu Yang

    Abstract: This paper describes our winning system on SemEval 2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts. A replaced token detection pre-trained model is utilized with minorly different task-specific heads for SubTask-A: Multi-class Classification and SubTask-B: Ranking. Incorporating a pattern-aware ensemble method, our system achieves a 6… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted at the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL

  46. arXiv:2211.14429  [pdf, other

    physics.chem-ph cs.LG q-bio.BM

    Supervised Pretraining for Molecular Force Fields and Properties Prediction

    Authors: Xiang Gao, Weihao Gao, Wenzhi Xiao, Zhirui Wang, Chong Wang, Liang Xiang

    Abstract: Machine learning approaches have become popular for molecular modeling tasks, including molecular force fields and properties prediction. Traditional supervised learning methods suffer from scarcity of labeled data for particular tasks, motivating the use of large-scale dataset for other relevant tasks. We propose to pretrain neural networks on a dataset of 86 millions of molecules with atom charg… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: AI4Science Workshop at NeurIPS 2022

  47. arXiv:2211.12773  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Learning Regularized Positional Encoding for Molecular Prediction

    Authors: Xiang Gao, Weihao Gao, Wenzhi Xiao, Zhirui Wang, Chong Wang, Liang Xiang

    Abstract: Machine learning has become a promising approach for molecular modeling. Positional quantities, such as interatomic distances and bond angles, play a crucial role in molecule physics. The existing works rely on careful manual design of their representation. To model the complex nonlinearity in predicting molecular properties in an more end-to-end approach, we propose to encode the positional quant… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: AI4Science Workshop at NeurIPS 2022

  48. TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition

    Authors: Lizhi Xiang, Miao Yin, Chengming Zhang, Aravind Sukumaran-Rajam, P. Sadayappan, Bo Yuan, Dingwen Tao

    Abstract: Tucker decomposition is one of the SOTA CNN model compression techniques. However, unlike the FLOPs reduction, we observe very limited inference time reduction with Tucker-compressed models using existing GPU software such as cuDNN. To this end, we propose an efficient end-to-end framework that can generate highly accurate and compact CNN models via Tucker decomposition and optimized inference cod… ▽ More

    Submitted 4 January, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 14 pages, 9 figures, 3 tables, accepted by PPoPP '23

  49. arXiv:2211.00826  [pdf, ps, other

    cs.CV cs.AI

    TSAA: A Two-Stage Anchor Assignment Method towards Anchor Drift in Crowded Object Detection

    Authors: Li Xiang, He Miao, Luo Haibo, Yang Huiyuan, Xiao Jiajie

    Abstract: Among current anchor-based detectors, a positive anchor box will be intuitively assigned to the object that overlaps it the most. The assigned label to each anchor will directly determine the optimization direction of the corresponding prediction box, including the direction of box regression and category prediction. In our practice of crowded object detection, however, the results show that a pos… ▽ More

    Submitted 11 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: 11 pages, 8 figures

  50. arXiv:2210.08068  [pdf, other

    eess.IV cs.CV cs.LG

    Whole-body tumor segmentation of 18F -FDG PET/CT using a cascaded and ensembled convolutional neural networks

    Authors: Ludovic Sibille, Xinrui Zhan, Lei Xiang

    Abstract: Background: A crucial initial processing step for quantitative PET/CT analysis is the segmentation of tumor lesions enabling accurate feature ex-traction, tumor characterization, oncologic staging, and image-based therapy response assessment. Manual lesion segmentation is however associated with enormous effort and cost and is thus infeasible in clinical routine. Goal: The goal of this study was t… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.