Skip to main content

Showing 1–50 of 78 results for author: An, Z

Searching in archive cs. Search in all archives.
.
  1. MARLP: Time-series Forecasting Control for Agricultural Managed Aquifer Recharge

    Authors: Yuning Chen, Kang Yang, Zhiyu An, Brady Holder, Luke Paloutzian, Khaled Bali, Wan Du

    Abstract: The rapid decline in groundwater around the world poses a significant challenge to sustainable agriculture. To address this issue, agricultural managed aquifer recharge (Ag-MAR) is proposed to recharge the aquifer by artificially flooding agricultural lands using surface water. Ag-MAR requires a carefully selected flooding schedule to avoid affecting the oxygen absorption of crop roots. However, c… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by KDD 2024

  2. Unified Dual-Intent Translation for Joint Modeling of Search and Recommendation

    Authors: Yuting Zhang, Yiqing Wu, Ruidong Han, Ying Sun, Yongchun Zhu, Xiang Li, Wei Lin, Fuzhen Zhuang, Zhulin An, Yongjun Xu

    Abstract: Recommendation systems, which assist users in discovering their preferred items among numerous options, have served billions of users across various online platforms. Intuitively, users' interactions with items are highly driven by their unchanging inherent intents (e.g., always preferring high-quality items) and changing demand intents (e.g., wanting a T-shirt in summer but a down jacket in winte… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.10744   

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Jose Alvarez, Coert van Gemeren, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Sheng** Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou , et al. (77 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: The author list and contents need to be verified by all authors

  4. arXiv:2406.05488  [pdf, other

    cs.LG cs.AI

    Online Policy Distillation with Decision-Attention

    Authors: Xinqiang Yu, Chuanguang Yang, Chengqing Yu, Libo Huang, Zhulin An, Yongjun Xu

    Abstract: Policy Distillation (PD) has become an effective method to improve deep reinforcement learning tasks. The core idea of PD is to distill policy knowledge from a teacher agent to a student agent. However, the teacher-student framework requires a well-trained teacher model which is computationally expensive.In the light of online knowledge distillation, we study the knowledge transfer between differe… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  5. arXiv:2406.04829  [pdf, other

    cs.CV

    EGOR: Efficient Generated Objects Replay for incremental object detection

    Authors: Zijia An, Boyu Diao, Libo Huang, Ruiqi Liu, Zhulin An, Yongjun Xu

    Abstract: Incremental object detection aims to simultaneously maintain old-class accuracy and detect emerging new-class objects in incremental data. Most existing distillation-based methods underperform when unlabeled old-class objects are absent in the incremental dataset. While the absence can be mitigated by generating old-class samples, it also incurs high computational costs. In this paper, we argue th… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  6. arXiv:2404.09447  [pdf, other

    cs.CV cs.LG

    kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

    Authors: Zhongrui Gui, Shuyang Sun, Runjia Li, Jianhao Yuan, Zhaochong An, Karsten Roth, Ameya Prabhu, Philip Torr

    Abstract: Rapid advancements in continual segmentation have yet to bridge the gap of scaling to large continually expanding vocabularies under compute-constrained scenarios. We discover that traditional continual training leads to catastrophic forgetting under compute constraints, unable to outperform zero-shot segmentation methods. We introduce a novel strategy for semantic and panoptic segmentation with z… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures

  7. arXiv:2404.04807  [pdf, other

    cs.CV cs.MM

    D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation

    Authors: Xuan Sun, Zhanfu An, Yuyu Liu

    Abstract: We investigated domain adaptive semantic segmentation in foggy weather scenarios, which aims to enhance the utilization of unlabeled foggy data and improve the model's adaptability to foggy conditions. Current methods rely on clear images as references, jointly learning defogging and segmentation for foggy images. Despite making some progress, there are still two main drawbacks: (1) the coupling o… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  8. arXiv:2403.16221  [pdf, other

    cs.CV

    Exemplar-Free Class Incremental Learning via Incremental Representation

    Authors: Libo Huang, Zhulin An, Yan Zeng, Chuanguang Yang, Xinqiang Yu, Yongjun Xu

    Abstract: Exemplar-Free Class Incremental Learning (efCIL) aims to continuously incorporate the knowledge from new classes while retaining previously learned information, without storing any old-class exemplars (i.e., samples). For this purpose, various efCIL methods have been proposed over the past few years, generally with elaborately constructed old pseudo-features, increasing the difficulty of model dev… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  9. arXiv:2403.07652  [pdf, other

    cs.LG cs.CL

    Harder Tasks Need More Experts: Dynamic Routing in MoE Models

    Authors: Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang **, Kun Xu, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng

    Abstract: In this paper, we introduce a novel dynamic expert selection framework for Mixture of Experts (MoE) models, aiming to enhance computational efficiency and model performance by adjusting the number of activated experts based on input difficulty. Unlike traditional MoE approaches that rely on fixed Top-K routing, which activates a predetermined number of experts regardless of the input's complexity,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  10. arXiv:2403.00592  [pdf, other

    cs.CV

    Rethinking Few-shot 3D Point Cloud Semantic Segmentation

    Authors: Zhaochong An, Guolei Sun, Yun Liu, Fayao Liu, Zongwei Wu, Dan Wang, Luc Van Gool, Serge Belongie

    Abstract: This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS), with a focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution. The former arises from non-uniform point sampling, allowing models to distinguish the density disparities between foreground and background for easier segmentation. The latter results from sampling only 2,048 p… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  11. arXiv:2403.00172  [pdf, other

    eess.SY cs.AI cs.LG

    Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control

    Authors: Zhiyu An, Xianzhong Ding, Wan Du

    Abstract: Recent research has shown the potential of Model-based Reinforcement Learning (MBRL) to enhance energy efficiency of Heating, Ventilation, and Air Conditioning (HVAC) systems. However, existing methods rely on black-box thermal dynamics models and stochastic optimizers, lacking reliability guarantees and posing risks to occupant health. In this work, we overcome the reliability bottleneck by redes… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted for the 61st Design Automation Conference (DAC)

  12. arXiv:2402.13419  [pdf, ps, other

    cs.AI

    Reward Bound for Behavioral Guarantee of Model-based Planning Agents

    Authors: Zhiyu An, Xianzhong Ding, Wan Du

    Abstract: Recent years have seen an emerging interest in the trustworthiness of machine learning-based agents in the wild, especially in robotics, to provide safety assurance for the industry. Obtaining behavioral guarantees for these agents remains an important problem. In this work, we focus on guaranteeing a model-based planning agent reaches a goal state within a specific future time step. We show that… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: To be published in ICLR 24 tiny paper track

  13. arXiv:2401.07448  [pdf, other

    cs.AI cs.LG

    Formal Logic Enabled Personalized Federated Learning Through Property Inference

    Authors: Ziyan An, Taylor T. Johnson, Meiyi Ma

    Abstract: Recent advancements in federated learning (FL) have greatly facilitated the development of decentralized collaborative applications, particularly in the domain of Artificial Intelligence of Things (AIoT). However, a critical aspect missing from the current research landscape is the ability to enable data-driven client models with symbolic reasoning capabilities. Specifically, the inherent heteroge… ▽ More

    Submitted 23 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  14. arXiv:2401.05960  [pdf, other

    cs.AI

    Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

    Authors: Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao , et al. (1 additional authors not shown)

    Abstract: In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt… ▽ More

    Submitted 17 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  15. arXiv:2312.11791  [pdf, other

    cs.RO

    Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

    Authors: Zijian An, Lifeng Zhou

    Abstract: We study the problem of game-theoretic robot allocation where two players strategically allocate robots to compete for multiple sites of interest. Robots possess offensive or defensive capabilities to interfere and weaken their opponents to take over a competing site. This problem belongs to the conventional Colonel Blotto Game. Considering the robots' heterogeneous capabilities and environmental… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  16. arXiv:2312.04780  [pdf, other

    cs.CV cs.AI

    Fine-Tuning InstructPix2Pix for Advanced Image Colorization

    Authors: Zifeng An, Zi**g Xu, Eric Fan, Qi Cao

    Abstract: This paper presents a novel approach to human image colorization by fine-tuning the InstructPix2Pix model, which integrates a language model (GPT-3) with a text-to-image model (Stable Diffusion). Despite the original InstructPix2Pix model's proficiency in editing images based on textual instructions, it exhibits limitations in the focused domain of colorization. To address this, we fine-tuned the… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  17. arXiv:2311.07491  [pdf, other

    cs.CL

    A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models

    Authors: He**g Cao, Zhenwei An, Jiazhan Feng, Kun Xu, Liwei Chen, Dongyan Zhao

    Abstract: While large language models exhibit remarkable performance in the Question Answering task, they are susceptible to hallucinations. Challenges arise when these models grapple with understanding multi-hop relations in complex questions or lack the necessary knowledge for a comprehensive response. To address this issue, we introduce the "Decompose-and-Query" framework (D&Q). This framework guides the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  18. arXiv:2309.16117  [pdf, other

    cs.LG cs.AI

    E2Net: Resource-Efficient Continual Learning with Elastic Expansion Network

    Authors: RuiQi Liu, Boyu Diao, Libo Huang, Zhulin An, Yongjun Xu

    Abstract: Continual Learning methods are designed to learn new tasks without erasing previous knowledge. However, Continual Learning often requires massive computational power and storage capacity for satisfactory performance. In this paper, we propose a resource-efficient continual learning method called the Elastic Expansion Network (E2Net). Leveraging core subnet distillation and precise replay sample se… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  19. arXiv:2309.08020  [pdf, other

    cs.CV

    Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation

    Authors: Zhaochong An, Guolei Sun, Zongwei Wu, Hao Tang, Luc Van Gool

    Abstract: Modern approaches have proved the huge potential of addressing semantic segmentation as a mask classification task which is widely used in instance-level segmentation. This paradigm trains models by assigning part of object queries to ground truths via conventional one-to-one matching. However, we observe that the popular video semantic segmentation (VSS) dataset has limited categories per video,… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: BMVC 2023

  20. arXiv:2308.07890  [pdf, other

    cs.AI cs.FL cs.LO

    EduSAT: A Pedagogical Tool for Theory and Applications of Boolean Satisfiability

    Authors: Yiqi Zhao, Ziyan An, Meiyi Ma, Taylor Johnson

    Abstract: Boolean Satisfiability (SAT) and Satisfiability Modulo Theories (SMT) are widely used in automated verification, but there is a lack of interactive tools designed for educational purposes in this field. To address this gap, we present EduSAT, a pedagogical tool specifically developed to support learning and understanding of SAT and SMT solving. EduSAT offers implementations of key algorithms such… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  21. arXiv:2307.12732  [pdf, other

    cs.CV

    CLIP-KD: An Empirical Study of CLIP Model Distillation

    Authors: Chuanguang Yang, Zhulin An, Libo Huang, Junyu Bi, Xinqiang Yu, Han Yang, Boyu Diao, Yongjun Xu

    Abstract: Contrastive Language-Image Pre-training (CLIP) has become a promising language-supervised visual pre-training framework. This paper aims to distill small CLIP models supervised by a large teacher CLIP model. We propose several distillation strategies, including relation, feature, gradient and contrastive paradigms, to examine the effectiveness of CLIP-Knowledge Distillation (KD). We show that a si… ▽ More

    Submitted 7 May, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: CVPR-2024

  22. arXiv:2306.10687  [pdf, other

    cs.CV

    Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation

    Authors: Chuanguang Yang, Xinqiang Yu, Zhulin An, Yongjun Xu

    Abstract: Deep neural networks have achieved remarkable performance for artificial intelligence tasks. The success behind intelligent systems often relies on large-scale models with high computational complexity and storage costs. The over-parameterized networks are often easy to optimize and can achieve better performance. However, it is challenging to deploy them over resource-limited edge-devices. Knowle… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Published at Springer book "Advancements in Knowledge Distillation: Towards New Horizons of Intelligent Systems"

  23. arXiv:2306.08998  [pdf, other

    cs.SD cs.CV eess.AS

    Team AcieLee: Technical Report for EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023

    Authors: Yuqi Li, Yizhi Luo, Xiaoshuai Hao, Chuanguang Yang, Zhulin An, Dantong Song, Wei Yi

    Abstract: In this report, we describe the technical details of our submission to the EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023, by Team "AcieLee" (username: Yuqi\_Li). The task is to classify the audio caused by interactions between objects, or from events of the camera wearer. We conducted exhaustive experiments and found learning rate step decay, backbone frozen, label smoothing and f… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  24. arXiv:2306.06808  [pdf, other

    cs.AI

    Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications

    Authors: Jiangwei Wang, Shuo Yang, Ziyan An, Songyang Han, Zhili Zhang, Rahul Mangharam, Meiyi Ma, Fei Miao

    Abstract: Reward design is a key component of deep reinforcement learning, yet some tasks and designer's objectives may be unnatural to define as a scalar cost function. Among the various techniques, formal methods integrated with DRL have garnered considerable attention due to their expressiveness and flexibility to define the reward and requirements for different states and actions of the agent. However,… ▽ More

    Submitted 22 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

  25. Modeling Dual Period-Varying Preferences for Takeaway Recommendation

    Authors: Yuting Zhang, Yiqing Wu, Ran Le, Yongchun Zhu, Fuzhen Zhuang, Ruidong Han, Xiang Li, Wei Lin, Zhulin An, Yongjun Xu

    Abstract: Takeaway recommender systems, which aim to accurately provide stores that offer foods meeting users' interests, have served billions of users in our daily life. Different from traditional recommendation, takeaway recommendation faces two main challenges: (1) Dual Interaction-Aware Preference Modeling. Traditional recommendation commonly focuses on users' single preferences for items while takeaway… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: accepted by KDD (The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining) 2023 Applied Data Science (ADS) track

  26. arXiv:2306.01974  [pdf, other

    cs.SD eess.AS

    BEDRF: Bidirectional Edge Diffraction Response Function for Interactive Sound Propagation

    Authors: Chunxiao Cao, Zili An, Zhong Ren, Dinesh Manocha, Kun Zhou

    Abstract: We introduce bidirectional edge diffraction response function (BEDRF), a new approach to model wave diffraction around edges with path tracing. The diffraction part of the wave is expressed as an integration on path space, and the wave-edge interaction is expressed using only the localized information around points on the edge similar to a bidirectional scattering distribution function (BSDF) for… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  27. arXiv:2305.17499  [pdf, other

    cs.CL cs.MM eess.AS

    CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

    Authors: Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma

    Abstract: Speech or text representation generated by pre-trained models contains modal-specific information that could be combined for benefiting spoken language understanding (SLU) tasks. In this work, we propose a novel pre-training paradigm termed Continuous Integrate-and-Fire Pre-Training (CIF-PT). It relies on a simple but effective frame-to-token alignment: continuous integrate-and-fire (CIF) to bridg… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 Findings

  28. arXiv:2305.15062  [pdf, other

    cs.CL cs.AI

    Lawyer LLaMA Technical Report

    Authors: Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin Chen, Zirui Wu, Yansong Feng

    Abstract: Large Language Models (LLMs), like LLaMA, have exhibited remarkable performance across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we propose a new framework to ada… ▽ More

    Submitted 13 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  29. arXiv:2305.10469  [pdf, other

    cs.CV

    Object Segmentation by Mining Cross-Modal Semantics

    Authors: Zongwei Wu, **g**g Wang, Zhuyun Zhou, Zhaochong An, Qiu** Jiang, Cédric Demonceaux, Guolei Sun, Radu Timofte

    Abstract: Multi-sensor clues have shown promise for object segmentation, but inherent noise in each sensor, as well as the calibration error in practice, may bias the segmentation accuracy. In this paper, we propose a novel approach by mining the Cross-Modal Semantics to guide the fusion and decoding of multimodal features, with the aim of controlling the modal contribution based on relative entropy. We exp… ▽ More

    Submitted 4 August, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ACM MM 2023

  30. arXiv:2305.08585  [pdf, other

    cs.CV eess.IV

    Toward Moiré-Free and Detail-Preserving Demosaicking

    Authors: Xuanchen Li, Yan Niu, Bo Zhao, Haoyuan Shi, Zitong An

    Abstract: 3D convolutions are commonly employed by demosaicking neural models, in the same way as solving other image restoration problems. Counter-intuitively, we show that 3D convolutions implicitly impede the RGB color spectra from exchanging complementary information, resulting in spectral-inconsistent inference of the local spatial high frequency components. As a consequence, shallow 3D convolution net… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures, 5 tables

  31. NeRF2: Neural Radio-Frequency Radiance Fields

    Authors: Xiaopeng Zhao, Zhenlin An, Qingrui Pan, Lei Yang

    Abstract: Although Maxwell discovered the physical laws of electromagnetic waves 160 years ago, how to precisely model the propagation of an RF signal in an electrically large and complex environment remains a long-standing problem. The difficulty is in the complex interactions between the RF signal and the obstacles (e.g., reflection, diffraction, etc.). Inspired by the great success of using a neural netw… ▽ More

    Submitted 12 October, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

  32. arXiv:2304.11677  [pdf, other

    cs.CV

    Indiscernible Object Counting in Underwater Scenes

    Authors: Guolei Sun, Zhaochong An, Yun Liu, Ce Liu, Christos Sakaridis, Deng-** Fan, Luc Van Gool

    Abstract: Recently, indiscernible scene understanding has attracted a lot of attention in the vision community. We further advance the frontier of this field by systematically studying a new challenge named indiscernible object counting (IOC), the goal of which is to count objects that are blended with respect to their surroundings. Due to a lack of appropriate IOC datasets, we present a large-scale dataset… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: To appear in CVPR 2023. The resources are available at https://github.com/GuoleiSun/Indiscernible-Object-Counting

  33. arXiv:2304.10103  [pdf, other

    cs.CV

    eTag: Class-Incremental Learning with Embedding Distillation and Task-Oriented Generation

    Authors: Libo Huang, Yan Zeng, Chuanguang Yang, Zhulin An, Boyu Diao, Yongjun Xu

    Abstract: Class-Incremental Learning (CIL) aims to solve the neural networks' catastrophic forgetting problem, which refers to the fact that once the network updates on a new task, its performance on previously-learned tasks drops dramatically. Most successful CIL methods incrementally train a feature extractor with the aid of stored exemplars, or estimate the feature distribution with the stored prototypes… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 12 pages, 12 figures

  34. arXiv:2303.08416  [pdf, other

    eess.IV cs.CV

    Lung Nodule Segmentation and Uncertain Region Prediction with an Uncertainty-Aware Attention Mechanism

    Authors: Han Yang, Qiuli Wang, Yue Zhang, Zhulin An, Chen Liu, Xiaohong Zhang, S. Kevin Zhou

    Abstract: Radiologists possess diverse training and clinical experiences, leading to variations in the segmentation annotations of lung nodules and resulting in segmentation uncertainty.Conventional methods typically select a single annotation as the learning target or attempt to learn a latent space comprising multiple annotations. However, these approaches fail to leverage the valuable information inheren… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 10 pages, 10 figures. We have reported a preliminary version of this work in MICCAI 2022

  35. arXiv:2302.11137  [pdf, other

    cs.AI

    Fairguard: Harness Logic-based Fairness Rules in Smart Cities

    Authors: Yiqi Zhao, Ziyan An, Xuqing Gao, Ayan Mukhopadhyay, Meiyi Ma

    Abstract: Smart cities operate on computational predictive frameworks that collect, aggregate, and utilize data from large-scale sensor networks. However, these frameworks are prone to multiple sources of data and algorithmic bias, which often lead to unfair prediction results. In this work, we first demonstrate that bias persists at a micro-level both temporally and spatially by studying real city data fro… ▽ More

    Submitted 8 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  36. arXiv:2212.06486  [pdf, other

    cs.CV

    Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning

    Authors: Kaiyou Song, Shan Zhang, Zihao An, Zimeng Luo, Tong Wang, ** Xie

    Abstract: In contrastive self-supervised learning, the common way to learn discriminative representation is to pull different augmented "views" of the same image closer while pushing all other images further apart, which has been proven to be effective. However, it is unavoidable to construct undesirable views containing different semantic concepts during the augmentation procedure. It would damage the sema… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  37. arXiv:2211.11617  [pdf, other

    cs.CL

    CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation

    Authors: Yinpei Dai, Wanwei He, Bowen Li, Yuchuan Wu, Zheng Cao, Zhongqi An, Jian Sun, Yongbin Li

    Abstract: Practical dialog systems need to deal with various knowledge sources, noisy user expressions, and the shortage of annotated data. To better solve the above problems, we propose CGoDial, new challenging and comprehensive Chinese benchmark for multi-domain Goal-oriented Dialog evaluation. It contains 96,763 dialog sessions and 574,949 dialog turns totally, covering three datasets with different know… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  38. arXiv:2210.17108  [pdf, other

    cs.CL

    Do Charge Prediction Models Learn Legal Theory?

    Authors: Zhenwei An, Quzhe Huang, Cong Jiang, Yansong Feng, Dongyan Zhao

    Abstract: The charge prediction task aims to predict the charge for a case given its fact description. Recent models have already achieved impressive accuracy in this task, however, little is understood about the mechanisms they use to perform the judgment.For practical applications, a charge prediction model should conform to the certain legal theory in civil law countries, as under the framework of civil… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: findings of emnlp2022

  39. arXiv:2208.05768  [pdf, other

    cs.CV

    MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition

    Authors: Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang

    Abstract: Unlike the conventional Knowledge Distillation (KD), Self-KD allows a network to learn knowledge from itself without any guidance from extra networks. This paper proposes to perform Self-KD from image Mixture (MixSKD), which integrates these two techniques into a unified framework. MixSKD mutually distills feature maps and probability distributions between the random pair of original images and th… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 22 pages, ECCV-2022

  40. arXiv:2207.11518  [pdf, other

    cs.CV cs.AI

    Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition

    Authors: Chuanguang Yang, Zhulin An, Helong Zhou, Fuzhen Zhuang, Yongjun Xu, Qian Zhan

    Abstract: The teacher-free online Knowledge Distillation (KD) aims to train an ensemble of multiple student models collaboratively and distill knowledge from each other. Although existing online KD methods achieve desirable performance, they often focus on class probabilities as the core knowledge type, ignoring the valuable feature representational information. We present a Mutual Contrastive Learning (MCL… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: 18 pages, accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI-2023)

  41. arXiv:2206.03367  [pdf, other

    cs.CV

    Localizing Semantic Patches for Accelerating Image Classification

    Authors: Chuanguang Yang, Zhulin An, Yongjun Xu

    Abstract: Existing works often focus on reducing the architecture redundancy for accelerating image classification but ignore the spatial redundancy of the input image. This paper proposes an efficient image classification pipeline to solve this problem. We first pinpoint task-aware regions over the input image by a lightweight patch proposal network called AnchorNet. We then feed these localized semantic p… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: Accepted by ICME-2022

  42. arXiv:2205.06603  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Contextual Representation with Gloss Regularized Pre-training

    Authors: Yu Lin, Zhecheng An, Peihao Wu, Zejun Ma

    Abstract: Though achieving impressive results on many NLP tasks, the BERT-like masked language models (MLM) encounter the discrepancy between pre-training and inference. In light of this gap, we investigate the contextual representation of pre-training and inference from the perspective of word probability distribution. We discover that BERT risks neglecting the contextual word similarity in pre-training. T… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of NAACL 2022

  43. arXiv:2204.06986  [pdf, other

    cs.CV

    Cross-Image Relational Knowledge Distillation for Semantic Segmentation

    Authors: Chuanguang Yang, Helong Zhou, Zhulin An, Xue Jiang, Yongjun Xu, Qian Zhang

    Abstract: Current Knowledge Distillation (KD) methods for semantic segmentation often guide the student to mimic the teacher's structured information generated from individual data samples. However, they ignore the global semantic relations among pixels across various images that are valuable for KD. This paper proposes a novel Cross-Image Relational KD (CIRKD), which focuses on transferring structured pixe… ▽ More

    Submitted 10 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR-2022

  44. arXiv:2204.05045  [pdf, other

    cs.CV

    SAL-CNN: Estimate the Remaining Useful Life of Bearings Using Time-frequency Information

    Authors: Bingguo Liu, Zhuo Gao, Binghui Lu, Hangcheng Dong, Zeru An

    Abstract: In modern industrial production, the prediction ability of the remaining useful life (RUL) of bearings directly affects the safety and stability of the system. Traditional methods require rigorous physical modeling and perform poorly for complex systems. In this paper, an end-to-end RUL prediction method is proposed, which uses short-time Fourier transform (STFT) as preprocessing. Considering the… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  45. arXiv:2203.00934  [pdf, ps, other

    cs.IT

    Cross-Layer Optimization: Joint User Scheduling and Beamforming Design With QoS Support in Joint Transmission Networks

    Authors: Shiwen He, Zhenyu An, Jianyue Zhu, Min Zhang, Yongming Huang, Yaoxue Zhang

    Abstract: User scheduling and beamforming design are two crucial yet coupled topics for wireless communication systems. They are usually optimized separately with conventional optimization methods. In this paper, a novel cross-layer optimization problem is considered, namely, the user scheduling and beamforming are jointly discussed subjecting to the requirement of per-user quality of service (QoS) and the… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 28 pages, 6 figures, submitted to IEEE Transactions on Communications, 2022

  46. arXiv:2202.08449  [pdf, other

    cs.CV

    V2X-Sim: Multi-Agent Collaborative Perception Dataset and Benchmark for Autonomous Driving

    Authors: Yiming Li, Dekun Ma, Ziyan An, Zixun Wang, Yiqi Zhong, Siheng Chen, Chen Feng

    Abstract: Vehicle-to-everything (V2X) communication techniques enable the collaboration between vehicles and many other entities in the neighboring environment, which could fundamentally improve the perception system for autonomous driving. However, the lack of a public dataset significantly restricts the research progress of collaborative perception. To fill this gap, we present V2X-Sim, a comprehensive si… ▽ More

    Submitted 15 July, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 2022 IEEE Robotics and Automation Letters (RA-L) (The extended abstract is presented at 2021 IEEE International Conference on Computer Vision (ICCV) Simulation Technology for Embodied AI Workshop)

  47. arXiv:2202.01582  [pdf, other

    cs.SD cs.GR eess.AS

    A Psychoacoustic Quality Criterion for Path-Traced Sound Propagation

    Authors: Chunxiao Cao, Zili An, Zhong Ren, Dinesh Manocha, Kun Zhou

    Abstract: In develo** virtual acoustic environments, it is important to understand the relationship between the computation cost and the perceptual significance of the resultant numerical error. In this paper, we propose a quality criterion that evaluates the error significance of path-tracing-based sound propagation simulators. We present an analytical formula that estimates the error signal power spectr… ▽ More

    Submitted 8 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 12 pages, 10 figures. To be published in IEEE TVCG

  48. arXiv:2201.08994  [pdf, ps, other

    cs.IT

    An Unsupervised Deep Unrolling Framework for Constrained Optimization Problems in Wireless Networks

    Authors: Shiwen He, Shaowen Xiong, Zhenyu An, Wei Zhang, Yongming Huang, Yaoxue Zhang

    Abstract: In wireless network, the optimization problems generally have complex constraints, and are usually solved via utilizing the traditional optimization methods that have high computational complexity and need to be executed repeatedly with the change of network environments. In this paper, to overcome these shortcomings, an unsupervised deep unrolling framework based on projection gradient descent, i… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 27 pages, 8 figures, submitted to IEEE Transactions on Wireless Communications, Jan. 2022

  49. arXiv:2201.06418  [pdf, other

    cs.LG cs.AI

    Lifelong Generative Learning via Knowledge Reconstruction

    Authors: Libo Huang, Zhulin An, Xiang Zhi, Yongjun Xu

    Abstract: Generative models often incur the catastrophic forgetting problem when they are used to sequentially learning multiple tasks, i.e., lifelong generative learning. Although there are some endeavors to tackle this problem, they suffer from high time-consumptions or error accumulation. In this work, we develop an efficient and effective lifelong generative model based on variational autoencoder (VAE).… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

  50. arXiv:2112.01738  [pdf, ps, other

    cs.IT eess.SP

    Joint User Scheduling and Beamforming Design for Multiuser MISO Downlink Systems

    Authors: S. He, J. Yuan, Z. An, W. Huang, Y. Huang, Y. Zhang

    Abstract: In multiuser communication systems, user scheduling and beamforming (US-BF) design are two fundamental problems that are usually studied separately in the existing literature. In this work, we focus on the joint US-BF design with the goal of maximizing the set cardinality of scheduled users, which is computationally challenging due to the non-convex objective function and the coupled constraints w… ▽ More

    Submitted 4 July, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 31 pages, 9 figures, submit to IEEE Transactions on Wireless Communications