Skip to main content

Showing 1–25 of 25 results for author: Zhuang, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.19403  [pdf, other

    cs.RO cs.AI

    Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making

    Authors: Lei Zhuang, **gdong Zhao, Yuntao Li, Zichun Xu, Liangliang Zhao, Hong Liu

    Abstract: Sampling-based motion planning (SBMP) algorithms are renowned for their robust global search capabilities. However, the inherent randomness in their sampling mechanisms often result in inconsistent path quality and limited search efficiency. In response to these challenges, this work proposes a novel deep learning-based motion planning framework, named Transformer-Enhanced Motion Planner (TEMP), w… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2403.06289  [pdf, other

    cs.CV cs.AI cs.LG

    Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning

    Authors: Zijun Long, Lipeng Zhuang, George Killick, Richard McCreadie, Gerardo Aragon Camarasa, Paul Henderson

    Abstract: Human-annotated vision datasets inevitably contain a fraction of human mislabelled examples. While the detrimental effects of such mislabelling on supervised learning are well-researched, their influence on Supervised Contrastive Learning (SCL) remains largely unexplored. In this paper, we show that human-labelling errors not only differ significantly from synthetic label errors, but also pose uni… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.16481

  3. arXiv:2402.14551  [pdf, other

    cs.CV cs.AI cs.LG

    CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion

    Authors: Zijun Long, George Killick, Lipeng Zhuang, Gerardo Aragon-Camarasa, Zaiqiao Meng, Richard Mccreadie

    Abstract: State-of-the-art pre-trained image models predominantly adopt a two-stage approach: initial unsupervised pre-training on large-scale datasets followed by task-specific fine-tuning using Cross-Entropy loss~(CE). However, it has been demonstrated that CE can compromise model generalization and stability. While recent works employing contrastive learning address some of these limitations by enhancing… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.14893

  4. arXiv:2401.11544  [pdf, other

    cs.CV

    Hierarchical Prompts for Rehearsal-free Continual Learning

    Authors: Yukun Zuo, Hantao Yao, Lu Yu, Liansheng Zhuang, Changsheng Xu

    Abstract: Continual learning endeavors to equip the model with the capability to integrate current task knowledge while mitigating the forgetting of past task knowledge. Inspired by prompt tuning, prompt-based methods maintain a frozen backbone and train with slight learnable prompts to minimize the catastrophic forgetting that arises due to updating a large number of backbone parameters. Nonetheless, these… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Submitted to TPAMI

  5. arXiv:2401.06287  [pdf, other

    cs.CV

    Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition

    Authors: Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu

    Abstract: Audio-visual video recognition (AVVR) aims to integrate audio and visual clues to categorize videos accurately. While existing methods train AVVR models using provided datasets and achieve satisfactory results, they struggle to retain historical class knowledge when confronted with new classes in real-world situations. Currently, there are no dedicated methods for addressing this problem, so this… ▽ More

    Submitted 6 June, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted by TPAMI

  6. arXiv:2312.13788  [pdf, other

    cs.RO

    Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator

    Authors: Zichun Xu, Yuntao Li, Xiaohang Yang, Zhiyuan Zhao, Lei Zhuang, **gdong Zhao

    Abstract: This paper presents three open-source reinforcement learning environments developed on the MuJoCo physics engine with the Franka Emika Panda arm in MuJoCo Menagerie. Three representative tasks, push, slide, and pick-and-place, are implemented through the Gymnasium Robotics API, which inherits from the core of Gymnasium. Both the sparse binary and dense rewards are supported, and the observation sp… ▽ More

    Submitted 11 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  7. arXiv:2311.16481  [pdf, other

    cs.CV

    Elucidating and Overcoming the Challenges of Label Noise in Supervised Contrastive Learning

    Authors: Zijun Long, George Killick, Lipeng Zhuang, Richard McCreadie, Gerardo Aragon Camarasa, Paul Henderson

    Abstract: Image classification datasets exhibit a non-negligible fraction of mislabeled examples, often due to human error when one class superficially resembles another. This issue poses challenges in supervised contrastive learning (SCL), where the goal is to cluster together data points of the same class in the embedding space while distancing those of disparate classes. While such methods outperform tho… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  8. arXiv:2310.15931  [pdf, other

    cs.RO

    GO-FEAP: Global Optimal UAV Planner Using Frontier-Omission-Aware Exploration and Altitude-Stratified Planning

    Authors: Weiye Zhang, Wenshuai Yu, Licong Zhuang, Xiaoyi Zhang, Zhi Zeng, Jiasong Zhu

    Abstract: Autonomous exploration is a fundamental problem for various applications of unmanned aerial vehicles(UAVs). Existing methods, however, are demonstrated to static local optima and two-dimensional exploration. To address these challenges, this paper introduces GO-FEAP (Global Optimal UAV Planner Using Frontier-Omission-Aware Exploration and Altitude-Stratified Planning), aiming to achieve efficient… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 7 pages,29 figures

  9. arXiv:2305.10662  [pdf, other

    cs.CV cs.CR

    Learning Differentially Private Probabilistic Models for Privacy-Preserving Image Generation

    Authors: Bochao Liu, Shiming Ge, Pengju Wang, Liansheng Zhuang, Tongliang Liu

    Abstract: A number of deep models trained on high-quality and valuable images have been deployed in practical applications, which may pose a leakage risk of data privacy. Learning differentially private generative models can sidestep this challenge through indirect data access. However, such differentially private generative models learned by existing approaches can only generate images with a low-resolutio… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  10. arXiv:2305.02567  [pdf, other

    cs.CV

    LayoutDM: Transformer-based Diffusion Model for Layout Generation

    Authors: Shang Chai, Liansheng Zhuang, Fengying Yan

    Abstract: Automatic layout generation that can synthesize high-quality layouts is an important tool for graphic design in many applications. Though existing methods based on generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) have progressed, they still leave much room for improving the quality and diversity of the results. Inspired by the recent success of… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted by CVPR 2023

  11. arXiv:2204.11695  [pdf, other

    cs.CV

    Estimation of Reliable Proposal Quality for Temporal Action Detection

    Authors: Junshan Hu, Chaoxu guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge, Yuning Jiang, Houqiang Li

    Abstract: Temporal action detection (TAD) aims to locate and recognize the actions in an untrimmed video. Anchor-free methods have made remarkable progress which mainly formulate TAD into two tasks: classification and localization using two separate branches. This paper reveals the temporal misalignment between the two tasks hindering further progress. To address this, we propose a new method that gives ins… ▽ More

    Submitted 21 November, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to ACM Multimedia 2022

  12. arXiv:2109.13967  [pdf, other

    cs.CV

    One-shot Key Information Extraction from Document with Deep Partial Graph Matching

    Authors: Minghong Yao, Zhiguang Liu, Liangwei Wang, Houqiang Li, Liansheng Zhuang

    Abstract: Automating the Key Information Extraction (KIE) from documents improves efficiency, productivity, and security in many industrial scenarios such as rapid indexing and archiving. Many existing supervised learning methods for the KIE task need to feed a large number of labeled samples and learn separate models for different types of documents. However, collecting and labeling a large dataset is time… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

  13. arXiv:2109.08879  [pdf, other

    eess.IV cs.CV

    FastHyMix: Fast and Parameter-free Hyperspectral Image Mixed Noise Removal

    Authors: Lina Zhuang, Michael K. Ng

    Abstract: Hyperspectral imaging with high spectral resolution plays an important role in finding objects, identifying materials, or detecting processes. The decrease of the widths of spectral bands leads to a decrease in the signal-to-noise ratio (SNR) of measurements. The decreased SNR reduces the reliability of measured features or information extracted from HSIs. Furthermore, the image degradations linke… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  14. Using Low-rank Representation of Abundance Maps and Nonnegative Tensor Factorization for Hyperspectral Nonlinear Unmixing

    Authors: Lianru Gao, Zhicheng Wang, Lina Zhuang, Haoyang Yu, Bing Zhang, Jocelyn Chanussot

    Abstract: Tensor-based methods have been widely studied to attack inverse problems in hyperspectral imaging since a hyperspectral image (HSI) cube can be naturally represented as a third-order tensor, which can perfectly retain the spatial information in the image. In this article, we extend the linear tensor method to the nonlinear tensor method and propose a nonlinear low-rank tensor unmixing algorithm to… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  15. Hyperspectral Image Denoising and Anomaly Detection Based on Low-rank and Sparse Representations

    Authors: Lina Zhuang, Lianru Gao, Bing Zhang, Xiyou Fu, Jose M. Bioucas-Dias

    Abstract: Hyperspectral imaging measures the amount of electromagnetic energy across the instantaneous field of view at a very high resolution in hundreds or thousands of spectral channels. This enables objects to be detected and the identification of materials that have subtle differences between them. However, the increase in spectral resolution often means that there is a decrease in the number of photon… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  16. Fast Hyperspectral Image Denoising and Inpainting Based on Low-Rank and Sparse Representations

    Authors: Lina Zhuang, Jose M. Bioucas-Dias

    Abstract: This paper introduces two very fast and competitive hyperspectral image (HSI) restoration algorithms: fast hyperspectral denoising (FastHyDe), a denoising algorithm able to cope with Gaussian and Poissonian noise, and fast hyperspectral inpainting (FastHyIn), an inpainting algorithm to restore HSIs where some observations from known pixels in some known bands are missing. FastHyDe and FastHyIn ful… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  17. arXiv:2002.10616  [pdf

    physics.soc-ph cs.CY

    How many infections of COVID-19 there will be in the "Diamond Princess"-Predicted by a virus transmission model based on the simulation of crowd flow

    Authors: Zhiming Fang, Zhongyi Huang, Xiaolian Li, Jun Zhang, Wei Lv, Lei Zhuang, Xingpeng Xu, Nan Huang

    Abstract: Objectives: Simulate the transmission process of COVID-19 in a cruise ship, and then to judge how many infections there will be in the 3711 people in the "Diamond Princess" and analyze measures that could have prevented mass transmission. Methods: Based on the crowd flow model, the virus transmission rule between pedestrians is established, to simulate the spread of the virus caused by the close… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

  18. arXiv:2002.02089  [pdf, other

    cs.AI cs.RO

    Soft Hindsight Experience Replay

    Authors: Qiwei He, Liansheng Zhuang, Houqiang Li

    Abstract: Efficient learning in the environment with sparse rewards is one of the most important challenges in Deep Reinforcement Learning (DRL). In continuous DRL environments such as robotic arms control, Hindsight Experience Replay (HER) has been shown an effective solution. However, due to the brittleness of deterministic methods, HER and its variants typically suffer from a major challenge for stabilit… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: 7 pages, 5 figures, 1 table, submitted to IJCAI2020

  19. arXiv:1909.11252  [pdf, other

    cs.IR cs.LG

    Neighborhood-Enhanced and Time-Aware Model for Session-based Recommendation

    Authors: Yang Lv, Liangsheng Zhuang, Pengyu Luo

    Abstract: Session based recommendation has become one of the research hotpots in the field of recommendation systems due to its highly practical value.Previous deep learning methods mostly focus on the sequential characteristics within the current session,and neglect the context similarity and temporal similarity between sessions which contain abundant collaborative information.In this paper,we propose a no… ▽ More

    Submitted 1 October, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

  20. arXiv:1807.08048  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Baidu Apollo EM Motion Planner

    Authors: Haoyang Fan, Fan Zhu, Changchun Liu, Liangliang Zhang, Li Zhuang, Dong Li, Weicheng Zhu, Jiangtao Hu, Hongye Li, Qi Kong

    Abstract: In this manuscript, we introduce a real-time motion planning system based on the Baidu Apollo (open source) autonomous driving platform. The developed system aims to address the industrial level-4 motion planning problem while considering safety, comfort and scalability. The system covers multilane and single-lane autonomous driving in a hierarchical manner: (1) The top layer of the system is a mu… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

  21. arXiv:1709.04344   

    cs.CV

    Flexible Network Binarization with Layer-wise Priority

    Authors: Lixue Zhuang, Yi Xu, Bingbing Ni, Hongteng Xu

    Abstract: How to effectively approximate real-valued parameters with binary codes plays a central role in neural network binarization. In this work, we reveal an important fact that binarizing different layers has a widely-varied effect on the compression ratio of network and the loss of performance. Based on this fact, we propose a novel and flexible neural network binarization method by introducing the co… ▽ More

    Submitted 16 February, 2018; v1 submitted 13 September, 2017; originally announced September 2017.

    Comments: More experiments on image classification are planned

  22. arXiv:1608.05150  [pdf

    cs.IT

    Experimental demonstration of Layered/Enhanced ACO-OFDM in short haul optical fiber transmission link

    Authors: Binhuang Song, Chen Zhu, Bill Corcoran, Qibing Wang, Leimeng Zhuang, Arthur J. Lowery

    Abstract: Asymmetrically clipped optical orthogonal frequency division multiplexing (ACO-OFDM) is theoretically more power efficient but less spectrally efficient than DC-bias OFDM (DCO-OFDM), with less power allocating to the informationless bias component by only using odd index sub-carriers. Layered/Enhanced asymmetrically clipped optical orthogonal frequency division multiplexing (L/E-ACO-OFDM) has been… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

  23. arXiv:1607.02539   

    cs.CV

    Graph Construction with Label Information for Semi-Supervised Learning

    Authors: Liansheng Zhuang, Zihan Zhou, **gwen Yin, Shenghua Gao, Zhouchen Lin, Yi Ma, Nenghai Yu

    Abstract: In the literature, most existing graph-based semi-supervised learning (SSL) methods only use the label information of observed samples in the label propagation stage, while ignoring such valuable information when learning the graph. In this paper, we argue that it is beneficial to consider the label information in the graph learning stage. Specifically, by enforcing the weight of edges between lab… ▽ More

    Submitted 12 February, 2017; v1 submitted 8 July, 2016; originally announced July 2016.

    Comments: This paper is withdrawn by the authors for some errors

  24. Constructing a Non-Negative Low Rank and Sparse Graph with Data-Adaptive Features

    Authors: Liansheng Zhuang, Shenghua Gao, **hui Tang, **g**g Wang, Zhouchen Lin, Yi Ma

    Abstract: This paper aims at constructing a good graph for discovering intrinsic data structures in a semi-supervised learning setting. Firstly, we propose to build a non-negative low-rank and sparse (referred to as NNLRS) graph for the given data representation. Specifically, the weights of edges in the graph are obtained by seeking a nonnegative low-rank and sparse matrix that represents each data sample… ▽ More

    Submitted 3 September, 2014; originally announced September 2014.

  25. arXiv:1402.1879  [pdf, other

    cs.CV

    Sparse Illumination Learning and Transfer for Single-Sample Face Recognition with Image Corruption and Misalignment

    Authors: Liansheng Zhuang, Tsung-Han Chan, Allen Y. Yang, S. Shankar Sastry, Yi Ma

    Abstract: Single-sample face recognition is one of the most challenging problems in face recognition. We propose a novel algorithm to address this problem based on a sparse representation based classification (SRC) framework. The new algorithm is robust to image misalignment and pixel corruption, and is able to reduce required gallery images to one sample per class. To compensate for the missing illuminatio… ▽ More

    Submitted 8 February, 2014; originally announced February 2014.