Skip to main content

Showing 1–17 of 17 results for author: Chengjie

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08782  [pdf, other

    eess.IV cs.CV

    Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

    Authors: Hao Liang, Chengjie, Kun Li, Xin Tian

    Abstract: Hyperspectral image (HSI) denoising is an essential procedure for HSI applications. Unfortunately, the existing Transformer-based methods mainly focus on non-local modeling, neglecting the importance of locality in image denoising. Moreover, deep learning methods employ complex spectral learning mechanisms, thus introducing large computation costs. To address these problems, we propose a hybrid… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2403.05791  [pdf, other

    eess.AS cs.SD

    Asynchronous Microphone Array Calibration using Hybrid TDOA Information

    Authors: Chengjie Zhang, Jiang Wang, He Kong

    Abstract: Asynchronous microphone array calibration is a prerequisite for most audition robot applications. A popular solution to the above calibration problem is the batch form of Simultaneous Localisation and Map** (SLAM), using the time difference of arrival measurements between two microphones (TDOA-M), and the robot (which serves as a moving sound source during calibration) odometry information. In t… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2301.07925  [pdf, other

    eess.SP

    Communication under Mixed Gaussian-Impulsive Channel: An End-to-End Framework

    Authors: Chengjie Zhao, Jun Wang, Wei Huang, Xiaonan Chen, Tianfu Qi

    Abstract: In many communication scenarios, the communication signals simultaneously suffer from white Gaussian noise (WGN) and non-Gaussian impulsive noise (IN), i.e., mixed Gaussian-impulsive noise (MGIN). Under MGIN channel, classical communication signal schemes and corresponding detection methods usually can not achieve desirable performance as they are optimized with respect to WGN. Moreover, as the wi… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  4. arXiv:2211.14448  [pdf, other

    cs.CV cs.LG eess.IV

    How to Backpropagate through Hungarian in Your DETR?

    Authors: Lingji Chen, Alok Sharma, Chinmay Shirore, Chengjie Zhang, Balarama Raju Buddharaju

    Abstract: The DEtection TRansformer (DETR) approach, which uses a transformer encoder-decoder architecture and a set-based global loss, has become a building block in many transformer based applications. However, as originally presented, the assignment cost and the global loss are not aligned, i.e., reducing the former is likely but not guaranteed to reduce the latter. And the issue of gradient is ignored w… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  5. arXiv:2209.14435  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    Out-of-Distribution Detection for LiDAR-based 3D Object Detection

    Authors: Chengjie Huang, Van Duong Nguyen, Vahdat Abdelzad, Christopher Gus Mannes, Luke Rowe, Benjamin Therien, Rick Salay, Krzysztof Czarnecki

    Abstract: 3D object detection is an essential part of automated driving, and deep neural networks (DNNs) have achieved state-of-the-art performance for this task. However, deep models are notorious for assigning high confidence scores to out-of-distribution (OOD) inputs, that is, inputs that are not drawn from the training distribution. Detecting OOD inputs is challenging and essential for the safe deployme… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: Accepted at ITSC 2022

  6. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  7. arXiv:2111.08857  [pdf, other

    cs.LG cs.AI cs.MA cs.RO eess.SY

    SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

    Authors: Hangyu Mao, Chao Wang, Xiaotian Hao, Yihuan Mao, Yiming Lu, Chengjie Wu, Jianye Hao, Dong Li, **zhong Tang

    Abstract: The MineRL competition is designed for the development of reinforcement learning and imitation learning algorithms that can efficiently leverage human demonstrations to drastically reduce the number of environment interactions needed to solve the complex \emph{ObtainDiamond} task with sparse rewards. To address the challenge, in this paper, we present \textbf{SEIHAI}, a \textbf{S}ample-\textbf{e}f… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: The winner solution of NeurIPS 2020 MineRL competition (https://www.aicrowd.com/challenges/neurips-2020-minerl-competition/leaderboards). The paper has been accepted by DAI 2021 (the third International Conference on Distributed Artificial Intelligence)

  8. arXiv:2108.13294  [pdf, other

    cs.LG cs.AI cs.RO cs.SE eess.SY

    The missing link: Develo** a safety case for perception components in automated driving

    Authors: Rick Salay, Krzysztof Czarnecki, Hiroshi Kuwajima, Hirotoshi Yasuoka, Toshihiro Nakae, Vahdat Abdelzad, Chengjie Huang, Maximilian Kahn, Van Duong Nguyen

    Abstract: Safety assurance is a central concern for the development and societal acceptance of automated driving (AD) systems. Perception is a key aspect of AD that relies heavily on Machine Learning (ML). Despite the known challenges with the safety assurance of ML-based components, proposals have recently emerged for unit-level safety cases addressing these components. Unfortunately, AD safety cases expre… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

  9. arXiv:2107.06463  [pdf, other

    eess.IV cs.CV

    Learned Image Compression with Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

    Authors: Haisheng Fu, Feng Liang, Jian** Lin, Bing Li, Mohammad Akbari, Jie Liang, Guohe Zhang, Dong Liu, Chengjie Tu, **gning Han

    Abstract: Recently deep learning-based image compression methods have achieved significant achievements and gradually outperformed traditional approaches including the latest standard Versatile Video Coding (VVC) in both PSNR and MS-SSIM metrics. Two key components of learned image compression are the entropy model of the latent representations and the encoding/decoding network architectures. Various models… ▽ More

    Submitted 9 February, 2024; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: IEEE Transactions On Image Processing

  10. arXiv:2012.15463  [pdf, other

    cs.CV cs.LG eess.IV

    Learned Multi-Resolution Variable-Rate Image Compression with Octave-based Residual Blocks

    Authors: Mohammad Akbari, Jie Liang, **gning Han, Chengjie Tu

    Abstract: Recently deep learning-based image compression has shown the potential to outperform traditional codecs. However, most existing methods train multiple networks for multiple bit rates, which increase the implementation complexity. In this paper, we propose a new variable-rate image compression framework, which employs generalized octave convolutions (GoConv) and generalized octave transposed-convol… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 10 pages, 9 figures, 1 table; accepted to IEEE Transactions on Multimedia 2020. arXiv admin note: substantial text overlap with arXiv:1912.05688

  11. arXiv:2009.13074  [pdf, other

    eess.IV

    Learned Variable-Rate Multi-Frequency Image Compression using Modulated Generalized Octave Convolution

    Authors: Jian** Lin, Mohammad Akbari, Haisheng Fu, Qian Zhang, Shang Wang, Jie Liang, Dong Liu, Feng Liang, Guohe Zhang, Chengjie Tu

    Abstract: In this proposal, we design a learned multi-frequency image compression approach that uses generalized octave convolutions to factorize the latent representations into high-frequency (HF) and low-frequency (LF) components, and the LF components have lower resolution than HF components, which can improve the rate-distortion performance, similar to wavelet transform. Moreover, compared to the origin… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: MMSP 2020; JPEG-AI. arXiv admin note: text overlap with arXiv:2002.10032

  12. arXiv:2006.14497  [pdf, other

    eess.SP

    Quantumized Microwave Detection Based on $Λ$-Type Three-level Superconducting System: HMM Modeling and Performance Prediction

    Authors: Junyu Zhang, Chen Gong, Shangbin Li, Shanchi Wu, Rui Ni, Chengjie Zuo, **kang Zhu, Ming Zhao, Zhengyuan Xu

    Abstract: We adopt artificial $Λ$-type three-level system with superconducting devices for microwave signal detection, where the signal intensity reaches the level of discrete photons instead of continuous waveform. Based on the state transition principles of the three-level system, we propose a statistical model for microwave signal detection. Moreover, we investigate the achievable transmission rate and s… ▽ More

    Submitted 27 August, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 12 pages, 18 figures

  13. arXiv:2006.14471  [pdf, other

    eess.SP

    Wireless Communication Based on Microwave Photon-Level Detection With Superconducting Devices: Achievable Rate Prediction

    Authors: Junyu Zhang, Chen Gong, Shangbin Li, Rui Ni, Chengjie Zuo, **kang Zhu, Ming Zhao, Zhengyuan Xu

    Abstract: Future wireless communication system embraces physical-layer signal detection with high sensitivity, especially in the microwave photon level. Currently, the receiver primarily adopts the signal detection based on semi-conductor devices for signal detection, while this paper introduces high-sensitivity photon-level microwave detection based on superconducting structure. We first overview existing… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 9 pages, 13 figures

  14. arXiv:2003.12933  [pdf, other

    eess.SP

    Weak Radio Frequency Signal Detection Based on Piezo-Opto-Electro-Mechanical System: Architecture Design and Sensitivity Prediction

    Authors: Shanchi Wu, Chen Gong, Chengjie Zuo, Shangbin Li, Junyu Zhang, Zhongbin Dai, Kai Yang, Ming Zhao, Rui Ni, Zhengyuan Xu, **kang Zhu

    Abstract: We propose a novel radio-frequency (RF) receiving architecture based on micro-electro-mechanical system (MEMS) and optical coherent detection module. The architecture converts the received electrical signal into mechanical vibration through the piezoelectric effect and adopts an optical detection module to detect the mechanical vibration. We analyze the response function of piezoelectric film to a… ▽ More

    Submitted 8 October, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

    Comments: 15 pages, 16 figures, 6 tables

  15. arXiv:2002.10032  [pdf, other

    eess.IV cs.CV cs.LG

    Generalized Octave Convolutions for Learned Multi-Frequency Image Compression

    Authors: Mohammad Akbari, Jie Liang, **gning Han, Chengjie Tu

    Abstract: Learned image compression has recently shown the potential to outperform the standard codecs. State-of-the-art rate-distortion (R-D) performance has been achieved by context-adaptive entropy coding approaches in which hyperprior and autoregressive models are jointly utilized to effectively capture the spatial dependencies in the latent representations. However, the latents are feature maps of the… ▽ More

    Submitted 31 December, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 13 pages, 10 figures, 5 tables; Extended version of the paper accepted to AAAI 2021

  16. arXiv:1912.05688  [pdf, other

    eess.IV cs.CV

    Learned Variable-Rate Image Compression with Residual Divisive Normalization

    Authors: Mohammad Akbari, Jie Liang, **gning Han, Chengjie Tu

    Abstract: Recently it has been shown that deep learning-based image compression has shown the potential to outperform traditional codecs. However, most existing methods train multiple networks for multiple bit rates, which increases the implementation complexity. In this paper, we propose a variable-rate image compression framework, which employs more Generalized Divisive Normalization (GDN) layers than pre… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: 6 pages, 5 figures

  17. arXiv:1907.06566  [pdf, other

    eess.IV cs.LG stat.ML

    Improved Hybrid Layered Image Compression using Deep Learning and Traditional Codecs

    Authors: Haisheng Fu, Feng Liang, Bo Lei, Nai Bian, Qian zhang, Mohammad Akbari, Jie Liang, Chengjie Tu

    Abstract: Recently deep learning-based methods have been applied in image compression and achieved many promising results. In this paper, we propose an improved hybrid layered image compression framework by combining deep learning and the traditional image codecs. At the encoder, we first use a convolutional neural network (CNN) to obtain a compact representation of the input image, which is losslessly enco… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: Submitted to Signal Processing: Image Communication

    Report number: 1907.06566

    Journal ref: Volume 82, March 2020, 115774