Skip to main content

Showing 1–23 of 23 results for author: Jia, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.07589  [pdf, other

    cs.MM eess.IV

    MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding

    Authors: Chuanmin Jia, Feng Ye, Fanke Dong, Kai Lin, Leonardo Chiariglione, Siwei Ma, Huifang Sun, Wen Gao

    Abstract: The rapid advancement of artificial intelligence (AI) technology has led to the prioritization of standardizing the processing, coding, and transmission of video using neural networks. To address this priority area, the Moving Picture, Audio, and Data Coding by Artificial Intelligence (MPAI) group is develo** a suite of standards called MPAI-EEV for "end-to-end optimized neural video coding." Th… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  2. arXiv:2308.12508  [pdf, other

    eess.IV cs.CV cs.GR

    FFEINR: Flow Feature-Enhanced Implicit Neural Representation for Spatio-temporal Super-Resolution

    Authors: Chenyue Jiao, Chongke Bi, Lu Yang

    Abstract: Large-scale numerical simulations are capable of generating data up to terabytes or even petabytes. As a promising method of data reduction, super-resolution (SR) has been widely studied in the scientific visualization community. However, most of them are based on deep convolutional neural networks (CNNs) or generative adversarial networks (GANs) and the scale factor needs to be determined before… ▽ More

    Submitted 26 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted and published by ChinaVis 2023(2023.7.21-24)

  3. arXiv:2306.14108  [pdf, other

    cs.CV eess.IV

    SpikeCodec: An End-to-end Learned Compression Framework for Spiking Camera

    Authors: Kexiang Feng, Chuanmin Jia, Siwei Ma, Wen Gao

    Abstract: Recently, the bio-inspired spike camera with continuous motion recording capability has attracted tremendous attention due to its ultra high temporal resolution imaging characteristic. Such imaging feature results in huge data storage and transmission burden compared to that of traditional camera, raising severe challenge and imminent necessity in compression for spike camera captured content. Exi… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 13 pages, 11 figures and 5 tables

  4. arXiv:2304.09322  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Modality Multi-Scale Cardiovascular Disease Subtypes Classification Using Raman Image and Medical History

    Authors: Bo Yu, Hechang Chen, Chengyou Jia, Hongren Zhou, Lele Cong, Xiankai Li, Jianhui Zhuang, Xianling Cong

    Abstract: Raman spectroscopy (RS) has been widely used for disease diagnosis, e.g., cardiovascular disease (CVD), owing to its efficiency and component-specific testing capabilities. A series of popular deep learning methods have recently been introduced to learn nuance features from RS for binary classifications and achieved outstanding performance than conventional machine learning methods. However, these… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: [J]. Expert Systems with Applications, 2023: 119965

  5. arXiv:2304.06896  [pdf, other

    eess.IV cs.AI cs.CV cs.MM

    Machine Perception-Driven Image Compression: A Layered Generative Approach

    Authors: Yuefeng Zhang, Chuanmin Jia, Jiannhui Chang, Siwei Ma

    Abstract: In this age of information, images are a critical medium for storing and transmitting information. With the rapid growth of image data amount, visual compression and visual data perception are two important research topics attracting a lot attention. However, those two topics are rarely discussed together and follow separate research path. Due to the compact compressed domain representation offere… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 12 pages, 12 figures

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology 2024

  6. arXiv:2301.06115  [pdf, other

    cs.CV eess.IV

    Learning to Compress Unmanned Aerial Vehicle (UAV) Captured Video: Benchmark and Analysis

    Authors: Chuanmin Jia, Feng Ye, Huifang Sun, Siwei Ma, Wen Gao

    Abstract: During the past decade, the Unmanned-Aerial-Vehicles (UAVs) have attracted increasing attention due to their flexible, extensive, and dynamic space-sensing capabilities. The volume of video captured by UAVs is exponentially growing along with the increased bitrate generated by the advancement of the sensors mounted on UAVs, bringing new challenges for on-device UAV storage and air-ground data tran… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: MPAI End-to-end Video group progress report, DCC 2023

  7. arXiv:2209.02574  [pdf, other

    eess.IV cs.CV cs.MM

    Cross Modal Compression: Towards Human-comprehensible Semantic Compression

    Authors: Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Wen Gao

    Abstract: Traditional image/video compression aims to reduce the transmission/storage cost with signal fidelity as high as possible. However, with the increasing demand for machine analysis and semantic monitoring in recent years, semantic fidelity rather than signal fidelity is becoming another emerging concern in image/video compression. With the recent advances in cross modal translation and generation,… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: 10 pages, 4 figures

  8. arXiv:2106.14371  [pdf, other

    cs.SD cs.CL eess.AS

    Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits

    Authors: Qingjian Lin, Lin Yang, Xuyang Wang, Luyuan Xie, Chen Jia, Junjie Wang

    Abstract: Target speech separation is the process of filtering a certain speaker's voice out of speech mixtures according to the additional speaker identity information provided. Recent works have made considerable improvement by processing signals in the time domain directly. The majority of them take fully overlapped speech mixtures for training. However, since most real-life conversations occur randomly… ▽ More

    Submitted 26 September, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted by APSIPA 2021

  9. arXiv:2106.12954  [pdf, other

    eess.IV cs.CV cs.LG

    Rate Distortion Characteristic Modeling for Neural Image Compression

    Authors: Chuanmin Jia, Ziqing Ge, Shanshe Wang, Siwei Ma, Wen Gao

    Abstract: End-to-end optimized neural image compression (NIC) has obtained superior lossy compression performance recently. In this paper, we consider the problem of rate-distortion (R-D) characteristic analysis and modeling for NIC. We make efforts to formulate the essential mathematical functions to describe the R-D behavior of NIC using deep networks. Thus arbitrary bit-rate points could be elegantly rea… ▽ More

    Submitted 13 January, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 10 pages, accepted by DCC 2022 as full paper

  10. arXiv:2104.10315  [pdf, ps, other

    eess.IV cs.CV

    Visual Analysis Motivated Rate-Distortion Model for Image Coding

    Authors: Zhimeng Huang, Chuanmin Jia, Shanshe Wang, Siwei Ma

    Abstract: Optimized for pixel fidelity metrics, images compressed by existing image codec are facing systematic challenges when used for visual analysis tasks, especially under low-bitrate coding. This paper proposes a visual analysis-motivated rate-distortion model for Versatile Video Coding (VVC) intra compression. The proposed model has two major contributions, a novel rate allocation strategy and a new… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  11. arXiv:2103.07131  [pdf, other

    cs.CV eess.IV

    Thousand to One: Semantic Prior Modeling for Conceptual Coding

    Authors: Jianhui Chang, Zhenghui Zhao, Lingbo Yang, Chuanmin Jia, Jian Zhang, Siwei Ma

    Abstract: Conceptual coding has been an emerging research topic recently, which encodes natural images into disentangled conceptual representations for compression. However, the compression performance of the existing methods is still sub-optimal due to the lack of comprehensive consideration of rate constraint and reconstruction quality. To this end, we propose a novel end-to-end semantic prior modeling-ba… ▽ More

    Submitted 15 March, 2021; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: ICME 2021 ORAL accepted

  12. arXiv:2011.04976  [pdf, other

    cs.CV eess.IV

    Conceptual Compression via Deep Structure and Texture Synthesis

    Authors: Jianhui Chang, Zhenghui Zhao, Chuanmin Jia, Shiqi Wang, Lingbo Yang, Qi Mao, Jian Zhang, Siwei Ma

    Abstract: Existing compression methods typically focus on the removal of signal-level redundancies, while the potential and versatility of decomposing visual data into compact conceptual components still lack further study. To this end, we propose a novel conceptual compression framework that encodes visual data into compact structure and texture representations, then decodes in a deep synthesis fashion, ai… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 15 pages, 14 figures

  13. arXiv:2006.12696  [pdf, ps, other

    eess.SY

    When Distributed Formation Control Is Feasible under Hard Constraints on Energy and Time?

    Authors: Chunxiang Jia, Fei Chen, Linying Xiang, Weiyao Lan, Gang Feng

    Abstract: This paper studies distributed optimal formation control with hard constraints on energy levels and termination time, in which the formation error is to be minimized jointly with the energy cost. The main contributions include a globally optimal distributed formation control law and a comprehensive analysis of the resulting closed-loop system under those hard constraints. It is revealed that the e… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  14. arXiv:2006.11730  [pdf, other

    eess.SP

    High-Resolution Channel Estimation for Intelligent Reflecting Surface-Assisted MmWave Communications

    Authors: C. Jia, J. Cheng, H. Gao, W. Xu

    Abstract: In this paper, we study the high-resolution channel estimation problem for intelligent reflecting surface (IRS)-assisted millimeter wave (mmWave) multiple-input-multiple-output (MIMO) communications, which is a prerequisite to guarantee further high-rate data transmission. Considering the typical sparsity of mmWave channels, we formulate the cascaded channel estimation problem from a sparse signal… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 6 pages, 7 figures, conference

  15. arXiv:2004.03428  [pdf, other

    eess.AS cs.CR cs.SD

    Universal Adversarial Perturbations Generative Network for Speaker Recognition

    Authors: Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao

    Abstract: Attacking deep learning based biometric systems has drawn more and more attention with the wide deployment of fingerprint/face/speaker recognition systems, given the fact that the neural networks are vulnerable to the adversarial examples, which have been intentionally perturbed to remain almost imperceptible for human. In this paper, we demonstrated the existence of the universal adversarial pert… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted by ICME2020

  16. arXiv:2004.03413  [pdf, other

    cs.MM cs.SD eess.AS

    Direct Speech-to-image Translation

    Authors: Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao

    Abstract: Direct speech-to-image translation without text is an interesting and useful topic due to the potential applications in human-computer interaction, art creation, computer-aided design. etc. Not to mention that many languages have no writing form. However, as far as we know, it has not been well-studied how to translate the speech signals into images directly and how well they can be translated. In… ▽ More

    Submitted 9 April, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted by JSTSP

  17. arXiv:2003.01400  [pdf, ps, other

    cs.IT eess.SP

    OTFS Based Receiver Scheme With Multi-Antennas in High-Mobility V2X Systems

    Authors: Junqiang Cheng, Chenglu Jia, Hui Gao, Wenjun Xu, Zhisong Bie

    Abstract: Vehicle-to-everything (V2X) is considered as one of the most important applications of future wireless communication networks. However, the Doppler effect caused by the vehicle mobility may seriously deteriorate the performance of the vehicular communication links, especially when the channels exhibit a large number of Doppler frequency offsets (DFOs). Orthogonal time frequency space (OTFS) is a n… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted in IEEE ICC'20 Workshop - V2X-NGD

  18. arXiv:2003.01306  [pdf, other

    eess.SP

    Machine Learning Empowered Beam Management for Intelligent Reflecting Surface Assisted MmWave Networks

    Authors: Chenglu Jia, Hui Gao, Na Chen, Yuan He

    Abstract: Recently, intelligent reflecting surface (IRS) assisted mmWave networks are emerging, which bear the potential to address the blockage issue of the millimeter wave (mmWave) communication in a more cost-effective way. In particular, IRS is built by passive and programmable electromagnetic elements that can manipulate the mmWave propagation channel into a more favorable condition that is free of blo… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  19. arXiv:1909.13342  [pdf, other

    cs.IT eess.SP

    Interference-Precancelled Pilot Design for LMMSE Channel Estimation of GFDM

    Authors: Ching-Lun Tai, Borching Su, Cai Jia

    Abstract: Generalized frequency division multiplexing (GFDM) is a promising candidate waveform for next-generation wireless communication systems. However, GFDM channel estimation is still challenging due to the inherent interference. In this paper, we formulate a pilot design framework with linear minimum mean square error (LMMSE) channel estimation for GFDM, and propose a novel pilot design to achieve int… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Comments: 5 pages, 6 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. Learn to Sense: a Meta-learning Based Sensing and Fusion Framework for Wireless Sensor Networks

    Authors: Hui Wu, Zhaoyang Zhang, Chunxu Jiao, Chunguang Li, Tony Q. S. Quek

    Abstract: Wireless sensor networks (WSN) acts as the backbone of Internet of Things (IoT) technology. In WSN, field sensing and fusion are the most commonly seen problems, which involve collecting and processing of a huge volume of spatial samples in an unknown field to reconstruct the field or extract its features. One of the major concerns is how to reduce the communication overhead and data redundancy wi… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Paper accepted for publication in IEEE Internet of Things Journal

  21. arXiv:1903.09752  [pdf, ps, other

    eess.SP

    MmWave Communication With Active Ambient Perception

    Authors: Chunxu Jiao, Zhaoyang Zhang, Caijun Zhong, Xiaoming Chen, Zhiyong Feng

    Abstract: In existing communication systems, the channel state information of each UE (user equipment) should be repeatedly estimated when it moves to a new position or another UE takes its place. The underlying ambient information, including the specific layout of potential reflectors, which provides more detailed information about all UEs' channel structures, has not been fully explored and exploited. In… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

    Comments: Accepted for publication in IEEE Transactions on Wireless Communications

  22. Complex Scene Classification of PolSAR Imagery based on a Self-paced Learning Approach

    Authors: Wenshuai Chen, Shui** Gou, Xinlin Wang, Licheng Jiao, Changzhe Jiao, Alina Zare

    Abstract: Existing polarimetric synthetic aperture radar (PolSAR) image classification methods cannot achieve satisfactory performance on complex scenes characterized by several types of land cover with significant levels of noise or similar scattering properties across land cover types. Hence, we propose a supervised classification method aimed at constructing a classifier based on self-paced learning (SPL… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

  23. arXiv:1711.00727  [pdf, ps, other

    eess.SP cs.LG cs.NE

    Performance Evaluation of Channel Decoding With Deep Neural Networks

    Authors: Wei Lyu, Zhaoyang Zhang, Chunxu Jiao, Kangjian Qin, Huazi Zhang

    Abstract: With the demand of high data rate and low latency in fifth generation (5G), deep neural network decoder (NND) has become a promising candidate due to its capability of one-shot decoding and parallel computing. In this paper, three types of NND, i.e., multi-layer perceptron (MLP), convolution neural network (CNN) and recurrent neural network (RNN), are proposed with the same parameter magnitude. Th… ▽ More

    Submitted 31 January, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 6 pages, 11 figures, Latex; typos corrected; IEEE ICC 2018 to appear