Skip to main content

Showing 1–50 of 65 results for author: Dong, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08038  [pdf, other

    eess.SP

    Interference Analysis for Coexistence of UAVs and Civil Aircrafts Based on Automatic Dependent Surveillance-Broadcast

    Authors: Yiyang Liao, Ziye Jia, Chao Dong, Lei Zhang, Qihui Wu, Huiling Hu, Zhu Han

    Abstract: Due to the advantages of high mobility and easy deployment, unmanned aerial vehicles (UAVs) are widely applied in both military and civilian fields. In order to strengthen the flight surveillance of UAVs and guarantee the airspace safety, UAVs can be equipped with the automatic dependent surveillance-broadcast (ADS-B) system, which periodically sends flight information to other aircrafts and groun… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.00733  [pdf, other

    eess.SP

    Joint ADS-B in 5G for Hierarchical Aerial Networks: Performance Analysis and Optimization

    Authors: Ziye Jia, Yiyang Liao, Chao Dong, Lijun He, Qihui Wu, Lei Zhang

    Abstract: Unmanned aerial vehicles (UAVs) are widely applied in multiple fields, which emphasizes the challenge of obtaining UAV flight information to ensure the airspace safety. UAVs equipped with automatic dependent surveillance-broadcast (ADS-B) devices are capable of sending flight information to nearby aircrafts and ground stations (GSs). However, the saturation of limited frequency bands of ADS-B lead… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

  3. arXiv:2404.19500  [pdf, other

    cs.CV cs.AI cs.MM eess.IV

    Towards Real-world Video Face Restoration: A New Benchmark

    Authors: Ziyan Chen, **gwen He, Xinqi Lin, Yu Qiao, Chao Dong

    Abstract: Blind face restoration (BFR) on images has significantly progressed over the last several years, while real-world video face restoration (VFR), which is more challenging for more complex face motions such as moving gaze directions and facial orientations involved, remains unsolved. Typical BFR methods are evaluated on privately synthesized datasets or self-collected real-world low-quality face ima… ▽ More

    Submitted 4 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Project page: https://ziyannchen.github.io/projects/VFRxBenchmark/

  4. arXiv:2404.18436  [pdf, other

    eess.SY

    Three-Dimension Collision-Free Trajectory Planning of UAVs Based on ADS-B Information in Low-Altitude Urban Airspace

    Authors: Chao Dong, Yifan Zhang, Ziye Jia, Yiyang Liao, Lei Zhang, Qihui Wu

    Abstract: The environment of low-altitude urban airspace is complex and variable due to numerous obstacles, non-cooperative aircrafts, and birds. Unmanned aerial vehicles (UAVs) leveraging environmental information to achieve three-dimension collision-free trajectory planning is the prerequisite to ensure airspace security. However, the timely information of surrounding situation is difficult to acquire by… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2403.15130  [pdf, ps, other

    cs.IT eess.SP

    Coexisting Passive RIS and Active Relay Assisted NOMA Systems

    Authors: Ao Huang, Li Guo, Xidong Mu, Chao Dong, Yuanwei Liu

    Abstract: A novel coexisting passive reconfigurable intelligent surface (RIS) and active decode-and-forward (DF) relay assisted non-orthogonal multiple access (NOMA) transmission framework is proposed. In particular, two communication protocols are conceived, namely Hybrid NOMA (H-NOMA) and Full NOMA (F-NOMA). Based on the proposed two protocols, both the sum rate maximization and max-min rate fairness prob… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  6. arXiv:2403.05937  [pdf, other

    cs.CV eess.IV

    Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding

    Authors: Cunhui Dong, Haichuan Ma, Haotian Zhang, Changsheng Gao, Li Li, Dong Liu

    Abstract: Neural network-based image coding has been develo** rapidly since its birth. Until 2022, its performance has surpassed that of the best-performing traditional image coding framework -- H.266/VVC. Witnessing such success, the IEEE 1857.11 working subgroup initializes a neural network-based image coding standard project and issues a corresponding call for proposals (CfP). In response to the CfP, t… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  7. arXiv:2312.16057  [pdf, other

    cs.IT eess.SP

    Semantic Importance-Aware Based for Multi-User Communication Over MIMO Fading Channels

    Authors: Haotai Liang, Zhicheng Bao, Wannian An, Chen Dong, Xiaodong Xu

    Abstract: Semantic communication, as a novel communication paradigm, has attracted the interest of many scholars, with multi-user, multi-input multi-output (MIMO) scenarios being one of the critical contexts. This paper presents a semantic importance-aware based communication system (SIA-SC) over MIMO Rayleigh fading channels. Combining the semantic symbols' inequality and the equivalent subchannels of MIMO… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  8. arXiv:2312.15721  [pdf, ps, other

    eess.SY

    UAV Trajectory Tracking via RNN-enhanced IMM-KF with ADS-B Data

    Authors: Yian Zhu, Ziye Jia, Qihui Wu, Chao Dong, Zirui Zhuang, Huiling Hu, Qi Cai

    Abstract: With the increasing use of autonomous unmanned aerial vehicles (UAVs), it is critical to ensure that they are continuously tracked and controlled, especially when UAVs operate beyond the communication range of ground stations (GSs). Conventional surveillance methods for UAVs, such as satellite communications, ground mobile networks and radars are subject to high costs and latency. The automatic de… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  9. arXiv:2312.10051  [pdf, other

    eess.SP

    Semantic Synchronization for Enhanced Reliability in Communication Systems

    Authors: Xiaoyi Liu, Haotai Liang, Chen Dong, Xiaodong Xu

    Abstract: As a new communication paradigm, semantic communication has received widespread attention in communication fields. However, since the decoding of semantic signals relies on contextual knowledge, misalignment between the starting position of the semantic signal and the AI-based semantic decoder would prevent source signal recovery and reconstruction. To achieve more precise semantic communication,… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  10. arXiv:2312.08862  [pdf, other

    cs.IT eess.SP

    Semantics-Division Duplexing: A Novel Full-Duplex Paradigm

    Authors: Kai Niu, Zijian Liang, Chao Dong, **cheng Dai, Zhongwei Si, ** Zhang

    Abstract: In-band full-duplex (IBFD) is a theoretically effective solution to increase the overall throughput for the future wireless communications system by enabling transmission and reception over the same time-frequency resources. However, reliable source reconstruction remains a great challenge in the practical IBFD systems due to the non-ideal elimination of the self-interference and the inherent limi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, submitted to IEEE Wireless Communications Magazine

  11. arXiv:2311.15683  [pdf

    eess.AS cs.SD eess.SP

    Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency

    Authors: Chenyu Tang, Muzi Xu, Wentian Yi, Zibo Zhang, Edoardo Occhipinti, Chaoqun Dong, Dafydd Ravenscroft, Sung-Min Jung, Sanghyo Lee, Shuo Gao, Jong Min Kim, Luigi G. Occhipinti

    Abstract: Our research presents a wearable Silent Speech Interface (SSI) technology that excels in device comfort, time-energy efficiency, and speech decoding accuracy for real-world use. We developed a biocompatible, durable textile choker with an embedded graphene-based strain sensor, capable of accurately detecting subtle throat movements. This sensor, surpassing other strain sensors in sensitivity by 42… ▽ More

    Submitted 7 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 5 figures in the article; 11 figures and 4 tables in supplementary information

    Journal ref: npj Flexible Electronics (2024)

  12. arXiv:2311.15593  [pdf, other

    cs.IT cs.PF eess.SP

    Performance Analysis of MDMA-Based Cooperative MRC Networks with Relays in Dissimilar Rayleigh Fading Channels

    Authors: Lei Teng, Wannian An, Chen Dong, Xiaoqi Qin, Xiaodong Xu

    Abstract: Multiple access technology is a key technology in various generations of wireless communication systems. As a potential multiple access technology for the next generation wireless communication systems, model division multiple access (MDMA) technology improves spectrum efficiency and feasibility regions. This implies that the MDMA scheme can achieve greater performance gains compared to traditiona… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures, conference

  13. arXiv:2311.06968  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    Physics-Informed Data Denoising for Real-Life Sensing Systems

    Authors: Xiyuan Zhang, Xiaohan Fu, Diyan Teng, Chengyu Dong, Keerthivasan Vijayakumar, Jiayun Zhang, Ranak Roy Chowdhury, Junsheng Han, Dezhi Hong, Rashmi Kulkarni, **gbo Shang, Rajesh Gupta

    Abstract: Sensors measuring real-life physical processes are ubiquitous in today's interconnected world. These sensors inherently bear noise that often adversely affects performance and reliability of the systems they support. Classic filtering-based approaches introduce strong assumptions on the time or frequency characteristics of sensory measurements, while learning-based denoising approaches typically r… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: SenSys 2023

  14. arXiv:2310.10513  [pdf, other

    cs.CV eess.IV

    Unifying Image Processing as Visual Prompting Question Answering

    Authors: Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong

    Abstract: Image processing is a fundamental task in computer vision, which aims at enhancing image quality and extracting essential features for subsequent vision applications. Traditionally, task-specific models are developed for individual tasks and designing such models requires distinct expertise. Building upon the success of large language models (LLMs) in natural language processing (NLP), there is a… ▽ More

    Submitted 20 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 16 pages, 12 figures

  15. arXiv:2309.11992  [pdf, other

    eess.SP cs.NI

    UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning

    Authors: Jia He, Ziye Jia, Chao Dong, Junyu Liu, Qihui Wu, **gxian Liu

    Abstract: Unmanned aerial vehicles (UAVs) are recognized as promising technologies for area coverage due to the flexibility and adaptability. However, the ability of a single UAV is limited, and as for the large-scale three-dimensional (3D) scenario, UAV swarms can establish seamless wireless communication services. Hence, in this work, we consider a scenario of UAV swarm deployment and trajectory to satisf… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  16. Symbol Detection for Coarsely Quantized OTFS

    Authors: Junwei He, Haochuan Zhang, Chao Dong, Huimin Zhu

    Abstract: This paper explicitly models a coarse and noisy quantization in a communication system empowered by orthogonal time frequency space (OTFS) for cost and power efficiency. We first point out, with coarse quantization, the effective channel is imbalanced and thus no longer able to circularly shift the transmitted symbols along the delay-Doppler domain. Meanwhile, the effective channel is non-isotropi… ▽ More

    Submitted 20 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  17. arXiv:2309.05929  [pdf

    eess.IV cs.CV

    Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

    Authors: Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou

    Abstract: Medical image segmentation is critical for diagnosing and treating spinal disorders. However, the presence of high noise, ambiguity, and uncertainty makes this task highly challenging. Factors such as unclear anatomical boundaries, inter-class similarities, and irrational annotations contribute to this challenge. Achieving both accurate and diverse segmentation templates is essential to support ra… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  18. arXiv:2309.04084  [pdf, other

    cs.CV cs.MM eess.IV

    Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation

    Authors: Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, **gwen He, Yu Qiao, Jiantao Zhou, Chao Dong

    Abstract: Modern displays are capable of rendering video content with high dynamic range (HDR) and wide color gamut (WCG). However, the majority of available resources are still in standard dynamic range (SDR). As a result, there is significant value in transforming existing SDR content into the HDRTV standard. In this paper, we define and analyze the SDRTV-to-HDRTV task by modeling the formation of SDRTV/H… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Extended version of HDRTVNet

  19. arXiv:2309.01615  [pdf

    eess.SP cs.ET

    A balanced Memristor-CMOS ternary logic family and its application

    Authors: Xiao-Yuan Wang, Jia-Wei Zhou, Chuan-Tao Dong, Xin-Hui Chen, Sanjoy Kumar Nandi, Robert G. Elliman, Sung-Mo Kang, Herbert Ho-Ching Iu

    Abstract: The design of balanced ternary digital logic circuits based on memristors and conventional CMOS devices is proposed. First, balanced ternary minimum gate TMIN, maximum gate TMAX and ternary inverters are systematically designed and verified by simulation, and then logic circuits such as ternary encoders, decoders and multiplexers are designed on this basis. Two different schemes are then used to r… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 15 pages, 30 figures

  20. arXiv:2308.02332   

    eess.SP eess.SY

    Novel Online-Offline MA2C-DDPG for Efficient Spectrum Allocation and Trajectory Optimization in Dynamic Spectrum Sharing UAV Networks

    Authors: Rui Ding, Fuhui Zhou, Yuben Qu, Chao Dong, Qihui Wu, Tony Q. S. Quek

    Abstract: Unmanned aerial vehicle (UAV) communication is of crucial importance for diverse practical applications. However, it is susceptible to the severe spectrum scarcity problem and interference since it operates in the unlicensed spectrum band. In order to tackle those issues, a dynamic spectrum sharing network is considered with the anti-jamming technique. Moreover, an intelligent spectrum allocation… ▽ More

    Submitted 27 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: Some technical errors occured in the manuscript

  21. arXiv:2307.01534  [pdf, other

    eess.SP

    Impact of UAVs Equipped with ADS-B on the Civil Aviation Monitoring System

    Authors: Yiyang Liao, Lei Zhang, Ziye Jia, Chao Dong, Yifan Zhang, Qihui Wu, Huiling Hu, Bin Wang

    Abstract: In recent years, there is an increasing demand for unmanned aerial vehicles (UAVs) to complete multiple applications. However, as unmanned equipments, UAVs lead to some security risks to general civil aviations. In order to strengthen the flight management of UAVs and guarantee the safety, UAVs can be equipped with automatic dependent surveillance-broadcast (ADS-B) devices. In addition, as an auto… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  22. arXiv:2307.00234  [pdf, ps, other

    cs.NI eess.SP

    The Potential of LEO Satellites in 6G Space-Air-Ground Enabled Access Networks

    Authors: Ziye Jia, Chao Dong, Kun Guo, Qihui Wu

    Abstract: Space-air-ground integrated networks (SAGINs) help enhance the service performance in the sixth generation communication system. SAGIN is basically composed of satellites, aerial vehicles, ground facilities, as well as multiple terrestrial users. Therein, the low earth orbit (LEO) satellites are popular in recent years due to the low cost of development and launch, global coverage and delay-enable… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  23. arXiv:2305.18107  [pdf, other

    cs.CV eess.IV

    Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution

    Authors: Ruofan Zhang, **** Gu, Haoyu Chen, Chao Dong, Yulun Zhang, Wenming Yang

    Abstract: Super-resolution (SR) techniques designed for real-world applications commonly encounter two primary challenges: generalization performance and restoration accuracy. We demonstrate that when methods are trained using complex, large-range degradations to enhance generalization, a decline in accuracy is inevitable. However, since the degradation in a certain real-world applications typically exhibit… ▽ More

    Submitted 1 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted to ICML 2023

  24. arXiv:2304.09420  [pdf, other

    eess.IV

    Latent Semantic Diffusion-based Channel Adaptive De-Noising SemCom for Future 6G Systems

    Authors: Bingxuan Xu, Rui Meng, Yue Chen, Xiaodong Xu, Chen Dong, Hao Sun

    Abstract: Compared with the current Shannon's Classical Information Theory (CIT) paradigm, semantic communication (SemCom) has recently attracted more attention, since it aims to transmit the meaning of information rather than bit-by-bit transmission, thus enhancing data transmission efficiency and supporting future human-centric, data-, and resource-intensive intelligent services in 6G systems. Nevertheles… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 6 pages, 7 figures

  25. arXiv:2303.06933  [pdf, ps, other

    cs.NI eess.SP

    Distributionally Robust Chance-Constrained Optimization for Hierarchical UAV-based MEC

    Authors: Can Cui, Ziye Jia, Chao Dong, Zhuang Ling, Jiahao You, Qihui Wu

    Abstract: Multi-access edge computing (MEC) is regarded as a promising technology in the sixth-generation communication. However, the antenna gain is always affected by the environment when unmanned aerial vehicles (UAVs) are served as MEC platforms, resulting in unexpected channel errors. In order to deal with the problem and reduce the power consumption in the UAV-based MEC, we jointly optimize the access… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  26. Non-Orthogonal Multiple Access Enhanced Multi-User Semantic Communication

    Authors: Weizhi Li, Haotai Liang, Chen Dong, Xiaodong Xu, ** Zhang, Kaijun Liu

    Abstract: Semantic communication serves as a novel paradigm and attracts the broad interest of researchers. One critical aspect of it is the multi-user semantic communication theory, which can further promote its application to the practical network environment. While most existing works focused on the design of end-to-end single-user semantic transmission, a novel non-orthogonal multiple access (NOMA)-base… ▽ More

    Submitted 20 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: accepted by IEEE Transactions on Cognitive Communications and Networking

  27. arXiv:2303.01020  [pdf, other

    cs.NI eess.SP

    SFC Deployment in Space-Air-Ground Integrated Networks Based on Matching Game

    Authors: Yilu Cao, Ziye Jia, Chao Dong, Yanting Wang, Jiahao You, Qihui Wu

    Abstract: The space-air-ground integrated network (SAGIN) is dynamic and flexible, which can support transmitting data in environments lacking ground communication facilities. However, the nodes of SAGIN are heterogeneous and it is intractable to share the resources to provide multiple services. Therefore, in this paper, we consider using network function virtualization technology to handle the problem of a… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  28. arXiv:2302.03453  [pdf, other

    eess.IV cs.CV

    OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer

    Authors: Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong

    Abstract: Omnidirectional images (ODIs) have obtained lots of research interest for immersive experiences. Although ODIs require extremely high resolution to capture details of the entire scene, the resolutions of most ODIs are insufficient. Previous methods attempt to solve this issue by image super-resolution (SR) on equirectangular projection (ERP) images. However, they omit geometric properties of ERP i… ▽ More

    Submitted 9 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: main paper + supplement

  29. arXiv:2301.03331  [pdf, other

    cs.CV cs.AI eess.IV

    A Specific Task-oriented Semantic Image Communication System for substation patrol inspection

    Authors: Senran Fan, Haotai Liang, Chen Dong, Xiaodong Xu, Geng Liu

    Abstract: Intelligent inspection robots are widely used in substation patrol inspection, which can help check potential safety hazards by patrolling the substation and sending back scene images. However, when patrolling some marginal areas with weak signal, the scene images cannot be sucessfully transmissted to be used for hidden danger elimination, which greatly reduces the quality of robots'daily work. To… ▽ More

    Submitted 13 April, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures

    Journal ref: IEEE Transactions on Power Delivery; vol. 39; no. 2; pp. 835-844; April 2024

  30. arXiv:2210.05960  [pdf, other

    eess.IV cs.CV

    Efficient Image Super-Resolution using Vast-Receptive-Field Attention

    Authors: Lin Zhou, Haoming Cai, **** Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao Dong

    Abstract: The attention mechanism plays a pivotal role in designing advanced super-resolution (SR) networks. In this work, we design an efficient SR network by improving the attention mechanism. We start from a simple pixel attention module and gradually modify it to achieve better super-resolution performance with reduced parameters. The specific approaches include: (1) increasing the receptive field of th… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  31. arXiv:2210.04198  [pdf, other

    eess.IV cs.CV

    Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images

    Authors: **** Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan

    Abstract: Rendering high-resolution (HR) graphics brings substantial computational costs. Efficient graphics super-resolution (SR) methods may achieve HR rendering with small computing resources and have attracted extensive research interests in industry and research communities. We present a new method for real-time SR for computer graphics, namely Super-Resolution by Predicting Offsets (SRPO). Our algorit… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: This article has been accepted by ECCV2022

  32. arXiv:2209.01809  [pdf, other

    eess.IV cs.CV

    UDC-UNet: Under-Display Camera Image Restoration via U-Shape Dynamic Network

    Authors: Xina Liu, **fan Hu, Xiangyu Chen, Chao Dong

    Abstract: Under-Display Camera (UDC) has been widely exploited to help smartphones realize full screen display. However, as the screen could inevitably affect the light propagation process, the images captured by the UDC system usually contain flare, haze, blur, and noise. Particularly, flare and blur in UDC images could severely deteriorate the user experience in high dynamic range (HDR) scenes. In this pa… ▽ More

    Submitted 11 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  33. arXiv:2209.00436  [pdf, other

    eess.SP

    Recurrent LSTM-based UAV Trajectory Prediction with ADS-B Information

    Authors: Yifan Zhang, Ziye Jia, Chao Dong, Yuntian Liu, Lei Zhang, Qihui Wu

    Abstract: Recently, unmanned aerial vehicles (UAVs) are gathering increasing attentions from both the academia and industry. The ever-growing number of UAV brings challenges for air traffic control (ATC), and thus trajectory prediction plays a vital role in ATC, especially for avoiding collisions among UAVs. However, the dynamic flight of UAV aggravates the complexity of trajectory prediction. Different wit… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  34. arXiv:2206.11695  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Perceptual Image Quality Assessment

    Authors: **** Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Radu Timofte

    Abstract: This paper reports on the NTIRE 2022 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2022. This challenge is held to address the emerging challenge of IQA by perceptual image processing algorithms. The output images of these algorithms have completely different characteristics fro… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: This report has been published in CVPR 2022 NTIRE workshop. arXiv admin note: text overlap with arXiv:2105.03072

  35. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  36. arXiv:2205.07019  [pdf, other

    cs.CV eess.IV

    Evaluating the Generalization Ability of Super-Resolution Networks

    Authors: Yihao Liu, Hengyuan Zhao, **** Gu, Yu Qiao, Chao Dong

    Abstract: Performance and generalization ability are two important aspects to evaluate the deep learning models. However, research on the generalization ability of Super-Resolution (SR) networks is currently absent. Assessing the generalization ability of deep models not only helps us to understand their intrinsic mechanisms, but also allows us to quantitatively measure their applicability boundaries, which… ▽ More

    Submitted 3 September, 2023; v1 submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted by TPAMI

  37. arXiv:2205.05996  [pdf, other

    cs.CV eess.IV

    Blueprint Separable Residual Network for Efficient Image Super-Resolution

    Authors: Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, **** Gu, Yu Qiao, Chao Dong

    Abstract: Recent advances in single image super-resolution (SISR) have achieved extraordinary performance, but the computational cost is too heavy to apply in edge devices. To alleviate this problem, many novel and effective solutions have been proposed. Convolutional neural network (CNN) with the attention mechanism has attracted increasing attention due to its efficiency and effectiveness. However, there… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted to CVPR Workshops

  38. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  39. arXiv:2205.05671  [pdf, other

    cs.CV cs.LG eess.IV

    RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization

    Authors: Xintao Wang, Chao Dong, Ying Shan

    Abstract: This paper explores training efficient VGG-style super-resolution (SR) networks with the structural re-parameterization technique. The general pipeline of re-parameterization is to train networks with multi-branch topology first, and then merge them into standard 3x3 convolutions for efficient inference. In this work, we revisit those primary designs and investigate essential components for re-par… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Technical Report. Codes will be available at https://github.com/TencentARC/RepSR

  40. arXiv:2205.05065  [pdf, other

    cs.CV eess.IV

    MM-RealSR: Metric Learning based Interactive Modulation for Real-World Super-Resolution

    Authors: Chong Mou, Yanze Wu, Xintao Wang, Chao Dong, Jian Zhang, Ying Shan

    Abstract: Interactive image restoration aims to restore images by adjusting several controlling coefficients, which determine the restoration strength. Existing methods are restricted in learning the controllable functions under the supervision of known degradation types and levels. They usually suffer from a severe performance drop when the real degradation is different from their assumptions. Such a limit… ▽ More

    Submitted 27 July, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted by ECCV 2022. Code is available at: https://github.com/TencentARC/MM-RealSR

  41. arXiv:2205.04910  [pdf, other

    eess.IV cs.CV

    A Closer Look at Blind Super-Resolution: Degradation Models, Baselines, and Performance Upper Bounds

    Authors: Wenlong Zhang, Guangyuan Shi, Yihao Liu, Chao Dong, Xiao-Ming Wu

    Abstract: Degradation models play an important role in Blind super-resolution (SR). The classical degradation model, which mainly involves blur degradation, is too simple to simulate real-world scenarios. The recently proposed practical degradation model includes a full spectrum of degradation types, but only considers complex cases that use all degradation types in the degradation process, while ignoring m… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted by CVPR Workshop, NTIRE 2022

  42. arXiv:2205.04437  [pdf, other

    eess.IV cs.CV

    Activating More Pixels in Image Super-Resolution Transformer

    Authors: Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong

    Abstract: Transformer-based methods have shown impressive performance in low-level vision tasks, such as image super-resolution. However, we find that these networks can only utilize a limited spatial range of input information through attribution analysis. This implies that the potential of Transformer is still not fully exploited in existing networks. In order to activate more input pixels for better reco… ▽ More

    Submitted 18 March, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted to CVPR2023

  43. arXiv:2205.03409  [pdf, other

    eess.IV cs.AI cs.CV cs.MM

    VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution

    Authors: Liangbin Xie. Xintao Wang, Honglun Zhang, Chao Dong, Ying Shan

    Abstract: Most of the existing video face super-resolution (VFSR) methods are trained and evaluated on VoxCeleb1, which is designed specifically for speaker identification and the frames in this dataset are of low quality. As a consequence, the VFSR models trained on this dataset can not output visual-pleasing results. In this paper, we develop an automatic and scalable pipeline to collect a high-quality vi… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Project webpage available at https://liangbinxie.github.io/projects/vfhq

  44. arXiv:2202.09595  [pdf, other

    eess.SP

    Innovative semantic communication system

    Authors: Chen Dong, Haotai Liang, Xiaodong Xu, Shujun Han, Bizhu Wang, ** Zhang

    Abstract: Traditional communication systems focus on the transmission process, and the context-dependent meaning has been ignored. The fact that 5G system has approached Shannon limit and the increasing amount of data will cause communication bottleneck, such as the increased delay problems. Inspired by the ability of artificial intelligence to understand semantics, we propose a new communication paradigm,… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  45. arXiv:2202.06046  [pdf, ps, other

    cs.NI eess.SP

    Hierarchical Aerial Computing for Internet of Things via Cooperation of HAPs and UAVs

    Authors: Ziye Jia, Qihui Wu, Chao Dong, Chau Yuen, Zhu Han

    Abstract: With the explosive increment of computation requirements, the multi-access edge computing (MEC) paradigm appears as an effective mechanism. Besides, as for the Internet of Things (IoT) in disasters or remote areas requiring MEC services, unmanned aerial vehicles (UAVs) and high altitude platforms (HAPs) are available to provide aerial computing services for these IoT devices. In this paper, we dev… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

  46. arXiv:2201.08517  [pdf, other

    cs.NI eess.SP

    Unmanned Aerial Vehicle Swarm-Enabled Edge Computing: Potentials, Promising Technologies, and Challenges

    Authors: Wei Wu, Fuhui Zhou, Baoyun Wang, Qihui Wu, Chao Dong, Rose Qingyang Hu

    Abstract: Unmanned aerial vehicle (UAV) swarm enabled edge computing is envisioned to be promising in the sixth generation wireless communication networks due to their wide application sensories and flexible deployment. However, most of the existing works focus on edge computing enabled by a single or a small scale UAVs, which are very different from UAV swarm-enabled edge computing. In order to facilitate… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 17 pages, 5 figures, to be published in IEEE Wireless Communications Magazine

  47. arXiv:2110.04562  [pdf, other

    cs.CV eess.IV

    Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning

    Authors: Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong

    Abstract: Video colorization is a challenging and highly ill-posed problem. Although recent years have witnessed remarkable progress in single image colorization, there is relatively less research effort on video colorization and existing methods always suffer from severe flickering artifacts (temporal inconsistency) or unsatisfying colorization performance. We address this problem from a new perspective, b… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 13 pages, 10 figures

  48. arXiv:2109.14837  [pdf, other

    eess.IV cs.CV

    End-to-End Image Compression with Probabilistic Decoding

    Authors: Haichuan Ma, Dong Liu, Cunhui Dong, Li Li, Feng Wu

    Abstract: Lossy image compression is a many-to-one process, thus one bitstream corresponds to multiple possible original images, especially at low bit rates. However, this nature was seldom considered in previous studies on image compression, which usually chose one possible image as reconstruction, e.g. the one with the maximal a posteriori probability. We propose a learned image compression framework to n… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  49. arXiv:2108.07978  [pdf, other

    eess.IV cs.CV

    A New Journey from SDRTV to HDRTV

    Authors: Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong

    Abstract: Nowadays modern displays are capable to render video content with high dynamic range (HDR) and wide color gamut (WCG). However, most available resources are still in standard dynamic range (SDR). Therefore, there is an urgent demand to transform existing SDR-TV contents into their HDR-TV versions. In this paper, we conduct an analysis of SDRTV-to-HDRTV task by modeling the formation of SDRTV/HDRTV… ▽ More

    Submitted 25 September, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV

  50. arXiv:2107.10833  [pdf, other

    eess.IV cs.CV

    Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

    Authors: Xintao Wang, Liangbin Xie, Chao Dong, Ying Shan

    Abstract: Though many attempts have been made in blind super-resolution to restore low-resolution images with unknown and complex degradations, they are still far from addressing general real-world degraded images. In this work, we extend the powerful ESRGAN to a practical restoration application (namely, Real-ESRGAN), which is trained with pure synthetic data. Specifically, a high-order degradation modelin… ▽ More

    Submitted 17 August, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Tech Report. Training/testing codes and executable files are in https://github.com/xinntao/Real-ESRGAN