Skip to main content

Showing 1–50 of 56 results for author: Feng, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.07410  [pdf, other

    eess.AS

    Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

    Authors: Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer's Disease (AD) detection research. Even by solely utilizing the silent segments of these audio recordings, we achieve nearly 100% accuracy in AD detection. However, employing the same methods to other datasets and prepro… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  2. arXiv:2405.16980  [pdf, other

    cs.CV eess.IV

    DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking

    Authors: Hongtao Wang, Rongyu Feng, Liangyi Wu, Mutian Liu, Yinuo Cui, Chunxia Zhang, Zhenbo Guo

    Abstract: In seismic exploration, identifying the first break (FB) is a critical component in establishing subsurface velocity models. Various automatic picking techniques based on deep neural networks have been developed to expedite this procedure. The most popular class is using semantic segmentation networks to pick on a shot gather called 2-dimensional (2-D) picking. Generally, 2-D segmentation-based pi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  4. arXiv:2404.02710  [pdf, other

    cs.CL eess.AS

    ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation

    Authors: Zheng Yuan, Dorina de Jong, Štefan Beňuš, Noël Nguyen, Ruitao Feng, Róbert Sabo, Luciano Fadiga, Alessandro D`Ausilio

    Abstract: We introduce the Alternating Reading Task (ART) Corpus, a collection of dyadic sentence reading for studying the entrainment and imitation behaviour in speech communication. The ART corpus features three experimental conditions - solo reading, alternating reading, and deliberate imitation - as well as three sub-corpora encompassing French-, Italian-, and Slovak-accented English. This design allows… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 15 pages, 2 figures, 7 tables, accepted at LREC-COLING 2024 conference

  5. arXiv:2403.11953  [pdf, other

    eess.IV cs.CV

    Advancing COVID-19 Detection in 3D CT Scans

    Authors: Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model. Firstly, we analyse the characteristics of 3D CT scans and remove the non-lung parts, facilitating the model to focus on lesion-related areas and reducing computational cost. We use ResNeSt50 as the strong feature extractor, initializing it with pretrained weights which have COVID-19-specific prior kno… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2403.11498  [pdf, other

    eess.IV cs.CV

    Domain Adaptation Using Pseudo Labels for COVID-19 Detection

    Authors: Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans. By utilizing annotated data from one domain and non-annotated data from another, the model overcomes the challenge of data scarcity and variability, common in emergent he… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2402.19387  [pdf, other

    eess.IV cs.CV

    SeD: Semantic-Aware Discriminator for Image Super-Resolution

    Authors: Bingchen Li, Xin Li, Hanxin Zhu, Yeying **, Ruoyu Feng, Zhizheng Zhang, Zhibo Chen

    Abstract: Generative Adversarial Networks (GANs) have been widely used to recover vivid textures in image super-resolution (SR) tasks. In particular, one discriminator is utilized to enable the SR network to learn the distribution of real-world high-quality images in an adversarial training manner. However, the distribution learning is overly coarse-grained, which is susceptible to virtual textures and caus… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: CVPR2024

  8. arXiv:2401.13959  [pdf, other

    eess.IV cs.CV

    Conditional Neural Video Coding with Spatial-Temporal Super-Resolution

    Authors: Henan Wang, Xiaohan Pan, Runsen Feng, Zongyu Guo, Zhibo Chen

    Abstract: This document is an expanded version of a one-page abstract originally presented at the 2024 Data Compression Conference. It describes our proposed method for the video track of the Challenge on Learned Image Compression (CLIC) 2024. Our scheme follows the typical hybrid coding framework with some novel techniques. Firstly, we adopt Spynet network to produce accurate motion vectors for motion esti… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by the 2024 Data Compression Conference (DCC) for presentation as a poster

  9. arXiv:2312.00568  [pdf, ps, other

    eess.SP

    A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

    Authors: Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

    Abstract: In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed. The angular distributions of clusters in both the horizontal and vertical planes are jointly considered. The receiver and clusters can be moving, which makes the model more general. Parameters including number of clusters, powers, dela… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  10. arXiv:2311.12892  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    IMJENSE: Scan-specific Implicit Representation for Joint Coil Sensitivity and Image Estimation in Parallel MRI

    Authors: Ruimin Feng, Qing Wu, Jie Feng, Huajun She, Chunlei Liu, Yuyao Zhang, Hongjiang Wei

    Abstract: Parallel imaging is a commonly used technique to accelerate magnetic resonance imaging (MRI) data acquisition. Mathematically, parallel MRI reconstruction can be formulated as an inverse problem relating the sparsely sampled k-space measurements to the desired MRI image. Despite the success of many existing reconstruction algorithms, it remains a challenge to reliably reconstruct a high-quality im… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  11. arXiv:2310.09625  [pdf, other

    eess.IV cs.CV

    JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model

    Authors: Lixuan Chen, Xuanyu Tian, Jiangjie Wu, Ruimin Feng, Guoyan Lao, Yuyao Zhang, Hongjiang Wei

    Abstract: Magnetic Resonance Imaging (MRI) stands as a powerful modality in clinical diagnosis. However, it is known that MRI faces challenges such as long acquisition time and vulnerability to motion-induced artifacts. Despite the success of many existing motion correction algorithms, there has been limited research focused on correcting motion artifacts on the estimated coil sensitivity maps for fast MRI… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 10 pages,8 figures, journal

  12. arXiv:2306.04236  [pdf, other

    cs.CV eess.IV

    Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

    Abstract: Artificial lights commonly leave strong lens flare artifacts on the images captured at night, degrading both the visual quality and performance of vision algorithms. Existing flare removal approaches mainly focus on removing daytime flares and fail in nighttime cases. Nighttime flare removal is challenging due to the unique luminance and spectrum of artificial lights, as well as the diverse patter… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Extension of arXiv:2210.06570; Project page at https://ykdai.github.io/projects/Flare7K

  13. arXiv:2305.16025  [pdf, other

    cs.CV eess.IV

    NVTC: Nonlinear Vector Transform Coding

    Authors: Runsen Feng, Zongyu Guo, Wei** Li, Zhibo Chen

    Abstract: In theory, vector quantization (VQ) is always better than scalar quantization (SQ) in terms of rate-distortion (R-D) performance. Recent state-of-the-art methods for neural image compression are mainly based on nonlinear transform coding (NTC) with uniform scalar quantization, overlooking the benefits of VQ due to its exponentially increased complexity. In this paper, we first investigate on some… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted by CVPR 2023

  14. arXiv:2305.13770  [pdf, other

    cs.CV eess.IV

    MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2023/

  15. arXiv:2305.07678  [pdf, other

    eess.IV cs.IT cs.LG

    Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression

    Authors: Yixin Gao, Runsen Feng, Zongyu Guo, Zhibo Chen

    Abstract: Despite a short history, neural image codecs have been shown to surpass classical image codecs in terms of rate-distortion performance. However, most of them suffer from significantly longer decoding times, which hinders the practical applications of neural image codecs. This issue is especially pronounced when employing an effective yet time-consuming autoregressive context model since it would i… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  16. arXiv:2305.02586  [pdf, other

    eess.IV cs.CV

    Semantically Structured Image Compression via Irregular Group-Based Decoupling

    Authors: Ruoyu Feng, Yixin Gao, Xin **, Runsen Feng, Zhibo Chen

    Abstract: Image compression techniques typically focus on compressing rectangular images for human consumption, however, resulting in transmitting redundant content for downstream applications. To overcome this limitation, some previous works propose to semantically structure the bitstream, which can meet specific application requirements by selective transmission and reconstruction. Nevertheless, they divi… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  17. arXiv:2304.10551  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Remosaic Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.08471, arXiv:2209.07060, arXiv:2209.07530, arXiv:2304.10089

  18. arXiv:2304.10089  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Fusion: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.07530, arXiv:2209.08471, arXiv:2209.07060

  19. arXiv:2304.06019  [pdf, other

    cs.CV eess.IV

    Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera

    Authors: Ruicheng Feng, Chongyi Li, Huai** Chen, Shuai Li, **wei Gu, Chen Change Loy

    Abstract: Due to the difficulty in collecting large-scale and perfectly aligned paired training data for Under-Display Camera (UDC) image restoration, previous methods resort to monitor-based image systems or simulation-based methods, sacrificing the realness of the data and introducing domain gaps. In this work, we revisit the classic stereo setup for training data collection -- capturing two images of the… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023

  20. arXiv:2304.02389  [pdf, other

    eess.IV cs.CV cs.LG

    DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images

    Authors: Bo Qian, Hao Chen, Xiangning Wang, Haoxuan Che, Gitaek Kwon, Jaeyoung Kim, Sung** Choi, Seoyoung Shin, Felix Krause, Markus Unterdechler, Junlin Hou, Rui Feng, Yihao Li, Mostafa El Habib Daho, Qiang Wu, ** Zhang, Xiaokang Yang, Yiyu Cai, Wei** Jia, Huating Li, Bin Sheng

    Abstract: Computer-assisted automatic analysis of diabetic retinopathy (DR) is of great importance in reducing the risks of vision loss and even blindness. Ultra-wide optical coherence tomography angiography (UW-OCTA) is a non-invasive and safe imaging modality in DR diagnosis system, but there is a lack of publicly available benchmarks for model development and evaluation. To promote further research and s… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  21. arXiv:2303.05338  [pdf, other

    cs.SD cs.MM eess.AS

    MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

    Authors: Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu

    Abstract: Audio-visual learning helps to comprehensively understand the world by fusing practical information from multiple modalities. However, recent studies show that the imbalanced optimization of uni-modal encoders in a joint-learning model is a bottleneck to enhancing the model's performance. We further find that the up-to-date imbalance-mitigating methods fail on some audio-visual fine-grained tasks,… ▽ More

    Submitted 11 March, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  22. arXiv:2302.03533  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Revisiting Pre-training in Audio-Visual Learning

    Authors: Ruoxuan Feng, Wenke Xia, Di Hu

    Abstract: Pre-training technique has gained tremendous success in enhancing model performance on various tasks, but found to perform worse than training from scratch in some uni-modal situations. This inspires us to think: are the pre-trained models always effective in the more complex multi-modal scenario, especially for the heterogeneous modalities such as audio and visual ones? We find that the answer is… ▽ More

    Submitted 17 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  23. arXiv:2301.00127  [pdf, other

    eess.IV cs.CV physics.med-ph

    Spatiotemporal implicit neural representation for unsupervised dynamic MRI reconstruction

    Authors: Jie Feng, Ruimin Feng, Qing Wu, Zhiyong Zhang, Yuyao Zhang, Hongjiang Wei

    Abstract: Supervised Deep-Learning (DL)-based reconstruction algorithms have shown state-of-the-art results for highly-undersampled dynamic Magnetic Resonance Imaging (MRI) reconstruction. However, the requirement of excessive high-quality ground-truth data hinders their applications due to the generalization problem. Recently, Implicit Neural Representation (INR) has appeared as a powerful DL-based tool fo… ▽ More

    Submitted 13 January, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: 9 pages, 5 figures; corrected the code availability description for arXiv

  24. arXiv:2211.14559  [pdf, other

    eess.IV cs.CV

    Boosting COVID-19 Severity Detection with Infection-aware Contrastive Mixup Classification

    Authors: Junlin Hou, Jilan Xu, Nan Zhang, Yuejie Zhang, Xiaobo Zhang, Rui Feng

    Abstract: This paper presents our solution for the 2nd COVID-19 Severity Detection Competition. This task aims to distinguish the Mild, Moderate, Severe, and Critical grades in COVID-19 chest CT images. In our approach, we devise a novel infection-aware 3D Contrastive Mixup Classification network for severity grading. Specifcally, we train two segmentation networks to first extract the lung region and then… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: ECCV AIMIA Workshop 2022

  25. arXiv:2211.14557  [pdf, other

    eess.IV cs.CV

    CMC v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors

    Authors: Junlin Hou, Jilan Xu, Nan Zhang, Yi Wang, Yuejie Zhang, Xiaobo Zhang, Rui Feng

    Abstract: This paper presents our solution for the 2nd COVID-19 Competition, occurring in the framework of the AIMIA Workshop at the European Conference on Computer Vision (ECCV 2022). In our approach, we employ the winning solution last year which uses a strong 3D Contrastive Mixup Classifcation network (CMC v1) as the baseline method, composed of contrastive representation learning and mixup classificatio… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: ECCV AIMIA Workshop 2022

  26. arXiv:2211.09365  [pdf, other

    cs.SD eess.AS

    Low-Resource Mongolian Speech Synthesis Based on Automatic Prosody Annotation

    Authors: Xin Yuan, Robin Feng, Mingming Ye

    Abstract: While deep learning-based text-to-speech (TTS) models such as VITS have shown excellent results, they typically require a sizable set of high-quality <text, audio> pairs to train, which is expensive to collect. So far, most languages in the world still lack the training data needed to develop TTS systems. This paper proposes two improvement methods for the two problems faced by low-resource Mongol… ▽ More

    Submitted 4 January, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by NCMMSC 2022

  27. arXiv:2210.10439  [pdf, other

    eess.IV cs.CV cs.LG

    A scan-specific unsupervised method for parallel MRI reconstruction via implicit neural representation

    Authors: Ruimin Feng, Qing Wu, Yuyao Zhang, Hongjiang Wei

    Abstract: Parallel imaging is a widely-used technique to accelerate magnetic resonance imaging (MRI). However, current methods still perform poorly in reconstructing artifact-free MRI images from highly undersampled k-space data. Recently, implicit neural representation (INR) has emerged as a new deep learning paradigm for learning the internal continuity of an object. In this study, we adopted INR to paral… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: conference

  28. arXiv:2210.06570  [pdf, other

    cs.CV eess.IV

    Flare7K: A Phenomenological Nighttime Flare Removal Dataset

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

    Abstract: Artificial lights commonly leave strong lens flare artifacts on images captured at night. Nighttime flare not only affects the visual quality but also degrades the performance of vision algorithms. Existing flare removal methods mainly focus on removing daytime flares and fail in nighttime. Nighttime flare removal is challenging because of the unique luminance and spectrum of artificial lights and… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Camera-ready version for NeurIPS 2022 Track Datasets and Benchmarks

  29. arXiv:2210.00515  [pdf, other

    eess.IV cs.CV

    Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images

    Authors: Junlin Hou, Fan Xiao, Jilan Xu, Yuejie Zhang, Haidong Zou, Rui Feng

    Abstract: The ultra-wide optical coherence tomography angiography (OCTA) has become an important imaging modality in diabetic retinopathy (DR) diagnosis. However, there are few researches focusing on automatic DR analysis using ultra-wide OCTA. In this paper, we present novel and practical deep-learning solutions based on ultra-wide OCTA for the Diabetic Retinopathy Analysis Challenge (DRAC). In the segment… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  30. arXiv:2209.08471  [pdf, other

    cs.CV eess.IV

    MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060, arXiv:2209.07530, arXiv:2209.07057

  31. arXiv:2209.07530  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060

  32. arXiv:2209.07060  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Quad-Bayer Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/

  33. arXiv:2209.07052  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results

    Authors: Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Under-display Camera Image Restoration Challenge Report. MIPI workshop website: http://mipi-challenge.org/

  34. arXiv:2209.05483  [pdf, other

    eess.IV cs.CV cs.LG

    Self-Supervised Coordinate Projection Network for Sparse-View Computed Tomography

    Authors: Qing Wu, Ruimin Feng, Hongjiang Wei, **gyi Yu, Yuyao Zhang

    Abstract: In the present work, we propose a Self-supervised COordinate Projection nEtwork (SCOPE) to reconstruct the artifacts-free CT image from a single SV sinogram by solving the inverse tomography imaging problem. Compared with recent related works that solve similar problems using implicit neural representation network (INR), our essential contribution is an effective and simple re-projection strategy… ▽ More

    Submitted 11 August, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 12 pages

    Journal ref: IEEE Transactions on Computational Imaging 9 (2023) 517-529

  35. arXiv:2208.09885  [pdf, other

    cs.CV eess.IV

    HST: Hierarchical Swin Transformer for Compressed Image Super-resolution

    Authors: Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen

    Abstract: Compressed Image Super-resolution has achieved great attention in recent years, where images are degraded with compression artifacts and low-resolution artifacts. Since the complex hybrid distortions, it is hard to restore the distorted image with the simple cooperation of super-resolution and compression artifacts removing. In this paper, we take a step forward to propose the Hierarchical Swin Tr… ▽ More

    Submitted 1 December, 2022; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: Accepted by ECCV2022 Workshop (AIM2022)

  36. arXiv:2207.03190  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Learning Music-Dance Representations through Explicit-Implicit Rhythm Synchronization

    Authors: Jiashuo Yu, Junfu Pu, Ying Cheng, Rui Feng, Ying Shan

    Abstract: Although audio-visual representation has been proved to be applicable in many downstream tasks, the representation of dancing videos, which is more specific and always accompanied by music with complex auditory contents, remains challenging and uninvestigated. Considering the intrinsic alignment between the cadent movement of dancer and music rhythm, we introduce MuDaR, a novel Music-Dance Represe… ▽ More

    Submitted 10 August, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in IEEE Transactions on Multimedia

  37. arXiv:2207.01758  [pdf, other

    eess.IV cs.CV

    FDVTS's Solution for 2nd COV19D Competition on COVID-19 Detection and Severity Analysis

    Authors: Junlin Hou, Jilan Xu, Rui Feng, Yuejie Zhang

    Abstract: This paper presents our solution for the 2nd COVID-19 Competition, occurring in the framework of the AIMIA Workshop in the European Conference on Computer Vision (ECCV 2022). In our approach, we employ an effective 3D Contrastive Mixup Classification network for COVID-19 diagnosis on chest CT images, which is composed of contrastive representation learning and mixup classification. For the COVID-1… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  38. arXiv:2206.02308  [pdf, other

    eess.SP

    Reconfigurable intelligent surfaces: Channel characterization and modeling

    Authors: Jie Huang, Cheng-Xiang Wang, Yingzhuo Sun, Rui Feng, Jialing Huang, Bolun Guo, Zhimeng Zhong, Tie Jun Cui

    Abstract: Reconfigurable intelligent surfaces (RISs) are two dimensional (2D) metasurfaces which can intelligently manipulate electromagnetic waves by low-cost near passive reflecting elements. RIS is viewed as a potential key technology for the sixth generation (6G) wireless communication systems mainly due to its advantages in tuning wireless signals, thus smartly controlling propagation environments. In… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  39. Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

    Authors: Zongyu Guo, Runsen Feng, Zhizheng Zhang, Xin **, Zhibo Chen

    Abstract: Neural video codecs have demonstrated great potential in video transmission and storage applications. Existing neural hybrid video coding approaches rely on optical flow or Gaussian-scale flow for prediction, which cannot support fine-grained adaptation to diverse motion content. Towards more content-adaptive prediction, we propose a novel cross-scale prediction module that achieves more effective… ▽ More

    Submitted 15 March, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: Preprint. Revised after peer-reviewimg

  40. arXiv:2111.03386  [pdf, other

    eess.IV cs.CV

    Versatile Learned Video Compression

    Authors: Runsen Feng, Zongyu Guo, Zhizheng Zhang, Zhibo Chen

    Abstract: Learned video compression methods have demonstrated great promise in catching up with traditional video codecs in their rate-distortion (R-D) performance. However, existing learned video compression schemes are limited by the binding of the prediction mode and the fixed network framework. They are unable to support various inter prediction modes and thus inapplicable for various scenarios. In this… ▽ More

    Submitted 5 January, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

  41. arXiv:2108.05930  [pdf, other

    cs.CV eess.IV

    A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis

    Authors: Mohammad Reza Hosseinzadeh Taher, Fatemeh Haghighi, Ruibin Feng, Michael B. Gotway, Jianming Liang

    Abstract: Transfer learning from supervised ImageNet models has been frequently used in medical image analysis. Yet, no large-scale evaluation has been conducted to benchmark the efficacy of newly-developed pre-training techniques for medical image analysis, leaving several important questions unanswered. As the first step in this direction, we conduct a systematic study on the transferability of models pre… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021); Domain Adaptation and Representation Transfer (DART)

  42. arXiv:2106.15097  [pdf, other

    eess.IV cs.CV

    IREM: High-Resolution Magnetic Resonance (MR) Image Reconstruction via Implicit Neural Representation

    Authors: Qing Wu, Yuwei Li, Lan Xu, Ruiming Feng, Hongjiang Wei, Qing Yang, Boliang Yu, Xiaozhao Liu, **gyi Yu, Yuyao Zhang

    Abstract: For collecting high-quality high-resolution (HR) MR image, we propose a novel image reconstruction network named IREM, which is trained on multiple low-resolution (LR) MR images and achieve an arbitrary up-sampling rate for HR image reconstruction. In this work, we suppose the desired HR image as an implicit continuous function of the 3D image spatial coordinate and the thick-slice LR images as se… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 8 pages, 6 figures, conference

  43. arXiv:2104.09556  [pdf, other

    cs.CV eess.IV

    Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network

    Authors: Ruicheng Feng, Chongyi Li, Huai** Chen, Shuai Li, Chen Change Loy, **wei Gu

    Abstract: Recent development of Under-Display Camera (UDC) systems provides a true bezel-less and notch-free viewing experience on smartphones (and TV, laptops, tablets), while allowing images to be captured from the selfie camera embedded underneath. In a typical UDC system, the microstructure of the semi-transparent organic light-emitting diode (OLED) pixel array attenuates and diffracts the incident ligh… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 camera-ready version

  44. arXiv:2104.05168  [pdf, other

    eess.IV

    Soft then Hard: Rethinking the Quantization in Neural Image Compression

    Authors: Zongyu Guo, Zhizheng Zhang, Runsen Feng, Zhibo Chen

    Abstract: Quantization is one of the core components in lossy image compression. For neural image compression, end-to-end optimization requires differentiable approximations of quantization, which can generally be grouped into three categories: additive uniform noise, straight-through estimator and soft-to-hard annealing. Training with additive uniform noise approximates the quantization error variationally… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Updated with a description on the high-rate assumption

  45. Flow-Mixup: Classifying Multi-labeled Medical Images with Corrupted Labels

    Authors: **tai Chen, Hongyun Yu, Ruiwei Feng, Danny Z. Chen, Jian Wu

    Abstract: In clinical practice, medical image interpretation often involves multi-labeled classification, since the affected parts of a patient tend to present multiple symptoms or comorbidities. Recently, deep learning based frameworks have attained expert-level performance on medical image interpretation, which can be attributed partially to large amounts of accurate annotations. However, manually annotat… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: 2020 IEEE International Conference on Bioinformatics and Biomedicine

  46. arXiv:2101.00706  [pdf, other

    eess.SP

    Smart Black Box 2.0: Efficient High-bandwidth Driving Data Collection based on Video Anomalies

    Authors: Ryan Feng, Yu Yao, Ella Atkins

    Abstract: Autonomous vehicles require fleet-wide data collection for continuous algorithm development and validation. The Smart Black Box (SBB) intelligent event data recorder has been proposed as a system for prioritized high-bandwidth data capture. This paper extends the SBB by applying anomaly detection and action detection methods for generalized event-of-interest (EOI) detection. An updated SBB pipelin… ▽ More

    Submitted 8 February, 2021; v1 submitted 3 January, 2021; originally announced January 2021.

    Comments: Submitted to Algorithms

  47. Causal Contextual Prediction for Learned Image Compression

    Authors: Zongyu Guo, Zhizheng Zhang, Runsen Feng, Zhibo Chen

    Abstract: Over the past several years, we have witnessed impressive progress in the field of learned image compression. Recent learned image codecs are commonly based on autoencoders, that first encode an image into low-dimensional latent representations and then decode them for reconstruction purposes. To capture spatial dependencies in the latent space, prior works exploit hyperprior and spatial context m… ▽ More

    Submitted 31 October, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

    Comments: We add some descriptions for the improved quantization in the latest arxiv version

  48. arXiv:2008.00239  [pdf, other

    eess.IV cs.CV

    Exploring Multi-Scale Feature Propagation and Communication for Image Super Resolution

    Authors: Ruicheng Feng, Weipeng Guan, Yu Qiao, Chao Dong

    Abstract: Multi-scale techniques have achieved great success in a wide range of computer vision tasks. However, while this technique is incorporated in existing works, there still lacks a comprehensive investigation on variants of multi-scale convolution in image super resolution. In this work, we present a unified formulation over widely-used multi-scale structures. With this framework, we systematically e… ▽ More

    Submitted 14 August, 2020; v1 submitted 1 August, 2020; originally announced August 2020.

  49. arXiv:2004.08283  [pdf, other

    eess.IV

    Learned Video Compression with Feature-level Residuals

    Authors: Runsen Feng, Yaojun Wu, Zongyu Guo, Zhizheng Zhang, Xin **, Zhibo Chen

    Abstract: In this paper, we present an end-to-end video compression network for P-frame challenge on CLIC. We focus on deep neural network (DNN) based video compression, and improve the current frameworks from three aspects. First, we notice that pixel space residuals is sensitive to the prediction errors of optical flow based motion compensation. To suppress the relative influence, we propose to compress t… ▽ More

    Submitted 21 April, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: accepted to CLIC 2020, 4 pages

  50. arXiv:2004.08273  [pdf, other

    eess.IV

    3-D Context Entropy Model for Improved Practical Image Compression

    Authors: Zongyu Guo, Yaojun Wu, Runsen Feng, Zhizheng Zhang, Zhibo Chen

    Abstract: In this paper, we present our image compression framework designed for CLIC 2020 competition. Our method is based on Variational AutoEncoder (VAE) architecture which is strengthened with residual structures. In short, we make three noteworthy improvements here. First, we propose a 3-D context entropy model which can take advantage of known latent representation in current spatial locations for bet… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: 4 pages, accepted to CLIC 2020