Skip to main content

Showing 1–50 of 104 results for author: Suen, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12186  [pdf, ps, other

    eess.IV cs.CV

    Unlocking the Potential of Early Epochs: Uncertainty-aware CT Metal Artifact Reduction

    Authors: Xinquan Yang, Guanqun Zhou, Wei Sun, Youjian Zhang, Zhongya Wang, Jiahui He, Zhicheng Zhang

    Abstract: In computed tomography (CT), the presence of metallic implants in patients often leads to disruptive artifacts in the reconstructed images, hindering accurate diagnosis. Recently, a large amount of supervised deep learning-based approaches have been proposed for metal artifact reduction (MAR). However, these methods neglect the influence of initial training weights. In this paper, we have discover… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.11810  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

    Authors: Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy, Wen Sun

    Abstract: We study computationally and statistically efficient Reinforcement Learning algorithms for the linear Bellman Complete setting, a setting that uses linear function approximation to capture value functions and unifies existing models like linear Markov Decision Processes (MDP) and Linear Quadratic Regulators (LQR). While it is known from the prior works that this setting is statistically tractable,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.09989  [pdf, other

    q-bio.NC eess.SY

    Suppressing seizure via optimal electrical stimulation to the hub of epileptic brain network

    Authors: Zhichao Liang, Guanyi Zhao, Yinuo Zhang, Weiting Sun, **gzhe Lin, Jialin Wang, Quanying Liu

    Abstract: The electrical stimulation to the seizure onset zone (SOZ) serves as an efficient approach to seizure suppression. Recently, seizure dynamics have gained widespread attendance in its network propagation mechanisms. Compared with the direct stimulation to SOZ, other brain network-level approaches that can effectively suppress epileptic seizures remain under-explored. In this study, we introduce a p… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.09873  [pdf, other

    eess.AS cs.AI cs.SD

    Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition

    Authors: Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian

    Abstract: Disordered speech recognition profound implications for improving the quality of life for individuals afflicted with, for example, dysarthria. Dysarthric speech recognition encounters challenges including limited data, substantial dissimilarities between dysarthric and non-dysarthric speakers, and significant speaker variations stemming from the disorder. This paper introduces Perceiver-Prompt, a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by interspeech 2024

  5. arXiv:2406.03961  [pdf, ps, other

    eess.IV cs.CV

    LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression

    Authors: Junhui Li, Jutao Li, Xingsong Hou, Huake Wang, Yutao Zhang, Yujie Dun, Wenke Sun

    Abstract: Deep learning-based image compression algorithms typically focus on designing encoding and decoding networks and improving the accuracy of entropy model estimation to enhance the rate-distortion (RD) performance. However, few algorithms leverage the compression distortion prior from existing compression algorithms to improve RD performance. In this paper, we propose a latent diffusion model-based… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2405.08745  [pdf, other

    eess.IV cs.CV cs.MM

    Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

    Authors: Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai

    Abstract: In this paper, we present a simple but effective method to enhance blind video quality assessment (BVQA) models for social media videos. Motivated by previous researches that leverage pre-trained features extracted from various computer vision models as the feature representation for BVQA, we further explore rich quality-aware features from pre-trained blind image quality assessment (BIQA) and BVQ… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2405.00725  [pdf, other

    eess.SP cs.CR cs.LG

    Federated Learning and Differential Privacy Techniques on Multi-hospital Population-scale Electrocardiogram Data

    Authors: Vikhyat Agrawal, Sunil Vasu Kalmady, Venkataseetharam Manoj Malipeddi, Manisimha Varma Manthena, Weijie Sun, Saiful Islam, Abram Hindle, Padma Kaul, Russell Greiner

    Abstract: This research paper explores ways to apply Federated Learning (FL) and Differential Privacy (DP) techniques to population-scale Electrocardiogram (ECG) data. The study learns a multi-label ECG classification model using FL and DP based on 1,565,849 ECG tracings from 7 hospitals in Alberta, Canada. The FL approach allowed collaborative model training without sharing raw data between hospitals while… ▽ More

    Submitted 15 May, 2024; v1 submitted 26 April, 2024; originally announced May 2024.

    Comments: Accepted for ICMHI 2024

  8. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Ya**g Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  9. arXiv:2404.09003  [pdf, other

    cs.CV eess.IV

    THQA: A Perceptual Quality Assessment Database for Talking Heads

    Authors: Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-** Zhang, Guangtao Zhai

    Abstract: In the realm of media technology, digital humans have gained prominence due to rapid advancements in computer technology. However, the manual modeling and control required for the majority of digital humans pose significant obstacles to efficient development. The speech-driven methods offer a novel avenue for manipulating the mouth shape and expressions of digital humans. Despite the proliferation… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  10. arXiv:2403.10573  [pdf, other

    eess.IV cs.CR cs.CV cs.LG

    Medical Unlearnable Examples: Securing Medical Data from Unauthorized Training via Sparsity-Aware Local Masking

    Authors: Weixiang Sun, Yixin Liu, Zhiling Yan, Kaidi Xu, Lichao Sun

    Abstract: The rapid expansion of AI in healthcare has led to a surge in medical data generation and storage, boosting medical AI development. However, fears of unauthorized use, like training commercial AI models, hinder researchers from sharing their valuable datasets. To encourage data sharing, one promising solution is to introduce imperceptible noise into the data. This method aims to safeguard the data… ▽ More

    Submitted 7 July, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Accept by ICML 2024 NextGenAISafety

  11. arXiv:2403.07923  [pdf

    cs.NI cs.AI cs.LG eess.IV eess.SY

    The Fusion of Deep Reinforcement Learning and Edge Computing for Real-time Monitoring and Control Optimization in IoT Environments

    Authors: **gyu Xu, Weixiang Wan, Linying Pan, Wenjian Sun, Yuxiang Liu

    Abstract: In response to the demand for real-time performance and control quality in industrial Internet of Things (IoT) environments, this paper proposes an optimization control system based on deep reinforcement learning and edge computing. The system leverages cloud-edge collaboration, deploys lightweight policy networks at the edge, predicts system states, and outputs controls at a high frequency, enabl… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  12. arXiv:2403.06993  [pdf

    cs.RO cs.AI cs.LG eess.IV eess.SY

    Automatic driving lane change safety prediction model based on LSTM

    Authors: Wenjian Sun, Linying Pan, **gyu Xu, Weixiang Wan, Yong Wang

    Abstract: Autonomous driving technology can improve traffic safety and reduce traffic accidents. In addition, it improves traffic flow, reduces congestion, saves energy and increases travel efficiency. In the relatively mature automatic driving technology, the automatic driving function is divided into several modules: perception, decision-making, planning and control, and a reasonable division of labor can… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  13. arXiv:2402.17268  [pdf, other

    eess.SY

    Reinforcement Learning Based Robust Volt/Var Control in Active Distribution Networks With Imprecisely Known Delay

    Authors: Hong Cheng, Huan Luo, Zhi Liu, Wei Sun, Weitao Li, Qiyue Li

    Abstract: Active distribution networks (ADNs) incorporating massive photovoltaic (PV) devices encounter challenges of rapid voltage fluctuations and potential violations. Due to the fluctuation and intermittency of PV generation, the state gap, arising from time-inconsistent states and exacerbated by imprecisely known system delays, significantly impacts the accuracy of voltage control. This paper addresses… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  14. arXiv:2402.09442  [pdf

    eess.SP cs.AI

    Progress in artificial intelligence applications based on the combination of self-driven sensors and deep learning

    Authors: Weixiang Wan, Wenjian Sun, Qiang Zeng, Linying Pan, **gyu Xu, Bo Liu

    Abstract: In the era of Internet of Things, how to develop a smart sensor system with sustainable power supply, easy deployment and flexible use has become a difficult problem to be solved. The traditional power supply has problems such as frequent replacement or charging when in use, which limits the development of wearable devices. The contact-to-separate friction nanogenerator (TENG) was prepared by usin… ▽ More

    Submitted 12 March, 2024; v1 submitted 30 January, 2024; originally announced February 2024.

    Comments: This aticle was accepted by ieee conference

  15. arXiv:2402.03413  [pdf, other

    cs.MM cs.CV eess.IV

    Perceptual Video Quality Assessment: A Survey

    Authors: Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

    Abstract: Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display. With the advancement of internet communication and cloud service technology, video content and traffic are growing exponentially, which further emphasizes the requirement… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  16. arXiv:2311.18216  [pdf, other

    cs.CV cs.MM eess.IV

    FS-BAND: A Frequency-Sensitive Banding Detector

    Authors: Zijian Chen, Wei Sun, Zicheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

    Abstract: Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc. scenarios, which largely affects the user's quality of experience (QoE). The banding distortion typically appears as relatively small pixel-wise variations in smooth backgrounds, which is difficult to analyze in the spatial domain but easily reflected in the frequency… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.17752

  17. arXiv:2310.17147  [pdf, other

    cs.CV eess.IV

    Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

    Abstract: Point clouds are widely used in 3D content representation and have various applications in multimedia. However, compression and simplification processes inevitably result in the loss of quality-aware information under storage and bandwidth constraints. Therefore, there is an increasing need for effective methods to quantify the degree of distortion in point clouds. In this paper, we propose simple… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  18. arXiv:2310.16732  [pdf, other

    cs.CV eess.IV

    A No-Reference Quality Assessment Method for Digital Human Head

    Authors: Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiongkuo Min, Xianghe Ma, Guangtao Zhai

    Abstract: In recent years, digital humans have been widely applied in augmented/virtual reality (A/VR), where viewers are allowed to freely observe and interact with the volumetric content. However, the digital humans may be degraded with various distortions during the procedure of generation and transmission. Moreover, little effort has been put into the perceptual quality assessment of digital humans. The… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  19. arXiv:2310.15984  [pdf, other

    cs.CV eess.IV

    Geometry-Aware Video Quality Assessment for Dynamic Digital Human

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

    Abstract: Dynamic Digital Humans (DDHs) are 3D digital models that are animated using predefined motions and are inevitably bothered by noise/shift during the generation process and compression distortion during the transmission process, which needs to be perceptually evaluated. Usually, DDHs are displayed as 2D rendered animation videos and it is natural to adapt video quality assessment (VQA) methods to D… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  20. arXiv:2310.12765  [pdf, other

    cs.SD cs.LG eess.AS

    Energy-Based Models For Speech Synthesis

    Authors: Wanli Sun, Zehai Tu, Anton Ragni

    Abstract: Recently there has been a lot of interest in non-autoregressive (non-AR) models for speech synthesis, such as FastSpeech 2 and diffusion models. Unlike AR models, these models do not have autoregressive dependencies among outputs which makes inference efficient. This paper expands the range of available non-AR models with another member called energy-based models (EBMs). The paper describes how no… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  21. arXiv:2310.08201  [pdf, other

    eess.SP

    Fast Ray-Tracing-Based Precise Underwater Acoustic Localization without Prior Acknowledgment of Target Depth

    Authors: Wei Huang, Hao Zhang, Kaitao Meng, Fan Gao, Wenzhou Sun, Jianxu Shu, Tianhe Xu, Deshi Li

    Abstract: Underwater localization is of great importance for marine observation and building positioning, navigation, timing (PNT) systems that could be widely applied in disaster warning, underwater rescues and resources exploration. The uneven distribution of underwater sound velocity poses great challenge for precise underwater positioning. The current soundline correction positioning method mainly aims… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2310.04964  [pdf, other

    cs.CV eess.IV

    Learning Many-to-Many Map** for Unpaired Real-World Image Super-resolution and Downscaling

    Authors: Wanjie Sun, Zhenzhong Chen

    Abstract: Learning based single image super-resolution (SISR) for real-world images has been an active research topic yet a challenging task, due to the lack of paired low-resolution (LR) and high-resolution (HR) training images. Most of the existing unsupervised real-world SISR methods adopt a two-stage training strategy by synthesizing realistic LR images from their HR counterparts first, then training th… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  23. arXiv:2310.00413  [pdf, other

    cs.CV cs.LG eess.IV

    SSIF: Learning Continuous Image Representation for Spatial-Spectral Super-Resolution

    Authors: Gengchen Mai, Ni Lao, Weiwei Sun, Yuchi Ma, Jiaming Song, Chenlin Meng, Hongxu Ma, **meng Rao, Ziyuan Li, Stefano Ermon

    Abstract: Existing digital sensors capture images at fixed spatial and spectral resolutions (e.g., RGB, multispectral, and hyperspectral images), and each combination requires bespoke machine learning models. Neural Implicit Functions partially overcome the spatial resolution challenge by representing an image in a resolution-independent way. However, they still operate at fixed, pre-defined spectral resolu… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    MSC Class: 68T07; 68T45 ACM Class: I.4.10; I.2.10; I.4.6

  24. StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability

    Authors: Tengchuan Kou, Xiaohong Liu, Wei Sun, Jun Jia, Xiongkuo Min, Guangtao Zhai, Ning Liu

    Abstract: Video shakiness is an unpleasant distortion of User Generated Content (UGC) videos, which is usually caused by the unstable hold of cameras. In recent years, many video stabilization algorithms have been proposed, yet no specific and accurate metric enables comprehensively evaluating the stability of videos. Indeed, most existing quality assessment models evaluate video quality as a whole without… ▽ More

    Submitted 27 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM'23

  25. arXiv:2307.13981  [pdf, other

    cs.CV cs.MM eess.IV

    Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models

    Authors: Wei Sun, Wen Wen, Xiongkuo Min, Long Lan, Guangtao Zhai, Kede Ma

    Abstract: Blind video quality assessment (BVQA) plays an indispensable role in monitoring and improving the end-users' viewing experience in various real-world video-enabled media applications. As an experimental field, the improvements of BVQA models have been measured primarily on a few human-rated VQA datasets. Thus, it is crucial to gain a better understanding of existing VQA datasets in order to proper… ▽ More

    Submitted 3 April, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

  26. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  27. arXiv:2307.09133  [pdf, ps, other

    eess.SP

    Radar-Based Estimation of Human Body Orientation Using Respiratory Features and Hierarchical Regression Model

    Authors: Wenxu Sun, Shunsuke Iwata, Yuji Tanaka, Takuya Sakamoto

    Abstract: This study proposes an accurate method to estimate human body orientation using a millimeter-wave radar system. Body displacement is measured from the phase of the radar echo, which is analyzed to obtain features associated with the fundamental and higher-order harmonic components of the quasi-periodic respiratory motion. These features are used in body-orientation estimation invoking a novel hier… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 5 pages, 4 figures. This work is going to be submitted to the IEEE for possible publication

  28. arXiv:2307.02808  [pdf, other

    eess.IV cs.CV cs.DB

    Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Haoning Wu, Chunyi Li, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: Digital humans have witnessed extensive applications in various domains, necessitating related quality assessment studies. However, there is a lack of comprehensive digital human quality assessment (DHQA) databases. To address this gap, we propose SJTU-H3D, a subjective quality assessment database specifically designed for full-body digital humans. It comprises 40 high-quality reference digital hu… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  29. arXiv:2306.05658  [pdf, other

    cs.CV eess.IV

    GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Houning Wu, Yingjie Zhou, Chunyi Li, Xiongkuo Min, Guangtao Zhai, Weisi Lin

    Abstract: Nowadays, most 3D model quality assessment (3DQA) methods have been aimed at improving performance. However, little attention has been paid to the computational cost and inference time required for practical applications. Model-based 3DQA methods extract features directly from the 3D models, which are characterized by their high degree of complexity. As a result, many researchers are inclined towa… ▽ More

    Submitted 31 January, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  30. arXiv:2306.04717  [pdf, other

    cs.CV cs.AI eess.IV

    AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment

    Authors: Chunyi Li, Zicheng Zhang, Haoning Wu, Wei Sun, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: With the rapid advancements of the text-to-image generative model, AI-generated images (AGIs) have been widely applied to entertainment, education, social media, etc. However, considering the large quality variance among different AGIs, there is an urgent need for quality models that are consistent with human subjective ratings. To address this issue, we extensively consider various popular AGI mo… ▽ More

    Submitted 12 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 12 pages, 11 figures

  31. arXiv:2305.13770  [pdf, other

    cs.CV eess.IV

    MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2023/

  32. arXiv:2304.10551  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Remosaic Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.08471, arXiv:2209.07060, arXiv:2209.07530, arXiv:2304.10089

  33. arXiv:2304.10089  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Fusion: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.07530, arXiv:2209.08471, arXiv:2209.07060

  34. arXiv:2303.12618  [pdf, other

    cs.CV eess.IV

    A Perceptual Quality Assessment Exploration for AIGC Images

    Authors: Zicheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: \underline{AI} \underline{G}enerated \underline{C}ontent (\textbf{AIGC}) has gained widespread attention with the increasing efficiency of deep learning in content creation. AIGC, created with the assistance of artificial intelligence technology, includes various forms of content, among which the AI-generated images (AGIs) have brought significant impact to society and have been applied to various… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  35. arXiv:2303.09290  [pdf, other

    eess.IV

    VDPVE: VQA Dataset for Perceptual Video Enhancement

    Authors: Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Recently, many video enhancement methods have been proposed to improve video quality from different aspects such as color, brightness, contrast, and stability. Therefore, how to evaluate the quality of the enhanced video in a way consistent with human visual perception is an important research topic. However, most video quality assessment methods mainly calculate video quality by estimating the di… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  36. arXiv:2303.08050  [pdf, other

    cs.CV eess.IV

    Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, **g Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Computer graphics images (CGIs) are artificially generated by means of computer programs and are widely perceived under various scenarios, such as games, streaming media, etc. In practice, the quality of CGIs consistently suffers from poor rendering during production, inevitable compression artifacts during the transmission of multimedia applications, and low aesthetic quality resulting from poor… ▽ More

    Submitted 1 November, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  37. arXiv:2303.05203  [pdf, other

    cs.RO cs.AI eess.SY

    RMMDet: Road-Side Multitype and Multigroup Sensor Detection System for Autonomous Driving

    Authors: Xiuyu Yang, Zhuangyan Zhang, Haikuo Du, Sui Yang, Feng** Sun, Yanbo Liu, Ling Pei, Wenchao Xu, Weiqi Sun, Zhengyu Li

    Abstract: Autonomous driving has now made great strides thanks to artificial intelligence, and numerous advanced methods have been proposed for vehicle end target detection, including single sensor or multi sensor detection methods. However, the complexity and diversity of real traffic situations necessitate an examination of how to use these methods in real road conditions. In this paper, we propose RMMDet… ▽ More

    Submitted 9 June, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  38. Audio-Visual Quality Assessment for User Generated Content: Database and Method

    Authors: Yuqin Cao, Xiongkuo Min, Wei Sun, ** Zhang, Guangtao Zhai

    Abstract: With the explosive increase of User Generated Content (UGC), UGC video quality assessment (VQA) becomes more and more important for improving users' Quality of Experience (QoE). However, most existing UGC VQA studies only focus on the visual distortions of videos, ignoring that the user's QoE also depends on the accompanying audio signals. In this paper, we conduct the first study to address the p… ▽ More

    Submitted 27 December, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  39. arXiv:2302.09332  [pdf, other

    eess.SP

    Incipient Fault Detection in Power Distribution System: A Time-Frequency Embedded Deep Learning Based Approach

    Authors: Qiyue Li, Huan Luo, Hong Cheng, Yuxing Deng, Wei Sun, Weitao Li, Zhi Liu

    Abstract: Incipient fault detection in power distribution systems is crucial to improve the reliability of the grid. However, the non-stationary nature and the inadequacy of the training dataset due to the self-recovery of the incipient fault signal, make the incipient fault detection in power distribution systems a great challenge. In this paper, we focus on incipient fault detection in power distribution… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: 15 pages

  40. arXiv:2302.08715  [pdf, other

    cs.CV eess.IV

    EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai

    Abstract: Currently, great numbers of efforts have been put into improving the effectiveness of 3D model quality assessment (3DQA) methods. However, little attention has been paid to the computational costs and inference time, which is also important for practical applications. Unlike 2D media, 3D models are represented by more complicated and irregular digital formats, such as point cloud and mesh. Thus it… ▽ More

    Submitted 27 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  41. arXiv:2211.10431  [pdf, other

    eess.SP cs.LG

    Improving ECG-based COVID-19 diagnosis and mortality predictions using pre-pandemic medical records at population-scale

    Authors: Weijie Sun, Sunil Vasu Kalmady, Nariman Sepehrvand, Luan Manh Chu, Zihan Wang, Amir Salimi, Abram Hindle, Russell Greiner, Padma Kaul

    Abstract: Pandemic outbreaks such as COVID-19 occur unexpectedly, and need immediate action due to their potential devastating consequences on global health. Point-of-care routine assessments such as electrocardiogram (ECG), can be used to develop prediction models for identifying individuals at risk. However, there is often too little clinically-annotated medical data, especially in early phases of a pande… ▽ More

    Submitted 11 January, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Accepted for NeurIPS 2022 TS4H workshop

  42. arXiv:2211.10014  [pdf, other

    cs.CR eess.SP eess.SY

    Users are Closer than they Appear: Protecting User Location from WiFi APs

    Authors: Roshan Ayyalasomayajula, Aditya Arun, Wei Sun, Dinesh Bharadia

    Abstract: WiFi-based indoor localization has now matured for over a decade. Most of the current localization algorithms rely on the WiFi access points (APs) in the enterprise network to localize the WiFi user accurately. Thus, the WiFi user's location information could be easily snooped by an attacker listening through a compromised WiFi AP. With indoor localization and navigation being the next step toward… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 6 pages, 6 figures, submitted to HotMobile 2023

  43. arXiv:2211.04894  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

    Authors: Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, **gwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on conte… ▽ More

    Submitted 7 March, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  44. arXiv:2210.07818  [pdf, other

    eess.IV cs.CV

    ISTA-Inspired Network for Image Super-Resolution

    Authors: Yuqing Liu, Wei Zhang, Weifeng Sun, Zhikai Yu, Jianfeng Wei, Shengquan Li

    Abstract: Deep learning for image super-resolution (SR) has been investigated by numerous researchers in recent years. Most of the works concentrate on effective block designs and improve the network representation but lack interpretation. There are also iterative optimization-inspired networks for image SR, which take the solution step as a whole without giving an explicit optimization step. This paper pro… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  45. arXiv:2210.06291  [pdf, other

    eess.SP cs.LG

    ECG for high-throughput screening of multiple diseases: Proof-of-concept using multi-diagnosis deep learning from population-based datasets

    Authors: Weijie Sun, Sunil Vasu Kalmady, Amir Salimi, Nariman Sepehrvand, Eric Ly, Abram Hindle, Russell Greiner, Padma Kaul

    Abstract: Electrocardiogram (ECG) abnormalities are linked to cardiovascular diseases, but may also occur in other non-cardiovascular conditions such as mental, neurological, metabolic and infectious conditions. However, most of the recent success of deep learning (DL) based diagnostic predictions in selected patient cohorts have been limited to a small set of cardiac diseases. In this study, we use a popul… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted in Medical Imaging meets NeurIPS 2021 https://www.cse.cuhk.edu.hk/~qdou/public/medneurips2021/88_ECG_for_high-throughput_screening_of_multiple_diseases_final_version.pdf

  46. arXiv:2209.09489  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment for Digital Human Heads

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Yuzhe Wu, Guangtao Zhai

    Abstract: Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort. However, the quality assessment of digital humans has fallen behind. Therefore, to tackle the challenge of digital human quality assessment issues, we propose the first large-scale quality assessment data… ▽ More

    Submitted 28 February, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

  47. arXiv:2209.08471  [pdf, other

    cs.CV eess.IV

    MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060, arXiv:2209.07530, arXiv:2209.07057

  48. arXiv:2209.07530  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060

  49. arXiv:2209.07060  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Quad-Bayer Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/

  50. arXiv:2209.07052  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results

    Authors: Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Under-display Camera Image Restoration Challenge Report. MIPI workshop website: http://mipi-challenge.org/