Skip to main content

Showing 1–50 of 69 results for author: Wei, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14264  [pdf, other

    eess.IV cs.CV

    Zero-Shot Image Denoising for High-Resolution Electron Microscopy

    Authors: Xuanyu Tian, Zhuoya Dong, Xiyue Lin, Yue Gao, Hongjiang Wei, Yanhang Ma, **gyi Yu, Yuyao Zhang

    Abstract: High-resolution electron microscopy (HREM) imaging technique is a powerful tool for directly visualizing a broad range of materials in real-space. However, it faces challenges in denoising due to ultra-low signal-to-noise ratio (SNR) and scarce data availability. In this work, we propose Noise2SR, a zero-shot self-supervised learning (ZS-SSL) denoising framework for HREM. Within our framework, we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 12 figures

  2. arXiv:2405.07717  [pdf, other

    eess.IV

    On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks

    Authors: Chenhao Wu, Qingbo Wu, Haoran Wei, Shuai Chen, Lei Wang, King Ngi Ngan, Fanman Meng, Hongliang Li

    Abstract: Despite demonstrating superior rate-distortion (RD) performance, learning-based image compression (LIC) algorithms have been found to be vulnerable to malicious perturbations in recent studies. Adversarial samples in these studies are designed to attack only one dimension of either bitrate or distortion, targeting a submodel with a specific compression ratio. However, adversaries in real-world sce… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.18820  [pdf, other

    eess.IV cs.CV

    Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

    Authors: Zhiyuan Li, Yanhui Zhou, Hao Wei, Chenyang Ge, **gwen Jiang

    Abstract: Image compression at extremely low bitrates (below 0.1 bits per pixel (bpp)) is a significant challenge due to substantial information loss. In this work, we propose a novel two-stage extreme image compression framework that exploits the powerful generative capability of pre-trained diffusion models to achieve realistic image reconstruction at extremely low bitrates. In the first stage, we treat t… ▽ More

    Submitted 13 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE TCSVT

  4. arXiv:2404.17890  [pdf, other

    eess.IV cs.AI cs.CV

    DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction

    Authors: Chenhe Du, Xiyue Lin, Qing Wu, Xuanyu Tian, Ying Su, Zhe Luo, Hongjiang Wei, S. Kevin Zhou, **gyi Yu, Yuyao Zhang

    Abstract: Limited-angle and sparse-view computed tomography (LACT and SVCT) are crucial for expanding the scope of X-ray CT applications. However, they face challenges due to incomplete data acquisition, resulting in diverse artifacts in the reconstructed CT images. Emerging implicit neural representation (INR) techniques, such as NeRF, NeAT, and NeRP, have shown promise in under-determined CT imaging recon… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 15 pages, 10 figures

    ACM Class: I.2.10; I.4.5

  5. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, **shan Pan, Jiangxin Dong, **hui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi **, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  6. arXiv:2404.06620  [pdf, other

    eess.IV

    Encoder-Quantization-Motion-based Video Quality Metrics

    Authors: Yixu Chen, Zaixi Shang, Hai Wei, Yongjun Wu, Sriram Sethuraman

    Abstract: In an adaptive bitrate streaming application, the efficiency of video compression and the encoded video quality depend on both the video codec and the quality metric used to perform encoding optimization. The development of such a quality metric need large scale subjective datasets. In this work we merge several datasets into one to support the creation of a metric tailored for video compression a… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at Picture Coding Symposium 2024

  7. arXiv:2403.17694  [pdf, other

    cs.CV cs.GR eess.IV

    AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

    Authors: Huawei Wei, Zejun Yang, Zhisheng Wang

    Abstract: In this study, we propose AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image. Our methodology is divided into two stages. Initially, we extract 3D intermediate representations from audio and project them into a sequence of 2D facial landmarks. Subsequently, we employ a robust diffusion model, coupled with a motion module, to convert… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  8. arXiv:2403.07337  [pdf, other

    eess.SP

    Analysis of Intelligent Reflecting Surface-Enhanced Mobility Through a Line-of-Sight State Transition Model

    Authors: Hongtao Zhang, Haoyan Wei

    Abstract: Rapid signal fluctuations due to blockage effects cause excessive handovers (HOs) and degrade mobility performance. By reconfiguring line-of-sight (LoS) Links through passive reflections, intelligent reflective surface (IRS) has the potential to address this issue. Due to the lack of introducing blocking effects, existing HO analyses cannot capture excessive HOs or exploit enhancements via IRSs. T… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 13 pages, 11 figures, submitted to IEEE

  9. arXiv:2403.07323  [pdf, other

    eess.SP cs.NI

    Discrete-Time Modeling and Handover Analysis of Intelligent Reflecting Surface-Assisted Networks

    Authors: Hongtao Zhang, Haoyan Wei

    Abstract: Owning to the reflection gain and double path loss featured by intelligent reflecting surface (IRS) channels, handover (HO) locations become irregular and the signal strength fluctuates sharply with variations in IRS connections during HO, the risk of HO failures (HOFs) is exacerbated and thus HO parameters require reconfiguration. However, existing HO models only assume monotonic negative exponen… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 13 pages, 12 figures, submitted to IEEE

  10. DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation

    Authors: Haojie Wei, Xueke Cao, Wenbo Xu, Tangpeng Dan, Yueguo Chen

    Abstract: Singing voice separation and vocal pitch estimation are pivotal tasks in music information retrieval. Existing methods for simultaneous extraction of clean vocals and vocal pitches can be classified into two categories: pipeline methods and naive joint learning methods. However, the efficacy of these methods is limited by the following problems: On the one hand, pipeline methods train models for e… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by ICASSP 2024

  11. arXiv:2311.12892  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    IMJENSE: Scan-specific Implicit Representation for Joint Coil Sensitivity and Image Estimation in Parallel MRI

    Authors: Ruimin Feng, Qing Wu, Jie Feng, Huajun She, Chunlei Liu, Yuyao Zhang, Hongjiang Wei

    Abstract: Parallel imaging is a commonly used technique to accelerate magnetic resonance imaging (MRI) data acquisition. Mathematically, parallel MRI reconstruction can be formulated as an inverse problem relating the sparsely sampled k-space measurements to the desired MRI image. Despite the success of many existing reconstruction algorithms, it remains a challenge to reliably reconstruct a high-quality im… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  12. arXiv:2311.10331  [pdf, other

    eess.IV cs.CV

    Leveraging Multimodal Fusion for Enhanced Diagnosis of Multiple Retinal Diseases in Ultra-wide OCTA

    Authors: Hao Wei, Peilun Shi, Guitao Bai, Minqing Zhang, Shuangle Li, Wu Yuan

    Abstract: Ultra-wide optical coherence tomography angiography (UW-OCTA) is an emerging imaging technique that offers significant advantages over traditional OCTA by providing an exceptionally wide scanning range of up to 24 x 20 $mm^{2}$, covering both the anterior and posterior regions of the retina. However, the currently accessible UW-OCTA datasets suffer from limited comprehensive hierarchical informati… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  13. arXiv:2311.08829  [pdf, other

    cs.SD eess.AS

    Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection

    Authors: Yifan Zhou, Dongxing Xu, Haoran Wei, Yanhua Long

    Abstract: In industry, machine anomalous sound detection (ASD) is in great demand. However, collecting enough abnormal samples is difficult due to the high cost, which boosts the rapid development of unsupervised ASD algorithms. Autoencoder (AE) based methods have been widely used for unsupervised ASD, but suffer from problems including 'shortcut', poor anti-noise ability and sub-optimal quality of features… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Submitted to the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  14. arXiv:2311.05935  [pdf, other

    eess.SY

    Resilient and constrained consensus against adversarial attacks: A distributed MPC framework

    Authors: Henglai Wei, Kunwu Zhang, Hui Zhang, Yang Shi

    Abstract: There has been a growing interest in realizing the resilient consensus of the multi-agent system (MAS) under cyber-attacks, which aims to achieve the consensus of normal agents (i.e., agents without attacks) in a network, depending on the neighboring information. The literature has developed mean-subsequence-reduced (MSR) algorithms for the MAS with F adversarial attacks and has shown that the con… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  15. arXiv:2310.09625  [pdf, other

    eess.IV cs.CV

    JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model

    Authors: Lixuan Chen, Xuanyu Tian, Jiangjie Wu, Ruimin Feng, Guoyan Lao, Yuyao Zhang, Hongjiang Wei

    Abstract: Magnetic Resonance Imaging (MRI) stands as a powerful modality in clinical diagnosis. However, it is known that MRI faces challenges such as long acquisition time and vulnerability to motion-induced artifacts. Despite the success of many existing motion correction algorithms, there has been limited research focused on correcting motion artifacts on the estimated coil sensitivity maps for fast MRI… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 10 pages,8 figures, journal

  16. arXiv:2310.04992  [pdf, other

    eess.IV cs.CV

    VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  17. arXiv:2308.06204  [pdf, other

    eess.SY cs.AI cs.LG

    Safety in Traffic Management Systems: A Comprehensive Survey

    Authors: Wenlu Du, Ankan Dash, **g Li, Hua Wei, Guiling Wang

    Abstract: Traffic management systems play a vital role in ensuring safe and efficient transportation on roads. However, the use of advanced technologies in traffic management systems has introduced new safety challenges. Therefore, it is important to ensure the safety of these systems to prevent accidents and minimize their impact on road users. In this survey, we provide a comprehensive review of the liter… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted by MDPI Designs journal, the Special Issue Design and Application of Intelligent Transportation Systems. 30 pages, 6 figures, published on 10 August 2023

    Journal ref: Designs 2023, 7, 100

  18. RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music

    Authors: Haojie Wei, Xueke Cao, Tangpeng Dan, Yueguo Chen

    Abstract: Vocal pitch is an important high-level feature in music audio processing. However, extracting vocal pitch in polyphonic music is more challenging due to the presence of accompaniment. To eliminate the influence of the accompaniment, most previous methods adopt music source separation models to obtain clean vocals from polyphonic music before predicting vocal pitches. As a result, the performance o… ▽ More

    Submitted 27 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by INTERSPEECH 2023

  19. arXiv:2306.15203  [pdf, other

    eess.IV cs.AI cs.CV

    Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction

    Authors: Qing Wu, Lixuan Chen, Ce Wang, Hongjiang Wei, S. Kevin Zhou, **gyi Yu, Yuyao Zhang

    Abstract: Emerging neural reconstruction techniques based on tomography (e.g., NeRF, NeAT, and NeRP) have started showing unique capabilities in medical imaging. In this work, we present a novel Polychromatic neural representation (Polyner) to tackle the challenging problem of CT imaging when metallic implants exist within the human body. CT metal artifacts arise from the drastic variation of metal's attenu… ▽ More

    Submitted 1 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted by NeurIPS 2023

  20. arXiv:2306.11309  [pdf, other

    cs.SD cs.CL eess.AS eess.SP

    Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition

    Authors: Xuefei Wang, Yanhua Long, Yijie Li, Haoran Wei

    Abstract: Low-resource accented speech recognition is one of the important challenges faced by current ASR technology in practical applications. In this study, we propose a Conformer-based architecture, called Aformer, to leverage both the acoustic information from large non-accented and limited accented training data. Specifically, a general encoder and an accent encoder are designed in the Aformer to extr… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  21. arXiv:2306.01304  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval

    Authors: Haojie Wei, Jun Yuan, Rui Zhang, Yueguo Chen, Gang Wang

    Abstract: Melody extraction is a core task in music information retrieval, and the estimation of pitch, onset and offset are key sub-tasks in melody extraction. Existing methods have limited accuracy, and work for only one type of data, either single-pitch or multipitch. In this paper, we propose a highly accurate method for joint estimation of pitch, onset and offset, named JEPOO. We address the challenges… ▽ More

    Submitted 7 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by IJCAI 2023; 11 pages, 6 figures

  22. arXiv:2305.12669  [pdf, other

    cs.IT eess.SP

    Angle-based SLAM on 5G mmWave Systems: Design, Implementation, and Measurement

    Authors: Jie Yang, Chao-Kai Wen, **g Xu, Hang Que, Haikun Wei, Shi **

    Abstract: Simultaneous localization and map** (SLAM) is a key technology that provides user equipment (UE) tracking and environment map** services, enabling the deep integration of sensing and communication. The millimeter-wave (mmWave) communication, with its larger bandwidths and antenna arrays, inherently facilitates more accurate delay and angle measurements than sub-6 GHz communication, thereby pro… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted by the IEEE Internet of Things Journal

  23. arXiv:2305.01360  [pdf, other

    eess.IV cs.CV

    Self-supervised arbitrary scale super-resolution framework for anisotropic MRI

    Authors: Haonan Zhang, Yuhan Zhang, Qing Wu, Jiangjie Wu, Zhiming Zhen, Feng Shi, Jianmin Yuan, Hongjiang Wei, Chen Liu, Yuyao Zhang

    Abstract: In this paper, we propose an efficient self-supervised arbitrary-scale super-resolution (SR) framework to reconstruct isotropic magnetic resonance (MR) images from anisotropic MRI inputs without involving external training data. The proposed framework builds a training dataset using in-the-wild anisotropic MR volumes with arbitrary image resolution. We then formulate the 3D volume SR task as a SR… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 10 pages, 5 figures

  24. arXiv:2304.13162  [pdf, other

    eess.IV cs.CV cs.MM

    HDR or SDR? A Subjective and Objective Study of Scaled and Compressed Videos

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yixu Chen, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We conducted a large-scale study of human perceptual quality judgments of High Dynamic Range (HDR) and Standard Dynamic Range (SDR) videos subjected to scaling and compression levels and viewed on three different display devices. HDR videos are able to present wider color gamuts, better contrasts, and brighter whites and darker blacks than SDR videos. While conventional expectations are that HDR q… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  25. arXiv:2304.13156  [pdf, other

    eess.IV cs.CV

    HDR-ChipQA: No-Reference Quality Assessment on High Dynamic Range Videos

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We present a no-reference video quality model and algorithm that delivers standout performance for High Dynamic Range (HDR) videos, which we call HDR-ChipQA. HDR videos represent wider ranges of luminances, details, and colors than Standard Dynamic Range (SDR) videos. The growing adoption of HDR in massively scaled video networks has driven the need for video quality assessment (VQA) algorithms th… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  26. Making Video Quality Assessment Models Robust to Bit Depth

    Authors: Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik

    Abstract: We introduce a novel feature set, which we call HDRMAX features, that when included into Video Quality Assessment (VQA) algorithms designed for Standard Dynamic Range (SDR) videos, sensitizes them to distortions of High Dynamic Range (HDR) videos that are inadequately accounted for by these algorithms. While these features are not specific to HDR, and also augment the equality prediction performan… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Published in IEEE Signal Processing Letters 2023

  27. arXiv:2304.07976  [pdf, ps, other

    eess.SP

    Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning

    Authors: Yuchao Chang, Wen Chen, Jun Li, Jianpo Liu, Haoran Wei, Zhendong Wang, Naofal Al-Dhahir

    Abstract: Network energy efficiency is a main pillar in the design and operation of wireless communication systems. In this paper, we investigate a dense radio access network (dense-RAN) capable of radiated power management at the base station (BS). Aiming to improve the long-term network energy efficiency, an optimization problem is formulated by collaboratively managing multi-BSs radiated power levels wit… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: IEEE Transactions on Green Communicaitons and Networking

  28. arXiv:2304.03708  [pdf, other

    eess.IV cs.CV

    Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

    Authors: Gongning Luo, Kuanquan Wang, Jun Liu, Shuo Li, Xinjie Liang, Xiangyu Li, Shaowei Gan, Wei Wang, Suyu Dong, Wenyi Wang, Pengxin Yu, Enyou Liu, Hongrong Wei, Na Wang, Jia Guo, Huiqi Li, Zhao Zhang, Ziwei Zhao, Na Gao, Nan An, Ashkan Pakzad, Bojidar Rangelov, Jiaqi Dou, Song Tian, Zeyu Liu , et al. (5 additional authors not shown)

    Abstract: Efficient automatic segmentation of multi-level (i.e. main and branch) pulmonary arteries (PA) in CTPA images plays a significant role in clinical applications. However, most existing methods concentrate only on main PA or branch PA segmentation separately and ignore segmentation efficiency. Besides, there is no public large-scale dataset focused on PA segmentation, which makes it highly challengi… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  29. arXiv:2304.03697  [pdf, other

    cs.LG cs.AI eess.SY

    HumanLight: Incentivizing Ridesharing via Human-centric Deep Reinforcement Learning in Traffic Signal Control

    Authors: Dimitris M. Vlachogiannis, Hua Wei, Scott Moura, Jane Macfarlane

    Abstract: Single occupancy vehicles are the most attractive transportation alternative for many commuters, leading to increased traffic congestion and air pollution. Advancements in information technologies create opportunities for smart solutions that incentivize ridesharing and mode shift to higher occupancy vehicles (HOVs) to achieve the car lighter vision of cities. In this study, we present HumanLight,… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 29 pages, 17 figures

  30. arXiv:2304.03459  [pdf, other

    math.OC eess.SY

    Integrated motion control and energy management of series hybrid electric vehicles: A multi-objective MPC approach

    Authors: Henglai Wei, Guangyuan Li, Yang Lu, Hui Zhang

    Abstract: This paper considers the integrated motion control and energy management problems of the series hybrid electric vehicles (SHEV) with constraints. We propose a multi-objective model predictive control (MOMPC)-based energy management approach, which is embedded with the motion control to guarantee driving comfort. In addition, due to the slow response of the engine, it may cause excessive batter pow… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  31. arXiv:2303.03703  [pdf, other

    eess.IV

    Geometry-based spherical JND modeling for 360$^\circ$ display

    Authors: Hongan Wei, Jiaqi Liu, Bo Chen, Liqun Lin, Weiling Chen, Tiesong Zhao

    Abstract: 360$^\circ$ videos have received widespread attention due to its realistic and immersive experiences for users. To date, how to accurately model the user perceptions on 360$^\circ$ display is still a challenging issue. In this paper, we exploit the visual characteristics of 360$^\circ$ projection and display and extend the popular just noticeable difference (JND) model to spherical JND (SJND). Fir… ▽ More

    Submitted 4 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  32. arXiv:2301.00127  [pdf, other

    eess.IV cs.CV physics.med-ph

    Spatiotemporal implicit neural representation for unsupervised dynamic MRI reconstruction

    Authors: Jie Feng, Ruimin Feng, Qing Wu, Zhiyong Zhang, Yuyao Zhang, Hongjiang Wei

    Abstract: Supervised Deep-Learning (DL)-based reconstruction algorithms have shown state-of-the-art results for highly-undersampled dynamic Magnetic Resonance Imaging (MRI) reconstruction. However, the requirement of excessive high-quality ground-truth data hinders their applications due to the generalization problem. Recently, Implicit Neural Representation (INR) has appeared as a powerful DL-based tool fo… ▽ More

    Submitted 13 January, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: 9 pages, 5 figures; corrected the code availability description for arXiv

  33. arXiv:2212.06557  [pdf, ps, other

    eess.SP

    A Data Quality Assessment Framework for AI-enabled Wireless Communication

    Authors: Hanning Tang, Liusha Yang, Rui Zhou, **g Liang, Hong Wei, Xuan Wang, Qingjiang Shi, Zhi-Quan Luo

    Abstract: Using artificial intelligent (AI) to re-design and enhance the current wireless communication system is a promising pathway for the future sixth-generation (6G) wireless network. The performance of AI-enabled wireless communication depends heavily on the quality of wireless air-interface data. Although there are various approaches to data quality assessment (DQA) for different applications, none h… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  34. arXiv:2211.01571  [pdf, other

    eess.AS cs.SD

    Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system

    Authors: Li Li, Dongxing Xu, Haoran Wei, Yanhua Long

    Abstract: Exploiting effective target modeling units is very important and has always been a concern in end-to-end automatic speech recognition (ASR). In this work, we propose a phonetic-assisted multi target units (PMU) modeling approach, to enhance the Conformer-Transducer ASR system in a progressive representation learning manner. Specifically, PMU first uses the pronunciation-assisted subword modeling (… ▽ More

    Submitted 7 July, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted by Interspeech 2023

  35. arXiv:2210.12731  [pdf, other

    eess.IV cs.CV

    Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field

    Authors: Qing Wu, Xin Li, Hongjiang Wei, **gyi Yu, Yuyao Zhang

    Abstract: Neural Radiance Field (NeRF) has widely received attention in Sparse-View Computed Tomography (SVCT) reconstruction tasks as a self-supervised deep learning framework. NeRF-based SVCT methods represent the desired CT image as a continuous function of spatial coordinates and train a Multi-Layer Perceptron (MLP) to learn the function by minimizing loss on the SV sinogram. Benefiting from the continu… ▽ More

    Submitted 6 November, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 5 pages

  36. arXiv:2210.10439  [pdf, other

    eess.IV cs.CV cs.LG

    A scan-specific unsupervised method for parallel MRI reconstruction via implicit neural representation

    Authors: Ruimin Feng, Qing Wu, Yuyao Zhang, Hongjiang Wei

    Abstract: Parallel imaging is a widely-used technique to accelerate magnetic resonance imaging (MRI). However, current methods still perform poorly in reconstructing artifact-free MRI images from highly undersampled k-space data. Recently, implicit neural representation (INR) has emerged as a new deep learning paradigm for learning the internal continuity of an object. In this study, we adopted INR to paral… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: conference

  37. Accelerated partial separable model using dimension-reduced optimization technique for ultra-fast cardiac MRI

    Authors: Zhongsen Li, Aiqi Sun, Chuyu Liu, Haining Wei, Shuai Wang, Mingzhu Fu, Rui Li

    Abstract: Objective. Imaging dynamic object with high temporal resolution is challenging in magnetic resonance imaging (MRI). Partial separable (PS) model was proposed to improve the imaging quality by reducing the degrees of freedom of the inverse problem. However, PS model still suffers from long acquisition time and even longer reconstruction time. The main objective of this study is to accelerate the PS… ▽ More

    Submitted 1 April, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 23 pages, 11 figures. Accepted as manuscript on Physics in Medicine & Biology

  38. arXiv:2209.10005  [pdf, other

    eess.IV cs.CV

    Subjective Assessment of High Dynamic Range Videos Under Different Ambient Conditions

    Authors: Zaixi Shang, Joshua P. Ebenezer, Alan C. Bovik, Yongjun Wu, Hai Wei, Sriram Sethuraman

    Abstract: High Dynamic Range (HDR) videos can represent a much greater range of brightness and color than Standard Dynamic Range (SDR) videos and are rapidly becoming an industry standard. HDR videos have more challenging capture, transmission, and display requirements than legacy SDR videos. With their greater bit depth, advanced electro-optical transfer functions, and wider color gamuts, comes the need fo… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  39. arXiv:2209.08785  [pdf, other

    eess.SY

    A Robust Distributed Model Predictive Control Framework for Consensus of Multi-Agent Systems with Input Constraints and Varying Delays

    Authors: Henglai Wei, Changxin Liu, Yang Shi

    Abstract: This paper studies the consensus problem of general linear discrete-time multi-agent systems (MAS) with input constraints and bounded time-varying communication delays. We propose a robust distributed model predictive control (DMPC) consensus protocol that integrates the offline consensus design with online DMPC optimization to exploit their respective advantages. More precisely, each agent is equ… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  40. arXiv:2209.06413  [pdf, other

    eess.IV cs.CV

    Continuous longitudinal fetus brain atlas construction via implicit neural representation

    Authors: Lixuan Chen, Jiangjie Wu, Qing Wu, Hongjiang Wei, Yuyao Zhang

    Abstract: Longitudinal fetal brain atlas is a powerful tool for understanding and characterizing the complex process of fetus brain development. Existing fetus brain atlases are typically constructed by averaged brain images on discrete time points independently over time. Due to the differences in onto-genetic trends among samples at different time points, the resulting atlases suffer from temporal inconsi… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 11 pages, 4 figures

  41. arXiv:2209.06411  [pdf, other

    eess.IV cs.CV cs.LG

    Noise2SR: Learning to Denoise from Super-Resolved Single Noisy Fluorescence Image

    Authors: Xuanyu Tian, Qing Wu, Hongjiang Wei, Yuyao Zhang

    Abstract: Fluorescence microscopy is a key driver to promote discoveries of biomedical research. However, with the limitation of microscope hardware and characteristics of the observed samples, the fluorescence microscopy images are susceptible to noise. Recently, a few self-supervised deep learning (DL) denoising methods have been proposed. However, the training efficiency and denoising performance of exis… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 12 pages, 6 figures

    Journal ref: MICCAI 2022

  42. arXiv:2209.05483  [pdf, other

    eess.IV cs.CV cs.LG

    Self-Supervised Coordinate Projection Network for Sparse-View Computed Tomography

    Authors: Qing Wu, Ruimin Feng, Hongjiang Wei, **gyi Yu, Yuyao Zhang

    Abstract: In the present work, we propose a Self-supervised COordinate Projection nEtwork (SCOPE) to reconstruct the artifacts-free CT image from a single SV sinogram by solving the inverse tomography imaging problem. Compared with recent related works that solve similar problems using implicit neural representation network (INR), our essential contribution is an effective and simple re-projection strategy… ▽ More

    Submitted 11 August, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 12 pages

    Journal ref: IEEE Transactions on Computational Imaging 9 (2023) 517-529

  43. arXiv:2207.12002  [pdf, ps, other

    cs.RO eess.SY

    An Optimal Motion Planning Framework for Quadruped Jum**

    Authors: Zhitao Song, Linzhu Yue, Guangli Sun, Yihu Ling, Hongshuo Wei, Linhai Gui, Yun-Hui Liu

    Abstract: This paper presents an optimal motion planning framework to generate versatile energy-optimal quadrupedal jum** motions automatically (e.g., flips, spin). The jum** motions via the centroidal dynamics are formulated as a 12-dimensional black-box optimization problem subject to the robot kino-dynamic constraints. Gradient-based approaches offer great success in addressing trajectory optimizatio… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accept by IROS 2022

  44. arXiv:2207.05798  [pdf, other

    eess.IV

    Subjective and Objective Quality Assessment of High-Motion Sports Videos at Low-Bitrates

    Authors: Joshua P. Ebenezer, Yixu Chen, Yongjun Wu, Hai Wei, Sriram Sethuraman

    Abstract: Videos often have to be transmitted and stored at low bitrates due to poor network connectivity during adaptive bitrate streaming. Designing optimal bitrate ladders that would select the perceptually-optimized resolution, frame-rate, and compression level for low-bitrate videos for adaptive streaming across the internet is therefore a task of great interest. Towards that end, we conducted the firs… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted at ICIP 2022

  45. arXiv:2207.00842  [pdf, other

    eess.SY

    Safe Reinforcement Learning for a Robot Being Pursued but with Objectives Covering More Than Capture-avoidance

    Authors: Huanhui Cao, Zhiyuan Cai, Hairuo Wei, Wenjie Lu, Lin Zhang, Hao Xiong

    Abstract: Reinforcement Learning (RL) algorithms show amazing performance in recent years, but placing RL in real-world applications such as self-driven vehicles may suffer safety problems. A self-driven vehicle moving to a target position following a learned policy may suffer a vehicle with unpredictable aggressive behaviors or even being pursued by a vehicle following a Nash strategy. To address the safet… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  46. arXiv:2206.04733  [pdf, other

    stat.AP eess.SY math.ST

    On Low-Complexity Quickest Intervention of Mutated Diffusion Processes Through Local Approximation

    Authors: Qining Zhang, Honghao Wei, Weina Wang, Lei Ying

    Abstract: We consider the problem of controlling a mutated diffusion process with an unknown mutation time. The problem is formulated as the quickest intervention problem with the mutation modeled by a change-point, which is a generalization of the quickest change-point detection (QCD). Our goal is to intervene in the mutated process as soon as possible while maintaining a low intervention cost with optimal… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  47. arXiv:2112.06091  [pdf

    eess.SP

    Continuous Human Action Detection Based on Wearable Inertial Data

    Authors: Xia Gong, Yan Lu, Haoran Wei

    Abstract: Human action detection is a hot topic, which is widely used in video surveillance, human machine interface, healthcare monitoring, gaming, dancing training and musical instrument teaching. As inertial sensors are low cost, portable, and having no operating space, it is suitable to detect human action. In real-world applications, actions that are of interest appear among actions of non interest wit… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  48. arXiv:2111.00803  [pdf

    eess.SP

    A Modified Dynamic Time War** (MDTW) Approach and Innovative Average Non-Self Match Distance (ANSD) Method for Anomaly Detection in ECG Recordings

    Authors: Hua-Liang Wei

    Abstract: ECGs objectively reflects the working conditions of the hearts as these signals contain vast physiological and pathological information. In this work, in order to improve the efficiency and accuracy of "best so far" time series analysis-based ECG anomaly detection methods, a novel method, comprising a modified dynamic time war** (MDTW) and an innovative average non-self match distance (ANSD) mea… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 18 pages, 11 figures, 9 tables

  49. arXiv:2111.00748  [pdf

    eess.SY

    Quasi-Linear Transfer Function: A New Method for Frequency Domain Analysis of Nonlinear Systems

    Authors: Hua-Liang Wei, S. A. Billings

    Abstract: A new concept, called quasi-linear transfer functions (QLTF), which can be used to characterize the output frequency behaviour of nonlinear systems, is introduced based on the well-known Volterra series representation. By using the new concept of QLTF, it can be proved that the input and output frequency behaviour of a given system can be expressed using a number of one-dimensional functions with… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 26 pages, 9 figures, 3 tables

  50. arXiv:2110.15830  [pdf

    eess.SY nlin.PS

    Characterising Linear Spatio-Temporal Dynamical Systems in the Frequency Domain

    Authors: Hua-Liang Wei, Stephen A. Billings

    Abstract: A new concept, called the spatio-temporal transfer function (STTF), is introduced to characterise a class of linear time-invariant (LTI) spatio-temporal dynamical systems. The spatio-temporal transfer function is a natural extension of the ordinary transfer function for classical linear time-invariant control systems. As in the case of the classical transfer function, the spatio-temporal transfer… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: 11 pages, 1 figure