Skip to main content

Showing 1–13 of 13 results for author: Sheng, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14118  [pdf, other

    eess.IV cs.CV

    Prediction and Reference Quality Adaptation for Learned Video Compression

    Authors: Xihua Sheng, Li Li, Dong Liu, Houqiang Li

    Abstract: Temporal prediction is one of the most important technologies for video compression. Various prediction coding modes are designed in traditional video codecs. Traditional video codecs will adaptively to decide the optimal coding mode according to the prediction quality and reference quality. Recently, learned video codecs have made great progress. However, they ignore the prediction and reference… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2404.10312  [pdf, other

    cs.CV eess.IV

    OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

    Authors: Runyi Li, Xuhan Sheng, Weiqi Li, Jian Zhang

    Abstract: Omnidirectional images (ODIs) are commonly used in real-world visual tasks, and high-resolution ODIs help improve the performance of related visual tasks. Most existing super-resolution methods for ODIs use end-to-end learning strategies, resulting in inferior realness of generated images and a lack of effective out-of-domain generalization capabilities in training methods. Image generation method… ▽ More

    Submitted 17 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2401.15864  [pdf, other

    cs.CV eess.IV

    Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression

    Authors: Xihua Sheng, Li Li, Dong Liu, Houqiang Li

    Abstract: Video compression performance is closely related to the accuracy of inter prediction. It tends to be difficult to obtain accurate inter prediction for the local video regions with inconsistent motion and occlusion. Traditional video coding standards propose various technologies to handle motion inconsistency and occlusion, such as recursive partitions, geometric partitions, and long-term reference… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  4. arXiv:2401.03396  [pdf

    eess.SP

    A Closed-loop Brain-Machine Interface SoC Featuring a 0.2$μ$J/class Multiplexer Based Neural Network

    Authors: Chao Zhang, Yongxiang Guo, Dawid Sheng, Zhixiong Ma, Chao Sun, Yuwei Zhang, Wenxin Zhao, Fenyan Zhang, Tongfei Wang, Xing Sheng, Milin Zhang

    Abstract: This work presents the first fabricated electrophysiology-optogenetic closed-loop bidirectional brain-machine interface (CL-BBMI) system-on-chip (SoC) with electrical neural signal recording, on-chip sleep staging and optogenetic stimulation. The first multiplexer with static assignment based table lookup solution (MUXnet) for multiplier-free NN processor was proposed. A state-of-the-art average a… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 2 pages, 6 figures. Accepted by IEEE Custom Integrated Circuits Conference (CICC) 2024. The codes for the MUXnet (constructing neural networks using multiplexers instead of multipliers) will be open-sourced after the Journal version of this work is accepted

  5. arXiv:2310.04984  [pdf, other

    cs.IT cs.LG eess.SP math.PR stat.ML

    Model-adapted Fourier sampling for generative compressed sensing

    Authors: Aaron Berk, Simone Brugiapaglia, Yaniv Plan, Matthew Scott, Xia Sheng, Ozgur Yilmaz

    Abstract: We study generative compressed sensing when the measurement matrix is randomly subsampled from a unitary matrix (with the DFT as an important special case). It was recently shown that $\textit{O}(kdn\| \boldsymbolα\|_{\infty}^{2})$ uniformly random Fourier measurements are sufficient to recover signals in the range of a neural network $G:\mathbb{R}^k \to \mathbb{R}^n$ of depth $d$, where each comp… ▽ More

    Submitted 17 November, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: 12 pages, 4 figures. Submitted to the NeurIPS 2023 Workshop on Deep Learning and Inverse Problems. This revision features additional attribution of work, aknowledgmenents, and a correction in definition 1.1

  6. arXiv:2307.05092  [pdf, other

    cs.CV eess.IV

    Offline and Online Optical Flow Enhancement for Deep Video Compression

    Authors: Chuanbo Tang, Xihua Sheng, Zhuoyuan Li, Haotian Zhang, Li Li, Dong Liu

    Abstract: Video compression relies heavily on exploiting the temporal redundancy between video frames, which is usually achieved by estimating and using the motion information. The motion information is represented as optical flows in most of the existing deep video compression networks. Indeed, these networks often adopt pre-trained optical flow estimation networks for motion estimation. The optical flows,… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 9 pages, 6 figures

  7. arXiv:2306.13969  [pdf, other

    eess.IV eess.SP

    Farthest Streamline Sampling for the Uniform Distribution of Forearm Muscle Fiber Tracts from Diffusion Tensor Imaging

    Authors: Yang Li, Shihan Ma, Jiamin Zhao, Qing Li, Xinjun Sheng

    Abstract: Background: Diffusion tensor imaging (DTI) has been used to characterize forearm muscle architecture. Since only uniform sampling is performed for seed points rather than fiber tracts, the tracts may be unevenly distributed in the muscle volume. Purpose: To reconstruct uniformly distributed fiber tracts in human forearm by filtering the tracts from DTI. Assessment: Farthest streamline sampling (FS… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 20 pages, 7 figures

  8. arXiv:2306.10681  [pdf, other

    eess.IV cs.CV

    VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

    Authors: Xihua Sheng, Li Li, Dong Liu, Houqiang Li

    Abstract: Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by machine vision algorithms. Intuitively, it is more efficient to enhance/analyze the coded representations directly without decoding them into pixels. Therefore, w… ▽ More

    Submitted 1 November, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

  9. arXiv:2306.10482  [pdf, other

    math.OC cs.CV eess.IV

    Weighted structure tensor total variation for image denoising

    Authors: Xiuhan Sheng, Lijuan Yang, **gya Chang

    Abstract: For image denoising problems, the structure tensor total variation (STV)-based models show good performances when compared with other competing regularization approaches. However, the STV regularizer does not couple the local information of the image and may not maintain the image details. Therefore, we employ the anisotropic weighted matrix introduced in the anisotropic total variation (ATV) mode… ▽ More

    Submitted 4 April, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

  10. arXiv:2304.13471  [pdf, other

    eess.IV cs.CV

    OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

    Authors: Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

    Abstract: 360° omnidirectional images have gained research attention due to their immersive and interactive experience, particularly in AR/VR applications. However, they suffer from lower angular resolution due to being captured by fisheye lenses with the same sensor size for capturing planar images. To solve the above issues, we propose a two-stage framework for 360° omnidirectional image superresolution.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPRW 2023

  11. arXiv:2211.01856  [pdf, other

    cs.LG cs.CE eess.SP physics.bio-ph

    Conditional Generative Models for Simulation of EMG During Naturalistic Movements

    Authors: Shihan Ma, Alexander Kenneth Clarke, Kostiantyn Maksymenko, Samuel Deslauriers-Gauthier, Xinjun Sheng, Xiangyang Zhu, Dario Farina

    Abstract: Numerical models of electromyographic (EMG) signals have provided a huge contribution to our fundamental understanding of human neurophysiology and remain a central pillar of motor neuroscience and the development of human-machine interfaces. However, whilst modern biophysical simulations based on finite element methods are highly accurate, they are extremely computationally expensive and thus are… ▽ More

    Submitted 5 October, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  12. Attribute Artifacts Removal for Geometry-based Point Cloud Compression

    Authors: Xihua Sheng, Li Li, Dong Liu, Zhiwei Xiong

    Abstract: Geometry-based point cloud compression (G-PCC) can achieve remarkable compression efficiency for point clouds. However, it still leads to serious attribute compression artifacts, especially under low bitrate scenarios. In this paper, we propose a Multi-Scale Graph Attention Network (MS-GAT) to remove the artifacts of point cloud attributes compressed by G-PCC. We first construct a graph based on p… ▽ More

    Submitted 28 February, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  13. arXiv:2111.13850  [pdf, other

    cs.CV cs.LG eess.IV

    Temporal Context Mining for Learned Video Compression

    Authors: Xihua Sheng, Jiahao Li, Bin Li, Li Li, Dong Liu, Yan Lu

    Abstract: We address end-to-end learned video compression with a special focus on better learning and utilizing temporal contexts. For temporal context mining, we propose to store not only the previously reconstructed frames, but also the propagated features into the generalized decoded picture buffer. From the stored propagated features, we propose to learn multi-scale temporal contexts, and re-fill the le… ▽ More

    Submitted 30 January, 2023; v1 submitted 27 November, 2021; originally announced November 2021.