Skip to main content

Showing 1–25 of 25 results for author: Zheng, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08283  [pdf, other

    cs.RO eess.SY

    A Hybrid Task-Constrained Motion Planning for Collaborative Robots in Intelligent Remanufacturing

    Authors: Wansong Liu, Chang Liu, Xiao Liang, Minghui Zheng

    Abstract: Industrial manipulators have extensively collaborated with human operators to execute tasks, e.g., disassembly of end-of-use products, in intelligent remanufacturing. A safety task execution requires real-time path planning for the manipulator's end-effector to autonomously avoid human operators. This is even more challenging when the end-effector needs to follow a planned path while avoiding the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.02518  [pdf, other

    cs.CV eess.IV

    DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering

    Authors: Zhongpai Gao, Benjamin Planche, Meng Zheng, Xiao Chen, Terrence Chen, Ziyan Wu

    Abstract: Digitally reconstructed radiographs (DRRs) are simulated 2D X-ray images generated from 3D CT volumes, widely used in preoperative settings but limited in intraoperative applications due to computational bottlenecks, especially for accurate but heavy physics-based Monte Carlo methods. While analytical DRR renderers offer greater efficiency, they overlook anisotropic X-ray image formation phenomena… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2405.07962  [pdf, other

    cs.RO eess.SY

    KG-Planner: Knowledge-Informed Graph Neural Planning for Collaborative Manipulators

    Authors: Wansong Liu, Kareem Eltouny, Sibo Tian, Xiao Liang, Minghui Zheng

    Abstract: This paper presents a novel knowledge-informed graph neural planner (KG-Planner) to address the challenge of efficiently planning collision-free motions for robots in high-dimensional spaces, considering both static and dynamic environments involving humans. Unlike traditional motion planners that struggle with finding a balance between efficiency and optimality, the KG-Planner takes a different a… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  4. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  5. Improving Disturbance Estimation and Suppression via Learning among Systems with Mismatched Dynamics

    Authors: Harsh Modi, Zhu Chen, Xiao Liang, Minghui Zheng

    Abstract: Iterative learning control (ILC) is a method for reducing system tracking or estimation errors over multiple iterations by using information from past iterations. The disturbance observer (DOB) is used to estimate and mitigate disturbances within the system, while the system is being affected by them. ILC enhances system performance by introducing a feedforward signal in each iteration. However, i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. Wi-Fi-based Personnel Identity Recognition: Addressing Dataset Imbalance with C-DDPMs

    Authors: Jichen Bian, Chong Tan, Peiyao Tang, Min Zheng

    Abstract: Wireless sensing technologies become increasingly prevalent due to the ubiquitous nature of wireless signals and their inherent privacy-friendly characteristics. Device-free personnel identity recognition, a prevalent application in wireless sensing, is susceptibly challenged by imbalanced channel state information (CSI) datasets. This letter proposes a novel method for CSI dataset augmentation th… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Journal ref: IEEE Signal Processing Letters, 2024

  7. arXiv:2403.05807  [pdf, other

    cs.CV eess.IV

    A self-supervised CNN for image watermark removal

    Authors: Chunwei Tian, Menghua Zheng, Tiancai Jiao, Wangmeng Zuo, Yanning Zhang, Chia-Wen Lin

    Abstract: Popular convolutional neural networks mainly use paired images in a supervised way for image watermark removal. However, watermarked images do not have reference images in the real world, which results in poor robustness of image watermark removal techniques. In this paper, we propose a self-supervised convolutional neural network (CNN) in image watermark removal (SWCNN). SWCNN uses a self-supervi… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  8. arXiv:2402.17200  [pdf, other

    cs.CV eess.IV

    Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain

    Authors: Qunliang Xing, Mai Xu, Shengxi Li, Xin Deng, Meisong Zheng, Huaida Liu, Ying Chen

    Abstract: Existing quality enhancement methods for compressed images focus on aligning the enhancement domain with the raw domain to yield realistic images. However, these methods exhibit a pervasive enhancement bias towards the compression domain, inadvertently regarding it as more realistic than the raw domain. This bias makes enhanced images closely resemble their compressed counterparts, thus degrading… ▽ More

    Submitted 19 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024

  9. arXiv:2311.15231  [pdf, other

    cs.CV cs.LG eess.IV

    Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification

    Authors: Bo Xu, Hao Zheng, Zhigang Hu, Liu Yang, Meiguang Zheng

    Abstract: In current synthetic aperture radar (SAR) object classification, one of the major challenges is the severe overfitting issue due to the limited dataset (few-shot) and noisy data. Considering the advantages of knowledge distillation as a learned label smoothing regularization, this paper proposes a novel Double Reverse Regularization Network based on Self-Knowledge Distillation (DRRNet-SKD). Specif… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 6 pages, 8 figures

  10. arXiv:2310.10408  [pdf, other

    eess.IV cs.CV cs.LG

    A cross Transformer for image denoising

    Authors: Chunwei Tian, Menghua Zheng, Wangmeng Zuo, Shichao Zhang, Yanning Zhang, Chia-Wen Ling

    Abstract: Deep convolutional neural networks (CNNs) depend on feedforward and feedback ways to obtain good performance in image denoising. However, how to obtain effective structural information via CNNs to efficiently represent given noisy images is key for complex scenes. In this paper, we propose a cross Transformer denoising CNN (CTNet) with a serial block (SB), a parallel block (PB), and a residual blo… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  11. arXiv:2306.02634  [pdf, other

    physics.optics cs.CV cs.LG eess.IV

    Computational 3D topographic microscopy from terabytes of data per sample

    Authors: Kevin C. Zhou, Mark Harfouche, Maxwell Zheng, Joakim Jönsson, Kyung Chul Lee, Ron Appel, Paul Reamey, Thomas Doman, Veton Saliu, Gregor Horstmeyer, Roarke Horstmeyer

    Abstract: We present a large-scale computational 3D topographic microscope that enables 6-gigapixel profilometric 3D imaging at micron-scale resolution across $>$110 cm$^2$ areas over multi-millimeter axial ranges. Our computational microscope, termed STARCAM (Scanning Topographic All-in-focus Reconstruction with a Computational Array Microscope), features a parallelized, 54-camera architecture with 3-axis… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  12. arXiv:2305.12708  [pdf, other

    eess.AS cs.SD

    ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

    Authors: Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, **zheng He, Zhou Zhao

    Abstract: Text-to-speech(TTS) has undergone remarkable improvements in performance, particularly with the advent of Denoising Diffusion Probabilistic Models (DDPMs). However, the perceived quality of audio depends not solely on its content, pitch, rhythm, and energy, but also on the physical environment. In this work, we propose ViT-TTS, the first visual TTS model with scalable diffusion transformers. ViT-T… ▽ More

    Submitted 21 April, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by EMNLP 2023

  13. arXiv:2302.05736  [pdf

    eess.SY

    Locating the Sources of Sub-synchronous Oscillations Induced by the Control of Voltage Source Converters Based on Energy Structure and Nonlinearity Detection

    Authors: Zetian Zheng, Shaowei Huang, Jun Yan, Qiangsheng Bu, Chen Shen, Mingzhong Zheng, Ye Liu

    Abstract: The oscillation phenomena associated with the control of voltage source converters (VSCs) are widely concerning, and locating the source of these oscillations is crucial to suppressing them; therefore, this paper presents a locating scheme, based on the energy structure and nonlinearity detection. On the one hand, the energy structure, which conforms with the principle of the energy-based method a… ▽ More

    Submitted 17 February, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  14. arXiv:2301.08351  [pdf, other

    physics.optics eess.IV physics.bio-ph

    Parallelized computational 3D video microscopy of freely moving organisms at multiple gigapixels per second

    Authors: Kevin C. Zhou, Mark Harfouche, Colin L. Cooke, Jaehee Park, Pavan C. Konda, Lucas Kreiss, Kanghyun Kim, Joakim Jönsson, Jed Doman, Paul Reamey, Veton Saliu, Clare B. Cook, Maxwell Zheng, Jack P. Bechtel, Aurélien Bègue, Matthew McCarroll, Jennifer Bagwell, Gregor Horstmeyer, Michel Bagnat, Roarke Horstmeyer

    Abstract: To study the behavior of freely moving model organisms such as zebrafish (Danio rerio) and fruit flies (Drosophila) across multiple spatial scales, it would be ideal to use a light microscope that can resolve 3D information over a wide field of view (FOV) at high speed and high spatial resolution. However, it is challenging to design an optical instrument to achieve all of these properties simulta… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  15. arXiv:2209.12394  [pdf, other

    eess.IV cs.CV

    Multi-stage image denoising with the wavelet transform

    Authors: Chunwei Tian, Menghua Zheng, Wangmeng Zuo, Bob Zhang, Yanning Zhang, David Zhang

    Abstract: Deep convolutional neural networks (CNNs) are used for image denoising via automatically mining accurate structure information. However, most of existing CNNs depend on enlarging depth of designed networks to obtain better denoising performance, which may cause training difficulty. In this paper, we propose a multi-stage image denoising CNN with the wavelet transform (MWDCNN) via three stages, i.e… ▽ More

    Submitted 3 October, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

  16. Multi-frequency PolSAR Image Fusion Classification Based on Semantic Interactive Information and Topological Structure

    Authors: Yice Cao, Yan Wu, Ming Li, Mingjie Zheng, Peng Zhang, Jili Wang

    Abstract: Compared with the rapid development of single-frequency multi-polarization SAR image classification technology, there is less research on the land cover classification of multifrequency polarimetric SAR (MF-PolSAR) images. In addition, the current deep learning methods for MF-PolSAR classification are mainly based on convolutional neural networks (CNNs), only local spatiality is considered but the… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

  17. arXiv:2112.09310  [pdf, other

    cs.IT eess.SP

    Joint Device Detection, Channel Estimation, and Data Decoding with Collision Resolution for MIMO Massive Unsourced Random Access

    Authors: Tianya Li, Yongpeng Wu, Mengfan Zheng, Wenjun Zhang, Chengwen Xing, Jian** An, Xiang-Gen Xia, Chengshan Xiao

    Abstract: In this paper, we investigate a joint device activity detection (DAD), channel estimation (CE), and data decoding (DD) algorithm for multiple-input multiple-output (MIMO) massive unsourced random access (URA). Different from the state-of-the-art slotted transmission scheme, the data in the proposed framework is split into only two parts. A portion of the data is coded by compressed sensing (CS) an… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted by IEEE JSAC special issue on Next Generation Multiple Access

  18. arXiv:2111.02461  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Automatic ultrasound vessel segmentation with deep spatiotemporal context learning

    Authors: Baichuan Jiang, Alvin Chen, Shyam Bharat, Mingxin Zheng

    Abstract: Accurate, real-time segmentation of vessel structures in ultrasound image sequences can aid in the measurement of lumen diameters and assessment of vascular diseases. This, however, remains a challenging task, particularly for extremely small vessels that are difficult to visualize. We propose to leverage the rich spatiotemporal context available in ultrasound to improve segmentation of small-scal… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  19. arXiv:2110.06977  [pdf, other

    eess.SY

    Cloud-Assisted Collaborative Road Information Discovery with Gaussian Process: Application to Road Profile Estimation

    Authors: Mohammad R. Hajidavalloo, Zhaojian Li, Xin Xia, Ali Louati, Minghui Zheng, Weichao Zhuang

    Abstract: There is an increasing popularity in exploiting modern vehicles as mobile sensors to obtain important road information such as potholes, black ice and road profile. Availability of such information has been identified as a key enabler for next-generation vehicles with enhanced safety, efficiency, and comfort. However, existing road information discovery approaches have been predominately performed… ▽ More

    Submitted 9 June, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems

  20. arXiv:2011.04988  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Rendering Realistic Bokeh

    Authors: Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zi** Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan , et al. (10 additional authors not shown)

    Abstract: This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world bokeh simulation problem, where the goal was to learn a realistic shallow focus technique using a large-scale EBB! bokeh dataset consisting of 5K shallow / wide depth-of-field image pairs captured using th… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Published in ECCV 2020 Workshop (Advances in Image Manipulation), https://data.vision.ee.ethz.ch/cvl/aim20/

  21. Large Intelligent Surface Aided Physical Layer Security Transmission

    Authors: Biqian Feng, Yongpeng Wu, Mengfan Zheng, Xiang-Gen Xia, Yongjian Wang, Chengshan Xiao

    Abstract: In this paper, we investigate a large intelligent surface-enhanced (LIS-enhanced) system, where a LIS is deployed to assist secure transmission. Our design aims to maximize the achievable secrecy rates in different channel models, i.e., Rician fading and (or) independent and identically distributed Gaussian fading for the legitimate and eavesdropper channels. In addition, we take into consideratio… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: Accepted by IEEE Transactions on Signal Processing

  22. arXiv:2003.02649  [pdf, other

    eess.SP

    An Audio-Based Fault Diagnosis Method for Quadrotors Using Convolutional Neural Network and Transfer Learning

    Authors: Wansong Liu, Zhu Chen, Minghui Zheng

    Abstract: Quadrotor unmanned aerial vehicles (UAVs) have been developed and applied into several types of workplaces, such as warehouses, which usually involve human workers. The co-existence of human and UAVs brings new challenges to UAVs: potential failure of UAVs may cause risk and danger to surrounding human. Effective and efficient detection of such failure may provide early warning to the surrounding… ▽ More

    Submitted 12 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: ACC 2020 Final Version

  23. arXiv:2003.02059  [pdf, other

    cs.CV eess.IV eess.SY

    Vehicle-Human Interactive Behaviors in Emergency: Data Extraction from Traffic Accident Videos

    Authors: Wansong Liu, Danyang Luo, Changxu Wu, Minghui Zheng

    Abstract: Currently, studying the vehicle-human interactive behavior in the emergency needs a large amount of datasets in the actual emergent situations that are almost unavailable. Existing public data sources on autonomous vehicles (AVs) mainly focus either on the normal driving scenarios or on emergency situations without human involvement. To fill this gap and facilitate related research, this paper pro… ▽ More

    Submitted 12 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: ACC 2020 final version

  24. arXiv:1912.09859  [pdf, ps, other

    cs.LG cs.NI eess.SP stat.ML

    Lightweight and Unobtrusive Data Obfuscation at IoT Edge for Remote Inference

    Authors: Dixing Xu, Mengyao Zheng, Linshan Jiang, Chaojie Gu, Rui Tan, Peng Cheng

    Abstract: Executing deep neural networks for inference on the server-class or cloud backend based on data generated at the edge of Internet of Things is desirable due primarily to the limited compute power of edge devices and the need to protect the confidentiality of the inference neural networks. However, such a remote inference scheme incurs concerns regarding the privacy of the inference data transmitte… ▽ More

    Submitted 25 March, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: This paper has been accepted by IEEE Internet of Things Journal, Special Issue on Artificial Intelligence Powered Edge Computing for Internet of Things

  25. arXiv:1908.03478  [pdf, other

    eess.SY

    A Preliminary Study on A Physical Model Oriented Learning Algorithm with Application to UAVs

    Authors: Minghui Zheng, Zhu Chen, Xiao Liang

    Abstract: This paper provides a preliminary study for an efficient learning algorithm by reasoning the error from first principle physics to generate learning signals in near real time. Motivated by iterative learning control (ILC), this learning algorithm is applied to the feedforward control loop of the unmanned aerial vehicles (UAVs), enabling the learning from errors made by other UAVs with different dy… ▽ More

    Submitted 9 August, 2019; originally announced August 2019.