Skip to main content

Showing 1–50 of 83 results for author: Luo, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08858  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning

    Authors: Tairan He, Zhengyi Luo, Xialin He, Wenli Xiao, Chong Zhang, Weinan Zhang, Kris Kitani, Changliu Liu, Guanya Shi

    Abstract: We present OmniH2O (Omni Human-to-Humanoid), a learning-based system for whole-body humanoid teleoperation and autonomy. Using kinematic pose as a universal control interface, OmniH2O enables various ways for a human to control a full-sized humanoid with dexterous hands, including using real-time teleoperation through VR headset, verbal instruction, and RGB camera. OmniH2O also enables full autono… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project page: https://omni.human2humanoid.com/

  2. arXiv:2405.12496  [pdf, other

    eess.AS cs.NI cs.SD eess.SP

    A Survey of Integrating Wireless Technology into Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active Noise Control (ANC) is a widely adopted technology for reducing environmental noise across various scenarios. This paper focuses on enhancing noise reduction performance, particularly through the refinement of signal quality fed into ANC systems. We discuss the main wireless technique integrated into the ANC system, equipped with some innovative algorithms, in diverse environments. Instead… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2405.03178  [pdf, other

    cs.SD eess.AS

    POPDG: Popular 3D Dance Generation with PopDanceSet

    Authors: Zhenye Luo, Min Ren, Xuecai Hu, Yongzhen Huang, Li Yao

    Abstract: Generating dances that are both lifelike and well-aligned with music continues to be a challenging task in the cross-modal domain. This paper introduces PopDanceSet, the first dataset tailored to the preferences of young audiences, enabling the generation of aesthetically oriented dances. And it surpasses the AIST++ dataset in music genre diversity and the intricacy and depth of dance movements. M… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  4. arXiv:2404.17890  [pdf, other

    eess.IV cs.AI cs.CV

    DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction

    Authors: Chenhe Du, Xiyue Lin, Qing Wu, Xuanyu Tian, Ying Su, Zhe Luo, Hongjiang Wei, S. Kevin Zhou, **gyi Yu, Yuyao Zhang

    Abstract: Limited-angle and sparse-view computed tomography (LACT and SVCT) are crucial for expanding the scope of X-ray CT applications. However, they face challenges due to incomplete data acquisition, resulting in diverse artifacts in the reconstructed CT images. Emerging implicit neural representation (INR) techniques, such as NeRF, NeAT, and NeRP, have shown promise in under-determined CT imaging recon… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 15 pages, 10 figures

    ACM Class: I.2.10; I.4.5

  5. arXiv:2404.00549  [pdf

    eess.IV cs.CV

    Pneumonia App: a mobile application for efficient pediatric pneumonia diagnosis using explainable convolutional neural networks (CNN)

    Authors: Jiaming Deng, Zhenglin Chen, Minjiang Chen, Lulu Xu, Jiaqi Yang, Zhendong Luo, Peiwu Qin

    Abstract: Mycoplasma pneumoniae pneumonia (MPP) poses significant diagnostic challenges in pediatric healthcare, especially in regions like China where it's prevalent. We introduce PneumoniaAPP, a mobile application leveraging deep learning techniques for rapid MPP detection. Our approach capitalizes on convolutional neural networks (CNNs) trained on a comprehensive dataset comprising 3345 chest X-ray (CXR)… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 27 Pages,7 figures

    MSC Class: 68 ACM Class: J.3

  6. arXiv:2403.04436  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation

    Authors: Tairan He, Zhengyi Luo, Wenli Xiao, Chong Zhang, Kris Kitani, Changliu Liu, Guanya Shi

    Abstract: We present Human to Humanoid (H2O), a reinforcement learning (RL) based framework that enables real-time whole-body teleoperation of a full-sized humanoid robot with only an RGB camera. To create a large-scale retargeted motion dataset of human movements for humanoid robots, we propose a scalable "sim-to-data" process to filter and pick feasible motions using a privileged motion imitator. Afterwar… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Project website: https://human2humanoid.com/

  7. arXiv:2402.16274  [pdf, other

    eess.SP

    Radar Anti-jamming Strategy Learning via Domain-knowledge Enhanced Online Convex Optimization

    Authors: Liangqi Liu, Wenqiang Pu, Yingru Li, Bo Jiu, Zhi-Quan Luo

    Abstract: The dynamic competition between radar and jammer systems presents a significant challenge for modern Electronic Warfare (EW), as current active learning approaches still lack sample efficiency and fail to exploit jammer's characteristics. In this paper, the competition between a frequency agile radar and a Digital Radio Frequency Memory (DRFM)-based intelligent jammer is considered. We introduce a… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  8. Calibration of Deep Learning Classification Models in fNIRS

    Authors: Zhihao Cao, Zizhou Luo

    Abstract: Functional near-infrared spectroscopy (fNIRS) is a valuable non-invasive tool for monitoring brain activity. The classification of fNIRS data in relation to conscious activity holds significance for advancing our understanding of the brain and facilitating the development of brain-computer interfaces (BCI). Many researchers have turned to deep learning to tackle the classification challenges inher… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  9. Unsupervised learning based end-to-end delayless generative fixed-filter active noise control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Woon-Seng Gan

    Abstract: Delayless noise control is achieved by our earlier generative fixed-filter active noise control (GFANC) framework through efficient coordination between the co-processor and real-time controller. However, the one-dimensional convolutional neural network (1D CNN) in the co-processor requires initial training using labelled noise datasets. Labelling noise data can be resource-intensive and may intro… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  10. arXiv:2402.04080  [pdf, other

    cs.LG eess.SY

    Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

    Authors: Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson

    Abstract: This paper presents advanced techniques of training diffusion policies for offline reinforcement learning (RL). At the core is a mean-reverting stochastic differential equation (SDE) that transfers a complex action distribution into a standard Gaussian and then samples actions conditioned on the environment state with a corresponding reverse-time SDE, like a typical diffusion policy. We show that… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2401.05000  [pdf

    eess.SP

    Map** Information in Feature Extraction Transformation for Chirp Signal

    Authors: Shuyi Gu, Zhenghua Luo, Lin Hu, Yilin Zhang, Junxiong Guo

    Abstract: Chirp signals have established diverse applications caused by the capable of producing time-dependent linear frequencies. Most feature extraction transformation methods for chirp signals focus on enhancing the performance of transform methods but neglecting the information derived from the transformation process. Consequently, they may fail to fully exploit the information from observations, resul… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 14 pages,10 figures

  12. arXiv:2312.16918  [pdf, other

    cs.IT eess.SP

    Intelligent Surfaces Empowered Wireless Network: Recent Advances and The Road to 6G

    Authors: Qingqing Wu, Beixiong Zheng, Changsheng You, Lipeng Zhu, Kaiming Shen, Xiaodan Shao, Weidong Mei, Boya Di, Hongliang Zhang, Ertugrul Basar, Lingyang Song, Marco Di Renzo, Zhi-Quan Luo, Rui Zhang

    Abstract: Intelligent surfaces (ISs) have emerged as a key technology to empower a wide range of appealing applications for wireless networks, due to their low cost, high energy efficiency, flexibility of deployment and capability of constructing favorable wireless channels/radio environments. Moreover, the recent advent of several new IS architectures further expanded their electromagnetic functionalities… ▽ More

    Submitted 24 March, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  13. End-to-end Alternating Optimization for Real-World Blind Super Resolution

    Authors: Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

    Abstract: Blind Super-Resolution (SR) usually involves two sub-problems: 1) estimating the degradation of the given low-resolution (LR) image; 2) super-resolving the LR image to its high-resolution (HR) counterpart. Both problems are ill-posed due to the information loss in the degrading process. Most previous methods try to solve the two problems independently, but often fall into a dilemma: a good super-r… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Extension of our previous NeurIPS paper. Accepted to IJCV

    Journal ref: International Journal of Computer Vision (IJCV) 2023

  14. arXiv:2308.04683  [pdf, other

    eess.SP eess.SY

    Real-time FPGA Implementation of CNN-based Distributed Fiber Optic Vibration Event Recognition Method

    Authors: Zhongyao Luo, Zhao Ge, Hao Wu, Ming Tang

    Abstract: Utilizing optical fibers to detect and pinpoint vibrations, Distributed Optical Fiber Vibration Sensing (DVS) technology provides real-time monitoring and surveillance of wide-reaching areas. This field has been leveraging Convolutional Neural Networks (CNN). Recently, a study has accomplished end-to-end vibration event recognition, enabling utilization of CNN-based DVS algorithms as real-time emb… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 5 pages, 6 figures

  15. arXiv:2308.00220  [pdf, other

    eess.IV cs.CV

    Boundary Difference Over Union Loss For Medical Image Segmentation

    Authors: Fan Sun, Zhiming Luo, Shaozi Li

    Abstract: Medical image segmentation is crucial for clinical diagnosis. However, current losses for medical image segmentation mainly focus on overall segmentation results, with fewer losses proposed to guide boundary segmentation. Those that do exist often need to be used in combination with other losses and produce ineffective results. To address this issue, we have developed a simple and effective loss c… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: MICCAI 2023

  16. arXiv:2307.09770  [pdf, other

    eess.SP cs.AI q-bio.NC

    Perturbing a Neural Network to Infer Effective Connectivity: Evidence from Synthetic EEG Data

    Authors: Peizhen Yang, Xinke Shen, Zongsheng Li, Zixiang Luo, Kexin Lou, Quanying Liu

    Abstract: Identifying causal relationships among distinct brain areas, known as effective connectivity, holds key insights into the brain's information processing and cognitive functions. Electroencephalogram (EEG) signals exhibit intricate dynamics and inter-areal interactions within the brain. However, methods for characterizing nonlinear causal interactions among multiple brain regions remain relatively… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures, 1 table

  17. arXiv:2306.15247  [pdf, ps, other

    cs.IT eess.SP math.OC

    Towards Efficient Optimal Large-Scale Network Slicing: A Decomposition Approach

    Authors: Wei-Kun Chen, Zheyu Wu, Rui-** Zhang, Ya-Feng Liu, Yu-Hong Dai, Zhi-Quan Luo

    Abstract: This paper considers the network slicing (NS) problem which attempts to map multiple customized virtual network requests to a common shared network infrastructure and allocate network resources to meet diverse service requirements. This paper proposes an efficient decomposition algorithm for globally solving the large-scale NP-hard NS problem. The proposed algorithm decomposes the hard NS problem… ▽ More

    Submitted 14 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 11 figures, submitted for possible publication; for the conference version, see arXiv:2306.15247v1

  18. arXiv:2306.11408  [pdf, other

    eess.AS

    A Computation-efficient Online Secondary Path Modeling Technique for Modified FXLMS Algorithm

    Authors: Junwei Ji, Dongyuan Shi, Woon-Seng Gan, Xiaoyi Shen, Zhengding Luo

    Abstract: This paper proposes an online secondary path modelling (SPM) technique to improve the performance of the modified filtered reference Least Mean Square (FXLMS) algorithm. It can effectively respond to a time-varying secondary path, which refers to the path from a secondary source to an error sensor. Unlike traditional methods, the proposed approach switches modes between adaptive ANC and online SPM… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  19. arXiv:2303.13631  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    In-depth analysis of music structure as a text network

    Authors: **-Rui Tsai, Yen-Ting Chou, Nathan-Christopher Wang, Hui-Ling Chen, Hong-Yue Huang, Zih-Jia Luo, Tzay-Ming Hong

    Abstract: Music, enchanting and poetic, permeates every corner of human civilization. Although music is not unfamiliar to people, our understanding of its essence remains limited, and there is still no universally accepted scientific description. This is primarily due to music being regarded as a product of both reason and emotion, making it difficult to define. In this article, we focus on the fundamental… ▽ More

    Submitted 2 January, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: 7 pages, 8 figures

  20. arXiv:2303.08411  [pdf, other

    eess.AS

    A practical distributed active noise control algorithm overcoming communication restrictions

    Authors: Junwei Ji, Dongyuan Shi, Zhengding Luo, Xiaoyi Shen, Woon-Seng Gan

    Abstract: By assigning the massive computing tasks of the traditional multichannel active noise control (MCANC) system to several distributed control nodes, distributed multichannel active noise control (DMCANC) techniques have become effective global noise reduction solutions with low computational costs. However, existing DMCANC algorithms simply complete the distribution of traditional centralized algori… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  21. arXiv:2303.08397  [pdf, other

    eess.AS eess.SP

    A Momentum Two-gradient Direction Algorithm with Variable Step Size Applied to Solve Practical Output Constraint Issue for Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active noise control (ANC) has been widely utilized to reduce unwanted environmental noise. The primary objective of ANC is to generate an anti-noise with the same amplitude but the opposite phase of the primary noise using the secondary source. However, the effectiveness of the ANC application is impacted by the speaker's output saturation. This paper proposes a two-gradient direction ANC algorit… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: Paper is submitted and accepted by ICASSP2023

  22. arXiv:2303.06681  [pdf, other

    eess.IV cs.CV

    Learning Deep Intensity Field for Extremely Sparse-View CBCT Reconstruction

    Authors: Yiqun Lin, Zhong** Luo, Wei Zhao, Xiaomeng Li

    Abstract: Sparse-view cone-beam CT (CBCT) reconstruction is an important direction to reduce radiation dose and benefit clinical applications. Previous voxel-based generation methods represent the CT as discrete voxels, resulting in high memory requirements and limited spatial resolution due to the use of 3D decoders. In this paper, we formulate the CT volume as a continuous intensity field and develop a no… ▽ More

    Submitted 31 August, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: MICCAI'23

  23. Deep Generative Fixed-filter Active Noise Control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Junwei Ji, Woon-Seng Gan

    Abstract: Due to the slow convergence and poor tracking ability, conventional LMS-based adaptive algorithms are less capable of handling dynamic noises. Selective fixed-filter active noise control (SFANC) can significantly reduce response time by selecting appropriate pre-trained control filters for different noises. Nonetheless, the limited number of pre-trained control filters may affect noise reduction p… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023. Code will be available after publication

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  24. arXiv:2303.02308  [pdf, other

    eess.SP

    A Physics-based and Data-driven Approach for Localized Statistical Channel Modeling

    Authors: Shutao Zhang, Xinzhi Ning, Xi Zheng, Qingjiang Shi, Tsung-Hui Chang, Zhi-Quan Luo

    Abstract: Localized channel modeling is crucial for offline performance optimization of 5G cellular networks, but the existing channel models are for general scenarios and do not capture local geographical structures. In this paper, we propose a novel physics-based and data-driven localized statistical channel modeling (LSCM), which is capable of sensing the physical geographical structures of the targeted… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: the 34th International Teletraffic Congress (ITC), Shenzhen, China, 2022

  25. Coordinating Multiple Intelligent Reflecting Surfaces without Channel Information

    Authors: Fan Xu, Jiawei Yao, Wenhai Lai, Kaiming Shen, Xin Li, Xin Chen, Zhi-Quan Luo

    Abstract: Conventional beamforming methods for intelligent reflecting surfaces (IRSs) or reconfigurable intelligent surfaces (RISs) typically entail the full channel state information (CSI). However, the computational cost of channel acquisition soars exponentially with the number of IRSs. To bypass this difficulty, we propose a novel strategy called blind beamforming that coordinates multiple IRSs by means… ▽ More

    Submitted 8 January, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: 16 pages

    Journal ref: IEEE Transactions on Signal Processing 2024

  26. arXiv:2212.11396  [pdf, ps, other

    cs.LG eess.SP

    ABODE-Net: An Attention-based Deep Learning Model for Non-intrusive Building Occupancy Detection Using Smart Meter Data

    Authors: Zhirui Luo, Ruobin Qi, Qingqing Li, Jun Zheng, Sihua Shao

    Abstract: Occupancy information is useful for efficient energy management in the building sector. The massive high-resolution electrical power consumption data collected by smart meters in the advanced metering infrastructure (AMI) network make it possible to infer buildings' occupancy status in a non-intrusive way. In this paper, we propose a deep leaning model called ABODE-Net which employs a novel Parall… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: To be published in The 7th International Conference on Smart Computing and Communication (SmartCom 2022)

  27. arXiv:2212.06557  [pdf, ps, other

    eess.SP

    A Data Quality Assessment Framework for AI-enabled Wireless Communication

    Authors: Hanning Tang, Liusha Yang, Rui Zhou, **g Liang, Hong Wei, Xuan Wang, Qingjiang Shi, Zhi-Quan Luo

    Abstract: Using artificial intelligent (AI) to re-design and enhance the current wireless communication system is a promising pathway for the future sixth-generation (6G) wireless network. The performance of AI-enabled wireless communication depends heavily on the quality of wireless air-interface data. Although there are various approaches to data quality assessment (DQA) for different applications, none h… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  28. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  29. A Linear Time Algorithm for the Optimal Discrete IRS Beamforming

    Authors: Shuyi Ren, Kaiming Shen, Xin Li, Xin Chen, Zhi-Quan Luo

    Abstract: It remains an open problem to find the optimal configuration of phase shifts under the discrete constraint for intelligent reflecting surface (IRS) in polynomial time. The above problem is widely believed to be difficult because it is not linked to any known combinatorial problems that can be solved efficiently. The branch-and-bound algorithms and the approximation algorithms constitute the best r… ▽ More

    Submitted 7 September, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 5 pages

  30. arXiv:2208.11609  [pdf, other

    eess.IV cs.CV

    Fast Nearest Convolution for Real-Time Efficient Image Super-Resolution

    Authors: Ziwei Luo, Youwei Li, Lei Yu, Qi Wu, Zhihong Wen, Haoqiang Fan, Shuaicheng Liu

    Abstract: Deep learning-based single image super-resolution (SISR) approaches have drawn much attention and achieved remarkable success on modern advanced GPUs. However, most state-of-the-art methods require a huge number of parameters, memories, and computational resources, which usually show inferior inference times when applying them to current mobile device CPUs/NPUs. In this paper, we propose a simple… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: AIM & Mobile AI 2022

  31. arXiv:2208.08440  [pdf, other

    cs.LG cs.AI eess.SP

    Performance Evaluation of Selective Fixed-filter Active Noise Control based on Different Convolutional Neural Networks

    Authors: Zhengding Luo, Dongyuan Shi, Woon-Seng Gan

    Abstract: Due to its rapid response time and a high degree of robustness, the selective fixed-filter active noise control (SFANC) method appears to be a viable candidate for widespread use in a variety of practical active noise control (ANC) systems. In comparison to conventional fixed-filter ANC methods, SFANC can select the pre-trained control filters for different types of noise. Deep learning technologi… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2208.08082

  32. arXiv:2208.08086  [pdf, other

    eess.SY

    Implementation of Multi-channel Active Noise Control based on Back-propagation Mechanism

    Authors: Zhengding Luo, Dongyuan Shi, Junwei Ji, Woon-seng Gan

    Abstract: Active noise control (ANC) systems can efficiently attenuate low-frequency noises by introducing anti-noises to combine with the unwanted noises. In ANC systems, the filtered-x least mean square (FxLMS) and filtered-X normalized least-mean-square (FxNLMS) algorithm are well-known algorithms for adaptively adjusting control filters. Multi-channel ANC systems are typically required to attenuate unwa… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  33. arXiv:2208.08082  [pdf, other

    eess.SY cs.LG cs.SD eess.AS

    A Hybrid SFANC-FxNLMS Algorithm for Active Noise Control based on Deep Learning

    Authors: Zhengding Luo, Dongyuan Shi, Woon-Seng Gan

    Abstract: The selective fixed-filter active noise control (SFANC) method selecting the best pre-trained control filters for various types of noise can achieve a fast response time. However, it may lead to large steady-state errors due to inaccurate filter selection and the lack of adaptability. In comparison, the filtered-X normalized least-mean-square (FxNLMS) algorithm can obtain lower steady-state errors… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Report number: Vol.29, p.1102-1106

    Journal ref: IEEE Signal Processing Letters, 2022

  34. Robust Adaptive Beamforming via Worst-Case SINR Maximization with Nonconvex Uncertainty Sets

    Authors: Yongwei Huang, Hao Fu, Sergiy A. Vorobyov, Zhi-Quan Luo

    Abstract: This paper considers a formulation of the robust adaptive beamforming (RAB) problem based on worst-case signal-to-interference-plus-noise ratio (SINR) maximization with a nonconvex uncertainty set for the steering vectors. The uncertainty set consists of a similarity constraint and a (nonconvex) double-sided ball constraint. The worst-case SINR maximization problem is turned into a quadratic matri… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  35. Rethinking WMMSE: Can Its Complexity Scale Linearly With the Number of BS Antennas?

    Authors: Xiaotong Zhao, Siyuan Lu, Qingjiang Shi, Zhi-Quan Luo

    Abstract: Precoding design for maximizing weighted sum-rate (WSR) is a fundamental problem for downlink of massive multi-user multiple-input multiple-output (MU-MIMO) systems. It is well-known that this problem is generally NP-hard due to the presence of multi-user interference. The weighted minimum mean-square error (WMMSE) algorithm is a popular approach for WSR maximization. However, its computational co… ▽ More

    Submitted 22 May, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

  36. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  37. arXiv:2205.03947  [pdf, other

    cs.CV eess.IV

    High-Resolution UAV Image Generation for Sorghum Panicle Detection

    Authors: Enyu Cai, Zhankun Luo, Sriram Baireddy, Jiaqi Guo, Changye Yang, Edward J. Delp

    Abstract: The number of panicles (or heads) of Sorghum plants is an important phenotypic trait for plant development and grain yield estimation. The use of Unmanned Aerial Vehicles (UAVs) enables the capability of collecting and analyzing Sorghum images on a large scale. Deep learning can provide methods for estimating phenotypic traits from UAV images but requires a large amount of labeled data. The lack o… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  38. Downlink Channel Covariance Matrix Reconstruction for FDD Massive MIMO Systems with Limited Feedback

    Authors: Kai Li, Ying Li, Lei Cheng, Qingjiang Shi, Zhi-Quan Luo

    Abstract: The downlink channel covariance matrix (CCM) acquisition is the key step for the practical performance of massive multiple-input and multiple-output (MIMO) systems, including beamforming, channel tracking, and user scheduling. However, this task is challenging in the popular frequency division duplex massive MIMO systems with Type I codebook due to the limited channel information feedback. In this… ▽ More

    Submitted 12 September, 2023; v1 submitted 2 April, 2022; originally announced April 2022.

  39. arXiv:2203.04962  [pdf, other

    eess.IV cs.CV

    Learning the Degradation Distribution for Blind Image Super-Resolution

    Authors: Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

    Abstract: Synthetic high-resolution (HR) \& low-resolution (LR) pairs are widely used in existing super-resolution (SR) methods. To avoid the domain gap between synthetic and test images, most previous methods try to adaptively learn the synthesizing (degrading) process via a deterministic model. However, some degradations in real scenarios are stochastic and cannot be determined by the content of the image… ▽ More

    Submitted 14 January, 2024; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted to CVRP2022

  40. arXiv:2202.07508  [pdf, other

    eess.IV cs.CV

    Deep Constrained Least Squares for Blind Image Super-Resolution

    Authors: Ziwei Luo, Haibin Huang, Lei Yu, Youwei Li, Haoqiang Fan, Shuaicheng Liu

    Abstract: In this paper, we tackle the problem of blind image super-resolution(SR) with a reformulated degradation model and two novel modules. Following the common practices of blind SR, our method proposes to improve both the kernel estimation as well as the kernel-based high-resolution image restoration. To be more specific, we first reformulate the degradation model such that the deblurring kernel estim… ▽ More

    Submitted 25 March, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: CVPR 2022

  41. arXiv:2112.07415  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

    Authors: Ziwei Luo, **g Hu, Xin Wang, Shu Hu, Bin Kong, Youbing Yin, Qi Song, Xi Wu, Siwei Lyu

    Abstract: Large deformations of organs, caused by diverse shapes and nonlinear shape changes, pose a significant challenge for medical image registration. Traditional registration methods need to iteratively optimize an objective function via a specific deformation model along with meticulous parameter tuning, but which have limited capabilities in registering images with large deformations. While deep lear… ▽ More

    Submitted 30 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022

  42. arXiv:2112.05147  [pdf, other

    eess.IV cs.CV

    Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement

    Authors: Long Ma, Risheng Liu, Jiaao Zhang, Xin Fan, Zhongxuan Luo

    Abstract: Enhancing the quality of low-light images plays a very important role in many image processing and multimedia applications. In recent years, a variety of deep learning techniques have been developed to address this challenging task. A typical framework is to simultaneously estimate the illumination and reflectance, but they disregard the scene-level contextual information encapsulated in feature s… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted by IEEE TNNLS. Code is available at https://github.com/KarelZhang/CSDNet-CSDGAN

  43. Configuring Intelligent Reflecting Surface with Performance Guarantees: Optimal Beamforming

    Authors: Yaowen Zhang, Kaiming Shen, Shuyi Ren, Xin Li, Xin Chen, Zhi-Quan Luo

    Abstract: This work proposes linear time strategies to optimally configure the phase shifts for the reflective elements of an intelligent reflecting surface (IRS). Specifically, we show that the binary phase beamforming can be optimally solved in linear time to maximize the received signal-to-noise ratio (SNR). For the general K-ary phase beamforming, we develop a linear time approximation algorithm that gu… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: 9 pages, 10 figures

  44. arXiv:2111.05133  [pdf, other

    eess.IV cs.CV

    Approaching the Limit of Image Rescaling via Flow Guidance

    Authors: Shang Li, Guixuan Zhang, Zhengxiong Luo, Jie Liu, Zhi Zeng, Shuwu Zhang

    Abstract: Image downscaling and upscaling are two basic rescaling operations. Once the image is downscaled, it is difficult to be reconstructed via upscaling due to the loss of information. To make these two processes more compatible and improve the reconstruction performance, some efforts model them as a joint encoding-decoding task, with the constraint that the downscaled (i.e. encoded) low-resolution (LR… ▽ More

    Submitted 8 January, 2023; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  45. arXiv:2110.03915  [pdf, ps, other

    cs.IT eess.SP math.OC

    Optimal QoS-Aware Network Slicing for Service-Oriented Networks with Flexible Routing

    Authors: Wei-Kun Chen, Ya-Feng Liu, Yu-Hong Dai, Zhi-Quan Luo

    Abstract: In this paper, we consider the network slicing problem which attempts to map multiple customized virtual network requests (also called services) to a common shared network infrastructure and allocate network resources to meet diverse quality of service (QoS) requirements. We first propose a mixed integer nonlinear program (MINLP) formulation for this problem that optimizes the network resource con… ▽ More

    Submitted 27 February, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: 5 pages, 2 figures, accepted for publication in IEEE ICASSP 2022

  46. arXiv:2110.01164  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks

    Authors: Zhaojie Luo, Shoufeng Lin, Rui Liu, Jun Baba, Yuichiro Yoshikawa, Ishiguro Hiroshi

    Abstract: Emotional voice conversion (VC) aims to convert a neutral voice to an emotional (e.g. happy) one while retaining the linguistic information and speaker identity. We note that the decoupling of emotional features from other speech information (such as speaker, content, etc.) is the key to achieving remarkable performance. Some recent attempts about speech representation decoupling on the neutral sp… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

  47. arXiv:2109.04284  [pdf, other

    cs.CV cs.LG eess.IV

    Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal

    Authors: Lei Zhu, Zhao**g Luo, Wei Wang, Meihui Zhang, Gang Chen, Kai** Zheng

    Abstract: Deep learning models usually require a large amount of labeled data to achieve satisfactory performance. In multimedia analysis, domain adaptation studies the problem of cross-domain knowledge transfer from a label rich source domain to a label scarce target domain, thus potentially alleviates the annotation requirement for deep learning models. However, we find that contemporary domain adaptation… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 10 pages, 7 figures

  48. Efficient Estimation of Sensor Biases for the 3-Dimensional Asynchronous Multi-Sensor System

    Authors: Wenqiang Pu, Ya-Feng Liu, Zhi-Quan Luo

    Abstract: An important preliminary procedure in multi-sensor data fusion is \textit{sensor registration}, and the key step in this procedure is to estimate sensor biases from their noisy measurements. There are generally two difficulties in this bias estimation problem: one is the unknown target states which serve as the nuisance variables in the estimation problem, and the other is the highly nonlinear coo… ▽ More

    Submitted 24 June, 2023; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: Accepted by IEEE Transactions on Signal Processing

  49. arXiv:2108.00190  [pdf, ps, other

    cs.SD cs.HC eess.AS

    Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language

    Authors: Huiyan Li, Haohong Lin, You Wang, Hengyang Wang, Ming Zhang, Han Gao, Qing Ai, Zhiyuan Luo, Guang Li

    Abstract: Silent Speech Decoding (SSD), based on articulatory neuromuscular activities, has become a prevalent task of Brain-Computer Interface (BCI) in recent years. Many works have been devoted to decoding surface electromyography (sEMG) from articulatory neuromuscular activities. However, restoring silent speech in tonal languages such as Mandarin Chinese is still difficult. This paper proposes an optimi… ▽ More

    Submitted 1 June, 2022; v1 submitted 31 July, 2021; originally announced August 2021.

  50. arXiv:2107.14404  [pdf, ps, other

    cs.IT eess.SP math.OC

    Towards Efficient Large-Scale Network Slicing: An LP Dynamic Rounding-and-Refinement Approach

    Authors: Wei-Kun Chen, Ya-Feng Liu, Fan Liu, Yu-Hong Dai, Zhi-Quan Luo

    Abstract: In this paper, we propose an efficient algorithm for the network slicing problem which attempts to map multiple customized virtual network requests (also called services) to a common shared network infrastructure and allocate network resources to meet diverse service requirements. The problem has been formulated as a mixed integer linear programming (MILP) formulation in the literature. We first p… ▽ More

    Submitted 11 February, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: 16 pages, 8 figures, accepted for publication in the IEEE Transactions on Signal Processing. arXiv admin note: text overlap with arXiv:2102.02563