Search | arXiv e-print repository

TVCondNet: A Conditional Denoising Neural Network for NMR Spectroscopy

Authors: Zihao Zou, Shirin Shoushtari, Jiaming Liu, Jialiang Zhang, Patrick Judge, Emilia Santana, Alison Lim, Marcus Foston, Ulugbek S. Kamilov

Abstract: Nuclear Magnetic Resonance (NMR) spectroscopy is a widely-used technique in the fields of bio-medicine, chemistry, and biology for the analysis of chemicals and proteins. The signals from NMR spectroscopy often have low signal-to-noise ratio (SNR) due to acquisition noise, which poses significant challenges for subsequent analysis. Recent work has explored the potential of deep learning (DL) for N… ▽ More Nuclear Magnetic Resonance (NMR) spectroscopy is a widely-used technique in the fields of bio-medicine, chemistry, and biology for the analysis of chemicals and proteins. The signals from NMR spectroscopy often have low signal-to-noise ratio (SNR) due to acquisition noise, which poses significant challenges for subsequent analysis. Recent work has explored the potential of deep learning (DL) for NMR denoising, showing significant performance gains over traditional methods such as total variation (TV) denoising. This paper shows that the performance of DL denoising for NMR can be further improved by combining data-driven training with traditional TV denoising. The proposed TVCondNet method outperforms both traditional TV and DL methods by including the TV solution as a condition during DL training. Our validation on experimentally collected NMR data shows the superior denoising performance and faster inference speed of TVCondNet compared to existing methods. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2401.11960 [pdf, other]

Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method

Authors: Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi

Abstract: Downscaling (DS) of meteorological variables involves obtaining high-resolution states from low-resolution meteorological fields and is an important task in weather forecasting. Previous methods based on deep learning treat downscaling as a super-resolution task in computer vision and utilize high-resolution gridded meteorological fields as supervision to improve resolution at specific grid scales… ▽ More Downscaling (DS) of meteorological variables involves obtaining high-resolution states from low-resolution meteorological fields and is an important task in weather forecasting. Previous methods based on deep learning treat downscaling as a super-resolution task in computer vision and utilize high-resolution gridded meteorological fields as supervision to improve resolution at specific grid scales. However, this approach has struggled to align with the continuous distribution characteristics of meteorological fields, leading to an inherent systematic bias between the downscaled results and the actual observations at meteorological stations. In this paper, we extend meteorological downscaling to arbitrary scattered station scales, establish a brand new benchmark and dataset, and retrieve meteorological states at any given station location from a coarse-resolution meteorological field. Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors. Building on this foundation, we propose a new downscaling model based on hypernetwork architecture, namely HyperDS, which efficiently integrates different observational information into the model training, achieving continuous scale modeling of the meteorological field. Through extensive experiments, our proposed method outperforms other specially designed baseline models on multiple surface variables. Notably, the mean squared error (MSE) for wind speed and surface pressure improved by 67% and 19.5% compared to other methods. We will release the dataset and code subsequently. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2311.15445 [pdf, other]

FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration

Authors: Zihao Zou, Jiaming Liu, Shirin Shoushtari, Yubo Wang, Weijie Gan, Ulugbek S. Kamilov

Abstract: Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input. While diffusion probabilistic models (DPMs) have been shown to achieve remarkable performance for face image restoration, they often fail to preserve temporally coherent, high-quality videos, compromising the fidelity of reconstructed faces.… ▽ More Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input. While diffusion probabilistic models (DPMs) have been shown to achieve remarkable performance for face image restoration, they often fail to preserve temporally coherent, high-quality videos, compromising the fidelity of reconstructed faces. We present a new conditional diffusion framework called FLAIR for FVR. FLAIR ensures temporal consistency across frames in a computationally efficient fashion by converting a traditional image DPM into a video DPM. The proposed conversion uses a recurrent video refinement layer and a temporal self-attention at different scales. FLAIR also uses a conditional iterative refinement process to balance the perceptual and distortion quality during inference. This process consists of two key components: a data-consistency module that analytically ensures that the generated video precisely matches its degraded observation and a coarse-to-fine image enhancement module specifically for facial regions. Our extensive experiments show superiority of FLAIR over the current state-of-the-art (SOTA) for video super-resolution, deblurring, JPEG restoration, and space-time frame interpolation on two high-quality face video datasets. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: 32 pages, 27 figures

arXiv:2311.02003 [pdf, other]

A Structured Pruning Algorithm for Model-based Deep Learning

Authors: Chicago Park, Weijie Gan, Zihao Zou, Yuyang Hu, Zhixin Sun, Ulugbek S. Kamilov

Abstract: There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits the… ▽ More There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits their applicability in certain large-scale applications. We address this issue by presenting structured pruning algorithm for model-based deep learning (SPADE) as the first structured pruning algorithm for MBDL networks. SPADE reduces the computational complexity of CNNs used within MBDL networks by pruning its non-essential weights. We propose three distinct strategies to fine-tune the pruned MBDL networks to minimize the performance loss. Each fine-tuning strategy has a unique benefit that depends on the presence of a pre-trained model and a high-quality ground truth. We validate SPADE on two distinct inverse problems, namely compressed sensing MRI and image super-resolution. Our results highlight that MBDL models pruned by SPADE can achieve substantial speed up in testing time while maintaining competitive performance. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.17363 [pdf, ps, other]

Controllability of networked multiagent systems based on linearized Turing's model

Authors: Tianhao Li, Ruichang Zhang, Zhixin Liu, Zhuo Zou, Xiaoming Hu

Abstract: Turing's model has been widely used to explain how simple, uniform structures can give rise to complex, patterned structures during the development of organisms. However, it is very hard to establish rigorous theoretical results for the dynamic evolution behavior of Turing's model since it is described by nonlinear partial differential equations. We focus on controllability of Turing's model by li… ▽ More Turing's model has been widely used to explain how simple, uniform structures can give rise to complex, patterned structures during the development of organisms. However, it is very hard to establish rigorous theoretical results for the dynamic evolution behavior of Turing's model since it is described by nonlinear partial differential equations. We focus on controllability of Turing's model by linearization and spatial discretization. This linearized model is a networked system whose agents are second order linear systems and these agents interact with each other by Laplacian dynamics on a graph. A control signal can be added to agents of choice. Under mild conditions on the parameters of the linearized Turing's model, we prove the equivalence between controllability of the linearized Turing's model and controllability of a Laplace dynamic system with agents of first order dynamics. When the graph is a grid graph or a cylinder grid graph, we then give precisely the minimal number of control nodes and a corresponding control node set such that the Laplace dynamic systems on these graphs with agents of first order dynamics are controllable. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 13 pages, 4 figures, submitted to automatica

arXiv:2310.05290 [pdf, other]

MSight: An Edge-Cloud Infrastructure-based Perception System for Connected Automated Vehicles

Authors: Rusheng Zhang, Depu Meng, Shengyin Shen, Zhengxia Zou, Houqiang Li, Henry X. Liu

Abstract: As vehicular communication and networking technologies continue to advance, infrastructure-based roadside perception emerges as a pivotal tool for connected automated vehicle (CAV) applications. Due to their elevated positioning, roadside sensors, including cameras and lidars, often enjoy unobstructed views with diminished object occlusion. This provides them a distinct advantage over onboard perc… ▽ More As vehicular communication and networking technologies continue to advance, infrastructure-based roadside perception emerges as a pivotal tool for connected automated vehicle (CAV) applications. Due to their elevated positioning, roadside sensors, including cameras and lidars, often enjoy unobstructed views with diminished object occlusion. This provides them a distinct advantage over onboard perception, enabling more robust and accurate detection of road objects. This paper presents MSight, a cutting-edge roadside perception system specifically designed for CAVs. MSight offers real-time vehicle detection, localization, tracking, and short-term trajectory prediction. Evaluations underscore the system's capability to uphold lane-level accuracy with minimal latency, revealing a range of potential applications to enhance CAV safety and efficiency. Presently, MSight operates 24/7 at a two-lane roundabout in the City of Ann Arbor, Michigan. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: Submitted to IEEE T-ITS

arXiv:2306.17302 [pdf, other]

Robust Roadside Perception: an Automated Data Synthesis Pipeline Minimizing Human Annotation

Authors: Rusheng Zhang, Depu Meng, Lance Bassett, Shengyin Shen, Zhengxia Zou, Henry X. Liu

Abstract: Recently, advancements in vehicle-to-infrastructure communication technologies have elevated the significance of infrastructure-based roadside perception systems for cooperative driving. This paper delves into one of its most pivotal challenges: data insufficiency. The lacking of high-quality labeled roadside sensor data with high diversity leads to low robustness, and low transfer-ability of curr… ▽ More Recently, advancements in vehicle-to-infrastructure communication technologies have elevated the significance of infrastructure-based roadside perception systems for cooperative driving. This paper delves into one of its most pivotal challenges: data insufficiency. The lacking of high-quality labeled roadside sensor data with high diversity leads to low robustness, and low transfer-ability of current roadside perception systems. In this paper, a novel solution is proposed to address this problem that creates synthesized training data using Augmented Reality. A Generative Adversarial Network is then applied to enhance the reality further, that produces a photo-realistic synthesized dataset that is capable of training or fine-tuning a roadside perception detector which is robust to different weather and lighting conditions. Our approach was rigorously tested at two key intersections in Michigan, USA: the Mcity intersection and the State St./Ellsworth Rd roundabout. The Mcity intersection is located within the Mcity test field, a controlled testing environment. In contrast, the State St./Ellsworth Rd intersection is a bustling roundabout notorious for its high traffic flow and a significant number of accidents annually. Experimental results demonstrate that detectors trained solely on synthesized data exhibit commendable performance across all conditions. Furthermore, when integrated with labeled data, the synthesized data can notably bolster the performance of pre-existing detectors, especially in adverse conditions. △ Less

Submitted 8 February, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: Accepted by IEEE Transactions on Intelligent Vehicles

arXiv:2306.02245 [pdf, other]

doi 10.1007/s11432-023-3943-6

SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model

Authors: Dingyuan Zhang, Dingkang Liang, Hongcheng Yang, Zhikang Zou, Xiaoqing Ye, Zhe Liu, Xiang Bai

Abstract: With the development of large language models, many remarkable linguistic systems like ChatGPT have thrived and achieved astonishing success on many tasks, showing the incredible power of foundation models. In the spirit of unleashing the capability of foundation models on vision tasks, the Segment Anything Model (SAM), a vision foundation model for image segmentation, has been proposed recently a… ▽ More With the development of large language models, many remarkable linguistic systems like ChatGPT have thrived and achieved astonishing success on many tasks, showing the incredible power of foundation models. In the spirit of unleashing the capability of foundation models on vision tasks, the Segment Anything Model (SAM), a vision foundation model for image segmentation, has been proposed recently and presents strong zero-shot ability on many downstream 2D tasks. However, whether SAM can be adapted to 3D vision tasks has yet to be explored, especially 3D object detection. With this inspiration, we explore adapting the zero-shot ability of SAM to 3D object detection in this paper. We propose a SAM-powered BEV processing pipeline to detect objects and get promising results on the large-scale Waymo open dataset. As an early attempt, our method takes a step toward 3D object detection with vision foundation models and presents the opportunity to unleash their power on 3D vision tasks. The code is released at https://github.com/DYZhang09/SAM3D. △ Less

Submitted 29 January, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

Comments: Accepted by Science China Information Sciences (SCIS)

arXiv:2304.11697 [pdf, other]

Informative Data Selection with Uncertainty for Multi-modal Object Detection

Authors: Xinyu Zhang, Zhiwei Li, Zhenhong Zou, Xin Gao, Yi** Xiong, Dafeng **, Jun Li, Hua** Liu

Abstract: Noise has always been nonnegligible trouble in object detection by creating confusion in model reasoning, thereby reducing the informativeness of the data. It can lead to inaccurate recognition due to the shift in the observed pattern, that requires a robust generalization of the models. To implement a general vision model, we need to develop deep learning models that can adaptively select valid i… ▽ More Noise has always been nonnegligible trouble in object detection by creating confusion in model reasoning, thereby reducing the informativeness of the data. It can lead to inaccurate recognition due to the shift in the observed pattern, that requires a robust generalization of the models. To implement a general vision model, we need to develop deep learning models that can adaptively select valid information from multi-modal data. This is mainly based on two reasons. Multi-modal learning can break through the inherent defects of single-modal data, and adaptive information selection can reduce chaos in multi-modal data. To tackle this problem, we propose a universal uncertainty-aware multi-modal fusion model. It adopts a multi-pipeline loosely coupled architecture to combine the features and results from point clouds and images. To quantify the correlation in multi-modal information, we model the uncertainty, as the inverse of data information, in different modalities and embed it in the bounding box generation. In this way, our model reduces the randomness in fusion and generates reliable output. Moreover, we conducted a completed investigation on the KITTI 2D object detection dataset and its derived dirty data. Our fusion model is proven to resist severe noise interference like Gaussian, motion blur, and frost, with only slight degradation. The experiment results demonstrate the benefits of our adaptive fusion. Our analysis on the robustness of multi-modal fusion will provide further insights for future research. △ Less

Submitted 23 April, 2023; originally announced April 2023.

arXiv:2303.05386 [pdf, other]

Deep Equilibrium Learning of Explicit Regularizers for Imaging Inverse Problems

Authors: Zihao Zou, Jiaming Liu, Brendt Wohlberg, Ulugbek S. Kamilov

Abstract: There has been significant recent interest in the use of deep learning for regularizing imaging inverse problems. Most work in the area has focused on regularization imposed implicitly by convolutional neural networks (CNNs) pre-trained for image reconstruction. In this work, we follow an alternative line of work based on learning explicit regularization functionals that promote preferred solution… ▽ More There has been significant recent interest in the use of deep learning for regularizing imaging inverse problems. Most work in the area has focused on regularization imposed implicitly by convolutional neural networks (CNNs) pre-trained for image reconstruction. In this work, we follow an alternative line of work based on learning explicit regularization functionals that promote preferred solutions. We develop the Explicit Learned Deep Equilibrium Regularizer (ELDER) method for learning explicit regularizers that minimize a mean-squared error (MSE) metric. ELDER is based on a regularization functional parameterized by a CNN and a deep equilibrium learning (DEQ) method for training the functional to be MSE-optimal at the fixed points of the reconstruction algorithm. The explicit regularizer enables ELDER to directly inherit fundamental convergence results from optimization theory. On the other hand, DEQ training enables ELDER to improve over existing explicit regularizers without prohibitive memory complexity during training. We use ELDER to train several approaches to parameterizing explicit regularizers and test their performance on three distinct imaging inverse problems. Our results show that ELDER can greatly improve the quality of explicit regularizers compared to existing methods, and show that learning explicit regularizers does not compromise performance relative to methods based on implicit regularization. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2301.00454 [pdf, other]

Waveforms for xG Non-stationary Channels

Authors: Zhibin Zou, Aveek Dutta

Abstract: Waveform design for interference cancellation in next-generation wireless systems, which includes precoding and modulation, aims to achieve orthogonality among data signals/symbols across all Degrees of Freedom (DoF). Conventional methods struggle with non-stationary channel states due to high mobility, density, and time-varying multipath propagation. In this article, we review the HOGMT-Precoding… ▽ More Waveform design for interference cancellation in next-generation wireless systems, which includes precoding and modulation, aims to achieve orthogonality among data signals/symbols across all Degrees of Freedom (DoF). Conventional methods struggle with non-stationary channel states due to high mobility, density, and time-varying multipath propagation. In this article, we review the HOGMT-Precoding and MEM modulations for non-stationary channels. We also discuss practical challenges and future directions. △ Less

Submitted 30 August, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

arXiv:2211.09208 [pdf, other]

Capacity Achieving by Diagonal Permutation for MU-MIMO channels

Authors: Zhibin Zou, Aveek Dutta

Abstract: Dirty Paper Coding (DPC) is considered as the optimal precoding which achieves capacity for the Gaussian Multiple-Input Multiple-Output (MIMO) broadcast channel (BC). However, to find the optimal precoding order, it needs to repeat N! times for N users as there are N! possible precoding orders. This extremely high complexity limits its practical use in modern wireless networks. In this paper, we s… ▽ More Dirty Paper Coding (DPC) is considered as the optimal precoding which achieves capacity for the Gaussian Multiple-Input Multiple-Output (MIMO) broadcast channel (BC). However, to find the optimal precoding order, it needs to repeat N! times for N users as there are N! possible precoding orders. This extremely high complexity limits its practical use in modern wireless networks. In this paper, we show the equivalence of DPC and the recently proposed Higher Order Mercer's Theorem (HOGMT) precoding[1][2] in 2-D (spatial) case, which provides an alternate implementation for DPC. Furthermore, we show that the proposed implementation method is linear over the permutation operator when permuting over multi-user channels. Therefore, we present a low complexity algorithm that optimizes the precoding order for DPC with beamforming, eliminating repeated computation of DPC for each precoding order. Simulations show that our method can achieve the same result as conventional DPC with about 20 dB lower complexity for N = 5 users. △ Less

Submitted 10 April, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

arXiv:2211.09203 [pdf, other]

Multidimensional Eigenwave Multiplexing Modulation for Non-Stationary Channels

Authors: Zhibin Zou, Aveek Dutta

Abstract: While interference in time domain (caused by path difference) is mitigated by OFDM modulation, interference in frequency domain (due to velocity difference), can be mitigated by OTFS modulation. However, in non-stationary channels, the relative difference in acceleration will cause Inter-Doppler Interference (IDI) and a modulation method for mitigating IDI does not exist in the literature. Both me… ▽ More While interference in time domain (caused by path difference) is mitigated by OFDM modulation, interference in frequency domain (due to velocity difference), can be mitigated by OTFS modulation. However, in non-stationary channels, the relative difference in acceleration will cause Inter-Doppler Interference (IDI) and a modulation method for mitigating IDI does not exist in the literature. Both methods in the literature use carriers in a specific domain which achieve orthogonality in the target domain to mitigate interference. Moreover, those modulation cannot directly incorporate space domain, which requires additional precoding technique to mitigate inter-user interference (IUI) for MU-MIMO channels. This work presents a generalized modulation for any multidimensional channel. Recently, Higher Order Mercer's Theorem (HOGMT) [1] has been proposed to decompose multi-user non-stationary channels into independent fading subchannels (Eigenwaves). Based on HOGMT decomposition, we develop Multidimensional Eigenwaves Multiplexing (MEM) modulation which uses jointly orthogonal eigenwaves, decomposed from the multidimensional channel as subcarriers. Data symbols modulated by these eigenwaves can achieve orthogonality across each degree of freedom(\eg space (users/antennas), time-frequency and delay-Doppler). Consequently, the transmitted remain independent over the high dimensional channel, thereby avoiding interference from other symbols. △ Less

Submitted 30 August, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

Comments: This paper is accepted by IEEE Globecom 2023

arXiv:2211.06017 [pdf, other]

Joint Spatio-Temporal Precoding for Practical Non-Stationary Wireless Channels

Authors: Zhibin Zou, Maqsood Careem, Aveek Dutta, Ngwe Thawdar

Abstract: The high mobility, density and multi-path evident in modern wireless systems makes the channel highly non-stationary. This causes temporal variation in the channel distribution that leads to the existence of time-varying joint interference across multiple degrees of freedom (DoF, e.g., users, antennas, frequency and symbols), which renders conventional precoding sub-optimal in practice. In this wo… ▽ More The high mobility, density and multi-path evident in modern wireless systems makes the channel highly non-stationary. This causes temporal variation in the channel distribution that leads to the existence of time-varying joint interference across multiple degrees of freedom (DoF, e.g., users, antennas, frequency and symbols), which renders conventional precoding sub-optimal in practice. In this work, we derive a High-Order Generalization of Mercer's Theorem (HOGMT), which decomposes the multi-user non-stationary channel into two (dual) sets of jointly orthogonal subchannels (eigenfunctions), that result in the other set when one set is transmitted through the channel. This duality and joint orthogonality of eigenfuntions ensure transmission over independently flat-fading subchannels. Consequently, transmitting these eigenfunctions with optimally derived coefficients eventually mitigates any interference across its degrees of freedoms and forms the foundation of the proposed joint spatio-temporal precoding. The transferred dual eigenfuntions and coefficients directly reconstruct the data symbols at the receiver upon demodulation, thereby significantly reducing its computational burden, by alleviating the need for any complementary post-coding. Additionally, the eigenfunctions decomposed from the time-frequency delay-Doppler channel kernel are paramount to extracting the second-order channel statistics, and therefore completely characterize the underlying channel. We evaluate this using a realistic non-stationary channel framework built in Matlab and show that our precoding achieves ${\geqslant}$4 orders of reduction in BER at SNR${\geqslant}15$dB in OFDM systems for higher-order modulations and less complexity compared to the state-of-the-art precoding. △ Less

Submitted 23 January, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: This paper is accepted by IEEE Transactions on Communications. arXiv admin note: substantial text overlap with arXiv:2202.04148

arXiv:2211.00531 [pdf, other]

Robustness of Deep Equilibrium Architectures to Changes in the Measurement Model

Authors: Junhao Hu, Shirin Shoushtari, Zihao Zou, Jiaming Liu, Zhixin Sun, Ulugbek S. Kamilov

Abstract: Deep model-based architectures (DMBAs) are widely used in imaging inverse problems to integrate physical measurement models and learned image priors. Plug-and-play priors (PnP) and deep equilibrium models (DEQ) are two DMBA frameworks that have received significant attention. The key difference between the two is that the image prior in DEQ is trained by using a specific measurement model, while t… ▽ More Deep model-based architectures (DMBAs) are widely used in imaging inverse problems to integrate physical measurement models and learned image priors. Plug-and-play priors (PnP) and deep equilibrium models (DEQ) are two DMBA frameworks that have received significant attention. The key difference between the two is that the image prior in DEQ is trained by using a specific measurement model, while that in PnP is trained as a general image denoiser. This difference is behind a common assumption that PnP is more robust to changes in the measurement models compared to DEQ. This paper investigates the robustness of DEQ priors to changes in the measurement models. Our results on two imaging inverse problems suggest that DEQ priors trained under mismatched measurement models outperform image denoisers. △ Less

Submitted 1 November, 2022; originally announced November 2022.

arXiv:2209.15147 [pdf, other]

Optimizing towards the best insertion-based error-tolerating joints

Authors: Zhibin Zou

Abstract: We present an optimization-based design process that can generate the best insertion-based joints with respect to different errors, including manipulation error, manufacturing error, and sensing error. We separate the analysis into two stages, the insertion and the after-insertion stability. Each sub-process is discretized into different modes of contacts. The transitions among the contact modes f… ▽ More We present an optimization-based design process that can generate the best insertion-based joints with respect to different errors, including manipulation error, manufacturing error, and sensing error. We separate the analysis into two stages, the insertion and the after-insertion stability. Each sub-process is discretized into different modes of contacts. The transitions among the contact modes form a directed graph and the connectivity of the graph is achieved and maintained through the manipulation of the socket edge-angle and peg contact-point locations. The analysis starts in 2D with the assumption of point-edge contacts. During the optimization, the edges of the socket are rotated and the points on the peg are moved along the edges to ensure the successful insertion and the stability after insertion. We show in simulation that our proposed method can generate insertion-based joints that are tolerant to the given errors. and we present a few simple 3D projections to show that the analysis is still effective beyond 2D cases. △ Less

Submitted 11 November, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

arXiv:2202.04148 [pdf, other]

doi 10.1109/ICC45855.2022.9839118

Unified Characterization and Precoding for Non-Stationary Channels

Authors: Zhibin Zou, Maqsood Careem, Aveek Dutta, Ngwe Thawdar

Abstract: Modern wireless channels are increasingly dense and mobile making the channel highly non-stationary. The time-varying distribution and the existence of joint interference across multiple degrees of freedom (e.g., users, antennas, frequency and symbols) in such channels render conventional precoding sub-optimal in practice, and have led to historically poor characterization of their statistics. The… ▽ More Modern wireless channels are increasingly dense and mobile making the channel highly non-stationary. The time-varying distribution and the existence of joint interference across multiple degrees of freedom (e.g., users, antennas, frequency and symbols) in such channels render conventional precoding sub-optimal in practice, and have led to historically poor characterization of their statistics. The core of our work is the derivation of a high-order generalization of Mercer's Theorem to decompose the non-stationary channel into constituent fading sub-channels (2-D eigenfunctions) that are jointly orthogonal across its degrees of freedom. Consequently, transmitting these eigenfunctions with optimally derived coefficients eventually mitigates any interference across these dimensions and forms the foundation of the proposed joint spatio-temporal precoding. The precoded symbols directly reconstruct the data symbols at the receiver upon demodulation, thereby significantly reducing its computational burden, by alleviating the need for any complementary decoding. These eigenfunctions are paramount to extracting the second-order channel statistics, and therefore completely characterize the underlying channel. Theory and simulations show that such precoding leads to ${>}10^4{\times}$ BER improvement (at 20dB) over existing methods for non-stationary channels. △ Less

Submitted 11 November, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: This paper is accepted by IEEE ICC 2022 and won Best Paper Award. arXiv admin note: text overlap with arXiv:2202.01827

arXiv:2202.01827 [pdf, other]

Proofs and Supplementary Material: Unified Characterization and Precoding for Non-Stationary Channels

Authors: Zhibin Zou, Maqsood Careem, Aveek Dutta, Ngwe Thawdar

Abstract: This document provides the supplementary material including a comprehensive related work, the complete proofs and extended evaluation results that support the manuscript, "Unified Characterization and Precoding for Non-Stationary Channels", that was accepted for publication at IEEE International Conference on Communications (ICC) 2022. Equations (1)--(34) refer to the equations from the main manus… ▽ More This document provides the supplementary material including a comprehensive related work, the complete proofs and extended evaluation results that support the manuscript, "Unified Characterization and Precoding for Non-Stationary Channels", that was accepted for publication at IEEE International Conference on Communications (ICC) 2022. Equations (1)--(34) refer to the equations from the main manuscript, and the Theorem, Lemma and Corollaries correspond to those from the manuscript. △ Less

Submitted 3 February, 2022; originally announced February 2022.

arXiv:2106.12255 [pdf, other]

Harmonic Power-Flow Study of Polyphase Grids with Converter-Interfaced Distributed Energy Resources, Part II: Model Library and Validation

Authors: Johanna Kristin Maria Becker, Andreas Martin Kettner, Lorenzo Reyes-Chamorro, Zhixiang Zou, Marco Liserre, Mario Paolone

Abstract: In Part I, a method for the Harmonic Power-Flow (HPF) study of three-phase power grids with Converter-Interfaced Distributed Energy Resources (CIDERs) is proposed. The method is based on generic and modular representations of the grid and the CIDERs, and explicitly accounts for coupling between harmonics. In Part II, the HPF method is validated. First, the applicability of the modeling framework i… ▽ More In Part I, a method for the Harmonic Power-Flow (HPF) study of three-phase power grids with Converter-Interfaced Distributed Energy Resources (CIDERs) is proposed. The method is based on generic and modular representations of the grid and the CIDERs, and explicitly accounts for coupling between harmonics. In Part II, the HPF method is validated. First, the applicability of the modeling framework is demonstrated on typical grid-forming and grid-following CIDERs. Then, the HPF method is implemented in Matlab and compared against time-domain simulations with Simulink. The accuracy of the models and the performance of the solution algorithm are assessed for individual resources and a modified version of the CIGRÉ low-voltage benchmark microgrid (i.e., with additional unbalanced components). The observed maximum errors are 6.3E-5 p.u. w.r.t. voltage magnitude, 1.3E-3 p.u. w.r.t. current magnitude, and 0.9 deg w.r.t. phase. Moreover, the scalability of the method is assessed w.r.t. the number of CIDERs and the maximum harmonic order ($\leqslant$25). For the maximum problem size, the execution time of the HPF method is 6.52 sec, which is 5 times faster than the time-domain simulation. The convergence of the method is robust w.r.t. the choice of the initial point, and multiplicity of solutions has not been observed. △ Less

Submitted 1 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

arXiv:2106.12253 [pdf, other]

Harmonic Power-Flow Study of Polyphase Grids with Converter-Interfaced Distributed Energy Resources, Part I: Modelling Framework and Algorithm

Authors: Andreas Martin Kettner, Lorenzo Reyes-Chamorro, Johanna Kristin Maria Becker, Zhixiang Zou, Marco Liserre, Mario Paolone

Abstract: Power distribution systems are experiencing a large-scale integration of Converter-Interfaced Distributed Energy Resources (CIDERs). This complicates the analysis and mitigation of harmonics, whose creation and propagation are facilitated by the interactions of converters and their controllers through the grid. In this paper, a method for the calculation of the so-called Harmonic Power-Flow (HPF)… ▽ More Power distribution systems are experiencing a large-scale integration of Converter-Interfaced Distributed Energy Resources (CIDERs). This complicates the analysis and mitigation of harmonics, whose creation and propagation are facilitated by the interactions of converters and their controllers through the grid. In this paper, a method for the calculation of the so-called Harmonic Power-Flow (HPF) in three-phase grids with CIDERs is proposed. The distinguishing feature of this HPF method is the generic and modular representation of the system components. Notably, as opposed to most of the existing approaches, the coupling between harmonics is explicitly considered. The HPF problem is formulated by combining the hybrid nodal equations of the grid with the closed-loop transfer functions of the CIDERs, and solved using the Newton-Raphson method. The grid components are characterized by compound electrical parameters, which allow to represent both transposed or non-transposed lines. The CIDERs are represented by modular linear time-periodic systems, which allows to treat both grid-forming and grid-following control laws. The method's accuracy and computational efficiency are confirmed via time-domain simulations of the CIGRÉ low-voltage benchmark microgrid. This paper is divided in two parts, which focus on the development (Part I) and the validation (Part II) of the proposed method. △ Less

Submitted 1 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

arXiv:2008.12052 [pdf, other]

Compensation Tracker: Reprocessing Lost Object for Multi-Object Tracking

Authors: Zhibo Zou, Junjie Huang, ** Luo

Abstract: Tracking by detection paradigm is one of the most popular object tracking methods. However, it is very dependent on the performance of the detector. When the detector has a behavior of missing detection, the tracking result will be directly affected. In this paper, we analyze the phenomenon of the lost tracking object in real-time tracking model on MOT2020 dataset. Based on simple and traditional… ▽ More Tracking by detection paradigm is one of the most popular object tracking methods. However, it is very dependent on the performance of the detector. When the detector has a behavior of missing detection, the tracking result will be directly affected. In this paper, we analyze the phenomenon of the lost tracking object in real-time tracking model on MOT2020 dataset. Based on simple and traditional methods, we propose a compensation tracker to further alleviate the lost tracking problem caused by missing detection. It consists of a motion compensation module and an object selection module. The proposed method not only can re-track missing tracking objects from lost objects, but also does not require additional networks so as to maintain speed-accuracy trade-off of the real-time model. Our method only needs to be embedded into the tracker to work without re-training the network. Experiments show that the compensation tracker can efficaciously improve the performance of the model and reduce identity switches. With limited costs, the compensation tracker successfully enhances the baseline tracking performance by a large margin and reaches 66% of MOTA and 67% of IDF1 on MOT2020 dataset. △ Less

Submitted 5 February, 2022; v1 submitted 27 August, 2020; originally announced August 2020.

arXiv:2007.01156 [pdf, other]

Enhancing Autonomy with Blockchain and Multi-Access Edge Computing in Distributed Robotic Systems

Authors: Jorge Peña Queralta, Li Qingqing, Zhuo Zou, Tomi Westerlund

Abstract: This conceptual paper discusses how different aspects involving the autonomous operation of robots and vehicles will change when they have access to next-generation mobile networks. 5G and beyond connectivity is bringing together a myriad of technologies and industries under its umbrella. High-bandwidth, low-latency edge computing services through network slicing have the potential to support nove… ▽ More This conceptual paper discusses how different aspects involving the autonomous operation of robots and vehicles will change when they have access to next-generation mobile networks. 5G and beyond connectivity is bringing together a myriad of technologies and industries under its umbrella. High-bandwidth, low-latency edge computing services through network slicing have the potential to support novel application scenarios in different domains including robotics, autonomous vehicles, and the Internet of Things. In particular, multi-tenant applications at the edge of the network will boost the development of autonomous robots and vehicles offering computational resources and intelligence through reliable offloading services. The integration of more distributed network architectures with distributed robotic systems can increase the degree of intelligence and level of autonomy of connected units. We argue that the last piece to put together a services framework with third-party integration will be next-generation low-latency blockchain networks. Blockchains will enable a transparent and secure way of providing services and managing resources at the Multi-Access Edge Computing (MEC) layer. We overview the state-of-the-art in MEC slicing, distributed robotic systems and blockchain technology to define a framework for services the MEC layer that will enhance the autonomous operations of connected robots and vehicles. △ Less

Submitted 1 July, 2020; originally announced July 2020.

Comments: Accepted to the Fifth International Conference on Fog and Mobile Edge Computing (FMEC 2020)

arXiv:2004.08174 [pdf, other]

doi 10.1016/j.procs.2020.07.051

UWB-Based Localization for Multi-UAV Systems and Collaborative Heterogeneous Multi-Robot Systems: a Survey

Authors: Wang Shule, Carmen Martínez Almansa, Jorge Peña Queralta, Zhuo Zou, Tomi Westerlund

Abstract: Ultra-wideband technology has emerged in recent years as a robust solution for localization in GNSS denied environments. In particular, its high accuracy when compared to other wireless localization solutions is enabling a wider range of collaborative and multi-robot application scenarios, being able to replace more complex and expensive motion-capture areas for use cases where accuracy in the ord… ▽ More Ultra-wideband technology has emerged in recent years as a robust solution for localization in GNSS denied environments. In particular, its high accuracy when compared to other wireless localization solutions is enabling a wider range of collaborative and multi-robot application scenarios, being able to replace more complex and expensive motion-capture areas for use cases where accuracy in the order of tens of centimeters is sufficient. We present the first survey of UWB-based localization focused on multi-UAV systems and heterogeneous multi-robot systems. We have found that previous literature reviews do not consider in-depth the challenges in both aerial navigation and navigation with multiple robots, but also in terms of heterogeneous multi-robot systems. In particular, this is, to the best of our knowledge, the first survey to review recent advances in UWB-based (i) methods that enable ad-hoc and dynamic deployments; (ii) collaborative localization techniques; and (iii) cooperative sensing and cooperative maneuvers such as UAV docking on mobile platforms. Finally, we also review existing datasets and discuss the potential of this technology for both localization in GNSS-denied environments and collaboration in multi-robot systems. △ Less

Submitted 28 April, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

arXiv:1909.11937 [pdf, other]

Multi-grained Attention Networks for Single Image Super-Resolution

Authors: Huapeng Wu, Zhengxia Zou, Jie Gui, Wen-Jun Zeng, Jie** Ye, Jun Zhang, Hongyi Liu, Zhihui Wei

Abstract: Deep Convolutional Neural Networks (CNN) have drawn great attention in image super-resolution (SR). Recently, visual attention mechanism, which exploits both of the feature importance and contextual cues, has been introduced to image SR and proves to be effective to improve CNN-based SR performance. In this paper, we make a thorough investigation on the attention mechanisms in a SR model and shed… ▽ More Deep Convolutional Neural Networks (CNN) have drawn great attention in image super-resolution (SR). Recently, visual attention mechanism, which exploits both of the feature importance and contextual cues, has been introduced to image SR and proves to be effective to improve CNN-based SR performance. In this paper, we make a thorough investigation on the attention mechanisms in a SR model and shed light on how simple and effective improvements on these ideas improve the state-of-the-arts. We further propose a unified approach called "multi-grained attention networks (MGAN)" which fully exploits the advantages of multi-scale and attention mechanisms in SR tasks. In our method, the importance of each neuron is computed according to its surrounding regions in a multi-grained fashion and then is used to adaptively re-scale the feature responses. More importantly, the "channel attention" and "spatial attention" strategies in previous methods can be essentially considered as two special cases of our method. We also introduce multi-scale dense connections to extract the image features at multiple scales and capture the features of different layers through dense skip connections. Ablation studies on benchmark datasets demonstrate the effectiveness of our method. In comparison with other state-of-the-art SR methods, our method shows the superiority in terms of both accuracy and model size. △ Less

Submitted 29 September, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

arXiv:1204.3100 [pdf, other]

Modular design of jointly optimal controllers and forwarding policies for wireless control

Authors: Burak Demirel, Zhenhua Zou, Pablo Soldati, Mikael Johansson

Abstract: We consider the joint design of packet forwarding policies and controllers for wireless control loops where sensor measurements are sent to the controller over an unreliable and energy-constrained multi-hop wireless network. For fixed sampling rate of the sensor, the co-design problem separates into two well-defined and independent subproblems: transmission scheduling for maximizing the deadline-c… ▽ More We consider the joint design of packet forwarding policies and controllers for wireless control loops where sensor measurements are sent to the controller over an unreliable and energy-constrained multi-hop wireless network. For fixed sampling rate of the sensor, the co-design problem separates into two well-defined and independent subproblems: transmission scheduling for maximizing the deadline-constrained reliability and optimal control under packet loss. We develop optimal and implementable solutions for these subproblems and show that the optimally co-designed system can be efficiently found. Numerical examples highlight the many trade-offs involved and demonstrate the power of our approach. △ Less

Submitted 1 July, 2014; v1 submitted 13 April, 2012; originally announced April 2012.

Showing 1–25 of 25 results for author: Zou, Z