-
Active Cell Balancing for Extended Operational Time of Lithium-Ion Battery Systems in Energy Storage Applications
Authors:
Yiming Xu,
Xiaohua Ge,
Ruohan Guo,
Weixiang Shen
Abstract:
Cell inconsistency within a lithium-ion battery system poses a significant challenge in maximizing the system operational time. This study presents an optimization-driven active balancing method to minimize the effects of cell inconsistency on the system operational time while simultaneously satisfying the system output power demand and prolonging the system operational time in energy storage appl…
▽ More
Cell inconsistency within a lithium-ion battery system poses a significant challenge in maximizing the system operational time. This study presents an optimization-driven active balancing method to minimize the effects of cell inconsistency on the system operational time while simultaneously satisfying the system output power demand and prolonging the system operational time in energy storage applications. The proposed method utilizes a fractional order model to forecast the terminal voltage dynamics of each cell within a battery system, enhanced with a particle-swarm-optimisation-genetic algorithm for precise parameter identification. It is implemented under two distinct cell-level balancing topologies: independent cell balancing and differential cell balancing. Subsequently, the current distribution for each topology is determined by resolving two optimization control problems constrained by the battery's operational specifications and power demands. The effectiveness of the proposed method is validated by extensive experiments based on the two balancing topologies. The results demonstrate that the proposed method increases the operational time by 3.2%.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Physics-informed Convolutional Neural Network for Microgrid Economic Dispatch
Authors:
Xiaoyu Ge,
Javad Khazaei
Abstract:
The variability of renewable energy generation and the unpredictability of electricity demand create a need for real-time economic dispatch (ED) of assets in microgrids. However, solving numerical optimization problems in real-time can be incredibly challenging. This study proposes using a convolutional neural network (CNN) based on deep learning to address these challenges. Compared to traditiona…
▽ More
The variability of renewable energy generation and the unpredictability of electricity demand create a need for real-time economic dispatch (ED) of assets in microgrids. However, solving numerical optimization problems in real-time can be incredibly challenging. This study proposes using a convolutional neural network (CNN) based on deep learning to address these challenges. Compared to traditional methods, CNN is more efficient, delivers more dependable results, and has a shorter response time when dealing with uncertainties. While CNN has shown promising results, it does not extract explainable knowledge from the data. To address this limitation, a physics-inspired CNN model is developed by incorporating constraints of the ED problem into the CNN training to ensure that the model follows physical laws while fitting the data. The proposed method can significantly accelerate real-time economic dispatch of microgrids without compromising the accuracy of numerical optimization techniques. The effectiveness of the proposed data-driven approach for optimal allocation of microgrid resources in real-time is verified through a comprehensive comparison with conventional numerical optimization approaches.
△ Less
Submitted 1 May, 2024; v1 submitted 28 April, 2024;
originally announced April 2024.
-
Task-Aware Encoder Control for Deep Video Compression
Authors:
Xingtong Ge,
Jixiang Luo,
Xinjie Zhang,
Tongda Xu,
Guo Lu,
Dailan He,
**g Geng,
Yan Wang,
Jun Zhang,
Hongwei Qin
Abstract:
Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task. In contrast, traditional video codecs employ a flexible encoder controller, enabling the adaptation of a single codec to different tasks through mechanisms like mode prediction. Drawing inspiration from this, we introduce an…
▽ More
Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task. In contrast, traditional video codecs employ a flexible encoder controller, enabling the adaptation of a single codec to different tasks through mechanisms like mode prediction. Drawing inspiration from this, we introduce an innovative encoder controller for deep video compression for machines. This controller features a mode prediction and a Group of Pictures (GoP) selection module. Our approach centralizes control at the encoding stage, allowing for adaptable encoder adjustments across different tasks, such as detection and tracking, while maintaining compatibility with a standard pre-trained DVC decoder. Empirical evidence demonstrates that our method is applicable across multiple tasks with various existing pre-trained DVCs. Moreover, extensive experiments demonstrate that our method outperforms previous DVC by about 25% bitrate for different tasks, with only one pre-trained decoder.
△ Less
Submitted 20 April, 2024; v1 submitted 7 April, 2024;
originally announced April 2024.
-
GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting
Authors:
Xinjie Zhang,
Xingtong Ge,
Tongda Xu,
Dailan He,
Yan Wang,
Hongwei Qin,
Guo Lu,
**g Geng,
Jun Zhang
Abstract:
Implicit neural representations (INRs) recently achieved great success in image representation and compression, offering high visual quality and fast rendering speeds with 10-1000 FPS, assuming sufficient GPU resources are available. However, this requirement often hinders their use on low-end devices with limited memory. In response, we propose a groundbreaking paradigm of image representation an…
▽ More
Implicit neural representations (INRs) recently achieved great success in image representation and compression, offering high visual quality and fast rendering speeds with 10-1000 FPS, assuming sufficient GPU resources are available. However, this requirement often hinders their use on low-end devices with limited memory. In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage. We first introduce 2D Gaussian to represent the image, where each Gaussian has 8 parameters including position, covariance and color. Subsequently, we unveil a novel rendering algorithm based on accumulated summation. Remarkably, our method with a minimum of 3$\times$ lower GPU memory usage and 5$\times$ faster fitting time not only rivals INRs (e.g., WIRE, I-NGP) in representation performance, but also delivers a faster rendering speed of 1500-2000 FPS regardless of parameter size. Furthermore, we integrate existing vector quantization technique to build an image codec. Experimental results demonstrate that our codec attains rate-distortion performance comparable to compression-based INRs such as COIN and COIN++, while facilitating decoding speeds of approximately 1000 FPS. Additionally, preliminary proof of concept shows that our codec surpasses COIN and COIN++ in performance when using partial bits-back coding. Code will be available at https://github.com/Xinjie-Q/GaussianImage.
△ Less
Submitted 10 April, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Content-aware Masked Image Modeling Transformer for Stereo Image Compression
Authors:
Xinjie Zhang,
Shenyuan Gao,
Zhening Liu,
Jiawei Shao,
Xingtong Ge,
Dailan He,
Tongda Xu,
Yan Wang,
Jun Zhang
Abstract:
Existing learning-based stereo image codec adopt sophisticated transformation with simple entropy models derived from single image codecs to encode latent representations. However, those entropy models struggle to effectively capture the spatial-disparity characteristics inherent in stereo images, which leads to suboptimal rate-distortion results. In this paper, we propose a stereo image compressi…
▽ More
Existing learning-based stereo image codec adopt sophisticated transformation with simple entropy models derived from single image codecs to encode latent representations. However, those entropy models struggle to effectively capture the spatial-disparity characteristics inherent in stereo images, which leads to suboptimal rate-distortion results. In this paper, we propose a stereo image compression framework, named CAMSIC. CAMSIC independently transforms each image to latent representation and employs a powerful decoder-free Transformer entropy model to capture both spatial and disparity dependencies, by introducing a novel content-aware masked image modeling (MIM) technique. Our content-aware MIM facilitates efficient bidirectional interaction between prior information and estimated tokens, which naturally obviates the need for an extra Transformer decoder. Experiments show that our stereo image codec achieves state-of-the-art rate-distortion performance on two stereo image datasets Cityscapes and InStereo2K with fast encoding and decoding speed.
△ Less
Submitted 19 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Boosting Neural Representations for Videos with a Conditional Decoder
Authors:
Xinjie Zhang,
Ren Yang,
Dailan He,
Xingtong Ge,
Tongda Xu,
Yan Wang,
Hongwei Qin,
Jun Zhang
Abstract:
Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks. However, existing methods often fail to fully leverage their representation capabilities, primarily due to inadequate alignment of intermediate features during target frame decoding. This paper introduces a universal boosting frame…
▽ More
Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks. However, existing methods often fail to fully leverage their representation capabilities, primarily due to inadequate alignment of intermediate features during target frame decoding. This paper introduces a universal boosting framework for current implicit video representation approaches. Specifically, we utilize a conditional decoder with a temporal-aware affine transform module, which uses the frame index as a prior condition to effectively align intermediate features with target frames. Besides, we introduce a sinusoidal NeRV-like block to generate diverse intermediate features and achieve a more balanced parameter distribution, thereby enhancing the model's capacity. With a high-frequency information-preserving reconstruction loss, our approach successfully boosts multiple baseline INRs in the reconstruction quality and convergence speed for video regression, and exhibits superior inpainting and interpolation results. Further, we integrate a consistent entropy minimization technique and develop video codecs based on these boosted INRs. Experiments on the UVG dataset confirm that our enhanced codecs significantly outperform baseline INRs and offer competitive rate-distortion performance compared to traditional and learning-based codecs. Code is available at https://github.com/Xinjie-Q/Boosting-NeRV.
△ Less
Submitted 16 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Recent Advances in Model-Based Fault Diagnosis for Lithium-Ion Batteries: A Comprehensive Review
Authors:
Yiming Xu,
Xiaohua Ge,
Ruohan Guo,
Weixiang Shen
Abstract:
Lithium-ion batteries (LIBs) have found wide applications in a variety of fields such as electrified transportation, stationary storage and portable electronics devices. A battery management system (BMS) is critical to ensure the reliability, efficiency and longevity of LIBs. Recent research has witnessed the emergence of model-based fault diagnosis methods in advanced BMSs. This paper provides a…
▽ More
Lithium-ion batteries (LIBs) have found wide applications in a variety of fields such as electrified transportation, stationary storage and portable electronics devices. A battery management system (BMS) is critical to ensure the reliability, efficiency and longevity of LIBs. Recent research has witnessed the emergence of model-based fault diagnosis methods in advanced BMSs. This paper provides a comprehensive review on the model-based fault diagnosis methods for LIBs. First, the widely explored battery models in the existing literature are classified into physics-based electrochemical models and electrical equivalent circuit models. Second, a general state-space representation that describes electrical dynamics of a faulty battery is presented. The formulation of the state vectors and the identification of the parameter matrices are then elaborated. Third, the fault mechanisms of both battery faults (incl. overcharege/overdischarge faults, connection faults, short circuit faults) and sensor faults (incl. voltage sensor faults and current sensor faults) are discussed. Furthermore, different types of modeling uncertainties, such as modeling errors and measurement noises, aging effects, measurement outliers, are elaborated. An emphasis is then placed on the observer design (incl. online state observers and offline state observers). The algorithm implementation of typical state observers for battery fault diagnosis is also put forward. Finally, discussion and outlook are offered to envision some possible future research directions.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Learning the Dynamics of Future Marine Microgrids Using Temporal Convolutional Neural Network
Authors:
Xiaoyu Ge,
Ali Hosseinipour,
Saskia Putri,
Faegheh Moazeni,
Javad Khazaei
Abstract:
Medium-voltage direct-current (MVDC) ship-board microgrids (SMGs) are the state-of-the-art architecture for onboard power distribution in navy. These systems are considered to be highly dynamic due to high penetration of power electronic converters and volatile load patterns such as pulsed-power load (PPL) and propulsion motors demand variation. Obtaining the dynamic model of an MVDC SMG is a chal…
▽ More
Medium-voltage direct-current (MVDC) ship-board microgrids (SMGs) are the state-of-the-art architecture for onboard power distribution in navy. These systems are considered to be highly dynamic due to high penetration of power electronic converters and volatile load patterns such as pulsed-power load (PPL) and propulsion motors demand variation. Obtaining the dynamic model of an MVDC SMG is a challenging task due to the confidentiality of system components models and uncertainty in the dynamic models through time. In this paper, a dynamic identification framework based on a temporal convolutional neural network (TCN) is developed to learn the system dynamics from measurement data. Different kinds of testing scenarios are implemented, and the testing results show that this approach achieves an exceptional performance and high generalization ability, thus holding substantial promise for development of advanced data-driven control strategies and stability prediction of the system.
△ Less
Submitted 1 May, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Nonlinear Model Predictive Control for Navy Microgrids with Stabilizing Terminal Ingredients
Authors:
Saskia Putri,
Xiaoyu Ge,
Faegheh Moazeni,
Javad Khazaei
Abstract:
This paper presents a novel control strategy for medium voltage DC (MVDC) naval shipboard microgrids (MGs), employing a nonlinear model predictive controller (NMPC) enhanced with stabilizing features and an intricate droop control architecture. This combination quickly regulates the output voltage and adeptly allocates supercapacitors for pulsed power loads (PPLs), while the battery energy storage…
▽ More
This paper presents a novel control strategy for medium voltage DC (MVDC) naval shipboard microgrids (MGs), employing a nonlinear model predictive controller (NMPC) enhanced with stabilizing features and an intricate droop control architecture. This combination quickly regulates the output voltage and adeptly allocates supercapacitors for pulsed power loads (PPLs), while the battery energy storage system (BESS) and auxiliary generators handle the steady state loads. A key feature of this study is the formulation of terminal cost and constraints, providing recursive feasibility and closed-loop stability in the Lyapunov sense, that offers a more robust and effective approach to naval power and energy management. By comparing the proposed Lyapunov-based NMPC with conventional PI controller under fluctuating PPLs, the control robustness is validated.
△ Less
Submitted 1 May, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Voltage Restoration in MVDC Shipboard Microgrids with Economic Nonlinear Model Predictive Control
Authors:
Saskia Putri,
Ali Hosseinipour,
Xiaoyu Ge,
Faegheh Moazeni,
Javad Khazaei
Abstract:
Future Naval Microgrids (MGs) will include hybrid energy storage systems (ESS), including battery and supercapacitors to respond to emerging constant power loads (CPLs) and fluctuating pulsed power loads (PPLs). Voltage regulation of naval microgrids and power sharing among these resources become critical for success of a mission. This paper presents a novel control strategy using nonlinear model…
▽ More
Future Naval Microgrids (MGs) will include hybrid energy storage systems (ESS), including battery and supercapacitors to respond to emerging constant power loads (CPLs) and fluctuating pulsed power loads (PPLs). Voltage regulation of naval microgrids and power sharing among these resources become critical for success of a mission. This paper presents a novel control strategy using nonlinear model predictive controller embedded with a complex droop control architecture for voltage restoration and power sharing in medium voltage DC (MVDC) Naval MGs. The complex droop control ensures allocating supercapacitors (SCs) for high-frequency loads (i.e., PPLs), while battery energy storage system (BESS) and auxiliary generators share the steady-state load (i.e., CPL). Compared to state-of-the-art control of the naval ship MGs that relies on linear models, the proposed method incorporates the nonlinear behavior of the MGs in the closed-loop control framework via nonlinear model predictive control (NMPC). A reduced order representation of the MVDC dynamic is employed as the prediction model, augmented with a multi-objective, constraints-based, optimal control formulation. The results demonstrate the effectiveness of the proposed control framework for voltage restoration and power sharing of resources in naval MGs.
△ Less
Submitted 1 May, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Vulnerability of Building Energy Management against Targeted False Data Injection Attacks:Model Predictive Control vs. Proportional Integral
Authors:
Xiaoyu Ge,
Kamelia Norouzi,
Faegheh Moazeni,
Mirel Sehic,
Javad Khazaei,
Parv Venkitasubramaniam,
Rick Blum
Abstract:
Cybersecurity in building energy management is crucial for protecting infrastructure, ensuring data integrity, and preventing unauthorized access or manipulation. This paper investigates the energy efficiency and cybersecurity of building energy management systems (BMS) against false data injection (FDI) attacks using proportional-integral (PI) and model predictive control (MPC) methods. Focusing…
▽ More
Cybersecurity in building energy management is crucial for protecting infrastructure, ensuring data integrity, and preventing unauthorized access or manipulation. This paper investigates the energy efficiency and cybersecurity of building energy management systems (BMS) against false data injection (FDI) attacks using proportional-integral (PI) and model predictive control (MPC) methods. Focusing on a commercial building model with five rooms, vulnerability of PI-based BMS and nonlinear MPC-based BMS against FDIs on sensors and actuators is studied. The study aims to assess the effectiveness of these control strategies in maintaining system performance and lifespan, highlighting the potential of MPC in enhancing system resilience against cyber threats. Our case studies demonstrate that even a short term FDIA can cause a 12% reduction in lifetime of a heat-pump under an MPC controller, and cause a near thirty-fold overuse of flow valves under a PI controller.
△ Less
Submitted 1 May, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Toward ground-truth optical coherence tomography via three-dimensional unsupervised deep learning processing and data
Authors:
Renxiong Wu,
Fei Zheng,
Meixuan Li,
Shaoyan Huang,
Xin Ge,
Linbo Liu,
Yong Liu,
Guangming Ni
Abstract:
Optical coherence tomography (OCT) can perform non-invasive high-resolution three-dimensional (3D) imaging and has been widely used in biomedical fields, while it is inevitably affected by coherence speckle noise which degrades OCT imaging performance and restricts its applications. Here we present a novel speckle-free OCT imaging strategy, named toward-ground-truth OCT (tGT-OCT), that utilizes un…
▽ More
Optical coherence tomography (OCT) can perform non-invasive high-resolution three-dimensional (3D) imaging and has been widely used in biomedical fields, while it is inevitably affected by coherence speckle noise which degrades OCT imaging performance and restricts its applications. Here we present a novel speckle-free OCT imaging strategy, named toward-ground-truth OCT (tGT-OCT), that utilizes unsupervised 3D deep-learning processing and leverages OCT 3D imaging features to achieve speckle-free OCT imaging. Specifically, our proposed tGT-OCT utilizes an unsupervised 3D-convolution deep-learning network trained using random 3D volumetric data to distinguish and separate speckle from real structures in 3D imaging volumetric space; moreover, tGT-OCT effectively further reduces speckle noise and reveals structures that would otherwise be obscured by speckle noise while preserving spatial resolution. Results derived from different samples demonstrated the high-quality speckle-free 3D imaging performance of tGT-OCT and its advancement beyond the previous state-of-the-art.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
High-content stimulated Raman histology of human breast cancer
Authors:
Hongli Ni,
Chinmayee Prabhu Dessai,
Haonan Lin,
Wei Wang,
Shaoxiong Chen,
Yuhao Yuan,
Xiaowei Ge,
Jianpeng Ao,
Nolan Vild,
Ji-Xin Cheng
Abstract:
Histological examination is crucial for cancer diagnosis, including hematoxylin and eosin (H&E) staining for map** morphology and immunohistochemistry (IHC) staining for revealing chemical information. Recently developed two-color stimulated Raman histology could bypass the complex tissue processing to mimic H&E-like morphology. Yet, the underlying chemical features are not revealed, compromisin…
▽ More
Histological examination is crucial for cancer diagnosis, including hematoxylin and eosin (H&E) staining for map** morphology and immunohistochemistry (IHC) staining for revealing chemical information. Recently developed two-color stimulated Raman histology could bypass the complex tissue processing to mimic H&E-like morphology. Yet, the underlying chemical features are not revealed, compromising the effectiveness of prognostic stratification. Here, we present a high-content stimulated Raman histology (HC-SRH) platform that provides both morphological and chemical information for cancer diagnosis based on un-stained breast tissues. Through spectral unmixing in the C-H vibration window, HC-SRH can map unsaturated lipids, cellular protein, extracellular matrix, saturated lipid, and water in breast tissue. In this way, HC-SRH provides excellent contrast for various tissue components. Considering rapidness is important in clinical trials, we implemented spectral selective sampling to boost the speed of HC-SRH by one order. We also successfully demonstrated the HC-SRH in a clinical-compatible fiber laser-based SRS microscopy. With the widely rapid tuning capability of the advanced fiber laser, a clear chemical contrast of nucleic acid and solid-state ester is shown in the fingerprint result.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Authors:
Fuxiang Tao,
Wei Ma,
Xuri Ge,
Anna Esposito,
Alessandro Vinciarelli
Abstract:
This work shows that depression changes the correlation between features extracted from speech. Furthermore, it shows that using such an insight can improve the training speed and performance of depression detectors based on SVMs and LSTMs. The experiments were performed over the Androids Corpus, a publicly available dataset involving 112 speakers, including 58 people diagnosed with depression by…
▽ More
This work shows that depression changes the correlation between features extracted from speech. Furthermore, it shows that using such an insight can improve the training speed and performance of depression detectors based on SVMs and LSTMs. The experiments were performed over the Androids Corpus, a publicly available dataset involving 112 speakers, including 58 people diagnosed with depression by professional psychiatrists. The results show that the models used in the experiments improve in terms of training speed and performance when fed with feature correlation matrices rather than with feature vectors. The relative reduction of the error rate ranges between 23.1% and 26.6% depending on the model. The probable explanation is that feature correlation matrices appear to be more variable in the case of depressed speakers. Correspondingly, such a phenomenon can be thought of as a depression marker.
△ Less
Submitted 7 July, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Entropy-Based Energy Dissipation Analysis of Mobile Communication Systems
Authors:
Litao Yan,
Xiaohu Ge
Abstract:
Compared with the energy efficiency of conventional mobile communication systems, the energy efficiency of fifth generation (5G) communication systems has been improved more than 30 times. However, the energy consumption of 5G communication systems is 3 times of the energy consumption of fourth generation (4G) communication systems when the wireless traffic is increased more than 100 times in the…
▽ More
Compared with the energy efficiency of conventional mobile communication systems, the energy efficiency of fifth generation (5G) communication systems has been improved more than 30 times. However, the energy consumption of 5G communication systems is 3 times of the energy consumption of fourth generation (4G) communication systems when the wireless traffic is increased more than 100 times in the last decade. It is anticipated that the traffic of future sixth generation (6G) communication systems will keep an exponential growth in the next decade. It is a key issue how much space is left for improving of energy efficiency in mobile communication systems. To answer the question, an entropy-based energy dissipation model based on nonequilibrium thermodynamics is first proposed for mobile communication systems. Moreover, the theoretical minimal energy dissipation limits are derived for typical modulations in mobile communication systems. Simulation results show that the practical energy dissipation of information processing and information transmission is three and seven orders of magnitude away from the theoretical minimal energy dissipation limits in mobile communication systems, respectively. These results provide some guidelines for energy efficiency optimization in future mobile communication systems.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Beamforming Design with Partial Channel Estimation and Feedback for FDD RIS-Assisted Systems
Authors:
Xiaochun Ge,
Shan** Yu,
Wenqian Shen,
Chengwen Xing,
Byonghyo Shim
Abstract:
Beamforming design with partial channel estimation and feedback for frequency-division duplexing (FDD) reconfigurable intelligent surface (RIS) assisted systems is considered in this paper. We leverage the observation that path angle information (PAI) varies more slowly than path gain information (PGI). Then, several dominant paths are selected among all the cascaded paths according to the known P…
▽ More
Beamforming design with partial channel estimation and feedback for frequency-division duplexing (FDD) reconfigurable intelligent surface (RIS) assisted systems is considered in this paper. We leverage the observation that path angle information (PAI) varies more slowly than path gain information (PGI). Then, several dominant paths are selected among all the cascaded paths according to the known PAI for maximizing the spectral efficiency of downlink data transmission. To acquire the dominating path gain information (DPGI, also regarded as the path gains of selected dominant paths) at the base station (BS), we propose a DPGI estimation and feedback scheme by jointly beamforming design at BS and RIS. Both the required number of downlink pilot signals and the length of uplink feedback vector are reduced to the number of dominant paths, and thus we achieve a great reduction of the pilot overhead and feedback overhead. Furthermore, we optimize the active BS beamformer and passive RIS beamformer by exploiting the feedback DPGI to further improve the spectral efficiency. From numerical results, we demonstrate the superiority of our proposed algorithms over the conventional schemes.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Authors:
Xiaofeng Ge,
Jiangyu Han,
Haixin Guan,
Yanhua Long
Abstract:
Recently, more and more personalized speech enhancement systems (PSE) with excellent performance have been proposed. However, two critical issues still limit the performance and generalization ability of the model: 1) Acoustic environment mismatch between the test noisy speech and target speaker enrollment speech; 2) Hard sample mining and learning. In this paper, dynamic acoustic compensation (DA…
▽ More
Recently, more and more personalized speech enhancement systems (PSE) with excellent performance have been proposed. However, two critical issues still limit the performance and generalization ability of the model: 1) Acoustic environment mismatch between the test noisy speech and target speaker enrollment speech; 2) Hard sample mining and learning. In this paper, dynamic acoustic compensation (DAC) is proposed to alleviate the environment mismatch, by intercepting the noise or environmental acoustic segments from noisy speech and mixing it with the clean enrollment speech. To well exploit the hard samples in training data, we propose an adaptive focal training (AFT) strategy by assigning adaptive loss weights to hard and non-hard samples during training. A time-frequency multi-loss training is further introduced to improve and generalize our previous work sDPCCN for PSE. The effectiveness of proposed methods are examined on the DNS4 Challenge dataset. Results show that, the DAC brings large improvements in terms of multiple evaluation metrics, and AFT reduces the hard sample rate significantly and produces obvious MOS score improvement.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
Preprocessing Enhanced Image Compression for Machine Vision
Authors:
Guo Lu,
Xingtong Ge,
Tianxiong Zhong,
**g Geng,
Qiang Hu
Abstract:
Recently, more and more images are compressed and sent to the back-end devices for the machine analysis tasks~(\textit{e.g.,} object detection) instead of being purely watched by humans. However, most traditional or learned image codecs are designed to minimize the distortion of the human visual system without considering the increased demand from machine vision systems. In this work, we propose a…
▽ More
Recently, more and more images are compressed and sent to the back-end devices for the machine analysis tasks~(\textit{e.g.,} object detection) instead of being purely watched by humans. However, most traditional or learned image codecs are designed to minimize the distortion of the human visual system without considering the increased demand from machine vision systems. In this work, we propose a preprocessing enhanced image compression method for machine vision tasks to address this challenge. Instead of relying on the learned image codecs for end-to-end optimization, our framework is built upon the traditional non-differential codecs, which means it is standard compatible and can be easily deployed in practical applications. Specifically, we propose a neural preprocessing module before the encoder to maintain the useful semantic information for the downstream tasks and suppress the irrelevant information for bitrate saving. Furthermore, our neural preprocessing module is quantization adaptive and can be used in different compression ratios. More importantly, to jointly optimize the preprocessing module with the downstream machine vision tasks, we introduce the proxy network for the traditional non-differential codecs in the back-propagation stage. We provide extensive experiments by evaluating our compression method for two representative downstream tasks with different backbone networks. Experimental results show our method achieves a better trade-off between the coding bitrate and the performance of the downstream machine vision tasks by saving about 20% bitrate.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Authors:
Xiaofeng Ge,
Jiangyu Han,
Yanhua Long,
Haixin Guan
Abstract:
PercepNet, a recent extension of the RNNoise, an efficient, high-quality and real-time full-band speech enhancement technique, has shown promising performance in various public deep noise suppression tasks. This paper proposes a new approach, named PercepNet+, to further extend the PercepNet with four significant improvements. First, we introduce a phase-aware structure to leverage the phase infor…
▽ More
PercepNet, a recent extension of the RNNoise, an efficient, high-quality and real-time full-band speech enhancement technique, has shown promising performance in various public deep noise suppression tasks. This paper proposes a new approach, named PercepNet+, to further extend the PercepNet with four significant improvements. First, we introduce a phase-aware structure to leverage the phase information into PercepNet, by adding the complex features and complex subband gains as the deep network input and output respectively. Then, a signal-to-noise ratio (SNR) estimator and an SNR switched post-processing are specially designed to alleviate the over attenuation (OA) that appears in high SNR conditions of the original PercepNet. Moreover, the GRU layer is replaced by TF-GRU to model both temporal and frequency dependencies. Finally, we propose to integrate the loss of complex subband gain, SNR, pitch filtering strength, and an OA loss in a multi-objective learning manner to further improve the speech enhancement performance. Experimental results show that, the proposed PercepNet+ outperforms the original PercepNet significantly in terms of both PESQ and STOI, without increasing the model size too much.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
Modelling and Optimization of OAM-MIMO Communication Systems with Unaligned Antennas
Authors:
Xusheng Xiong,
Hanqiong Lou,
Xiaohu Ge
Abstract:
The orbital angular momentum (OAM) wireless communication technique is emerging as one of potential techniques for the Sixth generation (6G) wireless communication system. The most advantage of OAM wireless communication technique is the natural orthogonality among different OAM states. However, one of the most disadvantages is the crosstalk among different OAM states which is widely caused by the…
▽ More
The orbital angular momentum (OAM) wireless communication technique is emerging as one of potential techniques for the Sixth generation (6G) wireless communication system. The most advantage of OAM wireless communication technique is the natural orthogonality among different OAM states. However, one of the most disadvantages is the crosstalk among different OAM states which is widely caused by the atmospheric turbulence and misalignment between transmitting and receiving antennas. Considering the OAM-based multiple-input multiple-output (OAM-MIMO) transmission system with unaligned antennas, a new channel model is proposed for performance analysis. Moreover, a purity model of the OAM-MIMO transmission system with unaligned antennas is derived for the non-Kolmogorov turbulence. Furthermore, error probability and capacity models are derived for OAM-MIMO transmission systems with unaligned antennas. To overcome the disadvantage caused by unaligned antennas and non-Kolmogorov turbulence, a new optimization algorithm of OAM state interval is proposed to improve the capacity of OAM-MIMO transmission system. Numerical results indicate that the capacity of OAM-MIMO transmission system is improved by the optimization algorithm. Specifically, the capacity increment of OAM-MIMO transmission system adopting the optimization algorithm is up to 28.7% and 320.3% when the angle of deflection between transmitting and receiving antennas is -24 dB and -5 dB, respectively.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Improving Intention Detection in Single-Trial Classification through Fusion of EEG and Eye-tracker Data
Authors:
Xianliang Ge,
Yunxian Pan,
Sujie Wang,
Linze Qian,
**gjia Yuan,
Jie Xu,
Nitish Thakor,
Yu Sun
Abstract:
Intention decoding is an indispensable procedure in hands-free human-computer interaction (HCI). Conventional eye-tracking system using single-model fixation duration possibly issues commands ignoring users' real expectation. In the current study, an eye-brain hybrid brain-computer interface (BCI) interaction system was introduced for intention detection through fusion of multi-modal eye-track and…
▽ More
Intention decoding is an indispensable procedure in hands-free human-computer interaction (HCI). Conventional eye-tracking system using single-model fixation duration possibly issues commands ignoring users' real expectation. In the current study, an eye-brain hybrid brain-computer interface (BCI) interaction system was introduced for intention detection through fusion of multi-modal eye-track and ERP (a measurement derived from EEG) features. Eye-track and EEG data were recorded from 64 healthy participants as they performed a 40-min customized free search task of a fixed target icon among 25 icons. The corresponding fixation duration of eye-tracking and ERP were extracted. Five previously-validated LDA-based classifiers (including RLDA, SWLDA, BLDA, SKLDA, and STDA) and the widely-used CNN method were adopted to verify the efficacy of feature fusion from both offline and pseudo-online analysis, and optimal approach was evaluated through modulating the training set and system response duration. Our study demonstrated that the input of multi-modal eye-track and ERP features achieved superior performance of intention detection in the single trial classification of active search task. And compared with single-model ERP feature, this new strategy also induced congruent accuracy across different classifiers. Moreover, in comparison with other classification methods, we found that the SKLDA exhibited the superior performance when fusing feature in offline test (ACC=0.8783, AUC=0.9004) and online simulation with different sample amount and duration length. In sum, the current study revealed a novel and effective approach for intention classification using eye-brain hybrid BCI, and further supported the real-life application of hands-free HCI in a more precise and stable manner.
△ Less
Submitted 5 December, 2021;
originally announced December 2021.
-
Federated Orchestration for Network Slicing of Bandwidth and Computational Resource
Authors:
Yingyu Li,
Anqi Huang,
Yong Xiao,
Xiaohu Ge,
Sumei Sun,
Han-Chieh Chao
Abstract:
Network slicing has been considered as one of the key enablers for 5G to support diversified IoT services and application scenarios. This paper studies the distributed network slicing for a massive scale IoT network supported by 5G with fog computing. Multiple services with various requirements need to be supported by both spectrum resource offered by 5G network and computational resourc of the fo…
▽ More
Network slicing has been considered as one of the key enablers for 5G to support diversified IoT services and application scenarios. This paper studies the distributed network slicing for a massive scale IoT network supported by 5G with fog computing. Multiple services with various requirements need to be supported by both spectrum resource offered by 5G network and computational resourc of the fog computing network. We propose a novel distributed framework based on a new control plane entity, federated-orchestrator , which can coordinate the spectrum and computational resources without requiring any exchange of the local data and resource information from BSs. We propose a distributed resource allocation algorithm based on Alternating Direction Method of Multipliers with Partial Variable Splitting . We prove DistADMM-PVS minimizes the average service response time of the entire network with guaranteed worst-case performance for all supported types of services when the coordination between the F-orchestrator and BSs is perfectly synchronized. Motivated by the observation that coordination synchronization may result in high coordination delay that can be intolerable when the network is large in scale, we propose a novel asynchronized ADMM algorithm. We prove that AsynADMM can converge to the global optimal solution with improved scalability and negligible coordination delay. We evaluate the performance of our proposed framework using two-month of traffic data collected in a in-campus smart transportation system supported by a 5G network. Extensive simulation has been conducted for both pedestrian and vehicular-related services during peak and non-peak hours. Our results show that the proposed framework offers significant reduction on service response time for both supported services, especially compared to network slicing with only a single resource.
△ Less
Submitted 6 February, 2020;
originally announced February 2020.
-
Capacity-Aware Edge Caching in Fog Computing Networks
Authors:
Qiang Li,
Yuanmei Zhang,
Yingyu Li,
Yong Xiao,
Xiaohu Ge
Abstract:
This paper studies edge caching in fog computing networks, where a capacity-aware edge caching framework is proposed by considering both the limited fog cache capacity and the connectivity capacity of base stations (BSs). By allowing cooperation between fog nodes and cloud data center, the average-download-time (ADT) minimization problem is formulated as a multi-class processor queuing process. We…
▽ More
This paper studies edge caching in fog computing networks, where a capacity-aware edge caching framework is proposed by considering both the limited fog cache capacity and the connectivity capacity of base stations (BSs). By allowing cooperation between fog nodes and cloud data center, the average-download-time (ADT) minimization problem is formulated as a multi-class processor queuing process. We prove the convexity of the formulated problem and propose an Alternating Direction Method of Multipliers (ADMM)-based algorithm that can achieve the minimum ADT and converge much faster than existing algorithms. Simulation results demonstrate that the allocation of fog cache capacity and connectivity capacity of BSs needs to be balanced according to the network status. While the maximization of the edge-cache-hit-ratio (ECHR) by utilizing all available fog cache capacity is helpful when the BS connectivity capacity is sufficient, it is preferable to keep a lower ECHR and allocate more traffic to the cloud when the BS connectivity capacity is deficient.
△ Less
Submitted 6 February, 2020;
originally announced February 2020.
-
An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments
Authors:
Zhiwei Chen,
Yi Zhong,
Xiaohu Ge,
Yi Ma
Abstract:
In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we…
▽ More
In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we propose an actor-critic-based (AC-based) deep reinforcement learning (DRL) method to find near-optimal UAV positions at every moment. In the proposed method, the process searching for the solution iteratively at a particular moment is modeled as a Markov decision process (MDP). To handle infinite state and action spaces and improve the robustness of the decision process, two powerful neural networks (NNs) are configured to evaluate the UAV position adjustments and make decisions, respectively. Compared with the heuristic algorithm, sequential least-squares programming and fixed UAVs methods, simulation results have shown that the proposed method outperforms these three benchmarks in terms of the throughput at every moment in UAV networks.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
The New Purity and Capacity Models for the OAM-mmWave Communication Systems under Atmospheric Turbulence
Authors:
Hanqiong Lou,
Xiaohu Ge,
Qiang Li
Abstract:
The orbital angular momentum (OAM) wireless communication technology is widely studied in recent literatures. But the atmospheric turbulence is rarely considered in analyzing the capacity of OAM-based millimeter wave (OAM-mmWave) communication systems. The OAM-mmWave propagated in the atmosphere environments is usually interfered by the atmospheric turbulence, resulting in the crosstalk among OAM…
▽ More
The orbital angular momentum (OAM) wireless communication technology is widely studied in recent literatures. But the atmospheric turbulence is rarely considered in analyzing the capacity of OAM-based millimeter wave (OAM-mmWave) communication systems. The OAM-mmWave propagated in the atmosphere environments is usually interfered by the atmospheric turbulence, resulting in the crosstalk among OAM channels,capacity degradation, etc. By taking into account the atmospheric turbulence effect, this paper proposes a new purity model and a new capacity model for the OAM-mmWave communication systems. Simulation results indicate that the OAM-mmWave propagation in the atmosphere environments is evidently interfered by atmospheric turbulence, where the capacity of the OAMmmWave communication systems decreases with the increase of the transmission frequency.
△ Less
Submitted 10 September, 2019;
originally announced September 2019.
-
A New Small-World IoT Routing Mechanism based on Cayley Graphs
Authors:
Yuna Jiang,
Xiaohu Ge,
Yi Zhong,
Guoqiang Mao,
Yonghui Li
Abstract:
An increasing number of low-power Internet of Things (IoT) devices will be widely deployed in the near future. Considering the short-range communication of low-power devices, multi-hop transmissions will become an important transmission mechanism in IoT networks. It is crucial for lowpower devices to transmit data over long distances via multihop in a low-delay and reliable way. Small-world charac…
▽ More
An increasing number of low-power Internet of Things (IoT) devices will be widely deployed in the near future. Considering the short-range communication of low-power devices, multi-hop transmissions will become an important transmission mechanism in IoT networks. It is crucial for lowpower devices to transmit data over long distances via multihop in a low-delay and reliable way. Small-world characteristics of networks indicate that the network has an advantage of a small Average Shortest-path Length (ASL) and a high Average Clustering Coefficient (ACC). In this paper, a new IoT routing mechanism considering small-world characteristics is proposed to reduce the delay and improve the reliability. The ASL and ACC are derived for performance analysis of small-world characteristics in IoT networks based on Cayley graphs. Besides, the reliability and delay models are proposed for Small-World IoT based on Cayley grapHs (SWITCH). Simulation results demonstrate that SWITCH has lower delay and better reliability than that of conventional Nearest Neighboring Routing (NNR). Moreover, the maximum delay of SWITCH is reduced by 50.6% compared with that by NNR.
△ Less
Submitted 27 August, 2019;
originally announced August 2019.
-
Complex-BP-Neural-Network-based Hybrid Precoding for Millimeter Wave Multiuser Massive MIMO Systems
Authors:
Kai Chen,
**g Yang,
Xiaohu Ge,
Yonghui Li
Abstract:
The high energy consumption of massive multi-input multi-out (MIMO) system has become a prominent problem in the millimeter wave(mm-Wave) communication scenario. The hybrid precoding technology greatly reduces the number of radio frequency(RF) chains by handing over part of the coding work to the phase shifting network, which can effectively improve energy efficiency. However, conventional hybrid…
▽ More
The high energy consumption of massive multi-input multi-out (MIMO) system has become a prominent problem in the millimeter wave(mm-Wave) communication scenario. The hybrid precoding technology greatly reduces the number of radio frequency(RF) chains by handing over part of the coding work to the phase shifting network, which can effectively improve energy efficiency. However, conventional hybrid precoding algorithms based on mathematical means often suffer from performance loss and high computational complexity. In this paper, a novel BP-neural-network-enabled hybrid precoding algorithm is proposed, in which the full-digital zero-forcing(ZF) precoding is set as the training target. Considering that signals at the base station are complex, we choose the complex neural network that has a richer representational capacity. Besides, we present the activation function of the complex neural network and the gradient derivation of the back propagation process. Simulation results demonstrate that the performance of the proposed hybrid precoding algorithm can optimally approximate the ZF precoding.
△ Less
Submitted 1 August, 2019;
originally announced August 2019.
-
Power-Consumption Outage Challenge in Next-Generation Cellular Networks
Authors:
**g Yang,
Yi Zhong,
Xiaohu Ge,
Han-Chieh Chao
Abstract:
The conventional outage in wireless communication systems is caused by the deterioration of the wireless communication link, i.e., the received signal power is less than the minimum received signal power. Is there a possibility that the outage occurs in wireless communication systems with a good channel state? Based on both communication and heat transfer theories, a power-consumption outage in th…
▽ More
The conventional outage in wireless communication systems is caused by the deterioration of the wireless communication link, i.e., the received signal power is less than the minimum received signal power. Is there a possibility that the outage occurs in wireless communication systems with a good channel state? Based on both communication and heat transfer theories, a power-consumption outage in the wireless communication between millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) base stations (BSs) and smartphones has been modeled and analyzed. Moreover, the total transmission time model with respect to the number of power-consumption outages is derived for mmWave massive MIMO communication systems. Simulation results indicate that the total transmission time is extended by the power-consumption outage, which deteriorates the average transmission rate of mmWave massive MIMO BSs.
△ Less
Submitted 19 July, 2019;
originally announced July 2019.
-
Energy Efficiency Optimization of Generalized Spatial Modulation with Sub-Connected Hybrid Precoding
Authors:
Kai Chen,
**g Yang,
Xiaohu Ge,
Yonghui Li,
Lin Tian,
**glin Shi
Abstract:
Energy efficiency (EE) optimization of millimeter wave (mm-Wave) massive multiple-input multiple-output (MIMO) systems is emerging as an important challenge for the fifth generation (5G) mobile communication systems. However, the power of radio frequency (RF) chains increases sharply due to the high carrier frequency in mm-Wave massive MIMO systems. To overcome this issue, a new energy efficiency…
▽ More
Energy efficiency (EE) optimization of millimeter wave (mm-Wave) massive multiple-input multiple-output (MIMO) systems is emerging as an important challenge for the fifth generation (5G) mobile communication systems. However, the power of radio frequency (RF) chains increases sharply due to the high carrier frequency in mm-Wave massive MIMO systems. To overcome this issue, a new energy efficiency optimization solution is proposed based on the structure of the generalized spatial modulation (GSM) and sub-connected hybrid precoding (HP). Moreover, the computation power of mm-Wave massive MIMO systems is considered for optimizing the EE. Simulation results indicate that the EE of the GSM-HP scheme outperforms the full digital precoding (FDP) scheme in the mm-Wave massive MIMO scene, and 88\% computation power can be saved by the proposed GSM-HP scheme.
△ Less
Submitted 1 January, 2019;
originally announced January 2019.
-
Performance Analysis of Non-Orthogonal Multicast in Two-tier Heterogeneous Networks
Authors:
Yong Zhang,
Bin Yang,
Xiaohu Ge,
Yonghui Li
Abstract:
With the explosive growth of mobile services, non-orthogonal broadcast/multicast transmissions can effectively improves spectrum efficiency. Nonorthogonal multiple access (NOMA) represents a paradigm shift from conventional orthogonal multiple-access (OMA) concepts and has been recognized as one of the key enabling technologies for fifth-generation (5G) mobile networks. In this paper, a two-tier h…
▽ More
With the explosive growth of mobile services, non-orthogonal broadcast/multicast transmissions can effectively improves spectrum efficiency. Nonorthogonal multiple access (NOMA) represents a paradigm shift from conventional orthogonal multiple-access (OMA) concepts and has been recognized as one of the key enabling technologies for fifth-generation (5G) mobile networks. In this paper, a two-tier heterogeneous network is studied, in which the wireless signal power is partitioned by the NOMA scheme. Moreover, the coverage probability, the average rate and the average QoE are derived to evaluate network performance. Simulation results show that compared with the OMA method, non-orthogonal broadcast/multicast method improve both the average user rate and QoE in the two-tier heterogeneous network.
△ Less
Submitted 1 January, 2019;
originally announced January 2019.
-
Towards a Simple Relationship to Estimate the Capacity of Static and Mobile Wireless Networks
Authors:
Guoqiang Mao,
Zihuai Lin,
Xiaohu Ge,
Yang Yang
Abstract:
Extensive research has been done on studying the capacity of wireless multi-hop networks. These efforts have led to many sophisticated and customized analytical studies on the capacity of particular networks. While most of the analyses are intellectually challenging, they lack universal properties that can be extended to study the capacity of a different network. In this paper, we sift through var…
▽ More
Extensive research has been done on studying the capacity of wireless multi-hop networks. These efforts have led to many sophisticated and customized analytical studies on the capacity of particular networks. While most of the analyses are intellectually challenging, they lack universal properties that can be extended to study the capacity of a different network. In this paper, we sift through various capacity-impacting parameters and present a simple relationship that can be used to estimate the capacity of both static and mobile networks. Specifically, we show that the network capacity is determined by the average number of simultaneous transmissions, the link capacity and the average number of transmissions required to deliver a packet to its destination. Our result is valid for both finite networks and asymptotically infinite networks. We then use this result to explain and better understand the insights of some existing results on the capacity of static networks, mobile networks and hybrid networks and the multicast capacity. The capacity analysis using the aforementioned relationship often becomes simpler. The relationship can be used as a powerful tool to estimate the capacity of different networks. Our work makes important contributions towards develo** a generic methodology for network capacity analysis that is applicable to a variety of different scenarios.
△ Less
Submitted 6 June, 2013;
originally announced June 2013.