Search | arXiv e-print repository

Digital Wireless Image Transmission via Distribution Matching

Authors: Pu**g Yang, Guangyi Zhang, Yunlong Cai

Abstract: Deep learning-based joint source-channel coding (JSCC) is emerging as a potential technology to meet the demand for effective data transmission, particularly for image transmission. Nevertheless, most existing advancements only consider analog transmission, where the channel symbols are continuous, making them incompatible with practical digital communication systems. In this work, we address this… ▽ More Deep learning-based joint source-channel coding (JSCC) is emerging as a potential technology to meet the demand for effective data transmission, particularly for image transmission. Nevertheless, most existing advancements only consider analog transmission, where the channel symbols are continuous, making them incompatible with practical digital communication systems. In this work, we address this by involving the modulation process and consider map** the continuous channel symbols into discrete space. Recognizing the non-uniform distribution of the output channel symbols in existing methods, we propose two effective methods to improve the performance. Firstly, we introduce a uniform modulation scheme, where the distance between two constellations is adjustable to match the non-uniform nature of the distribution. In addition, we further design a non-uniform modulation scheme according to the output distribution. To this end, we first generate the constellations by performing feature clustering on an analog image transmission system, then the generated constellations are employed to modulate the continuous channel symbols. For both schemes, we fine-tune the digital system to alleviate the performance loss caused by modulation. Here, the straight-through estimator (STE) is considered to overcome the non-differentiable nature. Our experimental results demonstrate that the proposed schemes significantly outperform existing digital image transmission systems. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.08337 [pdf, other]

WMAdapter: Adding WaterMark Control to Latent Diffusion Models

Authors: Hai Ci, Yiren Song, Pei Yang, **heng Xie, Mike Zheng Shou

Abstract: Watermarking is crucial for protecting the copyright of AI-generated images. We propose WMAdapter, a diffusion model watermark plugin that takes user-specified watermark information and allows for seamless watermark imprinting during the diffusion generation process. WMAdapter is efficient and robust, with a strong emphasis on high generation quality. To achieve this, we make two key designs: (1)… ▽ More Watermarking is crucial for protecting the copyright of AI-generated images. We propose WMAdapter, a diffusion model watermark plugin that takes user-specified watermark information and allows for seamless watermark imprinting during the diffusion generation process. WMAdapter is efficient and robust, with a strong emphasis on high generation quality. To achieve this, we make two key designs: (1) We develop a contextual adapter structure that is lightweight and enables effective knowledge transfer from heavily pretrained post-hoc watermarking models. (2) We introduce an extra finetuning step and design a hybrid finetuning strategy to further improve image quality and eliminate tiny artifacts. Empirical results demonstrate that WMAdapter offers strong flexibility, exceptional image generation quality and competitive watermark robustness. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 20 pages, 13 figures

arXiv:2406.05437 [pdf, ps, other]

From Analog to Digital: Multi-Order Digital Joint Coding-Modulation for Semantic Communication

Authors: Guangyi Zhang, Pu**g Yang, Yunlong Cai, Qiyu Hu, Guanding Yu

Abstract: Recent studies in joint source-channel coding (JSCC) have fostered a fresh paradigm in end-to-end semantic communication. Despite notable performance achievements, present initiatives in building semantic communication systems primarily hinge on the transmission of continuous channel symbols, thus presenting challenges in compatibility with established digital systems. In this paper, we introduce… ▽ More Recent studies in joint source-channel coding (JSCC) have fostered a fresh paradigm in end-to-end semantic communication. Despite notable performance achievements, present initiatives in building semantic communication systems primarily hinge on the transmission of continuous channel symbols, thus presenting challenges in compatibility with established digital systems. In this paper, we introduce a novel approach to address this challenge by develo** a multi-order digital joint coding-modulation (MDJCM) scheme for semantic communications. Initially, we construct a digital semantic communication system by integrating a multi-order modulation/demodulation module into a nonlinear transform source-channel coding (NTSCC) framework. Recognizing the non-differentiable nature of modulation/demodulation, we propose a novel substitution training strategy. Herein, we treat modulation/demodulation as a constrained quantization process and introduce scaling operations alongside manually crafted noise to approximate this process. As a result, employing this approximation in training semantic communication systems can be deployed in practical modulation/demodulation scenarios with superior performance. Additionally, we demonstrate the equivalence by analyzing the involved probability distribution. Moreover, to further upgrade the performance, we develop a hierarchical dimension-reduction strategy to provide a gradual information extraction process. Extensive experimental evaluations demonstrate the superiority of our proposed method over existing digital and non-digital JSCC techniques. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2405.09497 [pdf, other]

Towards the limits: Sensing Capability Measurement for ISAC Through Channel Encoder

Authors: Fei Shang, Haohua Du, Panlong Yang, Xin He, Wen Ma, Xiang-Yang Li

Abstract: Integrated Sensing and Communication (ISAC) is gradually becoming a reality due to the significant increase in frequency and bandwidth of next-generation wireless communication technologies. Therefore it becomes crucial to evaluate the communication and sensing performance using appropriate channel models to address resource competition from each other. Existing work only models the sensing capabi… ▽ More Integrated Sensing and Communication (ISAC) is gradually becoming a reality due to the significant increase in frequency and bandwidth of next-generation wireless communication technologies. Therefore it becomes crucial to evaluate the communication and sensing performance using appropriate channel models to address resource competition from each other. Existing work only models the sensing capability based on the mutual information between the channel response and the received signal, and its theoretical resolution is difficult to support the high-precision requirements of ISAC for sensing tasks, and may even affect its communication optimal. In this paper, we propose a sensing channel encoder model to measure the sensing capacity with higher resolution by discrete task mutual information. For the first time, derive upper and lower bounds on the sensing accuracy for a given channel. This model not only provides the possibility of optimizing the ISAC systems at a finer granularity and balancing communication and sensing resources, but also provides theoretical explanations for classical intuitive feelings (like more modalities more accuracy) in wireless sensing. Furthermore, we validate the effectiveness of the proposed channel model through real-case studies, including person identification, displacement detection, direction estimation, and device recognition. The evaluation results indicate a Pearson correlation coefficient exceeding 0.9 between our task mutual information and conventional experimental metrics (e.g., accuracy). △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.04867 [pdf, other]

MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging (MIPI). Building on the achievements of the previous MIPI Workshops held at ECCV 2022 and CVPR 2023, we introduce our third MIPI challenge including three tracks focusing on novel image sensors and imaging algorithms. In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2024. In total, 170 participants were successfully registered, and 14 teams submitted results in the final testing phase. The developed solutions in this challenge achieved state-of-the-art performance on Nighttime Flare Removal. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2024/. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

arXiv:2405.02563 [pdf, other]

Deep Representation Learning-Based Dynamic Trajectory Phenoty** for Acute Respiratory Failure in Medical Intensive Care Units

Authors: Alan Wu, Tilendra Choudhary, Pulakesh Upadhyaya, Ayman Ali, Philip Yang, Rishikesan Kamaleswaran

Abstract: Sepsis-induced acute respiratory failure (ARF) is a serious complication with a poor prognosis. This paper presents a deep representation learningbased phenoty** method to identify distinct groups of clinical trajectories of septic patients with ARF. For this retrospective study, we created a dataset from electronic medical records (EMR) consisting of data from sepsis patients admitted to medica… ▽ More Sepsis-induced acute respiratory failure (ARF) is a serious complication with a poor prognosis. This paper presents a deep representation learningbased phenoty** method to identify distinct groups of clinical trajectories of septic patients with ARF. For this retrospective study, we created a dataset from electronic medical records (EMR) consisting of data from sepsis patients admitted to medical intensive care units who required at least 24 hours of invasive mechanical ventilation at a quarternary care academic hospital in southeast USA for the years 2016-2021. A total of N=3349 patient encounters were included in this study. Clustering Representation Learning on Incomplete Time Series Data (CRLI) algorithm was applied to a parsimonious set of EMR variables in this data set. To validate the optimal number of clusters, the K-means algorithm was used in conjunction with dynamic time war**. Our model yielded four distinct patient phenotypes that were characterized as liver dysfunction/heterogeneous, hypercapnia, hypoxemia, and multiple organ dysfunction syndrome by a critical care expert. A Kaplan-Meier analysis to compare the 28-day mortality trends exhibited significant differences (p < 0.005) between the four phenotypes. The study demonstrates the utility of our deep representation learning-based approach in unraveling phenotypes that reflect the heterogeneity in sepsis-induced ARF in terms of different mortality outcomes and severity. These phenotypes might reveal important clinical insights into an effective prognosis and tailored treatment strategies. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: 9 pages

arXiv:2404.06991 [pdf, other]

Ray-driven Spectral CT Reconstruction Based on Neural Base-Material Fields

Authors: Ligen Shi, Chang Liu, ** Yang, Jun Qiu, Xing Zhao

Abstract: In spectral CT reconstruction, the basis materials decomposition involves solving a large-scale nonlinear system of integral equations, which is highly ill-posed mathematically. This paper proposes a model that parameterizes the attenuation coefficients of the object using a neural field representation, thereby avoiding the complex calculations of pixel-driven projection coefficient matrices durin… ▽ More In spectral CT reconstruction, the basis materials decomposition involves solving a large-scale nonlinear system of integral equations, which is highly ill-posed mathematically. This paper proposes a model that parameterizes the attenuation coefficients of the object using a neural field representation, thereby avoiding the complex calculations of pixel-driven projection coefficient matrices during the discretization process of line integrals. It introduces a lightweight discretization method for line integrals based on a ray-driven neural field, enhancing the accuracy of the integral approximation during the discretization process. The basis materials are represented as continuous vector-valued implicit functions to establish a neural field parameterization model for the basis materials. The auto-differentiation framework of deep learning is then used to solve the implicit continuous function of the neural base-material fields. This method is not limited by the spatial resolution of reconstructed images, and the network has compact and regular properties. Experimental validation shows that our method performs exceptionally well in addressing the spectral CT reconstruction. Additionally, it fulfils the requirements for the generation of high-resolution reconstruction images. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 14 pages,16 figures

MSC Class: 68U05; 65D18 ACM Class: I.4.5; I.4.10

arXiv:2404.06687 [pdf, other]

Fast and Accurate Relative Motion Tracking for Two Industrial Robots

Authors: Honglu He, Chen-lung Lu, Glenn Saunders, **hai Yang, Jeffrey Schoonover, John Wason, Santiago Paternain, Agung Julius, John T. Wen

Abstract: Industrial robotic applications such as spraying, welding, and additive manufacturing frequently require fast, accurate, and uniform motion along a 3D spatial curve. To increase process throughput, some manufacturers propose a dual-robot setup to overcome the speed limitation of a single robot. Industrial robot motion is programmed through waypoints connected by motion primitives (Cartesian linear… ▽ More Industrial robotic applications such as spraying, welding, and additive manufacturing frequently require fast, accurate, and uniform motion along a 3D spatial curve. To increase process throughput, some manufacturers propose a dual-robot setup to overcome the speed limitation of a single robot. Industrial robot motion is programmed through waypoints connected by motion primitives (Cartesian linear and circular paths and linear joint paths at constant Cartesian speed). The actual robot motion is affected by the blending between these motion primitives and the pose of the robot (an outstretched/close to singularity pose tends to have larger path-tracking errors). Choosing the waypoints and the speed along each motion segment to achieve the performance requirement is challenging. At present, there is no automated solution, and laborious manual tuning by robot experts is needed to approach the desired performance. In this paper, we present a systematic three-step approach to designing and programming a dual-robot system to optimize system performance. The first step is to select the relative placement between the two robots based on the specified relative motion path. The second step is to select the relative waypoints and the motion primitives. The final step is to update the waypoints iteratively based on the actual relative motion. Waypoint iteration is first executed in simulation and then completed using the actual robots. For performance measures, we use the mean path speed subject to the relative position and orientation constraints and the path speed uniformity constraint. We have demonstrated the effectiveness of this method with ABB and FANUC robots on two challenging test curves. The performance improvement over the current industrial practice baseline is over 300%. Compared to the optimized single-arm case that we have previously reported, the improvement is over 14%. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.04844 [pdf, other]

Self-Evolving Wireless Communications: A Novel Intelligence Trend for 6G and Beyond

Authors: Liangxin Qian, ** Yang, Jun Zhao, Ze Chen, Wanbin Tang

Abstract: Wireless communication is rapidly evolving, and future wireless communications (6G and beyond) will be more heterogeneous, multi-layered, and complex, which poses challenges to traditional communications. Adaptive technologies in traditional communication systems respond to environmental changes by modifying system parameters and structures on their own and are not flexible and agile enough to sat… ▽ More Wireless communication is rapidly evolving, and future wireless communications (6G and beyond) will be more heterogeneous, multi-layered, and complex, which poses challenges to traditional communications. Adaptive technologies in traditional communication systems respond to environmental changes by modifying system parameters and structures on their own and are not flexible and agile enough to satisfy requirements in future communications. To tackle these challenges, we propose a novel self-evolving communication framework, which consists of three layers: data layer, information layer, and knowledge layer. The first two layers allow communication systems to sense environments, fuse data, and generate a knowledge base for the knowledge layer. When dealing with a variety of application scenarios and environments, the generated knowledge is subsequently fed back to the first two layers for communication in practical application scenarios to obtain self-evolving ability and enhance the robustness of the system. In this paper, we first highlight the limitations of current adaptive communication systems and the need for intelligence, automation, and self-evolution in future wireless communications. We overview the development of self-evolving technologies and conceive the concept of self-evolving communications with its hypothetical architecture. To demonstrate the power of self-evolving modules, we compare the performances of a communication system with and without evolution. We then provide some potential techniques that enable self-evolving communications and challenges in implementing them. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2401.11090 [pdf, other]

Sharing Energy in Wide Area: A Two-Layer Energy Sharing Scheme for Massive Prosumers

Authors: Yifan Su, Peng Yang, Kai Kang, Zhaojian Wang, Ning Qi, Tonghua Liu, Feng Liu

Abstract: The popularization of distributed energy resources transforms end-users from consumers into prosumers. Inspired by the sharing economy principle, energy sharing markets for prosumers are proposed to facilitate the utilization of renewable energy. This paper proposes a novel two-layer energy sharing market for massive prosumers, which can promote social efficiency by wider-area sharing. In this mar… ▽ More The popularization of distributed energy resources transforms end-users from consumers into prosumers. Inspired by the sharing economy principle, energy sharing markets for prosumers are proposed to facilitate the utilization of renewable energy. This paper proposes a novel two-layer energy sharing market for massive prosumers, which can promote social efficiency by wider-area sharing. In this market, there is an upper-level wide-area market (WAM) in the distribution system and numerous lower-level local-area markets (LAMs) in communities. Prosumers in the same community share energy with each other in the LAM, which can be uncleared. The energy surplus and shortage of LAMs are cleared in the WAM. Thanks to the wide-area two-layer structure, the market outcome is near-social-optimal in large-scale systems. However, the proposed market forms a complex mathematical program with equilibrium constraints (MPEC). To solve the problem, we propose an efficient and hierarchically distributed bidding algorithm. The proposed two-layer market and bidding algorithm are verified on the IEEE 123-bus system with 11250 prosumers, which demonstrates the practicality and efficiency for large-scale markets. △ Less

Submitted 19 January, 2024; originally announced January 2024.

arXiv:2312.09429 [pdf]

Deep Learning-Enabled Swallowing Monitoring and Postoperative Recovery Biosensing System

Authors: Chih-Ning Tsai, Pei-Wen Yang, Tzu-Yen Huang, Jung-Chih Chen, Hsin-Yi Tseng, Che-Wei Wu, Amrit Sarmah, Tzu-En Lin

Abstract: This study introduces an innovative 3D printed dry electrode tailored for biosensing in postoperative recovery scenarios. Fabricated through a drop coating process, the electrode incorporates a novel 2D material. This study introduces an innovative 3D printed dry electrode tailored for biosensing in postoperative recovery scenarios. Fabricated through a drop coating process, the electrode incorporates a novel 2D material. △ Less

Submitted 24 November, 2023; originally announced December 2023.

Comments: the abstract can't uploaded fully

MSC Class: NA ACM Class: A.0

arXiv:2311.03557 [pdf, other]

Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable hel** clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable hel** clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective but ignore the relationships between brain ROIs over time, which would lead to synergistic deterioration. For exploring the synergistic deteriorating relationship between these biomarkers, in this paper, we propose a novel spatio-temporal similarity measure based multi-task learning approach for effectively predicting AD progression and sensitively capturing the critical relationships between biomarkers. Specifically, we firstly define a temporal measure for estimating the magnitude and velocity of biomarker change over time, which indicate a changing trend(temporal). Converting this trend into the vector, we then compare this variability between biomarkers in a unified vector space(spatial). The experimental results show that compared with directly ROI based learning, our proposed method is more effective in predicting disease progression. Our method also enables performing longitudinal stability selection to identify the changing relationships between biomarkers, which play a key role in disease progression. We prove that the synergistic deteriorating biomarkers between cortical volumes or surface areas have a significant effect on the cognitive prediction. △ Less

Submitted 6 November, 2023; originally announced November 2023.

arXiv:2310.09025 [pdf, other]

Survey on Near-Space Information Networks: Channel Modeling, Networking, and Transmission Perspectives

Authors: Xianbin Cao, Peng Yang, Xiaoning Su

Abstract: Near-space information networks (NSINs) composed of high-altitude platforms (HAPs) and high- and low-altitude unmanned aerial vehicles (UAVs) are a new regime for providing quick, robust, and cost-efficient sensing and communication services. Precipitated by innovations and breakthroughs in manufacturing, materials, communications, electronics, and control techniques, NSINs have been envisioned as… ▽ More Near-space information networks (NSINs) composed of high-altitude platforms (HAPs) and high- and low-altitude unmanned aerial vehicles (UAVs) are a new regime for providing quick, robust, and cost-efficient sensing and communication services. Precipitated by innovations and breakthroughs in manufacturing, materials, communications, electronics, and control techniques, NSINs have been envisioned as an essential component of the emerging sixth-generation of mobile communication systems. This article reveals some critical issues needing to be tackled in NSINs through conducting experiments and discusses the latest advances in NSINs in the research areas of channel modeling, networking, and transmission from a forward-looking, comparative, and technical evolutionary perspective. In this article, we highlight the characteristics of NSINs and present the promising use cases of NSINs. The impact of airborne platforms' unstable movements on the phase delays of onboard antenna arrays with diverse structures is mathematically analyzed. The recent advances in HAP channel modeling are elaborated on, along with the significant differences between HAP and UAV channel modeling. A comprehensive review of the networking techniques of NSINs in network deployment, handoff management, and network management aspects is provided. Besides, the promising techniques and communication protocols of the physical (PHY) layer, medium access control (MAC) layer, network layer, and transport layer of NSINs for achieving efficient transmission over NSINs are reviewed, and we have conducted experiments with practical NSINs to verify the performance of some techniques. Finally, we outline some open issues and promising directions for NSINs deserved for future study and discuss the corresponding challenges. △ Less

Submitted 13 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

arXiv:2309.12200 [pdf, other]

A Variational Auto-Encoder Enabled Multi-Band Channel Prediction Scheme for Indoor Localization

Authors: Ruihao Yuan, Kaixuan Huang, Pan Yang, Shunqing Zhang

Abstract: Indoor localization is getting increasing demands for various cutting-edged technologies, like Virtual/Augmented reality and smart home. Traditional model-based localization suffers from significant computational overhead, so fingerprint localization is getting increasing attention, which needs lower computation cost after the fingerprint database is built. However, the accuracy of indoor localiza… ▽ More Indoor localization is getting increasing demands for various cutting-edged technologies, like Virtual/Augmented reality and smart home. Traditional model-based localization suffers from significant computational overhead, so fingerprint localization is getting increasing attention, which needs lower computation cost after the fingerprint database is built. However, the accuracy of indoor localization is limited by the complicated indoor environment which brings the multipath signal refraction. In this paper, we provided a scheme to improve the accuracy of indoor fingerprint localization from the frequency domain by predicting the channel state information (CSI) values from another transmitting channel and spliced the multi-band information together to get more precise localization results. We tested our proposed scheme on COST 2100 simulation data and real time orthogonal frequency division multiplexing (OFDM) WiFi data collected from an office scenario. △ Less

Submitted 19 September, 2023; originally announced September 2023.

arXiv:2309.05026 [pdf, other]

Spatial Perceptual Quality Aware Adaptive Volumetric Video Streaming

Authors: Xi Wang, Wei Liu, Huitong Liu, Peng Yang

Abstract: Volumetric video offers a highly immersive viewing experience, but poses challenges in ensuring quality of experience (QoE) due to its high bandwidth requirements. In this paper, we explore the effect of viewing distance introduced by six degrees of freedom (6DoF) spatial navigation on user's perceived quality. By considering human visual resolution limitations, we propose a visual acuity model th… ▽ More Volumetric video offers a highly immersive viewing experience, but poses challenges in ensuring quality of experience (QoE) due to its high bandwidth requirements. In this paper, we explore the effect of viewing distance introduced by six degrees of freedom (6DoF) spatial navigation on user's perceived quality. By considering human visual resolution limitations, we propose a visual acuity model that describes the relationship between the virtual viewing distance and the tolerable boundary point cloud density. The proposed model satisfies spatial visual requirements during 6DoF exploration. Additionally, it dynamically adjusts quality levels to balance perceptual quality and bandwidth consumption. Furthermore, we present a QoE model to represent user's perceived quality at different viewing distances precisely. Extensive experimental results demonstrate that, the proposed scheme can effectively improve the overall average QoE by up to 26% over real networks and user traces, compared to existing baselines. △ Less

Submitted 10 September, 2023; originally announced September 2023.

Comments: Accepted byIEEE Globecom 2023

arXiv:2307.12264 [pdf, ps, other]

QoE-Driven Video Transmission: Energy-Efficient Multi-UAV Network Optimization

Authors: Kesong Wu, Xianbin Cao, Peng Yang, Zongyang Yu, Dapeng Oliver Wu, Tony Q. S. Quek

Abstract: This paper is concerned with the issue of improving video subscribers' quality of experience (QoE) by deploying a multi-unmanned aerial vehicle (UAV) network. Different from existing works, we characterize subscribers' QoE by video bitrates, latency, and frame freezing and propose to improve their QoE by energy-efficiently and dynamically optimizing the multi-UAV network in terms of serving UAV se… ▽ More This paper is concerned with the issue of improving video subscribers' quality of experience (QoE) by deploying a multi-unmanned aerial vehicle (UAV) network. Different from existing works, we characterize subscribers' QoE by video bitrates, latency, and frame freezing and propose to improve their QoE by energy-efficiently and dynamically optimizing the multi-UAV network in terms of serving UAV selection, UAV trajectory, and UAV transmit power. The dynamic multi-UAV network optimization problem is formulated as a challenging sequential-decision problem with the goal of maximizing subscribers' QoE while minimizing the total network power consumption, subject to some physical resource constraints. We propose a novel network optimization algorithm to solve this challenging problem, in which a Lyapunov technique is first explored to decompose the sequential-decision problem into several repeatedly optimized sub-problems to avoid the curse of dimensionality. To solve the sub-problems, iterative and approximate optimization mechanisms with provable performance guarantees are then developed. Finally, we design extensive simulations to verify the effectiveness of the proposed algorithm. Simulation results show that the proposed algorithm can effectively improve the QoE of subscribers and is 66.75\% more energy-efficient than benchmarks. △ Less

Submitted 23 July, 2023; originally announced July 2023.

arXiv:2307.09770 [pdf, other]

Perturbing a Neural Network to Infer Effective Connectivity: Evidence from Synthetic EEG Data

Authors: Peizhen Yang, Xinke Shen, Zongsheng Li, Zixiang Luo, Kexin Lou, Quanying Liu

Abstract: Identifying causal relationships among distinct brain areas, known as effective connectivity, holds key insights into the brain's information processing and cognitive functions. Electroencephalogram (EEG) signals exhibit intricate dynamics and inter-areal interactions within the brain. However, methods for characterizing nonlinear causal interactions among multiple brain regions remain relatively… ▽ More Identifying causal relationships among distinct brain areas, known as effective connectivity, holds key insights into the brain's information processing and cognitive functions. Electroencephalogram (EEG) signals exhibit intricate dynamics and inter-areal interactions within the brain. However, methods for characterizing nonlinear causal interactions among multiple brain regions remain relatively underdeveloped. In this study, we proposed a data-driven framework to infer effective connectivity by perturbing the trained neural networks. Specifically, we trained neural networks (i.e., CNN, vanilla RNN, GRU, LSTM, and Transformer) to predict future EEG signals according to historical data and perturbed the networks' input to obtain effective connectivity (EC) between the perturbed EEG channel and the rest of the channels. The EC reflects the causal impact of perturbing one node on others. The performance was tested on the synthetic EEG generated by a biological-plausible Jansen-Rit model. CNN and Transformer obtained the best performance on both 3-channel and 90-channel synthetic EEG data, outperforming the classical Granger causality method. Our work demonstrated the potential of perturbing an artificial neural network, learned to predict future system dynamics, to uncover the underlying causal structure. △ Less

Submitted 19 July, 2023; originally announced July 2023.

Comments: 7 pages, 3 figures, 1 table

arXiv:2307.03387 [pdf, ps, other]

A Joint Design for Full-duplex OFDM AF Relay System with Precoded Short Guard Interval

Authors: Pu Yang, Xiang-Gen Xia, Qingyue Qu, Han Wang, Yi Liu

Abstract: In-band full-duplex relay (FDR) has attracted much attention as an effective solution to improve the coverage and spectral efficiency in wireless communication networks. The basic problem for FDR transmission is how to eliminate the inherent self-interference and re-use the residual self-interference (RSI) at the relay to improve the end-to-end performance. Considering the RSI at the FDR, the over… ▽ More In-band full-duplex relay (FDR) has attracted much attention as an effective solution to improve the coverage and spectral efficiency in wireless communication networks. The basic problem for FDR transmission is how to eliminate the inherent self-interference and re-use the residual self-interference (RSI) at the relay to improve the end-to-end performance. Considering the RSI at the FDR, the overall equivalent channel can be modeled as an infinite impulse response (IIR) channel. For this IIR channel, a joint design for precoding, power gain control and equalization of cooperative OFDM relay systems is presented. Compared with the traditional OFDM systems, the length of the guard interval for the proposed design can be distinctly reduced, thereby improving the spectral efficiency. By analyzing the noise sources, this paper evaluates the signal to noise ratio (SNR) of the proposed scheme and presents a power gain control algorithm at the FDR. Compared with the existing schemes, the proposed scheme shows a superior bit error rate (BER) performance. △ Less

Submitted 7 July, 2023; originally announced July 2023.

Comments: 16 pages, 5 figures

MSC Class: 94-10 ACM Class: H.1.1

arXiv:2304.04163 [pdf, ps, other]

Energy-Efficient URLLC Service Provision via a Near-Space Information Network

Authors: Puguang An, Peng Yang, Xianbin Cao, Kun Guo, Yue Gao, Tony Q. S. Quek

Abstract: The integration of a near-space information network (NSIN) with the reconfigurable intelligent surface (RIS) is envisioned to significantly enhance the communication performance of future wireless communication systems by proactively altering wireless channels. This paper investigates the problem of deploying a RIS-integrated NSIN to provide energy-efficient, ultra-reliable and low-latency communi… ▽ More The integration of a near-space information network (NSIN) with the reconfigurable intelligent surface (RIS) is envisioned to significantly enhance the communication performance of future wireless communication systems by proactively altering wireless channels. This paper investigates the problem of deploying a RIS-integrated NSIN to provide energy-efficient, ultra-reliable and low-latency communications (URLLC) services. We mathematically formulate this problem as a resource optimization problem, aiming to maximize the effective throughput and minimize the system power consumption, subject to URLLC and physical resource constraints. The formulated problem is challenging in terms of accurate channel estimation, RIS phase alignment, theoretical analysis, and effective solution. We propose a joint resource allocation algorithm to handle these challenges. In this algorithm, we develop an accurate channel estimation approach by exploring message passing and optimize phase shifts of RIS reflecting elements to further increase the channel gain. Besides, we derive an analysis-friend expression of decoding error probability and decompose the problem into two-layered optimization problems by analyzing the monotonicity, which makes the formulated problem analytically tractable. Extensive simulations have been conducted to verify the performance of the proposed algorithm. Simulation results show that the proposed algorithm can achieve outstanding channel estimation performance and is more energy-efficient than diverse benchmark algorithms. △ Less

Submitted 14 February, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

arXiv:2303.07711 [pdf, other]

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

Authors: Chunyu Qiang, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang

Abstract: Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesized speech of a target speaker's timbre. In most previous methods, the synthesized fine-grained prosody features often represent the source speaker's average style, similar to the one-to-many problem(i.e., multiple prosody variations correspond to the same text). In response to this problem… ▽ More Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesized speech of a target speaker's timbre. In most previous methods, the synthesized fine-grained prosody features often represent the source speaker's average style, similar to the one-to-many problem(i.e., multiple prosody variations correspond to the same text). In response to this problem, a strength-controlled semi-supervised style extractor is proposed to disentangle the style from content and timbre, improving the representation and interpretability of the global style embedding, which can alleviate the one-to-many map** and data imbalance problems in prosody prediction. A hierarchical prosody predictor is proposed to improve prosody modeling. We find that better style transfer can be achieved by using the source speaker's prosody features that are easily predicted. Additionally, a speaker-transfer-wise cycle consistency loss is proposed to assist the model in learning unseen style-timbre combinations during the training phase. Experimental results show that the method outperforms the baseline. We provide a website with audio samples. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: Accepted by ICASSP2023

arXiv:2301.02348 [pdf, other]

High-Speed High-Accuracy Spatial Curve Tracking Using Motion Primitives in Industrial Robots

Authors: Honglu He, Chen-lung Lu, Yunshi Wen, Glenn Saunders, **hai Yang, Jeffrey Schoonover, Agung Julius, John T. Wen

Abstract: Industrial robots are increasingly deployed in applications requiring an end effector tool to closely track a specified path, such as in spraying and welding. Performance and productivity present possibly conflicting objectives: tracking accuracy, path speed, and motion uniformity. Industrial robots are programmed through motion primitives consisting of waypoints connected by pre-defined motion se… ▽ More Industrial robots are increasingly deployed in applications requiring an end effector tool to closely track a specified path, such as in spraying and welding. Performance and productivity present possibly conflicting objectives: tracking accuracy, path speed, and motion uniformity. Industrial robots are programmed through motion primitives consisting of waypoints connected by pre-defined motion segments, with specified parameters such as path speed and blending zone. The actual executed robot motion depends on the robot joint servo controller and joint motion constraints (velocity, acceleration, etc.) which are largely unknown to the users. Programming a robot to achieve the desired performance today is time-consuming and mostly manual, requiring tuning a large number of coupled parameters in the motion primitives. The performance also depends on the choice of additional parameters: possible redundant degrees of freedom, location of the target curve, and the robot configuration. This paper presents a systematic approach to optimize the robot motion primitives for performance. The approach first selects the static parameters, then the motion primitives, and finally iteratively update the waypoints to minimize the tracking error. The ultimate performance objective is to maximize the path speed subject to the tracking accuracy and speed uniformity constraints over the entire path. We have demonstrated the effectiveness of this approach in simulation for ABB and FANUC robots for two challenging example curves, and experimentally for an ABB robot. Comparing with the baseline using the current industry practice, the optimized performance shows over 200% performance improvement. △ Less

Submitted 5 January, 2023; originally announced January 2023.

arXiv:2301.01182 [pdf, other]

PMT-IQA: Progressive Multi-task Learning for Blind Image Quality Assessment

Authors: Qingyi Pan, Ning Guo, Letu Qingge, **gyi Zhang, Pei Yang

Abstract: Blind image quality assessment (BIQA) remains challenging due to the diversity of distortion and image content variation, which complicate the distortion patterns crossing different scales and aggravate the difficulty of the regression problem for BIQA. However, existing BIQA methods often fail to consider multi-scale distortion patterns and image content, and little research has been done on lear… ▽ More Blind image quality assessment (BIQA) remains challenging due to the diversity of distortion and image content variation, which complicate the distortion patterns crossing different scales and aggravate the difficulty of the regression problem for BIQA. However, existing BIQA methods often fail to consider multi-scale distortion patterns and image content, and little research has been done on learning strategies to make the regression model produce better performance. In this paper, we propose a simple yet effective Progressive Multi-Task Image Quality Assessment (PMT-IQA) model, which contains a multi-scale feature extraction module (MS) and a progressive multi-task learning module (PMT), to help the model learn complex distortion patterns and better optimize the regression issue to align with the law of human learning process from easy to hard. To verify the effectiveness of the proposed PMT-IQA model, we conduct experiments on four widely used public datasets, and the experimental results indicate that the performance of PMT-IQA is superior to the comparison approaches, and both MS and PMT modules improve the model's performance. △ Less

Submitted 3 November, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

arXiv:2301.01140 [pdf, other]

Performance Analysis and Enhancement of Beamforming Training in 802.11ad

Authors: Wen Wu, Nan Cheng, Ning Zhang, Peng Yang, Khalid Aldubaikhy, Xuemin, Shen

Abstract: Beamforming (BF) training is crucial to establishing reliable millimeter-wave communication connections between stations (STAs) and an access point. In IEEE 802.11ad BF training protocol, all STAs contend for limited BF training opportunities, i.e., associated BF training (A-BFT) slots, which results in severe collisions and significant BF training latency, especially in dense user scenarios. In t… ▽ More Beamforming (BF) training is crucial to establishing reliable millimeter-wave communication connections between stations (STAs) and an access point. In IEEE 802.11ad BF training protocol, all STAs contend for limited BF training opportunities, i.e., associated BF training (A-BFT) slots, which results in severe collisions and significant BF training latency, especially in dense user scenarios. In this paper, we first develop an analytical model to evaluate the BF training protocol performance. Our analytical model accounts for various protocol components, including user density, the number of A-BFT slots, and protocol parameters, i.e., retry limit and contention window size. We then derive the average successful BF training probability, the BF training efficiency and latency. Since the derived BF training efficiency is an implicit function, to reveal the relationship between system parameters and BF training performance, we also derive an approximate expression of BF training efficiency. Theoretical analysis indicates that the BF training efficiency degrades drastically in dense user scenarios. To address this issue, we propose an enhancement scheme which adaptively adjusts the protocol parameters in tune with user density, to improve the BF training performance in dense user scenarios. Extensive simulations are carried out to validate the accuracy of the developed analytical model. In addition, simulation results show that the proposed enhancement scheme can improve the BF training efficiency by 35% in dense user scenarios. △ Less

Submitted 1 January, 2023; originally announced January 2023.

Comments: The paper is accepted by IEEE Transactions on Vehicular Technology (TVT)

arXiv:2301.00130 [pdf, other]

Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning

Authors: Wen Wu, Peng Yang, Weiting Zhang, Conghao Zhou, Xuemin, Shen

Abstract: Collaboration among industrial Internet of Things (IoT) devices and edge networks is essential to support computation-intensive deep neural network (DNN) inference services which require low delay and high accuracy. Sampling rate adaption which dynamically configures the sampling rates of industrial IoT devices according to network conditions, is the key in minimizing the service delay. In this pa… ▽ More Collaboration among industrial Internet of Things (IoT) devices and edge networks is essential to support computation-intensive deep neural network (DNN) inference services which require low delay and high accuracy. Sampling rate adaption which dynamically configures the sampling rates of industrial IoT devices according to network conditions, is the key in minimizing the service delay. In this paper, we investigate the collaborative DNN inference problem in industrial IoT networks. To capture the channel variation and task arrival randomness, we formulate the problem as a constrained Markov decision process (CMDP). Specifically, sampling rate adaption, inference task offloading and edge computing resource allocation are jointly considered to minimize the average service delay while guaranteeing the long-term accuracy requirements of different inference services. Since CMDP cannot be directly solved by general reinforcement learning (RL) algorithms due to the intractable long-term constraints, we first transform the CMDP into an MDP by leveraging the Lyapunov optimization technique. Then, a deep RL-based algorithm is proposed to solve the MDP. To expedite the training process, an optimization subroutine is embedded in the proposed algorithm to directly obtain the optimal edge computing resource allocation. Extensive simulation results are provided to demonstrate that the proposed RL-based algorithm can significantly reduce the average service delay while preserving long-term inference accuracy with a high probability. △ Less

Submitted 31 December, 2022; originally announced January 2023.

Comments: Accpeted by Transaction on Industrial Informatics (TII)

arXiv:2212.06397 [pdf, other]

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

Authors: Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang

Abstract: Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesised speech of a target speaker's timbre. Most previous approaches rely on data with style labels, but manually-annotated labels are expensive and not always reliable. In response to this problem, we propose Style-Label-Free, a cross-speaker style transfer method, which can realize the style… ▽ More Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesised speech of a target speaker's timbre. Most previous approaches rely on data with style labels, but manually-annotated labels are expensive and not always reliable. In response to this problem, we propose Style-Label-Free, a cross-speaker style transfer method, which can realize the style transfer from source speaker to target speaker without style labels. Firstly, a reference encoder structure based on quantized variational autoencoder (Q-VAE) and style bottleneck is designed to extract discrete style representations. Secondly, a speaker-wise batch normalization layer is proposed to reduce the source speaker leakage. In order to improve the style extraction ability of the reference encoder, a style invariant and contrastive data augmentation method is proposed. Experimental results show that the method outperforms the baseline. We provide a website with audio samples. △ Less

Submitted 13 December, 2022; originally announced December 2022.

Comments: Published to ISCSLP 2022

arXiv:2211.09495 [pdf, other]

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Authors: Chunyu Qiang, Peng Yang, Hao Che, **ba Xiao, Xiaorui Wang, Zhongyuan Wang

Abstract: Conversion of Chinese Grapheme-to-Phoneme (G2P) plays an important role in Mandarin Chinese Text-To-Speech (TTS) systems, where one of the biggest challenges is the task of polyphone disambiguation. Most of the previous polyphone disambiguation models are trained on manually annotated datasets, and publicly available datasets for polyphone disambiguation are scarce. In this paper we propose a simp… ▽ More Conversion of Chinese Grapheme-to-Phoneme (G2P) plays an important role in Mandarin Chinese Text-To-Speech (TTS) systems, where one of the biggest challenges is the task of polyphone disambiguation. Most of the previous polyphone disambiguation models are trained on manually annotated datasets, and publicly available datasets for polyphone disambiguation are scarce. In this paper we propose a simple back-translation-style data augmentation method for mandarin Chinese polyphone disambiguation, utilizing a large amount of unlabeled text data. Inspired by the back-translation technique proposed in the field of machine translation, we build a Grapheme-to-Phoneme (G2P) model to predict the pronunciation of polyphonic character, and a Phoneme-to-Grapheme (P2G) model to predict pronunciation into text. Meanwhile, a window-based matching strategy and a multi-model scoring strategy are proposed to judge the correctness of the pseudo-label. We design a data balance strategy to improve the accuracy of some typical polyphonic characters in the training set with imbalanced distribution or data scarcity. The experimental result shows the effectiveness of the proposed back-translation-style data augmentation method. △ Less

Submitted 17 November, 2022; originally announced November 2022.

Comments: Published to APSIPA ASC 2022

arXiv:2210.17305 [pdf, other]

AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with Accents

Authors: Yongmao Zhang, Zhichao Wang, Peiji Yang, Hongshen Sun, Zhisheng Wang, Lei Xie

Abstract: Learning accent from crowd-sourced data is a feasible way to achieve a target speaker TTS system that can synthesize accent speech. To this end, there are two challenging problems to be solved. First, direct use of the poor acoustic quality crowd-sourced data and the target speaker data in accent transfer will apparently lead to synthetic speech with degraded quality. To mitigate this problem, we… ▽ More Learning accent from crowd-sourced data is a feasible way to achieve a target speaker TTS system that can synthesize accent speech. To this end, there are two challenging problems to be solved. First, direct use of the poor acoustic quality crowd-sourced data and the target speaker data in accent transfer will apparently lead to synthetic speech with degraded quality. To mitigate this problem, we take a bottleneck feature (BN) based TTS approach, in which TTS is decomposed into a Text-to-BN (T2BN) module to learn accent and a BN-to-Mel (BN2Mel) module to learn speaker timbre, where neural network based BN feature serves as the intermediate representation that are robust to noise interference. Second, direct training T2BN using the crowd-sourced data in the two-stage system will produce accent speech of target speaker with poor prosody. This is because the the crowd-sourced recordings are contributed from the ordinary unprofessional speakers. To tackle this problem, we update the two-stage approach to a novel three-stage approach, where T2BN and BN2Mel are trained using the high-quality target speaker data and a new BN-to-BN module is plugged in between the two modules to perform accent transfer. To train the BN2BN module, the parallel unaccented and accented BN features are obtained by a proposed data augmentation procedure. Finally the proposed three-stage approach manages to produce accent speech for the target speaker with good prosody, as the prosody pattern is inherited from the professional target speaker and accent transfer is achieved by the BN2BN module at the same time. The proposed approach, named as AccentSpeech, is validated in a Mandarin TTS accent transfer task. △ Less

Submitted 31 October, 2022; originally announced October 2022.

Comments: Accepted by ISCSLP2022

arXiv:2210.07803 [pdf, other]

An Efficient FPGA Accelerator for Point Cloud

Authors: Zilun Wang, Wendong Mao, Peixiang Yang, Zhongfeng Wang, Jun Lin

Abstract: Deep learning-based point cloud processing plays an important role in various vision tasks, such as autonomous driving, virtual reality (VR), and augmented reality (AR). The submanifold sparse convolutional network (SSCN) has been widely used for the point cloud due to its unique advantages in terms of visual results. However, existing convolutional neural network accelerators suffer from non-triv… ▽ More Deep learning-based point cloud processing plays an important role in various vision tasks, such as autonomous driving, virtual reality (VR), and augmented reality (AR). The submanifold sparse convolutional network (SSCN) has been widely used for the point cloud due to its unique advantages in terms of visual results. However, existing convolutional neural network accelerators suffer from non-trivial performance degradation when employed to accelerate SSCN because of the extreme and unstructured sparsity, and the complex computational dependency between the sparsity of the central activation and the neighborhood ones. In this paper, we propose a high performance FPGA-based accelerator for SSCN. Firstly, we develop a zero removing strategy to remove the coarse-grained redundant regions, thus significantly improving computational efficiency. Secondly, we propose a concise encoding scheme to obtain the matching information for efficient point-wise multiplications. Thirdly, we develop a sparse data matching unit and a computing core based on the proposed encoding scheme, which can convert the irregular sparse operations into regular multiply-accumulate operations. Finally, an efficient hardware architecture for the submanifold sparse convolutional layer is developed and implemented on the Xilinx ZCU102 field-programmable gate array board, where the 3D submanifold sparse U-Net is taken as the benchmark. The experimental results demonstrate that our design drastically improves computational efficiency, and can dramatically improve the power efficiency by 51 times compared to GPU. △ Less

Submitted 14 October, 2022; originally announced October 2022.

Comments: 6 pages, 10 figures, accepted by 2022 IEEE INTERNATIONAL SYSTEM-ON-CHIP conference

arXiv:2203.15488 [pdf, other]

Over-the-Air Federated Learning via Second-Order Optimization

Authors: Peng Yang, Yuning Jiang, Ting Wang, Yong Zhou, Yuanming Shi, Colin N. Jones

Abstract: Federated learning (FL) is a promising learning paradigm that can tackle the increasingly prominent isolated data islands problem while kee** users' data locally with privacy and security guarantees. However, FL could result in task-oriented data traffic flows over wireless networks with limited radio resources. To design communication-efficient FL, most of the existing studies employ the first-… ▽ More Federated learning (FL) is a promising learning paradigm that can tackle the increasingly prominent isolated data islands problem while kee** users' data locally with privacy and security guarantees. However, FL could result in task-oriented data traffic flows over wireless networks with limited radio resources. To design communication-efficient FL, most of the existing studies employ the first-order federated optimization approach that has a slow convergence rate. This however results in excessive communication rounds for local model updates between the edge devices and edge server. To address this issue, in this paper, we instead propose a novel over-the-air second-order federated optimization algorithm to simultaneously reduce the communication rounds and enable low-latency global model aggregation. This is achieved by exploiting the waveform superposition property of a multi-access channel to implement the distributed second-order optimization algorithm over wireless networks. The convergence behavior of the proposed algorithm is further characterized, which reveals a linear-quadratic convergence rate with an accumulative error term in each iteration. We thus propose a system optimization approach to minimize the accumulated error gap by joint device selection and beamforming design. Numerical results demonstrate the system and communication efficiency compared with the state-of-the-art approaches. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: 30 pages, 9 figures

arXiv:2203.02794 [pdf]

Machine Learning Applications in Lung Cancer Diagnosis, Treatment and Prognosis

Authors: Yawei Li, Xin Wu, ** Yang, Guoqian Jiang, Yuan Luo

Abstract: The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively… ▽ More The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively characterized lung cancer through the use of different perspectives from these accrued data. In this article, we provide an overview of machine learning-based approaches that strengthen the varying aspects of lung cancer diagnosis and therapy, including early detection, auxiliary diagnosis, prognosis prediction and immunotherapy practice. Moreover, we highlight the challenges and opportunities for future applications of machine learning in lung cancer. △ Less

Submitted 25 March, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

arXiv:2109.12514 [pdf, other]

Approaching the Transient Stability Boundary of a Power System: Theory and Applications

Authors: Peng Yang, Feng Liu, Wei Wei, Zhaojian Wang

Abstract: Estimating the stability boundary is a fundamental and challenging problem in transient stability studies. It is known that a proper level set of a Lyapunov function or an energy function can provide an inner approximation of the stability boundary, and the estimation can be expanded by trajectory reversing methods. In this paper, we streamline the theoretical foundation of the expansion methodolo… ▽ More Estimating the stability boundary is a fundamental and challenging problem in transient stability studies. It is known that a proper level set of a Lyapunov function or an energy function can provide an inner approximation of the stability boundary, and the estimation can be expanded by trajectory reversing methods. In this paper, we streamline the theoretical foundation of the expansion methodology, and generalize it by relaxing the request that the initial guess should be a subset of the stability region. We investigate topological characteristics of the expanded boundary, showing how an initial guess can approach the exact stability boundary locally or globally. We apply the theory to transient stability assessment, and propose expansion algorithms to improve the well-known Potential Energy Boundary Surface (PEBS) and Boundary of stability region based Controlling Unstable equilibrium point (BCU) methods. Case studies on the IEEE 39-bus system well verify our results and demonstrate that estimations of the stability boundary and the critical clearing time can be significantly improved with modest computational cost. △ Less

Submitted 30 September, 2021; v1 submitted 26 September, 2021; originally announced September 2021.

arXiv:2106.13166 [pdf, other]

Augmented Synchronization of Power Systems

Authors: Peng Yang, Feng Liu, Tao Liu, David J. Hill

Abstract: Power system transient stability has been translated into a Lyapunov stability problem of the post-disturbance equilibrium for decades. Despite substantial results, conventional theories suffer from the stringent requirement of knowing the post-disturbance equilibrium a priori. In contrast, the wisdom from practice, which certificates stability by only the observation of converging frequencies and… ▽ More Power system transient stability has been translated into a Lyapunov stability problem of the post-disturbance equilibrium for decades. Despite substantial results, conventional theories suffer from the stringent requirement of knowing the post-disturbance equilibrium a priori. In contrast, the wisdom from practice, which certificates stability by only the observation of converging frequencies and voltages, seems to provide an equilibrium-independent approach. Here, we formulate the empirical wisdom by the concept of augmented synchronization and aim to bridge such a theory-practice gap. First, we derive conditions under which the convergence to augmented synchronization implies the convergence to the equilibrium set, laying the first theoretical foundation for the empirical wisdom. Then, we reveal from what initial values the power system can achieve augmented synchronization. Our results open the possibility of an equilibrium-independent power system stability analytic that re-defines the nominal motion as augmented synchronization rather than certain equilibrium. Single-machine examples and the IEEE 9-bus system well verify our results and illustrate promising implications. △ Less

Submitted 18 September, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

Comments: 14 pages, 9 figures

arXiv:2102.12915 [pdf, ps, other]

doi 10.1109/JSTSP.2021.3121878

Fresh, Fair and Energy-Efficient Content Provision in a Private and Cache-Enabled UAV Network

Authors: Peng Yang, Kun Guo, Xing Xi, Tony Q. S. Quek, Xianbin Cao, Chenxi Liu

Abstract: In this paper, we investigate a private and cache-enabled unmanned aerial vehicle (UAV) network for content provision. Aiming at delivering fresh, fair, and energy-efficient content files to terrestrial users, we formulate a joint UAV caching, UAV trajectory, and UAV transmit power optimization problem. This problem is confirmed to be a sequential decision problem with mixed-integer non-convex con… ▽ More In this paper, we investigate a private and cache-enabled unmanned aerial vehicle (UAV) network for content provision. Aiming at delivering fresh, fair, and energy-efficient content files to terrestrial users, we formulate a joint UAV caching, UAV trajectory, and UAV transmit power optimization problem. This problem is confirmed to be a sequential decision problem with mixed-integer non-convex constraints, which is intractable directly. To this end, we propose a novel algorithm based on the techniques of subproblem decomposition and convex approximation. Particularly, we first propose to decompose the sequential decision problem into multiple repeated optimization subproblems via a Lyapunov technique. Next, an iterative optimization scheme incorporating a successive convex approximation (SCA) technique is explored to tackle the challenging mixed-integer non-convex subproblems. Besides, we analyze the convergence and computational complexity of the proposed algorithm and derive the theoretical value of the expected peak age of information (PAoI) to estimate the content freshness. Simulation results demonstrate that the proposed algorithm can achieve the expected PAoI close to the theoretical value and is more 22.11% and 70.51% energy-efficient and fairer than benchmark algorithms. △ Less

Submitted 26 February, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

arXiv:2010.01471 [pdf, ps, other]

Deep Reinforcement Learning for Delay-Oriented IoT Task Scheduling in Space-Air-Ground Integrated Network

Authors: Conghao Zhou, Wen Wu, Hongli He, Peng Yang, Feng Lyu, Nan Cheng, Xuemin, Shen

Abstract: In this paper, we investigate a computing task scheduling problem in space-air-ground integrated network (SAGIN) for delay-oriented Internet of Things (IoT) services. In the considered scenario, an unmanned aerial vehicle (UAV) collects computing tasks from IoT devices and then makes online offloading decisions, in which the tasks can be processed at the UAV or offloaded to the nearby base station… ▽ More In this paper, we investigate a computing task scheduling problem in space-air-ground integrated network (SAGIN) for delay-oriented Internet of Things (IoT) services. In the considered scenario, an unmanned aerial vehicle (UAV) collects computing tasks from IoT devices and then makes online offloading decisions, in which the tasks can be processed at the UAV or offloaded to the nearby base station or the remote satellite. Our objective is to design a task scheduling policy that minimizes offloading and computing delay of all tasks given the UAV energy capacity constraint. To this end, we first formulate the online scheduling problem as an energy-constrained Markov decision process (MDP). Then, considering the task arrival dynamics, we develop a novel deep risk-sensitive reinforcement learning algorithm. Specifically, the algorithm evaluates the risk, which measures the energy consumption that exceeds the constraint, for each state and searches the optimal parameter weighing the minimization of delay and risk while learning the optimal policy. Extensive simulation results demonstrate that the proposed algorithm can reduce the task processing delay by up to 30% compared to probabilistic configuration methods while satisfying the UAV energy capacity constraint. △ Less

Submitted 3 October, 2020; originally announced October 2020.

Comments: 14 pages, 8 figures

arXiv:2008.05826 [pdf, other]

Localizing the Common Action Among a Few Videos

Authors: Pengwan Yang, Vincent Tao Hu, Pascal Mettes, Cees G. M. Snoek

Abstract: This paper strives to localize the temporal extent of an action in a long untrimmed video. Where existing work leverages many examples with their start, their ending, and/or the class of the action during training time, we propose few-shot common action localization. The start and end of an action in a long untrimmed video is determined based on just a hand-full of trimmed video examples containin… ▽ More This paper strives to localize the temporal extent of an action in a long untrimmed video. Where existing work leverages many examples with their start, their ending, and/or the class of the action during training time, we propose few-shot common action localization. The start and end of an action in a long untrimmed video is determined based on just a hand-full of trimmed video examples containing the same action, without knowing their common class label. To address this task, we introduce a new 3D convolutional network architecture able to align representations from the support videos with the relevant query video segments. The network contains: (\textit{i}) a mutual enhancement module to simultaneously complement the representation of the few trimmed support videos and the untrimmed query video; (\textit{ii}) a progressive alignment module that iteratively fuses the support videos into the query branch; and (\textit{iii}) a pairwise matching module to weigh the importance of different support videos. Evaluation of few-shot common action localization in untrimmed videos containing a single or multiple action instances demonstrates the effectiveness and general applicability of our proposal. △ Less

Submitted 25 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

Comments: ECCV 2020

arXiv:2007.00857 [pdf, other]

doi 10.1109/TVT.2020.3000757

Efficient Hybrid Beamforming with Anti-Blockage Design for High-Speed Railway Communications

Authors: Meilin Gao, Bo Ai, Yong Niu, Wen Wu, Peng Yang, Feng Lyu, Xuemin, Shen

Abstract: Future railway is expected to accommodate both train operation services and passenger broadband services. The millimeter wave (mmWave) communication is a promising technology in providing multi-gigabit data rates to onboard users. However, mmWave communications suffer from severe propagation attenuation and vulnerability to blockage, which can be very challenging in high-speed railway (HSR) scenar… ▽ More Future railway is expected to accommodate both train operation services and passenger broadband services. The millimeter wave (mmWave) communication is a promising technology in providing multi-gigabit data rates to onboard users. However, mmWave communications suffer from severe propagation attenuation and vulnerability to blockage, which can be very challenging in high-speed railway (HSR) scenarios. In this paper, we investigate efficient hybrid beamforming (HBF) design for train-to-ground communications. First, we develop a two-stage HBF algorithm in blockage-free scenarios. In the first stage, the minimum mean square error method is adopted for optimal hybrid beamformer design with low complexity and fast convergence; in the second stage, the orthogonal matching pursuit method is utilized to approximately recover the analog and digital beamformers. Second, in blocked scenarios, we design an anti-blockage scheme by adaptively invoking the proposed HBF algorithm, which can efficiently deal with random blockages. Extensive simulation results are presented to show the sum rate performance of the proposed algorithms under various configurations, including transmission power, velocity of the train, blockage probability, etc. It is demonstrated that the proposed anti-blockage algorithm can improve the effective rate by 20% in severely-blocked scenarios while maintaining low outage probability. △ Less

Submitted 1 July, 2020; originally announced July 2020.

Comments: 11 Pages, 9 Figures

Journal ref: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020

arXiv:2006.04648 [pdf, other]

doi 10.1109/TMM.2021.3082292

Graph-based Visual-Semantic Entanglement Network for Zero-shot Image Recognition

Authors: Yang Hu, Guihua Wen, Adriane Chapman, Pei Yang, Mingnan Luo, Yingxue Xu, Dan Dai, Wendy Hall

Abstract: Zero-shot learning uses semantic attributes to connect the search space of unseen objects. In recent years, although the deep convolutional network brings powerful visual modeling capabilities to the ZSL task, its visual features have severe pattern inertia and lack of representation of semantic relationships, which leads to severe bias and ambiguity. In response to this, we propose the Graph-base… ▽ More Zero-shot learning uses semantic attributes to connect the search space of unseen objects. In recent years, although the deep convolutional network brings powerful visual modeling capabilities to the ZSL task, its visual features have severe pattern inertia and lack of representation of semantic relationships, which leads to severe bias and ambiguity. In response to this, we propose the Graph-based Visual-Semantic Entanglement Network to conduct graph modeling of visual features, which is mapped to semantic attributes by using a knowledge graph, it contains several novel designs: 1. it establishes a multi-path entangled network with the convolutional neural network (CNN) and the graph convolutional network (GCN), which input the visual features from CNN to GCN to model the implicit semantic relations, then GCN feedback the graph modeled information to CNN features; 2. it uses attribute word vectors as the target for the graph semantic modeling of GCN, which forms a self-consistent regression for graph modeling and supervise GCN to learn more personalized attribute relations; 3. it fuses and supplements the hierarchical visual-semantic features refined by graph modeling into visual embedding. Our method outperforms state-of-the-art approaches on multiple representative ZSL datasets: AwA2, CUB, and SUN by promoting the semantic linkage modelling of visual features. △ Less

Submitted 11 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

Comments: 15 pages, 11 figures, on IEEE Transactions on Multimedia

Journal ref: [J]. IEEE Transactions on Multimedia, 2021

arXiv:2004.00753 [pdf]

Image Denoising Using Sparsifying Transform Learning and Weighted Singular Values Minimization

Authors: Yanwei Zhao, ** Yang, Qiu Guan, Jianwei Zheng, Wanliang Wang

Abstract: In image denoising (IDN) processing, the low-rank property is usually considered as an important image prior. As a convex relaxation approximation of low rank, nuclear norm based algorithms and their variants have attracted significant attention. These algorithms can be collectively called image domain based methods, whose common drawback is the requirement of great number of iterations for some a… ▽ More In image denoising (IDN) processing, the low-rank property is usually considered as an important image prior. As a convex relaxation approximation of low rank, nuclear norm based algorithms and their variants have attracted significant attention. These algorithms can be collectively called image domain based methods, whose common drawback is the requirement of great number of iterations for some acceptable solution. Meanwhile, the sparsity of images in a certain transform domain has also been exploited in image denoising problems. Sparsity transform learning algorithms can achieve extremely fast computations as well as desirable performance. By taking both advantages of image domain and transform domain in a general framework, we propose a sparsity transform learning and weighted singular values minimization method (STLWSM) for IDN problems. The proposed method can make full use of the preponderance of both domains. For solving the non-convex cost function, we also present an efficient alternative solution for acceleration. Experimental results show that the proposed STLWSM achieves improvement both visually and quantitatively with a large margin over state-of-the-art approaches based on an alternatively single domain. It also needs much less iteration than all the image domain algorithms. △ Less

Submitted 1 April, 2020; originally announced April 2020.

Comments: 17 pages, 10 figures, 5 tables

arXiv:2003.06309 [pdf, other]

doi 10.1109/TMC.2020.2976936

BuildSenSys: Reusing Building Sensing Data for Traffic Prediction with Cross-domain Learning

Authors: Xiaochen Fan, Chaocan Xiang, Chao Chen, Panlong Yang, Liangyi Gong, Xudong Song, Priyadarsi Nanda, Xiangjian He

Abstract: With the rapid development of smart cities, smart buildings are generating a massive amount of building sensing data by the equipped sensors. Indeed, building sensing data provides a promising way to enrich a series of data-demanding and cost-expensive urban mobile applications. In this paper, we study how to reuse building sensing data to predict traffic volume on nearby roads. Nevertheless, it i… ▽ More With the rapid development of smart cities, smart buildings are generating a massive amount of building sensing data by the equipped sensors. Indeed, building sensing data provides a promising way to enrich a series of data-demanding and cost-expensive urban mobile applications. In this paper, we study how to reuse building sensing data to predict traffic volume on nearby roads. Nevertheless, it is non-trivial to achieve accurate prediction on such cross-domain data with two major challenges. First, relationships between building sensing data and traffic data are not unknown as prior, and the spatio-temporal complexities impose more difficulties to uncover the underlying reasons behind the above relationships. Second, it is even more daunting to accurately predict traffic volume with dynamic building-traffic correlations, which are cross-domain, non-linear, and time-varying. To address the above challenges, we design and implement BuildSenSys, a first-of-its-kind system for nearby traffic volume prediction by reusing building sensing data. First, we conduct a comprehensive building-traffic analysis based on multi-source datasets, disclosing how and why building sensing data is correlated with nearby traffic volume. Second, we propose a novel recurrent neural network for traffic volume prediction based on cross-domain learning with two attention mechanisms. Specifically, a cross-domain attention mechanism captures the building-traffic correlations and adaptively extracts the most relevant building sensing data at each predicting step. Then, a temporal attention mechanism is employed to model the temporal dependencies of data across historical time intervals. The extensive experimental studies demonstrate that BuildSenSys outperforms all baseline methods with up to 65.3% accuracy improvement (e.g., 2.2% MAPE) in predicting nearby traffic volume. △ Less

Submitted 11 March, 2020; originally announced March 2020.

Comments: 17 pages; 17 figures. in IEEE Transactions on Mobile Computing

Journal ref: IEEE Transactions on Mobile Computing, 2020, Early Access

arXiv:2002.09194 [pdf, ps, other]

Multicast eMBB and Bursty URLLC Service Multiplexing in a CoMP-Enabled RAN

Authors: Peng Yang, Xing Xi, Yaru Fu, Tony Q. S. Quek, Xianbin Cao, Dapeng Wu

Abstract: This paper is concerned with slicing a radio access network (RAN) for simultaneously serving two typical 5G and beyond use cases, i.e., enhanced mobile broadband (eMBB) and ultra-reliable and low latency communications (URLLC). Although many researches have been conducted to tackle this issue, few of them have considered the impact of bursty URLLC. The bursty characteristic of URLLC traffic may si… ▽ More This paper is concerned with slicing a radio access network (RAN) for simultaneously serving two typical 5G and beyond use cases, i.e., enhanced mobile broadband (eMBB) and ultra-reliable and low latency communications (URLLC). Although many researches have been conducted to tackle this issue, few of them have considered the impact of bursty URLLC. The bursty characteristic of URLLC traffic may significantly increase the difficulty of RAN slicing on the aspect of ensuring a ultra-low packet blocking probability. To reduce the packet blocking probability, we re-visit the structure of physical resource blocks (PRBs) orchestrated for bursty URLLC traffic in the time-frequency plane based on our theoretical results. Meanwhile, we formulate the problem of slicing a RAN enabling coordinated multi-point (CoMP) transmissions for multicast eMBB and bursty URLLC service multiplexing as a multi-timescale optimization problem. The goal of this problem is to maximize multicast eMBB and bursty URLLC slice utilities, subject to physical resource constraints. To mitigate this thorny multi-timescale problem, we transform it into multiple single timescale problems by exploring the fundamental principle of a sample average approximation (SAA) technique. Next, an iterative algorithm with provable performance guarantees is developed to obtain solutions to these single timescale problems and aggregate the obtained solutions into those of the multi-timescale problem. We also design a prototype for the CoMP-enabled RAN slicing system incorporating with multicast eMBB and bursty URLLC traffic and compare the proposed iterative algorithm with the state-of-the-art algorithm to verify the effectiveness of the algorithm. △ Less

Submitted 21 February, 2020; originally announced February 2020.

arXiv:2001.04161 [pdf, ps, other]

RAN Slicing for Massive IoT and Bursty URLLC Service Multiplexing: Analysis and Optimization

Authors: Peng Yang, Xing Xi, Tony Q. S. Quek, **gxuan Chen, Xianbin Cao, Dapeng Wu

Abstract: Future wireless networks are envisioned to serve massive Internet of things (mIoT) via some radio access technologies, where the random access channel (RACH) procedure should be exploited for IoT devices to access the networks. However, the theoretical analysis of the RACH procedure for massive IoT devices is challenging. To address this challenge, we first correlate the RACH request of an IoT dev… ▽ More Future wireless networks are envisioned to serve massive Internet of things (mIoT) via some radio access technologies, where the random access channel (RACH) procedure should be exploited for IoT devices to access the networks. However, the theoretical analysis of the RACH procedure for massive IoT devices is challenging. To address this challenge, we first correlate the RACH request of an IoT device with the status of its maintained queue and analyze the evolution of the queue status. Based on the analysis result, we then derive the closed-form expression of the random access (RA) success probability, which is a significant indicator characterizing the RACH procedure of the device. Besides, considering the agreement on converging different services onto a shared infrastructure, we investigate the RAN slicing for mIoT and bursty ultra-reliable and low latency communications (URLLC) service multiplexing. Specifically, we formulate the RAN slicing problem as an optimization one to maximize the total RA success probabilities of all IoT devices and provide URLLC services for URLLC devices in an energy-efficient way. A slice resource optimization (SRO) algorithm exploiting relaxation and approximation with provable tightness and error bound is then proposed to mitigate the optimization problem. Simulation results demonstrate that the proposed SRO algorithm can effectively implement the service multiplexing of mIoT and bursty URLLC traffic. △ Less

Submitted 29 January, 2021; v1 submitted 13 January, 2020; originally announced January 2020.

arXiv:1912.06493 [pdf, other]

GuardRider: Towards Sustainable Backscattering System over WiFi in the Wild

Authors: Xin He, Weiwei Jiang, Meng Cheng, Xiaobo Zhou, Peng-Jun Wan, Panlong Yang, Brian Kurkoski

Abstract: The WiFi backscatter communications offer ultra-low power and ubiquitous connections for IoT systems. Caused by the intermittent-nature of the WiFi traffics, state-of-the-art WiFi backscatter communications are not reliable for backscatter link or simple for tag to do adaptive transmission. In order to build sustainable (reliable and simple) WiFi backscatter communications, we present GuardRider,… ▽ More The WiFi backscatter communications offer ultra-low power and ubiquitous connections for IoT systems. Caused by the intermittent-nature of the WiFi traffics, state-of-the-art WiFi backscatter communications are not reliable for backscatter link or simple for tag to do adaptive transmission. In order to build sustainable (reliable and simple) WiFi backscatter communications, we present GuardRider, a WiFi backscatter system that enables backscatter communications riding on WiFi signals in the wild. The key contribution of GuardRider is an optimization algorithm of designing RS codes to follow the statistical knowledge of WiFi traffics and adjust backscatter transmission. With GuardRider, the reliable baskscatter link is guaranteed and a backscatter tag is able to adaptively transmit information without heavily listening the excitation channel. We built a hardware prototype of GuardRider using a customized tag with FPGA implementation. Both the simulations and field experiments verify that GuardRider could achieve a notably gains in bit error rate and frame error rate, which are hundredfold reduction in simulations and around 99% in filed experiments. △ Less

Submitted 12 December, 2019; originally announced December 2019.

arXiv:1912.02483 [pdf, ps, other]

doi 10.1109/TNS.2020.2985071

ROI-Wise Material Decomposition in Spectral Photon-Counting CT

Authors: Bingqing Xie, Pei Niu, Ting Su, Valérie Kaftandjian, Loic Boussel, Philippe Douek Feng Yang, Philippe Duvauchelle, Yuemin Zhu

Abstract: Spectral photon-counting X-ray CT (sCT) opens up new possibilities for the quantitative measurement of materials in an object, compared to conventional energy-integrating CT or dual energy CT. However, achieving reliable and accurate material decomposition in sCT is extremely challenging, due to similarity between different basis materials, strong quantum noise and photon-counting detector limitat… ▽ More Spectral photon-counting X-ray CT (sCT) opens up new possibilities for the quantitative measurement of materials in an object, compared to conventional energy-integrating CT or dual energy CT. However, achieving reliable and accurate material decomposition in sCT is extremely challenging, due to similarity between different basis materials, strong quantum noise and photon-counting detector limitations. We propose a novel material decomposition method that works in a region-wise manner. The method consists in optimizing basis materials based on spatio-energy segmentation of regions-of-interests (ROIs) in sCT images and performing a fine material decomposition involving optimized decomposition matrix and sparsity regularization. The effectiveness of the proposed method was validated on both digital and physical data. The results showed that the proposed ROI-wise material decomposition method presents clearly higher reliability and accuracy compared to common decomposition methods based on total variation (TV) or L1-norm (lasso) regularization. △ Less

Submitted 5 December, 2019; originally announced December 2019.

arXiv:1912.00799 [pdf, other]

doi 10.1109/TIM.2020.3036654

A CNN-LSTM Hybrid Framework for Wrist Kinematics Estimation Using Surface Electromyography

Authors: Tianzhe Bao, Syed Ali Raza Zaidi, Shengquan Xie, Pengfei Yang, Zhiqiang Zhang

Abstract: Convolutional neural network (CNN) has been widely exploited for simultaneous and proportional myoelectric control due to its capability of deriving informative, representative and transferable features from surface electromyography (sEMG). However, muscle contractions have strong temporal dependencies but conventional CNN can only exploit spatial correlations. Considering that long short-term mem… ▽ More Convolutional neural network (CNN) has been widely exploited for simultaneous and proportional myoelectric control due to its capability of deriving informative, representative and transferable features from surface electromyography (sEMG). However, muscle contractions have strong temporal dependencies but conventional CNN can only exploit spatial correlations. Considering that long short-term memory neural network (LSTM) is able to capture long-term and non-linear dynamics of time-series data, in this paper we propose a CNNLSTM hybrid framework to fully explore the temporal-spatial information in sEMG. Firstly, CNN is utilized to extract deep features from sEMG spectrum, then these features are processed via LSTM-based sequence regression to estimate wrist kinematics. Six healthy participants are recruited for the participatory collection and motion analysis under various experimental setups. Estimation results in both intra-session and inter-session evaluations illustrate that CNN-LSTM significantly outperforms CNN and conventional machine learning approaches, particularly when complex wrist movements are activated. △ Less

Submitted 2 January, 2020; v1 submitted 28 November, 2019; originally announced December 2019.

arXiv:1912.00579 [pdf, ps, other]

How Should I Orchestrate Resources of My Slices for Bursty URLLC Service Provision?

Authors: Peng Yang, Xing Xi, Tony Q. S. Quek, **gxuan Chen, Xianbin Cao, Dapeng Wu

Abstract: Future wireless networks are convinced to provide flexible and cost-efficient services via exploiting network slicing techniques. However, it is challenging to configure network slicing systems for bursty ultra-reliable and low latency communications (URLLC) service provision due to its stringent requirements on low packet blocking probability and low codeword error decoding probability. In this p… ▽ More Future wireless networks are convinced to provide flexible and cost-efficient services via exploiting network slicing techniques. However, it is challenging to configure network slicing systems for bursty ultra-reliable and low latency communications (URLLC) service provision due to its stringent requirements on low packet blocking probability and low codeword error decoding probability. In this paper, we propose to orchestrate network resources for a network slicing system to guarantee a more reliable bursty URLLC service provision. We re-cut physical resource blocks (PRBs) and derive the minimum upper bound of bandwidth for URLLC transmission with a low packet blocking probability. We correlate coordinated multipoint (CoMP) beamforming with channel uses and derive the minimum upper bound of channel uses for URLLC transmission with a low codeword error decoding probability. Considering the agreement on converging diverse services onto shared infrastructures, we further investigate the network slicing for URLLC and enhanced mobile broadband (eMBB) service multiplexing. Particularly, we formulate the service multiplexing as an optimization problem to maximize the long-term total slice utility. The mitigation of this problem is challenging due to the requirements of future channel information and tackling a two timescale issue. To address the challenges, we develop a joint resource optimization algorithm based on a sample average approximate (SAA) technique and a distributed optimization method with provable performance guarantees. △ Less

Submitted 7 November, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

arXiv:1911.11417 [pdf, other]

COOK: Chirp-OOK Communication with Self-reliant Bitrate Adaptation in Backscatter Networks

Authors: Gang Huang, Panlong Yang, Xin He, Yubo Yan, Hao Zhou, Xiangyang Li, Pengjun Wan

Abstract: For large-scale Internet of Things (IoT), backscatter communication is a promising technology to reduce power consumption and simplify deployment. However, backscatter communication lacks stability, along with limited communication range within a few meters. Due to the limited computation ability of backscatter tags, it is burdensome to effectively adapt the bitrate for the time-varying channel. T… ▽ More For large-scale Internet of Things (IoT), backscatter communication is a promising technology to reduce power consumption and simplify deployment. However, backscatter communication lacks stability, along with limited communication range within a few meters. Due to the limited computation ability of backscatter tags, it is burdensome to effectively adapt the bitrate for the time-varying channel. Thus, backscatter tags are failed to fully utilize the optimal transmission rate. In this paper, we design a system named COOK with self-reliant bitrate adaptation in backscatter communication. Channel symmetry allows backscatter tags to adjust bitrate depending on the received signal strength of the excitation source (ES) without feedback. In addition, the chirp spreading signal is exploited as the ES signal to enable backscatter tags to work under noise floor. Our modulation approach is denoted as Chirp-OOK since the tags reflect the chirp signal by employing the on-off keying modulation. It allows that receiver can decode under the noise floor and the bitrate varies flexibly as the communication range changes. We have implemented the prototype system based on the universal software radio peripheral (USRP) platform. Extensive experiment results demonstrate the effectiveness of the proposed system. Our system provides valid communication distance up to $27m$, which is 7 times as compared with normal backscatter system. The system significantly increases the backscatter communication stability, by supporting bitrate adaptation ranges from 0.33kbps to 1.2Mbps, and guaranteeing the bit error rate (BER) is below 1%. △ Less

Submitted 26 November, 2019; originally announced November 2019.

Comments: 7 pages

arXiv:1910.12199 [pdf, other]

High-Resolution, Respiratory-Resolved Coronary MRA Using a Phyllotaxis-Reordered Variable-Density 3D Cones Trajectory

Authors: Srivathsan P. Koundinyan, Corey A. Baron, Mario O. Malave, Frank Ong, Nii Okai Addy, Joseph Y. Cheng, Phillip C. Yang, Bob S. Hu, Dwight G. Nishimura

Abstract: Purpose: To develop a respiratory-resolved motion-compensation method for free-breathing, high-resolution coronary magnetic resonance angiography using a 3D cones trajectory. Methods: To achieve respiratory-resolved 0.98 mm resolution images in a clinically relevant scan time, we undersample the imaging data with a variable-density 3D cones trajectory. For retrospective motion compensation, tran… ▽ More Purpose: To develop a respiratory-resolved motion-compensation method for free-breathing, high-resolution coronary magnetic resonance angiography using a 3D cones trajectory. Methods: To achieve respiratory-resolved 0.98 mm resolution images in a clinically relevant scan time, we undersample the imaging data with a variable-density 3D cones trajectory. For retrospective motion compensation, translational estimates from 3D image-based navigators (3D iNAVs) are used to bin the imaging data into four phases from end-expiration to end-inspiration. To ensure pseudo-random undersampling within each respiratory phase, we devise a phyllotaxis readout ordering scheme mindful of eddy current artifacts in steady state free precession imaging. Following binning, residual 3D translational motion within each phase is computed using the 3D iNAVs and corrected for in the imaging data. The noise-like aliasing characteristic of the combined phyllotaxis and cones sampling pattern is leveraged in a compressed sensing reconstruction with spatial and temporal regularization to reduce aliasing in each of the respiratory phases. Results: In a volunteer and 5 patients, respiratory motion compensation using the proposed method yields improved image quality compared to non-respiratory-resolved approaches with no motion correction and with 3D translational correction. Qualitative assessment by two cardiologists indicates the superior sharpness of coronary segments reconstructed with the proposed method (P < 0.01). Conclusion: The proposed method better mitigates motion artifacts in free-breathing, high-resolution coronary angiography exams compared to translational correction. △ Less

Submitted 27 October, 2019; originally announced October 2019.

arXiv:1910.12185 [pdf, other]

Unraveling the Effect of Spatial Resolution and Scan Acceleration on 3D Image-Based Navigators for Respiratory Motion Tracking in Coronary MR Angiography

Authors: Srivathsan P. Koundinyan, Joseph Y. Cheng, Mario O. Malave, Phillip C. Yang, Bob S. Hu, Dwight G. Nishimura, Corey A. Baron

Abstract: Purpose: To study the accuracy of motion information extracted from beat-to-beat 3D image-based navigators (3D iNAVs) collected using a variable-density cones trajectory with different combinations of spatial resolutions and scan acceleration factors. Methods: Fully sampled, breath-held 4.4 mm 3D iNAV datasets for six respiratory phases are acquired in a volunteer. Ground truth translational and… ▽ More Purpose: To study the accuracy of motion information extracted from beat-to-beat 3D image-based navigators (3D iNAVs) collected using a variable-density cones trajectory with different combinations of spatial resolutions and scan acceleration factors. Methods: Fully sampled, breath-held 4.4 mm 3D iNAV datasets for six respiratory phases are acquired in a volunteer. Ground truth translational and nonrigid motion information is derived from these datasets. Subsequently, the motion estimates from synthesized undersampled 3D iNAVs with isotropic spatial resolutions of 4.4 mm (acceleration factor = 10.9), 5.4 mm (acceleration factor = 7.2), 6.4 mm (acceleration factor = 4.2), and 7.8 mm (acceleration factor = 2.9) are assessed against the ground truth information. The undersampled 3D iNAV configuration with the highest accuracy motion estimates in simulation is then compared with the originally proposed 4.4 mm undersampled 3D iNAV in six volunteer studies. Results: The simulations indicate that for navigators beyond certain scan acceleration factors, the accuracy of motion estimates is compromised due to errors from residual aliasing and blurring/smoothening effects following compressed sensing reconstruction. The 6.4 mm 3D iNAV achieves an acceptable spatial resolution with a small acceleration factor, resulting in the highest accuracy motion information among all assessed undersampled 3D iNAVs. Reader scores for six volunteer studies demonstrate superior coronary vessel sharpness when applying an autofocusing nonrigid correction technique using the 6.4 mm 3D iNAVs in place of 4.4 mm 3D iNAVs. Conclusion: Undersampled 6.4 mm 3D iNAVs enable motion tracking with improved accuracy relative to previously proposed undersampled 4.4 mm 3D iNAVs. △ Less

Submitted 27 October, 2019; originally announced October 2019.

arXiv:1910.07992 [pdf, other]

Dual-Domain Fusion Convolutional Neural Network for Contrast Enhancement Forensics

Authors: Pengpeng Yang, Rongrong Ni, Yao Zhao, Gang Cao, Wei Zhao

Abstract: Contrast enhancement (CE) forensics techniques have always been of great interest for image forensics community, as they can be an effective tool for recovering image history and identifying tampered images. Although several CE forensic algorithms have been proposed, their accuracy and robustness against some kinds of processing are still unsatisfactory. In order to attenuate such deficiency, in t… ▽ More Contrast enhancement (CE) forensics techniques have always been of great interest for image forensics community, as they can be an effective tool for recovering image history and identifying tampered images. Although several CE forensic algorithms have been proposed, their accuracy and robustness against some kinds of processing are still unsatisfactory. In order to attenuate such deficiency, in this paper we propose a new framework based on dual-domain fusion convolutional neural network to fuse the features of pixel and histogram domains for CE forensics. Specifically, we first present a pixel-domain convolutional neural network (P-CNN) to automatically capture the patterns of contrast-enhanced images in the pixel domain. Then, we present a histogram-domain convolutional neural network (H-CNN) to extract the features in the histogram domain. The feature representations of pixel and histogram domains are fused and fed into two fully connected layers for the classification of contrast-enhanced images. Experimental results show that the proposed method achieve better performance and is robust against pre-JPEG compression and anti-forensics attacks. In addition, a strategy for performance improvement of CNN-based forensics is explored, which could provide guidance for the design of CNN-based forensics tools. △ Less

Submitted 17 October, 2019; originally announced October 2019.

Comments: 24 pages

arXiv:1909.03313 [pdf, other]

Fast mmwave Beam Alignment via Correlated Bandit Learning

Authors: Wen Wu, Nan Cheng, Ning Zhang, Peng Yang, Weihua Zhuang, Xuemin, Shen

Abstract: Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair, which incurs significant BA latency on the order of seconds in the worst case. In this paper, we develop a learning algorithm to… ▽ More Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair, which incurs significant BA latency on the order of seconds in the worst case. In this paper, we develop a learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm. We first formulate the BA problem as a stochastic multi-armed bandit problem with the objective to maximize the cumulative received signal strength within a certain period. The proposed algorithm takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space. Furthermore, the prior knowledge on the channel fluctuation is incorporated in the proposed algorithm to further accelerate the BA process. Theoretical analysis indicates that the proposed algorithm is asymptotically optimal. Extensive simulation results demonstrate that the proposed algorithm can identify the optimal beam with a high probability and reduce the BA latency from hundreds of milliseconds to a few milliseconds in the multipath channel, as compared to the existing BA method in IEEE 802.11ad. △ Less

Submitted 7 September, 2019; originally announced September 2019.

Comments: Accepted by IEEE Transactions on Wireless Communications. In this article, we propose a learning-based fast beam alignment algorithm to reduce beam alignment latency

Showing 1–50 of 60 results for author: Yang, P