-
Biased Over-the-Air Federated Learning under Wireless Heterogeneity
Authors:
Muhammad Faraz Ul Abrar,
Nicolò Michelusi
Abstract:
Recently, Over-the-Air (OTA) computation has emerged as a promising federated learning (FL) paradigm that leverages the waveform superposition properties of the wireless channel to realize fast model updates. Prior work focused on the OTA device ``pre-scaler" design under \emph{homogeneous} wireless conditions, in which devices experience the same average path loss, resulting in zero-bias solution…
▽ More
Recently, Over-the-Air (OTA) computation has emerged as a promising federated learning (FL) paradigm that leverages the waveform superposition properties of the wireless channel to realize fast model updates. Prior work focused on the OTA device ``pre-scaler" design under \emph{homogeneous} wireless conditions, in which devices experience the same average path loss, resulting in zero-bias solutions. Yet, zero-bias designs are limited by the device with the worst average path loss and hence may perform poorly in \emph{heterogeneous} wireless settings. In this scenario, there may be a benefit in designing \emph{biased} solutions, in exchange for a lower variance in the model updates. To optimize this trade-off, we study the design of OTA device pre-scalers by focusing on the OTA-FL convergence. We derive an upper bound on the model ``optimality error", which explicitly captures the effect of bias and variance in terms of the choice of the pre-scalers. Based on this bound, we identify two solutions of interest: minimum noise variance, and minimum noise variance zero-bias solutions. Numerical evaluations show that using OTA device pre-scalers that minimize the variance of FL updates, while allowing a small bias, can provide high gains over existing schemes.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Analog-digital Scheduling for Federated Learning: A Communication-Efficient Approach
Authors:
Muhammad Faraz Ul Abrar,
Nicolò Michelusi
Abstract:
Over-the-air (OTA) computation has recently emerged as a communication-efficient Federated Learning (FL) paradigm to train machine learning models over wireless networks. However, its performance is limited by the device with the worst SNR, resulting in fast yet noisy updates. On the other hand, allocating orthogonal resource blocks (RB) to individual devices via digital channels mitigates the noi…
▽ More
Over-the-air (OTA) computation has recently emerged as a communication-efficient Federated Learning (FL) paradigm to train machine learning models over wireless networks. However, its performance is limited by the device with the worst SNR, resulting in fast yet noisy updates. On the other hand, allocating orthogonal resource blocks (RB) to individual devices via digital channels mitigates the noise problem, at the cost of increased communication latency. In this paper, we address this discrepancy and present ADFL, a novel Analog-Digital FL scheme: in each round, the parameter server (PS) schedules each device to either upload its gradient via the analog OTA scheme or transmit its quantized gradient over an orthogonal RB using the ``digital" scheme. Focusing on a single FL round, we cast the optimal scheduling problem as the minimization of the mean squared error (MSE) on the estimated global gradient at the PS, subject to a delay constraint, yielding the optimal device scheduling configuration and quantization bits for the digital devices. Our simulation results show that ADFL, by scheduling most of the devices in the OTA scheme while also occasionally employing the digital scheme for a few devices, consistently outperforms OTA-only and digital-only schemes, in both i.i.d. and non-i.i.d. settings.
△ Less
Submitted 2 February, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
Delay-Aware Hierarchical Federated Learning
Authors:
Frank Po-Chen Lin,
Seyyedali Hosseinalipour,
Nicolò Michelusi,
Christopher Brinton
Abstract:
Federated learning has gained popularity as a means of training models distributed across the wireless edge. The paper introduces delay-aware hierarchical federated learning (DFL) to improve the efficiency of distributed machine learning (ML) model training by accounting for communication delays between edge and cloud. Different from traditional federated learning, DFL leverages multiple stochasti…
▽ More
Federated learning has gained popularity as a means of training models distributed across the wireless edge. The paper introduces delay-aware hierarchical federated learning (DFL) to improve the efficiency of distributed machine learning (ML) model training by accounting for communication delays between edge and cloud. Different from traditional federated learning, DFL leverages multiple stochastic gradient descent iterations on local datasets within each global aggregation period and intermittently aggregates model parameters through edge servers in local subnetworks. During global synchronization, the cloud server consolidates local models with the outdated global model using a local-global combiner, thus preserving crucial elements of both, enhancing learning efficiency under the presence of delay. A set of conditions is obtained to achieve the sub-linear convergence rate of O(1/k) for strongly convex and smooth loss functions. Based on these findings, an adaptive control algorithm is developed for DFL, implementing policies to mitigate energy consumption and communication latency while aiming for sublinear convergence. Numerical evaluations show DFL's superior performance in terms of faster global model convergence, reduced resource consumption, and robustness against communication delays compared to existing FL algorithms. In summary, this proposed method offers improved efficiency and results when dealing with both convex and non-convex loss functions.
△ Less
Submitted 27 September, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
Propagation Measurements and Analyses at 28 GHz via an Autonomous Beam-Steering Platform
Authors:
Bharath Keshavamurthy,
Yaguang Zhang,
Christopher R. Anderson,
Nicolo Michelusi,
James V. Krogmeier,
David J. Love
Abstract:
This paper details the design of an autonomous alignment and tracking platform to mechanically steer directional horn antennas in a sliding correlator channel sounder setup for 28 GHz V2X propagation modeling. A pan-and-tilt subsystem facilitates uninhibited rotational mobility along the yaw and pitch axes, driven by open-loop servo units and orchestrated via inertial motion controllers. A geo-pos…
▽ More
This paper details the design of an autonomous alignment and tracking platform to mechanically steer directional horn antennas in a sliding correlator channel sounder setup for 28 GHz V2X propagation modeling. A pan-and-tilt subsystem facilitates uninhibited rotational mobility along the yaw and pitch axes, driven by open-loop servo units and orchestrated via inertial motion controllers. A geo-positioning subsystem augmented in accuracy by real-time kinematics enables navigation events to be shared between a transmitter and receiver over an Apache Kafka messaging middleware framework with fault tolerance. Herein, our system demonstrates a 3D geo-positioning accuracy of 17 cm, an average principal axes positioning accuracy of 1.1 degrees, and an average tracking response time of 27.8 ms. Crucially, fully autonomous antenna alignment and tracking facilitates continuous series of measurements, a unique yet critical necessity for millimeter wave channel modeling in vehicular networks. The power-delay profiles, collected along routes spanning urban and suburban neighborhoods on the NSF POWDER testbed, are used in pathloss evaluations involving the 3GPP TR38.901 and ITU-R M.2135 standards. Empirically, we demonstrate that these models fail to accurately capture the 28 GHz pathloss behavior in urban foliage and suburban radio environments. In addition to RMS direction-spread analyses for angles-of-arrival via the SAGE algorithm, we perform signal decoherence studies wherein we derive exponential models for the spatial/angular autocorrelation coefficient under distance and alignment effects.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Non-Coherent Over-the-Air Decentralized Gradient Descent
Authors:
Nicolo Michelusi
Abstract:
Decentralized Gradient Descent (DGD) is a popular algorithm used to solve decentralized optimization problems in diverse domains such as remote sensing, distributed inference, multi-agent coordination, and federated learning. Yet, executing DGD over wireless systems affected by noise, fading and limited bandwidth presents challenges, requiring scheduling of transmissions to mitigate interference a…
▽ More
Decentralized Gradient Descent (DGD) is a popular algorithm used to solve decentralized optimization problems in diverse domains such as remote sensing, distributed inference, multi-agent coordination, and federated learning. Yet, executing DGD over wireless systems affected by noise, fading and limited bandwidth presents challenges, requiring scheduling of transmissions to mitigate interference and the acquisition of topology and channel state information -- complex tasks in wireless decentralized systems. This paper proposes a DGD algorithm tailored to wireless systems. Unlike existing approaches, it operates without inter-agent coordination, topology information, or channel state information. Its core is a Non-Coherent Over-The-Air (NCOTA) consensus scheme, exploiting a noisy energy superposition property of wireless channels. With a randomized transmission strategy to accommodate half-duplex operation, transmitters map local optimization signals to energy levels across subcarriers in an OFDM frame, and transmit concurrently without coordination. It is shown that received energies form a noisy consensus signal, whose fluctuations are mitigated via a consensus stepsize. NCOTA-DGD leverages the channel pathloss for consensus formation, without explicit knowledge of the mixing weights. It is shown that, for the class of strongly-convex problems, the expected squared distance between the local and globally optimum models vanishes with rate $\mathcal O(1/\sqrt{k})$ after $k$ iterations, with a proper design of decreasing stepsizes. Extensions address a broad class of fading models and frequency-selective channels. Numerical results on an image classification task depict faster convergence vis-à-vis running time than state-of-the-art schemes, especially in densely deployed networks.
△ Less
Submitted 18 March, 2024; v1 submitted 19 November, 2022;
originally announced November 2022.
-
Decentralized Federated Learning via Non-Coherent Over-the-Air Consensus
Authors:
Nicolò Michelusi
Abstract:
This paper presents NCOTA-DGD, a Decentralized Gradient Descent (DGD) algorithm that combines local gradient descent with a novel Non-Coherent Over-The-Air (NCOTA) consensus scheme to solve distributed machine-learning problems over wirelessly-connected systems. NCOTA-DGD leverages the waveform superposition properties of the wireless channels: it enables simultaneous transmissions under half-dupl…
▽ More
This paper presents NCOTA-DGD, a Decentralized Gradient Descent (DGD) algorithm that combines local gradient descent with a novel Non-Coherent Over-The-Air (NCOTA) consensus scheme to solve distributed machine-learning problems over wirelessly-connected systems. NCOTA-DGD leverages the waveform superposition properties of the wireless channels: it enables simultaneous transmissions under half-duplex constraints, by map** local optimization signals to a mixture of preamble sequences, and consensus via non-coherent combining at the receivers. NCOTA-DGD operates without channel state information at transmitters and receivers, and leverages the average channel pathloss to mix signals, without explicit knowledge of the mixing weights (typically known in consensus-based optimization algorithms). It is shown both theoretically and numerically that, for smooth and strongly-convex problems with fixed consensus and learning stepsizes, the updates of NCOTA-DGD converge in Euclidean distance to the global optimum with rate $\mathcal O(K^{-1/4})$ for a target of $K$ iterations. NCOTA-DGD is evaluated numerically over a logistic regression problem, showing faster convergence vis-à-vis running time than implementations of the classical DGD algorithm over digital and analog orthogonal channels.
△ Less
Submitted 3 February, 2023; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Multiscale Adaptive Scheduling and Path-Planning for Power-Constrained UAV-Relays via SMDPs
Authors:
Bharath Keshavamurthy,
Nicolo Michelusi
Abstract:
We describe the orchestration of a decentralized swarm of rotary-wing UAV-relays, augmenting the coverage and service capabilities of a terrestrial base station. Our goal is to minimize the time-average service latencies involved in handling transmission requests from ground users under Poisson arrivals, subject to an average UAV power constraint. Equipped with rate adaptation to efficiently lever…
▽ More
We describe the orchestration of a decentralized swarm of rotary-wing UAV-relays, augmenting the coverage and service capabilities of a terrestrial base station. Our goal is to minimize the time-average service latencies involved in handling transmission requests from ground users under Poisson arrivals, subject to an average UAV power constraint. Equipped with rate adaptation to efficiently leverage air-to-ground channel stochastics, we first derive the optimal control policy for a single relay via a semi-Markov decision process formulation, with competitive swarm optimization for UAV trajectory design. Accordingly, we detail a multiscale decomposition of this construction: outer decisions on radial wait velocities and end positions optimize the expected long-term delay-power trade-off; consequently, inner decisions on angular wait velocities, service schedules, and UAV trajectories greedily minimize the instantaneous delay-power costs. Next, generalizing to UAV swarms via replication and consensus-driven command-and-control, this policy is embedded with spread maximization and conflict resolution heuristics. We demonstrate that our framework offers superior performance with respect to average service latencies and average per-UAV power consumption: 11x faster data payload delivery relative to static UAV-relay deployments and 2x faster than a deep-Q network solution; remarkably, one relay with our scheme outclasses three relays under a joint successive convex approximation policy by 62%.
△ Less
Submitted 15 October, 2022; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Parallel Successive Learning for Dynamic Distributed Model Training over Heterogeneous Wireless Networks
Authors:
Seyyedali Hosseinalipour,
Su Wang,
Nicolo Michelusi,
Vaneet Aggarwal,
Christopher G. Brinton,
David J. Love,
Mung Chiang
Abstract:
Federated learning (FedL) has emerged as a popular technique for distributing model training over a set of wireless devices, via iterative local updates (at devices) and global aggregations (at the server). In this paper, we develop parallel successive learning (PSL), which expands the FedL architecture along three dimensions: (i) Network, allowing decentralized cooperation among the devices via d…
▽ More
Federated learning (FedL) has emerged as a popular technique for distributing model training over a set of wireless devices, via iterative local updates (at devices) and global aggregations (at the server). In this paper, we develop parallel successive learning (PSL), which expands the FedL architecture along three dimensions: (i) Network, allowing decentralized cooperation among the devices via device-to-device (D2D) communications. (ii) Heterogeneity, interpreted at three levels: (ii-a) Learning: PSL considers heterogeneous number of stochastic gradient descent iterations with different mini-batch sizes at the devices; (ii-b) Data: PSL presumes a dynamic environment with data arrival and departure, where the distributions of local datasets evolve over time, captured via a new metric for model/concept drift. (ii-c) Device: PSL considers devices with different computation and communication capabilities. (iii) Proximity, where devices have different distances to each other and the access point. PSL considers the realistic scenario where global aggregations are conducted with idle times in-between them for resource efficiency improvements, and incorporates data dispersion and model dispersion with local model condensation into FedL. Our analysis sheds light on the notion of cold vs. warmed up models, and model inertia in distributed machine learning. We then propose network-aware dynamic model tracking to optimize the model learning vs. resource efficiency tradeoff, which we show is an NP-hard signomial programming problem. We finally solve this problem through proposing a general optimization solver. Our numerical results reveal new findings on the interdependencies between the idle times in-between the global aggregations, model/concept drift, and D2D cooperation configuration.
△ Less
Submitted 14 June, 2023; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Resource-Efficient and Delay-Aware Federated Learning Design under Edge Heterogeneity
Authors:
David Nickel,
Frank Po-Chen Lin,
Seyyedali Hosseinalipour,
Nicolo Michelusi,
Christopher G. Brinton
Abstract:
Federated learning (FL) has emerged as a popular technique for distributing machine learning across wireless edge devices. We examine FL under two salient properties of contemporary networks: device-server communication delays and device computation heterogeneity. Our proposed StoFedDelAv algorithm incorporates a local-global model combiner into the FL synchronization step. We theoretically charac…
▽ More
Federated learning (FL) has emerged as a popular technique for distributing machine learning across wireless edge devices. We examine FL under two salient properties of contemporary networks: device-server communication delays and device computation heterogeneity. Our proposed StoFedDelAv algorithm incorporates a local-global model combiner into the FL synchronization step. We theoretically characterize the convergence behavior of StoFedDelAv and obtain the optimal combiner weights, which consider the global model delay and expected local gradient error at each device. We then formulate a network-aware optimization problem which tunes the minibatch sizes of the devices to jointly minimize energy consumption and machine learning training loss, and solve the non-convex problem through a series of convex approximations. Our simulations reveal that StoFedDelAv outperforms the current art in FL, evidenced by the obtained improvements in optimization objective.
△ Less
Submitted 7 February, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
A Robotic Antenna Alignment and Tracking System for Millimeter Wave Propagation Modeling
Authors:
Bharath Keshavamurthy,
Yaguang Zhang,
Christopher R. Anderson,
Nicolo Michelusi,
James V. Krogmeier,
David J. Love
Abstract:
In this paper, we discuss the design of a sliding-correlator channel sounder for 28 GHz propagation modeling on the NSF POWDER testbed in Salt Lake City, UT. Beam-alignment is mechanically achieved via a fully autonomous robotic antenna tracking platform, designed using commercial off-the-shelf components. Equipped with an Apache Zookeeper/Kafka managed fault-tolerant publish-subscribe framework,…
▽ More
In this paper, we discuss the design of a sliding-correlator channel sounder for 28 GHz propagation modeling on the NSF POWDER testbed in Salt Lake City, UT. Beam-alignment is mechanically achieved via a fully autonomous robotic antenna tracking platform, designed using commercial off-the-shelf components. Equipped with an Apache Zookeeper/Kafka managed fault-tolerant publish-subscribe framework, we demonstrate tracking response times of 27.8 ms, in addition to superior scalability over state-of-the-art mechanical beam-steering systems. Enhanced with real-time kinematic correction streams, our geo-positioning subsystem achieves a 3D accuracy of 17 cm, while our principal axes positioning subsystem achieves an average accuracy of 1.1 degrees across yaw and pitch movements. Finally, by facilitating remote orchestration (via managed containers), uninhibited rotation (via encapsulation), and real-time positioning visualization (via Dash/MapBox), we exhibit a proven prototype well-suited for V2X measurements.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling
Authors:
Frank Po-Chen Lin,
Seyyedali Hosseinalipour,
Sheikh Shams Azam,
Christopher G. Brinton,
Nicolò Michelusi
Abstract:
Federated learning has emerged as a popular technique for distributing model training across the network edge. Its learning architecture is conventionally a star topology between the devices and a central server. In this paper, we propose two timescale hybrid federated learning (TT-HF), which migrates to a more distributed topology via device-to-device (D2D) communications. In TT-HF, local model t…
▽ More
Federated learning has emerged as a popular technique for distributing model training across the network edge. Its learning architecture is conventionally a star topology between the devices and a central server. In this paper, we propose two timescale hybrid federated learning (TT-HF), which migrates to a more distributed topology via device-to-device (D2D) communications. In TT-HF, local model training occurs at devices via successive gradient iterations, and the synchronization process occurs at two timescales: (i) macro-scale, where global aggregations are carried out via device-server interactions, and (ii) micro-scale, where local aggregations are carried out via D2D cooperative consensus formation in different device clusters. Our theoretical analysis reveals how device, cluster, and network-level parameters affect the convergence of TT-HF, and leads to a set of conditions under which a convergence rate of O(1/t) is guaranteed. Experimental results demonstrate the improvements in convergence and utilization that can be obtained by TT-HF over state-of-the-art federated learning baselines.
△ Less
Submitted 12 September, 2021; v1 submitted 7 September, 2021;
originally announced September 2021.
-
Finite-Bit Quantization For Distributed Algorithms With Linear Convergence
Authors:
Nicolò Michelusi,
Gesualdo Scutari,
Chang-Shen Lee
Abstract:
This paper studies distributed algorithms for (strongly convex) composite optimization problems over mesh networks, subject to quantized communications. Instead of focusing on a specific algorithmic design, a black-box model is proposed, casting linearly convergent distributed algorithms in the form of fixed-point iterates. The algorithmic model is equipped with a novel random or deterministic Bia…
▽ More
This paper studies distributed algorithms for (strongly convex) composite optimization problems over mesh networks, subject to quantized communications. Instead of focusing on a specific algorithmic design, a black-box model is proposed, casting linearly convergent distributed algorithms in the form of fixed-point iterates. The algorithmic model is equipped with a novel random or deterministic Biased Compression (BC) rule on the quantizer design, and a new Adaptive encoding Nonuniform Quantizer (ANQ) coupled with a communication-efficient encoding scheme, which implements the BC-rule using a finite number of bits (below machine precision). This fills a gap existing in most state-of-the-art quantization schemes, such as those based on the popular compression rule, which rely on communication of some scalar signals with negligible quantization error (in practice quantized at the machine precision). A unified communication complexity analysis is developed for the black-box model, determining the average number of bits required to reach a solution of the optimization problem within a target accuracy. It is shown that the proposed BC-rule preserves linear convergence of the unquantized algorithms, and a trade-off between convergence rate and communication cost under ANQ-based quantization is characterized. Numerical results validate our theoretical findings and show that distributed algorithms equipped with the proposed ANQ have more favorable communication cost than algorithms using state-of-the-art quantization rules.
△ Less
Submitted 17 May, 2022; v1 submitted 23 July, 2021;
originally announced July 2021.
-
Learning-based Spectrum Sensing and Access in Cognitive Radios via Approximate POMDPs
Authors:
Bharath Keshavamurthy,
Nicolo Michelusi
Abstract:
A novel LEarning-based Spectrum Sensing and Access (LESSA) framework is proposed, wherein a cognitive radio (CR) learns a time-frequency correlation model underlying spectrum occupancy of licensed users (LUs) in a radio ecosystem; concurrently, it devises an approximately optimal spectrum sensing and access policy under sensing constraints. A Baum-Welch algorithm is proposed to learn a parametric…
▽ More
A novel LEarning-based Spectrum Sensing and Access (LESSA) framework is proposed, wherein a cognitive radio (CR) learns a time-frequency correlation model underlying spectrum occupancy of licensed users (LUs) in a radio ecosystem; concurrently, it devises an approximately optimal spectrum sensing and access policy under sensing constraints. A Baum-Welch algorithm is proposed to learn a parametric Markov transition model of LU spectrum occupancy based on noisy spectrum measurements. Spectrum sensing and access are cast as a Partially-Observable Markov Decision Process, approximately optimized via randomized point-based value iteration. Fragmentation, Hamming-distance state filters and Monte-Carlo methods are proposed to alleviate the inherent computational complexity, and a weighted reward metric to regulate the trade-off between CR throughput and LU interference. Numerical evaluations demonstrate that LESSA performs within 5 percent of a genie-aided upper bound with foreknowledge of LU spectrum occupancy, and outperforms state-of-the-art algorithms across the entire trade-off region: 71 percent over correlation-based clustering, 26 percent over Neyman-Pearson detection, 6 percent over the Viterbi algorithm, and 9 percent over an adaptive Deep Q-Network. LESSA is then extended to a distributed Multi-Agent setting (MA-LESSA), by proposing novel neighbor discovery and channel access rank allocation. MA-LESSA improves CR throughput by 43 percent over cooperative TD-SARSA, 84 percent over cooperative greedy distributed learning, and 3x over non-cooperative learning via g-statistics and ACKs. Finally, MA-LESSA is implemented on the DARPA SC2 platform, manifesting superior performance over competitors in a real-world TDWR-UNII WLAN emulation; its implementation feasibility is further validated on a testbed of ESP32 radios, exhibiting 96 percent success probability.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Learning and Adaptation for Millimeter-Wave Beam Tracking and Training: a Dual Timescale Variational Framework
Authors:
Muddassar Hussain,
Nicolo Michelusi
Abstract:
Millimeter-wave vehicular networks incur enormous beam-training overhead to enable narrow-beam communications. This paper proposes a learning and adaptation framework in which the dynamics of the communication beams are learned and then exploited to design adaptive beam-tracking and training with low overhead: on a long-timescale, a deep recurrent variational autoencoder (DR-VAE) uses noisy beam-t…
▽ More
Millimeter-wave vehicular networks incur enormous beam-training overhead to enable narrow-beam communications. This paper proposes a learning and adaptation framework in which the dynamics of the communication beams are learned and then exploited to design adaptive beam-tracking and training with low overhead: on a long-timescale, a deep recurrent variational autoencoder (DR-VAE) uses noisy beam-training feedback to learn a probabilistic model of beam dynamics and enable predictive beam-tracking; on a short-timescale, an adaptive beam-training procedure is formulated as a partially observable (PO-) Markov decision process (MDP) and optimized via point-based value iteration (PBVI) by leveraging beam-training feedback and a probabilistic prediction of the strongest beam pair provided by the DR-VAE. In turn, beam-training feedback is used to refine the DR-VAE via stochastic gradient ascent in a continuous process of learning and adaptation. The proposed DR-VAE learning framework learns accurate beam dynamics: it reduces the Kullback-Leibler divergence between the ground truth and the learned model of beam dynamics by 95% over the Baum-Welch algorithm and a naive learning approach that neglects feedback errors. Numerical results on a line-of-sight (LOS) scenario with multipath reveal that the proposed dual timescale approach yields near-optimal spectral efficiency, and improves it by 130% over a policy that scans exhaustively over the dominant beam pairs, and by 20% over a state-of-the-art POMDP policy. Finally, a low-complexity policy is proposed by reducing the POMDP to an error-robust MDP, and is shown to perform well in regimes with infrequent feedback errors.
△ Less
Submitted 26 October, 2021; v1 submitted 27 June, 2021;
originally announced July 2021.
-
Semi-Decentralized Federated Learning with Cooperative D2D Local Model Aggregations
Authors:
Frank Po-Chen Lin,
Seyyedali Hosseinalipour,
Sheikh Shams Azam,
Christopher G. Brinton,
Nicolo Michelusi
Abstract:
Federated learning has emerged as a popular technique for distributing machine learning (ML) model training across the wireless edge. In this paper, we propose two timescale hybrid federated learning (TT-HF), a semi-decentralized learning architecture that combines the conventional device-to-server communication paradigm for federated learning with device-to-device (D2D) communications for model t…
▽ More
Federated learning has emerged as a popular technique for distributing machine learning (ML) model training across the wireless edge. In this paper, we propose two timescale hybrid federated learning (TT-HF), a semi-decentralized learning architecture that combines the conventional device-to-server communication paradigm for federated learning with device-to-device (D2D) communications for model training. In TT-HF, during each global aggregation interval, devices (i) perform multiple stochastic gradient descent iterations on their individual datasets, and (ii) aperiodically engage in consensus procedure of their model parameters through cooperative, distributed D2D communications within local clusters. With a new general definition of gradient diversity, we formally study the convergence behavior of TT-HF, resulting in new convergence bounds for distributed ML. We leverage our convergence bounds to develop an adaptive control algorithm that tunes the step size, D2D communication rounds, and global aggregation period of TT-HF over time to target a sublinear convergence rate of O(1/t) while minimizing network resource utilization. Our subsequent experiments demonstrate that TT-HF significantly outperforms the current art in federated learning in terms of model accuracy and/or network energy consumption in different scenarios where local device datasets exhibit statistical heterogeneity. Finally, our numerical evaluations demonstrate robustness against outages caused by fading channels, as well favorable performance with non-convex loss functions.
△ Less
Submitted 30 September, 2021; v1 submitted 18 March, 2021;
originally announced March 2021.
-
Federated Learning with Communication Delay in Edge Networks
Authors:
Frank Po-Chen Lin,
Christopher G. Brinton,
Nicolò Michelusi
Abstract:
Federated learning has received significant attention as a potential solution for distributing machine learning (ML) model training through edge networks. This work addresses an important consideration of federated learning at the network edge: communication delays between the edge nodes and the aggregator. A technique called FedDelAvg (federated delayed averaging) is developed, which generalizes…
▽ More
Federated learning has received significant attention as a potential solution for distributing machine learning (ML) model training through edge networks. This work addresses an important consideration of federated learning at the network edge: communication delays between the edge nodes and the aggregator. A technique called FedDelAvg (federated delayed averaging) is developed, which generalizes the standard federated averaging algorithm to incorporate a weighting between the current local model and the delayed global model received at each device during the synchronization step. Through theoretical analysis, an upper bound is derived on the global model loss achieved by FedDelAvg, which reveals a strong dependency of learning performance on the values of the weighting and learning rate. Experimental results on a popular ML task indicate significant improvements in terms of convergence speed when optimizing the weighting scheme to account for delays.
△ Less
Submitted 21 August, 2020;
originally announced August 2020.
-
Multi-Stage Hybrid Federated Learning over Large-Scale D2D-Enabled Fog Networks
Authors:
Seyyedali Hosseinalipour,
Sheikh Shams Azam,
Christopher G. Brinton,
Nicolo Michelusi,
Vaneet Aggarwal,
David J. Love,
Huaiyu Dai
Abstract:
Federated learning has generated significant interest, with nearly all works focused on a "star" topology where nodes/devices are each connected to a central server. We migrate away from this architecture and extend it through the network dimension to the case where there are multiple layers of nodes between the end devices and the server. Specifically, we develop multi-stage hybrid federated lear…
▽ More
Federated learning has generated significant interest, with nearly all works focused on a "star" topology where nodes/devices are each connected to a central server. We migrate away from this architecture and extend it through the network dimension to the case where there are multiple layers of nodes between the end devices and the server. Specifically, we develop multi-stage hybrid federated learning (MH-FL), a hybrid of intra- and inter-layer model learning that considers the network as a multi-layer cluster-based structure. MH-FL considers the topology structures among the nodes in the clusters, including local networks formed via device-to-device (D2D) communications, and presumes a semi-decentralized architecture for federated learning. It orchestrates the devices at different network layers in a collaborative/cooperative manner (i.e., using D2D interactions) to form local consensus on the model parameters and combines it with multi-stage parameter relaying between layers of the tree-shaped hierarchy. We derive the upper bound of convergence for MH-FL with respect to parameters of the network topology (e.g., the spectral radius) and the learning algorithm (e.g., the number of D2D rounds in different clusters). We obtain a set of policies for the D2D rounds at different clusters to guarantee either a finite optimality gap or convergence to the global optimum. We then develop a distributed control algorithm for MH-FL to tune the D2D rounds in each cluster over time to meet specific convergence criteria. Our experiments on real-world datasets verify our analytical results and demonstrate the advantages of MH-FL in terms of resource utilization metrics.
△ Less
Submitted 12 January, 2022; v1 submitted 18 July, 2020;
originally announced July 2020.
-
MAESTRO-X: Distributed Orchestration of Rotary-Wing UAV-Relay Swarms
Authors:
Bharath Keshavamurthy,
Matthew Bliss,
Nicolò Michelusi
Abstract:
This work details a scalable framework to orchestrate a swarm of rotary-wing UAVs serving as cellular relays to facilitate beyond line-of-sight connectivity and traffic offloading for ground users. First, a Multiscale Adaptive Energy-conscious Scheduling and TRajectory Optimization (MAESTRO) framework is developed for a single UAV. Aiming to minimize the time-averaged latency to serve user request…
▽ More
This work details a scalable framework to orchestrate a swarm of rotary-wing UAVs serving as cellular relays to facilitate beyond line-of-sight connectivity and traffic offloading for ground users. First, a Multiscale Adaptive Energy-conscious Scheduling and TRajectory Optimization (MAESTRO) framework is developed for a single UAV. Aiming to minimize the time-averaged latency to serve user requests, subject to an average UAV power constraint, it is shown that the optimization problem can be cast as a semi-Markov decision process, and exhibits a multiscale structure: outer actions on radial wait velocities and terminal service positions minimize the long-term delay-power trade-off, optimized via value iteration; given these outer actions, inner actions on angular wait velocities and service trajectories minimize a short-term delay-energy cost. A novel hierarchical competitive swarm optimization scheme is developed in the inner optimization, to devise high-resolution trajectories via iterative pair-wise updates. Next, MAESTRO is eXtended to UAV swarms (MAESTRO-X) via scalable policy replication: enabled by a decentralized command-and-control network, the optimal single-agent policy is augmented with spread maximization, consensus-driven conflict resolution, adaptive frequency reuse, and piggybacking. Numerical evaluations show that, for user requests of 10 Mbits, generated according to a Poisson arrival process with rate 0.2 req/min/UAV, single-agent MAESTRO offers 3.8x faster service than a high-altitude platform and 29% faster than a static UAV deployment; moreover, for a swarm of 3 UAV-relays, MAESTRO-X delivers data payloads 4.7x faster than a successive convex approximation scheme; and remarkably, a single UAV optimized via MAESTRO outclasses 3 UAVs optimized via a deep-Q network by 38%.
△ Less
Submitted 4 February, 2023; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Adaptive Millimeter-Wave Communications Exploiting Mobility and Blockage Dynamics
Authors:
Muddassar Hussain,
Maria Scalabrin,
Michele Rossi,
Nicolò Michelusi
Abstract:
Mobility may degrade the performance of next-generation vehicular networks operating at the millimeter-wave spectrum: frequent loss of alignment and blockages require repeated beam training and handover, thus incurring huge overhead. In this paper, an adaptive and joint design of beam training, data transmission and handover is proposed, that exploits the mobility process of mobile users and the d…
▽ More
Mobility may degrade the performance of next-generation vehicular networks operating at the millimeter-wave spectrum: frequent loss of alignment and blockages require repeated beam training and handover, thus incurring huge overhead. In this paper, an adaptive and joint design of beam training, data transmission and handover is proposed, that exploits the mobility process of mobile users and the dynamics of blockages to optimally trade-off throughput and power consumption. At each time slot, the serving base station decides to perform either beam training, data communication, or handover when blockage is detected. The problem is cast as a partially observable Markov decision process, and solved via an approximate dynamic programming algorithm based on PERSEUS [2]. Numerical results show that the PERSEUS-based policy performs near-optimally, and achieves a 55% gain in spectral efficiency compared to a baseline scheme with periodic beam training. Inspired by its structure, an adaptive heuristic policy is proposed with low computational complexity and small performance degradation.
△ Less
Submitted 1 March, 2020;
originally announced March 2020.
-
Mobility and Blockage-aware Communications in Millimeter-Wave Vehicular Networks
Authors:
Muddassar Hussain,
Maria Scalabrin,
Michele Rossi,
Nicolo Michelusi
Abstract:
Mobility may degrade the performance of next-generation vehicular networks operating at the millimeter-wave spectrum: frequent mis-alignment and blockages require repeated beam training and handover, and incur enormous overhead. Nevertheless, mobility induces temporal correlations in the communication beams and in blockage events. In this paper, an adaptive design of beam training, data transmissi…
▽ More
Mobility may degrade the performance of next-generation vehicular networks operating at the millimeter-wave spectrum: frequent mis-alignment and blockages require repeated beam training and handover, and incur enormous overhead. Nevertheless, mobility induces temporal correlations in the communication beams and in blockage events. In this paper, an adaptive design of beam training, data transmission and handover is proposed, that learns and exploits these temporal correlations to reduce the beam training overhead and optimally trade-off throughput and power consumption. At each time-slot, the serving base station (BS) decides to perform either beam training, data communication, or handover when blockage is detected, under uncertainty in the system state. The decision problem is cast as a partially observable Markov decision process, and the goal is to maximize the throughput delivered to the UE, under an average power constraint. To address the high dimensional optimization, an approximate constrained point-based value iteration (C-PBVI) method is developed, which simultaneously optimizes the primal and dual functions to meet the power constraint. Numerical results demonstrate a good match between the analysis and a simulation based on 2D mobility and 3D analog beamforming via uniform planar arrays at both BSs and UE, and reveal that C-PBVI performs near-optimally, and outperforms a baseline scheme with periodic beam training by 38% in spectral efficiency. Motivated by the structure of the C-PBVI policy, two heuristics are proposed, that trade complexity with sub-optimality, and achieve only 4% and 15% loss in spectral efficiency.
△ Less
Submitted 3 September, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Power-Constrained Trajectory Optimization for Wireless UAV Relays with Random Requests
Authors:
Matthew Bliss,
Nicolò Michelusi
Abstract:
This paper studies the adaptive trajectory design of a rotary-wing UAV serving as a relay between ground nodes dispersed in a circular cell and generating uplink data transmissions randomly according to a Poisson process, and a central base station. We seek to minimize the expected average communication delay to service the data transmission requests, subject to an average power constraint on the…
▽ More
This paper studies the adaptive trajectory design of a rotary-wing UAV serving as a relay between ground nodes dispersed in a circular cell and generating uplink data transmissions randomly according to a Poisson process, and a central base station. We seek to minimize the expected average communication delay to service the data transmission requests, subject to an average power constraint on the mobility of the UAV. The problem is cast as a semi-Markov decision process, and it is shown that the policy exhibits a two-scale structure, which can be efficiently optimized: in the outer decision, upon starting a communication phase, and given its current radius, the UAV selects a target end radius position so as to optimally balance a trade-off between average long-term communication delay and power consumption; in the inner decision, the UAV selects its trajectory between the start radius and the selected end radius, so as to greedily minimize the delay and energy consumption to serve the current request. Numerical evaluations show that, during waiting phases, the UAV circles at some optimal radius at the most energy efficient speed, until a new request is received. Lastly, the expected average communication delay and power consumption of the optimal policy is compared to that of several heuristics, demonstrating a reduction in latency by over 50% and 20%, respectively, compared to static and mobile heuristic schemes.
△ Less
Submitted 27 February, 2020; v1 submitted 21 February, 2020;
originally announced February 2020.
-
Second-best Beam-Alignment via Bayesian Multi-Armed Bandits
Authors:
Muddassar Hussain,
Nicolo Michelusi
Abstract:
Millimeter-wave (mm-wave) systems rely on narrow-beams to cope with the severe signal attenuation in the mm-wave frequency band. However, susceptibility to beam mis-alignment due to mobility or blockage requires the use of beam-alignment schemes, with huge cost in terms of overhead and use of system resources. In this paper, a beam-alignment scheme is proposed based on Bayesian multi-armed bandits…
▽ More
Millimeter-wave (mm-wave) systems rely on narrow-beams to cope with the severe signal attenuation in the mm-wave frequency band. However, susceptibility to beam mis-alignment due to mobility or blockage requires the use of beam-alignment schemes, with huge cost in terms of overhead and use of system resources. In this paper, a beam-alignment scheme is proposed based on Bayesian multi-armed bandits, with the goal to maximize the alignment probability and the data-communication throughput. A Bayesian approach is proposed, by considering the state as a posterior distribution over angles of arrival (AoA) and of departure (AoD), given the history of feedback signaling and of beam pairs scanned by the base-station (BS) and the user-end (UE). A simplified sufficient statistics for optimal control is derived, in the form of preference of BS-UE beam pairs. By bounding a value function, the second-best preference policy is derived, which strikes an optimal balance between exploration and exploitation by selecting the beam pair with the current second-best preference. Through Monte-Carlo simulation with analog beamforming, the superior performance of the second-best preference policy is demonstrated in comparison to existing schemes based on first-best preference, linear Thompson sampling, and upper confidence bounds, with up to 7%, 10% and 30% improvements in alignment probability, respectively.
△ Less
Submitted 21 August, 2019; v1 submitted 11 June, 2019;
originally announced June 2019.
-
Trajectory Optimization for Rotary-Wing UAVs in Wireless Networks with Random Requests
Authors:
Matthew Bliss,
Nicolò Michelusi
Abstract:
This paper studies the trajectory optimization problem in a scenario where a single rotary-wing UAV acts as a relay of data payloads for downlink transmission requests generated randomly by two ground nodes (GNs) in a wireless network. The goal is to optimize the UAV trajectory in order to minimize the expected average communication delay to serve these random requests. It is shown that the proble…
▽ More
This paper studies the trajectory optimization problem in a scenario where a single rotary-wing UAV acts as a relay of data payloads for downlink transmission requests generated randomly by two ground nodes (GNs) in a wireless network. The goal is to optimize the UAV trajectory in order to minimize the expected average communication delay to serve these random requests. It is shown that the problem can be cast as a semi-Markov decision process (SMDP), and the resulting minimization problem is solved via multi-chain policy iteration. The optimality of a two-scale optimization approach is proved: the optimal trajectory in the communication phase greedily minimizes the communication delay of the current request while moving between the current start position and a target end position (inner optimization); the end positions are selected to minimize the expected average long-term delay in the SMDP (outer optimization). Numerical simulations show that the expected average delay is minimized when the UAV moves towards the geometric center of the GNs during phases in which it is not actively servicing transmission requests, and demonstrate significant improvements over sensible heuristics. Finally, it is revealed that the optimal end positions of communication phases become increasingly independent of the data payload, for large data payload values.
△ Less
Submitted 21 August, 2019; v1 submitted 5 May, 2019;
originally announced May 2019.
-
Finite rate distributed weight-balancing and average consensus over digraphs
Authors:
Chang-Shen Lee,
Nicolò Michelusi,
Gesualdo Scutari
Abstract:
This paper proposes the first distributed algorithm that solves the weight-balancing problem using only finite rate and simplex communications among nodes, compliant with the directed nature of the graph edges. It is proved that the algorithm converges to a weight-balanced solution at sublinear rate. The analysis builds upon a new metric inspired by positional system representations, which charact…
▽ More
This paper proposes the first distributed algorithm that solves the weight-balancing problem using only finite rate and simplex communications among nodes, compliant with the directed nature of the graph edges. It is proved that the algorithm converges to a weight-balanced solution at sublinear rate. The analysis builds upon a new metric inspired by positional system representations, which characterizes the dynamics of information exchange over the network, and on a novel step-size rule. Building on this result, a novel distributed algorithm is proposed that solves the average consensus problem over digraphs, using, at each timeslot, finite rate simplex communications between adjacent nodes -- some bits for the weight-balancing problem and others for the average consensus. Convergence of the proposed quantized consensus algorithm to the average of the node's unquantized initial values is established, both almost surely and in the moment generating function of the error; and a sublinear convergence rate is proved for sufficiently large step-sizes. Numerical results validate our theoretical findings.
△ Less
Submitted 29 February, 2020; v1 submitted 3 January, 2019;
originally announced January 2019.
-
Coded Energy-Efficient Beam-Alignment for Millimeter-Wave Networks
Authors:
Muddassar Hussain,
Nicolo Michelusi
Abstract:
Millimeter-wave communications rely on narrow-beam transmissions to cope with the strong signal attenuation at these frequencies, thus demanding precise alignment between transmitter and receiver. However, the beam-alignment procedure may entail a huge overhead and its performance may be degraded by detection errors. This paper proposes a coded energy-efficient beam-alignment scheme, robust agains…
▽ More
Millimeter-wave communications rely on narrow-beam transmissions to cope with the strong signal attenuation at these frequencies, thus demanding precise alignment between transmitter and receiver. However, the beam-alignment procedure may entail a huge overhead and its performance may be degraded by detection errors. This paper proposes a coded energy-efficient beam-alignment scheme, robust against detection errors. Specifically, the beam-alignment sequence is designed such that the error-free feedback sequences are generated from a codebook with the desired error correction capabilities. Therefore, in the presence of detection errors, the error-free feedback sequences can be recovered with high probability. The assignment of beams to codewords is designed to optimize energy efficiency, and a water-filling solution is proved. The numerical results with analog beams depict up to 4dB and 8dB gains over exhaustive and uncoded beam-alignment schemes, respectively.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Limited Rate Distributed Weight-Balancing and Average Consensus Over Digraphs
Authors:
Chang-Shen Lee,
Nicolò Michelusi,
Gesualdo Scutari
Abstract:
Distributed quantized weight-balancing and average consensus over fixed digraphs are considered. A digraph with non-negative weights associated to its edges is weight-balanced if, for each node, the sum of the weights of its out-going edges is equal to that of its incoming edges. This paper proposes and analyzes the first distributed algorithm that solves the weight-balancing problem using only fi…
▽ More
Distributed quantized weight-balancing and average consensus over fixed digraphs are considered. A digraph with non-negative weights associated to its edges is weight-balanced if, for each node, the sum of the weights of its out-going edges is equal to that of its incoming edges. This paper proposes and analyzes the first distributed algorithm that solves the weight-balancing problem using only finite rate and simplex communications among nodes (compliant to the directed nature of the graph edges). Asymptotic convergence of the scheme is proved and a convergence rate analysis is provided. Building on this result, a novel distributed algorithm is proposed that solves the average consensus problem over digraphs, using, at each iteration, finite rate simplex communications between adjacent nodes -- some bits for the weight-balancing problem, other for the average consensus. Convergence of the proposed quantized consensus algorithm to the average of the real (i.e., unquantized) agent's initial values is proved, both almost surely and in $r$th mean for all positive integer $r$. Finally, numerical results validate our theoretical findings.
△ Less
Submitted 17 September, 2018;
originally announced September 2018.
-
Energy-Efficient Interactive Beam-Alignment for Millimeter-Wave Networks
Authors:
Muddassar Hussain,
Nicolo Michelusi
Abstract:
Millimeter-wave will be a key technology in next-generation wireless networks thanks to abundant bandwidth availability. However, the use of large antenna arrays with beamforming demands precise beam-alignment between transmitter and receiver, and may entail huge overhead in mobile environments. This paper investigates the design of an optimal interactive beam-alignment and data communication prot…
▽ More
Millimeter-wave will be a key technology in next-generation wireless networks thanks to abundant bandwidth availability. However, the use of large antenna arrays with beamforming demands precise beam-alignment between transmitter and receiver, and may entail huge overhead in mobile environments. This paper investigates the design of an optimal interactive beam-alignment and data communication protocol, with the goal of minimizing power consumption under a minimum rate constraint. The base-station selects beam-alignment or data communication and the beam parameters, based on feedback from the user-end. Based on the sectored antenna model and uniform prior on the angles of departure and arrival (AoD/AoA), the optimality of a fixed-length beam-alignment phase followed by a data-communication phase is demonstrated. Moreover, a decoupled fractional beam-alignment method is shown to be optimal, which decouples over time the alignment of AoD and AoA, and iteratively scans a fraction of their region of uncertainty. A heuristic policy is proposed for non-uniform prior on AoD/AoA, with provable performance guarantees, and it is shown that the uniform prior is the worst-case scenario. The performance degradation due to detection errors is studied analytically and via simulation. The numerical results with analog beams depict up to 4dB, 7.5dB, and 14 dB gains over a state-of-the-art bisection method, conventional and interactive exhaustive search policies, respectively, and demonstrate that the sectored model provides valuable insights for beam-alignment design.
△ Less
Submitted 7 December, 2018; v1 submitted 15 May, 2018;
originally announced May 2018.
-
Beam Training and Data Transmission Optimization in Millimeter-Wave Vehicular Networks
Authors:
Maria Scalabrin,
Nicolò Michelusi,
Michele Rossi
Abstract:
Future vehicular communication networks call for new solutions to support their capacity demands, by leveraging the potential of the millimeter-wave (mm-wave) spectrum. Mobility, in particular, poses severe challenges in their design, and as such shall be accounted for. A key question in mm-wave vehicular networks is how to optimize the trade-off between directive Data Transmission (DT) and direct…
▽ More
Future vehicular communication networks call for new solutions to support their capacity demands, by leveraging the potential of the millimeter-wave (mm-wave) spectrum. Mobility, in particular, poses severe challenges in their design, and as such shall be accounted for. A key question in mm-wave vehicular networks is how to optimize the trade-off between directive Data Transmission (DT) and directional Beam Training (BT), which enables it. In this paper, learning tools are investigated to optimize this trade-off. In the proposed scenario, a Base Station (BS) uses BT to establish a mm-wave directive link towards a Mobile User (MU) moving along a road. To control the BT/DT trade-off, a Partially Observable (PO) Markov Decision Process (MDP) is formulated, where the system state corresponds to the position of the MU within the road link. The goal is to maximize the number of bits delivered by the BS to the MU over the communication session, under a power constraint. The resulting optimal policies reveal that adaptive BT/DT procedures significantly outperform common-sense heuristic schemes, and that specific mobility features, such as user position estimates, can be effectively used to enhance the overall system performance and optimize the available system resources.
△ Less
Submitted 26 April, 2018;
originally announced April 2018.
-
Multi-Scale Spectrum Sensing in Dense Multi-Cell Cognitive Networks
Authors:
Nicolo Michelusi,
Matthew Nokleby,
Urbashi Mitra,
Robert Calderbank
Abstract:
Multi-scale spectrum sensing is proposed to overcome the cost of full network state information on the spectrum occupancy of primary users (PUs) in dense multi-cell cognitive networks. Secondary users (SUs) estimate the local spectrum occupancies and aggregate them hierarchically to estimate spectrum occupancy at multiple spatial scales. Thus, SUs obtain fine-grained estimates of spectrum occupanc…
▽ More
Multi-scale spectrum sensing is proposed to overcome the cost of full network state information on the spectrum occupancy of primary users (PUs) in dense multi-cell cognitive networks. Secondary users (SUs) estimate the local spectrum occupancies and aggregate them hierarchically to estimate spectrum occupancy at multiple spatial scales. Thus, SUs obtain fine-grained estimates of spectrum occupancies of nearby cells, more relevant to scheduling tasks, and coarse-grained estimates of those of distant cells. An agglomerative clustering algorithm is proposed to design a cost-effective aggregation tree, matched to the structure of interference, robust to local estimation errors and delays. Given these multi-scale estimates, the SU traffic is adapted in a decentralized fashion in each cell, to optimize the trade-off among SU cell throughput, interference caused to PUs, and mutual SU interference. Numerical evaluations demonstrate a small degradation in SU cell throughput (up to 15% for a 0dB interference-to-noise ratio experienced at PUs) compared to a scheme with full network state information, using only one-third of the cost incurred in the exchange of spectrum estimates. The proposed interference-matched design is shown to significantly outperform a random tree design, by providing more relevant information for network control, and a state-of-the-art consensus-based algorithm, which does not leverage the spatio-temporal structure of interference across the network.
△ Less
Submitted 6 December, 2018; v1 submitted 22 February, 2018;
originally announced February 2018.
-
Optimal Beam Swee** and Communication in Mobile Millimeter-Wave Networks
Authors:
Nicolo Michelusi,
Muddassar Hussain
Abstract:
Millimeter-wave (mm-wave) communications incur a high beam alignment cost in mobile scenarios such as vehicular networks. Therefore, an efficient beam alignment mechanism is required to mitigate the resulting overhead. In this paper, a one-dimensional mobility model is proposed where a mobile user (MU), such as a vehicle, moves along a straight road with time-varying and random speed, and communic…
▽ More
Millimeter-wave (mm-wave) communications incur a high beam alignment cost in mobile scenarios such as vehicular networks. Therefore, an efficient beam alignment mechanism is required to mitigate the resulting overhead. In this paper, a one-dimensional mobility model is proposed where a mobile user (MU), such as a vehicle, moves along a straight road with time-varying and random speed, and communicates with base stations (BSs) located on the roadside over the mm-wave band. To compensate for location uncertainty, the BS widens its transmission beam and, when a critical beamwidth is achieved, it performs beam-swee** to refine the MU position estimate, followed by data communication over a narrow beam. The average rate and average transmission power are computed in closed form and the optimal beamwidth for communication, number of swee** beams, and transmission power allocation are derived so as to maximize the average rate under an average power constraint. Structural properties of the optimal design are proved, and a bisection algorithm to determine the optimal swee** -- communication parameters is designed. It is shown numerically that an adaptation of the IEEE 802.11ad standard to the proposed model exhibits up to 90% degradation in spectral efficiency compared to the proposed scheme.
△ Less
Submitted 28 January, 2018;
originally announced January 2018.
-
Optimal Spectrum Sharing with ARQ based Legacy Users via Chain Decoding
Authors:
Nicolo Michelusi
Abstract:
This paper investigates the design of access policies in spectrum sharing networks by exploiting the retransmission protocol of legacy primary users (PUs) to improve the spectral efficiency via opportunistic retransmissions at secondary users (SUs) and chain decoding. The optimal policy maximizing the SU throughput under an interference constraint to the PU and its performance are found in closed…
▽ More
This paper investigates the design of access policies in spectrum sharing networks by exploiting the retransmission protocol of legacy primary users (PUs) to improve the spectral efficiency via opportunistic retransmissions at secondary users (SUs) and chain decoding. The optimal policy maximizing the SU throughput under an interference constraint to the PU and its performance are found in closed form. It is shown that the optimal policy randomizes among three modes: Idle, the SU remains idle over the retransmission window of the PU, to avoid causing interference; Interference cancellation, the SU transmits only after decoding the PU packet, to improve its own throughput via interference cancellation; Always transmit, the SU transmits over the retransmission window of the PU to maximize the future potential of interference cancellation via chain decoding. This structure is exploited to design a stochastic optimization algorithm to facilitate learning and adaptation when the model parameters are unknown or vary over time, based on ARQ feedback from the PU and CSI measurements at the SU receiver. It is shown numerically that, for a 10% interference constraint, the optimal access policy yields 15% improvement over a state-of-the-art scheme without SU retransmissions, and up to 2x gain over a scheme using a non-adaptive access policy instead of the optimal one.
△ Less
Submitted 29 June, 2018; v1 submitted 24 January, 2018;
originally announced January 2018.
-
Multi-scale Spectrum Sensing in Small-Cell mm-Wave Cognitive Wireless Networks
Authors:
Nicolo Michelusi,
Matthew Nokleby,
Urbashi Mitra,
Robert Calderbank
Abstract:
In this paper, a multi-scale approach to spectrum sensing in cognitive cellular networks is proposed. In order to overcome the huge cost incurred in the acquisition of full network state information, a hierarchical scheme is proposed, based on which local state estimates are aggregated up the hierarchy to obtain aggregate state information at multiple scales, which are then sent back to each cell…
▽ More
In this paper, a multi-scale approach to spectrum sensing in cognitive cellular networks is proposed. In order to overcome the huge cost incurred in the acquisition of full network state information, a hierarchical scheme is proposed, based on which local state estimates are aggregated up the hierarchy to obtain aggregate state information at multiple scales, which are then sent back to each cell for local decision making. Thus, each cell obtains fine-grained estimates of the channel occupancies of nearby cells, but coarse-grained estimates of those of distant cells. The performance of the aggregation scheme is studied in terms of the trade-off between the throughput achievable by secondary users and the interference generated by the activity of these secondary users to primary users. In order to account for the irregular structure of interference patterns arising from path loss, shadowing, and blockages, which are especially relevant in millimeter wave networks, a greedy algorithm is proposed to find a multi-scale aggregation tree to optimize the performance. It is shown numerically that this tailored hierarchy outperforms a regular tree construction by 60%.
△ Less
Submitted 25 February, 2017;
originally announced February 2017.
-
Throughput Optimal Beam Alignment in Millimeter Wave Networks
Authors:
Muddassar Hussain,
Nicolo Michelusi
Abstract:
Millimeter wave communications rely on narrow-beam transmissions to cope with the strong signal attenuation at these frequencies, thus demanding precise beam alignment between transmitter and receiver. The communication overhead incurred to achieve beam alignment may become a severe impairment in mobile networks. This paper addresses the problem of optimizing beam alignment acquisition, with the g…
▽ More
Millimeter wave communications rely on narrow-beam transmissions to cope with the strong signal attenuation at these frequencies, thus demanding precise beam alignment between transmitter and receiver. The communication overhead incurred to achieve beam alignment may become a severe impairment in mobile networks. This paper addresses the problem of optimizing beam alignment acquisition, with the goal of maximizing throughput. Specifically, the algorithm jointly determines the portion of time devoted to beam alignment acquisition, as well as, within this portion of time, the optimal beam search parameters, using the framework of Markov decision processes. It is proved that a bisection search algorithm is optimal, and that it outperforms exhaustive and iterative search algorithms proposed in the literature. The duration of the beam alignment phase is optimized so as to maximize the overall throughput. The numerical results show that the throughput, optimized with respect to the duration of the beam alignment phase, achievable under the exhaustive algorithm is 88.3% lower than that achievable under the bisection algorithm. Similarly, the throughput achievable by the iterative search algorithm for a division factor of 4 and 8 is, respectively, 12.8% and 36.4% lower than that achievable by the bisection algorithm.
△ Less
Submitted 20 February, 2017;
originally announced February 2017.
-
Energy-Based Adaptive Multiple Access in LPWAN IoT Systems with Energy Harvesting
Authors:
Nicolo Michelusi,
Marco Levorato
Abstract:
This paper develops a control framework for a network of energy harvesting nodes connected to a Base Station (BS) over a multiple access channel. The objective is to adapt their transmission strategy to the state of the network, including the energy available to the individual nodes. In order to reduce the complexity of control, an optimization framework is proposed where energy storage dynamics a…
▽ More
This paper develops a control framework for a network of energy harvesting nodes connected to a Base Station (BS) over a multiple access channel. The objective is to adapt their transmission strategy to the state of the network, including the energy available to the individual nodes. In order to reduce the complexity of control, an optimization framework is proposed where energy storage dynamics are replaced by dynamic average power constraints induced by the time correlated energy supply, thus enabling lightweight and flexible network control. Specifically, the BS adapts the packet transmission probability of the "active" nodes (those currently under a favorable energy harvesting state) so as to maximize the average long-term throughput, under these dynamic average power constraints. The resulting policy takes the form of the packet transmission probability as a function of the energy harvesting state and number of active nodes. The structure of the throughput-optimal genie-aided policy, in which the number of active nodes is known non-causally at the BS, is proved. Inspired by the genie-aided policy, a Bayesian estimation approach is presented to address the case where the BS estimates the number of active nodes based on the observed network transmission pattern. It is shown that the proposed scheme outperforms by 20% a scheme in which the nodes operate based on local state information only, and performs well even when energy storage dynamics are taken into account.
△ Less
Submitted 13 January, 2017; v1 submitted 13 November, 2016;
originally announced November 2016.
-
Cognitive Access-Transmission Policies under a Primary ARQ process via Chain Decoding
Authors:
Nicolo Michelusi,
Petar Popovski,
Michele Zorzi
Abstract:
This paper introduces a novel technique that enables access by a cognitive secondary user (SU) to a spectrum occupied by an incumbent primary user (PU) that employs Type-I Hybrid ARQ. The technique allows the SU to perform selective retransmissions of SU data packets that have not been successfully decoded in the previous attempts. The temporal redundancy introduced by the PU ARQ protocol and by t…
▽ More
This paper introduces a novel technique that enables access by a cognitive secondary user (SU) to a spectrum occupied by an incumbent primary user (PU) that employs Type-I Hybrid ARQ. The technique allows the SU to perform selective retransmissions of SU data packets that have not been successfully decoded in the previous attempts. The temporal redundancy introduced by the PU ARQ protocol and by the selective retransmission process of the SU can be exploited by the SU receiver to perform interference cancellation (IC) over multiple transmission slots, thus creating a "clean" channel for the decoding of the concurrent SU or PU packets. The chain decoding technique is initiated by a successful decoding operation of a SU or PU packet and proceeds by an iterative application of IC in order to decode the buffered signals that represent packets that could not be decoded before. Based on this scheme, an optimal policy is designed that maximizes the SU throughput under a constraint on the average long-term PU performance. The optimality of the chain decoding protocol is proved, which determines which packet the SU should send at a given time. Moreover, a decoupling principle is proved, which establishes the optimality of decoupling the secondary access strategy from the chain decoding protocol. Specifically, first, the SU access policy, optimized via dynamic programming, specifies whether the SU should access the channel or not, based on a compact state representation of the protocol; and second, the chain decoding protocol embeds four basic rules that are used to determine which packet should be transmitted by the SU. Chain decoding provably yields the maximum improvement that can be achieved by any scheme under our assumptions, and thus it is the ultimate scheme, which completely closes the gap between previous schemes and optimality.
△ Less
Submitted 26 September, 2015;
originally announced September 2015.
-
A new result of the scaling law of weighted L1 minimization
Authors:
Jun Zhang,
Urbashi Mitra,
Kuan-Wen Huang,
Nicolo Michelusi
Abstract:
This paper study recovery conditions of weighted L1 minimization for signal reconstruction from compressed sensing measurements. A sufficient condition for exact recovery by using the general weighted L1 minimization is derived, which builds a direct relationship between the weights and the recoverability. Simulation results indicates that this sufficient condition provides a precise prediction of…
▽ More
This paper study recovery conditions of weighted L1 minimization for signal reconstruction from compressed sensing measurements. A sufficient condition for exact recovery by using the general weighted L1 minimization is derived, which builds a direct relationship between the weights and the recoverability. Simulation results indicates that this sufficient condition provides a precise prediction of the scaling law for the weighted L1 minimization.
△ Less
Submitted 26 September, 2015;
originally announced September 2015.
-
Queuing models for abstracting interactions in Bacterial communities
Authors:
Nicolò Michelusi,
James Boedicker,
Mohamed Y. El-Naggar,
Urbashi Mitra
Abstract:
Microbial communities play a significant role in bioremediation,plant growth,human and animal digestion,global elemental cycles including the carbon-cycle,and water treatment.They are also posed to be the engines of renewable energy via microbial fuel cells which can reverse the process of electrosynthesis.Microbial communication regulates many virulence mechanisms used by bacteria.Thus,it is of f…
▽ More
Microbial communities play a significant role in bioremediation,plant growth,human and animal digestion,global elemental cycles including the carbon-cycle,and water treatment.They are also posed to be the engines of renewable energy via microbial fuel cells which can reverse the process of electrosynthesis.Microbial communication regulates many virulence mechanisms used by bacteria.Thus,it is of fundamental importance to understand interactions in microbial communities and to develop predictive tools that help control them,in order to aid the design of systems exploiting bacterial capabilities.This position paper explores how abstractions from communications,networking and information theory can play a role in understanding and modeling bacterial interactions.In particular,two forms of interactions in bacterial systems will be examined:electron transfer and quorum sensing.While the diffusion of chemical signals has been heavily studied,electron transfer occurring in living cells and its role in cell-cell interaction is less understood.Recent experimental observations open up new frontiers in the design of microbial systems based on electron transfer,which may coexist with the more well-known interaction strategies based on molecular diffusion.In quorum sensing,the concentration of certain signature chemical compounds emitted by the bacteria is used to estimate the bacterial population size,so as to activate collective behaviors.In this position paper,queuing models for electron transfer are summarized and adapted to provide new models for quorum sensing.These models are stochastic,and thus capture the inherent randomness exhibited by cell colonies in nature.It is shown that queuing models allow the characterization of the state of a single cell as a function of interactions with other cells and the environment,while being amenable to complexity reduction.
△ Less
Submitted 10 January, 2016; v1 submitted 4 August, 2015;
originally announced August 2015.
-
Cross-layer estimation and control for Cognitive Radio: Exploiting Sparse Network Dynamics
Authors:
Nicolo Michelusi,
Urbashi Mitra
Abstract:
In this paper, a cross-layer framework to jointly optimize spectrum sensing and scheduling in resource constrained agile wireless networks is presented. A network of secondary users (SUs) accesses portions of the spectrum left unused by a network of licensed primary users (PUs). A central controller (CC) schedules the traffic of the SUs, based on distributed compressed measurements collected by th…
▽ More
In this paper, a cross-layer framework to jointly optimize spectrum sensing and scheduling in resource constrained agile wireless networks is presented. A network of secondary users (SUs) accesses portions of the spectrum left unused by a network of licensed primary users (PUs). A central controller (CC) schedules the traffic of the SUs, based on distributed compressed measurements collected by the SUs. Sensing and scheduling are jointly controlled to maximize the SU throughput, with constraints on PU throughput degradation and SU cost. The sparsity in the spectrum dynamics is exploited: leveraging a prior spectrum occupancy estimate, the CC needs to estimate only a residual uncertainty vector via sparse recovery techniques. The high complexity entailed by the POMDP formulation is reduced by a low-dimensional belief representation via minimization of the Kullback-Leibler divergence. It is proved that the optimization of sensing and scheduling can be decoupled. A partially myopic scheduling strategy is proposed for which structural properties can be proved showing that the myopic scheme allocates SU traffic to likely idle spectral bands. Simulation results show that this framework balances optimally the resources between spectrum sensing and data transmission. This framework defines sensing-scheduling schemes most informative for network control, yielding energy efficient resource utilization.
△ Less
Submitted 22 June, 2015;
originally announced June 2015.
-
Capacity of electron-based communication over bacterial cables: the full-CSI case
Authors:
Nicolo Michelusi,
Urbashi Mitra
Abstract:
Motivated by recent discoveries of microbial communities that transfer electrons across centimeter-length scales, this paper studies the information capacity of bacterial cables via electron transfer, which coexists with molecular communications, under the assumption of full causal channel state information (CSI). The bacterial cable is modeled as an electron queue that transfers electrons from th…
▽ More
Motivated by recent discoveries of microbial communities that transfer electrons across centimeter-length scales, this paper studies the information capacity of bacterial cables via electron transfer, which coexists with molecular communications, under the assumption of full causal channel state information (CSI). The bacterial cable is modeled as an electron queue that transfers electrons from the encoder at the electron donor source, which controls the desired input electron intensity, to the decoder at the electron acceptor sink. Clogging due to local ATP saturation along the cable is modeled. A discrete-time scheme is investigated, enabling the computation of an achievable rate. The regime of asymptotically small time-slot duration is analyzed, and the optimality of binary input distributions is proved, i.e., the encoder transmits at either maximum or minimum intensity, as dictated by the physical constraints of the cable. A dynamic programming formulation of the capacity is proposed, and the optimal binary signaling is determined via policy iteration. It is proved that the optimal signaling has smaller intensity than that given by the myopic policy, which greedily maximizes the instantaneous information rate but neglects its effect on the steady-state cable distribution. In contrast, the optimal scheme balances the tension between achieving high instantaneous information rate, and inducing a favorable steady-state distribution, such that those states characterized by high information rates are visited more frequently, thus revealing the importance of CSI. This work represents a first contribution towards the design of electron signaling schemes in complex microbial structures, e.g., bacterial cables and biofilms, where the tension between maximizing the transfer of information and guaranteeing the well-being of the overall bacterial community arises.
△ Less
Submitted 16 January, 2015;
originally announced January 2015.
-
Optimal Adaptive Random Multiaccess in Energy Harvesting Wireless Sensor Networks
Authors:
Nicolo Michelusi,
Michele Zorzi
Abstract:
Wireless sensors can integrate rechargeable batteries and energy-harvesting (EH) devices to enable long-term, autonomous operation, thus requiring intelligent energy management to limit the adverse impact of energy outages. This work considers a network of EH wireless sensors, which report packets with a random utility value to a fusion center (FC) over a shared wireless channel. Decentralized acc…
▽ More
Wireless sensors can integrate rechargeable batteries and energy-harvesting (EH) devices to enable long-term, autonomous operation, thus requiring intelligent energy management to limit the adverse impact of energy outages. This work considers a network of EH wireless sensors, which report packets with a random utility value to a fusion center (FC) over a shared wireless channel. Decentralized access schemes are designed, where each node performs a local decision to transmit/discard a packet, based on an estimate of the packet's utility, its own energy level, and the scenario state of the EH process, with the objective to maximize the average long-term aggregate utility of the packets received at the FC. Due to the non-convex structure of the problem, an approximate optimization is developed by resorting to a mathematical artifice based on a game theoretic formulation of the multiaccess scheme, where the nodes do not behave strategically, but rather attempt to maximize a \emph{common} network utility with respect to their own policy. The symmetric Nash equilibrium (SNE) is characterized, where all nodes employ the same policy; its uniqueness is proved, and it is shown to be a local maximum of the original problem. An algorithm to compute the SNE is presented, and a heuristic scheme is proposed, which is optimal for large battery capacity. It is shown numerically that the SNE typically achieves near-optimal performance, within 3% of the optimal policy, at a fraction of the complexity, and two operational regimes of EH-networks are identified and analyzed: an energy-limited scenario, where energy is scarce and the channel is under-utilized, and a network-limited scenario, where energy is abundant and the shared wireless channel represents the bottleneck of the system.
△ Less
Submitted 20 February, 2015; v1 submitted 15 October, 2014;
originally announced October 2014.
-
A Stochastic Model for Electron Transfer in Bacterial Cables
Authors:
Nicolo Michelusi,
Sahand Pirbadian,
Mohamed Y. El-Naggar,
Urbashi Mitra
Abstract:
Biological systems are known to communicate by diffusing chemical signals in the surrounding medium. However, most of the recent literature has neglected the electron transfer mechanism occurring amongst living cells, and its role in cell-cell communication. Each cell relies on a continuous flow of electrons from its electron donor to its electron acceptor through the electron transport chain to p…
▽ More
Biological systems are known to communicate by diffusing chemical signals in the surrounding medium. However, most of the recent literature has neglected the electron transfer mechanism occurring amongst living cells, and its role in cell-cell communication. Each cell relies on a continuous flow of electrons from its electron donor to its electron acceptor through the electron transport chain to produce energy in the form of the molecule adenosine triphosphate, and to sustain the cell's vital operations and functions. While the importance of biological electron transfer is well-known for individual cells, the past decade has also brought about remarkable discoveries of multi-cellular microbial communities that transfer electrons between cells and across centimeter length scales, e.g., biofilms and multi-cellular bacterial cables. These experimental observations open up new frontiers in the design of electron-based communications networks in microbial communities, which may coexist with the more well-known communication strategies based on molecular diffusion, while benefiting from a much shorter communication delay. This paper develops a stochastic model that links the electron transfer mechanism to the energetic state of the cell. The model is also extensible to larger communities, by allowing for electron exchange between neighboring cells. Moreover, the parameters of the stochastic model are fit to experimental data available in the literature, and are shown to provide a good fit.
△ Less
Submitted 20 February, 2015; v1 submitted 7 October, 2014;
originally announced October 2014.
-
Cognitive Access Policies under a Primary ARQ process via Forward-Backward Interference Cancellation
Authors:
Nicolò Michelusi,
Petar Popovski,
Osvaldo Simeone,
Marco Levorato,
Michele Zorzi
Abstract:
This paper introduces a novel technique for access by a cognitive Secondary User (SU) using best-effort transmission to a spectrum with an incumbent Primary User (PU), which uses Type-I Hybrid ARQ. The technique leverages the primary ARQ protocol to perform Interference Cancellation (IC) at the SU receiver (SUrx). Two IC mechanisms that work in concert are introduced: Forward IC, where SUrx, after…
▽ More
This paper introduces a novel technique for access by a cognitive Secondary User (SU) using best-effort transmission to a spectrum with an incumbent Primary User (PU), which uses Type-I Hybrid ARQ. The technique leverages the primary ARQ protocol to perform Interference Cancellation (IC) at the SU receiver (SUrx). Two IC mechanisms that work in concert are introduced: Forward IC, where SUrx, after decoding the PU message, cancels its interference in the (possible) following PU retransmissions of the same message, to improve the SU throughput; Backward IC, where SUrx performs IC on previous SU transmissions, whose decoding failed due to severe PU interference. Secondary access policies are designed that determine the secondary access probability in each state of the network so as to maximize the average long-term SU throughput by opportunistically leveraging IC, while causing bounded average long-term PU throughput degradation and SU power expenditure. It is proved that the optimal policy prescribes that the SU prioritizes its access in the states where SUrx knows the PU message, thus enabling IC. An algorithm is provided to optimally allocate additional secondary access opportunities in the states where the PU message is unknown. Numerical results are shown to assess the throughput gain provided by the proposed techniques.
△ Less
Submitted 30 January, 2013;
originally announced February 2013.