Search | arXiv e-print repository

Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

Authors: Rui Sun, Li You, An-An Lu, Chen Sun, Xiqi Gao, Xiang-Gen Xia

Abstract: In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization. In UCN mMIMO systems, each user terminal (UT) is served by a subset of base stations (BSs) instead of all the BSs, facilitating the implementation of the system and lowering the dimension of the precoders to be designed. By prov… ▽ More In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization. In UCN mMIMO systems, each user terminal (UT) is served by a subset of base stations (BSs) instead of all the BSs, facilitating the implementation of the system and lowering the dimension of the precoders to be designed. By proving that the precoder set satisfying the per-BS power constraints forms a Riemannian submanifold of a linear product manifold, we transform the constrained precoder design problem in Euclidean space to an unconstrained one on the Riemannian submanifold. Riemannian ingredients, including orthogonal projection, Riemannian gradient, retraction and vector transport, of the problem on the Riemannian submanifold are further derived, with which the Riemannian conjugate gradient (RCG) design method is proposed for solving the unconstrained problem. The proposed method avoids the inverses of large dimensional matrices, which is beneficial in practice. The complexity analyses show the high computational efficiency of RCG precoder design. Simulation results demonstrate the numerical superiority of the proposed precoder design and the high efficiency of the UCN mMIMO system. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 13 pages, 9 figures, journal

arXiv:2403.12487 [pdf]

Unveiling Four Key Factors for Tire Force Control Allocation in 4WID-4WIS Electric Vehicles at Handling Limits

Authors: Ao Lu, Runfeng Li, Yunchang Yu, Ziwang Lu, Guangyu Tian

Abstract: The four-wheel independent drive and four-wheel independent steering (4WID-4WIS) configurations enhance control flexibility and dynamic performance potential for more integrated electric vehicles. This paper comprehensively analyzes the impacts of four key factors on tire force control allocation: vertical load estimation, actuator dynamic characteristics, tire force constraints, and wheel steerin… ▽ More The four-wheel independent drive and four-wheel independent steering (4WID-4WIS) configurations enhance control flexibility and dynamic performance potential for more integrated electric vehicles. This paper comprehensively analyzes the impacts of four key factors on tire force control allocation: vertical load estimation, actuator dynamic characteristics, tire force constraints, and wheel steering precision at handling limits. The study demonstrates that precise vertical load estimation enhances lateral force allocation accuracy. Additionally, the self-compensating effect of lateral tire forces minimizes the impact of small deviations in vertical load estimation on tire force control allocation. A novel control allocation method considering actuator dynamics is introduced, effectively improving yaw rate response and reducing tracking errors. Considering tire-road adhesion and actuator rate constraints, an innovative method to calculate the real-time attainable tire force volume is proposed based on the tire slip ratio and slip angle. Feedforward control with bump steer compensation is implemented to improve wheel steering precision and lateral tire force control accuracy. Matlab/Simulink and Carsim co-simulation results emphasize the importance of these key factors' individual impacts and combined effects. This analysis offers valuable insights for develo** advanced tire force control allocation strategies in 4WID-4WIS electric vehicles. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2310.00587 [pdf, other]

doi 10.1109/IWAENC53105.2022.9914771

Mechatronic Generation of Datasets for Acoustics Research

Authors: Austin Lu, Ethaniel Moore, Arya Nallanthighall, Kanad Sarkar, Manan Mittal, Ryan M. Corey, Paris Smaragdis, Andrew Singer

Abstract: We address the challenge of making spatial audio datasets by proposing a shared mechanized recording space that can run custom acoustic experiments: a Mechatronic Acoustic Research System (MARS). To accommodate a wide variety of experiments, we implement an extensible architecture for wireless multi-robot coordination which enables synchronized robot motion for dynamic scenes with moving speakers… ▽ More We address the challenge of making spatial audio datasets by proposing a shared mechanized recording space that can run custom acoustic experiments: a Mechatronic Acoustic Research System (MARS). To accommodate a wide variety of experiments, we implement an extensible architecture for wireless multi-robot coordination which enables synchronized robot motion for dynamic scenes with moving speakers and microphones. Using a virtual control interface, we can remotely design automated experiments to collect large-scale audio data. This data is shown to be similar across repeated runs, demonstrating the reliability of MARS. We discuss the potential for MARS to make audio data collection accessible for researchers without dedicated acoustic research spaces. △ Less

Submitted 1 October, 2023; originally announced October 2023.

Comments: 5 pages, 5 figures, IWAENC 2022

arXiv:2304.00201 [pdf, ps, other]

doi 10.1109/TSP.2024.3364914

Precoder Design for Massive MIMO Downlink with Matrix Manifold Optimization

Authors: Rui Sun, Chen Wang, An-An Lu, Xiqi Gao, Xiang-Gen Xia

Abstract: We investigate the weighted sum-rate (WSR) maximization linear precoder design for massive multiple-input multiple-output (MIMO) downlink. We consider a single-cell system with multiple users and propose a unified matrix manifold optimization framework applicable to total power constraint (TPC), per-user power constraint (PUPC) and per-antenna power constraint (PAPC). We prove that the precoders u… ▽ More We investigate the weighted sum-rate (WSR) maximization linear precoder design for massive multiple-input multiple-output (MIMO) downlink. We consider a single-cell system with multiple users and propose a unified matrix manifold optimization framework applicable to total power constraint (TPC), per-user power constraint (PUPC) and per-antenna power constraint (PAPC). We prove that the precoders under TPC, PUPC and PAPC are on distinct Riemannian submanifolds, and transform the constrained problems in Euclidean space to unconstrained ones on manifolds. In accordance with this, we derive Riemannian ingredients, including orthogonal projection, Riemannian gradient, Riemannian Hessian, retraction and vector transport, which are needed for precoder design in the matrix manifold framework. Then, Riemannian design methods using Riemannian steepest descent, Riemannian conjugate gradient and Riemannian trust region are provided to design the WSR-maximization precoders under TPC, PUPC or PAPC. Riemannian methods do not involve the inverses of the large dimensional matrices during the iterations, reducing the computational complexities of the algorithms. Complexity analyses and performance simulations demonstrate the advantages of the proposed precoder design. △ Less

Submitted 10 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

Comments: 16 pages, 11 figures, journal

Journal ref: IEEE Transactions on Signal Processing, vol. 72, pp. 1065-1080, 2024

arXiv:2212.14156 [pdf, other]

Decentralized Voltage Control with Peer-to-peer Energy Trading in a Distribution Network

Authors: Chen Feng, Andrew L. Lu, Yihsu Chen

Abstract: Utilizing distributed renewable and energy storage resources via peer-to-peer (P2P) energy trading has long been touted as a solution to improve energy system's resilience and sustainability. Consumers and prosumers (those who have energy generation resources), however, do not have expertise to engage in repeated P2P trading, and the zero-marginal costs of renewables present challenges in determin… ▽ More Utilizing distributed renewable and energy storage resources via peer-to-peer (P2P) energy trading has long been touted as a solution to improve energy system's resilience and sustainability. Consumers and prosumers (those who have energy generation resources), however, do not have expertise to engage in repeated P2P trading, and the zero-marginal costs of renewables present challenges in determining fair market prices. To address these issues, we propose a multi-agent reinforcement learning (MARL) framework to help automate consumers' bidding and management of their solar PV and energy storage resources, under a specific P2P clearing mechanism that utilizes the so-called supply-demand ratio. In addition, we show how the MARL framework can integrate physical network constraints to realize decentralized voltage control, hence ensuring physical feasibility of the P2P energy trading and paving ways for real-world implementations. △ Less

Submitted 28 December, 2022; originally announced December 2022.

arXiv:2211.05256 [pdf, other]

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

Authors: Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang , et al. (29 additional authors not shown)

Abstract: Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this prob… ▽ More Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this problem and propose the participants to design an end-to-end real-time video super-resolution solution for mobile NPUs optimized for low energy consumption. The participants were provided with the REDS training dataset containing video sequences for a 4X video upscaling task. The runtime and power efficiency of all models was evaluated on the powerful MediaTek Dimensity 9000 platform with a dedicated AI processing unit capable of accelerating floating-point and quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 500 FPS rate and 0.2 [Watt / 30 FPS] power consumption. A detailed description of all models developed in the challenge is provided in this paper. △ Less

Submitted 7 November, 2022; originally announced November 2022.

Comments: arXiv admin note: text overlap with arXiv:2105.08826, arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.03885

arXiv:2208.05163 [pdf, other]

Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization

Authors: Zhengang Li, Mengshu Sun, Alec Lu, Haoyu Ma, Geng Yuan, Yanyue Xie, Hao Tang, Yanyu Li, Miriam Leeser, Zhangyang Wang, Xue Lin, Zhenman Fang

Abstract: Vision transformers (ViTs) are emerging with significantly improved accuracy in computer vision tasks. However, their complex architecture and enormous computation/storage demand impose urgent needs for new hardware accelerator design methodology. This work proposes an FPGA-aware automatic ViT acceleration framework based on the proposed mixed-scheme quantization. To the best of our knowledge, thi… ▽ More Vision transformers (ViTs) are emerging with significantly improved accuracy in computer vision tasks. However, their complex architecture and enormous computation/storage demand impose urgent needs for new hardware accelerator design methodology. This work proposes an FPGA-aware automatic ViT acceleration framework based on the proposed mixed-scheme quantization. To the best of our knowledge, this is the first FPGA-based ViT acceleration framework exploring model quantization. Compared with state-of-the-art ViT quantization work (algorithmic approach only without hardware acceleration), our quantization achieves 0.47% to 1.36% higher Top-1 accuracy under the same bit-width. Compared with the 32-bit floating-point baseline FPGA accelerator, our accelerator achieves around 5.6x improvement on the frame rate (i.e., 56.8 FPS vs. 10.0 FPS) with 0.71% accuracy drop on ImageNet dataset for DeiT-base. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: Published in FPL2022

arXiv:2208.01142 [pdf]

doi 10.1016/j.sse.2022.108468

Vertical GaN Diode BV Maximization through Rapid TCAD Simulation and ML-enabled Surrogate Model

Authors: Albert Lu, Jordan Marshall, Yifan Wang, Ming Xiao, Yuhao Zhang, Hiu Yung Wong

Abstract: In this paper, two methodologies are used to speed up the maximization of the breakdown volt-age (BV) of a vertical GaN diode that has a theoretical maximum BV of ~2100V. Firstly, we demonstrated a 5X faster accurate simulation method in Technology Computer-Aided-Design (TCAD). This allows us to find 50% more numbers of high BV (>1400V) designs at a given simulation time. Secondly, a machine learn… ▽ More In this paper, two methodologies are used to speed up the maximization of the breakdown volt-age (BV) of a vertical GaN diode that has a theoretical maximum BV of ~2100V. Firstly, we demonstrated a 5X faster accurate simulation method in Technology Computer-Aided-Design (TCAD). This allows us to find 50% more numbers of high BV (>1400V) designs at a given simulation time. Secondly, a machine learning (ML) model is developed using TCAD-generated data and used as a surrogate model for differential evolution optimization. It can inversely design an out-of-the-training-range structure with BV as high as 1887V (89% of the ideal case) compared to ~1100V designed with human domain expertise. △ Less

Submitted 18 July, 2022; originally announced August 2022.

Comments: 4 pages, 7 figures

arXiv:2110.10714 [pdf, other]

Auction Design through Multi-Agent Learning in Peer-to-Peer Energy Trading

Authors: Zibo Zhao, Chen Feng, Andrew L. Lu

Abstract: Distributed energy resources (DERs), such as rooftop solar panels, are growing rapidly and are resha** power systems. To promote DERs, feed-in-tariff (FIT) is usually adopted by utilities to pay DER owners certain fixed rates for supplying energy to the grid. An alternative to FIT is a market-based approach; that is, consumers and DER owners trade energy in an auction-based peer-to-peer (P2P) ma… ▽ More Distributed energy resources (DERs), such as rooftop solar panels, are growing rapidly and are resha** power systems. To promote DERs, feed-in-tariff (FIT) is usually adopted by utilities to pay DER owners certain fixed rates for supplying energy to the grid. An alternative to FIT is a market-based approach; that is, consumers and DER owners trade energy in an auction-based peer-to-peer (P2P) market, and the rates are determined based on supply and demand. However, the auction complexity and market participants' bounded rationality may invalidate many well-established theories on auction design and hinder market development. To address the challenges, we propose an automated bidding framework based on multi-agent, multi-armed bandit learning for repeated auctions, which aims to minimize each bidder's cumulative regret. Numerical results indicate convergence of such a multi-agent learning game to a steady-state. Being particularly interested in auction designs, we have applied the framework to four different implementations of repeated double-side auctions to compare their market outcomes. While it is difficult to pick a clear winner, $k$-double auction (a variant of uniform pricing auction) and McAfee auction (a variant of Vickrey double-auction) appear to perform well in general, with their respective strengths and weaknesses. △ Less

Submitted 20 October, 2021; originally announced October 2021.

arXiv:2103.02025 [pdf]

Rightsizing the Railway Signal Workforce: a Zero-Based Resourcing Approach Towards Asset Management

Authors: Alex Lu, Zhiqi Zhong, Thomas Barger, Michael Brotzman

Abstract: Classic asset management approaches begin by inventorying all infrastructure assets and then assigning maintenance tasks and resources. Our approach collects similar data, but by starting with current personnel assignment and describing their job responsibilities and work processes, staff resistance in a railroad infrastructure owner-operator environment is minimized. Resulting "manning model" qua… ▽ More Classic asset management approaches begin by inventorying all infrastructure assets and then assigning maintenance tasks and resources. Our approach collects similar data, but by starting with current personnel assignment and describing their job responsibilities and work processes, staff resistance in a railroad infrastructure owner-operator environment is minimized. Resulting "manning model" quantitatively measures signal maintenance burden including Federally mandated tests, trouble tickets, non-FRA maintenance, overhead and vacation coverage, location/shift assignment, administrative process, and work curfew productivity losses. It is capable of delivering immediate results by rightsizing allocation of workforce across shifts and maintenance base locations--even before all assets are formally inventoried. Typical data from a commuter passenger railroad shows that work curfews and shift assignment constraints have significant impacts on workforce productivity. Just over half of signal maintenance employee-hours are devoted to Federally mandated tests, whilst non-FRA and repair maintenance consumes about 25% each. These indicators provide intelligence driving strategic management actions to improve signal maintenance cost-effectiveness. This model provides workload-based employee assignment by craft, location, gang, and shift for maintenance manager use, but also provides analytical basis for establishing or abolishing positions in the budgeting process. Comparing its results with current employee payroll provides a measure of how much staffing stress the maintenance organization is under, which can help measure whether the current overtime usage is appropriate. Asset and maintenance task inventories collected in this process can also feed normal asset management processes to assess replacement cycles, asset failure risk, and to inform strategic and investment decisions. △ Less

Submitted 2 March, 2021; originally announced March 2021.

Comments: 22 pages, 12 figures

arXiv:2102.04517 [pdf]

Power Off! Challenges in Planning and Executing Power Isolations on Shared-Use Electrified Railways

Authors: Alex Lu, Aleksandr Lukatskiy, Zhiqi Zhong, John G. Allen

Abstract: Electric railways are fast, clean, and safe, but complex to operate and maintain. Electric traction infrastructure includes signal power and feeder lines that remain live during isolations and complicate maintenance processes. Stakeholders involved in power outage planning include contractors, linemen, groundmen, power directors, dispatchers, conductor-flag, and support personnel. Weekly planning… ▽ More Electric railways are fast, clean, and safe, but complex to operate and maintain. Electric traction infrastructure includes signal power and feeder lines that remain live during isolations and complicate maintenance processes. Stakeholders involved in power outage planning include contractors, linemen, groundmen, power directors, dispatchers, conductor-flag, and support personnel. Weekly planning processes for track time requires many contingencies due to large number of moving parts and factors not known in advance, like personnel availability. Electrical and mechanical environments faced by crews working in adjacent areas may be entirely different and require a "bespoke" circuit configuration to de-energize catenary, which must be planned meticulously. Although recent automation improved real-time "plate order" communications between power directors and dispatchers, each outage still requires many manual switching operations. Net impact of this isolation process reduces available construction work windows nightly from a nominal 7 hours to 2 hrs 39 mins. We recommend joint design of electrical and civil infrastructure, cross-training between disciplines, limiting maximum number of concurrent outages, formal study of maintenance outage capacity, and further automation in power switching. △ Less

Submitted 8 February, 2021; originally announced February 2021.

Comments: 26 pages, 6 figures

arXiv:2006.08532 [pdf, other]

Improved Conditional Flow Models for Molecule to Image Synthesis

Authors: Karren Yang, Samuel Goldman, Wengong **, Alex Lu, Regina Barzilay, Tommi Jaakkola, Caroline Uhler

Abstract: In this paper, we aim to synthesize cell microscopy images under different molecular interventions, motivated by practical applications to drug development. Building on the recent success of graph neural networks for learning molecular embeddings and flow-based models for image generation, we propose Mol2Image: a flow-based generative model for molecule to cell image synthesis. To generate cell fe… ▽ More In this paper, we aim to synthesize cell microscopy images under different molecular interventions, motivated by practical applications to drug development. Building on the recent success of graph neural networks for learning molecular embeddings and flow-based models for image generation, we propose Mol2Image: a flow-based generative model for molecule to cell image synthesis. To generate cell features at different resolutions and scale to high-resolution images, we develop a novel multi-scale flow architecture based on a Haar wavelet image pyramid. To maximize the mutual information between the generated images and the molecular interventions, we devise a training strategy based on contrastive learning. To evaluate our model, we propose a new set of metrics for biological image generation that are robust, interpretable, and relevant to practitioners. We show quantitatively that our method learns a meaningful embedding of the molecular intervention, which is translated into an image representation reflecting the biological effects of the intervention. △ Less

Submitted 15 June, 2020; originally announced June 2020.

MSC Class: 92-08

Showing 1–12 of 12 results for author: Lu, A