-
FastFlip: Compositional Error Injection Analysis
Authors:
Keyur Joshi,
Rahul Singh,
Tommaso Bassetto,
Sarita Adve,
Darko Marinov,
Sasa Misailovic
Abstract:
Instruction-level error injection analyses aim to find instructions where errors often lead to unacceptable outcomes like Silent Data Corruptions (SDCs). These analyses require significant time, which is especially problematic if developers wish to regularly analyze software that evolves over time.
We present FastFlip, a combination of empirical error injection and symbolic SDC propagation analy…
▽ More
Instruction-level error injection analyses aim to find instructions where errors often lead to unacceptable outcomes like Silent Data Corruptions (SDCs). These analyses require significant time, which is especially problematic if developers wish to regularly analyze software that evolves over time.
We present FastFlip, a combination of empirical error injection and symbolic SDC propagation analyses that enables fast, compositional error injection analysis of evolving programs. FastFlip calculates how SDCs propagate across program sections and correctly accounts for unexpected side effects that can occur due to errors. Using FastFlip, we analyze five benchmarks, plus two modified versions of each benchmark. FastFlip speeds up the analysis of incrementally modified programs by $3.2\times$ (geomean). FastFlip selects a set of instructions to protect against SDCs that minimizes the runtime cost of protection while protecting against a developer-specified target fraction of all SDC-causing errors.
△ Less
Submitted 26 March, 2024; v1 submitted 20 March, 2024;
originally announced March 2024.
-
Coordinated Science Laboratory 70th Anniversary Symposium: The Future of Computing
Authors:
Klara Nahrstedt,
Naresh Shanbhag,
Vikram Adve,
Nancy Amato,
Romit Roy Choudhury,
Carl Gunter,
Nam Sung Kim,
Olgica Milenkovic,
Sayan Mitra,
Lav Varshney,
Yurii Vlasov,
Sarita Adve,
Rashid Bashir,
Andreas Cangellaris,
James DiCarlo,
Katie Driggs-Campbell,
Nick Feamster,
Mattia Gazzola,
Karrie Karahalios,
Sanmi Koyejo,
Paul Kwiat,
Bo Li,
Negar Mehr,
Ravish Mehra,
Andrew Miller
, et al. (3 additional authors not shown)
Abstract:
In 2021, the Coordinated Science Laboratory CSL, an Interdisciplinary Research Unit at the University of Illinois Urbana-Champaign, hosted the Future of Computing Symposium to celebrate its 70th anniversary. CSL's research covers the full computing stack, computing's impact on society and the resulting need for social responsibility. In this white paper, we summarize the major technological points…
▽ More
In 2021, the Coordinated Science Laboratory CSL, an Interdisciplinary Research Unit at the University of Illinois Urbana-Champaign, hosted the Future of Computing Symposium to celebrate its 70th anniversary. CSL's research covers the full computing stack, computing's impact on society and the resulting need for social responsibility. In this white paper, we summarize the major technological points, insights, and directions that speakers brought forward during the Future of Computing Symposium.
Participants discussed topics related to new computing paradigms, technologies, algorithms, behaviors, and research challenges to be expected in the future. The symposium focused on new computing paradigms that are going beyond traditional computing and the research needed to support their realization. These needs included stressing security and privacy, the end to end human cyber physical systems and with them the analysis of the end to end artificial intelligence needs. Furthermore, advances that enable immersive environments for users, the boundaries between humans and machines will blur and become seamless. Particular integration challenges were made clear in the final discussion on the integration of autonomous driving, robo taxis, pedestrians, and future cities. Innovative approaches were outlined to motivate the next generation of researchers to work on these challenges.
The discussion brought out the importance of considering not just individual research areas, but innovations at the intersections between computing research efforts and relevant application domains, such as health care, transportation, energy systems, and manufacturing.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
On-Device CPU Scheduling for Sense-React Systems
Authors:
Aditi Partap,
Samuel Grayson,
Muhammad Huzaifa,
Sarita Adve,
Brighten Godfrey,
Saurabh Gupta,
Kris Hauser,
Radhika Mittal
Abstract:
Sense-react systems (e.g. robotics and AR/VR) have to take highly responsive real-time actions, driven by complex decisions involving a pipeline of sensing, perception, planning, and reaction tasks. These tasks must be scheduled on resource-constrained devices such that the performance goals and the requirements of the application are met. This is a difficult scheduling problem that requires handl…
▽ More
Sense-react systems (e.g. robotics and AR/VR) have to take highly responsive real-time actions, driven by complex decisions involving a pipeline of sensing, perception, planning, and reaction tasks. These tasks must be scheduled on resource-constrained devices such that the performance goals and the requirements of the application are met. This is a difficult scheduling problem that requires handling multiple scheduling dimensions, and variations in resource usage and availability. In practice, system designers manually tune parameters for their specific hardware and application, which results in poor generalization and increases the development burden. In this work, we highlight the emerging need for scheduling CPU resources at runtime in sense-react systems. We study three canonical applications (face tracking, robot navigation, and VR) to first understand the key scheduling requirements for such systems. Armed with this understanding, we develop a scheduling framework, Catan, that dynamically schedules compute resources across different components of an app so as to meet the specified application requirements. Through experiments with a prototype implemented on a widely-used robotics framework (ROS) and an open-source AR/VR platform, we show the impact of system scheduling on meeting the performance goals for the three applications, how Catan is able to achieve better application performance than hand-tuned configurations, and how it dynamically adapts to runtime variations.
△ Less
Submitted 14 August, 2022; v1 submitted 27 July, 2022;
originally announced July 2022.
-
Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration
Authors:
Georgios Zacharopoulos,
Adel Ejjeh,
Ying **g,
En-Yu Yang,
Tianyu Jia,
Iulian Brumar,
Jeremy Intan,
Muhammad Huzaifa,
Sarita Adve,
Vikram Adve,
Gu-Yeon Wei,
David Brooks
Abstract:
The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution…
▽ More
The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution, including loop level, task level and pipeline parallelism. To assist the design process and expose every possible level of parallelism, we present Trireme, a fully automated tool-chain that explores multiple levels of parallelism and produces domain specific accelerator designs and configurations that maximize performance, given an area budget. Experiments on demanding benchmarks from the XR domain revealed a speedup of up to 20x, as well as a speedup of up to 37x for smaller applications, compared to software-only implementations.
△ Less
Submitted 21 January, 2022;
originally announced January 2022.
-
Statistical Program Slicing: a Hybrid Slicing Technique for Analyzing Deployed Software
Authors:
Bogdan Alexandru Stoica,
Swarup K. Sahoo,
James R. Larus,
Vikram S. Adve
Abstract:
Dynamic program slicing can significantly reduce the code developers need to inspect by narrowing it down to only a subset of relevant program statements. However, despite an extensive body of research showing its usefulness, dynamic slicing is still short from production-level use due to the high cost of runtime instrumentation.
As an alternative, we propose statistical program slicing, a novel…
▽ More
Dynamic program slicing can significantly reduce the code developers need to inspect by narrowing it down to only a subset of relevant program statements. However, despite an extensive body of research showing its usefulness, dynamic slicing is still short from production-level use due to the high cost of runtime instrumentation.
As an alternative, we propose statistical program slicing, a novel hybrid dynamic-static slicing technique that explores the trade-off between accuracy and runtime cost. Our approach relies on modern hardware support for control flow monitoring and a novel, cooperative heap memory tracing mechanism combined with static program analysis for data flow tracking. We evaluate statistical slicing for debugging on 21 failures from 6 widely deployed applications and show it recovers 94% of the program statements on a dynamic slice with only 5% overhead.
△ Less
Submitted 31 December, 2021;
originally announced January 2022.
-
A Case for Fine-grain Coherence Specialization in Heterogeneous Systems
Authors:
Johnathan Alsop,
Weon Taek Na,
Matthew D. Sinclair,
Samuel Grayson,
Sarita V. Adve
Abstract:
Hardware specialization is becoming a key enabler of energyefficient performance. Future systems will be increasingly heterogeneous, integrating multiple specialized and programmable accelerators, each with different memory demands. Traditionally, communication between accelerators has been inefficient, typically orchestrated through explicit DMA transfers between different address spaces. More re…
▽ More
Hardware specialization is becoming a key enabler of energyefficient performance. Future systems will be increasingly heterogeneous, integrating multiple specialized and programmable accelerators, each with different memory demands. Traditionally, communication between accelerators has been inefficient, typically orchestrated through explicit DMA transfers between different address spaces. More recently, industry has proposed unified coherent memory which enables implicit data movement and more data reuse, but often these interfaces limit the coherence flexibility available to heterogeneous systems. This paper demonstrates the benefits of fine-grained coherence specialization for heterogeneous systems. We propose an architecture that enables low-complexity independent specialization of each individual coherence request in heterogeneous workloads by building upon a simple and flexible baseline coherence interface, Spandex. We then describe how to optimize individual memory requests to improve cache reuse and performance-critical memory latency in emerging heterogeneous workloads. Collectively, our techniques enable significant gains, reducing execution time by up to 61% or network traffic by up to 99% while adding minimal complexity to the Spandex protocol.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Modelling the Role of Inter-cellular Communication in Modulating Photosynthesis in Plants
Authors:
Hamdan Awan,
Raviraj S. Adve,
Nigel Wallbridge,
Carrol Plummer,
Andrew W. Eckford
Abstract:
In this paper we show how inter-cellular molecular communication may change the overall levels of photosynthesis in plants. Individual plant cells respond to external stimuli, such as illumination levels, to regulate their photosynthetic output. Here, we present a mathematical model which shows that by sharing information internally using molecular communication, plants may increase overall photos…
▽ More
In this paper we show how inter-cellular molecular communication may change the overall levels of photosynthesis in plants. Individual plant cells respond to external stimuli, such as illumination levels, to regulate their photosynthetic output. Here, we present a mathematical model which shows that by sharing information internally using molecular communication, plants may increase overall photosynthate production. Numerical results show that higher mutual information between cells corresponds to an increase in overall photosynthesis by as much as 25 per cent. This suggests that molecular communication plays a vital role in maximising the photosynthesis in plants and therefore suggests new routes to influence plant development in agriculture and elsewhere.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.
-
Exploring Extended Reality with ILLIXR: A New Playground for Architecture Research
Authors:
Muhammad Huzaifa,
Rishi Desai,
Samuel Grayson,
Xutao Jiang,
Ying **g,
Jae Lee,
Fang Lu,
Yihan Pang,
Joseph Ravichandran,
Finn Sinclair,
Boyuan Tian,
Hengzhi Yuan,
Jeffrey Zhang,
Sarita V. Adve
Abstract:
As we enter the era of domain-specific architectures, systems researchers must understand the requirements of emerging application domains. Augmented and virtual reality (AR/VR) or extended reality (XR) is one such important domain. This paper presents ILLIXR, the first open source end-to-end XR system (1) with state-of-the-art components, (2) integrated with a modular and extensible multithreaded…
▽ More
As we enter the era of domain-specific architectures, systems researchers must understand the requirements of emerging application domains. Augmented and virtual reality (AR/VR) or extended reality (XR) is one such important domain. This paper presents ILLIXR, the first open source end-to-end XR system (1) with state-of-the-art components, (2) integrated with a modular and extensible multithreaded runtime, (3) providing an OpenXR compliant interface to XR applications (e.g., game engines), and (4) with the ability to report (and trade off) several quality of experience (QoE) metrics. We analyze performance, power, and QoE metrics for the complete ILLIXR system and for its individual components. Our analysis reveals several properties with implications for architecture and systems research. These include demanding performance, power, and QoE requirements, a large diversity of critical tasks, inter-dependent execution pipelines with challenges in scheduling and resource management, and a large tradeoff space between performance/power and human perception related QoE metrics. ILLIXR and our analysis have the potential to propel new directions in architecture and systems research in general, and impact XR in particular. ILLIXR is open-source and available at https://illixr.github.io
△ Less
Submitted 2 March, 2021; v1 submitted 25 March, 2020;
originally announced April 2020.
-
Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics
Authors:
Giordano Salvador,
Wesley H. Darvin,
Muhammad Huzaifa,
Johnathan Alsop,
Matthew D. Sinclair,
Sarita V. Adve
Abstract:
This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6…
▽ More
This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6 graph inputs for a total of 36 workloads running on 12 system (hardware+software) configurations reflecting the above design space of update propagation, coherence, and memory consistency. We make three key contributions. First, we show that there is no single best system configuration for all workloads, motivating systems with flexible coherence and consistency support. Second, we develop a model to accurately predict the best system configuration -- this model can be used by software designers to decide on push vs. pull and the consistency model and by flexible hardware to invoke the appropriate coherence and consistency configuration for the given workload. Third, we show that the design dimensions explored here are inter-dependent, reinforcing the need for software-hardware co-design in the above design dimensions. For example, software designers deciding on push vs. pull must consider the consistency model supported by hardware -- in some cases, push maybe better if hardware supports DRFrlx while pull may be better if hardware does not support DRFrlx.
△ Less
Submitted 25 February, 2020; v1 submitted 19 February, 2020;
originally announced February 2020.
-
HarDNN: Feature Map Vulnerability Evaluation in CNNs
Authors:
Abdulrahman Mahmoud,
Siva Kumar Sastry Hari,
Christopher W. Fletcher,
Sarita V. Adve,
Charbel Sakr,
Naresh Shanbhag,
Pavlo Molchanov,
Michael B. Sullivan,
Timothy Tsai,
Stephen W. Keckler
Abstract:
As Convolutional Neural Networks (CNNs) are increasingly being employed in safety-critical applications, it is important that they behave reliably in the face of hardware errors. Transient hardware errors may percolate undesirable state during execution, resulting in software-manifested errors which can adversely affect high-level decision making. This paper presents HarDNN, a software-directed ap…
▽ More
As Convolutional Neural Networks (CNNs) are increasingly being employed in safety-critical applications, it is important that they behave reliably in the face of hardware errors. Transient hardware errors may percolate undesirable state during execution, resulting in software-manifested errors which can adversely affect high-level decision making. This paper presents HarDNN, a software-directed approach to identify vulnerable computations during a CNN inference and selectively protect them based on their propensity towards corrupting the inference output in the presence of a hardware error. We show that HarDNN can accurately estimate relative vulnerability of a feature map (fmap) in CNNs using a statistical error injection campaign, and explore heuristics for fast vulnerability assessment. Based on these results, we analyze the tradeoff between error coverage and computational overhead that the system designers can use to employ selective protection. Results show that the improvement in resilience for the added computation is superlinear with HarDNN. For example, HarDNN improves SqueezeNet's resilience by 10x with just 30% additional computations.
△ Less
Submitted 25 February, 2020; v1 submitted 22 February, 2020;
originally announced February 2020.
-
Communication in Plants: Comparison of Multiple Action Potential and Mechanosensitive Signals with Experiments
Authors:
Hamdan Awan,
Kareem Zeid,
Raviraj S. Adve,
Nigel Wallbridge,
Carrol Plummer,
Andrew W. Eckford
Abstract:
Both action potentials and mechanosensitive signalling are an important communication mechanisms in plants. Considering an information theoretic framework, this paper explores the effective range of multiple action potentials for a long chain of cells (i.e., up to 100) in different configurations, and introduces the study of multiple mechanosensitive activation signals (generated due to a mechanic…
▽ More
Both action potentials and mechanosensitive signalling are an important communication mechanisms in plants. Considering an information theoretic framework, this paper explores the effective range of multiple action potentials for a long chain of cells (i.e., up to 100) in different configurations, and introduces the study of multiple mechanosensitive activation signals (generated due to a mechanical stimulus) in plants. For both these signals, we find that the mutual information per cell and information propagation speed tends to increase up to a certain number of receiver cells. However, as the number of cells increase beyond 10 to 12, the mutual information per cell starts to decrease. To validate our model and results, we include an experimental verification of the theoretical model, using a PhytlSigns biosignal amplifier, allowing us to measure the magnitude of the voltage associated with the multiple AP and mechanosensitive activation signals induced by different stimulus in plants. Experimental data is used to calculate the mutual information and information propagation speed, which is compared with corresponding numerical results. Since these signals are used for a variety of important tasks within the plant, understanding them may lead to new bioengineering methods for plants.
△ Less
Submitted 12 November, 2019;
originally announced November 2019.
-
Scheduling for VoLTE: Resource Allocation Optimization and Low-Complexity Algorithms
Authors:
Maryam Mohseni,
S. Alireza Banani,
Andrew W. Eckford,
Raviraj S. Adve
Abstract:
We consider scheduling and resource allocation in long-term evolution (LTE) networks across voice over LTE (VoLTE) and best-effort data users. The difference between these two is that VoLTE users get scheduling priority to receive their required quality of service. As we show, strict priority causes data services to suffer. We propose new scheduling and resource allocation algorithms to maximize t…
▽ More
We consider scheduling and resource allocation in long-term evolution (LTE) networks across voice over LTE (VoLTE) and best-effort data users. The difference between these two is that VoLTE users get scheduling priority to receive their required quality of service. As we show, strict priority causes data services to suffer. We propose new scheduling and resource allocation algorithms to maximize the sum- or proportional fair (PF) throughout amongst data users while meeting VoLTE demands. Essentially, we use VoLTE as an example application with both a guaranteed bit-rate and strict application-specific requirements. We first formulate and solve the frame-level optimization problem for throughput maximization; however, this leads to an integer problem coupled across the LTE transmission time intervals (TTIs). We then propose a TTI-level problem to decouple scheduling across TTIs. Finally, we propose a heuristic, with extremely low complexity. The formulations illustrate the detail required to realize resource allocation in an implemented standard. Numerical results show that the performance of the TTI-level scheme is very close to that of the frame-level upper bound. Similarly, the heuristic scheme works well compared to TTI-level optimization and a baseline scheduling algorithm. Finally, we show that our PF optimization retains the high fairness index characterizing PF-scheduling.
△ Less
Submitted 7 January, 2019;
originally announced January 2019.
-
Communication and Information Theory of Single Action Potential Signals in Plants
Authors:
Hamdan Awan,
Raviraj S. Adve,
Nigel Wallbridge,
Carrol Plummer,
Andrew W. Eckford
Abstract:
Many plants, such as Mimosa pudica (the sensitive plant), employ electrochemical signals known as action potentials (APs) for rapid intercellular communication. In this paper, we consider a reaction diffusion model of individual AP signals to analyze APs from a communication and information theoretic perspective. We use concepts from molecular communication to explain the underlying process of inf…
▽ More
Many plants, such as Mimosa pudica (the sensitive plant), employ electrochemical signals known as action potentials (APs) for rapid intercellular communication. In this paper, we consider a reaction diffusion model of individual AP signals to analyze APs from a communication and information theoretic perspective. We use concepts from molecular communication to explain the underlying process of information transfer in a plant for a single AP pulse that is shared with one or more receiver cells. We also use the chemical Langevin equation to accommodate the deterministic as well as stochastic component of the system. Finally we present an information theoretic analysis of single action potentials, obtaining achievable information rates for these signals. We show that, in general, the presence of an AP signal can increase the mutual information and information propagation speed among neighboring cells with receivers in different settings.
△ Less
Submitted 8 November, 2018;
originally announced November 2018.
-
Optimizing the MIMO Cellular Downlink: Multiplexing, Diversity, or Interference Nulling?
Authors:
Kianoush Hosseini,
Caiyi Zhu,
Ahmad Khan,
Raviraj S. Adve,
Wei Yu
Abstract:
A base-station (BS) equipped with multiple antennas can use its spatial dimensions in three different ways: (1) to serve multiple users, thereby achieving a multiplexing gain, (2) to provide spatial diversity in order to improve user rates and (3) to null interference in neighboring cells. This paper answers the following question: What is the optimal balance between these three competing benefits…
▽ More
A base-station (BS) equipped with multiple antennas can use its spatial dimensions in three different ways: (1) to serve multiple users, thereby achieving a multiplexing gain, (2) to provide spatial diversity in order to improve user rates and (3) to null interference in neighboring cells. This paper answers the following question: What is the optimal balance between these three competing benefits? We answer this question in the context of the downlink of a cellular network, where multi-antenna BSs serve multiple single-antenna users using zero-forcing beamforming with equal power assignment, while nulling interference at a subset of out-of-cell users. Any remaining spatial dimensions provide transmit diversity for the scheduled users. Utilizing tools from stochastic geometry, we show that, surprisingly, to maximize the per-BS ergodic sum rate, with an optimal allocation of spatial resources, interference nulling does not provide a tangible benefit. The strategy of avoiding inter-cell interference nulling, reserving some fraction of spatial resources for multiplexing and using the rest to provide diversity, is already close-to-optimal in terms of the sum-rate. However, interference nulling does bring significant benefit to cell-edge users, particularly when adopting a range-adaptive nulling strategy where the size of the cooperating BS cluster is increased for cell-edge users.
△ Less
Submitted 21 August, 2018;
originally announced August 2018.
-
Multifunction Cognitive Radar Task Scheduling Using Monte Carlo Tree Search and Policy Networks
Authors:
Mahdi Shaghaghi,
Raviraj S. Adve,
Zhen Ding
Abstract:
A modern radar may be designed to perform multiple functions, such as surveillance, tracking, and fire control. Each function requires the radar to execute a number of transmit-receive tasks. A radar resource management (RRM) module makes decisions on parameter selection, prioritization, and scheduling of such tasks. RRM becomes especially challenging in overload situations, where some tasks may n…
▽ More
A modern radar may be designed to perform multiple functions, such as surveillance, tracking, and fire control. Each function requires the radar to execute a number of transmit-receive tasks. A radar resource management (RRM) module makes decisions on parameter selection, prioritization, and scheduling of such tasks. RRM becomes especially challenging in overload situations, where some tasks may need to be delayed or even dropped. In general, task scheduling is an NP-hard problem. In this work, we develop the branch-and-bound (B&B) method which obtains the optimal solution but at exponential computational complexity. On the other hand, heuristic methods have low complexity but provide relatively poor performance. We resort to machine learning-based techniques to address this issue; specifically we propose an approximate algorithm based on the Monte Carlo tree search method. Along with using bound and dominance rules to eliminate nodes from the search tree, we use a policy network to help to reduce the width of the search. Such a network can be trained using solutions obtained by running the B&B method offline on problems with feasible complexity. We show that the proposed method provides near-optimal performance, but with computational complexity orders of magnitude smaller than the B&B algorithm.
△ Less
Submitted 18 May, 2018;
originally announced May 2018.
-
Characterizing Information Propagation in Plants
Authors:
Hamdan Awan,
Raviraj S. Adve,
Nigel Wallbridge,
Carrol Plummer,
Andrew W. Eckford
Abstract:
This paper considers an electro-chemical based communication model for intercellular communication in plants. Many plants, such as Mimosa pudica (the "sensitive plant"), employ electrochemical signals known as action potentials (APs) for communication purposes. In this paper we present a simple model for action potential generation. We make use of the concepts from molecular communication to expla…
▽ More
This paper considers an electro-chemical based communication model for intercellular communication in plants. Many plants, such as Mimosa pudica (the "sensitive plant"), employ electrochemical signals known as action potentials (APs) for communication purposes. In this paper we present a simple model for action potential generation. We make use of the concepts from molecular communication to explain the underlying process of information transfer in a plant. Using the information-theoretic analysis, we compute the mutual information between the input and output in this work. The key aim is to study the variations in the information propagation speed for varying number of plant cells for one simple case. Furthermore we study the impact of the AP signal on the mutual information and information propagation speed. We aim to explore further that how the growth rate in plants can impact the information transfer rate and vice versa.
△ Less
Submitted 25 April, 2018;
originally announced May 2018.
-
21st Century Computer Architecture
Authors:
Mark D. Hill,
Sarita Adve,
Luis Ceze,
Mary Jane Irwin,
David Kaeli,
Margaret Martonosi,
Josep Torrellas,
Thomas F. Wenisch,
David Wood,
Katherine Yelick
Abstract:
Because most technology and computer architecture innovations were (intentionally) invisible to higher layers, application and other software developers could reap the benefits of this progress without engaging in it. Higher performance has both made more computationally demanding applications feasible (e.g., virtual assistants, computer vision) and made less demanding applications easier to devel…
▽ More
Because most technology and computer architecture innovations were (intentionally) invisible to higher layers, application and other software developers could reap the benefits of this progress without engaging in it. Higher performance has both made more computationally demanding applications feasible (e.g., virtual assistants, computer vision) and made less demanding applications easier to develop by enabling higher-level programming abstractions (e.g., scripting languages and reusable components). Improvements in computer system cost-effectiveness enabled value creation that could never have been imagined by the field's founders (e.g., distributed web search sufficiently inexpensive so as to be covered by advertising links).
The wide benefits of computer performance growth are clear. Recently, Danowitz et al. apportioned computer performance growth roughly equally between technology and architecture, with architecture credited with ~80x improvement since 1985. As semiconductor technology approaches its "end-of-the-road" (see below), computer architecture will need to play an increasing role in enabling future ICT innovation. But instead of asking, "How can I make my chip run faster?," architects must now ask, "How can I enable the 21st century infrastructure, from sensors to clouds, adding value from performance to privacy, but without the benefit of near-perfect technology scaling?". The challenges are many, but with appropriate investment, opportunities abound. Underlying these opportunities is a common theme that future architecture innovations will require the engagement of and investments from innovators in other ICT layers.
△ Less
Submitted 21 September, 2016;
originally announced September 2016.
-
Analyzing Dependent Placements of Small Cells in a Two-Layer Heterogeneous Network with a Rate Coverage Constraint
Authors:
S. Alireza Banani,
Andrew W. Eckford,
Raviraj S. Adve
Abstract:
We consider the downlink of a two-layer heterogeneous network, comprising macro cells (MCs) and small cells (SCs). The existing literature generally assumes independent placements of the access points (APs) in different layers; in contrast, we analyze a dependent placement where SC APs are placed at locations with poor service from the MC layer. Our goal is to obtain an estimate of the number of S…
▽ More
We consider the downlink of a two-layer heterogeneous network, comprising macro cells (MCs) and small cells (SCs). The existing literature generally assumes independent placements of the access points (APs) in different layers; in contrast, we analyze a dependent placement where SC APs are placed at locations with poor service from the MC layer. Our goal is to obtain an estimate of the number of SCs required to maintain a target outage rate. Such an analysis is trivial if the MCs are located according to a Poisson point process (PPP), which provides a lower bound on performance. Here, we consider MCs placed on a hexagonal grid, which complements the PPP model by providing an upper bound on performance. We first provide accurate bounds for the average interference within a MC when SCs are not used. Then, by obtaining the outage areas, we estimate the number of SCs required within an MC to overcome outage. If resource allocation amongst SCs is not used, we show that the problem of outage is not solved completely, and the residual outage area depends on whether co-channel or orthogonal SCs are used. Simulations show that a much smaller residual outage area is obtained with orthogonal SCs.
△ Less
Submitted 20 September, 2016;
originally announced September 2016.
-
A Stochastic Analysis of Network MIMO Systems
Authors:
Kianoush Hosseini,
Wei Yu,
Raviraj S. Adve
Abstract:
This paper quantifies the benefits and limitations of cooperative communications by providing a statistical analysis of the downlink in network multiple-input multiple-output (MIMO) systems. We consider an idealized model where the multiple-antenna base-stations (BSs) are distributed according to a homogeneous Poisson point process and cooperate by forming disjoint clusters. We assume that perfect…
▽ More
This paper quantifies the benefits and limitations of cooperative communications by providing a statistical analysis of the downlink in network multiple-input multiple-output (MIMO) systems. We consider an idealized model where the multiple-antenna base-stations (BSs) are distributed according to a homogeneous Poisson point process and cooperate by forming disjoint clusters. We assume that perfect channel state information (CSI) is available at the cooperating BSs without any overhead. Multiple single-antenna users are served using zero-forcing beamforming with equal power allocation across the beams. For such a system, we obtain tractable, but accurate, approximations of the signal power and inter-cluster interference power distributions and derive a computationally efficient expression for the achievable per-BS ergodic sum rate using tools from stochastic geometry. This expression allows us to obtain the optimal loading factor, i.e., the ratio between the number of scheduled users and the number of BS antennas, that maximizes the per-BS ergodic sum rate. Further, it allows us to quantify the performance improvement of network MIMO systems as a function of the cooperating cluster size. We show that to perform zero-forcing across the distributed set of BSs within the cluster, the network MIMO system introduces a penalty in received signal power. Along with the inevitable out-of-cluster interference, we show that the per-BS ergodic sum rate of a network MIMO system does not approach that of an isolated cell even at unrealistically large cluster sizes. Nevertheless, network MIMO does provide significant rate improvement as compared to uncoordinated single-cell processing even at relatively modest cluster sizes.
△ Less
Submitted 5 April, 2016;
originally announced April 2016.
-
Analyzing the Impact of Access Point Density on the Performance of Finite-Area Networks
Authors:
S. Alireza Banani,
Andrew W. Eckford,
Raviraj S. Adve
Abstract:
Assuming a network of infinite extent, several researchers have analyzed small-cell networks using a Poisson point process (PPP) location model, leading to simple analytic expressions. The general assumption has been that these results apply to finite-area networks as well. However, do the results of infinite-area networks apply to finite-area networks? In this paper, we answer this question by ob…
▽ More
Assuming a network of infinite extent, several researchers have analyzed small-cell networks using a Poisson point process (PPP) location model, leading to simple analytic expressions. The general assumption has been that these results apply to finite-area networks as well. However, do the results of infinite-area networks apply to finite-area networks? In this paper, we answer this question by obtaining an accurate approximation for the achievable signal-to-interference-plus-noise ratio (SINR) and user capacity in the downlink of a \textit{finite-area} network with \textit{a fixed number of} access points (APs). The APs are uniformly distributed within the area of interest. Our analysis shows that, crucially, the results of infinite-area networks are very different from those for finite-area networks of low-to-medium AP density. Comprehensive simulations are used to illustrate the accuracy of our analysis. For practical values of signal transmit powers and AP densities, the analytic expressions capture the behavior of the system well. As an added benefit, the formulations developed here can be used in parametric studies for network design. Here, the analysis is used to obtain the required number of APs to guarantee a desired target capacity in a finite-area network.
△ Less
Submitted 10 December, 2015;
originally announced December 2015.
-
Analyzing the Impact of Inter Cooperation Region Interference in Coordinated Multi-Point Uplink Networks
Authors:
S. Alireza Banani,
Raviraj S. Adve
Abstract:
We analyze the uplink of coordinated multi-point (CoMP) networks in which cooperation can be amongst 2 or 3 base stations (BSs). We consider a 2D network of BSs on a regular hexagonal lattice wherein the cooperation tessellates the 2D plane into cooperation regions (CRs); specifically, we analyze the impact of the interference between the CRs in the network. Our model accounts realistic propagatio…
▽ More
We analyze the uplink of coordinated multi-point (CoMP) networks in which cooperation can be amongst 2 or 3 base stations (BSs). We consider a 2D network of BSs on a regular hexagonal lattice wherein the cooperation tessellates the 2D plane into cooperation regions (CRs); specifically, we analyze the impact of the interference between the CRs in the network. Our model accounts realistic propagation conditions, particularly including shadowing. We obtain accurate, closed-form, approximations for the user capacity coverage probability (CCP) and the ergodic capacity at each point within the CR. To provide a network-level analysis, we focus on the locations within each CR with the minimum CCP, the worstcase point(s). The worst-case CCP and or ergodic capacity can be used in parametric studies for network design. Here, the analysis is applied to obtain the relationship between cell size and CCP and, thereby, the required density of BSs to achieve a chosen target capacity coverage. The analysis also allows for a comparison between different orders of BS cooperation, quantifying the reduced required BS density from higher orders of cooperation. Comprehensive simulations are used to illustrate the accuracy of our analysis, including the approximations used for analytic tractability.
△ Less
Submitted 19 January, 2015;
originally announced January 2015.
-
Handoff Rate and Coverage Analysis in Multi-tier Heterogeneous Networks
Authors:
Sanam Sadr,
Raviraj S. Adve
Abstract:
This paper analyzes the impact of user mobility in multi-tier heterogeneous networks. We begin by obtaining the handoff rate for a mobile user in an irregular cellular network with the access point locations modeled as a homogeneous Poisson point process. The received signal-to-interference-ratio (SIR) distribution along with a chosen SIR threshold is then used to obtain the probability of coverag…
▽ More
This paper analyzes the impact of user mobility in multi-tier heterogeneous networks. We begin by obtaining the handoff rate for a mobile user in an irregular cellular network with the access point locations modeled as a homogeneous Poisson point process. The received signal-to-interference-ratio (SIR) distribution along with a chosen SIR threshold is then used to obtain the probability of coverage. To capture potential connection failures due to mobility, we assume that a fraction of handoffs result in such failures. Considering a multi-tier network with orthogonal spectrum allocation among tiers and the maximum biased average received power as the tier association metric, we derive the probability of coverage for two cases: 1) the user is stationary (i.e., handoffs do not occur, or the system is not sensitive to handoffs); 2) the user is mobile, and the system is sensitive to handoffs. We derive the optimal bias factors to maximize the coverage. We show that when the user is mobile, and the network is sensitive to handoffs, both the optimum tier association and the probability of coverage depend on the user's speed; a speed-dependent bias factor can then adjust the tier association to effectively improve the coverage, and hence system performance, in a fully-loaded network.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
Partially-Distributed Resource Allocation in Small-Cell Networks
Authors:
Sanam Sadr,
Raviraj S. Adve
Abstract:
We propose a four-stage hierarchical resource allocation scheme for the downlink of a large-scale small-cell network in the context of orthogonal frequency-division multiple access (OFDMA). Since interference limits the capabilities of such networks, resource allocation and interference management are crucial. However, obtaining the globally optimum resource allocation is exponentially complex and…
▽ More
We propose a four-stage hierarchical resource allocation scheme for the downlink of a large-scale small-cell network in the context of orthogonal frequency-division multiple access (OFDMA). Since interference limits the capabilities of such networks, resource allocation and interference management are crucial. However, obtaining the globally optimum resource allocation is exponentially complex and mathematically intractable. Here, we develop a partially decentralized algorithm to obtain an effective solution. The three major advantages of our work are: 1) as opposed to a fixed resource allocation, we consider load demand at each access point (AP) when allocating spectrum; 2) to prevent overloaded APs, our scheme is dynamic in the sense that as the users move from one AP to the other, so do the allocated resources, if necessary, and such considerations generally result in huge computational complexity, which brings us to the third advantage: 3) we tackle complexity by introducing a hierarchical scheme comprising four phases: user association, load estimation, interference management via graph coloring, and scheduling. We provide mathematical analysis for the first three steps modeling the user and AP locations as Poisson point processes. Finally, we provide results of numerical simulations to illustrate the efficacy of our scheme.
△ Less
Submitted 16 August, 2014;
originally announced August 2014.
-
Tier Association Probability and Spectrum Partitioning for Maximum Rate Coverage in Multi-tier Heterogeneous Networks
Authors:
Sanam Sadr,
Raviraj S. Adve
Abstract:
For a wireless multi-tier heterogeneous network with orthogonal spectrum allocation across tiers, we optimize the association probability and the fraction of spectrum allocated to each tier so as to maximize rate coverage. In practice, the association probability can be controlled using a biased received signal power. The optimization problem is non-convex and we are forced to explore locally opti…
▽ More
For a wireless multi-tier heterogeneous network with orthogonal spectrum allocation across tiers, we optimize the association probability and the fraction of spectrum allocated to each tier so as to maximize rate coverage. In practice, the association probability can be controlled using a biased received signal power. The optimization problem is non-convex and we are forced to explore locally optimal solutions. We make two contributions in this paper: first, we show that there exists a relation between the first derivatives of the objective function with respect to each of the optimization variables. This can be used to simplify numerical solutions to the optimization problem. Second, we explore the optimality of the intuitive solution that the fraction of spectrum allocated to each tier should be equal to the tier association probability. We show that, in this case, a closed-form solution exists. Importantly, our numerical results show that there is essentially zero performance loss. The results also illustrate the significant gains possible by jointly optimizing the user association and the resource allocation.
△ Less
Submitted 16 August, 2014;
originally announced August 2014.
-
Large-Scale MIMO versus Network MIMO for Multicell Interference Mitigation
Authors:
Kianoush Hosseini,
Wei Yu,
Raviraj S. Adve
Abstract:
This paper compares two important downlink multicell interference mitigation techniques, namely, large-scale (LS) multiple-input multiple-output (MIMO) and network MIMO. We consider a cooperative wireless cellular system operating in time-division duplex (TDD) mode, wherein each cooperating cluster includes $B$ base-stations (BSs), each equipped with multiple antennas and scheduling $K$ single-ant…
▽ More
This paper compares two important downlink multicell interference mitigation techniques, namely, large-scale (LS) multiple-input multiple-output (MIMO) and network MIMO. We consider a cooperative wireless cellular system operating in time-division duplex (TDD) mode, wherein each cooperating cluster includes $B$ base-stations (BSs), each equipped with multiple antennas and scheduling $K$ single-antenna users. In an LS-MIMO system, each BS employs $BM$ antennas not only to serve its scheduled users, but also to null out interference caused to the other users within the cooperating cluster using zero-forcing (ZF) beamforming. In a network MIMO system, each BS is equipped with only $M$ antennas, but interference cancellation is realized by data and channel state information exchange over the backhaul links and joint downlink transmission using ZF beamforming. Both systems are able to completely eliminate intra-cluster interference and to provide the same number of spatial degrees of freedom per user. Assuming the uplink-downlink channel reciprocity provided by TDD, both systems are subject to identical channel acquisition overhead during the uplink pilot transmission stage. Further, the available sum power at each cluster is fixed and assumed to be equally distributed across the downlink beams in both systems. Building upon the channel distribution functions and using tools from stochastic ordering, this paper shows, however, that from a performance point of view, users experience better quality of service, averaged over small-scale fading, under an LS-MIMO system than a network MIMO system. Numerical simulations for a multicell network reveal that this conclusion also holds true with regularized ZF beamforming scheme. Hence, given the likely lower cost of adding excess number of antennas at each BS, LS-MIMO could be the preferred route toward interference mitigation in cellular networks.
△ Less
Submitted 22 July, 2014;
originally announced July 2014.
-
Required Base Station Density in Coordinated Multi-Point Uplink with Rate Constraints
Authors:
S. Alireza Banani,
Raviraj S. Adve
Abstract:
In this paper we obtain the required spatial density of base stations (BSs) in a coordinated multi-point uplink cellular network to meet a chosen quality of service metric. Our model assumes cooperation amongst two BSs and the required density is obtained under shadowing and Rayleigh fading for different LTE-A path loss models. The proposed approach guarantees that the worst-case achievable rate i…
▽ More
In this paper we obtain the required spatial density of base stations (BSs) in a coordinated multi-point uplink cellular network to meet a chosen quality of service metric. Our model assumes cooperation amongst two BSs and the required density is obtained under shadowing and Rayleigh fading for different LTE-A path loss models. The proposed approach guarantees that the worst-case achievable rate in the entire coverage region is above a target rate with chosen probability. Two models for the position of the BSs are considered: a hexagonal grid and a Poisson point process (PPP) modified to set a minimum cell size. First, for each cooperation region, the location with the minimum rate coverage probability - the worst-case point - is determined. Next, accurate closed-form approximations are obtained for the worst-case rate coverage probability. The approximations presented are useful for the quick assessment of network performance and can be utilized in parametric studies for network design. Here, they are applied to obtain the required density of BSs to achieve a target rate coverage probability. As an added benefit, the formulation here quantifies the penalty in moving from a regular BS deployment (the grid model) to a random BS deployment (the PPP model).
△ Less
Submitted 27 September, 2013; v1 submitted 6 February, 2013;
originally announced February 2013.
-
Resource Allocation for Selection-Based Cooperative OFDM Networks
Authors:
Kianoush Hosseini,
Raviraj S. Adve
Abstract:
This paper considers resource allocation to achieve max-min fairness in a selection-based orthogonal frequency division multiplexing network wherein source nodes are assisted by fixed decode-and-forward relays. The joint problem of transmission strategy selection, relay assignment, and power allocation is a combinatorial problem with exponential complexity. To develop effective solutions to these…
▽ More
This paper considers resource allocation to achieve max-min fairness in a selection-based orthogonal frequency division multiplexing network wherein source nodes are assisted by fixed decode-and-forward relays. The joint problem of transmission strategy selection, relay assignment, and power allocation is a combinatorial problem with exponential complexity. To develop effective solutions to these questions, we approach these problems in two stages. The first set of problems assume ideal source-relay channels; this simplification helps illustrate our general methodology and also why our solutions provide tight bounds. We then formulate the general problem of transmission strategy selection, relay assignment, and power allocation at the sources and relays considering all communication channels, i.e., finite power source-relay channels. In both sets of problems mentioned so far, transmissions over subcarriers are assumed to be independent. However, given the attendant problems of synchronization and the implementation using a FFT/IFFT pair, resource allocation at the subcarrier level appears impractical. We, therefore, consider resource allocation at the level of an entire OFDM block. While optimal resource management requires an exhaustive search, we develop tight bounds with lower complexity. Finally, we propose a decentralized block-based relaying scheme. Simulation results using the COST-231 channel model show that this scheme yields close-to-optimal performance while offering many computational benefits.
△ Less
Submitted 13 December, 2011; v1 submitted 29 April, 2011;
originally announced April 2011.
-
Molecular communication in fluid media: The additive inverse Gaussian noise channel
Authors:
K. V. Srinivas,
Raviraj S. Adve,
Andrew W. Eckford
Abstract:
We consider molecular communication, with information conveyed in the time of release of molecules. The main contribution of this paper is the development of a theoretical foundation for such a communication system. Specifically, we develop the additive inverse Gaussian (IG) noise channel model: a channel in which the information is corrupted by noise with an inverse Gaussian distribution. We show…
▽ More
We consider molecular communication, with information conveyed in the time of release of molecules. The main contribution of this paper is the development of a theoretical foundation for such a communication system. Specifically, we develop the additive inverse Gaussian (IG) noise channel model: a channel in which the information is corrupted by noise with an inverse Gaussian distribution. We show that such a channel model is appropriate for molecular communication in fluid media - when propagation between transmitter and receiver is governed by Brownian motion and when there is positive drift from transmitter to receiver. Taking advantage of the available literature on the IG distribution, upper and lower bounds on channel capacity are developed, and a maximum likelihood receiver is derived. Theory and simulation results are presented which show that such a channel does not have a single quality measure analogous to signal-to-noise ratio in the AWGN channel. It is also shown that the use of multiple molecules leads to reduced error rate in a manner akin to diversity order in wireless communications. Finally, we discuss some open problems in molecular communications that arise from the IG system model.
△ Less
Submitted 9 December, 2010; v1 submitted 30 November, 2010;
originally announced December 2010.
-
Molecular Communication Using Brownian Motion with Drift
Authors:
Sachin Kadloor,
Raviraj S. Adve,
Andrew W. Eckford
Abstract:
Inspired by biological communication systems, molecular communication has been proposed as a viable scheme to communicate between nano-sized devices separated by a very short distance. Here, molecules are released by the transmitter into the medium, which are then sensed by the receiver. This paper develops a preliminary version of such a communication system focusing on the release of either one…
▽ More
Inspired by biological communication systems, molecular communication has been proposed as a viable scheme to communicate between nano-sized devices separated by a very short distance. Here, molecules are released by the transmitter into the medium, which are then sensed by the receiver. This paper develops a preliminary version of such a communication system focusing on the release of either one or two molecules into a fluid medium with drift. We analyze the mutual information between transmitter and the receiver when information is encoded in the time of release of the molecule. Simplifying assumptions are required in order to calculate the mutual information, and theoretical results are provided to show that these calculations are upper bounds on the true mutual information. Furthermore, optimized degree distributions are provided, which suggest transmission strategies for a variety of drift velocities.
△ Less
Submitted 23 July, 2011; v1 submitted 17 June, 2010;
originally announced June 2010.
-
Energy Optimization across Training and Data for Multiuser Minimum Sum-MSE Linear Precoding
Authors:
Adam J. Tenenbaum,
Raviraj S. Adve
Abstract:
This paper considers minimum sum mean-squared error (sum-MSE) linear transceiver designs in multiuser downlink systems with imperfect channel state information. Specifically, we derive the optimal energy allocations for training and data phases for such a system. Under MMSE estimation of uncorrelated Rayleigh block fading channels with equal average powers, we prove the separability of the energ…
▽ More
This paper considers minimum sum mean-squared error (sum-MSE) linear transceiver designs in multiuser downlink systems with imperfect channel state information. Specifically, we derive the optimal energy allocations for training and data phases for such a system. Under MMSE estimation of uncorrelated Rayleigh block fading channels with equal average powers, we prove the separability of the energy allocation and transceiver design optimization problems. A closed-form optimum energy allocation is derived and applied to existing transceiver designs. Analysis and simulation results demonstrate the improvements that can be realized with the proposed design.
△ Less
Submitted 18 January, 2010;
originally announced January 2010.
-
Minimizing Sum-MSE Implies Identical Downlink and Dual Uplink Power Allocations
Authors:
Adam J. Tenenbaum,
Raviraj S. Adve
Abstract:
In the multiuser downlink, power allocation for linear precoders that minimize the sum of mean squared errors under a sum power constraint is a non-convex problem. Many existing algorithms solve an equivalent convex problem in the virtual uplink and apply a transformation based on uplink-downlink duality to find a downlink solution. In this letter, we analyze the optimality criteria for the power…
▽ More
In the multiuser downlink, power allocation for linear precoders that minimize the sum of mean squared errors under a sum power constraint is a non-convex problem. Many existing algorithms solve an equivalent convex problem in the virtual uplink and apply a transformation based on uplink-downlink duality to find a downlink solution. In this letter, we analyze the optimality criteria for the power allocation subproblem in the virtual uplink, and demonstrate that the optimal solution leads to identical power allocations in the downlink and virtual uplink. We thus extend the known duality results and, importantly, simplify the existing algorithms used for iterative transceiver design.
△ Less
Submitted 19 July, 2010; v1 submitted 17 December, 2009;
originally announced December 2009.
-
Linear Processing and Sum Throughput in the Multiuser MIMO Downlink
Authors:
Adam J. Tenenbaum,
Raviraj S. Adve
Abstract:
We consider linear precoding and decoding in the downlink of a multiuser multiple-input, multiple-output (MIMO) system, wherein each user may receive more than one data stream. We propose several mean squared error (MSE) based criteria for joint transmit-receive optimization and establish a series of relationships linking these criteria to the signal-to-interference-plus-noise ratios of individu…
▽ More
We consider linear precoding and decoding in the downlink of a multiuser multiple-input, multiple-output (MIMO) system, wherein each user may receive more than one data stream. We propose several mean squared error (MSE) based criteria for joint transmit-receive optimization and establish a series of relationships linking these criteria to the signal-to-interference-plus-noise ratios of individual data streams and the information theoretic channel capacity under linear minimum MSE decoding. In particular, we show that achieving the maximum sum throughput is equivalent to minimizing the product of MSE matrix determinants (PDetMSE). Since the PDetMSE minimization problem does not admit a computationally efficient solution, a simplified scalar version of the problem is considered that minimizes the product of mean squared errors (PMSE). An iterative algorithm is proposed to solve the PMSE problem, and is shown to provide near-optimal performance with greatly reduced computational complexity. Our simulations compare the achievable sum rates under linear precoding strategies to the sum capacity for the broadcast channel.
△ Less
Submitted 10 February, 2009; v1 submitted 10 November, 2008;
originally announced November 2008.
-
Sum Rate Maximization using Linear Precoding and Decoding in the Multiuser MIMO Downlink
Authors:
Adam J. Tenenbaum,
Raviraj S. Adve
Abstract:
We propose an algorithm to maximize the instantaneous sum data rate transmitted by a base station in the downlink of a multiuser multiple-input, multiple-output system. The transmitter and the receivers may each be equipped with multiple antennas and each user may receive more than one data stream. We show that maximizing the sum rate is closely linked to minimizing the product of mean squared e…
▽ More
We propose an algorithm to maximize the instantaneous sum data rate transmitted by a base station in the downlink of a multiuser multiple-input, multiple-output system. The transmitter and the receivers may each be equipped with multiple antennas and each user may receive more than one data stream. We show that maximizing the sum rate is closely linked to minimizing the product of mean squared errors (PMSE). The algorithm employs an uplink/downlink duality to iteratively design transmit-receive linear precoders, decoders, and power allocations that minimize the PMSE for all data streams under a sum power constraint. Numerical simulations illustrate the effectiveness of the algorithm and support the use of the PMSE criterion in maximizing the overall instantaneous data rate.
△ Less
Submitted 2 January, 2008;
originally announced January 2008.