-
Enhancing Training Efficiency Using Packing with Flash Attention
Authors:
Achintya Kundu,
Rhui Dih Lee,
Laura Wynter,
Raghu Kiran Ganti
Abstract:
Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. On the other hand, the Hugging Face SFT trainer offers the option to use packin…
▽ More
Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. On the other hand, the Hugging Face SFT trainer offers the option to use packing to combine multiple training examples up to the maximum sequence length. This allows for maximal utilization of GPU resources. However, without proper masking of each packed training example, attention will not be computed correctly when using SFT trainer. We enable and then analyse packing and Flash Attention with proper attention masking of each example and show the benefits of this training paradigm.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Centimeter Positioning Accuracy using AI/ML for 6G Applications
Authors:
Sai Prasanth Kotturi,
Radha Krishna Ganti
Abstract:
This research looks at using AI/ML to achieve centimeter-level user positioning in 6G applications such as the Industrial Internet of Things (IIoT). Initial results show that our AI/ML-based method can estimate user positions with an accuracy of 17 cm in an indoor factory environment. In this proposal, we highlight our approaches and future directions.
This research looks at using AI/ML to achieve centimeter-level user positioning in 6G applications such as the Industrial Internet of Things (IIoT). Initial results show that our AI/ML-based method can estimate user positions with an accuracy of 17 cm in an indoor factory environment. In this proposal, we highlight our approaches and future directions.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Accelerating Production LLMs with Combined Token/Embedding Speculators
Authors:
Davis Wertheimer,
Joshua Rosenkranz,
Thomas Parnell,
Sahil Suneja,
Pavithra Ranganathan,
Raghu Ganti,
Mudhakar Srivatsa
Abstract:
This technical report describes the design and training of novel speculative decoding draft models, for accelerating the inference speeds of large language models in a production environment. By conditioning draft predictions on both context vectors and sampled tokens, we can train our speculators to efficiently predict high-quality n-grams, which the base model then accepts or rejects. This allow…
▽ More
This technical report describes the design and training of novel speculative decoding draft models, for accelerating the inference speeds of large language models in a production environment. By conditioning draft predictions on both context vectors and sampled tokens, we can train our speculators to efficiently predict high-quality n-grams, which the base model then accepts or rejects. This allows us to effectively predict multiple tokens per inference forward pass, accelerating wall-clock inference speeds of highly optimized base model implementations by a factor of 2-3x. We explore these initial results and describe next steps for further improvements.
△ Less
Submitted 6 June, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
UCINet0: A Machine Learning based Receiver for 5G NR PUCCH Format 0
Authors:
Anil Kumar Yerrapragada,
Jeeva Keshav Sattianarayanin,
Radha Krishna Ganti
Abstract:
Accurate decoding of Uplink Control Information (UCI) on the Physical Uplink Control Channel (PUCCH) is essential for enabling 5G wireless links. This paper explores an AI/ML-based receiver design for PUCCH Format 0. Format 0 signaling encodes the UCI content within the phase of a known base waveform and even supports multiplexing of up to 12 users within the same time-frequency resources. Our fir…
▽ More
Accurate decoding of Uplink Control Information (UCI) on the Physical Uplink Control Channel (PUCCH) is essential for enabling 5G wireless links. This paper explores an AI/ML-based receiver design for PUCCH Format 0. Format 0 signaling encodes the UCI content within the phase of a known base waveform and even supports multiplexing of up to 12 users within the same time-frequency resources. Our first-of-a-kind neural network classifier, which we term UCINet0, is capable of predicting when no user is transmitting on the PUCCH, as well as decoding the UCI content of any number of multiplexed users, up to 12. Inference results with both simulated and hardware-captured field datasets show that the UCINet0 model outperforms conventional DFT-based decoders across all SNR ranges.
△ Less
Submitted 10 March, 2024;
originally announced April 2024.
-
Optimum Beamforming and Grating Lobe Mitigation for Intelligent Reflecting Surfaces
Authors:
Sai Sanjay Narayanan,
Uday K Khankhoje,
Radha Krishna Ganti
Abstract:
Ensuring adequate wireless coverage in upcoming communication technologies such as 6G is expected to be challenging. This is because user demands of higher datarate require an increase in carrier frequencies, which in turn reduce the diffraction effects (and hence coverage) in complex multipath environments. Intelligent reflecting surfaces have been proposed as a way of restoring coverage by adapt…
▽ More
Ensuring adequate wireless coverage in upcoming communication technologies such as 6G is expected to be challenging. This is because user demands of higher datarate require an increase in carrier frequencies, which in turn reduce the diffraction effects (and hence coverage) in complex multipath environments. Intelligent reflecting surfaces have been proposed as a way of restoring coverage by adaptively reflecting incoming electromagnetic waves in desired directions. This is accomplished by judiciously adding extra phases at different points on the surface. In practice, these extra phases are only available in discrete quantities due to hardware constraints. Computing these extra phases is computationally challenging when they can only be picked from a discrete distribution, and existing approaches for solving this problem were either heuristic or based on evolutionary algorithms. We solve this problem by proposing fast algorithms with provably optimal solutions. Our algorithms have linear complexity, and are presented with rigorous proofs for their optimality. We show that the proposed algorithms exhibit better performance. We analyze situations when unwanted grating lobes arise in the radiation pattern, and discuss mitigation strategies, such as the use of triangular lattices and prephasing techniques, to eliminate them. We also demonstrate how our algorithms can leverage these techniques to deliver optimum beamforming solutions.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
TP-Aware Dequantization
Authors:
Adnan Hoque,
Mudhakar Srivatsa,
Chih-Chieh Yang,
Raghu Ganti
Abstract:
In this paper, we present a novel method that reduces model inference latency during distributed deployment of Large Language Models (LLMs). Our contribution is an optimized inference deployment scheme that address the current limitations of state-of-the-art quantization kernels when used in conjunction with Tensor Parallel (TP). Our method preserves data locality in GPU memory access patterns and…
▽ More
In this paper, we present a novel method that reduces model inference latency during distributed deployment of Large Language Models (LLMs). Our contribution is an optimized inference deployment scheme that address the current limitations of state-of-the-art quantization kernels when used in conjunction with Tensor Parallel (TP). Our method preserves data locality in GPU memory access patterns and exploits a priori knowledge of TP to reduce global communication. We demonstrate an up to 1.81x speedup over existing methods for Llama-70B and up to 1.78x speedup for IBM WatsonX's Granite-20B MLP layer problem sizes on A100 and H100 NVIDIA DGX Systems for a variety of TP settings.
△ Less
Submitted 15 January, 2024;
originally announced February 2024.
-
SudokuSens: Enhancing Deep Learning Robustness for IoT Sensing Applications using a Generative Approach
Authors:
Tianshi Wang,
**yang Li,
Ruijie Wang,
Denizhan Kara,
Shengzhong Liu,
Davis Wertheimer,
Antoni Viros-i-Martin,
Raghu Ganti,
Mudhakar Srivatsa,
Tarek Abdelzaher
Abstract:
This paper introduces SudokuSens, a generative framework for automated generation of training data in machine-learning-based Internet-of-Things (IoT) applications, such that the generated synthetic data mimic experimental configurations not encountered during actual sensor data collection. The framework improves the robustness of resulting deep learning models, and is intended for IoT applications…
▽ More
This paper introduces SudokuSens, a generative framework for automated generation of training data in machine-learning-based Internet-of-Things (IoT) applications, such that the generated synthetic data mimic experimental configurations not encountered during actual sensor data collection. The framework improves the robustness of resulting deep learning models, and is intended for IoT applications where data collection is expensive. The work is motivated by the fact that IoT time-series data entangle the signatures of observed objects with the confounding intrinsic properties of the surrounding environment and the dynamic environmental disturbances experienced. To incorporate sufficient diversity into the IoT training data, one therefore needs to consider a combinatorial explosion of training cases that are multiplicative in the number of objects considered and the possible environmental conditions in which such objects may be encountered. Our framework substantially reduces these multiplicative training needs. To decouple object signatures from environmental conditions, we employ a Conditional Variational Autoencoder (CVAE) that allows us to reduce data collection needs from multiplicative to (nearly) linear, while synthetically generating (data for) the missing conditions. To obtain robustness with respect to dynamic disturbances, a session-aware temporal contrastive learning approach is taken. Integrating the aforementioned two approaches, SudokuSens significantly improves the robustness of deep learning for IoT applications. We explore the degree to which SudokuSens benefits downstream inference tasks in different data sets and discuss conditions under which the approach is particularly effective.
△ Less
Submitted 8 February, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
Accelerating a Triton Fused Kernel for W4A16 Quantized Inference with SplitK work decomposition
Authors:
Adnan Hoque,
Less Wright,
Chih-Chieh Yang,
Mudhakar Srivatsa,
Raghu Ganti
Abstract:
We propose an implementation of an efficient fused matrix multiplication kernel for W4A16 quantized inference, where we perform dequantization and GEMM in a fused kernel using a SplitK work decomposition. Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads. In particular, this paper surveys the type of matrix multi…
▽ More
We propose an implementation of an efficient fused matrix multiplication kernel for W4A16 quantized inference, where we perform dequantization and GEMM in a fused kernel using a SplitK work decomposition. Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads. In particular, this paper surveys the type of matrix multiplication between a skinny activation matrix and a square weight matrix. Our results show an average of 65% speed improvement on A100, and an average of 124% speed improvement on H100 (with a peak of 295%) for a range of matrix dimensions including those found in a llama-style model, where m < n = k.
△ Less
Submitted 22 February, 2024; v1 submitted 5 January, 2024;
originally announced February 2024.
-
Enhancements for 5G NR PRACH Reception: An AI/ML Approach
Authors:
Rohit Singh,
Anil Kumar Yerrapragada,
Jeeva Keshav S,
Radha Krishna Ganti
Abstract:
Random Access is an important step in enabling the initial attachment of a User Equipment (UE) to a Base Station (gNB). The UE identifies itself by embedding a Preamble Index (RAPID) in the phase rotation of a known base sequence, which it transmits on the Physical Random Access Channel (PRACH). The signal on the PRACH also enables the estimation of propagation delay, often known as Timing Advance…
▽ More
Random Access is an important step in enabling the initial attachment of a User Equipment (UE) to a Base Station (gNB). The UE identifies itself by embedding a Preamble Index (RAPID) in the phase rotation of a known base sequence, which it transmits on the Physical Random Access Channel (PRACH). The signal on the PRACH also enables the estimation of propagation delay, often known as Timing Advance (TA), which is induced by virtue of the UE's position. Traditional receivers estimate the RAPID and TA using correlation-based techniques. This paper presents an alternative receiver approach that uses AI/ML models, wherein two neural networks are proposed, one for the RAPID and one for the TA. Different from other works, these two models can run in parallel as opposed to sequentially. Experiments with both simulated data and over-the-air hardware captures highlight the improved performance of the proposed AI/ML-based techniques compared to conventional correlation methods.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Foundation Models for Generalist Geospatial Artificial Intelligence
Authors:
Johannes Jakubik,
Sujit Roy,
C. E. Phillips,
Paolo Fraccaro,
Denys Godwin,
Bianca Zadrozny,
Daniela Szwarcman,
Carlos Gomes,
Gabby Nyirjesy,
Blair Edwards,
Daiki Kimura,
Naomi Simumba,
Linsong Chu,
S. Karthik Mukkavilli,
Devyani Lambhate,
Kamal Das,
Ran**i Bangalore,
Dario Oliveira,
Michal Muszynski,
Kumar Ankur,
Muthukumaran Ramasubramanian,
Iksha Gurung,
Sam Khallaghi,
Hanxi,
Li
, et al. (8 additional authors not shown)
Abstract:
Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation models are pre-trained on large unlabeled datasets through self-supervision, and then fine-tuned for various downstream tasks with small labeled datasets. This paper introduces a first-of-a-kind framewo…
▽ More
Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation models are pre-trained on large unlabeled datasets through self-supervision, and then fine-tuned for various downstream tasks with small labeled datasets. This paper introduces a first-of-a-kind framework for the efficient pre-training and fine-tuning of foundational models on extensive geospatial data. We have utilized this framework to create Prithvi, a transformer-based geospatial foundational model pre-trained on more than 1TB of multispectral satellite imagery from the Harmonized Landsat-Sentinel 2 (HLS) dataset. Our study demonstrates the efficacy of our framework in successfully fine-tuning Prithvi to a range of Earth observation tasks that have not been tackled by previous work on foundation models involving multi-temporal cloud gap imputation, flood map**, wildfire scar segmentation, and multi-temporal crop segmentation. Our experiments show that the pre-trained model accelerates the fine-tuning process compared to leveraging randomly initialized weights. In addition, pre-trained Prithvi compares well against the state-of-the-art, e.g., outperforming a conditional GAN model in multi-temporal cloud imputation by up to 5pp (or 5.7%) in the structural similarity index. Finally, due to the limited availability of labeled data in the field of Earth observation, we gradually reduce the quantity of available labeled data for refining the model to evaluate data efficiency and demonstrate that data can be decreased significantly without affecting the model's accuracy. The pre-trained 100 million parameter model and corresponding fine-tuning workflows have been released publicly as open source contributions to the global Earth sciences community through Hugging Face.
△ Less
Submitted 8 November, 2023; v1 submitted 28 October, 2023;
originally announced October 2023.
-
AI Foundation Models for Weather and Climate: Applications, Design, and Implementation
Authors:
S. Karthik Mukkavilli,
Daniel Salles Civitarese,
Johannes Schmude,
Johannes Jakubik,
Anne Jones,
Nam Nguyen,
Christopher Phillips,
Sujit Roy,
Shraddha Singh,
Campbell Watson,
Raghu Ganti,
Hendrik Hamann,
Udaysankar Nair,
Rahul Ramachandran,
Kommy Weldemariam
Abstract:
Machine learning and deep learning methods have been widely explored in understanding the chaotic behavior of the atmosphere and furthering weather forecasting. There has been increasing interest from technology companies, government institutions, and meteorological agencies in building digital twins of the Earth. Recent approaches using transformers, physics-informed machine learning, and graph n…
▽ More
Machine learning and deep learning methods have been widely explored in understanding the chaotic behavior of the atmosphere and furthering weather forecasting. There has been increasing interest from technology companies, government institutions, and meteorological agencies in building digital twins of the Earth. Recent approaches using transformers, physics-informed machine learning, and graph neural networks have demonstrated state-of-the-art performance on relatively narrow spatiotemporal scales and specific tasks. With the recent success of generative artificial intelligence (AI) using pre-trained transformers for language modeling and vision with prompt engineering and fine-tuning, we are now moving towards generalizable AI. In particular, we are witnessing the rise of AI foundation models that can perform competitively on multiple domain-specific downstream tasks. Despite this progress, we are still in the nascent stages of a generalizable AI model for global Earth system models, regional climate models, and mesoscale weather models. Here, we review current state-of-the-art AI approaches, primarily from transformer and operator learning literature in the context of meteorology. We provide our perspective on criteria for success towards a family of foundation models for nowcasting and forecasting weather and climate predictions. We also discuss how such models can perform competitively on downstream tasks such as downscaling (super-resolution), identifying conditions conducive to the occurrence of wildfires, and predicting consequential meteorological phenomena across various spatiotemporal scales such as hurricanes and atmospheric rivers. In particular, we examine current AI methodologies and contend they have matured enough to design and implement a weather foundation model.
△ Less
Submitted 19 September, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Pilotless Uplink for Massive MIMO Systems
Authors:
P Aswathylakshmi,
Radha Krishna Ganti
Abstract:
Massive MIMO OFDM waveforms help support a large number of users in the same time-frequency resource and also provide significant array gain for uplink reception in cellular systems. However, channel estimation in such large antenna systems can be tricky as pilot assignment for multiple users becomes more challenging with increasing number of users. Additionally, the pilot overhead especially for…
▽ More
Massive MIMO OFDM waveforms help support a large number of users in the same time-frequency resource and also provide significant array gain for uplink reception in cellular systems. However, channel estimation in such large antenna systems can be tricky as pilot assignment for multiple users becomes more challenging with increasing number of users. Additionally, the pilot overhead especially for wideband rapidly changing channels can diminish the system throughput quite significantly. In this paper, we propose an iterative matrix decomposition algorithm for the blind demodulation of massive MIMO OFDM signals without using any pilots. This new decomposition technique provides estimates of both the user symbols and the user channel in the frequency domain simultaneously (to a scaling factor) without any pilots. We discuss methods for finding the appropriate initial points for the algorithm that ensure its convergence in different types of wireless channels. We also propose new methods for resolving the scaling factor in the estimated signal that do not increase pilot overhead. We show how the method can be adapted to both single-user and multi-user systems. Simulation results demonstrate that the lack of pilots does not affect the error performance of the proposed algorithm when compared to the conventional pilot-based channel estimation and equalization methods across a wide range of channels for both single and multi-user cases. We also demonstrate techniques to reduce the complexity of the estimation algorithm over multiple OFDM symbols in a 5G MIMO system by leveraging the temporal correlations in the channel.
△ Less
Submitted 3 June, 2024; v1 submitted 21 May, 2023;
originally announced May 2023.
-
Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation Dataset
Authors:
Arun Tejasvi Chaganty,
Megan Leszczynski,
Shu Zhang,
Ravi Ganti,
Krisztian Balog,
Filip Radlinski
Abstract:
Users in consumption domains, like music, are often able to more efficiently provide preferences over a set of items (e.g. a playlist or radio) than over single items (e.g. songs). Unfortunately, this is an underexplored area of research, with most existing recommendation systems limited to understanding preferences over single items. Curating an item set exponentiates the search space that recomm…
▽ More
Users in consumption domains, like music, are often able to more efficiently provide preferences over a set of items (e.g. a playlist or radio) than over single items (e.g. songs). Unfortunately, this is an underexplored area of research, with most existing recommendation systems limited to understanding preferences over single items. Curating an item set exponentiates the search space that recommender systems must consider (all subsets of items!): this motivates conversational approaches-where users explicitly state or refine their preferences and systems elicit preferences in natural language-as an efficient way to understand user needs. We call this task conversational item set curation and present a novel data collection methodology that efficiently collects realistic preferences about item sets in a conversational setting by observing both item-level and set-level feedback. We apply this methodology to music recommendation to build the Conversational Playlist Curation Dataset (CPCD), where we show that it leads raters to express preferences that would not be otherwise expressed. Finally, we propose a wide range of conversational retrieval models as baselines for this task and evaluate them on the dataset.
△ Less
Submitted 5 May, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
Talk the Walk: Synthetic Data Generation for Conversational Music Recommendation
Authors:
Megan Leszczynski,
Shu Zhang,
Ravi Ganti,
Krisztian Balog,
Filip Radlinski,
Fernando Pereira,
Arun Tejasvi Chaganty
Abstract:
Recommender systems are ubiquitous yet often difficult for users to control, and adjust if recommendation quality is poor. This has motivated conversational recommender systems (CRSs), with control provided through natural language feedback. However, as with most application domains, building robust CRSs requires training data that reflects system usage$\unicode{x2014}$here conversations with user…
▽ More
Recommender systems are ubiquitous yet often difficult for users to control, and adjust if recommendation quality is poor. This has motivated conversational recommender systems (CRSs), with control provided through natural language feedback. However, as with most application domains, building robust CRSs requires training data that reflects system usage$\unicode{x2014}$here conversations with user utterances paired with items that cover a wide range of preferences. This has proved challenging to collect scalably using conventional methods. We address the question of whether it can be generated synthetically, building on recent advances in natural language. We evaluate in the setting of item set recommendation, noting the increasing attention to this task motivated by use cases like music, news, and recipe recommendation. We present TalkTheWalk, which synthesizes realistic high-quality conversational data by leveraging domain expertise encoded in widely available curated item collections, generating a sequence of hypothetical yet plausible item sets, then using a language model to produce corresponding user utterances. We generate over one million diverse playlist curation conversations in the music domain, and show these contain consistent utterances with relevant item sets nearly matching the quality of an existing but small human-collected dataset for this task. We demonstrate the utility of the generated synthetic dataset on a conversational item retrieval task and show that it improves over both unsupervised baselines and systems trained on a real dataset.
△ Less
Submitted 17 November, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
MAQA: A Multimodal QA Benchmark for Negation
Authors:
Judith Yue Li,
Aren Jansen,
Qingqing Huang,
Joonseok Lee,
Ravi Ganti,
Dima Kuzmin
Abstract:
Multimodal learning can benefit from the representation power of pretrained Large Language Models (LLMs). However, state-of-the-art transformer based LLMs often ignore negations in natural language and there is no existing benchmark to quantitatively evaluate whether multimodal transformers inherit this weakness. In this study, we present a new multimodal question answering (QA) benchmark adapted…
▽ More
Multimodal learning can benefit from the representation power of pretrained Large Language Models (LLMs). However, state-of-the-art transformer based LLMs often ignore negations in natural language and there is no existing benchmark to quantitatively evaluate whether multimodal transformers inherit this weakness. In this study, we present a new multimodal question answering (QA) benchmark adapted from labeled music videos in AudioSet (Gemmeke et al., 2017) with the goal of systematically evaluating if multimodal transformers can perform complex reasoning to recognize new concepts as negation of previously learned concepts. We show that with standard fine-tuning approach multimodal transformers are still incapable of correctly interpreting negation irrespective of model size. However, our experiments demonstrate that augmenting the original training task distributions with negated QA examples allow the model to reliably reason with negation. To do this, we describe a novel data generation procedure that prompts the 540B-parameter PaLM model to automatically generate negated QA examples as compositions of easily accessible video tags. The generated examples contain more natural linguistic patterns and the gains compared to template-based task augmentation approach are significant.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Rethinking Data-driven Networking with Foundation Models: Challenges and Opportunities
Authors:
Franck Le,
Mudhakar Srivatsa,
Raghu Ganti,
Vyas Sekar
Abstract:
Foundational models have caused a paradigm shift in the way artificial intelligence (AI) systems are built. They have had a major impact in natural language processing (NLP), and several other domains, not only reducing the amount of required labeled data or even eliminating the need for it, but also significantly improving performance on a wide range of tasks. We argue foundation models can have…
▽ More
Foundational models have caused a paradigm shift in the way artificial intelligence (AI) systems are built. They have had a major impact in natural language processing (NLP), and several other domains, not only reducing the amount of required labeled data or even eliminating the need for it, but also significantly improving performance on a wide range of tasks. We argue foundation models can have a similar profound impact on network traffic analysis, and management. More specifically, we show that network data shares several of the properties that are behind the success of foundational models in linguistics. For example, network data contains rich semantic content, and several of the networking tasks (e.g., traffic classification, generation of protocol implementations from specification text, anomaly detection) can find similar counterparts in NLP (e.g., sentiment analysis, translation from natural language to code, out-of-distribution). However, network settings also present unique characteristics and challenges that must be overcome. Our contribution is in highlighting the opportunities and challenges at the intersection of foundation models and networking.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Machine Learning Decoder for 5G NR PUCCH Format 0
Authors:
Anil Kumar Yerrapragada,
Jeeva Keshav S,
Ankit Gautam,
Radha Krishna Ganti
Abstract:
5G cellular systems depend on the timely exchange of feedback control information between the user equipment and the base station. Proper decoding of this control information is necessary to set up and sustain high throughput radio links. This paper makes the first attempt at using Machine Learning techniques to improve the decoding performance of the Physical Uplink Control Channel Format 0. We u…
▽ More
5G cellular systems depend on the timely exchange of feedback control information between the user equipment and the base station. Proper decoding of this control information is necessary to set up and sustain high throughput radio links. This paper makes the first attempt at using Machine Learning techniques to improve the decoding performance of the Physical Uplink Control Channel Format 0. We use fully connected neural networks to classify the received samples based on the uplink control information content embedded within them. The trained neural network, tested on real-time wireless captures, shows significant improvement in accuracy over conventional DFT-based decoders, even at low SNR. The obtained accuracy results also demonstrate conformance with 3GPP requirements.
△ Less
Submitted 26 August, 2022;
originally announced September 2022.
-
MuLan: A Joint Embedding of Music Audio and Natural Language
Authors:
Qingqing Huang,
Aren Jansen,
Joonseok Lee,
Ravi Ganti,
Judith Yue Li,
Daniel P. W. Ellis
Abstract:
Music tagging and content-based retrieval systems have traditionally been constructed using pre-defined ontologies covering a rigid set of music attributes or text queries. This paper presents MuLan: a first attempt at a new generation of acoustic models that link music audio directly to unconstrained natural language music descriptions. MuLan takes the form of a two-tower, joint audio-text embedd…
▽ More
Music tagging and content-based retrieval systems have traditionally been constructed using pre-defined ontologies covering a rigid set of music attributes or text queries. This paper presents MuLan: a first attempt at a new generation of acoustic models that link music audio directly to unconstrained natural language music descriptions. MuLan takes the form of a two-tower, joint audio-text embedding model trained using 44 million music recordings (370K hours) and weakly-associated, free-form text annotations. Through its compatibility with a wide range of music genres and text styles (including conventional music tags), the resulting audio-text representation subsumes existing ontologies while graduating to true zero-shot functionalities. We demonstrate the versatility of the MuLan embeddings with a range of experiments including transfer learning, zero-shot music tagging, language understanding in the music domain, and cross-modal retrieval applications.
△ Less
Submitted 25 August, 2022;
originally announced August 2022.
-
Fronthaul Compression for Uplink Massive MIMO using Matrix Decomposition
Authors:
Aswathylakshmi P,
Radha Krishna Ganti
Abstract:
Massive MIMO opens up attractive possibilities for next generation wireless systems with its large number of antennas offering spatial diversity and multiplexing gain. However, the fronthaul link that connects a massive MIMO Remote Radio Head (RRH) and carries IQ samples to the Baseband Unit (BBU) of the base station can throttle the network capacity/speed if appropriate data compression technique…
▽ More
Massive MIMO opens up attractive possibilities for next generation wireless systems with its large number of antennas offering spatial diversity and multiplexing gain. However, the fronthaul link that connects a massive MIMO Remote Radio Head (RRH) and carries IQ samples to the Baseband Unit (BBU) of the base station can throttle the network capacity/speed if appropriate data compression techniques are not applied. In this paper, we propose an iterative technique for fronthaul load reduction in the uplink for massive MIMO systems that utilizes the convolution structure of the received signals. We use an alternating minimisation algorithm for blind deconvolution of the received data matrix that provides compression ratios of 30-50. In addition, the technique presented here can be used for blind decoding of OFDM signals in massive MIMO systems.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option
Authors:
P Sharoff,
Nishant A. Mehta,
Ravi Ganti
Abstract:
We consider a sequential decision-making problem where an agent can take one action at a time and each action has a stochastic temporal extent, i.e., a new action cannot be taken until the previous one is finished. Upon completion, the chosen action yields a stochastic reward. The agent seeks to maximize its cumulative reward over a finite time budget, with the option of "giving up" on a current a…
▽ More
We consider a sequential decision-making problem where an agent can take one action at a time and each action has a stochastic temporal extent, i.e., a new action cannot be taken until the previous one is finished. Upon completion, the chosen action yields a stochastic reward. The agent seeks to maximize its cumulative reward over a finite time budget, with the option of "giving up" on a current action -- hence forfeiting any reward -- in order to choose another action. We cast this problem as a variant of the stochastic multi-armed bandits problem with stochastic consumption of resource. For this problem, we first establish that the optimal arm is the one that maximizes the ratio of the expected reward of the arm to the expected waiting time before the agent sees the reward due to pulling that arm. Using a novel upper confidence bound on this ratio, we then introduce an upper confidence based-algorithm, WAIT-UCB, for which we establish logarithmic, problem-dependent regret bound which has an improved dependence on problem parameters compared to previous works. Simulations on various problem configurations comparing WAIT-UCB against the state-of-the-art algorithms are also presented.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
QR Approximation for Massive MIMO Fronthaul Compression
Authors:
Aswathylakshmi P,
Radha Krishna Ganti
Abstract:
Massive MIMO's immense potential to serve large number of users at fast data rates also comes with the caveat of requiring tremendous processing power. This favours a centralized radio access network (C-RAN) architecture that concentrates the processing power at a common baseband unit (BBU) connected to multiple remote radio heads (RRH) via fronthaul links. The high bandwidths of 5G make the front…
▽ More
Massive MIMO's immense potential to serve large number of users at fast data rates also comes with the caveat of requiring tremendous processing power. This favours a centralized radio access network (C-RAN) architecture that concentrates the processing power at a common baseband unit (BBU) connected to multiple remote radio heads (RRH) via fronthaul links. The high bandwidths of 5G make the fronthaul data rate a major bottleneck. Since the number of active users in a massive MIMO system is much smaller than the number of antennas, we propose a dimension reduction scheme based on low rank approximation for fronthaul data compression. Link level simulations show that the proposed method achieves more than 17x compression while also improving the error performance of the system through denoising.
△ Less
Submitted 12 March, 2019;
originally announced March 2019.
-
neuralRank: Searching and ranking ANN-based model repositories
Authors:
Nirmit Desai,
Linsong Chu,
Raghu K. Ganti,
Sebastian Stein,
Mudhakar Srivatsa
Abstract:
Widespread applications of deep learning have led to a plethora of pre-trained neural network models for common tasks. Such models are often adapted from other models via transfer learning. The models may have varying training sets, training algorithms, network architectures, and hyper-parameters. For a given application, what isthe most suitable model in a model repository? This is a critical que…
▽ More
Widespread applications of deep learning have led to a plethora of pre-trained neural network models for common tasks. Such models are often adapted from other models via transfer learning. The models may have varying training sets, training algorithms, network architectures, and hyper-parameters. For a given application, what isthe most suitable model in a model repository? This is a critical question for practical deployments but it has not received much attention. This paper introduces the novel problem of searching and ranking models based on suitability relative to a target dataset and proposes a ranking algorithm called \textit{neuralRank}. The key idea behind this algorithm is to base model suitability on the discriminating power of a model, using a novel metric to measure it. With experimental results on the MNIST, Fashion, and CIFAR10 datasets, we demonstrate that (1) neuralRank is independent of the domain, the training set, or the network architecture and (2) that the models ranked highly by neuralRank ranking tend to have higher model accuracy in practice.
△ Less
Submitted 2 March, 2019;
originally announced March 2019.
-
Thompson Sampling for Dynamic Pricing
Authors:
Ravi Ganti,
Matyas Sustik,
Quoc Tran,
Brian Seaman
Abstract:
In this paper we apply active learning algorithms for dynamic pricing in a prominent e-commerce website. Dynamic pricing involves changing the price of items on a regular basis, and uses the feedback from the pricing decisions to update prices of the items. Most popular approaches to dynamic pricing use a passive learning approach, where the algorithm uses historical data to learn various paramete…
▽ More
In this paper we apply active learning algorithms for dynamic pricing in a prominent e-commerce website. Dynamic pricing involves changing the price of items on a regular basis, and uses the feedback from the pricing decisions to update prices of the items. Most popular approaches to dynamic pricing use a passive learning approach, where the algorithm uses historical data to learn various parameters of the pricing problem, and uses the updated parameters to generate a new set of prices. We show that one can use active learning algorithms such as Thompson sampling to more efficiently learn the underlying parameters in a pricing problem. We apply our algorithms to a real e-commerce system and show that the algorithms indeed improve revenue compared to pricing algorithms that use passive learning.
△ Less
Submitted 8 February, 2018;
originally announced February 2018.
-
Pressure Gradients Fail to Predict Diffusio-Osmosis
Authors:
Yawei Liu,
Raman Ganti,
Daan Frenkel
Abstract:
We present numerical simulations of diffusio-osmotic flow, i.e. the fluid flow generated by a concentration gradient along a solid-fluid interface. In our study, we compare a number of distinct approaches that have been proposed for computing such flows and compare them with a reference calculation based on direct, non-equilibrium Molecular Dynamics simulations. As alternatives, we consider scheme…
▽ More
We present numerical simulations of diffusio-osmotic flow, i.e. the fluid flow generated by a concentration gradient along a solid-fluid interface. In our study, we compare a number of distinct approaches that have been proposed for computing such flows and compare them with a reference calculation based on direct, non-equilibrium Molecular Dynamics simulations. As alternatives, we consider schemes that compute diffusio-osmotic flow from the gradient of the chemical potentials of the constituent species and from the gradient of the component of the stress tensor parallel to the interface. We find that the approach based on treating chemical potential gradients as external forces acting on various species agrees with the direct simulations, thereby supporting the approach of Marbach et al. (J Chem Phys 146, 194701 (2017)). In contrast, an approach based on computing the gradients of the microscopic pressure tensor does not reproduce the direct non-equilibrium results.
△ Less
Submitted 26 January, 2018;
originally announced January 2018.
-
Interference Characterization in Downlink Li-Fi Optical Attocell Networks
Authors:
Atchutananda Surampudi,
Radha Krishna Ganti
Abstract:
Wireless access to data using visible light, popularly known as light-fidelity (Li-Fi), is one of the key emerging technologies which promises huge bandwidths and data rates. In Li-Fi, the data is modulated on optical intensities and transmitted and detected using light-emitting-diodes (LED) and photodiodes respectively. A network of such LED access points illuminates a given region in the form of…
▽ More
Wireless access to data using visible light, popularly known as light-fidelity (Li-Fi), is one of the key emerging technologies which promises huge bandwidths and data rates. In Li-Fi, the data is modulated on optical intensities and transmitted and detected using light-emitting-diodes (LED) and photodiodes respectively. A network of such LED access points illuminates a given region in the form of attocells. Akin, to wireless networks, co-channel interference or simply interference is a major impediment in Li-Fi attocell networks. Also, when in such networks, the field-of-view (FOV) of a photodiode is limited, the network interference distribution gets affected significantly. So, for any given network scenario, interference characterization is critical for good system design. Currently, there are no good closed-form approximations to interference in Li-Fi attocell networks, that can be used for the analysis of signal-to-interference-plus-noise-ratio (or coverage), particularly for the case of limited FOVs. In this paper, using a technique from Fourier analysis, we provide a very close approximation to interference in one and two dimension Li-Fi attocell networks for any given finite inter-LED separation. We validate the interference approximation by providing theoretical error bounds using asymptotics and by performing numerical simulations. We show that our method of approximation can be extended to characterize interference in limited FOV scenarios as well.
△ Less
Submitted 13 December, 2017;
originally announced December 2017.
-
Hamiltonian transformation to compute Thermo-osmotic Forces
Authors:
Raman Ganti,
Yawei Liu,
Daan Frenkel
Abstract:
If a thermal gradient is applied along a fluid-solid interface, the fluid experiences a thermo-osmotic force. In steady state this force is balanced by the gradient of the shear stress. Surprisingly, there appears to be no unique microscopic expression that can be used for computing the magnitude of the thermo-osmotic force.
Here we report how, by treating the mass $M$ of the fluid particles as…
▽ More
If a thermal gradient is applied along a fluid-solid interface, the fluid experiences a thermo-osmotic force. In steady state this force is balanced by the gradient of the shear stress. Surprisingly, there appears to be no unique microscopic expression that can be used for computing the magnitude of the thermo-osmotic force.
Here we report how, by treating the mass $M$ of the fluid particles as a tensor in the Hamiltonian, we can eliminate the balancing shear force in a non-equilibrium simulation and therefore compute the thermo-osmotic force at simple solid-fluid interfaces. We compare the non-equilibrium force measurement with estimates of the thermo-osmotic force based on computing gradients of the stress tensor. We find that the thermo-osmotic force as measured in our simulations cannot be derived from the most common microscopic definitions of the stress tensor.
△ Less
Submitted 4 October, 2017;
originally announced October 2017.
-
Microscopic Marangoni flows cannot be predicted on the basis of pressure gradients
Authors:
Yawei Liu,
Raman Ganti,
Hugh G. A. Burton,
Xianren Zhang,
Wenchuan Wang,
Daan Frenkel
Abstract:
A concentration gradient along a fluid-fluid interface can cause flow. On a microscopic level, this so-called Marangoni effect can be viewed as being caused by a gradient in the pressures acting on the fluid elements, or as the chemical-potential gradients acting on the excess densities of different species at the interface. If the interfacial thickness can be ignored, all approaches should result…
▽ More
A concentration gradient along a fluid-fluid interface can cause flow. On a microscopic level, this so-called Marangoni effect can be viewed as being caused by a gradient in the pressures acting on the fluid elements, or as the chemical-potential gradients acting on the excess densities of different species at the interface. If the interfacial thickness can be ignored, all approaches should result in the same flow profile away from the interface. However, on a more microscopic scale, the different expressions result in different flow profiles, only one of which can be correct. Here we compare the results of direct non-equilibrium Molecular Dynamics simulations with the flows that would be generated by pressure and chemical potential gradients. We find that the approach based on the chemical potential gradients agrees with the direct simulations, whereas the calculations based on the pressure gradients do not.
△ Less
Submitted 1 October, 2017;
originally announced October 2017.
-
Coverage Analysis in Millimeter Wave Cellular Networks with Reflections
Authors:
Aroon Narayanan,
Sreejith T. V,
Radha Krishna Ganti
Abstract:
The coverage probability of a user in a mmwave system depends on the availability of line-of-sight paths or reflected paths from any base station. Many prior works modelled blockages using random shape theory and analyzed the SIR distribution with and without interference. While, it is intuitive that the reflected paths do not significantly contribute to the coverage (because of longer path length…
▽ More
The coverage probability of a user in a mmwave system depends on the availability of line-of-sight paths or reflected paths from any base station. Many prior works modelled blockages using random shape theory and analyzed the SIR distribution with and without interference. While, it is intuitive that the reflected paths do not significantly contribute to the coverage (because of longer path lengths), there are no works which provide a model and study the coverage with reflections. In this paper, we model and analyze the impact of reflectors using stochastic geometry. We observe that the reflectors have very little impact on the coverage probability.
△ Less
Submitted 5 August, 2017;
originally announced August 2017.
-
Error Vector Magnitude Analysis in Generalized Fading with Co-Channel Interference
Authors:
Sudharsan Parthasarathy,
Suman Kumar,
Radha Krishna Ganti,
Sheetal Kalyani,
K. Giridhar
Abstract:
In this paper, we derive the data-aided Error Vector Magnitude (EVM) in an interference limited system when both the desired signal and interferers experience independent and non identically distributed $κ$-$μ$ shadowed fading. Then it is analytically shown that the EVM is equal to the square root of number of interferers when the desired signal and interferers do not experience fading. Further, E…
▽ More
In this paper, we derive the data-aided Error Vector Magnitude (EVM) in an interference limited system when both the desired signal and interferers experience independent and non identically distributed $κ$-$μ$ shadowed fading. Then it is analytically shown that the EVM is equal to the square root of number of interferers when the desired signal and interferers do not experience fading. Further, EVM is derived in the presence of interference and noise, when the desired signal experiences $κ$-$μ$ shadowed fading and the interferers experience independent and identical Nakagami fading. Moreover, using the properties of the special functions, the derived EVM expressions are also simplified for various special cases.
△ Less
Submitted 11 April, 2017;
originally announced April 2017.
-
Efficient CSMA using Regional Free Energy Approximations
Authors:
Peruru Subrahmanya Swamy,
Venkata Pavan Kumar Bellam,
Radha Krishna Ganti,
Krishna Jagannathan
Abstract:
CSMA (Carrier Sense Multiple Access) algorithms based on Gibbs sampling can achieve throughput optimality if certain parameters called the fugacities are appropriately chosen. However, the problem of computing these fugacities is NP-hard. In this work, we derive estimates of the fugacities by using a framework called the regional free energy approximations. In particular, we derive explicit expres…
▽ More
CSMA (Carrier Sense Multiple Access) algorithms based on Gibbs sampling can achieve throughput optimality if certain parameters called the fugacities are appropriately chosen. However, the problem of computing these fugacities is NP-hard. In this work, we derive estimates of the fugacities by using a framework called the regional free energy approximations. In particular, we derive explicit expressions for approximate fugacities corresponding to any feasible service rate vector. We further prove that our approximate fugacities are exact for the class of chordal graphs. A distinguishing feature of our work is that the regional approximations that we propose are tailored to conflict graphs with small cycles, which is a typical characteristic of wireless networks. Numerical results indicate that the fugacities obtained by the proposed method are quite accurate and significantly outperform the existing Bethe approximation based techniques.
△ Less
Submitted 22 February, 2017;
originally announced February 2017.
-
Molecular Simulation of Thermo-osmotic slip
Authors:
Raman Ganti,
Yawei Liu,
Daan Frenkel
Abstract:
Thermo-osmotic slip -- the flow induced by a thermal gradient along a surface -- is a well-known phenomenon, but curiously there is a lack of robust molecular-simulation techniques to predict its magnitude. Here, we compare three different molecular simulation techniques to compute the thermo-osmotic slip at a simple solid-fluid interface. Although we do not expect the different approaches to be i…
▽ More
Thermo-osmotic slip -- the flow induced by a thermal gradient along a surface -- is a well-known phenomenon, but curiously there is a lack of robust molecular-simulation techniques to predict its magnitude. Here, we compare three different molecular simulation techniques to compute the thermo-osmotic slip at a simple solid-fluid interface. Although we do not expect the different approaches to be in perfect agreement, we find that the differences are barely significant for a range of different physical conditions, suggesting that practical molecular simulations of thermo-osmotic slip are feasible.
△ Less
Submitted 8 February, 2017;
originally announced February 2017.
-
SIR Asymptotics in General Network Models
Authors:
An** Guo,
Martin Haenggi,
Radha Krishna Ganti
Abstract:
In the performance analyses of wireless networks, asymptotic quantities and properties often pro- vide useful results and insights. The asymptotic analyses become especially important when complete analytical expressions of the performance metrics of interest are not available, which is often the case if one departs from very specific modeling assumptions. In this paper, we consider the asymptotic…
▽ More
In the performance analyses of wireless networks, asymptotic quantities and properties often pro- vide useful results and insights. The asymptotic analyses become especially important when complete analytical expressions of the performance metrics of interest are not available, which is often the case if one departs from very specific modeling assumptions. In this paper, we consider the asymptotics of the SIR distribution in general wireless network models, including ad hoc and cellular networks, simple and non-simple point processes, and singular and bounded path loss models, for which, in most cases, finding analytical expressions of the complete SIR distribution seems hopeless. We show that the lower tails of the SIR distributions decay polynomially with the order solely determined by the path loss exponent or the fading parameter, while the upper tails decay exponentially, with the exception of cellular networks with singular path loss. In addition, we analyze the impact of the nearest interferer on the asymptotic properties of the SIR distributions, and we formulate three crisp conjectures that -if true- determine the asymptotic behavior in many cases based on the large-scale path loss properties of the desired signal and/or nearest interferer only.
△ Less
Submitted 14 November, 2016;
originally announced November 2016.
-
Maximal Packing with Interference Constraints
Authors:
Rakshith Jagannath,
Radha Krishna Ganti,
Neelesh S Upadhye
Abstract:
In this work, we study the problem of scheduling a maximal set of transmitters subjected to an interference constraint across all the nodes. Given a set of nodes, the problem reduces to finding the maximum cardinality of a subset of nodes that can concurrently transmit without violating interference constraints. The resulting packing problem is a binary optimization problem and is NP hard. We prop…
▽ More
In this work, we study the problem of scheduling a maximal set of transmitters subjected to an interference constraint across all the nodes. Given a set of nodes, the problem reduces to finding the maximum cardinality of a subset of nodes that can concurrently transmit without violating interference constraints. The resulting packing problem is a binary optimization problem and is NP hard. We propose a semi-definite relaxation (SDR) for this problem and provide bounds on the relaxation.
△ Less
Submitted 31 October, 2016;
originally announced October 2016.
-
Beyond Spatial Auto-Regressive Models: Predicting Housing Prices with Satellite Imagery
Authors:
Archith J. Bency,
Swati Rallapalli,
Raghu K. Ganti,
Mudhakar Srivatsa,
B. S. Manjunath
Abstract:
When modeling geo-spatial data, it is critical to capture spatial correlations for achieving high accuracy. Spatial Auto-Regression (SAR) is a common tool used to model such data, where the spatial contiguity matrix (W) encodes the spatial correlations. However, the efficacy of SAR is limited by two factors. First, it depends on the choice of contiguity matrix, which is typically not learnt from d…
▽ More
When modeling geo-spatial data, it is critical to capture spatial correlations for achieving high accuracy. Spatial Auto-Regression (SAR) is a common tool used to model such data, where the spatial contiguity matrix (W) encodes the spatial correlations. However, the efficacy of SAR is limited by two factors. First, it depends on the choice of contiguity matrix, which is typically not learnt from data, but instead, is assumed to be known apriori. Second, it assumes that the observations can be explained by linear models. In this paper, we propose a Convolutional Neural Network (CNN) framework to model geo-spatial data (specifi- cally housing prices), to learn the spatial correlations automatically. We show that neighborhood information embedded in satellite imagery can be leveraged to achieve the desired spatial smoothing. An additional upside of our framework is the relaxation of linear assumption on the data. Specific challenges we tackle while implementing our framework include, (i) how much of the neighborhood is relevant while estimating housing prices? (ii) what is the right approach to capture multiple resolutions of satellite imagery? and (iii) what other data-sources can help improve the estimation of spatial correlations? We demonstrate a marked improvement of 57% on top of the SAR baseline through the use of features from deep neural networks for the cities of London, Birmingham and Liverpool.
△ Less
Submitted 15 October, 2016;
originally announced October 2016.
-
Adaptive Modulation with Impulsive Interference
Authors:
Sudharsan Parthasarathy,
Radha Krishna Ganti
Abstract:
In this letter, we analyze power and rate adaptation in a point-to-point link with Rayleigh fading and impulsive interference. We model the impulsive interference as a Bernoulli-Gaussian random process. Adaptation is used to maximize the average spectral efficiency by changing power and rate of the transmission subject to an average power and instantaneous probability of error constraints. Without…
▽ More
In this letter, we analyze power and rate adaptation in a point-to-point link with Rayleigh fading and impulsive interference. We model the impulsive interference as a Bernoulli-Gaussian random process. Adaptation is used to maximize the average spectral efficiency by changing power and rate of the transmission subject to an average power and instantaneous probability of error constraints. Without impulsive interference, it is well known that water-filling is optimal for block fading. We provide two simple schemes that show that the conventional water-filling algorithm is not optimal in an impulsive interference channel.
△ Less
Submitted 12 May, 2016;
originally announced May 2016.
-
Coverage Analysis In Downlink Poisson Cellular Network With $κ$-$μ$ Shadowed Fading
Authors:
Sudharsan Parthasarathy,
Radha Krishna Ganti
Abstract:
The downlink coverage probability of a cellular network, when the base station locations are modelled by a Poisson point process (PPP), is known when the desired channel is Nakagami distributed with an integer shape parameter. However, for many interesting fading distributions such as Rician, Rician shadowing, $κ$-$μ$, $η$-$μ$, etc., the coverage probability is unknown. $κ$-$μ$ shadowed fading is…
▽ More
The downlink coverage probability of a cellular network, when the base station locations are modelled by a Poisson point process (PPP), is known when the desired channel is Nakagami distributed with an integer shape parameter. However, for many interesting fading distributions such as Rician, Rician shadowing, $κ$-$μ$, $η$-$μ$, etc., the coverage probability is unknown. $κ$-$μ$ shadowed fading is a generic fading distribution whose special cases are many of these popular distributions known so far. In this letter, we derive the coverage probability when the desired channel experiences $κ$-$μ$ shadowed fading. Using numerical simulations, we verify our analytical expressions.
△ Less
Submitted 24 October, 2016; v1 submitted 12 May, 2016;
originally announced May 2016.
-
A Linearization Technique for Self-Interference Cancellation in Full-Duplex Radios
Authors:
Arjun Nadh,
Joseph Samuel,
Ankit Sharma,
S. Aniruddhan,
Radha Krishna Ganti
Abstract:
The fundamental problem in the design of a full-duplex radio is the cancellation of the self-interference (SI) signal generated by the transmitter.Current techniques for suppressing SI rely on generating a copy of the SI signal and subtracting it partly in the RF (radio frequency) and digital domains. A critical step in replicating the self-interference is the estimation of the multi-path channel…
▽ More
The fundamental problem in the design of a full-duplex radio is the cancellation of the self-interference (SI) signal generated by the transmitter.Current techniques for suppressing SI rely on generating a copy of the SI signal and subtracting it partly in the RF (radio frequency) and digital domains. A critical step in replicating the self-interference is the estimation of the multi-path channel through which the transmitted signal propagates to the antenna. Since there is no prior model on the number of multipath reflections, current techniques assume a tap delay line filter (in the RF and digital domain) with a large number of taps, and estimate the taps in the analog and the digital domain. Assuming such a model leads to a large form-factor for the analog and RF circuits and increased complexity in the digital domain.
In this paper, using a linearization technique, we show that the self-interference channel in an indoor environment can be effectively modelled as $H(f)=C_0 + C_1f$ in the frequency domain. Thus, the effective self-interference channel can be represented by two parameters $C_0$ and $C_1$, irrespective of the multipath environment. We also provide experimental evidence to verify the above channel model and propose novel low-complexity designs for self-interference cancellation. Linearization not only aids in the practicality of analog cancellation by reducing the form factor, but also results in a simpler SI filter model in the digital domain due to dimensionality reduction of the channel parameters. Therefore this method can enable the widespread adoption of full-duplex techniques to portable devices in addition to infrastructure base-stations.
△ Less
Submitted 4 May, 2016;
originally announced May 2016.
-
Active Algorithms For Preference Learning Problems with Multiple Populations
Authors:
Aniruddha Bhargava,
Ravi Ganti,
Robert Nowak
Abstract:
In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the obtained reward to decide which pair of items to show next. We provide computationally efficient algorithms with provable sample complexity guarantees for this prob…
▽ More
In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the obtained reward to decide which pair of items to show next. We provide computationally efficient algorithms with provable sample complexity guarantees for this problem in both the noiseless and noisy cases. In the process of establishing sample complexity guarantees for our algorithms, we establish new results using a Nystr{ö}m-like method which can be of independent interest. We supplement our theoretical results with experimental comparisons.
△ Less
Submitted 22 June, 2016; v1 submitted 13 March, 2016;
originally announced March 2016.
-
On Learning High Dimensional Structured Single Index Models
Authors:
Nikhil Rao,
Ravi Ganti,
Laura Balzano,
Rebecca Willett,
Robert Nowak
Abstract:
Single Index Models (SIMs) are simple yet flexible semi-parametric models for machine learning, where the response variable is modeled as a monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights and the nonlinear function that relates features to observations. While methods have been described to learn SIMs in the low dimension…
▽ More
Single Index Models (SIMs) are simple yet flexible semi-parametric models for machine learning, where the response variable is modeled as a monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights and the nonlinear function that relates features to observations. While methods have been described to learn SIMs in the low dimensional regime, a method that can efficiently learn SIMs in high dimensions, and under general structural assumptions, has not been forthcoming. In this paper, we propose computationally efficient algorithms for SIM inference in high dimensions with structural constraints. Our general approach specializes to sparsity, group sparsity, and low-rank assumptions among others. Experiments show that the proposed method enjoys superior predictive performance when compared to generalized linear models, and achieves results comparable to or better than single layer feedforward neural networks with significantly less computational cost.
△ Less
Submitted 29 November, 2016; v1 submitted 12 March, 2016;
originally announced March 2016.
-
Adaptive CSMA under the SINR Model: Efficient Approximation Algorithms for Throughput and Utility Maximization
Authors:
Peruru Subrahmanya Swamy,
Radha Krishna Ganti,
Krishna Jagannathan
Abstract:
We consider a Carrier Sense Multiple Access (CSMA) based scheduling algorithm for a single-hop wireless network under a realistic Signal-to-interference-plus-noise ratio (SINR) model for the interference. We propose two local optimization based approximation algorithms to efficiently estimate certain attempt rate parameters of CSMA called fugacities. It is known that adaptive CSMA can achieve thro…
▽ More
We consider a Carrier Sense Multiple Access (CSMA) based scheduling algorithm for a single-hop wireless network under a realistic Signal-to-interference-plus-noise ratio (SINR) model for the interference. We propose two local optimization based approximation algorithms to efficiently estimate certain attempt rate parameters of CSMA called fugacities. It is known that adaptive CSMA can achieve throughput optimality by sampling feasible schedules from a Gibbs distribution, with appropriate fugacities. Unfortunately, obtaining these optimal fugacities is an NP-hard problem. Further, the existing adaptive CSMA algorithms use a stochastic gradient descent based method, which usually entails an impractically slow (exponential in the size of the network) convergence to the optimal fugacities. To address this issue, we first propose an algorithm to estimate the fugacities, that can support a given set of desired service rates. The convergence rate and the complexity of this algorithm are independent of the network size, and depend only on the neighborhood size of a link. Further, we show that the proposed algorithm corresponds exactly to performing the well-known Bethe approximation to the underlying Gibbs distribution. Then, we propose another local algorithm to estimate the optimal fugacities under a utility maximization framework, and characterize its accuracy. Numerical results indicate that the proposed methods have a good degree of accuracy, and achieve extremely fast convergence to near-optimal fugacities, and often outperform the convergence rate of the stochastic gradient descent by a few orders of magnitude.
△ Less
Submitted 23 February, 2017; v1 submitted 22 January, 2016;
originally announced January 2016.
-
Joint Backhaul-Access Analysis of Full Duplex Self-Backhauling Heterogeneous Networks
Authors:
Ankit Sharma,
Radha Krishna Ganti,
J. Klutto Milleth
Abstract:
With the successful demonstration of in-band full-duplex (IBFD) transceivers, a new research dimension has been added to wireless networks. This paper proposes an interesting use case of this capability for IBFD self-backhauling heterogeneous networks (HetNet). IBFD self-backhauling in a HetNet refers to IBFD-enabled small cells backhauling themselves with macro cells over the wireless channel. Ow…
▽ More
With the successful demonstration of in-band full-duplex (IBFD) transceivers, a new research dimension has been added to wireless networks. This paper proposes an interesting use case of this capability for IBFD self-backhauling heterogeneous networks (HetNet). IBFD self-backhauling in a HetNet refers to IBFD-enabled small cells backhauling themselves with macro cells over the wireless channel. Owing to their IBFD capability, the small cells simultaneously communicate over the access and backhaul links, using the same frequency band. The idea is doubly advantageous, as it obviates the need for fiber backhauling small cells every hundred meters and allows the access spectrum to be reused for backhauling at no extra cost. This work considers the case of a two-tier cellular network with IBFD-enabled small cells, wirelessly backhauling themselves with conventional macro cells. For clear exposition, the case considered is that of FDD network, where within access and backhaul links, the downlink (DL) and uplink (UL) are frequency duplexed ($f1$, $f2$ respectively), while the total frequency spectrum used at access and backhaul ($f1+f2$) is the same. Analytical expressions for coverage and average downlink (DL) rate in such a network are derived using tools from the field of stochastic geometry. It is shown that DL rate in such networks could be close to double that of a conventional TDD/FDD self-backhauling network, at the expense of reduced coverage due to higher interference in IBFD networks. For the proposed IBFD network, the conflicting aspects of increased interference on one side and high spectral efficiency on the other are captured into a mathematical model. The mathematical model introduces an end-to-end joint analysis of backhaul (or fronthaul) and access links, in contrast to the largely available access-centric studies.
△ Less
Submitted 10 January, 2017; v1 submitted 8 January, 2016;
originally announced January 2016.
-
Matrix Completion Under Monotonic Single Index Models
Authors:
Ravi Ganti,
Laura Balzano,
Rebecca Willett
Abstract:
Most recent results in matrix completion assume that the matrix under consideration is low-rank or that the columns are in a union of low-rank subspaces. In real-world settings, however, the linear structure underlying these models is distorted by a (typically unknown) nonlinear transformation. This paper addresses the challenge of matrix completion in the face of such nonlinearities. Given a few…
▽ More
Most recent results in matrix completion assume that the matrix under consideration is low-rank or that the columns are in a union of low-rank subspaces. In real-world settings, however, the linear structure underlying these models is distorted by a (typically unknown) nonlinear transformation. This paper addresses the challenge of matrix completion in the face of such nonlinearities. Given a few observations of a matrix that are obtained by applying a Lipschitz, monotonic function to a low rank matrix, our task is to estimate the remaining unobserved entries. We propose a novel matrix completion method that alternates between low-rank matrix estimation and monotonic function estimation to estimate the missing matrix elements. Mean squared error bounds provide insight into how well the matrix can be estimated based on the size, rank of the matrix and properties of the nonlinear transformation. Empirical results on synthetic and real-world datasets demonstrate the competitiveness of the proposed approach.
△ Less
Submitted 29 December, 2015;
originally announced December 2015.
-
Performance of Cloud Radio Networks
Authors:
Sreejith T. Veetil,
Kiran Kuchi,
Radha Krishna Ganti
Abstract:
Cloud radio networks coordinate transmission among base stations (BSs) to reduce the interference effects, particularly for the cell-edge users. In this paper, we analyze the performance of a cloud network with static clustering where geographically close BSs form a cloud network of cooperating BSs. Because, of finite cooperation, the interference in a practical cloud radio cannot be removed and i…
▽ More
Cloud radio networks coordinate transmission among base stations (BSs) to reduce the interference effects, particularly for the cell-edge users. In this paper, we analyze the performance of a cloud network with static clustering where geographically close BSs form a cloud network of cooperating BSs. Because, of finite cooperation, the interference in a practical cloud radio cannot be removed and in this paper, the distance based interference is taken into account in the analysis. In particular, we consider centralized zero forcing equalizer and dirty paper precoding for cancelling the interference. Bounds are developed on the signal-to-interference ratio distribution and achievable rate with full and limited channel feedback from the cluster users. The adverse effect of finite clusters on the achievable rate is quantified. We show that, the number of cooperating BSs is more crucial than the cluster area when full channel state information form the cluster is available for precoding. Also, we study the impact of limiting the channel state information on the achievable rate. We show that even with a practically feasible feedback of about five to six channel states from each user, significant gain in mean rate and cell edge rate compared to conventional cellular systems can be obtained.
△ Less
Submitted 18 December, 2015;
originally announced December 2015.
-
Joint Source Selection and Data Extrapolation in Social Sensing for Disaster Response
Authors:
Mohammad Hosseini,
Nooreddin Nagibolhosseini,
Amotz Barnoy,
Peter Terlecky,
Hengchang Liu,
Shaohan Hu,
Shiguang Wang,
Tanvir Amin,
Lu Su,
Dong Wang,
Ramesh Govindan,
Raghu Ganti,
Mudhakar Srivatsa,
Charu Aggrawal,
Tarek Abdelzaher,
Siyu Gu,
Chenji Pan
Abstract:
This paper complements the large body of social sensing literature by develo** means for augmenting sensing data with inference results that "fill-in" missing pieces. It specifically explores the synergy between (i) inference techniques used for filling-in missing pieces and (ii) source selection techniques used to determine which pieces to retrieve in order to improve inference results. We focu…
▽ More
This paper complements the large body of social sensing literature by develo** means for augmenting sensing data with inference results that "fill-in" missing pieces. It specifically explores the synergy between (i) inference techniques used for filling-in missing pieces and (ii) source selection techniques used to determine which pieces to retrieve in order to improve inference results. We focus on prediction in disaster scenarios, where disruptive trend changes occur. We first discuss our previous conference study that compared a set of prediction heuristics and developed a hybrid prediction algorithm. We then enhance the prediction scheme by considering algorithms for sensor selection that improve inference quality. Our proposed source selection and extrapolation algorithms are tested using data collected during the New York City crisis in the aftermath of Hurricane Sandy in November 2012. The evaluation results show that consistently good predictions are achieved. The work is notable for addressing the bi-modal nature of damage propagation in complex systems subjected to stress, where periods of calm are interspersed with periods of severe change. It is novel in offering a new solution to the problem that jointly leverages source selection and extrapolation components thereby improving the results.
△ Less
Submitted 1 December, 2015;
originally announced December 2015.
-
Learning Single Index Models in High Dimensions
Authors:
Ravi Ganti,
Nikhil Rao,
Rebecca M. Willett,
Robert Nowak
Abstract:
Single Index Models (SIMs) are simple yet flexible semi-parametric models for classification and regression. Response variables are modeled as a nonlinear, monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights, and the nonlinear function. While methods have been described to learn SIMs in the low dimensional regime, a method t…
▽ More
Single Index Models (SIMs) are simple yet flexible semi-parametric models for classification and regression. Response variables are modeled as a nonlinear, monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights, and the nonlinear function. While methods have been described to learn SIMs in the low dimensional regime, a method that can efficiently learn SIMs in high dimensions has not been forthcoming. We propose three variants of a computationally and statistically efficient algorithm for SIM inference in high dimensions. We establish excess risk bounds for the proposed algorithms and experimentally validate the advantages that our SIM learning methods provide relative to Generalized Linear Model (GLM) and low dimensional SIM based learning methods.
△ Less
Submitted 29 June, 2015;
originally announced June 2015.
-
Asymptotics and Approximation of the SIR Distribution in General Cellular Networks
Authors:
Radha K. Ganti,
Martin Haenggi
Abstract:
It has recently been observed that the SIR distributions of a variety of cellular network models and transmission techniques look very similar in shape. As a result, they are well approximated by a simple horizontal shift (or gain) of the distribution of the most tractable model, the Poisson point process (PPP). To study and explain this behavior, this paper focuses on general single-tier network…
▽ More
It has recently been observed that the SIR distributions of a variety of cellular network models and transmission techniques look very similar in shape. As a result, they are well approximated by a simple horizontal shift (or gain) of the distribution of the most tractable model, the Poisson point process (PPP). To study and explain this behavior, this paper focuses on general single-tier network models with nearest-base station association and studies the asymptotic gain both at 0 and at infinity.
We show that the gain at 0 is determined by the so-called mean interference-to-signal ratio (MISR) between the PPP and the network model under consideration, while the gain at infinity is determined by the expected fading-to-interference ratio (EFIR).
The analysis of the MISR is based on a novel type of point process, the so-called relative distance process, which is a one-dimensional point process on the unit interval [0,1] that fully determines the SIR. A comparison of the gains at 0 and infinity shows that the gain at 0 indeed provides an excellent approximation for the entire SIR distribution. Moreover, the gain is mostly a function of the network geometry and barely depends on the path loss exponent and the fading. The results are illustrated using several examples of repulsive point processes.
△ Less
Submitted 25 November, 2015; v1 submitted 9 May, 2015;
originally announced May 2015.
-
Approximation of Capacity for ISI Channels with One-bit Output Quantization
Authors:
Radha Krishna Ganti,
Andrew Thangaraj,
Arijit Mondal
Abstract:
Motivated by recent high bandwidth communication systems, Inter-Symbol Interference (ISI) channels with 1-bit quantized output are considered under an average-power-constrained continuous input. While the exact capacity is difficult to characterize, an approximation that matches with the exact channel output up to a probability of error is provided. The approximation does not have additive noise,…
▽ More
Motivated by recent high bandwidth communication systems, Inter-Symbol Interference (ISI) channels with 1-bit quantized output are considered under an average-power-constrained continuous input. While the exact capacity is difficult to characterize, an approximation that matches with the exact channel output up to a probability of error is provided. The approximation does not have additive noise, but constrains the channel output (without noise) to be above a threshold in absolute value. The capacity under the approximation is computed using methods involving standard Gibbs distributions. Markovian achievable schemes approaching the approximate capacity are provided. The methods used over the approximate ISI channel result in ideas for practical coding schemes for ISI channels with 1-bit output quantization.
△ Less
Submitted 4 May, 2015;
originally announced May 2015.
-
Spatial CSMA: A Distributed Scheduling Algorithm for the SIR Model with Time-varying Channels
Authors:
Peruru Subrahmanya Swamy,
Radha Krishna Ganti,
Krishna Jagannathan
Abstract:
Recent work has shown that adaptive CSMA algorithms can achieve throughput optimality. However, these adaptive CSMA algorithms assume a rather simplistic model for the wireless medium. Specifically, the interference is typically modelled by a conflict graph, and the channels are assumed to be static. In this work, we propose a distributed and adaptive CSMA algorithm under a more realistic signal-t…
▽ More
Recent work has shown that adaptive CSMA algorithms can achieve throughput optimality. However, these adaptive CSMA algorithms assume a rather simplistic model for the wireless medium. Specifically, the interference is typically modelled by a conflict graph, and the channels are assumed to be static. In this work, we propose a distributed and adaptive CSMA algorithm under a more realistic signal-to-interference ratio (SIR) based interference model, with time-varying channels. We prove that our algorithm is throughput optimal under this generalized model. Further, we augment our proposed algorithm by using a parallel update technique. Numerical results show that our algorithm outperforms the conflict graph based algorithms, in terms of supportable throughput and the rate of convergence to steady-state.
△ Less
Submitted 29 April, 2015;
originally announced April 2015.
-
Active Model Aggregation via Stochastic Mirror Descent
Authors:
Ravi Ganti
Abstract:
We consider the problem of learning convex aggregation of models, that is as good as the best convex aggregation, for the binary classification problem. Working in the stream based active learning setting, where the active learner has to make a decision on-the-fly, if it wants to query for the label of the point currently seen in the stream, we propose a stochastic-mirror descent algorithm, called…
▽ More
We consider the problem of learning convex aggregation of models, that is as good as the best convex aggregation, for the binary classification problem. Working in the stream based active learning setting, where the active learner has to make a decision on-the-fly, if it wants to query for the label of the point currently seen in the stream, we propose a stochastic-mirror descent algorithm, called SMD-AMA, with entropy regularization. We establish an excess risk bounds for the loss of the convex aggregate returned by SMD-AMA to be of the order of $O\left(\sqrt{\frac{\log(M)}{T^{1-μ}}}\right)$, where $μ\in [0,1)$ is an algorithm dependent parameter, that trades-off the number of labels queried, and excess risk.
△ Less
Submitted 28 March, 2015;
originally announced March 2015.
-
Sparse Linear Regression With Missing Data
Authors:
Ravi Ganti,
Rebecca M. Willett
Abstract:
This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the regression coefficients, and the proposed algorithm jointly learns the low-dimensional structure of the data and a linear regressor with sparse coefficients. The propos…
▽ More
This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the regression coefficients, and the proposed algorithm jointly learns the low-dimensional structure of the data and a linear regressor with sparse coefficients. The proposed stochastic optimization method, Sparse Linear Regression with Missing Data (SLRM), performs an alternating minimization procedure and scales well with the problem size. Large deviation inequalities shed light on the impact of the various problem-dependent parameters on the expected squared loss of the learned regressor. Extensive simulations on both synthetic and real datasets show that SLRM performs better than competing algorithms in a variety of contexts.
△ Less
Submitted 28 March, 2015;
originally announced March 2015.