Search | arXiv e-print repository

TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes

Authors: Aamod Khatiwada, Harsha Kokel, Ibrahim Abdelaziz, Subhajit Chaudhury, Julian Dolby, Oktie Hassanzadeh, Zhenhan Huang, Tejaswini Pedapati, Horst Samulowitz, Kavitha Srinivas

Abstract: Enterprises have a growing need to identify relevant tables in data lakes; e.g. tables that are unionable, joinable, or subsets of each other. Tabular neural models can be helpful for such data discovery tasks. In this paper, we present TabSketchFM, a neural tabular model for data discovery over data lakes. First, we propose a novel pre-training sketch-based approach to enhance the effectiveness o… ▽ More Enterprises have a growing need to identify relevant tables in data lakes; e.g. tables that are unionable, joinable, or subsets of each other. Tabular neural models can be helpful for such data discovery tasks. In this paper, we present TabSketchFM, a neural tabular model for data discovery over data lakes. First, we propose a novel pre-training sketch-based approach to enhance the effectiveness of data discovery techniques in neural tabular models. Second, to further finetune the pretrained model for several downstream tasks, we develop LakeBench, a collection of 8 benchmarks to help with different data discovery tasks such as finding tasks that are unionable, joinable, or subsets of each other. We then show on these finetuning tasks that TabSketchFM achieves state-of-the art performance compared to existing neural models. Third, we use these finetuned models to search for tables that are unionable, joinable, or can be subsets of each other. Our results demonstrate improvements in F1 scores for search compared to state-of-the-art techniques (even up to 70% improvement in a joinable search benchmark). Finally, we show significant transfer across datasets and tasks establishing that our model can generalize across different tasks over different data lakes △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2307.04217

arXiv:2406.10320 [pdf, other]

Out of style: Misadventures with LLMs and code style transfer

Authors: Karl Munson, Chih-Kai Ting, Serenity Wade, Anish Savla, Julian Dolby, Kiran Kate, Kavitha Srinivas

Abstract: Like text, programs have styles, and certain programming styles are more desirable than others for program readability, maintainability, and performance. Code style transfer, however, is difficult to automate except for trivial style guidelines such as limits on line length. Inspired by the success of using language models for text style transfer, we investigate if code language models can perform… ▽ More Like text, programs have styles, and certain programming styles are more desirable than others for program readability, maintainability, and performance. Code style transfer, however, is difficult to automate except for trivial style guidelines such as limits on line length. Inspired by the success of using language models for text style transfer, we investigate if code language models can perform code style transfer. Code style transfer, unlike text transfer, has rigorous requirements: the system needs to identify lines of code to change, change them correctly, and leave the rest of the program untouched. We designed CSB (Code Style Benchmark), a benchmark suite of code style transfer tasks across five categories including converting for-loops to list comprehensions, eliminating duplication in code, adding decorators to methods, etc. We then used these tests to see if large pre-trained code language models or fine-tuned models perform style transfer correctly, based on rigorous metrics to test that the transfer did occur, and the code still passes functional tests. Surprisingly, language models failed to perform all of the tasks, suggesting that they perform poorly on tasks that require code understanding. We will make available the large-scale corpora to help the community build better code models. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2405.11525 [pdf, other]

Overcoming Data and Model Heterogeneities in Decentralized Federated Learning via Synthetic Anchors

Authors: Chun-Yin Huang, Kartik Srinivas, Xin Zhang, Xiaoxiao Li

Abstract: Conventional Federated Learning (FL) involves collaborative training of a global model while maintaining user data privacy. One of its branches, decentralized FL, is a serverless network that allows clients to own and optimize different local models separately, which results in saving management and communication resources. Despite the promising advancements in decentralized FL, it may reduce mode… ▽ More Conventional Federated Learning (FL) involves collaborative training of a global model while maintaining user data privacy. One of its branches, decentralized FL, is a serverless network that allows clients to own and optimize different local models separately, which results in saving management and communication resources. Despite the promising advancements in decentralized FL, it may reduce model generalizability due to lacking a global model. In this scenario, managing data and model heterogeneity among clients becomes a crucial problem, which poses a unique challenge that must be overcome: How can every client's local model learn generalizable representation in a decentralized manner? To address this challenge, we propose a novel Decentralized FL technique by introducing Synthetic Anchors, dubbed as DeSA. Based on the theory of domain adaptation and Knowledge Distillation (KD), we theoretically and empirically show that synthesizing global anchors based on raw data distribution facilitates mutual knowledge transfer. We further design two effective regularization terms for local training: 1) REG loss that regularizes the distribution of the client's latent embedding with the anchors and 2) KD loss that enables clients to learn from others. Through extensive experiments on diverse client data distributions, we showcase the effectiveness of DeSA in enhancing both inter- and intra-domain accuracy of each client. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: Paper Accepted at ICML 2024, 23 pages

arXiv:2405.06650 [pdf, other]

Large Language Models as Planning Domain Generators

Authors: James Oswald, Kavitha Srinivas, Harsha Kokel, Junkyu Lee, Michael Katz, Shirin Sohrabi

Abstract: Develo** domain models is one of the few remaining places that require manual human labor in AI planning. Thus, in order to make planning more accessible, it is desirable to automate the process of domain model generation. To this end, we investigate if large language models (LLMs) can be used to generate planning domain models from simple textual descriptions. Specifically, we introduce a frame… ▽ More Develo** domain models is one of the few remaining places that require manual human labor in AI planning. Thus, in order to make planning more accessible, it is desirable to automate the process of domain model generation. To this end, we investigate if large language models (LLMs) can be used to generate planning domain models from simple textual descriptions. Specifically, we introduce a framework for automated evaluation of LLM-generated domains by comparing the sets of plans for domain instances. Finally, we perform an empirical analysis of 7 large language models, including coding and chat models across 9 different planning domains, and under three classes of natural language domain descriptions. Our results indicate that LLMs, particularly those with high parameter counts, exhibit a moderate level of proficiency in generating correct planning domains from natural language descriptions. Our code is available at https://github.com/IBM/NL2PDDL. △ Less

Submitted 2 April, 2024; originally announced May 2024.

Comments: Published at ICAPS 2024

arXiv:2404.11833 [pdf, ps, other]

Thought of Search: Planning with Language Models Through The Lens of Efficiency

Authors: Michael Katz, Harsha Kokel, Kavitha Srinivas, Shirin Sohrabi

Abstract: Among the most important properties of algorithms investigated in computer science are soundness, completeness, and complexity. These properties, however, are rarely analyzed for the vast collection of recently proposed methods for planning with large language models. In this work, we alleviate this gap. We analyse these properties of using LLMs for planning and highlight that recent trends abando… ▽ More Among the most important properties of algorithms investigated in computer science are soundness, completeness, and complexity. These properties, however, are rarely analyzed for the vast collection of recently proposed methods for planning with large language models. In this work, we alleviate this gap. We analyse these properties of using LLMs for planning and highlight that recent trends abandon both soundness and completeness for the sake of inefficiency. We propose a significantly more efficient approach that can, at the same time, maintain both soundness and completeness. We exemplify on four representative search problems, comparing to the LLM-based solutions from the literature that attempt to solve these problems. We show that by using LLMs to produce the code for the search components we can solve the entire datasets with 100\% accuracy with only a few calls to the LLM. We argue for a responsible use of compute resources; urging research community to investigate sound and complete LLM-based approaches that uphold efficiency. △ Less

Submitted 21 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.05151 [pdf, other]

STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs

Authors: Kush Hari, Hansoul Kim, Will Panitch, Kishore Srinivas, Vincent Schorp, Karthik Dharmarajan, Shreya Ganti, Tara Sadjadpour, Ken Goldberg

Abstract: We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread swee**, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new sut… ▽ More We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread swee**, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new suturing motion primitives. We compare STITCH to baselines, including a proprioception-only and a policy without visual servoing. In physical experiments across 15 trials, STITCH achieves an average of 2.93 sutures without human intervention and 4.47 sutures with human intervention. See https://sites.google.com/berkeley.edu/stitch for code and supplemental materials. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2403.08900 [pdf, other]

Handoffs in User-Centric Cell-Free MIMO Networks: A POMDP Framework

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: We study the problem of managing handoffs (HOs) in user-centric cell-free massive MIMO (UC-mMIMO) networks. Motivated by the importance of controlling the number of HOs and by the correlation between efficient HO decisions and the temporal evolution of the channel conditions, we formulate a partially observable Markov decision process (POMDP) with the state space representing the discrete versions… ▽ More We study the problem of managing handoffs (HOs) in user-centric cell-free massive MIMO (UC-mMIMO) networks. Motivated by the importance of controlling the number of HOs and by the correlation between efficient HO decisions and the temporal evolution of the channel conditions, we formulate a partially observable Markov decision process (POMDP) with the state space representing the discrete versions of the large-scale fading and the action space representing the association decisions of the user with the access points (APs). We develop a novel algorithm that employs this model to derive a HO policy for a mobile user based on current and future rewards. To alleviate the high complexity of our POMDP, we follow a divide-and-conquer approach by breaking down the POMDP formulation into sub-problems, each solved separately. Then, the policy and the candidate pool of APs for the sub-problem that produced the best total expected reward are used to perform HOs within a specific time horizon. We then introduce modifications to our algorithm to decrease the number of HOs. The results show that half of the number of HOs in the UC-mMIMO networks can be eliminated. Namely, our novel solution can control the number of HOs while maintaining a rate guarantee, where a 47%-70% reduction of the cumulative number of HOs is observed in networks with a density of 125 APs per km2. Most importantly, our results show that a POMDP-based HO scheme is promising to control HOs. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: Accepted in IEEE Transactions on Wireless Communications (TWC)

arXiv:2311.02044 [pdf, other]

Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Authors: David Paz, Narayanan E. Ranganatha, Srinidhi K. Srinivas, Yunchao Yao, Henrik I. Christensen

Abstract: This research work seeks to explore and identify strategies that can determine road topology information in 2D and 3D under highly dynamic urban driving scenarios. To facilitate this exploration, we introduce a substantial dataset comprising nearly one million automatically labeled data frames. A key contribution of our research lies in develo** an automatic label-generation process and an occlu… ▽ More This research work seeks to explore and identify strategies that can determine road topology information in 2D and 3D under highly dynamic urban driving scenarios. To facilitate this exploration, we introduce a substantial dataset comprising nearly one million automatically labeled data frames. A key contribution of our research lies in develo** an automatic label-generation process and an occlusion handling strategy. This strategy is designed to model a wide range of occlusion scenarios, from mild disruptions to severe blockages. Furthermore, we present a comprehensive ablation study wherein multiple centerline detection methods are developed and evaluated. This analysis not only benchmarks the performance of various approaches but also provides valuable insights into the interpretability of these methods. Finally, we demonstrate the practicality of our methods and assess their adaptability across different sensor configurations, highlighting their versatility and relevance in real-world scenarios. Our dataset and experimental models are publicly available. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Comments: 7 pages, 8 figures, 1 algorithm, 11 equations

arXiv:2308.15027 [pdf, ps, other]

Improving Neural Ranking Models with Traditional IR Methods

Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

Abstract: Neural ranking methods based on large transformer models have recently gained significant attention in the information retrieval community, and have been adopted by major commercial solutions. Nevertheless, they are computationally expensive to create, and require a great deal of labeled data for specialized corpora. In this paper, we explore a low resource alternative which is a bag-of-embedding… ▽ More Neural ranking methods based on large transformer models have recently gained significant attention in the information retrieval community, and have been adopted by major commercial solutions. Nevertheless, they are computationally expensive to create, and require a great deal of labeled data for specialized corpora. In this paper, we explore a low resource alternative which is a bag-of-embedding model for document retrieval and find that it is competitive with large transformer models fine tuned on information retrieval tasks. Our results show that a simple combination of TF-IDF, a traditional keyword matching method, with a shallow embedding model provides a low cost path to compete well with the performance of complex neural ranking models on 3 datasets. Furthermore, adding TF-IDF measures improves the performance of large-scale fine tuned models on these tasks. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: Short paper, 4 pages

arXiv:2308.03891 [pdf, other]

A Cross-Domain Evaluation of Approaches for Causal Knowledge Extraction

Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

Abstract: Causal knowledge extraction is the task of extracting relevant causes and effects from text by detecting the causal relation. Although this task is important for language understanding and knowledge discovery, recent works in this domain have largely focused on binary classification of a text segment as causal or non-causal. In this regard, we perform a thorough analysis of three sequence tagging… ▽ More Causal knowledge extraction is the task of extracting relevant causes and effects from text by detecting the causal relation. Although this task is important for language understanding and knowledge discovery, recent works in this domain have largely focused on binary classification of a text segment as causal or non-causal. In this regard, we perform a thorough analysis of three sequence tagging models for causal knowledge extraction and compare it with a span based approach to causality extraction. Our experiments show that embeddings from pre-trained language models (e.g. BERT) provide a significant performance boost on this task compared to previous state-of-the-art models with complex architectures. We observe that span based models perform better than simple sequence tagging models based on BERT across all 4 data sets from diverse domains with different types of cause-effect phrases. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2307.04217 [pdf, other]

LakeBench: Benchmarks for Data Discovery over Data Lakes

Authors: Kavitha Srinivas, Julian Dolby, Ibrahim Abdelaziz, Oktie Hassanzadeh, Harsha Kokel, Aamod Khatiwada, Tejaswini Pedapati, Subhajit Chaudhury, Horst Samulowitz

Abstract: Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can be unionable, joinable, or subsets of each other. There is a dearth of benchmarks for these tasks in the public domain, with related work targeting private data… ▽ More Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can be unionable, joinable, or subsets of each other. There is a dearth of benchmarks for these tasks in the public domain, with related work targeting private datasets. In LakeBench, we develop multiple benchmarks for these tasks by using the tables that are drawn from a diverse set of data sources such as government data from CKAN, Socrata, and the European Central Bank. We compare the performance of 4 publicly available tabular foundational models on these tasks. None of the existing models had been trained on the data discovery tasks that we developed for this benchmark; not surprisingly, their performance shows significant room for improvement. The results suggest that the establishment of such benchmarks may be useful to the community to build tabular models usable for data discovery in data lakes. △ Less

Submitted 9 July, 2023; originally announced July 2023.

arXiv:2307.03882 [pdf, other]

The Busboy Problem: Efficient Tableware Decluttering Using Consolidation and Multi-Object Grasps

Authors: Kishore Srinivas, Shreya Ganti, Rishi Parikh, Ayah Ahmad, Wisdom Agboh, Mehmet Dogar, Ken Goldberg

Abstract: We present the "Busboy Problem": automating an efficient decluttering of cups, bowls, and silverware from a planar surface. As gras** and transporting individual items is highly inefficient, we propose policies to generate grasps for multiple items. We introduce the metric of Objects per Trip (OpT) carried by the robot to the collection bin to analyze the improvement seen as a result of our poli… ▽ More We present the "Busboy Problem": automating an efficient decluttering of cups, bowls, and silverware from a planar surface. As gras** and transporting individual items is highly inefficient, we propose policies to generate grasps for multiple items. We introduce the metric of Objects per Trip (OpT) carried by the robot to the collection bin to analyze the improvement seen as a result of our policies. In physical experiments with singulated items, we find that consolidation and multi-object grasps resulted in an 1.8x improvement in OpT, compared to methods without multi-object grasps. See https://sites.google.com/berkeley.edu/busboyproblem for code and supplemental materials. △ Less

Submitted 7 July, 2023; originally announced July 2023.

arXiv:2305.11014 [pdf, other]

Generalized Planning in PDDL Domains with Pretrained Large Language Models

Authors: Tom Silver, Soham Dan, Kavitha Srinivas, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Michael Katz

Abstract: Recent work has considered whether large language models (LLMs) can function as planners: given a task, generate a plan. We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently produces plans for other tasks in the domain. In particular, we consider PDDL domains and use GPT-4 to synthesize Python programs. We also consid… ▽ More Recent work has considered whether large language models (LLMs) can function as planners: given a task, generate a plan. We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently produces plans for other tasks in the domain. In particular, we consider PDDL domains and use GPT-4 to synthesize Python programs. We also consider (1) Chain-of-Thought (CoT) summarization, where the LLM is prompted to summarize the domain and propose a strategy in words before synthesizing the program; and (2) automated debugging, where the program is validated with respect to the training tasks, and in case of errors, the LLM is re-prompted with four types of feedback. We evaluate this approach in seven PDDL domains and compare it to four ablations and four baselines. Overall, we find that GPT-4 is a surprisingly powerful generalized planner. We also conclude that automated debugging is very important, that CoT summarization has non-uniform impact, that GPT-4 is far superior to GPT-3.5, and that just two training tasks are often sufficient for strong generalization. △ Less

Submitted 18 December, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

Comments: AAAI 2024

arXiv:2303.13906 [pdf, ps, other]

Some congruences for $(\ell, k)$ and $(\ell, k, r)$-regular partitions

Authors: T Kathiravan, K Srinivas, Usha K Sangale

Abstract: Let $b_{\ell, k}(n), b_{\ell, k, r}(n)$ count the number of $(\ell, k)$, $(\ell, k, r)$-regular partitions respectively. In this paper we shall derive infinite families of congruences for $b_{\ell, k}(n)$ modulo $2$ when $ (\ell, k) = (3,8), (4, 7)$, for $b_{\ell, k}(n)$ modulo $8$, modulo $9$ and modulo $12$ when $(\ell, k) = (4, 9)$ and $b_{\ell, k, r}(n)$ modulo $2$ when… ▽ More Let $b_{\ell, k}(n), b_{\ell, k, r}(n)$ count the number of $(\ell, k)$, $(\ell, k, r)$-regular partitions respectively. In this paper we shall derive infinite families of congruences for $b_{\ell, k}(n)$ modulo $2$ when $ (\ell, k) = (3,8), (4, 7)$, for $b_{\ell, k}(n)$ modulo $8$, modulo $9$ and modulo $12$ when $(\ell, k) = (4, 9)$ and $b_{\ell, k, r}(n)$ modulo $2$ when $(\ell, k, r) = (3, 5, 8)$. △ Less

Submitted 24 March, 2023; originally announced March 2023.

MSC Class: 11P83; 05A17

arXiv:2303.01378 [pdf, other]

A Vision for Semantically Enriched Data Science

Authors: Udayan Khurana, Kavitha Srinivas, Sainyam Galhotra, Horst Samulowitz

Abstract: The recent efforts in automation of machine learning or data science has achieved success in various tasks such as hyper-parameter optimization or model selection. However, key areas such as utilizing domain knowledge and data semantics are areas where we have seen little automation. Data Scientists have long leveraged common sense reasoning and domain knowledge to understand and enrich data for b… ▽ More The recent efforts in automation of machine learning or data science has achieved success in various tasks such as hyper-parameter optimization or model selection. However, key areas such as utilizing domain knowledge and data semantics are areas where we have seen little automation. Data Scientists have long leveraged common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In this paper we discuss important shortcomings of current data science and machine learning solutions. We then envision how leveraging "semantic" understanding and reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation. Additionally, we discuss how semantics can assist data scientists in a new manner by hel** with challenges related to trust, bias, and explainability in machine learning. Semantic annotation can also help better explore and organize large data sources. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2205.08018

arXiv:2302.02259 [pdf, other]

CLiNet: Joint Detection of Road Network Centerlines in 2D and 3D

Authors: David Paz, Srinidhi Kalgundi Srinivas, Yunchao Yao, Henrik I. Christensen

Abstract: This work introduces a new approach for joint detection of centerlines based on image data by localizing the features jointly in 2D and 3D. In contrast to existing work that focuses on detection of visual cues, we explore feature extraction methods that are directly amenable to the urban driving task. To develop and evaluate our approach, a large urban driving dataset dubbed AV Breadcrumbs is auto… ▽ More This work introduces a new approach for joint detection of centerlines based on image data by localizing the features jointly in 2D and 3D. In contrast to existing work that focuses on detection of visual cues, we explore feature extraction methods that are directly amenable to the urban driving task. To develop and evaluate our approach, a large urban driving dataset dubbed AV Breadcrumbs is automatically labeled by leveraging vector map representations and projective geometry to annotate over 900,000 images. Our results demonstrate potential for dynamic scene modeling across various urban driving scenarios. Our model achieves an F1 score of 0.684 and an average normalized depth error of 2.083. The code and data annotations are publicly available. △ Less

Submitted 4 February, 2023; originally announced February 2023.

Comments: 5 pages, 4 figures, 1 table. Under review at IEEE Intelligent Vehicles Symposium 2023

arXiv:2301.05108 [pdf, other]

Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Authors: Wenting Zhao, Ibrahim Abdelaziz, Julian Dolby, Kavitha Srinivas, Mossad Helali, Essam Mansour

Abstract: Dynamically typed languages such as Python have become very popular. Among other strengths, Python's dynamic nature and its straightforward linking to native code have made it the de-facto language for many research areas such as Artificial Intelligence. This flexibility, however, makes static analysis very hard. While creating a sound, or a soundy, analysis for Python remains an open problem, we… ▽ More Dynamically typed languages such as Python have become very popular. Among other strengths, Python's dynamic nature and its straightforward linking to native code have made it the de-facto language for many research areas such as Artificial Intelligence. This flexibility, however, makes static analysis very hard. While creating a sound, or a soundy, analysis for Python remains an open problem, we present in this work Serenity, a framework for static analysis of Python that turns out to be sufficient for some tasks. The Serenity framework exploits two basic mechanisms: (a) reliance on dynamic dispatch at the core of language translation, and (b) extreme abstraction of libraries, to generate an abstraction of the code. We demonstrate the efficiency and usefulness of Serenity's analysis in two applications: code completion and automated machine learning. In these two applications, we demonstrate that such analysis has a strong signal, and can be leveraged to establish state-of-the-art performance, comparable to neural models and dynamic analysis respectively. △ Less

Submitted 4 January, 2023; originally announced January 2023.

arXiv:2211.02293 [pdf, other]

Automating Vascular Shunt Insertion with the dVRK Surgical Robot

Authors: Karthik Dharmarajan, Will Panitch, Muyan Jiang, Kishore Srinivas, Baiyu Shi, Yahav Avigal, Huang Huang, Thomas Low, Danyal Fer, Ken Goldberg

Abstract: Vascular shunt insertion is a fundamental surgical procedure used to temporarily restore blood flow to tissues. It is often performed in the field after major trauma. We formulate a problem of automated vascular shunt insertion and propose a pipeline to perform Automated Vascular Shunt Insertion (AVSI) using a da Vinci Research Kit. The pipeline uses a learned visual model to estimate the locus of… ▽ More Vascular shunt insertion is a fundamental surgical procedure used to temporarily restore blood flow to tissues. It is often performed in the field after major trauma. We formulate a problem of automated vascular shunt insertion and propose a pipeline to perform Automated Vascular Shunt Insertion (AVSI) using a da Vinci Research Kit. The pipeline uses a learned visual model to estimate the locus of the vessel rim, plans a grasp on the rim, and moves to grasp at that point. The first robot gripper then pulls the rim to stretch open the vessel with a dilation motion. The second robot gripper then proceeds to insert a shunt into the vessel phantom (a model of the blood vessel) with a chamfer tilt followed by a screw motion. Results suggest that AVSI achieves a high success rate even with tight tolerances and varying vessel orientations up to 30°. Supplementary material, dataset, videos, and visualizations can be found at https://sites.google.com/berkeley.edu/autolab-avsi. △ Less

Submitted 8 March, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

Comments: Published in: IEEE International Conference on Robotics and Automation (ICRA) 2023

arXiv:2210.07420 [pdf, other]

Learning to Efficiently Plan Robust Frictional Multi-Object Grasps

Authors: Wisdom C. Agboh, Satvik Sharma, Kishore Srinivas, Mallika Parulekar, Gaurav Datta, Tianshuang Qiu, Jeffrey Ichnowski, Eugen Solowjow, Mehmet Dogar, Ken Goldberg

Abstract: We consider a decluttering problem where multiple rigid convex polygonal objects rest in randomly placed positions and orientations on a planar surface and must be efficiently transported to a packing box using both single and multi-object grasps. Prior work considered frictionless multi-object gras**. In this paper, we introduce friction to increase the number of potential grasps for a given gr… ▽ More We consider a decluttering problem where multiple rigid convex polygonal objects rest in randomly placed positions and orientations on a planar surface and must be efficiently transported to a packing box using both single and multi-object grasps. Prior work considered frictionless multi-object gras**. In this paper, we introduce friction to increase the number of potential grasps for a given group of objects, and thus increase picks per hour. We train a neural network using real examples to plan robust multi-object grasps. In physical experiments, we find a 13.7% increase in success rate, a 1.6x increase in picks per hour, and a 6.3x decrease in grasp planning time compared to prior work on multi-object gras**. Compared to single-object gras**, we find a 3.1x increase in picks per hour. △ Less

Submitted 2 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: IEEE IROS 2023

arXiv:2209.08197 [pdf, other]

Thompson Sampling with Virtual Hel** Agents

Authors: Kartik Anand Pant, Amod Hegde, K. V. Srinivas

Abstract: We address the problem of online sequential decision making, i.e., balancing the trade-off between exploiting the current knowledge to maximize immediate performance and exploring the new information to gain long-term benefits using the multi-armed bandit framework. Thompson sampling is one of the heuristics for choosing actions that address this exploration-exploitation dilemma. We first propose… ▽ More We address the problem of online sequential decision making, i.e., balancing the trade-off between exploiting the current knowledge to maximize immediate performance and exploring the new information to gain long-term benefits using the multi-armed bandit framework. Thompson sampling is one of the heuristics for choosing actions that address this exploration-exploitation dilemma. We first propose a general framework that helps heuristically tune the exploration versus exploitation trade-off in Thompson sampling using multiple samples from the posterior distribution. Utilizing this framework, we propose two algorithms for the multi-armed bandit problem and provide theoretical bounds on the cumulative regret. Next, we demonstrate the empirical improvement in the cumulative regret performance of the proposed algorithm over Thompson Sampling. We also show the effectiveness of the proposed algorithm on real-world datasets. Contrary to the existing methods, our framework provides a mechanism to vary the amount of exploration/ exploitation based on the task at hand. Towards this end, we extend our framework for two additional problems, i.e., best arm identification and time-sensitive learning in bandits and compare our algorithm with existing methods. △ Less

Submitted 16 September, 2022; originally announced September 2022.

Comments: 14 pages, 8 figures

arXiv:2209.06273 [pdf, other]

Exploring Code Style Transfer with Neural Networks

Authors: Karl Munson, Anish Savla, Chih-Kai Ting, Serenity Wade, Kiran Kate, Kavitha Srinivas

Abstract: Style is a significant component of natural language text, reflecting a change in the tone of text while kee** the underlying information the same. Even though programming languages have strict syntax rules, they also have style. Code can be written with the same functionality but using different language features. However, programming style is difficult to quantify, and thus as part of this wor… ▽ More Style is a significant component of natural language text, reflecting a change in the tone of text while kee** the underlying information the same. Even though programming languages have strict syntax rules, they also have style. Code can be written with the same functionality but using different language features. However, programming style is difficult to quantify, and thus as part of this work, we define style attributes, specifically for Python. To build a definition of style, we utilized hierarchical clustering to capture a style definition without needing to specify transformations. In addition to defining style, we explore the capability of a pre-trained code language model to capture information about code style. To do this, we fine-tuned pre-trained code-language models and evaluated their performance in code style transfer tasks. △ Less

Submitted 13 September, 2022; originally announced September 2022.

arXiv:2208.02384 [pdf, other]

POMDP-based Handoffs for User-Centric Cell-Free MIMO Networks

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: We propose to control handoffs (HOs) in user-centric cell-free massive MIMO networks through a partially observable Markov decision process (POMDP) with the state space representing the discrete versions of the large-scale fading (LSF) and the action space representing the association decisions of the user with the access points. Our proposed formulation accounts for the temporal evolution and the… ▽ More We propose to control handoffs (HOs) in user-centric cell-free massive MIMO networks through a partially observable Markov decision process (POMDP) with the state space representing the discrete versions of the large-scale fading (LSF) and the action space representing the association decisions of the user with the access points. Our proposed formulation accounts for the temporal evolution and the partial observability of the channel states. This allows us to consider future rewards when performing HO decisions, and hence obtain a robust HO policy. To alleviate the high complexity of solving our POMDP, we follow a divide-and-conquer approach by breaking down the POMDP formulation into sub-problems, each solved individually. Then, the policy and the candidate cluster of access points for the best solved sub-problem is used to perform HOs within a specific time horizon. We control the number of HOs by determining when to use the HO policy. Our simulation results show that our proposed solution reduces HOs by 47% compared to time-triggered LSF-based HOs and by 70% compared to data rate threshold-triggered LSF-based HOs. This amount can be further reduced through increasing the time horizon of the POMDP. △ Less

Submitted 7 August, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

Comments: IEEE Global Communications Conference 2022

arXiv:2205.08018 [pdf, other]

A Survey on Semantics in Automated Data Science

Authors: Udayan Khurana, Kavitha Srinivas, Horst Samulowitz

Abstract: Data Scientists leverage common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In recent years, we have witnessed a surge in tools and techniques for {\em automated machine learning}. While data scientists can employ various such tools to help with model building, many other aspects such as {\em feature engineering} that require semantic understa… ▽ More Data Scientists leverage common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In recent years, we have witnessed a surge in tools and techniques for {\em automated machine learning}. While data scientists can employ various such tools to help with model building, many other aspects such as {\em feature engineering} that require semantic understanding of concepts, remain manual to a large extent. In this paper we discuss important shortcomings of current automated data science solutions and machine learning. We discuss how leveraging basic semantic reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation. Moreover, semantics can assist data scientists in a new manner by hel** with challenges related to {\em trust}, {\em bias}, and {\em explainability}. △ Less

Submitted 16 May, 2022; originally announced May 2022.

arXiv:2202.08467 [pdf, other]

Absolute and "upstream" convective instabilities in plane Couette-Poiseuille flow

Authors: Kirthy K. Srinivas, Sourabh S. Diwan

Abstract: Here we report some interesting new features of the spatio-temporal instability of the incompressible plane Couette-Poiseuille flow (CPF). First of all, this flow represents the first instance of a "non-inflectional" absolute instability, within constant-viscosity formulation, which is triggered when one of the plates moves opposite to the bulk motion. More strikingly, with further increase in the… ▽ More Here we report some interesting new features of the spatio-temporal instability of the incompressible plane Couette-Poiseuille flow (CPF). First of all, this flow represents the first instance of a "non-inflectional" absolute instability, within constant-viscosity formulation, which is triggered when one of the plates moves opposite to the bulk motion. More strikingly, with further increase in the negative plate motion, the absolute instability ($\textrm{AI}$) transitions to an "upstream" convective instability ($\textrm{CI}^-$), wherein an unstable wave packet moves opposite to the direction of the bulk flow. Thus, the CPF exhibits a unique $\textrm{CI}^+ \to \textrm{AI} \to \textrm{CI}^-$ transition, for a given Reynolds number ($Re$), where $\textrm{CI}^+$ denotes the commonly-observed case of a "downstream" convective instability. This type of transition has not been reported for other known examples of absolutely unstable flows. We compute the leading and trailing edge velocities for an amplifying wave packet and find that, for the plane Poiseuille flow, both these velocities approach zero as $Re \to \infty$. As a result, at high $Re$, even the slightest of negative plate motions is sufficient to trigger $\textrm{AI}$ and subsequently $\textrm{CI}^-$, as observed for the CPF. The wave-packet dispersion first increases with $Re$, followed by a decrease, which points to a peculiar "dual" role of viscosity in sustaining $\textrm{AI}$ in the CPF, namely, viscosity promotes sustenance of $\textrm{AI}$ at moderate Reynolds numbers but suppresses it at low and high Reynolds numbers. These results can be well understood within the Ginzburg-Landau framework, and therefore can be expected to have a wider applicability. △ Less

Submitted 17 February, 2022; originally announced February 2022.

arXiv:2201.12242 [pdf, other]

Large Scale Generation of Labeled Type Data for Python

Authors: Ibrahim Abdelaziz, Julian Dolby, Kavitha Srinivas

Abstract: Recently, dynamically typed languages, such as Python, have gained unprecedented popularity. Although these languages alleviate the need for mandatory type annotations, types still play a critical role in program understanding and preventing runtime errors. An attractive option is to infer types automatically to get static guarantees without writing types. Existing inference techniques rely mostly… ▽ More Recently, dynamically typed languages, such as Python, have gained unprecedented popularity. Although these languages alleviate the need for mandatory type annotations, types still play a critical role in program understanding and preventing runtime errors. An attractive option is to infer types automatically to get static guarantees without writing types. Existing inference techniques rely mostly on static ty** tools such as PyType for direct type inference; more recently, neural type inference has been proposed. However, neural type inference is data hungry, and depends on collecting labeled data based on static ty**. Such tools, however, are poor at inferring user defined types. Furthermore, type annotation by developers in these languages is quite sparse. In this work, we propose novel techniques for generating high quality types using 1) information retrieval techniques that work on well documented libraries to extract types and 2) usage patterns by analyzing a large repository of programs. Our results show that these techniques are more precise and address the weaknesses of static tools, and can be useful for generating a large labeled dataset for type inference by machine learning methods. F1 scores are 0.52-0.58 for our techniques, compared to static ty** tools which are at 0.06, and we use them to generate over 37,000 types for over 700 modules. △ Less

Submitted 6 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

arXiv:2111.13186 [pdf, other]

Federated Data Science to Break Down Silos [Vision]

Authors: Essam Mansour, Kavitha Srinivas, Katja Hose

Abstract: Similar to Open Data initiatives, data science as a community has launched initiatives for sharing not only data but entire pipelines, derivatives, artifacts, etc. (Open Data Science). However, the few efforts that exist focus on the technical part on how to facilitate sharing, conversion, etc. This vision paper goes a step further and proposes KEK, an open federated data science platform that doe… ▽ More Similar to Open Data initiatives, data science as a community has launched initiatives for sharing not only data but entire pipelines, derivatives, artifacts, etc. (Open Data Science). However, the few efforts that exist focus on the technical part on how to facilitate sharing, conversion, etc. This vision paper goes a step further and proposes KEK, an open federated data science platform that does not only allow for sharing data science pipelines and their (meta)data but also provides methods for efficient search and, in the ideal case, even allows for combining and defining pipelines across platforms in a federated manner. In doing so, KEK addresses the so far neglected challenge of actually finding artifacts that are semantically related and that can be combined to achieve a certain goal. △ Less

Submitted 25 November, 2021; originally announced November 2021.

Comments: Accepted at SIGMOD Record

arXiv:2111.00083 [pdf, other]

A Scalable AutoML Approach Based on Graph Neural Networks

Authors: Mossad Helali, Essam Mansour, Ibrahim Abdelaziz, Julian Dolby, Kavitha Srinivas

Abstract: AutoML systems build machine learning models automatically by performing a search over valid data transformations and learners, along with hyper-parameter optimization for each learner. Many AutoML systems use meta-learning to guide search for optimal pipelines. In this work, we present a novel meta-learning system called KGpip which, (1) builds a database of datasets and corresponding pipelines b… ▽ More AutoML systems build machine learning models automatically by performing a search over valid data transformations and learners, along with hyper-parameter optimization for each learner. Many AutoML systems use meta-learning to guide search for optimal pipelines. In this work, we present a novel meta-learning system called KGpip which, (1) builds a database of datasets and corresponding pipelines by mining thousands of scripts with program analysis, (2) uses dataset embeddings to find similar datasets in the database based on its content instead of metadata-based features, (3) models AutoML pipeline creation as a graph generation problem, to succinctly characterize the diverse pipelines seen for a single dataset. KGpip's meta-learning is a sub-component for AutoML systems. We demonstrate this by integrating KGpip with two AutoML systems. Our comprehensive evaluation using 126 datasets, including those used by the state-of-the-art systems, shows that KGpip significantly outperforms these systems. △ Less

Submitted 14 July, 2022; v1 submitted 29 October, 2021; originally announced November 2021.

Comments: 14 pages, 9 figures. Accepted in VLDB22

arXiv:2110.08154 [pdf, ps, other]

Distributed Resource Allocation Optimization for User-Centric Cell-Free MIMO Networks

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: We develop two distributed downlink resource allocation algorithms for user-centric, cell-free, spatially-distributed, multiple-input multiple-output (MIMO) networks. In such networks, each user is served by a subset of nearby transmitters that we call distributed units or DUs. The operation of the DUs in a region is controlled by a central unit (CU). Our first scheme is implemented at the DUs, wh… ▽ More We develop two distributed downlink resource allocation algorithms for user-centric, cell-free, spatially-distributed, multiple-input multiple-output (MIMO) networks. In such networks, each user is served by a subset of nearby transmitters that we call distributed units or DUs. The operation of the DUs in a region is controlled by a central unit (CU). Our first scheme is implemented at the DUs, while the second is implemented at the CUs controlling these DUs. We define a hybrid quality of service metric that enables distributed optimization of system resources in a proportional fair manner. Specifically, each of our algorithms performs user scheduling, beamforming, and power control while accounting for channel estimation errors. Importantly, our algorithm does not require information exchange amongst DUs (CUs) for the DU-distributed (CU-distributed) system, while also smoothly converging. Our results show that our CU-distributed system provides 1.3- to 1.8-fold network throughput compared to the DU-distributed system, with minor increases in complexity and front-haul load - and substantial gains over benchmark schemes like local zero-forcing. We also analyze the trade-offs provided by the CU-distributed system, hence highlighting the significance of deploying multiple CUs in user-centric cell-free networks. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: To appear in IEEE Transactions on Wireless Communications

arXiv:2109.07452 [pdf, other]

Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Authors: Ibrahim Abdelaziz, Julian Dolby, Jamie McCusker, Kavitha Srinivas

Abstract: Code understanding is an increasingly important application of Artificial Intelligence. A fundamental aspect of understanding code is understanding text about code, e.g., documentation and forum discussions. Pre-trained language models (e.g., BERT) are a popular approach for various NLP tasks, and there are now a variety of benchmarks, such as GLUE, to help improve the development of such models f… ▽ More Code understanding is an increasingly important application of Artificial Intelligence. A fundamental aspect of understanding code is understanding text about code, e.g., documentation and forum discussions. Pre-trained language models (e.g., BERT) are a popular approach for various NLP tasks, and there are now a variety of benchmarks, such as GLUE, to help improve the development of such models for natural language understanding. However, little is known about how well such models work on textual artifacts about code, and we are unaware of any systematic set of downstream tasks for such an evaluation. In this paper, we derive a set of benchmarks (BLANCA - Benchmarks for LANguage models on Coding Artifacts) that assess code understanding based on tasks such as predicting the best answer to a question in a forum post, finding related forum posts, or predicting classes related in a hierarchy from class documentation. We evaluate the performance of current state-of-the-art language models on these tasks and show that there is a significant improvement on each task from fine tuning. We also show that multi-task training over BLANCA tasks helps build better language models for code understanding. △ Less

Submitted 15 September, 2021; originally announced September 2021.

arXiv:2109.05978 [pdf, ps, other]

RWP+: A New Random Waypoint Model for High-Speed Mobility

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: In this letter, we emulate real-world statistics for mobility patterns on road systems. We then propose modifications to the assumptions of the random waypoint (RWP) model to better represent high-mobility profiles. We call the model under our new framework as RWP+. Specifically, we show that the lengths of the transitions which constitute a trip, are best represented by a lognormal distribution,… ▽ More In this letter, we emulate real-world statistics for mobility patterns on road systems. We then propose modifications to the assumptions of the random waypoint (RWP) model to better represent high-mobility profiles. We call the model under our new framework as RWP+. Specifically, we show that the lengths of the transitions which constitute a trip, are best represented by a lognormal distribution, and that the velocities are best described by a linear combination of normal distributions with different mean values. Compared to the assumptions used in the literature for mobile cellular networks, our modeling provides mobility metrics, such as handoff rates, that better characterize actual emulated trips from the collected statistics. △ Less

Submitted 13 September, 2021; originally announced September 2021.

Comments: To appear in IEEE Communications Letters

arXiv:2108.08049 [pdf, ps, other]

Euclidean algorithm in Galois Quartic Fields

Authors: K Srinivas, M Subramani, Usha K Sangale

Abstract: We prove that all imaginary biquadratic fields and cyclic quartic fields of class number $1$ are Euclidean. We prove that all imaginary biquadratic fields and cyclic quartic fields of class number $1$ are Euclidean. △ Less

Submitted 18 August, 2021; originally announced August 2021.

Comments: 7 pages

MSC Class: 11A05; 11R04

arXiv:2108.06316 [pdf, ps, other]

Downlink Resource Allocation in Multiuser Cell-free MIMO Networks with User-centric Clustering

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: In this paper, we optimize user scheduling, power allocation and beamforming in distributed multiple-input multiple-output (MIMO) networks implementing user-centric clustering. We study both the coherent and non-coherent transmission modes, formulating a weighted sum rate maximization problem for each; finding the optimal solution to these problems is known to be NP-hard. We use tools from fractio… ▽ More In this paper, we optimize user scheduling, power allocation and beamforming in distributed multiple-input multiple-output (MIMO) networks implementing user-centric clustering. We study both the coherent and non-coherent transmission modes, formulating a weighted sum rate maximization problem for each; finding the optimal solution to these problems is known to be NP-hard. We use tools from fractional programming, block coordinate descent, and compressive sensing to construct an algorithm that optimizes the beamforming weights and user scheduling and converges in a smooth non-decreasing pattern. Channel state information (CSI) being crucial for optimization, we highlight the importance of employing a low-overhead pilot assignment policy for scheduling problems. In this regard, we use a variant of hierarchical agglomerative clustering, which provides a suboptimal, but feasible, pilot assignment scheme; for our cell-free case, we formulate an area-based pilot reuse factor. Our results show that our scheme provides large gains in the long-term network sum spectral efficiency compared to benchmark schemes such as zero-forcing and conjugate beamforming (with round-robin scheduling) respectively. Furthermore, the results show the superiority of coherent transmission compared to the non-coherent mode under ideal and imperfect CSI for the area-based pilot-reuse factors we consider. △ Less

Submitted 13 August, 2021; originally announced August 2021.

Comments: To appear in IEEE Transactions on Wireless Communications

arXiv:2106.03906 [pdf, other]

Learning to Guide a Saturation-Based Theorem Prover

Authors: Ibrahim Abdelaziz, Maxwell Crouse, Bassem Makni, Vernon Austil, Cristina Cornelio, Shajith Ikbal, Pavan Kapanipathi, Ndivhuwo Makondo, Kavitha Srinivas, Michael Witbrock, Achille Fokoue

Abstract: Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes… ▽ More Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes core elements of saturation-based theorem proving within a neural framework. TRAIL leverages (a) an effective graph neural network for representing logical formulas, (b) a novel neural representation of the state of a saturation-based theorem prover in terms of processed clauses and available actions, and (c) a novel representation of the inference selection process as an attention-based action policy. We show through a systematic analysis that these components allow TRAIL to significantly outperform previous reinforcement learning-based theorem provers on two standard benchmark datasets (up to 36% more theorems proved). In addition, to the best of our knowledge, TRAIL is the first reinforcement learning-based approach to exceed the performance of a state-of-the-art traditional theorem prover on a standard theorem proving benchmark (solving up to 17% more problems). △ Less

Submitted 7 June, 2021; originally announced June 2021.

arXiv:2104.14589 [pdf, other]

User-centric Cell-free Massive MIMO Networks: A Survey of Opportunities, Challenges and Solutions

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Venkata Srinivas

Abstract: Densification of network base stations is indispensable to achieve the stringent Quality of Service (QoS) requirements of future mobile networks. However, with a dense deployment of transmitters, interference management becomes an arduous task. To solve this issue, exploring radically new network architectures with intelligent coordination and cooperation capabilities is crucial. This survey paper… ▽ More Densification of network base stations is indispensable to achieve the stringent Quality of Service (QoS) requirements of future mobile networks. However, with a dense deployment of transmitters, interference management becomes an arduous task. To solve this issue, exploring radically new network architectures with intelligent coordination and cooperation capabilities is crucial. This survey paper investigates the emerging user-centric cell-free massive Multiple-input multiple-output (MIMO) network architecture that sets a foundation for future mobile networks. Such networks use a dense deployment of distributed units (DUs) to serve users; the crucial difference from the traditional cellular paradigm is that a specific serving cluster of DUs is defined for each user. This framework provides macro diversity, power efficiency, interference management, and robust connectivity. Most importantly, the user-centric approach eliminates cell edges, thus contributing to uniform coverage and performance for users across the network area. We present here a guide to the key challenges facing the deployment of this network scheme and contemplate the solutions being proposed for the main bottlenecks facing cell-free communications. Specifically, we survey the literature targeting the fronthaul, then we scan the details of the channel estimation required, resource allocation, delay, and scalability issues. Furthermore, we highlight some technologies that can provide a management platform for this scheme such as distributed software-defined network (SDN). Our article serves as a check point that delineates the current status and indicates future directions for this area in a comprehensive manner. △ Less

Submitted 10 December, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

Comments: To appear in IEEE Communications Surveys & Tutorials

arXiv:2102.07977 [pdf, ps, other]

On the Diophantine equation $cx^2+p^{2m}=4y^n$

Authors: Kalyan Chakraborty, Azizul Hoque, Kotyada Srinivas

Abstract: Let $c$ be a square-free positive integer and $p$ a prime satisfying $p\nmid c$. Let $h(-c)$ denote the class number of the imaginary quadratic field $\mathbb{Q}(\sqrt{-c})$. In this paper, we consider the Diophantine equation $$cx^2+p^{2m}=4y^n,~~x,y\geq 1, m\geq 0, n\geq 3, \gcd(x,y)=1, \gcd(n,2h(-c))=1,$$ and we describe all its integer solutions. Our main tool here is the prominent result of B… ▽ More Let $c$ be a square-free positive integer and $p$ a prime satisfying $p\nmid c$. Let $h(-c)$ denote the class number of the imaginary quadratic field $\mathbb{Q}(\sqrt{-c})$. In this paper, we consider the Diophantine equation $$cx^2+p^{2m}=4y^n,~~x,y\geq 1, m\geq 0, n\geq 3, \gcd(x,y)=1, \gcd(n,2h(-c))=1,$$ and we describe all its integer solutions. Our main tool here is the prominent result of Bilu, Hanrot and Voutier on existence of primitive divisors in Lehmer sequences. △ Less

Submitted 16 February, 2021; originally announced February 2021.

Comments: 12 pages. To appear in `Results in Mathematics'

MSC Class: 11D61; 11D41; 11Y50

arXiv:2102.02968 [pdf, other]

Resource Allocation and Scheduling in Non-coherent User-centric Cell-free MIMO

Authors: Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, Kothapalli Srinivas

Abstract: We study the problem of user-scheduling and resource allocation in distributed multi-user, multiple-input multiple-output (MIMO) networks implementing user-centric clustering and non-coherent transmission. We formulate a weighted sum-rate maximization problem which can provide user proportional fairness. As in this setup, users can be served by many transmitters, user scheduling is particularly di… ▽ More We study the problem of user-scheduling and resource allocation in distributed multi-user, multiple-input multiple-output (MIMO) networks implementing user-centric clustering and non-coherent transmission. We formulate a weighted sum-rate maximization problem which can provide user proportional fairness. As in this setup, users can be served by many transmitters, user scheduling is particularly difficult. To solve this issue, we use block coordinate descent, fractional programming, and compressive sensing to construct an algorithm that performs user-scheduling and beamforming. Our results show that the proposed framework provides an 8- to 10-fold gain in the long-term user spectral efficiency compared to benchmark schemes such as round-robin scheduling. Furthermore, we quantify the performance loss due to imperfect channel state information and pilot training overhead using a defined area-based pilot-reuse factor. △ Less

Submitted 4 February, 2021; originally announced February 2021.

Comments: To appear in the IEEE International Conference on Communications (ICC'21)

arXiv:2008.08012 [pdf, other]

doi 10.1016/j.patcog.2020.107812

Linguistically-aware Attention for Reducing the Semantic-Gap in Vision-Language Tasks

Authors: Gouthaman KV, Athira Nambiar, Kancheti Sai Srinivas, Anurag Mittal

Abstract: Attention models are widely used in Vision-language (V-L) tasks to perform the visual-textual correlation. Humans perform such a correlation with a strong linguistic understanding of the visual world. However, even the best performing attention model in V-L tasks lacks such a high-level linguistic understanding, thus creating a semantic gap between the modalities. In this paper, we propose an atte… ▽ More Attention models are widely used in Vision-language (V-L) tasks to perform the visual-textual correlation. Humans perform such a correlation with a strong linguistic understanding of the visual world. However, even the best performing attention model in V-L tasks lacks such a high-level linguistic understanding, thus creating a semantic gap between the modalities. In this paper, we propose an attention mechanism - Linguistically-aware Attention (LAT) - that leverages object attributes obtained from generic object detectors along with pre-trained language models to reduce this semantic gap. LAT represents visual and textual modalities in a common linguistically-rich space, thus providing linguistic awareness to the attention process. We apply and demonstrate the effectiveness of LAT in three V-L tasks: Counting-VQA, VQA, and Image captioning. In Counting-VQA, we propose a novel counting-specific VQA model to predict an intuitive count and achieve state-of-the-art results on five datasets. In VQA and Captioning, we show the generic nature and effectiveness of LAT by adapting it into various baselines and consistently improving their performance. △ Less

Submitted 18 August, 2020; originally announced August 2020.

Journal ref: Pattern Recognition, 2021

arXiv:2004.02822 [pdf, other]

LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data

Authors: Madhumitha Harishankar, Jun Han, Sai Vineeth Kalluru Srinivas, Faisal Alqarni, Shi Su, Shijia Pan, Hae Young Noh, Pei Zhang, Marco Gruteser, Patrick Tague

Abstract: The resolution of GPS measurements, especially in urban areas, is insufficient for identifying a vehicle's lane. In this work, we develop a deep LSTM neural network model LaNet that determines the lane vehicles are on by periodically classifying accelerometer samples collected by vehicles as they drive in real time. Our key finding is that even adjacent patches of road surfaces contain characteris… ▽ More The resolution of GPS measurements, especially in urban areas, is insufficient for identifying a vehicle's lane. In this work, we develop a deep LSTM neural network model LaNet that determines the lane vehicles are on by periodically classifying accelerometer samples collected by vehicles as they drive in real time. Our key finding is that even adjacent patches of road surfaces contain characteristics that are sufficiently unique to differentiate between lanes, i.e., roads inherently exhibit differing bumps, cracks, potholes, and surface unevenness. Cars can capture this road surface information as they drive using inexpensive, easy-to-install accelerometers that increasingly come fitted in cars and can be accessed via the CAN-bus. We collect an aggregate of 60 km driving data and synthesize more based on this that capture factors such as variable driving speed, vehicle suspensions, and accelerometer noise. Our formulated LSTM-based deep learning model, LaNet, learns lane-specific sequences of road surface events (bumps, cracks etc.) and yields 100% lane classification accuracy with 200 meters of driving data, achieving over 90% with just 100 m (correspondingly to roughly one minute of driving). We design the LaNet model to be practical for use in real-time lane classification and show with extensive experiments that LaNet yields high classification accuracy even on smooth roads, on large multi-lane roads, and on drives with frequent lane changes. Since different road surfaces have different inherent characteristics or entropy, we excavate our neural network model and discover a mechanism to easily characterize the achievable classification accuracies in a road over various driving distances by training the model just once. We present LaNet as a low-cost, easily deployable and highly accurate way to achieve fine-grained lane identification. △ Less

Submitted 6 April, 2020; originally announced April 2020.

arXiv:2002.09440 [pdf, other]

A Toolkit for Generating Code Knowledge Graphs

Authors: Ibrahim Abdelaziz, Julian Dolby, Jamie McCusker, Kavitha Srinivas

Abstract: Knowledge graphs have been proven extremely useful in powering diverse applications in semantic search and natural language understanding. In this paper, we present GraphGen4Code, a toolkit to build code knowledge graphs that can similarly power various applications such as program search, code understanding, bug detection, and code automation. GraphGen4Code uses generic techniques to capture code… ▽ More Knowledge graphs have been proven extremely useful in powering diverse applications in semantic search and natural language understanding. In this paper, we present GraphGen4Code, a toolkit to build code knowledge graphs that can similarly power various applications such as program search, code understanding, bug detection, and code automation. GraphGen4Code uses generic techniques to capture code semantics with the key nodes in the graph representing classes, functions, and methods. Edges indicate function usage (e.g., how data flows through function calls, as derived from program analysis of real code), and documentation about functions (e.g., code documentation, usage documentation, or forum discussions such as StackOverflow). Our toolkit uses named graphs in RDF to model graphs per program, or can output graphs as JSON. We show the scalability of the toolkit by applying it to 1.3 million Python files drawn from GitHub, 2,300 Python modules, and 47 million forum posts. This results in an integrated code graph with over 2 billion triples. We make the toolkit to build such graphs as well as the sample extraction of the 2 billion triples graph publicly available to the community for use. △ Less

Submitted 27 September, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

arXiv:1911.02065 [pdf, other]

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Authors: Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni, Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

Abstract: Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theor… ▽ More Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theorem proving. TRAIL leverages (a) a novel neural representation of the state of a theorem prover and (b) a novel characterization of the inference selection process in terms of an attention-based action policy. We show through systematic analysis that these mechanisms allow TRAIL to significantly outperform previous reinforcement-learning-based theorem provers on two benchmark datasets for first-order logic automated theorem proving (proving around 15% more theorems). △ Less

Submitted 15 September, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

arXiv:1906.07214 [pdf, other]

Hardware Aware Neural Network Architectures using FbNet

Authors: Sai Vineeth Kalluru Srinivas, Harideep Nair, Vinay Vidyasagar

Abstract: We implement a differentiable Neural Architecture Search (NAS) method inspired by FBNet for discovering neural networks that are heavily optimized for a particular target device. The FBNet NAS method discovers a neural network from a given search space by optimizing over a loss function which accounts for accuracy and target device latency. We extend this loss function by adding an energy term. Th… ▽ More We implement a differentiable Neural Architecture Search (NAS) method inspired by FBNet for discovering neural networks that are heavily optimized for a particular target device. The FBNet NAS method discovers a neural network from a given search space by optimizing over a loss function which accounts for accuracy and target device latency. We extend this loss function by adding an energy term. This will potentially enhance the ``hardware awareness" and help us find a neural network architecture that is optimal in terms of accuracy, latency and energy consumption, given a target device (Raspberry Pi in our case). We name our trained child architecture obtained at the end of search process as Hardware Aware Neural Network Architecture (HANNA). We prove the efficacy of our approach by benchmarking HANNA against two other state-of-the-art neural networks designed for mobile/embedded applications, namely MobileNetv2 and CondenseNet for CIFAR-10 dataset. Our results show that HANNA provides a speedup of about 2.5x and 1.7x, and reduces energy consumption by 3.8x and 2x compared to MobileNetv2 and CondenseNet respectively. HANNA is able to provide such significant speedup and energy efficiency benefits over the state-of-the-art baselines at the cost of a tolerable 4-5% drop in accuracy. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Comments: 8 pages, 11 figures

arXiv:1809.01604 [pdf, other]

Merging datasets through deep learning

Authors: Kavitha Srinivas, Abraham Gale, Julian Dolby

Abstract: Merging datasets is a key operation for data analytics. A frequent requirement for merging is joining across columns that have different surface forms for the same entity (e.g., the name of a person might be represented as "Douglas Adams" or "Adams, Douglas"). Similarly, ontology alignment can require recognizing distinct surface forms of the same entity, especially when ontologies are independent… ▽ More Merging datasets is a key operation for data analytics. A frequent requirement for merging is joining across columns that have different surface forms for the same entity (e.g., the name of a person might be represented as "Douglas Adams" or "Adams, Douglas"). Similarly, ontology alignment can require recognizing distinct surface forms of the same entity, especially when ontologies are independently developed. However, data management systems are currently limited to performing merges based on string equality, or at best using string similarity. We propose an approach to performing merges based on deep learning models. Our approach depends on (a) creating a deep learning model that maps surface forms of an entity into a set of vectors such that alternate forms for the same entity are closest in vector space, (b) indexing these vectors using a nearest neighbors algorithm to find the forms that can be potentially joined together. To build these models, we had to adapt techniques from metric learning due to the characteristics of the data; specifically we describe novel sample selection techniques and loss functions that work for this problem. To evaluate our approach, we used Wikidata as ground truth and built models from datasets with approximately 1.1M people's names (200K identities) and 130K company names (70K identities). We developed models that allow for joins with precision@1 of .75-.81 and recall of .74-.81. We make the models available for aligning people or companies across multiple datasets. △ Less

Submitted 5 September, 2018; originally announced September 2018.

arXiv:1603.00813 [pdf, ps, other]

Some remarks related to Maeda's conjecture

Authors: M. Ram Murty, K. Srinivas

Abstract: In this article we deal with the problem of counting the number of pairs of normalized eigenforms $ (f,g) $ of weight $k$ and level $N$ such that $ a_p (f) = a_p (g) $ where $a_p (f) $ denotes the $p-$th Fourier coefficient of $f$. Here $p$ is a fixed prime. In this article we deal with the problem of counting the number of pairs of normalized eigenforms $ (f,g) $ of weight $k$ and level $N$ such that $ a_p (f) = a_p (g) $ where $a_p (f) $ denotes the $p-$th Fourier coefficient of $f$. Here $p$ is a fixed prime. △ Less

Submitted 2 March, 2016; originally announced March 2016.

Comments: 6 pages, to appear in PAMS

MSC Class: 11F30

arXiv:1308.5703 [pdf, other]

A Principled Approach to Bridging the Gap between Graph Data and their Schemas

Authors: Marcelo Arenas, Gonzalo I. Diaz, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas

Abstract: Although RDF graphs have schema information associated with them, in practice it is very common to find cases in which data do not fully conform to their schema. A prominent example of this is DBpedia, which is RDF data extracted from Wikipedia, a publicly editable source of information. In such situations, it becomes interesting to study the structural properties of the actual data, because the s… ▽ More Although RDF graphs have schema information associated with them, in practice it is very common to find cases in which data do not fully conform to their schema. A prominent example of this is DBpedia, which is RDF data extracted from Wikipedia, a publicly editable source of information. In such situations, it becomes interesting to study the structural properties of the actual data, because the schema gives an incomplete description of the organization of a dataset. In this paper we have approached the study of the structuredness of an RDF graph in a principled way: we propose a framework for specifying structuredness functions, which gauge the degree to which an RDF graph conforms to a schema. In particular, we first define a formal language for specifying structuredness functions with expressions we call rules. This language allows a user or a database administrator to state a rule to which an RDF graph may fully or partially conform. Then we consider the issue of discovering a refinement of a sort (type) by partitioning the dataset into subsets whose structuredness is over a specified threshold. In particular, we prove that the natural decision problem associated to this refinement problem is NP-complete, and we provide a natural translation of this problem into Integer Linear Programming (ILP). Finally, we test this ILP solution with two real world datasets, DBpedia Persons and WordNet Nouns, and 4 different and intuitive rules, which gauge the structuredness in different ways. The rules give meaningful refinements of the datasets, showing that our language can be a powerful tool for understanding the structure of RDF data. △ Less

Submitted 4 March, 2014; v1 submitted 26 August, 2013; originally announced August 2013.

Comments: 18 pages, 8 figures. To be published in PVLDB Vol. 8, No. 9

arXiv:1102.0367 [pdf, ps, other]

On the zeros of the Epstein zeta function

Authors: Anirban Mukhopadhyay, Krishnan Rajkumar, Kotyada Srinivas

Abstract: In this article, we count the number of consecutive zeros of the Epstein zeta-function, associated to a certain quadratic form, on the critical line with ordinates lying in $[0,T], T$ sufficiently large and which are separated apart by a given positive number $V$. In this article, we count the number of consecutive zeros of the Epstein zeta-function, associated to a certain quadratic form, on the critical line with ordinates lying in $[0,T], T$ sufficiently large and which are separated apart by a given positive number $V$. △ Less

Submitted 2 February, 2011; originally announced February 2011.

Comments: 13 pages, to appear in proceedings of CINTAA, 2006

Journal ref: Proc. in honour of Prof. T. C. Vasudevan, RMS Lecture Note Series, No. 15 (2011), pp. 73-87

arXiv:1012.2965 [pdf, ps, other]

Digital watermarking : An approach based on Hilbert transform

Authors: Rashmi Agarwal, R. Krishnan, M. S. Santhanam, K. Srinivas, K. Venugopalan

Abstract: Most of the well known algorithms for watermarking of digital images involve transformation of the image data to Fourier or singular vector space. In this paper, we introduce watermarking in Hilbert transform domain for digital media. Generally, if the image is a matrix of order $m$ by $n$, then the transformed space is also an image of the same order. However, with Hilbert transforms, the transfo… ▽ More Most of the well known algorithms for watermarking of digital images involve transformation of the image data to Fourier or singular vector space. In this paper, we introduce watermarking in Hilbert transform domain for digital media. Generally, if the image is a matrix of order $m$ by $n$, then the transformed space is also an image of the same order. However, with Hilbert transforms, the transformed space is of order $2m$ by $2n$. This allows for more latitude in storing the watermark in the host image. Based on this idea, we propose an algorithm for embedding and extracting watermark in a host image and analytically obtain a parameter related to this procedure. Using extensive simulations, we show that the algorithm performs well even if the host image is corrupted by various attacks. △ Less

Submitted 14 December, 2010; originally announced December 2010.

Comments: 17 Pages, 52 Figures

arXiv:1012.0081 [pdf, ps, other]

doi 10.1109/TIT.2012.2193554

Molecular communication in fluid media: The additive inverse Gaussian noise channel

Authors: K. V. Srinivas, Raviraj S. Adve, Andrew W. Eckford

Abstract: We consider molecular communication, with information conveyed in the time of release of molecules. The main contribution of this paper is the development of a theoretical foundation for such a communication system. Specifically, we develop the additive inverse Gaussian (IG) noise channel model: a channel in which the information is corrupted by noise with an inverse Gaussian distribution. We show… ▽ More We consider molecular communication, with information conveyed in the time of release of molecules. The main contribution of this paper is the development of a theoretical foundation for such a communication system. Specifically, we develop the additive inverse Gaussian (IG) noise channel model: a channel in which the information is corrupted by noise with an inverse Gaussian distribution. We show that such a channel model is appropriate for molecular communication in fluid media - when propagation between transmitter and receiver is governed by Brownian motion and when there is positive drift from transmitter to receiver. Taking advantage of the available literature on the IG distribution, upper and lower bounds on channel capacity are developed, and a maximum likelihood receiver is derived. Theory and simulation results are presented which show that such a channel does not have a single quality measure analogous to signal-to-noise ratio in the AWGN channel. It is also shown that the use of multiple molecules leads to reduced error rate in a manner akin to diversity order in wireless communications. Finally, we discuss some open problems in molecular communications that arise from the IG system model. △ Less

Submitted 9 December, 2010; v1 submitted 30 November, 2010; originally announced December 2010.

Comments: 28 pages, 8 figures. Submitted to IEEE Transactions on Information Theory. Corrects minor typos in the first version

arXiv:0808.0418 [pdf, ps, other]

Counting squarefree discriminants of trinomials under abc

Authors: Anirban Mukhopadhyay, M. Ram Murty, Kotyada Srinivas

Abstract: For an odd positive integer $n\ge 5$, assuming the truth of the $abc$ conjecture, we show that for a positive proportion of pairs $(a,b)$ of integers the trinomials of the form $t^n+at+b (a,b\in \mathbb Z)$ are irreducible and their discriminants are squarefree. For an odd positive integer $n\ge 5$, assuming the truth of the $abc$ conjecture, we show that for a positive proportion of pairs $(a,b)$ of integers the trinomials of the form $t^n+at+b (a,b\in \mathbb Z)$ are irreducible and their discriminants are squarefree. △ Less

Submitted 4 August, 2008; originally announced August 2008.

Comments: 9 Pages

MSC Class: 11R09; 11C08

arXiv:0807.0986 [pdf, ps, other]

On the Oppenheim's "factorisatio numerorum" function

Authors: Florian Luca, Anirban Mukhopadhyay, Kotyada Srinivas

Abstract: Let $f(n)$ denote the number of distinct unordered factorisations of the natural number $n$ into factors larger than 1.In this paper, we address some aspects of the function $f(n)$. Let $f(n)$ denote the number of distinct unordered factorisations of the natural number $n$ into factors larger than 1.In this paper, we address some aspects of the function $f(n)$. △ Less

Submitted 7 July, 2008; originally announced July 2008.

Comments: 14 pages

arXiv:0804.0715 [pdf, ps, other]

On the zeros of functions in the Selberg class

Authors: Anirban Mukhopadhyay, Kotyada Srinivas, Krishnan Rajkumar

Abstract: It is proved that under some suitable conditions, the degree two functions in the Selberg class have infinitely many zeros on the critical line. It is proved that under some suitable conditions, the degree two functions in the Selberg class have infinitely many zeros on the critical line. △ Less

Submitted 5 February, 2011; v1 submitted 4 April, 2008; originally announced April 2008.

Comments: 9 pages

MSC Class: 11E45

Journal ref: Funct. Approx. Comment. Math. 38 (2008), part 2, 121-130

Showing 1–50 of 50 results for author: Srinivas, K