Skip to main content

Showing 1–50 of 477 results for author: Krishnamurthy

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19461  [pdf, other

    cs.RO cs.CV

    Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features

    Authors: Halil Utku Unlu, Anthony Tzes, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: A robust, resource-efficient, distributed, and minimally parameterized 3D map matching and merging algorithm is proposed. The suggested algorithm utilizes tomographic features from 2D projections of horizontal cross-sections of gravity-aligned local maps, and matches these projection slices at all possible height differences, enabling the estimation of four degrees of freedom in an efficient and p… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Submitted to Elsevier Journal: Robotics and Autonomous Systems (RAS)

  2. arXiv:2406.16807  [pdf, other

    cs.LG cs.CL cs.CV

    Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

    Authors: Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

    Abstract: Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.14799  [pdf, other

    cs.RO eess.SY

    Capture Point Control in Thruster-Assisted Bipedal Locomotion

    Authors: Shreyansh Pitroda, Aditya Bondada, Kaushik Venkatesh Krishnamurthy, Adarsh Salagame, Chenghao Wang, Taoran Liu, Bibek Gupta, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: Despite major advancements in control design that are robust to unplanned disturbances, bipedal robots are still susceptible to falling over and struggle to negotiate rough terrains. By utilizing thrusters in our bipedal robot, we can perform additional posture manipulation and expand the modes of locomotion to enhance the robot's stability and ability to negotiate rough and difficult-to-navigate… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Submitted and to be presented at IEEE AIM 2024. arXiv admin note: substantial text overlap with arXiv:2103.15952

  4. arXiv:2406.13118  [pdf, other

    cs.RO eess.SY

    Thruster-Assisted Incline Walking

    Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: In this study, our aim is to evaluate the effectiveness of thruster-assisted steep slope walking for the Husky Carbon, a quadrupedal robot equipped with custom-designed actuators and plural electric ducted fans, through simulation prior to conducting experimental trials. Thruster-assisted steep slope walking draws inspiration from wing-assisted incline running (WAIR) observed in birds, and intrigu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 7 pages, 7 figures, submitted to CDC 2024 conference. arXiv admin note: text overlap with arXiv:2405.06070

  5. arXiv:2406.11810  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

    Authors: Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy, Wen Sun

    Abstract: We study computationally and statistically efficient Reinforcement Learning algorithms for the linear Bellman Complete setting, a setting that uses linear function approximation to capture value functions and unifies existing models like linear Markov Decision Processes (MDP) and Linear Quadratic Regulators (LQR). While it is known from the prior works that this setting is statistically tractable,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.05590  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

    Authors: Minghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

    Abstract: Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  7. arXiv:2405.21046  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

    Authors: Tengyang Xie, Dylan J. Foster, Akshay Krishnamurthy, Corby Rosset, Ahmed Awadallah, Alexander Rakhlin

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a central tool for language model alignment. We consider online exploration in RLHF, which exploits interactive access to human or AI feedback by deliberately encouraging the model to produce diverse, maximally informative responses. By allowing RLHF to confidently stray from the pre-trained model, online exploration offers the possi… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  8. arXiv:2405.19269  [pdf, other

    cs.LG

    Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

    Authors: Yuda Song, Lili Wu, Dylan J. Foster, Akshay Krishnamurthy

    Abstract: Sample-efficiency and reliability remain major bottlenecks toward wide adoption of reinforcement learning algorithms in continuous settings with high-dimensional perceptual inputs. Toward addressing these challenges, we introduce a new theoretical framework, RichCLD (Rich-Observation RL with Continuous Latent Dynamics), in which the agent performs control based on high-dimensional observations, bu… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 63 pages, 4 figures, published at ICML 2024

  9. arXiv:2405.14737  [pdf, other

    cs.CV

    CLIPScope: Enhancing Zero-Shot OOD Detection with Bayesian Scoring

    Authors: Hao Fu, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Detection of out-of-distribution (OOD) samples is crucial for safe real-world deployment of machine learning models. Recent advances in vision language foundation models have made them capable of detecting OOD samples without requiring in-distribution (ID) images. However, these zero-shot methods often underperform as they do not adequately consider ID class likelihoods in their detection confiden… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.07417  [pdf, other

    cs.SI eess.SP

    Identifying Hate Speech Peddlers in Online Platforms. A Bayesian Social Learning Approach for Large Language Model Driven Decision-Makers

    Authors: Adit Jain, Vikram Krishnamurthy

    Abstract: This paper studies the problem of autonomous agents performing Bayesian social learning for sequential detection when the observations of the state belong to a high-dimensional space and are expensive to analyze. Specifically, when the observations are textual, the Bayesian agent can use a large language model (LLM) as a map to get a low-dimensional private observation. The agent performs Bayesian… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  11. arXiv:2405.07415  [pdf, ps, other

    cs.LG eess.SY

    Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

    Authors: Adit Jain, Vikram Krishnamurthy

    Abstract: This paper studies how a stochastic gradient algorithm (SG) can be controlled to hide the estimate of the local stationary point from an eavesdropper. Such problems are of significant interest in distributed optimization settings like federated learning and inventory management. A learner queries a stochastic oracle and incentivizes the oracle to obtain noisy gradient measurements and perform SG.… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  12. arXiv:2405.06070  [pdf, other

    cs.RO eess.SY

    Narrow-Path, Dynamic Walking Using Integrated Posture Manipulation and Thrust Vectoring

    Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: This research concentrates on enhancing the navigational capabilities of Northeastern Universitys Husky, a multi-modal quadrupedal robot, that can integrate posture manipulation and thrust vectoring, to traverse through narrow pathways such as walking over pipes and slacklining. The Husky is outfitted with thrusters designed to stabilize its body during dynamic walking over these narrow paths. The… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.12586

  13. arXiv:2405.04829  [pdf, other

    cs.CL

    Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages

    Authors: Sankalp Bahad, Pruthwik Mishra, Karunesh Arora, Rakesh Chandra Balabantaray, Dipti Misra Sharma, Parameswari Krishnamurthy

    Abstract: Named Entity Recognition (NER) is a useful component in Natural Language Processing (NLP) applications. It is used in various tasks such as Machine Translation, Summarization, Information Retrieval, and Question-Answering systems. The research on NER is centered around English and some other major languages, whereas limited attention has been given to Indian languages. We analyze the challenges an… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 8 pages, accepted in NAACL-SRW, 2024

  14. arXiv:2405.04636  [pdf, ps, other

    cs.LG stat.ML

    Data-driven Error Estimation: Upper Bounding Multiple Errors with No Technical Debt

    Authors: Sanath Kumar Krishnamurthy, Susan Athey, Emma Brunskill

    Abstract: We formulate the problem of constructing multiple simultaneously valid confidence intervals (CIs) as estimating a high probability upper bound on the maximum error for a class/set of estimate-estimand-error tuples, and refer to this as the error estimation problem. For a single such tuple, data-driven confidence intervals can often be used to bound the error in our estimate. However, for a class o… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2405.04260  [pdf, other

    cs.LG cs.AI

    Verified Neural Compressed Sensing

    Authors: Rudy Bunel, Krishnamurthy Dvijotham, M. Pawan Kumar, Alessandro De Palma, Robert Stanforth

    Abstract: We develop the first (to the best of our knowledge) provably correct neural networks for a precise computational task, with the proof of correctness generated by an automated verification algorithm without any human input. Prior work on neural network verification has focused on partial specifications that, even when satisfied, are not sufficient to ensure that a neural network never makes errors.… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  16. arXiv:2405.00942  [pdf, other

    cs.CV cs.CL

    LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs

    Authors: Somesh Singh, Harini S I, Yaman K Singla, Veeky Baths, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy

    Abstract: Communication is defined as "Who says what to whom with what effect." A message from a communicator generates downstream receiver effects, also known as behavior. Receiver behavior, being a downstream effect of the message, carries rich signals about it. Even after carrying signals about the message, the behavior data is often ignored while training large language models. We show that training LLM… ▽ More

    Submitted 16 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  17. arXiv:2405.00717  [pdf, other

    cs.CL cs.AI

    Exploring News Summarization and Enrichment in a Highly Resource-Scarce Indian Language: A Case Study of Mizo

    Authors: Abhinaba Bala, Ashok Urlana, Rahul Mishra, Parameswari Krishnamurthy

    Abstract: Obtaining sufficient information in one's mother tongue is crucial for satisfying the information needs of the users. While high-resource languages have abundant online resources, the situation is less than ideal for very low-resource languages. Moreover, the insufficient reporting of vital national and international events continues to be a worry, especially in languages with scarce resources, li… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: Accepted at LREC-COLING2024 WILDRE Workshop

    ACM Class: I.2.7

  18. arXiv:2404.16706  [pdf, other

    cs.DS cs.CC cs.CR cs.LG

    Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy

    Authors: Krishnamurthy Dvijotham, H. Brendan McMahan, Krishna Pillutla, Thomas Steinke, Abhradeep Thakurta

    Abstract: In the task of differentially private (DP) continual counting, we receive a stream of increments and our goal is to output an approximate running total of these increments, without revealing too much about any specific increment. Despite its simplicity, differentially private continual counting has attracted significant attention both in theory and in practice. Existing algorithms for differential… ▽ More

    Submitted 6 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  19. arXiv:2404.15446  [pdf, other

    cs.CR eess.SY

    OffRAMPS: An FPGA-based Intermediary for Analysis and Modification of Additive Manufacturing Control Systems

    Authors: Jason Blocklove, Md Raz, Prithwish Basu Roy, Hammond Pearce, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri

    Abstract: Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compr… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.15391  [pdf, ps, other

    cs.GT econ.GN

    Adaptive Mechanism Design using Multi-Agent Revealed Preferences

    Authors: Luke Snow, Vikram Krishnamurthy

    Abstract: This paper constructs an algorithmic framework for adaptively achieving the mechanism design objective, finding a mechanism inducing socially optimal Nash equilibria, without knowledge of the utility functions of the agents. We consider a probing scheme where the designer can iteratively enact mechanisms and observe Nash equilibria responses. We first derive necessary and sufficient conditions, ta… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  21. arXiv:2404.01863  [pdf, other

    cs.LG cs.AI

    Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

    Authors: Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham, **woo Shin, Kimin Lee

    Abstract: Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy objectives, can compromise the performance of fine-tuned models, a phenomenon known as reward overoptimization. To investigate this issue in depth, we introduce th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  22. arXiv:2404.00538  [pdf, ps, other

    cs.CR stat.AP

    Eclipse Attack Detection on a Blockchain Network as a Non-Parametric Change Detection Problem

    Authors: Anurag Gupta, Vikram Krishnamurthy, Brian M. Sadler

    Abstract: This paper introduces a novel non-parametric change detection algorithm to identify eclipse attacks on a blockchain network; the non-parametric algorithm relies only on the empirical mean and variance of the dataset, making it highly adaptable. An eclipse attack occurs when malicious actors isolate blockchain users, disrupting their ability to reach consensus with the broader network, thereby dist… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  23. arXiv:2403.18206  [pdf, other

    cs.RO

    Sailing Through Point Clouds: Safe Navigation Using Point Cloud Based Control Barrier Functions

    Authors: Bolun Dai, Rooholla Khorrambakht, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: The capability to navigate safely in an unstructured environment is crucial when deploying robotic systems in real-world scenarios. Recently, control barrier function (CBF) based approaches have been highly effective in synthesizing safety-critical controllers. In this work, we propose a novel CBF-based local planner comprised of two components: Vessel and Mariner. The Vessel is a novel scaling fa… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  24. arXiv:2403.17277  [pdf, other

    cs.NI

    Relational Network Verification

    Authors: Xieyang Xu, Yifei Yuan, Zachary Kincaid, Arvind Krishnamurthy, Ratul Mahajan, David Walker, Ennan Zhai

    Abstract: Relational network verification is a new approach to validating network changes. In contrast to traditional network verification, which analyzes specifications for a single network snapshot, relational network verification analyzes specifications concerning two network snapshots (e.g., pre- and post-change snapshots) and captures their similarities and differences. Relational change specifications… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  25. arXiv:2403.15371  [pdf, other

    cs.LG cs.AI cs.CL

    Can large language models explore in-context?

    Authors: Akshay Krishnamurthy, Keegan Harris, Dylan J. Foster, Cyril Zhang, Aleksandrs Slivkins

    Abstract: We investigate the extent to which contemporary Large Language Models (LLMs) can engage in exploration, a core capability in reinforcement learning and decision making. We focus on native performance of existing LLMs, without training interventions. We deploy LLMs as agents in simple multi-armed bandit environments, specifying the environment description and interaction history entirely in-context… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  26. arXiv:2403.11411  [pdf, other

    cs.NI

    Laconic: Streamlined Load Balancers for SmartNICs

    Authors: Tianyi Cui, Chenxingyu Zhao, Wei Zhang, Kaiyuan Zhang, Arvind Krishnamurthy

    Abstract: Load balancers are pervasively used inside today's clouds to scalably distribute network requests across data center servers. Given the extensive use of load balancers and their associated operating costs, several efforts have focused on improving their efficiency by implementing Layer-4 load-balancing logic within the kernel or using hardware acceleration. This work explores whether the more comp… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  27. Detecting Security-Relevant Methods using Multi-label Machine Learning

    Authors: Oshando Johnson, Goran Piskachev, Ranjith Krishnamurthy, Eric Bodden

    Abstract: To detect security vulnerabilities, static analysis tools need to be configured with security-relevant methods. Current approaches can automatically identify such methods using binary relevance machine learning approaches. However, they ignore dependencies among security-relevant methods, over-generalize and perform poorly in practice. Additionally, users have to nevertheless manually configure st… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, The IDE Workshop

  28. arXiv:2403.06634  [pdf, other

    cs.CR

    Stealing Part of a Production Language Model

    Authors: Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Eric Wallace, David Rolnick, Florian Tramèr

    Abstract: We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under \… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  29. arXiv:2403.06571  [pdf, other

    cs.LG math.OC stat.ML

    Scalable Online Exploration via Coverability

    Authors: Philip Amortila, Dylan J. Foster, Akshay Krishnamurthy

    Abstract: Exploration is a major challenge in reinforcement learning, especially for high-dimensional domains that require function approximation. We propose exploration objectives -- policy optimization objectives that enable downstream maximization of any reward function -- as a conceptual framework to systematize the study of exploration. Within this framework, we introduce a new objective, $L_1$-Coverag… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  30. arXiv:2403.05576  [pdf

    cs.HC cs.AI

    Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction

    Authors: Senjuti Dutta, Sherol Chen, Sunny Mak, Amnah Ahmad, Katherine Collins, Alena Butryna, Deepak Ramachandran, Krishnamurthy Dvijotham, Ellie Pavlick, Ravi Rajakumar

    Abstract: Image generation models are poised to become ubiquitous in a range of applications. These models are often fine-tuned and evaluated using human quality judgments that assume a universal standard, failing to consider the subjectivity of such tasks. To investigate how to quantify subjectivity, and the scale of its impact, we measure how assessments differ among human annotators across different use… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

  31. arXiv:2403.03622  [pdf, other

    cs.GR math.AG

    Medial Parametrization of Arbitrary Planar Compact Domains with Dipoles

    Authors: Vinayak Krishnamurthy, Ergun Akleman

    Abstract: We present medial parametrization, a new approach to parameterizing any compact planar domain bounded by simple closed curves. The basic premise behind our proposed approach is to use two close Voronoi sites, which we call dipoles, to construct and reconstruct an approximate piecewise-linear version of the original boundary and medial axis through Voronoi tessellation. The boundaries and medial ax… ▽ More

    Submitted 7 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 17 pages

  32. arXiv:2402.16926  [pdf, other

    cs.CR cs.AI cs.LG stat.ML

    On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

    Authors: Georg Pichler, Marco Romanelli, Divya Prakash Manivannan, Prashanth Krishnamurthy, Farshad Khorrami, Siddharth Garg

    Abstract: We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition. The main contributions of this work are an impossibility result and an achievability result for backdoor detection. We show a no-free-lunch theorem, proving that… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  33. arXiv:2402.13531  [pdf, other

    cs.LG cs.CR

    Private Gradient Descent for Linear Regression: Tighter Error Bounds and Instance-Specific Uncertainty Estimation

    Authors: Gavin Brown, Krishnamurthy Dvijotham, Georgina Evans, Daogao Liu, Adam Smith, Abhradeep Thakurta

    Abstract: We provide an improved analysis of standard differentially private gradient descent for linear regression under the squared error loss. Under modest assumptions on the input, we characterize the distribution of the iterate at each time step. Our analysis leads to new results on the algorithm's accuracy: for a proper fixed choice of hyperparameters, the sample complexity depends only linearly on… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 22 pages, 11 figures

  34. arXiv:2402.10344  [pdf, other

    cs.CV

    Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions

    Authors: Muhammad Arbab Arshad, Talukder Jubery, James Afful, Anushrut Jignasu, Aditya Balu, Baskar Ganapathysubramanian, Soumik Sarkar, Adarsh Krishnamurthy

    Abstract: We evaluate different Neural Radiance Fields (NeRFs) techniques for reconstructing (3D) plants in varied environments, from indoor settings to outdoor fields. Traditional techniques often struggle to capture the complex details of plants, which is crucial for botanical and agricultural understanding. We evaluate three scenarios with increasing complexity and compare the results with the point clou… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  35. arXiv:2402.08769  [pdf, other

    cs.LG cs.DC

    FLASH: Federated Learning Across Simultaneous Heterogeneities

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: The key premise of federated learning (FL) is to train ML models across a diverse set of data-owners (clients), without exchanging local data. An overarching challenge to this date is client heterogeneity, which may arise not only from variations in data distribution, but also in data quality, as well as compute/communication latency. An integrated view of these diverse and concurrent sources of h… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  36. arXiv:2402.08546  [pdf, other

    cs.RO

    Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback

    Authors: Vineet Bhat, Ali Umut Kaypak, Prashanth Krishnamurthy, Ramesh Karri, Farshad Khorrami

    Abstract: Robotic planning algorithms direct agents to perform actions within diverse environments to accomplish a task. Large Language Models (LLMs) like PaLM 2, GPT-3.5, and GPT-4 have revolutionized this domain, using their embedded real-world knowledge to tackle complex tasks involving multiple agents and objects. This paper introduces an innovative planning algorithm that integrates LLMs into the robot… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  37. arXiv:2402.06787  [pdf, other

    cs.NI cs.DC cs.LG

    ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics

    Authors: Liangyu Zhao, Saeed Maleki, Ziyue Yang, Hossein Pourreza, Aashaka Shah, Changho Hwang, Arvind Krishnamurthy

    Abstract: As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck. Designing efficient communication schedules is challenging given today's highly diverse and heterogeneous network fabrics. In this paper, we present ForestColl, a tool that generates efficient schedules for any network topology. ForestColl cons… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.18461

  38. arXiv:2402.06091  [pdf, other

    cs.CV

    Early Fusion of Features for Semantic Segmentation

    Authors: Anupam Gupta, Ashok Krishnamurthy, Lisa Singh

    Abstract: This paper introduces a novel segmentation framework that integrates a classifier network with a reverse HRNet architecture for efficient image segmentation. Our approach utilizes a ResNet-50 backbone, pretrained in a semi-supervised manner, to generate feature maps at various scales. These maps are then processed by a reverse HRNet, which is adapted to handle varying channel dimensions through 1x… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  39. arXiv:2402.01344  [pdf, other

    cs.LG

    Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks

    Authors: Ruigang Wang, Krishnamurthy Dvijotham, Ian R. Manchester

    Abstract: This paper presents a new bi-Lipschitz invertible neural network, the BiLipNet, which has the ability to smoothly control both its Lipschitzness (output sensitivity to input perturbations) and inverse Lipschitzness (input distinguishability from different outputs). The second main contribution is a new scalar-output network, the PLNet, which is a composition of a BiLipNet and a quadratic potential… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: International Conference on Machine Learning, Vienna, Austria, July 21 -- 17, 2024

  40. arXiv:2402.01155  [pdf, other

    cs.CL

    CABINET: Content Relevance based Noise Reduction for Table Question Answering

    Authors: Sohan Patnaik, Heril Changwal, Milan Aggarwal, Sumit Bhatia, Yaman Kumar, Balaji Krishnamurthy

    Abstract: Table understanding capability of Large Language Models (LLMs) has been extensively studied through the task of question-answering (QA) over tables. Typically, only a small part of the whole table is relevant to derive the answer for a given question. The irrelevant parts act as noise and are distracting information, resulting in sub-optimal performance due to the vulnerability of LLMs to noise. T… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (spotlight)

  41. arXiv:2401.17172  [pdf, other

    physics.comp-ph cs.LG math.NA

    Learning Domain-Independent Green's Function For Elliptic Partial Differential Equations

    Authors: Pawan Negi, Maggie Cheng, Mahesh Krishnamurthy, Wenjun Ying, Shuwang Li

    Abstract: Green's function characterizes a partial differential equation (PDE) and maps its solution in the entire domain as integrals. Finding the analytical form of Green's function is a non-trivial exercise, especially for a PDE defined on a complex domain or a PDE with variable coefficients. In this paper, we propose a novel boundary integral network to learn the domain-independent Green's function, ref… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  42. arXiv:2401.12216  [pdf, other

    stat.ML cs.LG math.OC

    Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning

    Authors: Philip Amortila, Tongyi Cao, Akshay Krishnamurthy

    Abstract: A pervasive phenomenon in machine learning applications is distribution shift, where training and deployment conditions for a machine learning model differ. As distribution shift typically results in a degradation in performance, much attention has been devoted to algorithmic interventions that mitigate these detrimental effects. In this paper, we study the effect of distribution shift in the pres… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  43. arXiv:2401.12032  [pdf, other

    cs.HC cs.AI

    MINT: A wrapper to make multi-modal and multi-image AI models interactive

    Authors: Jan Freyberg, Abhijit Guha Roy, Terry Spitz, Beverly Freeman, Mike Schaekermann, Patricia Strachan, Eva Schnider, Renee Wong, Dale R Webster, Alan Karthikesalingam, Yun Liu, Krishnamurthy Dvijotham, Umesh Telang

    Abstract: During the diagnostic process, doctors incorporate multimodal information including imaging and the medical history - and similarly medical AI development has increasingly become multimodal. In this paper we tackle a more subtle challenge: doctors take a targeted medical history to obtain only the most pertinent pieces of information; how do we enable AI to do the same? We develop a wrapper method… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 15 pages, 7 figures

  44. arXiv:2401.07123  [pdf, other

    cs.HC cs.CL

    One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI

    Authors: Christopher Clarke, Karthik Krishnamurthy, Walter Talamonti, Yi** Kang, Lingjia Tang, Jason Mars

    Abstract: Conversational agents have been gaining increasing popularity in recent years. Influenced by the widespread adoption of task-oriented agents such as Apple Siri and Amazon Alexa, these agents are being deployed into various applications to enhance user experience. Although these agents promote "ask me anything" functionality, they are typically built to focus on a single or finite set of expertise.… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  45. arXiv:2401.04130  [pdf, other

    cs.LG cs.AI

    Plug-and-Play Transformer Modules for Test-Time Adaptation

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: Parameter-efficient tuning (PET) methods such as LoRA, Adapter, and Visual Prompt Tuning (VPT) have found success in enabling adaptation to new domains by tuning small modules within a transformer model. However, the number of domains encountered during test time can be very large, and the data is usually unlabeled. Thus, adaptation to new domains is challenging; it is also impractical to generate… ▽ More

    Submitted 8 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  46. arXiv:2312.12586  [pdf

    cs.RO

    Towards dynamic Narrow path walking on NU's Husky

    Authors: Kaushik Venkatesh Krishnamurthy

    Abstract: This research focuses on enabling Northeastern University's Husky, a multi-modal quadrupedal robot, to navigate narrow paths akin to various animals in nature. The Husky is equipped with thrusters to stabilize its body during dynamic maneuvers, addressing challenges inherent in aerial-legged systems. The approach involves modeling the robot as HROM (Husky Reduced Model) and creating an optimal con… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 60 pages, 27 figures

  47. arXiv:2312.10240  [pdf, other

    cs.CV

    Rich Human Feedback for Text-to-Image Generation

    Authors: Youwei Liang, Junfeng He, Gang Li, Peizhao Li, Arseniy Klimovskiy, Nicholas Carolan, Jiao Sun, Jordi Pont-Tuset, Sarah Young, Feng Yang, Junjie Ke, Krishnamurthy Dj Dvijotham, Katie Collins, Yiwen Luo, Yang Li, Kai J Kohlhoff, Deepak Ramachandran, Vidhya Navalpakkam

    Abstract: Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen have made significant progress in generating high-resolution images based on text descriptions. However, many generated images still suffer from issues such as artifacts/implausibility, misalignment with text descriptions, and low aesthetic quality. Inspired by the success of Reinforcement Learning with Human Feedback… ▽ More

    Submitted 8 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR'24

  48. arXiv:2312.03052  [pdf, other

    cs.CV cs.CL

    Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

    Authors: Yushi Hu, Otilia Stretcu, Chun-Ta Lu, Krishnamurthy Viswanathan, Kenji Hata, Enming Luo, Ranjay Krishna, Ariel Fuxman

    Abstract: Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. Recent work shows promise by decomposing such tasks using a large language model (LLM) into an executable program that invokes specialized vision models. However, generated programs are error-pr… ▽ More

    Submitted 5 April, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Oral

  49. arXiv:2311.10995  [pdf, other

    cs.CV cs.CL

    Behavior Optimized Image Generation

    Authors: Varun Khurana, Yaman K Singla, Jayakumar Subramanian, Rajiv Ratn Shah, Changyou Chen, Zhiqiang Xu, Balaji Krishnamurthy

    Abstract: The last few years have witnessed great success on image generation, which has crossed the acceptance thresholds of aesthetics, making it directly applicable to personal and commercial applications. However, images, especially in marketing and advertising applications, are often created as a means to an end as opposed to just aesthetic concerns. The goal can be increasing sales, getting more click… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  50. arXiv:2311.09216  [pdf, other

    cs.CL cs.AI

    Assessing Translation capabilities of Large Language Models involving English and Indian Languages

    Authors: Vandan Mujadia, Ashok Urlana, Yash Bhaskar, Penumalla Aditya Pavani, Kukkapalli Shravya, Parameswari Krishnamurthy, Dipti Misra Sharma

    Abstract: Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. In this work, our aim is to explore the multilingual capabilities of large language models by using machine translation as a task involving English and 22 Indian languages. We first investigate the translation capabilities of raw large language models, followed by exploring the in-context learning c… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.