Skip to main content

Showing 1–50 of 327 results for author: Srikanth

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06706  [pdf, other

    cs.RO

    Exploring Unstructured Environments using Minimal Sensing on Cooperative Nano-Drones

    Authors: Pedro Arias-Perez, Alvika Gautam, Miguel Fernandez-Cortizas, David Perez-Saura, Srikanth Saripalli, Pascual Campoy

    Abstract: Recent advances have improved autonomous navigation and map** under payload constraints, but current multi-robot inspection algorithms are unsuitable for nano-drones due to their need for heavy sensors and high computational resources. To address these challenges, we introduce ExploreBug, a novel hybrid frontier range bug algorithm designed to handle limited sensing capabilities for a swarm of n… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Robotics and Automation Letters

  2. arXiv:2407.04525  [pdf

    q-bio.NC cs.AI cs.LG

    Enhancing learning in artificial neural networks through cellular heterogeneity and neuromodulatory signaling

    Authors: Alejandro Rodriguez-Garcia, Jie Mei, Srikanth Ramaswamy

    Abstract: Recent progress in artificial intelligence (AI) has been driven by insights from neuroscience, particularly with the development of artificial neural networks (ANNs). This has significantly enhanced the replication of complex cognitive tasks such as vision and natural language processing. Despite these advances, ANNs struggle with continual learning, adaptable knowledge transfer, robustness, and r… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 34 pages, 4 figures, 3 boxes

  3. arXiv:2407.04444  [pdf, other

    cs.CL cs.SD eess.AS

    TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR

    Authors: Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Nigmatulina, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia, Aravind Ganapathiraju

    Abstract: In traditional conversational intelligence from speech, a cascaded pipeline is used, involving tasks such as voice activity detection, diarization, transcription, and subsequent processing with different NLP models for tasks like semantic endpointing and named entity recognition (NER). Our paper introduces TokenVerse, a single Transducer-based model designed to handle multiple tasks. This is achie… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 5 pages, double column

  4. arXiv:2407.03387  [pdf, other

    cs.SE cs.AI cs.CL

    ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

    Authors: Mehant Kammakomati, Sameer Pimparkhede, Srikanth Tamilselvam, Prince Kumar, Pushpak Bhattacharyya

    Abstract: Recent work shows Large Language Models (LLMs) struggle to understand natural language constraints for various text generation tasks in zero- and few-shot settings. While, in the code domain, there is wide usage of constraints in code format to maintain the integrity of code written in Domain-Specific Languages (DSLs), yet there has been no work evaluating LLMs with these constraints. We propose t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  5. arXiv:2406.17935  [pdf, other

    cs.CL cs.SD eess.AS

    Sequential Editing for Lifelong Training of Speech Recognition Models

    Authors: Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Nikolaos Pappas, Srikanth Ronanki

    Abstract: Automatic Speech Recognition (ASR) traditionally assumes known domains, but adding data from a new domain raises concerns about computational inefficiencies linked to retraining models on both existing and new domains. Fine-tuning solely on new domain risks Catastrophic Forgetting (CF). To address this, Lifelong Learning (LLL) algorithms have been proposed for ASR. Prior research has explored tech… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: INTERSPEECH 2024

  6. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 3 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.08900  [pdf, other

    eess.AS cs.SD eess.SP

    On Improving Error Resilience of Neural End-to-End Speech Coders

    Authors: Kishan Gupta, Nicola Pia, Srikanth Korse, Andreas Brendel, Guillaume Fuchs, Markus Multrus

    Abstract: Error resilient tools like Packet Loss Concealment (PLC) and Forward Error Correction (FEC) are essential to maintain a reliable speech communication for applications like Voice over Internet Protocol (VoIP), where packets are frequently delayed and lost. In recent times, end-to-end neural speech codecs have seen a significant rise, due to their ability to transmit speech signal at low bitrates bu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  8. arXiv:2406.07568  [pdf, other

    cs.AI cs.LG cs.RO

    Reinforcement Learning Based Escape Route Generation in Low Visibility Environments

    Authors: Hari Srikanth

    Abstract: Structure fires are responsible for the majority of fire-related deaths nationwide. In order to assist with the rapid evacuation of trapped people, this paper proposes the use of a system that determines optimal search paths for firefighters and exit paths for civilians in real time based on environmental measurements. Through the use of a LiDAR map** system evaluated and verified by a trust ran… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  9. arXiv:2406.06835  [pdf, other

    cs.SE

    Large language models for generating rules, yay or nay?

    Authors: Shangeetha Sivasothy, Scott Barnett, Rena Logothetis, Mohamed Abdelrazek, Zafaryab Rasool, Srikanth Thudumu, Zac Brannelly

    Abstract: Engineering safety-critical systems such as medical devices and digital health intervention systems is complex, where long-term engagement with subject-matter experts (SMEs) is needed to capture the systems' expected behaviour. In this paper, we present a novel approach that leverages Large Language Models (LLMs), such as GPT-3.5 and GPT-4, as a potential world model to accelerate the engineering… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure

  10. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  11. arXiv:2405.20350  [pdf, other

    cs.LG

    Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges

    Authors: Hari Srikanth

    Abstract: Neural Network based approximations of the Value function make up the core of leading Policy Based methods such as Trust Regional Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). While this adds significant value when dealing with very complex environments, we note that in sufficiently low State and action space environments, a computationally expensive Neural Network architectur… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.13019  [pdf, other

    cs.CL cs.AI

    A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models

    Authors: Mahsa Khoshnoodi, Vinija Jain, Mingye Gao, Malavika Srikanth, Aman Chadha

    Abstract: Despite the crucial importance of accelerating text generation in large language models (LLMs) for efficiently producing content, the sequential nature of this process often leads to high inference latency, posing challenges for real-time applications. Various techniques have been proposed and developed to address these challenges and improve efficiency. This paper presents a comprehensive survey… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2405.08317  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

    Authors: Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Integrated Speech and Large Language Models (SLMs) that can follow speech instructions and generate relevant text responses have gained popularity lately. However, the safety and robustness of these models remains largely unclear. In this work, we investigate the potential vulnerabilities of such instruction-following speech-language models to adversarial attacks and jailbreaking. Specifically, we… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 9+6 pages, Submitted to ACL 2024

  14. arXiv:2405.08295  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechVerse: A Large-scale Generalizable Audio Language Model

    Authors: Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Large language models (LLMs) have shown incredible proficiency in performing tasks that require semantic understanding of natural language instructions. Recently, many works have further expanded this capability to perceive multimodal audio and text inputs, but their capabilities are often limited to specific fine-tuned tasks such as automatic speech recognition and translation. We therefore devel… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Single Column, 13 page

  15. arXiv:2405.06149  [pdf, other

    cs.AI cs.CV

    DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness

    Authors: Srikanth Vemula, Eulises Franco, Michael Frye

    Abstract: Intelligent detection and tracking of the vessels on the sea play a significant role in conducting traffic avoidance in unmanned surface vessels(USV). Current traffic avoidance software relies mainly on Automated Identification System (AIS) and radar to track other vessels to avoid collisions and acts as a typical perception system to detect targets. However, in a contested environment, emitting r… ▽ More

    Submitted 17 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  16. arXiv:2405.02347  [pdf, other

    cs.LG cs.AI cs.CL

    COPAL: Continual Pruning in Large Language Generative Models

    Authors: Srikanth Malla, Joon Hee Choi, Chiho Choi

    Abstract: Adapting pre-trained large language models to different domains in natural language processing requires two key considerations: high computational demands and model's inability to continual adaptation. To simultaneously address both issues, this paper presents COPAL (COntinual Pruning in Adaptive Language settings), an algorithm developed for pruning large language generative models under a contin… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML2024

  17. arXiv:2404.19422  [pdf, other

    cs.DS

    Efficient Algorithms for Earliest and Fastest Paths in Public Transport Networks

    Authors: Mithinti Srikanth, G. Ramakrishna

    Abstract: Public transport administrators rely on efficient algorithms for various problems that arise in public transport networks. In particular, our study focused on designing linear-time algorithms for two fundamental path problems: the earliest arrival time (\textsc{eat}) and the fastest path duration (\textsc{fpd}) on public transportation data. We conduct a comparative analysis with state-of-the-art… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  18. arXiv:2404.16326  [pdf, other

    cs.LG

    NeuroKoopman Dynamic Causal Discovery

    Authors: Rahmat Adesunkanmi, Balaji Sesha Srikanth Pokuri, Ratnesh Kumar

    Abstract: In many real-world applications where the system dynamics has an underlying interdependency among its variables (such as power grid, economics, neuroscience, omics networks, environmental ecosystems, and others), one is often interested in knowing whether the past values of one time series influences the future of another, known as Granger causality, and the associated underlying dynamics. This pa… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  19. arXiv:2404.11819  [pdf, other

    cs.CV

    Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement

    Authors: Pushkar Shukla, Dhruv Srikanth, Lee Cohen, Matthew Turk

    Abstract: We propose a novel approach to mitigate biases in computer vision models by utilizing counterfactual generation and fine-tuning. While counterfactuals have been used to analyze and address biases in DNN models, the counterfactuals themselves are often generated from biased generative models, which can introduce additional biases or spurious correlations. To address this issue, we propose using adv… ▽ More

    Submitted 27 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  20. arXiv:2404.11717  [pdf, other

    cs.CL

    How often are errors in natural language reasoning due to paraphrastic variability?

    Authors: Neha Srikanth, Marine Carpuat, Rachel Rudinger

    Abstract: Large language models have been shown to behave inconsistently in response to meaning-preserving paraphrastic inputs. At the same time, researchers evaluate the knowledge and reasoning abilities of these models with test evaluations that do not disaggregate the effect of paraphrastic variability on performance. We propose a metric for evaluating the paraphrastic consistency of natural language rea… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: accepted to TACL 2024 (pre-MIT Press publication version)

  21. arXiv:2403.20305  [pdf, ps, other

    cs.CC

    Local Correction of Linear Functions over the Boolean Cube

    Authors: Prashanth Amireddy, Amik Raj Behera, Manaswi Paraashar, Srikanth Srinivasan, Madhu Sudan

    Abstract: We consider the task of locally correcting, and locally list-correcting, multivariate linear functions over the domain $\{0,1\}^n$ over arbitrary fields and more generally Abelian groups. Such functions form error-correcting codes of relative distance $1/2$ and we give local-correction algorithms correcting up to nearly $1/4$-fraction errors making $\widetilde{\mathcal{O}}(\log n)$ queries. This q… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 61 pages, To Appear in the Proceedings of the 56th Annual ACM Symposium on Theory of Computing, June 24-28 2024, Vancouver, Canada. Added a remark on local testing in the revision

  22. arXiv:2403.19816  [pdf, other

    cs.LG eess.SP

    The State of Lithium-Ion Battery Health Prognostics in the CPS Era

    Authors: Gaurav Shinde, Rohan Mohapatra, Pooja Krishan, Harish Garg, Srikanth Prabhu, Sanchari Das, Mohammad Masum, Saptarshi Sengupta

    Abstract: Lithium-ion batteries (Li-ion) have revolutionized energy storage technology, becoming integral to our daily lives by powering a diverse range of devices and applications. Their high energy density, fast power response, recyclability, and mobility advantages have made them the preferred choice for numerous sectors. This paper explores the seamless integration of Prognostics and Health Management w… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 18 pages, 12 figures, 6 tables. arXiv admin note: text overlap with arXiv:2310.00023

    MSC Class: 68 ACM Class: B.8.1

  23. arXiv:2403.13188  [pdf, ps, other

    cs.CV cs.RO eess.IV

    Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation

    Authors: Kasi Viswanath, Peng Jiang, Srikanth Saripalli

    Abstract: LiDAR semantic segmentation frameworks predominantly leverage geometry-based features to differentiate objects within a scan. While these methods excel in scenarios with clear boundaries and distinct shapes, their performance declines in environments where boundaries are blurred, particularly in off-road contexts. To address this, recent strides in 3D segmentation algorithms have focused on harnes… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  24. arXiv:2403.11367  [pdf, other

    cs.CV cs.GR cs.RO

    3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization

    Authors: Peng Jiang, Gaurav Pandey, Srikanth Saripalli

    Abstract: This paper presents a novel system designed for 3D map** and visual relocalization using 3D Gaussian Splatting. Our proposed method uses LiDAR and camera data to create accurate and visually plausible representations of the environment. By leveraging LiDAR data to initiate the training of the 3D Gaussian Splatting map, our system constructs maps that are both detailed and geometrically accurate.… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 7 figures

  25. arXiv:2403.10205  [pdf, other

    cs.CL cs.AI

    Read between the lines -- Functionality Extraction From READMEs

    Authors: Prince Kumar, Srikanth Tamilselvam, Dinesh Garg

    Abstract: While text summarization is a well-known NLP task, in this paper, we introduce a novel and useful variant of it called functionality extraction from Git README files. Though this task is a text2text generation at an abstract level, it involves its own peculiarities and challenges making existing text2text generation systems not very useful. The motivation behind this task stems from a recent surge… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  26. arXiv:2402.11760  [pdf, other

    cs.LG cs.CV

    Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation

    Authors: Bharat Srikishan, Anika Tabassum, Srikanth Allu, Ramakrishnan Kannan, Nikhil Muralidhar

    Abstract: Deep learning architectures have achieved state-of-the-art (SOTA) performance on computer vision tasks such as object detection and image segmentation. This may be attributed to the use of over-parameterized, monolithic deep learning architectures executed on large datasets. Although such architectures lead to increased accuracy, this is usually accompanied by a large increase in computation and m… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  27. arXiv:2402.08769  [pdf, other

    cs.LG cs.DC

    FLASH: Federated Learning Across Simultaneous Heterogeneities

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: The key premise of federated learning (FL) is to train ML models across a diverse set of data-owners (clients), without exchanging local data. An overarching challenge to this date is client heterogeneity, which may arise not only from variations in data distribution, but also in data quality, as well as compute/communication latency. An integrated view of these diverse and concurrent sources of h… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  28. arXiv:2402.07118  [pdf, other

    cs.HC cs.AI cs.LG eess.IV eess.SP

    Next-Generation Teleophthalmology: AI-enabled Quality Assessment Aiding Remote Smartphone-based Consultation

    Authors: Dhruv Srikanth, Jayang Gurung, N Satya Deepika, Vineet Joshi, Pravin Vaddavalli, Soumya Jana

    Abstract: Blindness and other eye diseases are a global health concern, particularly in low- and middle-income countries like India. In this regard, during the COVID-19 pandemic, teleophthalmology became a lifeline, and the Grabi attachment for smartphone-based eye imaging gained in use. However, quality of user-captured image often remained inadequate, requiring clinician vetting and delays. In this backdr… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 4 pages, Submitted to IEEE EMBC 2024

  29. arXiv:2402.01968  [pdf, other

    cs.MA cs.AI cs.LG

    A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions

    Authors: Hung Du, Srikanth Thudumu, Rajesh Vasa, Kon Mouzakis

    Abstract: Research interest in autonomous agents is on the rise as an emerging topic. The notable achievements of Large Language Models (LLMs) have demonstrated the considerable potential to attain human-like intelligence in autonomous agents. However, the challenge lies in enabling these agents to learn, reason, and navigate uncertainties in dynamic environments. Context awareness emerges as a pivotal elem… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 11 pages, 1 figure

  30. arXiv:2401.08138  [pdf, other

    cs.SE cs.AI

    LLMs for Test Input Generation for Semantic Caches

    Authors: Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek

    Abstract: Large language models (LLMs) enable state-of-the-art semantic capabilities to be added to software systems such as semantic search of unstructured documents and text generation. However, these models are computationally expensive. At scale, the cost of serving thousands of users increases massively affecting also user experience. To address this problem, semantic caches are used to check for answe… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted in International Conference on AI Engineering Software Engineering (CAIN 2024)

  31. arXiv:2401.05856  [pdf, other

    cs.SE cs.AI

    Seven Failure Points When Engineering a Retrieval Augmented Generation System

    Authors: Scott Barnett, Stefanus Kurniawan, Srikanth Thudumu, Zach Brannelly, Mohamed Abdelrazek

    Abstract: Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG systems aim to: a) reduce the problem of hallucinat… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  32. arXiv:2401.04130  [pdf, other

    cs.LG cs.AI

    Plug-and-Play Transformer Modules for Test-Time Adaptation

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: Parameter-efficient tuning (PET) methods such as LoRA, Adapter, and Visual Prompt Tuning (VPT) have found success in enabling adaptation to new domains by tuning small modules within a transformer model. However, the number of domains encountered during test time can be very large, and the data is usually unlabeled. Thus, adaptation to new domains is challenging; it is also impractical to generate… ▽ More

    Submitted 8 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  33. arXiv:2401.01439  [pdf, other

    cs.CV cs.RO

    Off-Road LiDAR Intensity Based Semantic Segmentation

    Authors: Kasi Viswanath, Peng Jiang, Sujit PB, Srikanth Saripalli

    Abstract: LiDAR is used in autonomous driving to provide 3D spatial information and enable accurate perception in off-road environments, aiding in obstacle detection, map**, and path planning. Learning-based LiDAR semantic segmentation utilizes machine learning techniques to automatically classify objects and regions in LiDAR point clouds. Learning-based models struggle in off-road environments due to the… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted to ISER 2023

  34. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but develo** strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  35. arXiv:2312.00434  [pdf, other

    cs.LG cs.AI cs.CY

    PEFTDebias : Capturing debiasing information using PEFTs

    Authors: Sumit Agarwal, Aditya Srikanth Veerubhotla, Srijan Bansal

    Abstract: The increasing use of foundation models highlights the urgent need to address and eliminate implicit biases present in them that arise during pretraining. In this paper, we introduce PEFTDebias, a novel approach that employs parameter-efficient fine-tuning (PEFT) to mitigate the biases within foundation models. PEFTDebias consists of two main phases: an upstream phase for acquiring debiasing param… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023

  36. arXiv:2311.15072  [pdf, other

    cs.CV cs.AI

    Introducing SSBD+ Dataset with a Convolutional Pipeline for detecting Self-Stimulatory Behaviours in Children using raw videos

    Authors: Vaibhavi Lokegaonkar, Vijay Jaisankar, Pon Deepika, Madhav Rao, T K Srikanth, Sarbani Mallick, Manjit Sodhi

    Abstract: Conventionally, evaluation for the diagnosis of Autism spectrum disorder is done by a trained specialist through questionnaire-based formal assessments and by observation of behavioral cues under various settings to capture the early warning signs of autism. These evaluation techniques are highly subjective and their accuracy relies on the experience of the specialist. In this regard, machine lear… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  37. arXiv:2311.14786  [pdf, other

    cs.CV cs.AI cs.RO

    GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction

    Authors: Jia Huang, Peng Jiang, Alvika Gautam, Srikanth Saripalli

    Abstract: Predicting pedestrian behavior is the key to ensure safety and reliability of autonomous vehicles. While deep learning methods have been promising by learning from annotated video frame sequences, they often fail to fully grasp the dynamic interactions between pedestrians and traffic, crucial for accurate predictions. These models also lack nuanced common sense reasoning. Moreover, the manual anno… ▽ More

    Submitted 25 January, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  38. arXiv:2311.12779  [pdf, other

    cs.NI cs.GT

    Finding Adversarial Inputs for Heuristics using Multi-level Optimization

    Authors: Pooria Namyar, Behnaz Arzani, Ryan Beckett, Santiago Segarra, Himanshu Raj, Umesh Krishnaswamy, Ramesh Govindan, Srikanth Kandula

    Abstract: Production systems use heuristics because they are faster or scale better than their optimal counterparts. Yet, practitioners are often unaware of the performance gap between a heuristic and the optimum or between two heuristics in realistic scenarios. We present MetaOpt, a system that helps analyze heuristics. Users specify the heuristic and the optimal (or another heuristic) as input, and MetaOp… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  39. arXiv:2311.09542  [pdf, other

    cs.CL

    Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering

    Authors: Neha Srikanth, Rupak Sarkar, Heran Mane, Elizabeth M. Aparicio, Quynh C. Nguyen, Rachel Rudinger, Jordan Boyd-Graber

    Abstract: Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmat… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024

  40. arXiv:2311.08402  [pdf, other

    cs.CL cs.IR cs.SD eess.AS

    Retrieve and Copy: Scaling ASR Personalization to Large Catalogs

    Authors: Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati

    Abstract: Personalization of automatic speech recognition (ASR) models is a widely studied topic because of its many practical applications. Most recently, attention-based contextual biasing techniques are used to improve the recognition of rare words and domain specific entities. However, due to performance constraints, the biasing is often limited to a few thousand entities, restricting real-world usabili… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  41. arXiv:2311.02482  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Generalized zero-shot audio-to-intent classification

    Authors: Veera Raghavendra Elluru, Devang Kulshreshtha, Rohit Paturi, Sravan Bodapati, Srikanth Ronanki

    Abstract: Spoken language understanding systems using audio-only data are gaining popularity, yet their ability to handle unseen intents remains limited. In this study, we propose a generalized zero-shot audio-to-intent classification framework with only a few sample text sentences per intent. To achieve this, we first train a supervised audio-to-intent classifier by making use of a self-supervised pre-trai… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  42. arXiv:2310.17620  [pdf, other

    cs.RO

    Radar-Only Off-Road Local Navigation

    Authors: Timothy Overbye, Srikanth Saripalli

    Abstract: Off-road robotics have traditionally utilized lidar for local navigation due to its accuracy and high resolution. However, the limitations of lidar, such as reduced performance in harsh environmental conditions and limited range, have prompted the exploration of alternative sensing technologies. This paper investigates the potential of radar for off-road local navigation, as it offers the advantag… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 7 pages, 17 figures, ITSC 2023

  43. arXiv:2310.13551  [pdf, other

    cs.CV cs.RO

    ROSS: Radar Off-road Semantic Segmentation

    Authors: Peng Jiang, Srikanth Saripalli

    Abstract: As the demand for autonomous navigation in off-road environments increases, the need for effective solutions to understand these surroundings becomes essential. In this study, we confront the inherent complexities of semantic segmentation in RADAR data for off-road scenarios. We present a novel pipeline that utilizes LIDAR data and an existing annotated off-road LIDAR dataset for generating RADAR… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 figures, accepted by the 18th International Symposium on Experimental Robotics (ISER 2023)

  44. Online Multi-IMU Calibration Using Visual-Inertial Odometry

    Authors: Jacob Hartzer, Srikanth Saripalli

    Abstract: This work presents a centralized multi-IMU filter framework with online intrinsic and extrinsic calibration for unsynchronized inertial measurement units that is robust against changes in calibration parameters. The novel EKF-based method estimates the positional and rotational offsets of the system of sensors as well as their intrinsic biases without the use of rigid body geometric constraints. A… ▽ More

    Submitted 3 January, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  45. arXiv:2310.09699  [pdf, other

    cs.NI cs.DC

    Solving Max-Min Fair Resource Allocations Quickly on Large Graphs

    Authors: Pooria Namyar, Behnaz Arzani, Srikanth Kandula, Santiago Segarra, Daniel Crankshaw, Umesh Krishnaswamy, Ramesh Govindan, Himanshu Raj

    Abstract: We consider the max-min fair resource allocation problem. The best-known solutions use either a sequence of optimizations or waterfilling, which only applies to a narrow set of cases. These solutions have become a practical bottleneck in WAN traffic engineering and cluster scheduling, especially at larger problem sizes. We improve both approaches: (1) we show how to convert the optimization sequen… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted to USENIX NSDI 2024

  46. arXiv:2309.16057  [pdf, other

    cs.RO

    WiDEVIEW: An UltraWideBand and Vision Dataset for Deciphering Pedestrian-Vehicle Interactions

    Authors: Jia Huang, Alvika Gautam, Junghun Choi, Srikanth Saripalli

    Abstract: Robust and accurate tracking and localization of road users like pedestrians and cyclists is crucial to ensure safe and effective navigation of Autonomous Vehicles (AVs), particularly so in urban driving scenarios with complex vehicle-pedestrian interactions. Existing datasets that are useful to investigate vehicle-pedestrian interactions are mostly image-centric and thus vulnerable to vision fail… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  47. arXiv:2309.11414  [pdf, other

    cs.RO cs.AI cs.LG

    EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning

    Authors: Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna

    Abstract: Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without s… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 8 pages, 8 figures, submitted to ICRA 2024 (International Conference on Robotics and Automation)

  48. arXiv:2309.10047  [pdf, other

    cs.LG cs.DB

    A Modular Spatial Clustering Algorithm with Noise Specification

    Authors: Akhil K, Srikanth H R

    Abstract: Clustering techniques have been the key drivers of data mining, machine learning and pattern recognition for decades. One of the most popular clustering algorithms is DBSCAN due to its high accuracy and noise tolerance. Many superior algorithms such as DBSCAN have input parameters that are hard to estimate. Therefore, finding those parameters is a time consuming process. In this paper, we propose… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Presented at International Conference for Machine Learning and Data Science 2018

  49. arXiv:2309.08703  [pdf, ps, other

    cs.CC math.CO

    The discrepancy of greater-than

    Authors: Srikanth Srinivasan, Amir Yehudayoff

    Abstract: The discrepancy of the $n \times n$ greater-than matrix is shown to be $\fracπ{2 \ln n}$ up to lower order terms.

    Submitted 15 September, 2023; originally announced September 2023.

  50. arXiv:2308.14670  [pdf, other

    cs.RO

    Symmetric Models for Visual Force Policy Learning

    Authors: Colin Kohler, Anuj Shrivatsav Srikanth, Eshan Arora, Robert Platt

    Abstract: While it is generally acknowledged that force feedback is beneficial to robotic control, applications of policy learning to robotic manipulation typically only leverage visual feedback. Recently, symmetric neural models have been used to significantly improve the sample efficiency and performance of policy learning across a variety of robotic manipulation domains. This paper explores an applicatio… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.