Skip to main content

Showing 1–50 of 152 results for author: Deshpande, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.03142  [pdf, ps, other

    cs.LG

    On the Power of Randomization in Fair Classification and Representation

    Authors: Sushant Agarwal, Amit Deshpande

    Abstract: Fair classification and fair representation learning are two important problems in supervised and unsupervised fair machine learning, respectively. Fair classification asks for a classifier that maximizes accuracy on a given data distribution subject to fairness constraints. Fair representation maps a given data distribution over the original feature space to a distribution over a new representati… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Appeared in ACM FAccT 2022

  3. arXiv:2405.19307  [pdf, other

    cs.RO

    Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

    Authors: Abhay Deshpande, Liyiming Ke, Quinn Pfeifer, Abhishek Gupta, Siddhartha S. Srinivasa

    Abstract: We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by l… ▽ More

    Submitted 3 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.04325  [pdf, other

    cs.CL

    Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

    Authors: Atharvan Dogra, Ameet Deshpande, John Nay, Tanmay Rajpurohit, Ashwin Kalyan, Balaraman Ravindran

    Abstract: Recent developments in large language models (LLMs), while offering a powerful foundation for develo** natural language agents, raise safety concerns about them and the autonomous agents built upon them. Deception is one potential capability of AI agents of particular concern, which we refer to as an act or statement that misleads, hides the truth, or promotes a belief that is not true in its en… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: A**kya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  6. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  7. arXiv:2403.05749  [pdf, other

    eess.SY cs.DM

    Characterizing Flow Complexity in Transportation Networks using Graph Homology

    Authors: Shashank A Deshpande, Hamsa Balakrishnan

    Abstract: Series-parallel network topologies generally exhibit simplified dynamical behavior and avoid high combinatorial complexity. A comprehensive analysis of how flow complexity emerges with a graph's deviation from series-parallel topology is therefore of fundamental interest. We introduce the notion of a robust $k$-path on a directed acycylic graph, with increasing values of the length $k$ reflecting… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures, letter

  8. arXiv:2402.11741  [pdf, other

    cs.DS cs.CC cs.DB cs.DC

    To Store or Not to Store: a graph theoretical approach for Dataset Versioning

    Authors: Anxin Guo, **gwei Li, Pattara Sukprasert, Samir Khuller, Amol Deshpande, Koyel Mukherjee

    Abstract: In this work, we study the cost efficient data versioning problem, where the goal is to optimize the storage and reconstruction (retrieval) costs of data versions, given a graph of datasets as nodes and edges capturing edit/delta information. One central variant we study is MinSum Retrieval (MSR) where the goal is to minimize the total retrieval costs, while kee** the storage costs bounded. This… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by IPDPS 2024

  9. arXiv:2402.06733  [pdf, other

    cs.CL cs.AI cs.LG

    NICE: To Optimize In-Context Examples or Not?

    Authors: Pragya Srivastava, Satvik Golechha, Amit Deshpande, Amit Sharma

    Abstract: Recent work shows that in-context learning and optimization of in-context examples (ICE) can significantly improve the accuracy of large language models (LLMs) on a wide range of tasks, leading to an apparent consensus that ICE optimization is crucial for better performance. However, most of these studies assume a fixed or no instruction provided in the prompt. We challenge this consensus by inves… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted as a full paper (9 pages) at ACL 2024 (Main)

    Journal ref: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024 (Volume 1: Long Papers)

  10. arXiv:2312.16735  [pdf, other

    cs.DB cs.DC

    Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

    Authors: Gang Liao, Amol Deshpande, Daniel J. Abadi

    Abstract: Existing serverless data analytics systems rely on external storage services like S3 for data shuffling and communication between cloud functions. While this approach provides the elasticity benefits of serverless computing, it incurs additional latency and cost overheads. We present Flock, a novel cloud-native streaming query engine that leverages the on-demand scalability of FaaS platforms for r… ▽ More

    Submitted 21 April, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  11. arXiv:2312.10534  [pdf, other

    cs.LG cs.CR cs.CV

    Rethinking Robustness of Model Attributions

    Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian

    Abstract: For machine learning models to be reliable and trustworthy, their decisions must be interpretable. As these models find increasing use in safety-critical applications, it is important that not just the model predictions but also their explanations (as feature attributions) be robust to small human-imperceptible input perturbations. Recent works have shown that many attribution methods are fragile… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted AAAI 2024

  12. arXiv:2312.10396  [pdf, ps, other

    cs.LG cs.AI

    How Far Can Fairness Constraints Help Recover From Biased Data?

    Authors: Mohit Sharma, Amit Deshpande

    Abstract: A general belief in fair classification is that fairness constraints incur a trade-off with accuracy, which biased data may worsen. Contrary to this belief, Blum & Stangl (2019) show that fair classification with equal opportunity constraints even on extremely biased data can recover optimally accurate and fair classifiers on the original data distribution. Their result is interesting because it d… ▽ More

    Submitted 1 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted for publication at ICML 2024

  13. arXiv:2312.05323  [pdf, other

    cs.RO

    BaRiFlex: A Robotic Gripper with Versatility and Collision Robustness for Robot Learning

    Authors: Gu-Cheol Jeong, Arpit Bahety, Gabriel Pedraza, Ashish D. Deshpande, Roberto Martín-Martín

    Abstract: We present a new approach to robot hand design specifically suited for successfully implementing robot learning methods to accomplish tasks in daily human environments. We introduce BaRiFlex, an innovative gripper design that alleviates the issues caused by unexpected contact and collisions during robot learning, offering robustness, gras** versatility, task versatility, and simplicity to the le… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 8 pages, 6 figures, project website: https://robin-lab.cs.utexas.edu/bariflex/

  14. arXiv:2312.04294  [pdf, ps, other

    cs.NI

    Energy-Efficient Internet of Things Monitoring with Content-Based Wake-Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled. However, polling-based WUR may still lead to wasted energy if values sensed by the polled sensors provide no new information to the… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  15. arXiv:2312.00348  [pdf, other

    cs.CV

    Student Activity Recognition in Classroom Environments using Transfer Learning

    Authors: Anagha Deshpande, Vedant Deshpande

    Abstract: The recent advances in artificial intelligence and deep learning facilitate automation in various applications including home automation, smart surveillance systems, and healthcare among others. Human Activity Recognition is one of its emerging applications, which can be implemented in a classroom environment to enhance safety, efficiency, and overall educational quality. This paper proposes a sys… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 6 pages, 12 figures, accepted at the IEEE International Conference on Computational Intelligence, Networks and Security (ICCINS) 2023

  16. arXiv:2311.09735  [pdf, other

    cs.LG cs.IR

    GEO: Generative Engine Optimization

    Authors: Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande

    Abstract: The advent of large language models (LLMs) has ushered in a new paradigm of search engines that use generative models to gather and summarize information to answer user queries. This emerging technology, which we formalize under the unified framework of generative engines (GEs), can generate accurate and personalized responses, rapidly replacing traditional search engines like Google and Bing. Gen… ▽ More

    Submitted 28 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to KDD 2024

  17. arXiv:2311.04892  [pdf, other

    cs.CL

    Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

    Authors: Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot

    Abstract: Recent works have showcased the ability of LLMs to embody diverse personas in their responses, exemplified by prompts like 'You are Yoda. Explain the Theory of Relativity.' While this ability allows personalization of LLMs and enables human behavior simulation, its effect on LLMs' capabilities remains unclear. To fill this gap, we present the first extensive study of the unintended side-effects of… ▽ More

    Submitted 27 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Project page: https://allenai.github.io/persona-bias. Paper to appear at ICLR 2024. Added results for other LLMs in v2 (similar findings)

  18. arXiv:2311.02807  [pdf, other

    cs.LG cs.AI cs.CL

    QualEval: Qualitative Evaluation for Model Improvement

    Authors: Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan

    Abstract: Quantitative evaluation metrics have traditionally been pivotal in gauging the advancements of artificial intelligence systems, including large language models (LLMs). However, these metrics have inherent limitations. Given the intricate nature of real-world tasks, a single scalar to quantify and compare is insufficient to capture the fine-grained nuances of model behavior. Metrics serve only as a… ▽ More

    Submitted 5 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  19. arXiv:2310.12972  [pdf, other

    cs.RO

    CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning

    Authors: Liyiming Ke, Yunchu Zhang, Abhay Deshpande, Siddhartha Srinivasa, Abhishek Gupta

    Abstract: We present a new technique to enhance the robustness of imitation learning methods by generating corrective data to account for compounding errors and disturbances. While existing methods rely on interactive expert labeling, additional offline datasets, or domain-specific invariances, our approach requires minimal additional assumptions beyond access to expert data. The key insight is to leverage… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  20. arXiv:2310.10294  [pdf, other

    cs.CL cs.AI

    Key-phrase boosted unsupervised summary generation for FinTech organization

    Authors: Aadit Deshpande, Shreya Goyal, Prateek Nagwanshi, Avinash Tripathy

    Abstract: With the recent advances in social media, the use of NLP techniques in social media data analysis has become an emerging research direction. Business organizations can particularly benefit from such an analysis of social media discourse, providing an external perspective on consumer behavior. Some of the NLP applications such as intent detection, sentiment classification, text summarization can he… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures

  21. arXiv:2310.01892  [pdf, ps, other

    cs.LG cs.AI

    FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations

    Authors: Chanakya Ekbote, A**kya Pankaj Deshpande, Arun Iyer, Ramakrishna Bairi, Sundararajan Sellamanickam

    Abstract: Unsupervised node representations learnt using contrastive learning-based methods have shown good performance on downstream tasks. However, these methods rely on augmentations that mimic low-pass filters, limiting their performance on tasks requiring different eigen-spectrum parts. This paper presents a simple filter-based augmentation method to capture different parts of the eigen-spectrum. We sh… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  22. arXiv:2309.03750  [pdf, other

    cs.CV

    PBP: Path-based Trajectory Prediction for Autonomous Driving

    Authors: Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui

    Abstract: Trajectory prediction plays a crucial role in the autonomous driving stack by enabling autonomous vehicles to anticipate the motion of surrounding agents. Goal-based prediction models have gained traction in recent years for addressing the multimodal nature of future trajectories. Goal-based prediction models simplify multimodal prediction by first predicting 2D goal locations of agents and then p… ▽ More

    Submitted 2 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally

  23. arXiv:2309.02710  [pdf, ps, other

    cs.LG cs.CG cs.DS

    Improved Outlier Robust Seeding for k-means

    Authors: Amit Deshpande, Rameshwar Pratap

    Abstract: The $k$-means is a popular clustering objective, although it is inherently non-robust and sensitive to outliers. Its popular seeding or initialization called $k$-means++ uses $D^{2}$ sampling and comes with a provable $O(\log k)$ approximation guarantee \cite{AV2007}. However, in the presence of adversarial noise or outliers, $D^{2}$ sampling is more likely to pick centers from distant outliers in… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  24. arXiv:2309.00133  [pdf, other

    cs.CV

    Distraction-free Embeddings for Robust VQA

    Authors: Atharvan Dogra, Deeksha Varshney, Ashwin Kalyan, Ameet Deshpande, Neeraj Kumar

    Abstract: The generation of effective latent representations and their subsequent refinement to incorporate precise information is an essential prerequisite for Vision-Language Understanding (VLU) tasks such as Video Question Answering (VQA). However, most existing methods for VLU focus on sparsely sampling or fine-graining the input information (e.g., sampling a sparse set of frames or text tokens), or add… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  25. arXiv:2308.13242  [pdf, other

    cs.LG cs.CY cs.IR

    Optimizing Group-Fair Plackett-Luce Ranking Models for Relevance and Ex-Post Fairness

    Authors: Sruthi Gorantla, Eshaan Bhansali, Amit Deshpande, Anand Louis

    Abstract: In learning-to-rank (LTR), optimizing only the relevance (or the expected ranking utility) can cause representational harm to certain categories of items. Moreover, if there is implicit bias in the relevance scores, LTR models may fail to optimize for true relevance. Previous works have proposed efficient algorithms to train stochastic ranking models that achieve fairness of exposure to the groups… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 20 pages

  26. Low-Latency Massive Access with Multicast Wake Up Radio

    Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

    Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled, saving energy. However, polling-based Time Division Multiple Access (TDMA) may significantly increase data transmission delay if pac… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 2023 21st Mediterranean Communication and Computer Networking Conference (MedComNet)

  27. arXiv:2307.08593  [pdf, other

    physics.acc-ph cs.LG hep-ex nucl-ex nucl-th

    Artificial Intelligence for the Electron Ion Collider (AI4EIC)

    Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

  28. arXiv:2307.00259  [pdf, other

    cs.CL cs.AI

    InstructEval: Systematic Evaluation of Instruction Selection Methods

    Authors: Anirudh Ajith, Chris Pan, Mengzhou Xia, Ameet Deshpande, Karthik Narasimhan

    Abstract: In-context learning (ICL) performs tasks by prompting a large language model (LLM) using an instruction and a small set of annotated examples called demonstrations. Recent work has shown that precise details of the inputs used in the ICL prompt significantly impact performance, which has incentivized instruction selection algorithms. The effect of instruction-choice however is severely underexplor… ▽ More

    Submitted 16 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: 8 content pages + 3 pages of supplementary material, 3 figures, 10 tables

  29. arXiv:2306.11964  [pdf, other

    cs.CY cs.DS cs.IR cs.LG stat.ML

    Sampling Individually-Fair Rankings that are Always Group Fair

    Authors: Sruthi Gorantla, Anay Mehrotra, Amit Deshpande, Anand Louis

    Abstract: Rankings on online platforms help their end-users find the relevant information -- people, news, media, and products -- quickly. Fair ranking tasks, which ask to rank a set of items to maximize utility subject to satisfying group-fairness constraints, have gained significant interest in the Algorithmic Fairness, Information Retrieval, and Machine Learning literature. Recent works, however, identif… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Full version of a paper accepted for presentation in ACM AIES 2023

  30. arXiv:2306.11072  [pdf, other

    cs.LG

    Causal Effect Regularization: Automated Detection and Removal of Spurious Attributes

    Authors: Abhinav Kumar, Amit Deshpande, Amit Sharma

    Abstract: In many classification datasets, the task labels are spuriously correlated with some input attributes. Classifiers trained on such datasets often rely on these attributes for prediction, especially when the spurious correlation is high, and thus fail to generalize whenever there is a shift in the attributes' correlation at deployment. If we assume that the spurious attributes are known a priori, s… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  31. arXiv:2305.15093  [pdf, other

    cs.CL cs.AI cs.LG

    C-STS: Conditional Semantic Textual Similarity

    Authors: Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan

    Abstract: Semantic textual similarity (STS), a cornerstone task in NLP, measures the degree of similarity between a pair of sentences, and has broad application in fields such as information retrieval and natural language understanding. However, sentence similarity can be inherently ambiguous, depending on the specific aspect of interest. We resolve this ambiguity by proposing a novel task called Conditiona… ▽ More

    Submitted 6 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023

  32. arXiv:2305.14784  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    Anthropomorphization of AI: Opportunities and Risks

    Authors: Ameet Deshpande, Tanmay Rajpurohit, Karthik Narasimhan, Ashwin Kalyan

    Abstract: Anthropomorphization is the tendency to attribute human-like traits to non-human entities. It is prevalent in many social contexts -- children anthropomorphize toys, adults do so with brands, and it is a literary device. It is also a versatile tool in science, with behavioral psychology and evolutionary biology meticulously documenting its consequences. With widespread adoption of AI systems, and… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  33. arXiv:2304.05335  [pdf, other

    cs.CL cs.AI cs.LG

    Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

    Authors: Ameet Deshpande, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan

    Abstract: Large language models (LLMs) have shown incredible capabilities and transcended the natural language processing (NLP) community, with adoption throughout many services like healthcare, therapy, education, and customer service. Since users include people with critical information needs like students or patients engaging with chatbots, the safety of these systems is of prime importance. Therefore, a… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  34. arXiv:2303.05508  [pdf, other

    cs.RO

    Cherry-Picking with Reinforcement Learning : Robust Dynamic Gras** in Unstable Conditions

    Authors: Yunchu Zhang, Liyiming Ke, Abhay Deshpande, Abhishek Gupta, Siddhartha Srinivasa

    Abstract: Gras** small objects surrounded by unstable or non-rigid material plays a crucial role in applications such as surgery, harvesting, construction, disaster recovery, and assisted feeding. This task is especially difficult when fine manipulation is required in the presence of sensor noise and perception errors; errors inevitably trigger dynamic motion, which is challenging to model precisely. Circ… ▽ More

    Submitted 28 June, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  35. arXiv:2302.13191  [pdf

    cs.RO cs.AI cs.LG cs.NE eess.SY

    DeepCPG Policies for Robot Locomotion

    Authors: Aditya M. Deshpande, Eric Hurd, Ali A. Minai, Manish Kumar

    Abstract: Central Pattern Generators (CPGs) form the neural basis of the observed rhythmic behaviors for locomotion in legged animals. The CPG dynamics organized into networks allow the emergence of complex locomotor behaviors. In this work, we take this inspiration for develo** walking behaviors in multi-legged robots. We present novel DeepCPG policies that embed CPGs as a layer in a larger neural networ… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Preprint of paper accepted for publication in IEEE Transaction On Cognitive and Developmental Systems

  36. arXiv:2302.12441  [pdf, other

    cs.LG cs.CL

    MUX-PLMs: Data Multiplexing for High-throughput Language Models

    Authors: Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan

    Abstract: The widespread adoption of large language models such as ChatGPT and Bard has led to unprecedented demand for these technologies. The burgeoning cost of inference for ever-increasing model sizes coupled with hardware shortages has limited affordable access and poses a pressing need for efficiency approaches geared towards high throughput and performance. Multi-input multi-output (MIMO) algorithms… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  37. arXiv:2302.05906  [pdf, other

    cs.LG cs.AI

    On Comparing Fair Classifiers under Data Bias

    Authors: Mohit Sharma, Amit Deshpande, Rajiv Ratn Shah

    Abstract: In this paper, we consider a theoretical model for injecting data bias, namely, under-representation and label bias (Blum & Stangl, 2019). We empirically study the effect of varying data biases on the accuracy and fairness of fair classifiers. Through extensive experiments on both synthetic and real-world datasets (e.g., Adult, German Credit, Bank Marketing, COMPAS), we empirically audit pre-, in-… ▽ More

    Submitted 10 December, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted as a Spotlight Presentation at Algorithmic Fairness through the Lens of Time, Neurips 2023 Workshop

  38. arXiv:2301.11309  [pdf, other

    cs.CL

    SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

    Authors: Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan

    Abstract: Extreme classification (XC) involves predicting over large numbers of classes (thousands to millions), with real-world applications like news article classification and e-commerce product tagging. The zero-shot version of this task requires generalization to novel classes without additional supervision. In this paper, we develop SemSup-XC, a model that achieves state-of-the-art zero-shot and few-s… ▽ More

    Submitted 22 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Published at ICML 2023. V2: camera ready version at ICML 2023

  39. arXiv:2301.09158  [pdf, other

    cs.RO

    Differential Spiral Joint Mechanism for Coupled Variable Stiffness Actuation

    Authors: Mincheol Kim, Ashish Deshpande

    Abstract: In this study, we present the Differential Spiral Joint (DSJ) mechanism for variable stiffness actuation in tendon-driven robots. The DSJ mechanism semi-decouples the modulation of position and mechanical stiffness, allowing independent trajectory tracking in different parameter space. Past studies show that increasing the mechanical stiffness achieves the wider range of renderable stiffness, wher… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

  40. Sharp complexity phase transitions generated by entanglement

    Authors: Soumik Ghosh, Abhinav Deshpande, Dominik Hangleiter, Alexey V. Gorshkov, Bill Fefferman

    Abstract: Entanglement is one of the physical properties of quantum systems responsible for the computational hardness of simulating quantum systems. But while the runtime of specific algorithms, notably tensor network algorithms, explicitly depends on the amount of entanglement in the system, it is unknown whether this connection runs deeper and entanglement can also cause inherent, algorithm-independent c… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  41. arXiv:2211.16634  [pdf, other

    cs.CL cs.AI cs.LG

    SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

    Authors: Ameet Deshpande, Md Arafat Sultan, Anthony Ferritto, Ashwin Kalyan, Karthik Narasimhan, Avirup Sil

    Abstract: Fine-tuning pre-trained language models (PLMs) achieves impressive performance on a range of downstream tasks, and their sizes have consequently been getting bigger. Since a different copy of the model is required for each task, this paradigm is infeasible for storage-constrained edge devices like mobile phones. In this paper, we propose SPARTAN, a parameter efficient (PE) and computationally fast… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  42. arXiv:2211.08547  [pdf, other

    cs.CL cs.AI cs.LG

    ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training

    Authors: Henry Tang, Ameet Deshpande, Karthik Narasimhan

    Abstract: Multilingual pre-trained models exhibit zero-shot cross-lingual transfer, where a model fine-tuned on a source language achieves surprisingly good performance on a target language. While studies have attempted to understand transfer, they focus only on MLM, and the large number of differences between natural languages makes it hard to disentangle the importance of different properties. In this wor… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  43. arXiv:2211.08388  [pdf, other

    astro-ph.GA astro-ph.IM cs.LG

    Photometric identification of compact galaxies, stars and quasars using multiple neural networks

    Authors: Siddharth Chaini, Atharva Bagul, Anish Deshpande, Rishi Gondkar, Kaushal Sharma, M. Vivek, Ajit Kembhavi

    Abstract: We present MargNet, a deep learning-based classifier for identifying stars, quasars and compact galaxies using photometric parameters and images from the Sloan Digital Sky Survey (SDSS) Data Release 16 (DR16) catalogue. MargNet consists of a combination of Convolutional Neural Network (CNN) and Artificial Neural Network (ANN) architectures. Using a carefully curated dataset consisting of 240,000 c… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 14 pages, 10 figures, Accepted for publication in MNRAS

  44. arXiv:2211.06106  [pdf, other

    cs.LG cs.AI cs.CY

    Identifying, measuring, and mitigating individual unfairness for supervised learning models and application to credit risk models

    Authors: Rasoul Shahsavarifar, Jithu Chandran, Mario Inchiosa, Amit Deshpande, Mario Schlener, Vishal Gossain, Yara Elias, Vinaya Murali

    Abstract: In the past few years, Artificial Intelligence (AI) has garnered attention from various industries including financial services (FS). AI has made a positive impact in financial services by enhancing productivity and improving risk management. While AI can offer efficient solutions, it has the potential to bring unintended consequences. One such consequence is the pronounced effect of AI-related un… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 12 pages, 3 figures

  45. arXiv:2208.10095  [pdf, other

    cs.LG cs.CY cs.DS

    Socially Fair Center-based and Linear Subspace Clustering

    Authors: Sruthi Gorantla, Kishen N. Gowda, Amit Deshpande, Anand Louis

    Abstract: Center-based clustering (e.g., $k$-means, $k$-medians) and clustering using linear subspaces are two most popular techniques to partition real-world data into smaller clusters. However, when the data consists of sensitive demographic groups, significantly different clustering cost per point for different sensitive groups can lead to fairness-related harms (e.g., different quality-of-service). The… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages

  46. arXiv:2205.14036  [pdf, other

    cs.CL

    StereoKG: Data-Driven Knowledge Graph Construction for Cultural Knowledge and Stereotypes

    Authors: Awantee Deshpande, Dana Ruiter, Marius Mosbach, Dietrich Klakow

    Abstract: Analyzing ethnic or religious bias is important for improving fairness, accountability, and transparency of natural language processing models. However, many techniques rely on human-compiled lists of bias terms, which are expensive to create and are limited in coverage. In this study, we present a fully data-driven pipeline for generating a knowledge graph (KG) of cultural knowledge and stereotyp… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 12 pages, 2 figures, accepted as a long paper at WOAH at NAACL 2022

  47. arXiv:2205.11843  [pdf, other

    cs.NI eess.SP

    Beam Aware Stochastic Multihop Routing for Flying Ad-hoc Networks

    Authors: Anay Ajit Deshpande, Roberto Pereira, Federico Chiariotti, Adriano Pastore, Xavier Mestre, Andrea Zanella

    Abstract: Routing is a crucial component in the design of Flying Ad-Hoc Networks (FANETs). State of the art routing solutions exploit the position of Unmanned Aerial Vehicles (UAVs) and their mobility information to determine the existence of links between them, but this information is often unreliable, as the topology of FANETs can change quickly and unpredictably. In order to improve the tracking performa… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  48. arXiv:2205.11373  [pdf, ps, other

    eess.SP cs.LG

    User Clustering for Rate Splitting using Machine Learning

    Authors: Roberto Pereira, Anay Ajit Deshpande, Cristian J. Vaca-Rubio, Xavier Mestre, Andrea Zanella, David Gregoratti, Elisabeth de Carvalho, Petar Popovski

    Abstract: Hierarchical Rate Splitting (HRS) schemes proposed in recent years have shown to provide significant improvements in exploiting spatial diversity in wireless networks and provide high throughput for all users while minimising interference among them. Hence, one of the major challenges for such HRS schemes is the necessity to know the optimal clustering of these users based only on their Channel St… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  49. arXiv:2204.12073  [pdf, other

    cs.LG cs.CG stat.ML

    One-pass additive-error subset selection for $\ell_{p}$ subspace approximation

    Authors: Amit Deshpande, Rameshwar Pratap

    Abstract: We consider the problem of subset selection for $\ell_{p}$ subspace approximation, that is, to efficiently find a \emph{small} subset of data points such that solving the problem optimally for this subset gives a good approximation to solving the problem optimally for the original input. Previously known subset selection algorithms based on volume sampling and adaptive sampling \cite{DeshpandeV07}… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: text overlap with arXiv:2103.11107

  50. arXiv:2203.00887  [pdf, other

    cs.LG cs.DS

    Sampling Ex-Post Group-Fair Rankings

    Authors: Sruthi Gorantla, Amit Deshpande, Anand Louis

    Abstract: Randomized rankings have been of recent interest to achieve ex-ante fairer exposure and better robustness than deterministic rankings. We propose a set of natural axioms for randomized group-fair rankings and prove that there exists a unique distribution $D$ that satisfies our axioms and is supported only over ex-post group-fair rankings, i.e., rankings that satisfy given lower and upper bounds on… ▽ More

    Submitted 29 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: 31 pages. Accepted for publication as a full paper in IJCAI 2023