Skip to main content

Showing 1–50 of 269 results for author: Sinha, A

Searching in archive cs. Search in all archives.
.
  1. A Linear Programming Enhanced Genetic Algorithm for Hyperparameter Tuning in Machine Learning

    Authors: Ankur Sinha, Paritosh Pankaj

    Abstract: In this paper, we formulate the hyperparameter tuning problem in machine learning as a bilevel program. The bilevel program is solved using a micro genetic algorithm that is enhanced with a linear program. While the genetic algorithm searches over discrete hyperparameters, the linear program enhancement allows hyper local search over continuous hyperparameters. The major contribution in this paper… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 8 pages; https://ieeexplore.ieee.org/document/10254162/

    MSC Class: 49M37 ACM Class: I.2.6

  2. arXiv:2406.17104  [pdf, other

    cs.CL

    Automated Adversarial Discovery for Safety Classifiers

    Authors: Yash Kumar Lal, Preethi Lahoti, Aradhana Sinha, Yao Qin, Ananth Balashankar

    Abstract: Safety classifiers are critical in mitigating toxicity on online forums such as social media and in chatbots. Still, they continue to be vulnerable to emergent, and often innumerable, adversarial attacks. Traditional automated adversarial data generation methods, however, tend to produce attacks that are not diverse, but variations of previously observed harm types. We formalize the task of automa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Published at Fourth Workshop on TrustworthyNLP (TrustNLP) at NAACL 2024

  3. arXiv:2406.16135  [pdf, other

    cs.CL cs.LG

    Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models

    Authors: Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chulin Xie, Chiyuan Zhang

    Abstract: Large language models (LLMs) are typically multilingual due to pretraining on diverse multilingual corpora. But can these models relate corresponding concepts across languages, effectively being crosslingual? This study evaluates six state-of-the-art LLMs on inherently crosslingual tasks. We observe that while these models show promising surface-level crosslingual abilities on machine translation… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.14322  [pdf, other

    cs.CL cs.CR cs.LG

    Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

    Authors: Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Daogao Liu, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

    Abstract: Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy (DP) offers a promising solution by ensuring models are 'almost indistinguishable' with or without any particular privacy unit, current evaluations on LLMs most… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.09760  [pdf, other

    cs.CL cs.LG

    Bootstrap** Language Models with DPO Implicit Rewards

    Authors: Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin

    Abstract: Human alignment in large language models (LLMs) is an active area of research. A recent groundbreaking work, direct preference optimization (DPO), has greatly simplified the process from past work in reinforcement learning from human feedback (RLHF) by bypassing the reward learning stage in RLHF. DPO, after training, provides an implicit reward model. In this work, we make a novel observation that… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.09390  [pdf, other

    cs.CV cs.LG

    LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

    Authors: Rajatsubhra Chakraborty, Arkaprava Sinha, Dominick Reilly, Manish Kumar Govind, Pu Wang, Francois Bremond, Srijan Das

    Abstract: Large Language Vision Models (LLVMs) have demonstrated effectiveness in processing internet videos, yet they struggle with the visually perplexing dynamics present in Activities of Daily Living (ADL) due to limited pertinent datasets and models tailored to relevant cues. To this end, we propose a framework for curating ADL multiview datasets to fine-tune LLVMs, resulting in the creation of ADL-X,… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.06967  [pdf, other

    cs.CV cs.AI eess.IV

    Dual Thinking and Perceptual Analysis of Deep Learning Models using Human Adversarial Examples

    Authors: Kailas Dayanandan, Anand Sinha, Brejesh Lall

    Abstract: The dual thinking framework considers fast, intuitive processing and slower, logical processing. The perception of dual thinking in vision requires images where inferences from intuitive and logical processing differ. We introduce an adversarial dataset to provide evidence for the dual thinking framework in human vision, which also aids in studying the qualitative behavior of deep learning models.… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  8. arXiv:2406.04724  [pdf, other

    cs.LG cs.AI

    Probabilistic Perspectives on Error Minimization in Adversarial Reinforcement Learning

    Authors: Roman Belaire, Arunesh Sinha, Pradeep Varakantham

    Abstract: Deep Reinforcement Learning (DRL) policies are critically vulnerable to adversarial noise in observations, posing severe risks in safety-critical scenarios. For example, a self-driving car receiving manipulated sensory inputs about traffic signs could lead to catastrophic outcomes. Existing strategies to fortify RL algorithms against such adversarial perturbations generally fall into two categorie… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  9. arXiv:2406.03253  [pdf, other

    cs.LG

    Generating Explanations for Cellular Neural Networks

    Authors: Akshit Sinha, Sreeram Vennam, Charu Sharma, Ponnurangam Kumaraguru

    Abstract: Recent advancements in graph learning contributed to explaining predictions generated by Graph Neural Networks. However, existing methodologies often fall short when applied to real-world datasets. We introduce HOGE, a framework to capture higher-order structures using cell complexes, which excel at modeling higher-order relationships. In the real world, higher-order structures are ubiquitous like… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    ACM Class: I.2.4

  10. arXiv:2405.18506  [pdf, other

    cs.DM

    An Algorithm for the Decomposition of Complete Graph into Minimum Number of Edge-disjoint Trees

    Authors: Antika Sinha, Sanjoy Kumar Saha, Partha Basuchowdhuri

    Abstract: In this work, we study methodical decomposition of an undirected, unweighted complete graph ($K_n$ of order $n$, size $m$) into minimum number of edge-disjoint trees. We find that $x$, a positive integer, is minimum and $x=\lceil\frac{n}{2}\rceil$ as the edge set of $K_n$ is decomposed into edge-disjoint trees of size sequence $M = \{m_1,m_2,...,m_x\}$ where $m_i\le(n-1)$ and $Σ_{i=1}^{x} m_i$ =… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 10 pages, 4 figures and 3 tables

  11. arXiv:2405.17533  [pdf, other

    cs.AI cs.CV cs.LG

    PAE: LLM-based Product Attribute Extraction for E-Commerce Fashion Trends

    Authors: Apurva Sinha, Ekta Gujral

    Abstract: Product attribute extraction is an growing field in e-commerce business, with several applications including product ranking, product recommendation, future assortment planning and improving online shop** customer experiences. Understanding the customer needs is critical part of online business, specifically fashion products. Retailers uses assortment planning to determine the mix of products to… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Attribute Extraction, PDF files, Bert Embedding, Hashtag, Large Language Model (LLM), Text and Images

  12. arXiv:2405.13081  [pdf

    cs.HC cs.AI

    Children's Mental Models of Generative Visual and Text Based AI Models

    Authors: Eliza Kosoy, Soo** Jeong, Anoop Sinha, Alison Gopnik, Tanya Kraljic

    Abstract: In this work we investigate how children ages 5-12 perceive, understand, and use generative AI models such as a text-based LLMs ChatGPT and a visual-based model DALL-E. Generative AI is newly being used widely since chatGPT. Children are also building mental models of generative AI. Those haven't been studied before and it is also the case that the children's models are dynamic as they use the too… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  13. arXiv:2405.09296  [pdf, ps, other

    cs.LG math.OC

    Tight Bounds for Online Convex Optimization with Adversarial Constraints

    Authors: Abhishek Sinha, Rahul Vaze

    Abstract: A well-studied generalization of the standard online convex optimization (OCO) is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the learner after the action for that round is chosen. The objective is to design an online policy that simultaneously achieves a small regret while ensuring small cumulative… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  14. arXiv:2405.02705  [pdf, other

    cs.IT

    Peak Age of Information under Tandem of Queues

    Authors: Ashirwad Sinha, Shubhransh Singhvi, Praful D. Mankar, Harpreet S. Dhillon

    Abstract: This paper considers a communication system where a source sends time-sensitive information to its destination via queues in tandem. We assume that the arrival process as well as the service process (of each server) are memoryless, and each of the servers has no buffer. For this setup, we develop a recursive framework to characterize the mean peak age of information (PAoI) under preemptive and non… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at IEEE ISIT'24

  15. arXiv:2404.16312  [pdf, other

    eess.SY cs.MA cs.RO

    3D Guidance Law for Maximal Coverage and Target Enclosing with Inherent Safety

    Authors: Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao

    Abstract: In this paper, we address the problem of enclosing an arbitrarily moving target in three dimensions by a single pursuer, which is an unmanned aerial vehicle (UAV), for maximum coverage while also ensuring the pursuer's safety by preventing collisions with the target. The proposed guidance strategy steers the pursuer to a safe region of space surrounding the target, allowing it to maintain a certai… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  16. arXiv:2404.13252  [pdf, other

    cs.CV cs.LG eess.IV

    3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification

    Authors: Shyam Varahagiri, Aryaman Sinha, Shiv Ram Dubey, Satish Kumar Singh

    Abstract: In recent years, Vision Transformers (ViTs) have shown promising classification performance over Convolutional Neural Networks (CNNs) due to their self-attention mechanism. Many researchers have incorporated ViTs for Hyperspectral Image (HSI) classification. HSIs are characterised by narrow contiguous spectral bands, providing rich spectral data. Although ViTs excel with sequential data, they cann… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted in IEEE Conference on Artificial Intelligence, 2024

  17. arXiv:2404.09473  [pdf, other

    cs.IR

    Exploring the Nexus Between Retrievability and Query Generation Strategies

    Authors: Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy

    Abstract: Quantifying bias in retrieval functions through document retrievability scores is vital for assessing recall-oriented retrieval systems. However, many studies investigating retrieval model bias lack validation of their query generation methods as accurate representations of retrievability for real users and their queries. This limitation results from the absence of established criteria for query g… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at ECIR 2024

  18. arXiv:2404.05938  [pdf, other

    cs.LG cs.CE math.NA

    Neural networks can be FLOP-efficient integrators of 1D oscillatory integrands

    Authors: Anshuman Sinha, Spencer H. Bryngelson

    Abstract: We demonstrate that neural networks can be FLOP-efficient integrators of one-dimensional oscillatory integrands. We train a feed-forward neural network to compute integrals of highly oscillatory 1D functions. The training set is a parametric combination of functions with varying characters and oscillatory behavior degrees. Numerical examples show that these networks are FLOP-efficient for sufficie… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 11 pages, 7 figures, 3 tables. Published in TMLR 03/2024. Code at https://github.com/comp-physics/deepOscillations

    Journal ref: Transactions on Machine Learning Research; ISSN 2835-8856 (2024); https://openreview.net/forum?id=z9SIj-IM7tn

  19. arXiv:2404.04497  [pdf, other

    eess.SY cs.MA cs.RO math.OC

    Self-organizing Multiagent Target Enclosing under Limited Information and Safety Guarantees

    Authors: Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao

    Abstract: This paper introduces an approach to address the target enclosing problem using non-holonomic multiagent systems, where agents autonomously self-organize themselves in the desired formation around a fixed target. Our approach combines global enclosing behavior and local collision avoidance mechanisms by devising a novel potential function and sliding manifold. In our approach, agents independently… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  20. arXiv:2404.00618  [pdf, other

    cs.LG cs.CV cs.NE

    A Multi-Branched Radial Basis Network Approach to Predicting Complex Chaotic Behaviours

    Authors: Aarush Sinha

    Abstract: In this study, we propose a multi branched network approach to predict the dynamics of a physics attractor characterized by intricate and chaotic behavior. We introduce a unique neural network architecture comprised of Radial Basis Function (RBF) layers combined with an attention mechanism designed to effectively capture nonlinear inter-dependencies inherent in the attractor's temporal evolution.… ▽ More

    Submitted 30 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures

  21. arXiv:2403.17673  [pdf, other

    cs.LG cs.CR cs.DS

    How Private are DP-SGD Implementations?

    Authors: Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

    Abstract: We demonstrate a substantial gap between the privacy guarantees of the Adaptive Batch Linear Queries (ABLQ) mechanism under different types of batch sampling: (i) Shuffling, and (ii) Poisson subsampling; the typical analysis of Differentially Private Stochastic Gradient Descent (DP-SGD) follows by interpreting it as a post-processing of ABLQ. While shuffling-based DP-SGD is more commonly used in p… ▽ More

    Submitted 6 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Proceedings of ICML 2024

  22. arXiv:2403.08974  [pdf, other

    cs.CV cs.AI

    $TrIND$: Representing Anatomical Trees by Denoising Diffusion of Implicit Neural Fields

    Authors: Ashish Sinha, Ghassan Hamarneh

    Abstract: Anatomical trees play a central role in clinical diagnosis and treatment planning. However, accurately representing anatomical trees is challenging due to their varying and complex topology and geometry. Traditional methods for representing tree structures, captured using medical imaging, while invaluable for visualizing vascular and bronchial networks, exhibit drawbacks in terms of limited resolu… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted to MICCAI 2024

  23. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  24. arXiv:2402.19292  [pdf, other

    cs.GT

    Fundamental Limits of Throughput and Availability: Applications to prophet inequalities & transaction fee mechanism design

    Authors: Aadityan Ganesh, Jason Hartline, Atanu R Sinha, Matthew vonAllmen

    Abstract: This paper studies the fundamental limits of availability and throughput for independent and heterogeneous demands of a limited resource. Availability is the probability that the demands are below the capacity of the resource. Throughput is the expected fraction of the resource that is utilized by the demands. We offer a concentration inequality generator that gives lower bounds on feasible availa… ▽ More

    Submitted 19 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 34 pages, 7 figures; updated author information to include institutions and email addresses

  25. arXiv:2402.17721  [pdf, other

    cs.HC cs.SE

    Content-Centric Prototy** of Generative AI Applications: Emerging Approaches and Challenges in Collaborative Software Teams

    Authors: Hari Subramonyam, Divy Thakkar, Jürgen Dieber, Anoop Sinha

    Abstract: Generative AI models are increasingly powering software applications, offering the capability to produce expressive content across varied contexts. However, unlike previous iterations of human-AI design, the emerging design process for generative capabilities primarily hinges on prompt engineering strategies. Given this fundamental shift in approach, our work aims to understand how collaborative s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  26. arXiv:2402.06176  [pdf, other

    eess.SY cs.MA cs.RO math.DS math.OC

    Cooperative Nonlinear Guidance Strategies for Guaranteed Pursuit-Evasion

    Authors: Saurabh Kumar, Shashi Ranjan Kumar, Abhinav Sinha

    Abstract: This paper addresses the pursuit-evasion problem involving three agents -- a purser, an evader, and a defender. We develop cooperative guidance laws for the evader-defender team that guarantee that the defender intercepts the pursuer before it reaches the vicinity of the evader. Unlike heuristic methods, optimal control, differential game formulation, and recently proposed time-constrained guidanc… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  27. arXiv:2402.05918  [pdf, other

    eess.SY cs.MA math.DS math.OC nlin.AO

    Consensus-driven Deviated Pursuit for Guaranteed Simultaneous Interception of Moving Targets

    Authors: Abhinav Sinha, Dwaipayan Mukherjee, Shashi Ranjan Kumar

    Abstract: This work proposes a cooperative strategy that employs deviated pursuit guidance to simultaneously intercept a moving (but not manoeuvring) target. As opposed to many existing cooperative guidance strategies which use estimates of time-to-go, based on proportional-navigation guidance, the proposed strategy uses an exact expression for time-to-go to ensure simultaneous interception. The guidance de… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  28. arXiv:2402.03388  [pdf, other

    cs.AI cs.IR cs.LG

    Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint

    Authors: Harshita Chopra, Atanu R. Sinha, Sunav Choudhary, Ryan A. Rossi, Paavan Kumar Indela, Veda Pranav Parwatala, Srinjayee Paul, Aurghya Maiti

    Abstract: Users' behavioral footprints online enable firms to discover behavior-based user segments (or, segments) and deliver segment specific messages to users. Following the discovery of segments, delivery of messages to users through preferred media channels like Facebook and Google can be challenging, as only a portion of users in a behavior segment find match in a medium, and only a fraction of those… ▽ More

    Submitted 15 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  29. arXiv:2401.15897  [pdf, other

    cs.CY cs.HC cs.LG

    Red-Teaming for Generative AI: Silver Bullet or Security Theater?

    Authors: Michael Feffer, Anusha Sinha, Wesley Hanwen Deng, Zachary C. Lipton, Hoda Heidari

    Abstract: In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what… ▽ More

    Submitted 15 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  30. arXiv:2401.15724  [pdf, other

    cs.CL

    RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses

    Authors: Sahil Girhepuje, Siva Sankar Sajeev, Purvam Jain, Arya Sikder, Adithya Rama Varma, Ryan George, Akshay Govind Srinivasan, Mahendra Kurup, Ashmit Sinha, Sudip Mondal

    Abstract: Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receivin… ▽ More

    Submitted 20 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  31. arXiv:2401.15246  [pdf, other

    cs.LG cs.CR cs.IR

    Training Differentially Private Ad Prediction Models with Semi-Sensitive Features

    Authors: Lynn Chua, Qiliang Cui, Badih Ghazi, Charlie Harrison, Pritish Kamath, Walid Krichene, Ravi Kumar, Pasin Manurangsi, Krishna Giri Narra, Amer Sinha, Avinash Varadarajan, Chiyuan Zhang

    Abstract: Motivated by problems arising in digital advertising, we introduce the task of training differentially private (DP) machine learning models with semi-sensitive features. In this setting, a subset of the features is known to the attacker (and thus need not be protected) while the remaining features as well as the label are unknown to the attacker and should be protected by the DP guarantee. This ta… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 7 pages, 4 figures

  32. arXiv:2401.14322  [pdf, other

    cs.CV cs.CY

    Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images

    Authors: Hansa Srinivasan, Candice Schumann, Aradhana Sinha, David Madras, Gbolahan Oluwafemi Olanubi, Alex Beutel, Susanna Ricco, Jilin Chen

    Abstract: Capturing the diversity of people in images is challenging: recent literature tends to focus on diversifying one or two attributes, requiring expensive attribute labels or building classifiers. We introduce a diverse people image ranking method which more flexibly aligns with human notions of people diversity in a less prescriptive, label-free manner. The Perception-Aligned Text-derived Human repr… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  33. arXiv:2401.01690  [pdf, other

    cs.LG

    Zero-shot Active Learning Using Self Supervised Learning

    Authors: Abhishek Sinha, Shreya Singh

    Abstract: Deep learning algorithms are often said to be data hungry. The performance of such algorithms generally improve as more and more annotated data is fed into the model. While collecting unlabelled data is easier (as they can be scraped easily from the internet), annotating them is a tedious and expensive task. Given a fixed budget available for data annotation, Active Learning helps selecting the be… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  34. Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters

    Authors: Manikanta Loya, Divya Anand Sinha, Richard Futrell

    Abstract: The advancement of Large Language Models (LLMs) has led to their widespread use across a broad spectrum of tasks including decision making. Prior studies have compared the decision making abilities of LLMs with those of humans from a psychological perspective. However, these studies have not always properly accounted for the sensitivity of LLMs' behavior to hyperparameters and variations in the pr… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023

  35. arXiv:2312.16564  [pdf, other

    cs.RO cs.MA

    Balancing Priorities in Patrolling with Rabbit Walks

    Authors: Rugved Katole, Deepak Mallya, Leena Vachhani, Arpita Sinha

    Abstract: In an environment with certain locations of higher priority, it is required to patrol these locations as frequently as possible due to their importance. However, the Non-Priority locations are often neglected during the task. It is necessary to balance the patrols on both kinds of sites to avoid breaches in security. We present a distributed online algorithm that assigns the routes to agents that… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 8 pages, 8 figures, 2 tables

  36. arXiv:2312.16177  [pdf, other

    cs.IR cs.LG stat.ME stat.ML

    Learning to Infer Unobserved Behaviors: Estimating User's Preference for a Site over Other Sites

    Authors: Atanu R Sinha, Tanay Anand, Paridhi Maheshwari, A V Lakshmy, Vishal Jain

    Abstract: A site's recommendation system relies on knowledge of its users' preferences to offer relevant recommendations to them. These preferences are for attributes that comprise items and content shown on the site, and are estimated from the data of users' interactions with the site. Another form of users' preferences is material too, namely, users' preferences for the site over other sites, since that s… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  37. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  38. arXiv:2312.04566  [pdf, other

    cs.CV

    Gen2Det: Generate to Detect

    Authors: Saksham Suri, Fanyi Xiao, Animesh Sinha, Sean Chang Culatana, Raghuraman Krishnamoorthi, Chenchen Zhu, Abhinav Shrivastava

    Abstract: Recently diffusion models have shown improvement in synthetic image quality as well as better control in generation. We motivate and present Gen2Det, a simple modular pipeline to create synthetic training data for object detection for free by leveraging state-of-the-art grounded image generation methods. Unlike existing works which generate individual object instances, require identifying foregrou… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  39. arXiv:2312.04557  [pdf, other

    cs.CV

    GenTron: Diffusion Transformers for Image and Video Generation

    Authors: Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Animesh Sinha, ** Luo, Tao Xiang, Juan-Manuel Perez-Rua

    Abstract: In this study, we explore Transformer-based diffusion models for image and video generation. Despite the dominance of Transformer architectures in various fields due to their flexibility and scalability, the visual generative domain primarily utilizes CNN-based U-Net architectures, particularly in diffusion-based models. We introduce GenTron, a family of Generative models employing Transformer-bas… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: CVPR2024 Camera Ready. Website: https://www.shoufachen.com/gentron_website/

  40. arXiv:2312.03584  [pdf, other

    cs.CV

    Context Diffusion: In-Context Aware Image Generation

    Authors: Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic

    Abstract: We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and fidelity of the generated images deteriorate when the prompt is not present, demonst… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  41. arXiv:2311.17681  [pdf, other

    cs.RO cs.MA

    A low-cost Framework for Decentralized Autonomous Intersection Management

    Authors: Rugved Katole, Arpita Sinha

    Abstract: This paper addresses the traffic management problem for autonomous vehicles at intersections without traffic signals. In the current system, a road junction has no traffic signals when the traffic volume is low to medium. Installing infrastructure at each unsignalled crossing to coordinate autonomous cars can be formidable. We propose a novel low-cost solution strategy where the vehicles use a har… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Revised version

  42. arXiv:2311.15373  [pdf, other

    cs.LG cs.AI cs.CR

    Confidence Is All You Need for MI Attacks

    Authors: Abhishek Sinha, Himanshi Tibrewal, Mansi Gupta, Nikhar Waghela, Shivank Garg

    Abstract: In this evolving era of machine learning security, membership inference attacks have emerged as a potent threat to the confidentiality of sensitive data. In this attack, adversaries aim to determine whether a particular point was used during the training of a target model. This paper proposes a new method to gauge a data point's membership in a model's training set. Instead of correlating loss wit… ▽ More

    Submitted 19 June, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 2 pages, 1 figure

  43. arXiv:2311.15341  [pdf, other

    cs.LG

    Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning

    Authors: Changyu Chen, Ramesha Karunasena, Thanh Hong Nguyen, Arunesh Sinha, Pradeep Varakantham

    Abstract: Many problems in Reinforcement Learning (RL) seek an optimal policy with large discrete multidimensional yet unordered action spaces; these include problems in randomized allocation of resources such as placements of multiple security resources and emergency response units, etc. A challenge in this setting is that the underlying action space is categorical (discrete and unordered) and large, for w… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: Accepted in NeurIPS 2023. Website: https://cameron-chen.github.io/flow-iar/

  44. arXiv:2311.12875  [pdf, other

    quant-ph cs.AI cs.LG

    Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars

    Authors: Akash Sinha, Antonio Macaluso, Matthias Klusch

    Abstract: The task of collision-free navigation (CFN) of self-driving cars is an NP-hard problem usually tackled using Deep Reinforcement Learning (DRL). While DRL methods have proven to be effective, their implementation requires substantial computing resources and extended training periods to develop a robust agent. On the other hand, quantum reinforcement learning has recently demonstrated faster converg… ▽ More

    Submitted 23 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 28 pages, 12 figures, 4 tables

  45. arXiv:2311.10794  [pdf, other

    cs.CV

    Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

    Authors: Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

    Abstract: We introduce Style Tailoring, a recipe to finetune Latent Diffusion Models (LDMs) in a distinct domain with high visual quality, prompt alignment and scene diversity. We choose sticker image generation as the target domain, as the images significantly differ from photorealistic samples typically generated by large-scale LDMs. We start with a competent text-to-image model, like Emu, and show that r… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  46. arXiv:2311.10348  [pdf, other

    cs.IR

    A Comparative Analysis of Retrievability and PageRank Measures

    Authors: Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy

    Abstract: The accessibility of documents within a collection holds a pivotal role in Information Retrieval, signifying the ease of locating specific content in a collection of documents. This accessibility can be achieved via two distinct avenues. The first is through some retrieval model using a keyword or other feature-based search, and the other is where a document can be navigated using links associated… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted at FIRE 2023

  47. arXiv:2311.09998  [pdf, other

    cs.LG cs.CV

    DeepEMD: A Transformer-based Fast Estimation of the Earth Mover's Distance

    Authors: Atul Kumar Sinha, Francois Fleuret

    Abstract: The Earth Mover's Distance (EMD) is the measure of choice between point clouds. However the computational cost to compute it makes it prohibitive as a training loss, and the standard approach is to use a surrogate such as the Chamfer distance. We propose an attention-based model to compute an accurate approximation of the EMD that can be used as a training loss for generative models. To get the ne… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  48. arXiv:2311.08357  [pdf, other

    cs.LG cs.CR

    Sparsity-Preserving Differentially Private Training of Large Embedding Models

    Authors: Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

    Abstract: As the use of large embedding models in recommendation systems and language applications increases, concerns over user data privacy have also risen. DP-SGD, a training algorithm that combines differential privacy with stochastic gradient descent, has been the workhorse in protecting user privacy without compromising model accuracy by much. However, applying DP-SGD naively to embedding models can d… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Neural Information Processing Systems (NeurIPS) 2023

  49. arXiv:2311.06542  [pdf

    cs.CV

    Generation Of Colors using Bidirectional Long Short Term Memory Networks

    Authors: A. Sinha

    Abstract: Human vision can distinguish between a vast spectrum of colours, estimated to be between 2 to 7 million discernible shades. However, this impressive range does not inherently imply that all these colours have been precisely named and described within our lexicon. We often associate colours with familiar objects and concepts in our daily lives. This research endeavors to bridge the gap between our… ▽ More

    Submitted 31 December, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: 8 pages

  50. Achieving Maximum Utilization in Optimal Time for Learning or Convergence in the Kolkata Paise Restaurant Problem

    Authors: Aniruddha Biswas, Antika Sinha, Bikas K. Chakrabarti

    Abstract: The objective of the KPR agents are to learn themselves in the minimum (learning) time to have maximum success or utilization probability ($f$). A dictator can easily solve the problem with $f = 1$ in no time, by asking every one to form a queue and go to the respective restaurant, resulting in no fluctuation and full utilization from the first day (convergence time $τ= 0$). It has already been sh… ▽ More

    Submitted 15 February, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 9 pages, 6 figures included in manuscript; Accepted for publication in Indian Journal of Physics