Skip to main content

Showing 1–50 of 387 results for author: Singh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19091  [pdf, other

    cs.CR

    SubLock: Sub-Circuit Replacement based Input Dependent Key-based Logic Locking for Robust IP Protection

    Authors: Vijaypal Singh Rathor, Munesh Singh, Kshira Sagar Sahoo, Saraju P. Mohanty

    Abstract: Intellectual Property (IP) piracy, overbuilding, reverse engineering, and hardware Trojan are serious security concerns during integrated circuit (IC) development. Logic locking has proven to be a solid defence for mitigating these threats. The existing logic locking techniques are vulnerable to SAT-based attacks. However, several SAT-resistant logic locking methods are reported; they require sign… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 22 pages, 12 figures, Journal

  2. arXiv:2406.08816  [pdf, other

    cs.CV

    ToSA: Token Selective Attention for Efficient Vision Transformers

    Authors: Manish Kumar Singh, Rajeev Yasarla, Hong Cai, Mingu Lee, Fatih Porikli

    Abstract: In this paper, we propose a novel token selective attention approach, ToSA, which can identify tokens that need to be attended as well as those that can skip a transformer layer. More specifically, a token selector parses the current attention maps and predicts the attention maps for the next layer, which are then used to select the important tokens that should participate in the attention operati… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPRW 2024

  3. arXiv:2406.06774  [pdf, other

    eess.AS cs.SD

    ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

    Authors: Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this work, we focus on the detection of depression through speech analysis. Previous research has widely explored features extracted from pre-trained models (PTMs) primarily trained for paralinguistic tasks. Although these features have led to sufficient advances in speech-based depression detection, their performance declines in real-world settings. To address this, in this paper, we introduce… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

  4. arXiv:2406.05505  [pdf, other

    cs.IR cs.AI

    I-SIRch: AI-Powered Concept Annotation Tool For Equitable Extraction And Analysis Of Safety Insights From Maternity Investigations

    Authors: Mohit Kumar Singh, Georgina Cosma, Patrick Waterson, Jonathan Back, Gyuchan Thomas Jun

    Abstract: Maternity care is a complex system involving treatments and interactions between patients, providers, and the care environment. To improve patient safety and outcomes, understanding the human factors (e.g. individuals decisions, local facilities) influencing healthcare delivery is crucial. However, most current tools for analysing healthcare data focus only on biomedical concepts (e.g. health cond… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  5. arXiv:2406.03822  [pdf, other

    cs.SD cs.CR eess.AS

    SilentCipher: Deep Audio Watermarking

    Authors: Mayank Kumar Singh, Naoya Takahashi, Weihsiang Liao, Yuki Mitsufuji

    Abstract: In the realm of audio watermarking, it is challenging to simultaneously encode imperceptible messages while enhancing the message capacity and robustness. Although recent advancements in deep learning-based methods bolster the message capacity and robustness over traditional methods, the encoded messages introduce audible artefacts that restricts their usage in professional settings. In this study… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2406.01650  [pdf, other

    q-bio.BM cs.AI cs.LG

    TAGMol: Target-Aware Gradient-guided Molecule Generation

    Authors: Vineeth Dorna, D. Subhalingam, Keshav Kolluru, Shreshth Tuli, Mrityunjay Singh, Saurabh Singal, N. M. Anoop Krishnan, Sayan Ranu

    Abstract: 3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug disco… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.20163  [pdf, other

    cs.CL cs.AI

    Reasoning about concepts with LLMs: Inconsistencies abound

    Authors: Rosario Uceda-Sosa, Karthikeyan Natesan Ramamurthy, Maria Chang, Moninder Singh

    Abstract: The ability to summarize and organize knowledge into abstract concepts is key to learning and reasoning. Many industrial applications rely on the consistent and systematic use of concepts, especially when dealing with decision-critical knowledge. However, we demonstrate that, when methodically questioned, large language models (LLMs) often display and demonstrate significant inconsistencies in the… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 15 pages, 5 figures, 3 tables

  8. arXiv:2405.05988  [pdf, other

    physics.ao-ph cs.LG

    CloudSense: A Model for Cloud Type Identification using Machine Learning from Radar data

    Authors: Mehzooz Nizar, Jha K. Ambuj, Manmeet Singh, Vaisakh S. B, G. Pandithurai

    Abstract: The knowledge of type of precipitating cloud is crucial for radar based quantitative estimates of precipitation. We propose a novel model called CloudSense which uses machine learning to accurately identify the type of precipitating clouds over the complex terrain locations in the Western Ghats (WGs) of India. CloudSense uses vertical reflectivity profiles collected during July-August 2018 from an… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  9. arXiv:2405.01858  [pdf, other

    cs.CL cs.CY

    SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India

    Authors: Salam Michael Singh, Shubhmoy Kumar Garg, Amitesh Misra, Aaditeshwar Seth, Tanmoy Chakraborty

    Abstract: Sexual education aims to foster a healthy lifestyle in terms of emotional, mental and social well-being. In countries like India, where adolescents form the largest demographic group, they face significant vulnerabilities concerning sexual health. Unfortunately, sexual education is often stigmatized, creating barriers to providing essential counseling and information to this at-risk population. Co… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  10. arXiv:2405.01556  [pdf, other

    cs.SE cs.AI cs.CL

    Semantically Aligned Question and Code Generation for Automated Insight Generation

    Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

    Abstract: Automated insight generation is a common tactic for hel** knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  11. arXiv:2404.15765  [pdf, other

    cs.CV

    3D Face Morphing Attack Generation using Non-Rigid Registration

    Authors: Jag Mohan Singh, Raghavendra Ramachandra

    Abstract: Face Recognition Systems (FRS) are widely used in commercial environments, such as e-commerce and e-banking, owing to their high accuracy in real-world conditions. However, these systems are vulnerable to facial morphing attacks, which are generated by blending face color images of different subjects. This paper presents a new method for generating 3D face morphs from two bona fide point clouds. T… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted to 2024 18th International Conference on Automatic Face and Gesture Recognition (FG) as short paper

  12. arXiv:2404.12680  [pdf, other

    cs.CV cs.CR

    VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection

    Authors: Raghavendra Ramachandra, Narayan Vetrekar, Sushma Venkatesh, Savita Nageshker, Jag Mohan Singh, R. S. Gad

    Abstract: Facial biometrics are an essential components of smartphones to ensure reliable and trustworthy authentication. However, face biometric systems are vulnerable to Presentation Attacks (PAs), and the availability of more sophisticated presentation attack instruments such as 3D silicone face masks will allow attackers to deceive face recognition systems easily. In this work, we propose a novel Presen… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted in 2024 18th International Conference on Automatic Face and Gesture Recognition (FG)

  13. arXiv:2404.02269  [pdf, other

    cs.CL cs.AI

    Extracting Norms from Contracts Via ChatGPT: Opportunities and Challenges

    Authors: Amanul Haque, Munindar P. Singh

    Abstract: We investigate the effectiveness of ChatGPT in extracting norms from contracts. Norms provide a natural way to engineer multiagent systems by capturing how to govern the interactions between two or more autonomous parties. We extract norms of commitment, prohibition, authorization, and power, along with associated norm elements (the parties involved, antecedents, and consequents) from contracts. O… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at COINE-AAMAS 2024

  14. arXiv:2404.00401  [pdf, other

    cs.CL

    How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

    Authors: Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

    Abstract: Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientif… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  15. Information Security and Privacy in the Digital World: Some Selected Topics

    Authors: Jaydip Sen, Joceli Mayer, Subhasis Dasgupta, Subrata Nandi, Srinivasan Krishnaswamy, Pinaki Mitra, Mahendra Pratap Singh, Naga Prasanthi Kundeti, Chandra Sekhara Rao MVP, Sudha Sree Chekuri, Seshu Babu Pallapothu, Preethi Nanjundan, Jossy P. George, Abdelhadi El Allahi, Ilham Morino, Salma AIT Oussous, Siham Beloualid, Ahmed Tamtaoui, Abderrahim Bajit

    Abstract: In the era of generative artificial intelligence and the Internet of Things, while there is explosive growth in the volume of data and the associated need for processing, analysis, and storage, several new challenges are faced in identifying spurious and fake information and protecting the privacy of sensitive data. This has led to an increasing demand for more robust and resilient schemes for aut… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: Published by IntechOpen, London Uk in Nov 2023, the book contains 8 chapters spanning over 131 pages. arXiv admin note: text overlap with arXiv:2307.02055, arXiv:2304.00258

  16. arXiv:2403.17243  [pdf, other

    cs.CY cs.SI

    Review Ecosystems to access Educational XR Experiences: a Sco** Review

    Authors: Shaun Bangay, Adam P. A. Cardilini, Sophie McKenzie, Maria Nicholas, Manjeet Singh

    Abstract: Educators, developers, and other stakeholders face challenges when creating, adapting, and utilizing virtual and augmented reality (XR) experiences for teaching curriculum topics. User created reviews of these applications provide important information about their relevance and effectiveness in supporting achievement of educational outcomes. To make these reviews accessible, relevant, and useful,… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 29 pages, 13 figures

    ACM Class: I.7.4; K.3.1

  17. arXiv:2403.12953  [pdf, other

    cs.CV

    FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

    Authors: Rajeev Yasarla, Manish Kumar Singh, Hong Cai, Yunxiao Shi, Jisoo Jeong, Yinhao Zhu, Shizhong Han, Risheek Garrepalli, Fatih Porikli

    Abstract: In this paper, we propose a novel video depth estimation approach, FutureDepth, which enables the model to implicitly leverage multi-frame and motion cues to improve depth estimation by making it learn to predict the future at training. More specifically, we propose a future prediction network, F-Net, which takes the features of multiple consecutive frames and is trained to predict multi-frame fea… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  18. arXiv:2403.12202  [pdf, other

    cs.CV

    DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

    Authors: Yunxiao Shi, Manish Kumar Singh, Hong Cai, Fatih Porikli

    Abstract: In this paper, we introduce a novel approach that harnesses both 2D and 3D attentions to enable highly accurate depth completion without requiring iterative spatial propagations. Specifically, we first enhance a baseline convolutional depth completion model by applying attention to 2D features in the bottleneck and skip connections. This effectively improves the performance of this simple network… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  19. arXiv:2403.09704  [pdf, other

    cs.CL cs.AI cs.LG

    Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

    Authors: Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney

    Abstract: The alignment of large language models is usually done by model providers to add or control behaviors that are common or universally understood across use cases and contexts. In contrast, in this article, we present an approach and architecture that empowers application developers to tune a model to their particular values, social norms, laws and other regulations, and orchestrate between potentia… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  20. arXiv:2403.09349  [pdf, other

    cs.SI physics.soc-ph

    From Pro, Anti to Informative and Hesitant: An Infoveillance study of COVID-19 vaccines and vaccination discourse on Twitter

    Authors: Pardeep Singh, Rabindra Lamsal, Monika Singh, Satish Chand, Bhawna Shishodia

    Abstract: COVID-19 pandemic has brought unprecedented challenges to the world, and vaccination has been a key strategy to combat the disease. Since Twitter is one of the most widely used public microblogging platforms, researchers have analysed COVID-19 vaccines and vaccination Twitter discourse to explore the conversational dynamics around the topic. While contributing to the crisis informatics literature,… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. arXiv:2403.00096  [pdf

    cs.CY

    Future of Pandemic Prevention and Response CCC Workshop Report

    Authors: David Danks, Rada Mihalcea, Katie Siek, Mona Singh, Brian Dixon, Haley Griffin

    Abstract: This report summarizes the discussions and conclusions of a 2-day multidisciplinary workshop that brought together researchers and practitioners in healthcare, computer science, and social sciences to explore what lessons were learned and what actions, primarily in research, could be taken. One consistent observation was that there is significant merit in thinking not only about pandemic situation… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  22. arXiv:2402.19052  [pdf

    cs.CL cs.HC

    Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study

    Authors: Prottay Kumar Adhikary, Aseem Srivastava, Shivani Kumar, Salam Michael Singh, Puneet Manuja, **i K Gopinath, Vijay Krishnan, Swati Kedia, Koushik Sinha Deb, Tanmoy Chakraborty

    Abstract: Comprehensive summaries of sessions enable an effective continuity in mental health counseling, facilitating informed therapy planning. Yet, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. This study evaluates the effectiveness of state-of-the-art Large Language Models (LLMs) in selectively summarizing various components of ther… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  23. arXiv:2402.16078  [pdf, other

    cs.LG

    Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs

    Authors: Anson Bastos, Kuldeep Singh, Abhishek Nadgeri, Manish Singh, Toyotaro Suzumura

    Abstract: We present the Evolving Graph Fourier Transform (EFT), the first invertible spectral transform that captures evolving representations on temporal graphs. We motivate our work by the inadequacy of existing methods for capturing the evolving graph spectra, which are also computationally expensive due to the temporal aspect along with the graph vertex domain. We view the problem as an optimization ov… ▽ More

    Submitted 18 April, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted as a full conference paper in the International Conference on Learning Representations 2024

  24. arXiv:2402.14860  [pdf, other

    cs.CL cs.AI cs.LG

    Ranking Large Language Models without Ground Truth

    Authors: Amit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth Daly, Karthikeyan Natesan Ramamurthy

    Abstract: Evaluation and ranking of large language models (LLMs) has become an important problem with the proliferation of these models and their impact. Evaluation methods either require human responses which are expensive to acquire or use pairs of LLMs to evaluate each other which can be unreliable. In this paper, we provide a novel perspective where, given a dataset of prompts (viz. questions, instructi… ▽ More

    Submitted 10 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  25. arXiv:2402.11997  [pdf, other

    cs.CL cs.AI cs.LG

    Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

    Authors: Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

    Abstract: Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, \textbf{TempUN}, to reveal significan… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  26. arXiv:2402.09948  [pdf, other

    eess.SP cs.LG

    Neural 5G Indoor Localization with IMU Supervision

    Authors: Aleksandr Ermolov, Shreya Kadambi, Maximilian Arnold, Mohammed Hirzallah, Roohollah Amiri, Deepak Singh Mahendar Singh, Srinivas Yerramalli, Daniel Dijkman, Fatih Porikli, Taesang Yoo, Bence Major

    Abstract: Radio signals are well suited for user localization because they are ubiquitous, can operate in the dark and maintain privacy. Many prior works learn map**s between channel state information (CSI) and position fully-supervised. However, that approach relies on position labels which are very expensive to acquire. In this work, this requirement is relaxed by using pseudo-labels during deployment,… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: IEEE GLOBECOM 2023

  27. arXiv:2402.06023  [pdf, other

    cs.LG cs.AI cs.GT

    Decision Theory-Guided Deep Reinforcement Learning for Fast Learning

    Authors: Zelin Wan, **-Hee Cho, Mu Zhu, Ahmed H. Anwar, Charles Kamhoua, Munindar P. Singh

    Abstract: This paper introduces a novel approach, Decision Theory-guided Deep Reinforcement Learning (DT-guided DRL), to address the inherent cold start problem in DRL. By integrating decision theory principles, DT-guided DRL enhances agents' initial performance and robustness in complex environments, enabling more efficient and reliable convergence during learning. Our investigation encompasses two primary… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  28. arXiv:2401.16461  [pdf, other

    cs.MA cs.AI cs.LG

    Norm Enforcement with a Soft Touch: Faster Emergence, Happier Agents

    Authors: Sz-Ting Tzeng, Nirav Ajmeri, Munindar P. Singh

    Abstract: A multiagent system is a society of autonomous agents whose interactions can be regulated via social norms. In general, the norms of a society are not hardcoded but emerge from the agents' interactions. Specifically, how the agents in a society react to each other's behavior and respond to the reactions of others determines which norms emerge in the society. We think of these reactions by an agent… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 12 pages, 11 figures, 5 tables (and supplementary material with code availability and additional results), accepted at AAMAS 2024

  29. arXiv:2401.14317  [pdf, ps, other

    cs.DS

    Maximizing the Minimum Eigenvalue in Constant Dimension

    Authors: Adam Brown, Aditi Laddha, Mohit Singh

    Abstract: In an instance of the minimum eigenvalue problem, we are given a collection of $n$ vectors $v_1,\ldots, v_n \subset {\mathbb{R}^d}$, and the goal is to pick a subset $B\subseteq [n]$ of given vectors to maximize the minimum eigenvalue of the matrix $\sum_{i\in B} v_i v_i^{\top} $. Often, additional combinatorial constraints such as cardinality constraint $\left(|B|\leq k\right)$ or matroid constra… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  30. arXiv:2401.10521  [pdf, other

    cs.CL cs.AI

    Cross-lingual Editing in Multilingual Language Models

    Authors: Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

    Abstract: The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources. While numerous model editing techniques (METs) have emerged to efficiently update model outputs without retraining, their effectiveness in multilingual LLMs, where knowledge is stored in diverse languages, remains an underexpl… ▽ More

    Submitted 3 February, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at EACL 2024

  31. arXiv:2401.06233  [pdf, other

    cs.CL

    LEGOBench: Scientific Leaderboard Generation Benchmark

    Authors: Shruti Singh, Shoaib Alam, Husain Malwat, Mayank Singh

    Abstract: The ever-increasing volume of paper submissions makes it difficult to stay informed about the latest state-of-the-art research. To address this challenge, we introduce LEGOBench, a benchmark for evaluating systems that generate scientific leaderboards. LEGOBench is curated from 22 years of preprint submission data on arXiv and more than 11k machine learning leaderboards on the PapersWithCode porta… ▽ More

    Submitted 21 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  32. arXiv:2401.04732  [pdf, other

    cs.IR cs.AI cs.LG

    A case study of Generative AI in MSX Sales Copilot: Improving seller productivity with a real-time question-answering system for content recommendation

    Authors: Manpreet Singh, Ravdeep Pasricha, Nitish Singh, Ravi Prasad Kondapalli, Manoj R, Kiran R, Laurent Boué

    Abstract: In this paper, we design a real-time question-answering system specifically targeted for hel** sellers get relevant material/documentation they can share live with their customers or refer to during a call. Taking the Seismic content repository as a relatively large scale example of a diverse dataset of sales material, we demonstrate how LLM embeddings of sellers' queries can be matched with the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  33. arXiv:2401.03855  [pdf, other

    cs.CL cs.AI

    PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLM

    Authors: Ankit Yadav, Mayank Singh

    Abstract: Driven by the surge in code generation using large language models (LLMs), numerous benchmarks have emerged to evaluate these LLMs capabilities. We conducted a large-scale human evaluation of HumanEval and MBPP, two popular benchmarks for Python code generation, analyzing their diversity and difficulty. Our findings unveil a critical bias towards a limited set of programming concepts, neglecting m… ▽ More

    Submitted 26 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  34. arXiv:2401.02918  [pdf, ps, other

    cs.DS cs.GT

    Approximation Algorithms for the Weighted Nash Social Welfare via Convex and Non-Convex Programs

    Authors: Adam Brown, Aditi Laddha, Madhusudhan Reddy Pittu, Mohit Singh

    Abstract: In an instance of the weighted Nash Social Welfare problem, we are given a set of $m$ indivisible items, $\mathscr{G}$, and $n$ agents, $\mathscr{A}$, where each agent $i \in \mathscr{A}$ has a valuation $v_{ij}\geq 0$ for each item $j\in \mathscr{G}$. In addition, every agent $i$ has a non-negative weight $w_i$ such that the weights collectively sum up to $1$. The goal is to find an assignment… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  35. arXiv:2312.14920  [pdf, ps, other

    cs.LG cs.AI

    A Novel Sampled Clustering Algorithm for Rice Phenotypic Data

    Authors: Mithun Singh, Kapil Ahuja, Milind B. Ratnaparkhe

    Abstract: Phenotypic (or Physical) characteristics of plant species are commonly used to perform clustering. In one of our recent works (Shastri et al. (2021)), we used a probabilistically sampled (using pivotal sampling) and spectrally clustered algorithm to group soybean species. These techniques were used to obtain highly accurate clusterings at a reduced cost. In this work, we extend the earlier algorit… ▽ More

    Submitted 12 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 31 Pages, 3 Figures, 7 Tables

    MSC Class: 68T01; 68T10 ACM Class: I.2.1; I.5.3

  36. GroupMixNorm Layer for Learning Fair Models

    Authors: Anubha Pandey, Aditi Rai, Maneet Singh, Deepak Bhatt, Tanmoy Bhowmik

    Abstract: Recent research has identified discriminatory behavior of automated prediction algorithms towards groups identified on specific protected attributes (e.g., gender, ethnicity, age group, etc.). When deployed in real-world scenarios, such techniques may demonstrate biased predictions resulting in unfair outcomes. Recent literature has witnessed algorithms for mitigating such biased behavior mostly b… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 12 pages, 6 figures, Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2023

  37. arXiv:2312.11524  [pdf, other

    cs.CL cs.AI cs.CV

    Assessing GPT4-V on Structured Reasoning Tasks

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Gust Verbruggen

    Abstract: Multi-modality promises to unlock further uses for large language models. Recently, the state-of-the-art language model GPT-4 was enhanced with vision capabilities. We carry out a prompting evaluation of GPT-4V and five other baselines on structured reasoning tasks, such as mathematical reasoning, visual data analysis, and code generation. We show that visual Chain-of-Thought, an extension of Chai… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 9 pages, 9 figures

  38. arXiv:2312.08213  [pdf, other

    cs.MM cs.CV

    Accelerated Event-Based Feature Detection and Compression for Surveillance Video Systems

    Authors: Andrew C. Freeman, Ketan Mayer-Patel, Montek Singh

    Abstract: The strong temporal consistency of surveillance video enables compelling compression performance with traditional methods, but downstream vision applications operate on decoded image frames with a high data rate. Since it is not straightforward for applications to extract information on temporal redundancy from the compressed video representations, we propose a novel system which conveys temporal… ▽ More

    Submitted 8 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the proceedings of ACM Multimedia Systems '24

  39. arXiv:2312.07492  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models

    Authors: Manish Nagireddy, Lamogha Chiazor, Moninder Singh, Ioana Baldini

    Abstract: Current datasets for unwanted social bias auditing are limited to studying protected demographic features such as race and gender. In this work, we introduce a comprehensive benchmark that is meant to capture the amplification of social bias, via stigmas, in generative language models. Taking inspiration from social science research, we start with a documented list of 93 US-centric stigmas and cur… ▽ More

    Submitted 27 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  40. arXiv:2312.01239  [pdf, other

    eess.IV cs.CV cs.LG

    Motion Informed Needle Segmentation in Ultrasound Images

    Authors: Raghavv Goel, Cecilia Morales, Manpreet Singh, Artur Dubrawski, John Galeotti, Howie Choset

    Abstract: Segmenting a moving needle in ultrasound images is challenging due to the presence of artifacts, noise, and needle occlusion. This task becomes even more demanding in scenarios where data availability is limited. In this paper, we present a novel approach for needle segmentation for 2D ultrasound that combines classical Kalman Filter (KF) techniques with data-driven learning, incorporating both ne… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: 7 pages, 4 figures, accepted at ISBI 2024

  41. arXiv:2311.10794  [pdf, other

    cs.CV

    Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

    Authors: Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

    Abstract: We introduce Style Tailoring, a recipe to finetune Latent Diffusion Models (LDMs) in a distinct domain with high visual quality, prompt alignment and scene diversity. We choose sticker image generation as the target domain, as the images significantly differ from photorealistic samples typically generated by large-scale LDMs. We start with a competent text-to-image model, like Emu, and show that r… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  42. arXiv:2311.10709  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

    Authors: Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

    Abstract: We present Emu Video, a text-to-video generation model that factorizes the generation into two steps: first generating an image conditioned on the text, and then generating a video conditioned on the text and the generated image. We identify critical design decisions--adjusted noise schedules for diffusion, and multi-stage training--that enable us to directly generate high quality and high resolut… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Project page: https://emu-video.metademolab.com

  43. arXiv:2311.03230  [pdf, other

    cs.DS

    Balancing Notions of Equity: Approximation Algorithms for Fair Portfolio of Solutions in Combinatorial Optimization

    Authors: Swati Gupta, Jai Moondra, Mohit Singh

    Abstract: Inspired by equity considerations, we consider top-$k$ norm, ordered norm, and symmetric monotonic norm objectives for various combinatorial optimization problems. Top-$k$ norms and ordered norms have natural interpretations in terms of minimizing the impact on individuals bearing largest costs. To model decision-making with multiple equity criteria, we study the notion of portfolios of solutions… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 37 pages, 3 figures

    MSC Class: 68W25 ACM Class: F.2.0

  44. arXiv:2310.19452  [pdf, other

    cs.CR

    Incorporating Zero-Knowledge Succinct Non-interactive Argument of Knowledge for Blockchain-based Identity Management with off-chain computations

    Authors: Pranay Kothari, Deepak Chopra, Manjot Singh, Shivam Bhardwaj, Rudresh Dwivedi

    Abstract: In today's world, secure and efficient biometric authentication is of keen importance. Traditional authentication methods are no longer considered reliable due to their susceptibility to cyber-attacks. Biometric authentication, particularly fingerprint authentication, has emerged as a promising alternative, but it raises concerns about the storage and use of biometric data, as well as centralized… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  45. arXiv:2310.19268  [pdf, other

    cs.SI cs.CL cs.CY

    Moral Sparks in Social Media Narratives

    Authors: Ruijie Xi, Munindar P. Singh

    Abstract: There is increasing interest in building computational models of moral reasoning by people to enable effective interaction by Artificial Intelligence (AI) agents. We examine interactions on social media to understand human moral judgments in real-life ethical scenarios. Specifically, we examine posts from a popular Reddit subreddit (i.e., a subcommunity) called r/AmITheAsshole, where authors and c… ▽ More

    Submitted 21 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  46. arXiv:2310.17680   

    cs.SE cs.AI cs.CL cs.PL

    CodeFusion: A Pre-trained Diffusion Model for Code Generation

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

    Abstract: Imagine a developer who can only change their last line of code, how often would they have to start writing a function from scratch before it is correct? Auto-regressive models for code generation from natural language have a similar limitation: they do not easily allow reconsidering earlier tokens generated. We introduce CodeFusion, a pre-trained diffusion code generation model that addresses thi… ▽ More

    Submitted 1 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Contains inappropriately sourced conjecture of OpenAI's ChatGPT parameter count from www.forbes.com/sites/forbestechcouncil/2023/02/17/is-bigger-better-why-the-chatgpt-vs-gpt-3-vs-gpt-4-battle-is-just-a-family-chat, a citation which was omitted. The authors do not have direct knowledge or verification of this information, and relied solely on this article, which may lead to public confusion

  47. arXiv:2310.17306   

    cs.AI cs.CL cs.DB cs.PL

    FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Elnaz Nouri, Mohammad Raza, Gust Verbruggen

    Abstract: Formatting is an important property in tables for visualization, presentation, and analysis. Spreadsheet software allows users to automatically format their tables by writing data-dependent conditional formatting (CF) rules. Writing such rules is often challenging for users as it requires them to understand and implement the underlying logic. We present FormaT5, a transformer-based model that can… ▽ More

    Submitted 1 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Contains inappropriately sourced conjecture of OpenAI's ChatGPT parameter count from www.forbes.com/sites/forbestechcouncil/2023/02/17/is-bigger-better-why-the-chatgpt-vs-gpt-3-vs-gpt-4-battle-is-just-a-family-chat, a citation which was omitted. The authors do not have direct knowledge or verification of this information, and relied solely on this article, which may lead to public confusion

  48. arXiv:2310.17228  [pdf, other

    cs.AI cs.CL cs.SE

    TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World

    Authors: Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Ananya Singha, Mukul Singh, Gust Verbruggen

    Abstract: Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance. Its goal is to adapt a sentence embedding model to have the similarity between two NL inputs match the similarity between their associated code outputs. In this paper, we propose different methods to apply and improve TST… ▽ More

    Submitted 28 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP-Findings, 2023

  49. arXiv:2310.16658  [pdf, other

    cs.RO

    An Online Self-calibrating Refractive Camera Model with Application to Underwater Odometry

    Authors: Mohit Singh, Mihir Dharmadhikari, Kostas Alexis

    Abstract: This work presents a camera model for refractive media such as water and its application in underwater visual-inertial odometry. The model is self-calibrating in real-time and is free of known correspondences or calibration targets. It is separable as a distortion model (dependent on refractive index $n$ and radial pixel coordinate) and a virtual pinhole model (as a function of $n$). We derive the… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures, Submitted to the IEEE International Conference on Robotics and Automation, 2024

  50. arXiv:2310.14885  [pdf, other

    cs.CR

    Location Estimation and Recovery using 5G Positioning: Thwarting GNSS Spoofing Attacks

    Authors: Aneet Kumar Dutta, Sebastian Brandt, Mridula Singh

    Abstract: The availability of cheap GNSS spoofers can prevent safe navigation and tracking of road users. It can lead to loss of assets, inaccurate fare estimation, enforcing the wrong speed limit, miscalculated toll tax, passengers reaching an incorrect location, etc. The techniques designed to prevent and detect spoofing by using cryptographic solutions or receivers capable of differentiating legitimate a… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.