Skip to main content

Showing 1–50 of 213 results for author: Singh, A K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00434  [pdf, other

    cs.CL

    Brevity is the soul of wit: Pruning long files for code generation

    Authors: Aaditya K. Singh, Yu Yang, Kushal Tirumala, Mostafa Elhoushi, Ari S. Morcos

    Abstract: Data curation is commonly considered a "secret-sauce" for LLM training, with higher quality data usually leading to better LLM performance. Given the scale of internet-scraped corpora, data pruning has become a larger and larger focus. Specifically, many have shown that de-duplicating data, or sub-selecting higher quality data, can lead to efficiency or performance improvements. Generally, three t… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 15 pages, 5 figures

  2. arXiv:2406.17720  [pdf, other

    cs.CV

    Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

    Authors: Chih-Hsuan Yang, Benjamin Feuer, Zaki Jubery, Zi K. Deng, Andre Nakkab, Md Zahid Hasan, Shivani Chiranjeevi, Kelly Marshall, Nirmal Baishnab, Asheesh K Singh, Arti Singh, Soumik Sarkar, Nirav Merchant, Chinmay Hegde, Baskar Ganapathysubramanian

    Abstract: We introduce Arboretum, the largest publicly accessible dataset designed to advance AI for biodiversity applications. This dataset, curated from the iNaturalist community science platform and vetted by domain experts to ensure accuracy, includes 134.6 million images, surpassing existing datasets in scale by an order of magnitude. The dataset encompasses image-language paired data for a diverse set… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Preprint under review

  3. arXiv:2406.17339  [pdf, other

    cs.IT eess.SP

    Optimizing Configuration Selection in Reconfigurable-Antenna MIMO Systems: Physics-Inspired Heuristic Solvers

    Authors: I. Krikidis, C. Psomas, A. K. Singh, K. Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a foundational technology for the continuing evolution of cellular systems, including upcoming 6G communication systems. In this paper, we address the problem of flexible/reconfigurable antenna configuration selection for point-to-point MIMO antenna systems by using physics-inspired heuristics. Firstly, we optimize the antenna configu… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.12571

    Journal ref: IEEE Transactions on Communications, 2004

  4. arXiv:2406.16176  [pdf, other

    cs.AI cs.CL cs.LG

    GraphEval2000: Benchmarking and Improving Large Language Models on Graph Datasets

    Authors: Qiming Wu, Zichen Chen, Will Corcoran, Misha Sra, Ambuj K. Singh

    Abstract: Large language models (LLMs) have achieved remarkable success in natural language processing (NLP), demonstrating significant capabilities in processing and understanding text data. However, recent studies have identified limitations in LLMs' ability to reason about graph-structured data. To address this gap, we introduce GraphEval2000, the first comprehensive graph dataset, comprising 40 graph da… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPs 2024 Dataset and Benchmark track, under review

    MSC Class: H.2.8; I.2.6; I.2.7

  5. arXiv:2406.14639  [pdf, other

    cs.RO

    Differentiable-Optimization Based Neural Policy for Occlusion-Aware Target Tracking

    Authors: Houman Masnavi, Arun Kumar Singh, Farrokh Janabi-Sharifi

    Abstract: Tracking a target in cluttered and dynamic environments is challenging but forms a core component in applications like aerial cinematography. The obstacles in the environment not only pose collision risk but can also occlude the target from the field-of-view of the robot. Moreover, the target future trajectory may be unknown and only its current state can be estimated. In this paper, we propose a… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2406.13081  [pdf, other

    cs.CV

    Class-specific Data Augmentation for Plant Stress Classification

    Authors: Nasla Saleem, Aditya Balu, Talukder Zaki Jubery, Arti Singh, Asheesh K. Singh, Soumik Sarkar, Baskar Ganapathysubramanian

    Abstract: Data augmentation is a powerful tool for improving deep learning-based image classifiers for plant stress identification and classification. However, selecting an effective set of augmentations from a large pool of candidates remains a key challenge, particularly in imbalanced and confounding datasets. We propose an approach for automated class-specific data augmentation using a genetic algorithm.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.10229  [pdf, other

    cs.LG cs.AI

    Quantifying Variance in Evaluation Benchmarks

    Authors: Lovish Madaan, Aaditya K. Singh, Rylan Schaeffer, Andrew Poulton, Sanmi Koyejo, Pontus Stenetorp, Sharan Narang, Dieuwke Hupkes

    Abstract: Evaluation benchmarks are the cornerstone of measuring capabilities of large language models (LLMs), as well as driving progress in said capabilities. Originally designed to make claims about capabilities (or lack thereof) in fully pretrained models, evaluation benchmarks are now also extensively used to decide between various training choices. Despite this widespread usage, we rarely quantify the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2405.15766  [pdf, other

    cs.AI cs.CL cs.CV

    Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development

    Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal

    Abstract: The mining of adverse drug events (ADEs) is pivotal in pharmacovigilance, enhancing patient safety by identifying potential risks associated with medications, facilitating early detection of adverse events, and guiding regulatory decision-making. Traditional ADE detection methods are reliable but slow, not easily adaptable to large-scale operations, and offer limited information. With the exponent… ▽ More

    Submitted 26 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: ACL Findings 2024

  9. arXiv:2405.11487  [pdf, other

    cs.CV

    "Previously on ..." From Recaps to Story Summarization

    Authors: Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi

    Abstract: We introduce multimodal story summarization by leveraging TV episode recaps - short video sequences interweaving key story moments from previous episodes to bring viewers up to speed. We propose PlotSnap, a dataset featuring two crime thriller TV shows with rich recaps and long episodes of 40 minutes. Story summarization labels are unlocked by matching recap shots to corresponding sub-stories in t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: CVPR 2024; Project page: https://katha-ai.github.io/projects/recap-story-summ/

  10. arXiv:2405.11200  [pdf, other

    cs.CL

    LexGen: Domain-aware Multilingual Lexicon Generation

    Authors: Karthika NJ, Ayush Maheshwari, Atul Kumar Singh, Preethi Jyothi, Ganesh Ramakrishnan, Krishnakant Bhatt

    Abstract: Lexicon or dictionary generation across domains is of significant societal importance, as it can potentially enhance information accessibility for a diverse user base while preserving language identity. Prior work in the field primarily focuses on bilingual lexical induction, which deals with word alignments using map**-based or corpora-based approaches. Though initiated by researchers, the rese… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  11. arXiv:2404.18591  [pdf, other

    cs.CV cs.AI

    FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion

    Authors: Abhishek Kumar Singh, Ioannis Patras

    Abstract: The rapid evolution of the fashion industry increasingly intersects with technological advancements, particularly through the integration of generative AI. This study introduces a novel generative pipeline designed to transform the fashion design process by employing latent diffusion models. Utilizing ControlNet and LoRA fine-tuning, our approach generates high-quality images from multimodal input… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 9 pages, 8 figures

  12. arXiv:2404.07129  [pdf, other

    cs.LG

    What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

    Authors: Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

    Abstract: In-context learning is a powerful emergent ability in transformer models. Prior work in mechanistic interpretability has identified a circuit element that may be critical for in-context learning -- the induction head (IH), which performs a match-and-copy operation. During training of large transformers on natural language data, IHs emerge around the same time as a notable phase change in the loss.… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 26 pages, 18 figures

  13. arXiv:2404.05631  [pdf, other

    cs.ET

    Multi Digit Ising Map** for Low Precision Ising Solvers

    Authors: Abhishek Kumar Singh, Kyle Jamieson

    Abstract: The last couple of years have seen an ever-increasing interest in using different Ising solvers, like Quantum annealers, Coherent Ising machines, and Oscillator-based Ising machines, for solving tough computational problems in various domains. Although the simulations predict massive performance improvements for several tough computational problems, the real implementations of the Ising solvers te… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: version 1.0

  14. arXiv:2404.03307  [pdf, other

    cs.RO eess.SY

    Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

    Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More

    Submitted 11 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  15. arXiv:2403.20116  [pdf, other

    cs.RO

    LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving

    Authors: Pranjal Paul, Anant Garg, Tushar Choudhary, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Existing Vision-Language models (VLMs) estimate either long-term trajectory waypoints or a set of control actions as a reactive solution for closed-loop planning based on their rich scene comprehension. However, these estimations are coarse and are subjective to their "world understanding" which may generate sub-optimal decisions due to perception errors. In this paper, we introduce LeGo-Drive, wh… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  16. arXiv:2403.19461  [pdf, other

    cs.RO

    Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization

    Authors: Simon Idoko, Basant Sharma, Arun Kumar Singh

    Abstract: Sampling trajectories from a distribution followed by ranking them based on a specified cost function is a common approach in autonomous driving. Typically, the sampling distribution is hand-crafted (e.g a Gaussian, or a grid). Recently, there have been efforts towards learning the sampling distribution through generative models such as Conditional Variational Autoencoder (CVAE). However, these ap… ▽ More

    Submitted 25 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  17. arXiv:2403.16592  [pdf, other

    cs.CL

    TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques

    Authors: Ashok Urlana, Aditya Saibewar, Bala Mallikarjunarao Garlapati, Charaka Vinayak Kumar, Ajeet Kumar Singh, Srinivasa Rao Chalamala

    Abstract: The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual co… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 1 Figure

    ACM Class: I.2.7

  18. arXiv:2403.12571  [pdf, other

    cs.IT eess.SP

    Optimizing Reconfigurable Antenna MIMO Systems with Coherent Ising Machines

    Authors: Ioannis Krikidis, Abhishek Kumar Singh, Kyle Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a promising technology for upcoming 6G communication systems. In this paper, we deal with the problem of configuration selection for reconfigurable antenna MIMO by leveraging Coherent Ising Machines (CIMs). By adopting the CIM as a heuristic solver for the Ising problem, the optimal antenna configuration that maximizes the received si… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Journal ref: IEEE International Conference on Communications (ICC), June 2024

  19. arXiv:2402.18778  [pdf, other

    cs.NI quant-ph

    X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible Parallelism

    Authors: Minsung Kim, Abhishek Kumar Singh, Davide Venturelli, John Kaewell, Kyle Jamieson

    Abstract: Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless performance. The approach aims to leverage QA to expedite the computation required for theoretically optimal but computationally-demanding Maximum Likelihood detection to overcome the limitations of t… ▽ More

    Submitted 9 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 22 pages

  20. arXiv:2402.18751  [pdf, other

    cs.LG cs.CV

    Multi-Sensor and Multi-temporal High-Throughput Phenoty** for Monitoring and Early Detection of Water-Limiting Stress in Soybean

    Authors: Sarah E. Jones, Timilehin Ayanlade, Benjamin Fallen, Talukder Z. Jubery, Arti Singh, Baskar Ganapathysubramanian, Soumik Sarkar, Asheesh K. Singh

    Abstract: Soybean production is susceptible to biotic and abiotic stresses, exacerbated by extreme weather events. Water limiting stress, i.e. drought, emerges as a significant risk for soybean production, underscoring the need for advancements in stress monitoring for crop breeding and production. This project combines multi-modal information to identify the most effective and efficient automated methods t… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 25 pages, 5 figures

  21. arXiv:2402.14903  [pdf, other

    cs.CL cs.LG

    Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs

    Authors: Aaditya K. Singh, DJ Strouse

    Abstract: Tokenization, the division of input text into input tokens, is an often overlooked aspect of the large language model (LLM) pipeline and could be the source of useful or harmful inductive biases. Historically, LLMs have relied on byte pair encoding, without care to specific input domains. With the increased use of LLMs for reasoning, various number-specific tokenization schemes have been adopted,… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 21 pages, 18 figures

  22. arXiv:2402.14558  [pdf, other

    cs.CL

    LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey

    Authors: Ashok Urlana, Charaka Vinayak Kumar, Ajeet Kumar Singh, Bala Mallikarjunarao Garlapati, Srinivasa Rao Chalamala, Rahul Mishra

    Abstract: Large language models (LLMs) have become the secret ingredient driving numerous industrial applications, showcasing their remarkable versatility across a diverse spectrum of tasks. From natural language processing and sentiment analysis to content generation and personalized recommendations, their unparalleled adaptability has facilitated widespread adoption across industries. This transformative… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 25 pages, 7 figures

  23. arXiv:2402.09654  [pdf, other

    cs.AI cs.CL cs.HC cs.MA stat.ML

    GPT-4's assessment of its performance in a USMLE-based case study

    Authors: Uttam Dhakal, Aniket Kumar Singh, Suman Devkota, Yogesh Sapkota, Bishal Lamichhane, Suprinsa Paudyal, Chandra Dhakal

    Abstract: This study investigates GPT-4's assessment of its performance in healthcare applications. A simple prompting technique was used to prompt the LLM with questions taken from the United States Medical Licensing Examination (USMLE) questionnaire and it was tasked to evaluate its confidence score before posing the question and after asking the question. The questionnaire was categorized into two groups… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  24. arXiv:2402.07927  [pdf, other

    cs.AI cs.CL cs.HC

    A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

    Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Vinija Jain, Samrat Mondal, Aman Chadha

    Abstract: Prompt engineering has emerged as an indispensable technique for extending the capabilities of large language models (LLMs) and vision-language models (VLMs). This approach leverages task-specific instructions, known as prompts, to enhance model efficacy without modifying the core model parameters. Rather than updating the model parameters, prompts allow seamless integration of pre-trained models… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 9 pages, 2 figures

  25. arXiv:2401.08943  [pdf, other

    cs.CV

    Fluid Dynamic DNNs for Reliable and Adaptive Distributed Inference on Edge Devices

    Authors: Lei Xun, Mingyu Hu, Hengrui Zhao, Amit Kumar Singh, Jonathon Hare, Geoff V. Merrett

    Abstract: Distributed inference is a popular approach for efficient DNN inference at the edge. However, traditional Static and Dynamic DNNs are not distribution-friendly, causing system reliability and adaptability issues. In this paper, we introduce Fluid Dynamic DNNs (Fluid DyDNNs), tailored for distributed inference. Distinct from Static and Dynamic DNNs, Fluid DyDNNs utilize a novel nested incremental t… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024

  26. arXiv:2312.12338  [pdf, other

    cs.CY

    Smart Connected Farms and Networked Farmers to Tackle Climate Challenges Impacting Agricultural Production

    Authors: Behzad J. Balabaygloo, Barituka Bekee, Samuel W. Blair, Suzanne Fey, Fateme Fotouhi, Ashish Gupta, Kevin Menke, Anusha Vangala, Jorge C. M. Palomares, Aaron Prestholt, Vishesh K. Tanwar, Xu Tao, Matthew E. Carroll, Sajal Das, Gil Depaula, Peter Kyveryga, Soumik Sarkar, Michelle Segovia, Simone Sylvestri, Corinne Valdivia, Asheesh K. Singh

    Abstract: To meet the grand challenges of agricultural production including climate change impacts on crop production, a tight integration of social science, technology and agriculture experts including farmers are needed. There are rapid advances in information and communication technology, precision agriculture and data analytics, which are creating a fertile field for the creation of smart connected farm… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  27. arXiv:2312.07759  [pdf, ps, other

    cs.LG

    IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means

    Authors: Sean Jaffe, Ambuj K. Singh, Francesco Bullo

    Abstract: Compressing large neural networks with minimal performance loss is crucial to enabling their deployment on edge devices. (Cho et al., 2022) proposed a weight quantization method that uses an attention-based clustering algorithm called differentiable $k$-means (DKM). Despite achieving state-of-the-art results, DKM's performance is constrained by its heavy memory dependency. We propose an implicit,… ▽ More

    Submitted 15 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  28. arXiv:2312.02418  [pdf, other

    cs.CL cs.AI cs.LG

    Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data

    Authors: Yu Yang, Aaditya K. Singh, Mostafa Elhoushi, Anas Mahmoud, Kushal Tirumala, Fabian Gloeckle, Baptiste Rozière, Carole-Jean Wu, Ari S. Morcos, Newsha Ardalani

    Abstract: Code datasets, often collected from diverse and uncontrolled sources such as GitHub, potentially suffer from quality issues, thereby affecting the performance and training efficiency of Large Language Models (LLMs) optimized for code generation. Previous studies demonstrated the benefit of using embedding spaces for data pruning, but they mainly focused on duplicate removal or increasing variety,… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 12 pages, 4 figures, Oral Presentation at 3rd Workshop on Efficient Natural Language and Speech Processing (ENLSP-III), NeurIPS 2023

  29. arXiv:2311.08360  [pdf, other

    cs.LG cs.AI cs.CL

    The Transient Nature of Emergent In-Context Learning in Transformers

    Authors: Aaditya K. Singh, Stephanie C. Y. Chan, Ted Moskovitz, Erin Grant, Andrew M. Saxe, Felix Hill

    Abstract: Transformer neural networks can exhibit a surprising capacity for in-context learning (ICL) despite not being explicitly trained for it. Prior work has provided a deeper understanding of how ICL emerges in transformers, e.g. through the lens of mechanistic interpretability, Bayesian inference, or by examining the distributional properties of training data. However, in each of these cases, ICL is t… ▽ More

    Submitted 11 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 19 pages, 16 figures

  30. arXiv:2310.17808  [pdf, other

    quant-ph cs.ET

    A Novel Fast Path Planning Approach for Mobile Devices using Hybrid Quantum Ant Colony Optimization Algorithm

    Authors: Mayukh Sarkar, Jitesh Pradhan, Anil Kumar Singh, Hathiram Nenavath

    Abstract: With IoT systems' increasing scale and complexity, maintenance of a large number of nodes using stationary devices is becoming increasingly difficult. Hence, mobile devices are being employed that can traverse through a set of target locations and provide the necessary services. In order to reduce energy consumption and time requirements, the devices are required to traverse following a Hamiltonia… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  31. arXiv:2310.14766  [pdf, other

    cs.RO

    End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic

    Authors: Jatan Shrestha, Simon Idoko, Basant Sharma, Arun Kumar Singh

    Abstract: Trajectory sampling in the Frenet(road-aligned) frame, is one of the most popular methods for motion planning of autonomous vehicles. It operates by sampling a set of behavioural inputs, such as lane offset and forward speed, before solving a trajectory optimization problem conditioned on the sampled inputs. The sampling is handcrafted based on simple heuristics, does not adapt to driving scenario… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to IROS 2023. arXiv admin note: text overlap with arXiv:2212.02224

  32. arXiv:2310.09195  [pdf, other

    cs.RO

    AMSwarmX: Safe Swarm Coordination in CompleX Environments via Implicit Non-Convex Decomposition of the Obstacle-Free Space

    Authors: Vivek K. Adajania, Siqi Zhou, Arun Kumar Singh, Angela P. Schoellig

    Abstract: Quadrotor motion planning in complex environments leverage the concept of safe flight corridor (SFC) to facilitate static obstacle avoidance. Typically, SFCs are constructed through convex decomposition of the environment's free space into cuboids, convex polyhedra, or spheres. However, when dealing with a quadrotor swarm, such SFCs can be overly conservative, substantially limiting the available… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Submitted to ICRA 2024

  33. arXiv:2310.08270  [pdf, other

    cs.RO

    Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction

    Authors: Basant Sharma, Aditya Sharma, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Safe autonomous driving critically depends on how well the ego-vehicle can predict the trajectories of neighboring vehicles. To this end, several trajectory prediction algorithms have been presented in the existing literature. Many of these approaches output a multi-modal distribution of obstacle trajectories instead of a single deterministic prediction to account for the underlying uncertainty. H… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  34. arXiv:2310.04373  [pdf, other

    cs.LG cs.AI

    Confronting Reward Model Overoptimization with Constrained RLHF

    Authors: Ted Moskovitz, Aaditya K. Singh, DJ Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca D. Dragan, Stephen McAleer

    Abstract: Large language models are typically aligned with human preferences by optimizing $\textit{reward models}$ (RMs) fitted to human feedback. However, human preferences are multi-faceted, and it is increasingly common to derive reward from a composition of simpler reward models which each capture a different aspect of language quality. This itself presents a challenge, as it is difficult to appropriat… ▽ More

    Submitted 10 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  35. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/

  36. arXiv:2309.16145  [pdf, other

    cs.CL cs.CY cs.HC

    The Confidence-Competence Gap in Large Language Models: A Cognitive Study

    Authors: Aniket Kumar Singh, Suman Devkota, Bishal Lamichhane, Uttam Dhakal, Chandra Dhakal

    Abstract: Large Language Models (LLMs) have acquired ubiquitous attention for their performances across diverse domains. Our study here searches through LLMs' cognitive abilities and confidence dynamics. We dive deep into understanding the alignment between their self-assessed confidence and actual performance. We exploit these models with diverse sets of questionnaires and real-world scenarios and extract… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 19 pages, 8 Figures, to be published in a journal (Journal TBD), All Authors contributed equally and were Supervised by Chandra Dhakal

    MSC Class: ACM-class: I.2.0

  37. arXiv:2309.15881  [pdf, other

    cs.LG cs.AI

    Enhancing Cross-Category Learning in Recommendation Systems with Multi-Layer Embedding Training

    Authors: Zihao Deng, Benjamin Ghaemmaghami, Ashish Kumar Singh, Benjamin Cho, Leo Orshansky, Mattan Erez, Michael Orshansky

    Abstract: Modern DNN-based recommendation systems rely on training-derived embeddings of sparse features. Input sparsity makes obtaining high-quality embeddings for rarely-occurring categories harder as their representations are updated infrequently. We demonstrate a training-time technique to produce superior embeddings via effective cross-category learning and theoretically explain its surprising effectiv… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: This is the preprint of our paper accepted at ACML 2023

  38. arXiv:2309.08235  [pdf, other

    cs.RO

    PRIEST: Projection Guided Sampling-Based Optimization For Autonomous Navigation

    Authors: Fatemeh Rastgar, Houman Masnavi, Basant Sharma, Alvo Aabloo, Jan Swevers, Arun Kumar Singh

    Abstract: Efficient navigation in unknown and dynamic environments is crucial for expanding the application domain of mobile robots. The core challenge stems from the nonavailability of a feasible global path for guiding optimization-based local planners. As a result, existing local planners often get trapped in poor local minima. In this paper, we present a novel optimizer that can explore multiple homotop… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  39. arXiv:2309.07878  [pdf

    cs.SI cs.CV

    Using network metrics to explore the community structure that underlies movement patterns

    Authors: Anh Pham Thi Minh, Abhishek Kumar Singh, Soumya Snigdha Kundu

    Abstract: This work aims to explore the community structure of Santiago de Chile by analyzing the movement patterns of its residents. We use a dataset containing the approximate locations of home and work places for a subset of anonymized residents to construct a network that represents the movement patterns within the city. Through the analysis of this network, we aim to identify the communities or sub-cit… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 6 pages excluding References

    ACM Class: J.4

  40. An AI-Driven VM Threat Prediction Model for Multi-Risks Analysis-Based Cloud Cybersecurity

    Authors: Deepika Saxena, Ishu Gupta, Rishabh Gupta, Ashutosh Kumar Singh, Xiaoqing Wen

    Abstract: Cloud virtualization technology, ingrained with physical resource sharing, prompts cybersecurity threats on users' virtual machines (VM)s due to the presence of inevitable vulnerabilities on the offsite servers. Contrary to the existing works which concentrated on reducing resource sharing and encryption and decryption of data before transfer for improving cybersecurity which raises computational… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Journal ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems Journal, 2023

  41. arXiv:2306.11640  [pdf

    physics.app-ph cs.ET

    Process Voltage Temperature Variability Estimation of Tunneling Current for Band-to-Band-Tunneling based Neuron

    Authors: Shubham Patil, Anand Sharma, Gaurav R, Abhishek Kadam, Ajay Kumar Singh, Sandip Lashkare, Nihar Ranjan Mohapatra, Udayan Ganguly

    Abstract: Compact and energy-efficient Synapse and Neurons are essential to realize the full potential of neuromorphic computing. In addition, a low variability is indeed needed for neurons in Deep neural networks for higher accuracy. Further, process (P), voltage (V), and temperature (T) variation (PVT) are essential considerations for low-power circuits as performance impact and compensation complexities… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  42. arXiv:2306.08322  [pdf, other

    cs.CR

    Cryptography approach for Secure Outsourced Data Storage in Cloud Environment

    Authors: Rishabh Gupta, Deepika Saxena, Ashutosh Kumar Singh

    Abstract: A large amount of data and applications are migrated by researchers, stakeholders, academia, and business organizations to the cloud environment due to its large variety of services, which involve the least maintenance cost, maximum flexibility, and on-demand service for storage, computation, and data distribution intentions. Despite the various characteristics the cloud environment supports, it a… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  43. arXiv:2306.04939  [pdf, other

    cs.RO

    UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images

    Authors: Vikrant Dewangan, Basant Sharma, Tushar Choudhary, Sarthak Sharma, Aakash Aanegola, Arun K. Singh, K. Madhava Krishna

    Abstract: Autonomous driving requires accurate reasoning of the location of objects from raw sensor data. Recent end-to-end learning methods go from raw sensor data to a trajectory output via Bird's Eye View(BEV) segmentation as an interpretable intermediate representation. Motion planning over cost maps generated via Birds Eye View (BEV) segmentation has emerged as a prominent approach in autonomous drivin… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to CASE 2023. Project video available at https://vikr-182.github.io/UAP-BEV

  44. arXiv:2306.02507  [pdf, other

    cs.CV

    Deep learning powered real-time identification of insects using citizen science data

    Authors: Shivani Chiranjeevi, Mojdeh Sadaati, Zi K Deng, Jayanth Koushik, Talukder Z Jubery, Daren Mueller, Matthew E O Neal, Nirav Merchant, Aarti Singh, Asheesh K Singh, Soumik Sarkar, Arti Singh, Baskar Ganapathysubramanian

    Abstract: Insect-pests significantly impact global agricultural productivity and quality. Effective management involves identifying the full insect community, including beneficial insects and harmful pests, to develop and implement integrated pest management strategies. Automated identification of insects under real-world conditions presents several challenges, including differentiating similar-looking spec… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  45. arXiv:2306.00011  [pdf, other

    cs.LG cs.CV

    DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image Datasets

    Authors: Alokendu Mazumder, Tirthajit Baruah, Akash Kumar Singh, Pagadla Krishna Murthy, Vishwajeet Pattanaik, Punit Rathore

    Abstract: Estimating the number of clusters and cluster structures in unlabeled, complex, and high-dimensional datasets (like images) is challenging for traditional clustering algorithms. In recent years, a matrix reordering-based algorithm called Visual Assessment of Tendency (VAT), and its variants have attracted many researchers from various domains to estimate the number of clusters and inherent cluster… ▽ More

    Submitted 31 July, 2023; v1 submitted 29 May, 2023; originally announced June 2023.

    Comments: Accepted at ViPriors @ ICCV 2023

  46. arXiv:2305.14004  [pdf

    cs.CL

    Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation

    Authors: Ayush Maheshwari, Ashim Gupta, Amrith Krishna, Atul Kumar Singh, Ganesh Ramakrishnan, G. Anil Kumar, Jitin Singla

    Abstract: We release Sāmayik, a dataset of around 53,000 parallel English-Sanskrit sentences, written in contemporary prose. Sanskrit is a classical language still in sustenance and has a rich documented heritage. However, due to the limited availability of digitized content, it still remains a low-resource language. Existing Sanskrit corpora, whether monolingual or bilingual, have predominantly focused on… ▽ More

    Submitted 29 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: LREC-COLING, 2024

  47. arXiv:2305.12371  [pdf

    cs.CL

    Machine Translation by Projecting Text into the Same Phonetic-Orthographic Space Using a Common Encoding

    Authors: Amit Kumar, Shantipriya Parida, Ajay Pratap, Anil Kumar Singh

    Abstract: The use of subword embedding has proved to be a major innovation in Neural Machine Translation (NMT). It helps NMT to learn better context vectors for Low Resource Languages (LRLs) so as to predict the target words by better modelling the morphologies of the two languages and also the morphosyntax transfer. Even so, their performance for translation in Indian language to Indian language scenario i… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 17 pages, 2 figures

  48. arXiv:2305.01823  [pdf, other

    cs.CV cs.LG

    Out-of-distribution detection algorithms for robust insect classification

    Authors: Mojdeh Saadati, Aditya Balu, Shivani Chiranjeevi, Talukder Zaki Jubery, Asheesh K Singh, Soumik Sarkar, Arti Singh, Baskar Ganapathysubramanian

    Abstract: Deep learning-based approaches have produced models with good insect classification accuracy; Most of these models are conducive for application in controlled environmental conditions. One of the primary emphasis of researchers is to implement identification and classification models in the real agriculture fields, which is challenging because input images that are wildly out of the distribution (… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  49. arXiv:2304.12830  [pdf, other

    cs.NI cs.IT eess.SP

    Uplink MIMO Detection using Ising Machines: A Multi-Stage Ising Approach

    Authors: Abhishek Kumar Singh, Ari Kapelyan, Davide Venturelli, Kyle Jamieson

    Abstract: Multiple-Input-Multiple-Output~(MIMO) signal detection is central to every state-of-the-art communication system, and enhancements in error performance and computation complexity of MIMO detection would significantly enhance data rate and latency experienced by the users. Theoretically, the optimal MIMO detector is the maximum-likelihood (ML) MIMO detector; however, due to its extremely high compl… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Initial draft. arXiv admin note: text overlap with arXiv:2205.05020

  50. arXiv:2304.08504  [pdf

    cs.ET physics.app-ph

    Schottky Barrier MOSFET Enabled Ultra-Low Power Real-Time Neuron for Neuromorphic Computing

    Authors: Shubham Patil, Jayatika Sakhuja, Ajay Kumar Singh, Anmol Biswas, Vivek Saraswat, Sandeep Kumar, Sandip Lashkare, Udayan Ganguly

    Abstract: Energy-efficient real-time synapses and neurons are essential to enable large-scale neuromorphic computing. In this paper, we propose and demonstrate the Schottky-Barrier MOSFET-based ultra-low power voltage-controlled current source to enable real-time neurons for neuromorphic computing. Schottky-Barrier MOSFET is fabricated on a Silicon-on-insulator platform with polycrystalline Silicon as the c… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.