Skip to main content

Showing 1–50 of 497 results for author: Kumar, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18325  [pdf, ps, other

    cs.IT

    Linear codes with few weights over $\mathbb{F}_{p}+u\mathbb{F}_{p}$

    Authors: Pavan Kumar, Noor Mohammad Khan

    Abstract: For any positive integer $m$ and an odd prime $p$; let $\mathbb{F}_{q}+u\mathbb{F}_{q}$, where $q=p^{m}$, be a ring extension of the ring $\mathbb{F}_{p}+u\mathbb{F}_{p}.$ In this paper, we construct linear codes over $\mathbb{F}_{p}+u\mathbb{F}_{p}$ by using trace function defined on $\mathbb{F}_{q}+u\mathbb{F}_{q}$ and determine their Hamming weight distributions by employing symplectic-weight… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    MSC Class: 94B05; 11T71

  2. arXiv:2406.18307  [pdf, ps, other

    cs.IT

    Five-Lee-weight linear codes over $\mathbb{F}_{q}+u\mathbb{F}_{q}$

    Authors: Dr. Pavan Kumar, Dr. Noor Mohammad Khan

    Abstract: In this study, linear codes having their Lee-weight distributions over the semi-local ring $\mathbb{F}_{q}+u\mathbb{F}_{q}$ with $u^{2}=1$ are constructed using the defining set and Gauss sums for an odd prime $q $. Moreover, we derive complete Hamming-weight enumerators for the images of the constructed linear codes under the Gray map. We finally show an application to secret sharing schemes.

    Submitted 26 June, 2024; originally announced June 2024.

    MSC Class: 94B05; 11T71

  3. arXiv:2406.13248  [pdf, other

    cs.IT eess.SP

    Overlay Space-Air-Ground Integrated Networks with SWIPT-Empowered Aerial Communications

    Authors: Anuradha Verma, Pankaj Kumar Sharma, Pawan Kumar, Dong In Kim

    Abstract: In this article, we consider overlay space-air-ground integrated networks (OSAGINs) where a low earth orbit (LEO) satellite communicates with ground users (GUs) with the assistance of an energy-constrained coexisting air-to-air (A2A) network. Particularly, a non-linear energy harvester with a hybrid SWIPT utilizing both power-splitting and time-switching energy harvesting (EH) techniques is employ… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 36 pages, 14 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth G. Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.10288  [pdf, other

    cs.CL cs.LG

    Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

    Authors: Francisco Eiras, Aleksandar Petrov, Phillip H. S. Torr, M. Pawan Kumar, Adel Bibi

    Abstract: Fine-tuning large language models on small, high-quality datasets can enhance their performance on specific downstream tasks. Recent research shows that fine-tuning on benign, instruction-following data can inadvertently undo the safety alignment process and increase a model's propensity to comply with harmful queries. Although critical, understanding and mitigating safety risks in well-defined ta… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2405.11013  [pdf, other

    cs.LG cs.AI

    ARDDQN: Attention Recurrent Double Deep Q-Network for UAV Coverage Path Planning and Data Harvesting

    Authors: Praveen Kumar, Priyadarshni, Rajiv Misra

    Abstract: Unmanned Aerial Vehicles (UAVs) have gained popularity in data harvesting (DH) and coverage path planning (CPP) to survey a given area efficiently and collect data from aerial perspectives, while data harvesting aims to gather information from various Internet of Things (IoT) sensor devices, coverage path planning guarantees that every location within the designated area is visited with minimal re… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  7. arXiv:2405.09373  [pdf, other

    cs.CL

    PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

    Authors: Devansh Jain, Priyanshu Kumar, Samuel Gehman, Xuhui Zhou, Thomas Hartvigsen, Maarten Sap

    Abstract: Recent advances in large language models (LLMs) have led to their extensive global deployment, and ensuring their safety calls for comprehensive and multilingual toxicity evaluations. However, existing toxicity benchmarks are overwhelmingly focused on English, posing serious risks to deploying LLMs in other languages. We address this by introducing PolygloToxicityPrompts (PTP), the first large-sca… ▽ More

    Submitted 20 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  8. arXiv:2405.07733  [pdf, other

    cs.CE

    TOPress3D: 3D topology optimization with design-dependent pressure loads in MATLAB

    Authors: Prabhat Kumar

    Abstract: This paper introduces ``TOPress3D," a 3D topology optimization MATLAB code for structures subjected to design-dependent pressure loads. With a primary focus on pedagogical objectives, the code provides an easy learning experience, making it a valuable tool and practical gateway for newcomers, students, and researchers towards this topic. TOPress3D uses Darcy's law with a drainage term to link the… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 Figures

  9. arXiv:2405.06641  [pdf, other

    cs.IT

    On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems

    Authors: Srivathsa Acharya, P. Vijay Kumar, Viveck R. Cadambe

    Abstract: We consider the problem of geographically distributed data storage in a network of servers (or nodes) where the nodes are connected to each other via communication links having certain round-trip times (RTTs). Each node serves a specific set of clients, where a client can request for any of the files available in the distributed system. The parent node provides the requested file if available loca… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  10. arXiv:2405.06621  [pdf, other

    cs.IT

    On Streaming Codes for Simultaneously Correcting Burst and Random Erasures

    Authors: Shobhit Bhatnagar, Biswadip Chakraborty, P. Vijay Kumar

    Abstract: Streaming codes are packet-level codes that recover dropped packets within a strict decoding-delay constraint. We study streaming codes over a sliding-window (SW) channel model which admits only those erasure patterns which allow either a single burst erasure of $\le b$ packets along with $\le e$ random packet erasures, or else, $\le a$ random packet erasures, in any sliding-window of $w$ time slo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  11. arXiv:2405.06606  [pdf, other

    cs.IT

    On Streaming Codes for Burst and Random Errors

    Authors: Shobhit Bhatnagar, P. Vijay Kumar

    Abstract: Streaming codes (SCs) are packet-level codes that recover erased packets within a strict decoding-delay deadline. Streaming codes for various packet erasure channel models such as sliding-window (SW) channel models that admit random or burst erasures in any SW of a fixed length have been studied in the literature, and the optimal rate as well as rate-optimal code constructions of SCs over such cha… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  12. arXiv:2405.04260  [pdf, other

    cs.LG cs.AI

    Verified Neural Compressed Sensing

    Authors: Rudy Bunel, Krishnamurthy Dvijotham, M. Pawan Kumar, Alessandro De Palma, Robert Stanforth

    Abstract: We develop the first (to the best of our knowledge) provably correct neural networks for a precise computational task, with the proof of correctness generated by an automated verification algorithm without any human input. Prior work on neural network verification has focused on partial specifications that, even when satisfied, are not sufficient to ensure that a neural network never makes errors.… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  13. arXiv:2405.02769  [pdf, other

    cs.LG cs.MA math.OC

    Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization

    Authors: Youbang Sun, Tao Liu, P. R. Kumar, Shahin Shahrampour

    Abstract: This work focuses on the entropy-regularized independent natural policy gradient (NPG) algorithm in multi-agent reinforcement learning. In this work, agents are assumed to have access to an oracle with exact policy evaluation and seek to maximize their respective independent rewards. Each individual's reward is assumed to depend on the actions of all the agents in the multi-agent system, leading t… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  14. arXiv:2404.18470  [pdf, other

    cs.CE cs.AI cs.CL q-fin.RM q-fin.TR

    ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction

    Authors: Yupeng Cao, Zhi Chen, Qingyun Pei, Prashant Kumar, K. P. Subbalakshmi, Papa Momar Ndiaye

    Abstract: In the realm of financial analytics, leveraging unstructured data, such as earnings conference calls (ECCs), to forecast stock performance is a critical challenge that has attracted both academics and investors. While previous studies have used deep learning-based models to obtain a general view of ECCs, they often fail to capture detailed, complex information. Our study introduces a novel framewo… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 15 pages, 3 figures, 5 tables

  15. arXiv:2404.12244  [pdf, other

    cs.CE

    PyTOaCNN: Topology optimization using an adaptive convolutional neural network in Python

    Authors: Khaish Singh Chadha, Prabhat Kumar

    Abstract: This paper introduces an adaptive convolutional neural network (CNN) architecture capable of automating various topology optimization (TO) problems with diverse underlying physics. The proposed architecture has an encoder-decoder-type structure with dense layers added at the bottleneck region to capture complex geometrical features. The network is trained using datasets obtained by the problem-spe… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 24 pages

  16. arXiv:2404.09474  [pdf, other

    cs.CV

    TCCT-Net: Two-Stream Network Architecture for Fast and Efficient Engagement Estimation via Behavioral Feature Signals

    Authors: Alexander Vedernikov, Puneet Kumar, Haoyu Chen, Tapio Seppanen, Xiaobai Li

    Abstract: Engagement analysis finds various applications in healthcare, education, advertisement, services. Deep Neural Networks, used for analysis, possess complex architecture and need large amounts of input data, computational power, inference time. These constraints challenge embedding systems into devices for real-time use. To address these limitations, we present a novel two-stream feature fusion "Ten… ▽ More

    Submitted 14 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted for the CVPR 2024 workshop (ABAW)

  17. arXiv:2404.07452  [pdf, other

    q-fin.RM cs.AI cs.CE cs.LG q-fin.PM

    RiskLabs: Predicting Financial Risk Using Large Language Model Based on Multi-Sources Data

    Authors: Yupeng Cao, Zhi Chen, Qingyun Pei, Fabrizio Dimino, Lorenzo Ausiello, Prashant Kumar, K. P. Subbalakshmi, Papa Momar Ndiaye

    Abstract: The integration of Artificial Intelligence (AI) techniques, particularly large language models (LLMs), in finance has garnered increasing academic attention. Despite progress, existing studies predominantly focus on tasks like financial text summarization, question-answering (Q$\&$A), and stock movement prediction (binary classification), with a notable gap in the application of LLMs for financial… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 24 pages, 7 figures, 5 tables, 1 algorithm

  18. arXiv:2404.07172  [pdf, other

    cs.LG math.NA math.OC

    A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

    Authors: Neel Mishra, Bamdev Mishra, Pratik Jawanpuria, Pawan Kumar

    Abstract: A novel first-order method is proposed for training generative adversarial networks (GANs). It modifies the Gauss-Newton method to approximate the min-max Hessian and uses the Sherman-Morrison inversion formula to calculate the inverse. The method corresponds to a fixed-point method that ensures necessary contraction. To evaluate its effectiveness, numerical experiments are conducted on various da… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: accepted in IJCNN 2023, 9 pages

  19. arXiv:2404.04118  [pdf, other

    cs.LG cs.DC

    GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System

    Authors: Yidong Gong, Pradeep Kumar

    Abstract: We hypothesize that the absence of a standardized benchmark has allowed several fundamental pitfalls in GNN System design and evaluation that the community has overlooked. In this work, we propose GNNBench, a plug-and-play benchmarking platform focused on system innovation. GNNBench presents a new protocol to exchange their captive tensor data, supports custom classes in System APIs, and allows au… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  20. arXiv:2404.03995  [pdf, other

    cs.SE cs.AI

    Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems

    Authors: Apoorva Nalini Pradeep Kumar, Justus Bogner, Markus Funke, Patricia Lago

    Abstract: Recent advances in artificial intelligence (AI) capabilities have increased the eagerness of companies to integrate AI into software systems. While AI can be used to have a positive impact on several dimensions of sustainability, this is often overshadowed by its potential negative influence. While many studies have explored sustainability factors in isolation, there is insufficient holistic cover… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at the 8th International Workshop on Green and Sustainable Software (GREENS'24), collocated with ICSA'24

  21. arXiv:2404.03908  [pdf, other

    cs.LG cs.AI cs.SD

    Multi-Task Learning for Lung sound & Lung disease classification

    Authors: Suma K V, Deepali Koppad, Preethi Kumar, Neha A Kantikar, Surabhi Ramesh

    Abstract: In recent years, advancements in deep learning techniques have considerably enhanced the efficiency and accuracy of medical diagnostics. In this work, a novel approach using multi-task learning (MTL) for the simultaneous classification of lung sounds and lung diseases is proposed. Our proposed model leverages MTL with four different deep learning models such as 2D CNN, ResNet50, MobileNet and Dens… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  22. arXiv:2403.17199  [pdf, other

    cs.CL

    Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model

    Authors: Braja Gopal Patra, Lauren A. Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A. Sanchez-Ruiz, Euijung Ryu, Joanna M. Biernacka, Girish N. Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J. John Mann, Alexander W. Charney, Jyotishman Pathak

    Abstract: Background: Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented as narrative clinical notes rather than structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of data extraction.… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 2 figures, 3 tables

  23. arXiv:2403.15328  [pdf, other

    cs.ET cs.AR

    Cross-layer Modeling and Design of Content Addressable Memories in Advanced Technology Nodes for Similarity Search

    Authors: Siri Narla, Piyush Kumar, Mohammad Adnaan, Azad Naeemi

    Abstract: In this paper we present a comprehensive design and benchmarking study of Content Addressable Memory (CAM) at the 7nm technology node in the context of similarity search applications. We design CAM cells based on SRAM, spin-orbit torque, and ferroelectric field effect transistor devices and from their layouts extract cell parasitics using state of the art EDA tools. These parasitics are used to de… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  24. arXiv:2403.10205  [pdf, other

    cs.CL cs.AI

    Read between the lines -- Functionality Extraction From READMEs

    Authors: Prince Kumar, Srikanth Tamilselvam, Dinesh Garg

    Abstract: While text summarization is a well-known NLP task, in this paper, we introduce a novel and useful variant of it called functionality extraction from Git README files. Though this task is a text2text generation at an abstract level, it involves its own peculiarities and challenges making existing text2text generation systems not very useful. The motivation behind this task stems from a recent surge… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  25. arXiv:2403.10088  [pdf, other

    cs.CL cs.AI

    Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF

    Authors: Amey Hengle, Aswini Kumar, Sahajpreet Singh, Anil Bandhakavi, Md Shad Akhtar, Tanmoy Chakroborty

    Abstract: Counterspeech, defined as a response to mitigate online hate speech, is increasingly used as a non-censorial solution. Addressing hate speech effectively involves dispelling the stereotypes, prejudices, and biases often subtly implied in brief, single-sentence statements or abuses. These implicit expressions challenge language models, especially in seq2seq tasks, as model performance typically exc… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  26. arXiv:2403.08824  [pdf, other

    cs.HC cs.AI cs.MM

    Measuring Non-Typical Emotions for Mental Health: A Survey of Computational Approaches

    Authors: Puneet Kumar, Alexander Vedernikov, Xiaobai Li

    Abstract: Analysis of non-typical emotions, such as stress, depression and engagement is less common and more complex compared to that of frequently discussed emotions like happiness, sadness, fear, and anger. The importance of these non-typical emotions has been increasingly recognized due to their implications on mental health and well-being. Stress and depression impact the engagement in daily tasks, hig… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Under review in IEEE Transactions on Affective Computing

  27. arXiv:2403.06350  [pdf, other

    cs.CL

    IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

    Authors: Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

    Abstract: Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-re… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  28. arXiv:2403.05738  [pdf, other

    cs.LG cs.GT

    Provable Policy Gradient Methods for Average-Reward Markov Potential Games

    Authors: Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian

    Abstract: We study Markov potential games under the infinite horizon average reward criterion. Most previous studies have been for discounted rewards. We prove that both algorithms based on independent policy gradient and independent natural policy gradient converge globally to a Nash equilibrium for the average reward criterion. To set the stage for gradient-based methods, we first establish that the avera… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 38 pages, 7 figures, published to AISTAT-24

  29. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  30. Effect of turbulent diffusion in modeling anaerobic digestion

    Authors: Jeremy Z. Yan, Prashant Kumar, Wolfgang Rauch

    Abstract: In this study, the impact of turbulent diffusion on mixing of biochemical reaction models is explored by implementing and validating different models. An original codebase called CHAD (Coupled Hydrodynamics and Anaerobic Digestion) is extended to incorporate turbulent diffusion and validate it against results from OpenFOAM with 2D Rayleigh-Taylor Instability and lid-driven cavity simulations. The… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  31. arXiv:2403.04085  [pdf, other

    cs.CL cs.CY

    Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

    Authors: Abhishek Anand, Negar Mokhberian, Prathyusha Naresh Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

    Abstract: Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  32. arXiv:2403.01926  [pdf, other

    cs.CL

    IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

    Authors: Tahir Javed, Janki Atul Nawale, Eldho Ittan George, Sakshi Joshi, Kaushal Santosh Bhogale, Deovrat Mehendale, Ishvinder Virender Sethi, Aparna Ananthanarayanan, Hafsah Faquih, Pratiti Palit, Sneha Ravishankar, Saranya Sukumaran, Tripura Panchagnula, Sunjay Murali, Kunal Sharad Gandhi, Ambujavalli R, Manickam K M, C Venkata Vaijayanthi, Krishnan Srinivasa Raghavan Karunganni, Pratyush Kumar, Mitesh M Khapra

    Abstract: We present INDICVOICES, a dataset of natural and spontaneous speech containing a total of 7348 hours of read (9%), extempore (74%) and conversational (17%) audio from 16237 speakers covering 145 Indian districts and 22 languages. Of these 7348 hours, 1639 hours have already been transcribed, with a median of 73 hours per language. Through this paper, we share our journey of capturing the cultural,… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  33. arXiv:2402.17768  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning

    Authors: Xiaoyu Zhang, Matthew Chang, Pranav Kumar, Saurabh Gupta

    Abstract: A common failure mode for policies trained with imitation is compounding execution errors at test time. When the learned policy encounters states that are not present in the expert demonstrations, the policy fails, leading to degenerate behavior. The Dataset Aggregation, or DAgger approach to this problem simply collects more data to cover these failure states. However, in practice, this is often… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by Robotics: Science and Systems (RSS) 2024. project website with video, see https://sites.google.com/view/diffusion-meets-dagger

  34. arXiv:2402.07640  [pdf, other

    cs.MM cs.AI

    CMFeed: A Benchmark Dataset for Controllable Multimodal Feedback Synthesis

    Authors: Puneet Kumar, Sarthak Malik, Balasubramanian Raman, Xiaobai Li

    Abstract: The Controllable Multimodal Feedback Synthesis (CMFeed) dataset enables the generation of sentiment-controlled feedback from multimodal inputs. It contains images, text, human comments, comments' metadata and sentiment labels. Existing datasets for related tasks such as multimodal summarization, visual question answering, visual dialogue, and sentiment-aware text generation do not incorporate trai… ▽ More

    Submitted 5 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  35. arXiv:2402.03548  [pdf, other

    cs.LG cs.DC

    Single-GPU GNN Systems: Traps and Pitfalls

    Authors: Yidong Gong, Arnab Tarafder, Saima Afrin, Pradeep Kumar

    Abstract: The current graph neural network (GNN) systems have established a clear trend of not showing training accuracy results, and directly or indirectly relying on smaller datasets for evaluations majorly. Our in-depth analysis shows that it leads to a chain of pitfalls in the system design and evaluation process, questioning the practicality of many of the proposed system optimizations, and affecting c… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  36. arXiv:2401.10217  [pdf, other

    cs.CV

    Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions

    Authors: Namitha Padmanabhan, Matthew Gwilliam, Pulkit Kumar, Shishira R Maiya, Max Ehrlich, Abhinav Shrivastava

    Abstract: The many variations of Implicit Neural Representations (INRs), where a neural network is trained as a continuous representation of a signal, have tremendous practical utility for downstream tasks including novel view synthesis, video compression, and image superresolution. Unfortunately, the inner workings of these networks are seriously under-studied. Our work, eXplaining the Implicit Neural Canv… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Project site: https://namithap10.github.io/xinc

  37. SoRoTop: a hitchhiker's guide to topology optimization MATLAB code for design-dependent pneumatic-driven soft robots

    Authors: Prabhat Kumar

    Abstract: Demands for pneumatic-driven soft robots are constantly rising for various applications. However, they are often designed manually due to the lack of systematic methods. Moreover, design-dependent characteristics of pneumatic actuation pose distinctive challenges. This paper provides a compact MATLAB code, named SoRoTop, and its various extensions for designing pneumatic-driven soft robots using t… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 31 pages, MATLAB codes

    Journal ref: Optimization and Engineering, 2023

  38. arXiv:2401.02665  [pdf, other

    cs.LG cs.AI physics.ao-ph

    Zero-shot Microclimate Prediction with Deep Learning

    Authors: Iman Deznabi, Peeyush Kumar, Madalina Fiterau

    Abstract: Weather station data is a valuable resource for climate prediction, however, its reliability can be limited in remote locations. To compound the issue, making local predictions often relies on sensor data that may not be accessible for a new, previously unmonitored location. In response to these challenges, we propose a novel zero-shot learning approach designed to forecast various climate measure… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Journal ref: Tackling Climate Change with Machine Learning: workshop at NeurIPS 2023

  39. arXiv:2312.15488  [pdf, other

    cs.CC

    The Zeta ($ζ$) Notation for Complex Asymptotes

    Authors: Anurag Dutta, K. Lakshmanan, John Harshith, A. Ramamoorthy, C. Pradeep, Pijush Kanti Kumar

    Abstract: Time Complexity is an important metric to compare algorithms based on their cardinality. The commonly used, trivial notations to qualify the same are the Big-Oh, Big-Omega, Big-Theta, Small-Oh, and Small-Omega Notations. All of them, consider time a part of the real entity, i.e., Time coincides with the horizontal axis in the argand plane. But what if the Time rather than completely coinciding wit… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  40. arXiv:2312.13211  [pdf, other

    cs.CL

    DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization

    Authors: Rahul Chand, Yashoteja Prabhu, Pratyush Kumar

    Abstract: With the tremendous success of large transformer models in natural language understanding, down-sizing them for cost-effective deployments has become critical. Recent studies have explored the low-rank weight factorization techniques which are efficient to train, and apply out-of-the-box to any transformer architecture. Unfortunately, the low-rank assumption tends to be over-restrictive and hinder… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 9 page main paper. 1 page appendix

  41. arXiv:2312.05686  [pdf, other

    cs.AI

    Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains

    Authors: Ananta Mukherjee, Peeyush Kumar, Boling Yang, Nishanth Chandran, Divya Gupta

    Abstract: This paper addresses privacy concerns in multi-agent reinforcement learning (MARL), specifically within the context of supply chains where individual strategic data must remain confidential. Organizations within the supply chain are modeled as agents, each seeking to optimize their own objectives while interacting with others. As each organization's strategy is contingent on neighboring strategies… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  42. arXiv:2312.00068  [pdf, other

    cs.RO cs.CV

    GLiDR: Topologically Regularized Graph Generative Network for Sparse LiDAR Point Clouds

    Authors: Prashant Kumar, Kshitij Madhav Bhat, Vedang Bhupesh Shenvi Nadkarni, Prem Kalra

    Abstract: Sparse LiDAR point clouds cause severe loss of detail of static structures and reduce the density of static points available for navigation. Reduced density can be detrimental to navigation under several scenarios. We observe that despite high sparsity, in most cases, the global topology of LiDAR outlining the static structures can be inferred. We utilize this property to obtain a backbone skeleto… ▽ More

    Submitted 24 May, 2024; v1 submitted 29 November, 2023; originally announced December 2023.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR)

  43. arXiv:2311.16484  [pdf, other

    cs.CV

    Eye vs. AI: Human Gaze and Model Attention in Video Memorability

    Authors: Prajneya Kumar, Eshika Khandelwal, Makarand Tapaswi, Vishnu Sreekumar

    Abstract: Understanding the factors that determine video memorability has important applications in areas such as educational technology and advertising. Towards this goal, we investigate the semantic and temporal attention mechanisms underlying video memorability. We propose a Transformer-based model with spatio-temporal attention that matches SoTA performance on video memorability prediction on a large na… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  44. arXiv:2311.08927  [pdf

    cs.CE physics.flu-dyn

    Introducing CHAD -- An ADM1 Solver for Direct Linking to Lagrangian CFD Software

    Authors: Prashant Kumar, Zhenghao Yan, Soroush Dabiri, Nikolaus Rauch, Wolfgang Rauch

    Abstract: Standard methods for modeling anaerobic digestion processes assume homogeneous conditions inside the tank and thus suffer from the negligence of hydrodynamics. In this work, we present the software toolbox Coupled Hydrodynamics and Anaerobic Digestion (CHAD), a novel parallelized solver that is capable of utilizing CFD results as the basis for Anaerobic digestion model No.1 (ADMno1) simulations. C… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: conference paper

    ACM Class: I.6.5; G.1.10

  45. arXiv:2310.11515  [pdf, ps, other

    cs.LG

    Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

    Authors: Yu-Heng Hung, **-Chun Hsieh, Akshay Mete, P. R. Kumar

    Abstract: We consider the infinite-horizon linear Markov Decision Processes (MDPs), where the transition probabilities of the dynamic model can be linearly parameterized with the help of a predefined low-dimensional feature map**. While the existing regression-based approaches have been theoretically shown to achieve nearly-optimal regret, they are computationally rather inefficient due to the need for a… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  46. arXiv:2310.10691  [pdf, other

    cs.LG cs.AR

    Enhancing ML model accuracy for Digital VLSI circuits using diffusion models: A study on synthetic data generation

    Authors: Prasha Srivastava, Pawan Kumar, Zia Abbas

    Abstract: Generative AI has seen remarkable growth over the past few years, with diffusion models being state-of-the-art for image generation. This study investigates the use of diffusion models in generating artificial data generation for electronic circuits for enhancing the accuracy of subsequent machine learning models in tasks such as performance assessment, design, and testing when training data is us… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 7 pages, submitted to NeurIPS workshop 2023

  47. arXiv:2310.10355  [pdf, other

    cs.CE

    Topology optimization of fluidic pressure-driven multi-material compliant mechanisms

    Authors: Prabhat Kumar, Josh Pinskier, David Howard, Matthijs Langelaar

    Abstract: Compliant mechanisms actuated by pneumatic loads are receiving increasing attention due to their direct applicability as soft robots that perform tasks using their flexible bodies. Using multiple materials to build them can further improve their performance and efficiency. Due to developments in additive manufacturing, the fabrication of multi-material soft robots is becoming a real possibility. T… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Proceedings of the ASME IDETC conference 2023

  48. arXiv:2310.09852  [pdf, other

    cs.LG

    Alpha Elimination: Using Deep Reinforcement Learning to Reduce Fill-In during Sparse Matrix Decomposition

    Authors: Arpan Dasgupta, Pawan Kumar

    Abstract: A large number of computational and scientific methods commonly require decomposing a sparse matrix into triangular factors as LU decomposition. A common problem faced during this decomposition is that even though the given matrix may be very sparse, the decomposition may lead to a denser triangular factors due to fill-in. A significant fill-in may lead to prohibitively larger computational costs… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: accepted to ECML 2023, Research Track

  49. arXiv:2310.09727  [pdf, other

    cs.LG math.OC

    Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

    Authors: Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

    Abstract: This work studies an independent natural policy gradient (NPG) algorithm for the multi-agent reinforcement learning problem in Markov potential games. It is shown that, under mild technical assumptions and the introduction of the \textit{suboptimality gap}, the independent NPG method with an oracle providing exact policy evaluation asymptotically reaches an $ε$-Nash Equilibrium (NE) within… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Will appear in NeurIPS 2023

  50. arXiv:2310.02069  [pdf, other

    cs.CE

    TOaCNN: Adaptive Convolutional Neural Network for Multidisciplinary Topology Optimization

    Authors: Khaish Singh Chadha, Prabhat Kumar

    Abstract: This paper presents an adaptive convolutional neural network (CNN) architecture that can automate diverse topology optimization (TO) problems having different underlying physics. The architecture uses the encoder-decoder networks with dense layers in the middle which includes an additional adaptive layer to capture complex geometrical features. The network is trained using the dataset obtained fro… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted in 6th NCMDAO 2023