Skip to main content

Showing 1–50 of 690 results for author: Das, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19391  [pdf, other

    cs.CV

    Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

    Authors: Ali Khaleghi Rahimian, Manish Kumar Govind, Subhajit Maity, Dominick Reilly, Christian Kümmerle, Srijan Das, Aritra Dutta

    Abstract: Visual perception tasks are predominantly solved by Vision Transformer (ViT) architectures, which, despite their effectiveness, encounter a computational bottleneck due to the quadratic complexity of computing self-attention. This inefficiency is largely due to the self-attention heads capturing redundant token interactions, reflecting inherent redundancy within visual data. Many works have aimed… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: The code is publicly available at https://github.com/Charlotte-CharMLab/Fibottention

  2. arXiv:2406.19299  [pdf, other

    cs.CV

    PNeRV: A Polynomial Neural Representation for Videos

    Authors: Sonam Gupta, Snehal Singh Tomar, Grigorios G Chrysos, Sukhendu Das, A. N. Rajagopalan

    Abstract: Extracting Implicit Neural Representations (INRs) on video data poses unique challenges due to the additional temporal dimension. In the context of videos, INRs have predominantly relied on a frame-only parameterization, which sacrifices the spatiotemporal continuity observed in pixel-level (spatial) representations. To mitigate this, we introduce Polynomial Neural Representation for Videos (PNeRV… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 25 pages, 17 figures, published at TMLR, Feb 2024

  3. arXiv:2406.17652  [pdf, other

    cs.GR cs.CG cs.CV

    Time-varying Extremum Graphs

    Authors: Somenath Das, Raghavendra Sridharamurthy, Vijay Natarajan

    Abstract: We introduce time-varying extremum graph (TVEG), a topological structure to support visualization and analysis of a time-varying scalar field. The extremum graph is a substructure of the Morse-Smale complex. It captures the adjacency relationship between cells in the Morse decomposition of a scalar field. We define the TVEG as a time-varying extension of the extremum graph and demonstrate how it c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.10133  [pdf, other

    cs.CL cs.AI

    Evaluation of Large Language Models: STEM education and Gender Stereotypes

    Authors: Smilla Due, Sneha Das, Marianne Andersen, Berta Plandolit López, Sniff Andersen Nexø, Line Clemmensen

    Abstract: Large Language Models (LLMs) have an increasing impact on our lives with use cases such as chatbots, study support, coding support, ideation, writing assistance, and more. Previous studies have revealed linguistic biases in pronouns used to describe professions or adjectives used to describe men vs women. These issues have to some degree been addressed in updated LLM versions, at least to pass exi… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.09390  [pdf, other

    cs.CV cs.LG

    LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

    Authors: Rajatsubhra Chakraborty, Arkaprava Sinha, Dominick Reilly, Manish Kumar Govind, Pu Wang, Francois Bremond, Srijan Das

    Abstract: Large Language Vision Models (LLVMs) have demonstrated effectiveness in processing internet videos, yet they struggle with the visually perplexing dynamics present in Activities of Daily Living (ADL) due to limited pertinent datasets and models tailored to relevant cues. To this end, we propose a framework for curating ADL multiview datasets to fine-tune LLVMs, resulting in the creation of ADL-X,… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.06046  [pdf, other

    cs.CL cs.LG

    MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

    Authors: Zichun Yu, Spandan Das, Chenyan Xiong

    Abstract: Pretraining data selection has the potential to improve language model pretraining efficiency by utilizing higher-quality data from massive web data corpora. Current data selection methods, which rely on either hand-crafted rules or larger reference models, are conducted statically and do not capture the evolving data preferences during pretraining. In this paper, we introduce model-aware data sel… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: The code is open-sourced at https://github.com/cxcscmu/MATES

  8. arXiv:2406.02722  [pdf, other

    cs.RO

    Control of Microrobots Using Model Predictive Control and Gaussian Processes for Disturbance Estimation

    Authors: Mehdi Kermanshah, Logan E. Beaver, Max Sokolich, Sambeeta Das, Ron Weiss, Roberto Tron, Calin Belta

    Abstract: This paper presents a control framework for magnetically actuated micron-scale robots ($μ$bots) designed to mitigate disturbances and improve trajectory tracking. To address the challenges posed by unmodeled dynamics and environmental variability, we combine data-driven modeling with model-based control to accurately track desired trajectories using a relatively small amount of data. The system is… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2405.20488  [pdf, other

    cs.DC

    Shoal++: High Throughput DAG BFT Can Be Fast!

    Authors: Balaji Arun, Zekun Li, Florian Suri-Payer, Sourav Das, Alexander Spiegelman

    Abstract: Today's practical partially synchronous Byzantine Fault Tolerant (BFT) consensus protocols trade off low latency and high throughput. On the one end, traditional BFT protocols such as PBFT and its derivatives optimize for latency. They require, in fault-free executions, only 3 message exchanges to commit, the optimum for BFT consensus. However, this class of protocols typically relies on a single… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.19519  [pdf, other

    cs.CL cs.AI

    Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data

    Authors: Sudeshna Das, Yao Ge, Yuting Guo, Swati Rajwal, JaMor Hairston, Jeanne Powell, Drew Walker, Snigdha Peddireddy, Sahithi Lakamana, Selen Bozkurt, Matthew Reyna, Reza Sameni, Yunyu Xiao, Sangmi Kim, Rasheeta Chandler, Natalie Hernandez, Danielle Mowery, Rachel Wightman, Jennifer Love, Anthony Spadaro, Jeanmarie Perrone, Abeed Sarker

    Abstract: Retrieval augmented generation (RAG) provides the capability to constrain generative model outputs, and mitigate the possibility of hallucination, by providing relevant in-context text. The number of tokens a generative large language model (LLM) can incorporate as context is finite, thus limiting the volume of knowledge from which to generate an answer. We propose a two-layer RAG framework for qu… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  11. arXiv:2405.18572  [pdf, other

    cs.LG cs.AI cs.CL

    Low-rank finetuning for LLMs: A fairness perspective

    Authors: Saswat Das, Marco Romanelli, Cuong Tran, Zarreen Reza, Bhavya Kailkhura, Ferdinando Fioretto

    Abstract: Low-rank approximation techniques have become the de facto standard for fine-tuning Large Language Models (LLMs) due to their reduced computational and memory requirements. This paper investigates the effectiveness of these methods in capturing the shift of fine-tuning datasets from the initial pre-trained data distribution. Our findings reveal that there are cases in which low-rank fine-tuning fa… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  12. arXiv:2405.16639  [pdf, ps, other

    cs.LG

    A unified law of robustness for Bregman divergence losses

    Authors: Santanu Das, Jatin Batra, Piyush Srivastava

    Abstract: In contemporary deep learning practice, models are often trained to near zero loss i.e. to nearly interpolate the training data. However, the number of parameters in the model is usually far more than the number of data points $n$, the theoretical minimum needed for interpolation: a phenomenon referred to as overparameterization. In an interesting piece of work that contributes to the considerable… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 16 pages

  13. arXiv:2405.15218  [pdf, other

    cs.LG

    AGS-GNN: Attribute-guided Sampling for Graph Neural Networks

    Authors: Siddhartha Shankar Das, S M Ferdous, Mahantesh M Halappanavar, Edoardo Serra, Alex Pothen

    Abstract: We propose AGS-GNN, a novel attribute-guided sampling algorithm for Graph Neural Networks (GNNs) that exploits node features and connectivity structure of a graph while simultaneously adapting for both homophily and heterophily in graphs. (In homophilic graphs vertices of the same class are more likely to be connected, and vertices of different classes tend to be linked in heterophilic graphs.) Wh… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: The paper has been accepted to KDD'24 in the research track

  14. arXiv:2405.12122  [pdf, other

    cs.LG

    An Active Learning Framework with a Class Balancing Strategy for Time Series Classification

    Authors: Shemonto Das

    Abstract: Training machine learning models for classification tasks often requires labeling numerous samples, which is costly and time-consuming, especially in time series analysis. This research investigates Active Learning (AL) strategies to reduce the amount of labeled data needed for effective time series classification. Traditional AL techniques cannot control the selection of instances per class for l… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Master's thesis accepted by Memorial University of Newfoundland. Chapter 3 published in the Journal of Frontiers in Robotics and AI. Chapter 4 published in the IEEE Systems Conference 2024

  15. arXiv:2405.11128  [pdf

    cs.PL cs.SE

    Parsimonious Optimal Dynamic Partial Order Reduction

    Authors: Parosh Aziz Abdulla, Mohamed Faouzi Atig, Sarbojit Das, Bengt Jonsson, Konstantinos Sagonas

    Abstract: Stateless model checking is a fully automatic verification technique for concurrent programs that checks for safety violations by exploring all possible thread schedulings. It becomes effective when coupled with Dynamic Partial Order Reduction (DPOR), which introduces an equivalence on schedulings and reduces the amount of needed exploration. DPOR algorithms that are \emph{optimal} are particularl… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  16. arXiv:2405.11008  [pdf

    cs.LG cs.AI

    A Systematic Review and Meta-Analysis on Sleep Stage Classification and Sleep Disorder Detection Using Artificial Intelligence

    Authors: Tayab Uddin Wara, Ababil Hossain Fahad, Adri Shankar Das, Md. Mehedi Hasan Shawon

    Abstract: Sleep is vital for people's physical and mental health, and sound sleep can help them focus on daily activities. Therefore, a sleep study that includes sleep patterns and disorders is crucial to enhancing our knowledge about individuals' health status. The findings on sleep stages and sleep disorders relied on polysomnography and self-report measures, and then the study went through clinical asses… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 40 pages, 11 Figures, 8 Tables

  17. arXiv:2405.06145  [pdf, other

    cs.CL cs.AI cs.LG

    Reddit-Impacts: A Named Entity Recognition Dataset for Analyzing Clinical and Social Effects of Substance Use Derived from Social Media

    Authors: Yao Ge, Sudeshna Das, Karen O'Connor, Mohammed Ali Al-Garadi, Graciela Gonzalez-Hernandez, Abeed Sarker

    Abstract: Substance use disorders (SUDs) are a growing concern globally, necessitating enhanced understanding of the problem and its trends through data-driven research. Social media are unique and important sources of information about SUDs, particularly since the data in such sources are often generated by people with lived experiences. In this paper, we introduce Reddit-Impacts, a challenging Named Entit… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure, 4 tables

  18. arXiv:2405.06142  [pdf, other

    quant-ph cs.IT

    Sequentially Encodable Codeword Stabilized Codes

    Authors: Sowrabh Sudevan, Sourin Das, Thamadathil Aswanth, Navin Kashyap

    Abstract: An m-uniform quantum state on n qubits is an entangled state in which every m-qubit subsystem is maximally mixed. Such a state spans a pure [[n,0,m+1]] quantum error correcting code (QECC). Starting with an m-uniform state realized as the graph state associated with an m-regular graph, and a classical [n,k,d \ge m+1] binary linear code with certain additional properties, we construct codeword stab… ▽ More

    Submitted 12 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: A shorter version of this manuscript will appear in the Proceedings of the 2024 International Symposium on Information Theory (ISIT 2024)

  19. arXiv:2405.05813  [pdf

    cs.SE

    Revitalising Stagecraft: NLP-Driven Sentiment Analysis for Traditional Theater Revival

    Authors: Saikat Samanta, Saptarshi Karmakar, Satayajay Behuria, Shibam Dutta, Soujit Das, Soumik Saha

    Abstract: This paper explores the application of FilmFrenzy, a python based ticket booking web application, in the revival of traditional Indian theatres. Additionally, this research paper explores how NLP can be implemented to improve user experience. Through clarifying audience views and pinpointing opportunities for development, FilmFrenzy aims to promote involvement and rejuvenation in India's conventio… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  20. arXiv:2405.05574  [pdf, other

    cs.CV

    Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft

    Authors: Debabrata Pal, Anvita Singh, Saumya Saumya, Shouvik Das

    Abstract: The intrinsic capability to perceive depth of field and extract salient information by the Human Vision System (HVS) stimulates a pilot to perform manual landing over an autoland approach. However, harsh weather creates visibility hindrances, and a pilot must have a clear view of runway elements before the minimum decision altitude. To help a pilot in manual landing, a vision-based system tailored… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  21. arXiv:2405.05511  [pdf, other

    quant-ph cs.ET

    Investigating impact of bit-flip errors in control electronics on quantum computation

    Authors: Subrata Das, Avimita Chatterjee, Swaroop Ghosh

    Abstract: In this paper, we investigate the impact of bit flip errors in FPGA memories in control electronics on quantum computing systems. FPGA memories are integral in storing the amplitude and phase information pulse envelopes, which are essential for generating quantum gate pulses. However, these memories can incur faults due to physical and environmental stressors such as electromagnetic interference,… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 9 figures, conference

  22. arXiv:2405.05204  [pdf

    cs.CL

    CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation

    Authors: Drew Walker, Annie Thorne, Sudeshna Das, Jennifer Love, Hannah LF Cooper, Melvin Livingston III, Abeed Sarker

    Abstract: Objective: To detect and classify features of stigmatizing and biased language in intensive care electronic health records (EHRs) using natural language processing techniques. Materials and Methods: We first created a lexicon and regular expression lists from literature-driven stem words for linguistic features of stigmatizing patient labels, doubt markers, and scare quotes within EHRs. The lexico… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 28 pages, 3 figures, 4 tables. 5 Appendices

  23. arXiv:2405.04292  [pdf, other

    cs.CL cs.AI

    Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning

    Authors: Sayantan Pal, Souvik Das, Rohini K. Srihari

    Abstract: This study introduces 'clickbait spoiling', a novel technique designed to detect, categorize, and generate spoilers as succinct text responses, countering the curiosity induced by clickbait content. By leveraging a multi-task learning framework, our model's generalization capabilities are significantly enhanced, effectively addressing the pervasive issue of clickbait. The crux of our research lies… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted in ICON 2023

  24. arXiv:2404.18198  [pdf, other

    quant-ph cs.AI cs.CV cs.LG

    Permutation-equivariant quantum convolutional neural networks

    Authors: Sreetama Das, Filippo Caruso

    Abstract: The Symmetric group $S_{n}$ manifests itself in large classes of quantum systems as the invariance of certain characteristics of a quantum state with respect to permuting the qubits. The subgroups of $S_{n}$ arise, among many other contexts, to describe label symmetry of classical images with respect to spatial transformations, e.g. reflection or rotation. Equipped with the formalism of geometric… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures

  25. arXiv:2404.15487  [pdf, other

    cs.CG cs.DS

    Minimum Consistent Subset in Trees and Interval Graphs

    Authors: Aritra Banik, Sayani Das, Anil Maheshwari, Bubai Manna, Subhas C Nandy, Krishna Priya K M, Bodhayan Roy, Sasanka Roy, Abhishek Sahu

    Abstract: In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of i… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  26. arXiv:2404.12957  [pdf, other

    cs.CL cs.LG

    Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction

    Authors: Qinyuan Wu, Mohammad Aflah Khan, Soumi Das, Vedant Nanda, Bishwamittra Ghosh, Camila Kolling, Till Speicher, Laurent Bindschaedler, Krishna P. Gummadi, Evimaria Terzi

    Abstract: We propose an approach for estimating the latent knowledge embedded inside large language models (LLMs). We leverage the in-context learning (ICL) abilities of LLMs to estimate the extent to which an LLM knows the facts stored in a knowledge base. Our knowledge estimator avoids reliability concerns with previous prompting-based methods, is both conceptually simpler and easier to apply, and we demo… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  27. arXiv:2404.12081  [pdf, other

    cs.CV

    MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification

    Authors: Weikang Yu, Xiaokang Zhang, Samiran Das, Xiao Xiang Zhu, Pedram Ghamisi

    Abstract: Change detection (CD) from remote sensing (RS) images using deep learning has been widely investigated in the literature. It is typically regarded as a pixel-wise labeling task that aims to classify each pixel as changed or unchanged. Although per-pixel classification networks in encoder-decoder structures have shown dominance, they still suffer from imprecise boundaries and incomplete object deli… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  28. arXiv:2404.09338  [pdf, other

    cs.CL

    Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models

    Authors: Souvik Das, Lifeng **, Linfeng Song, Haitao Mi, Baolin Peng, Dong Yu

    Abstract: Large language models (LLMs) exhibit impressive natural language capabilities but suffer from hallucination -- generating content ungrounded in the realities of training data. Recent work has focused on decoding techniques to improve factuality during inference by leveraging LLMs' hierarchical representation of factual knowledge, manipulating the predicted distributions at inference time. Current… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Work in Progress

  29. arXiv:2404.09136  [pdf, other

    cs.CL cs.AI cs.LG

    TLDR at SemEval-2024 Task 2: T5-generated clinical-Language summaries for DeBERTa Report Analysis

    Authors: Spandan Das, Vinay Samuel, Shahriar Noroozizadeh

    Abstract: This paper introduces novel methodologies for the Natural Language Inference for Clinical Trials (NLI4CT) task. We present TLDR (T5-generated clinical-Language summaries for DeBERTa Report Analysis) which incorporates T5-model generated premise summaries for improved entailment and contradiction analysis in clinical NLI tasks. This approach overcomes the challenges posed by small context windows a… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Journal ref: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 507-516, Mexico City, Mexico. Association for Computational Linguistics

  30. arXiv:2404.08655  [pdf, other

    cs.CL cs.AI cs.LG

    Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection

    Authors: Sourya Dipta Das, Yash Vadi, Kuldeep Yadav

    Abstract: Automated Essay Scoring (AES) systems are widely popular in the market as they constitute a cost-effective and time-effective option for grading systems. Nevertheless, many studies have demonstrated that the AES system fails to assign lower grades to irrelevant responses. Thus, detecting the off-topic response in automated essay scoring is crucial in practical tasks where candidates write unrelate… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

    Comments: Accepted in LREC-COLING 2024

  31. arXiv:2404.08160  [pdf, other

    cs.CR

    A Survey on Security of Ultra/Hyper Reliable Low Latency Communication: Recent Advancements, Challenges, and Future Directions

    Authors: Annapurna Pradhan, Susmita Das, Md. Jalil Piran, Zhu Han

    Abstract: Ultra-reliable low latency communication (URLLC) is an innovative service offered by fifth-generation (5G) wireless systems. URLLC enables various mission-critical applications by facilitating reliable and low-latency signal transmission to support extreme Quality of Service (QoS) requirements. Apart from reliability and latency, ensuring secure data transmission for URLLC has been a prominent iss… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  32. arXiv:2404.06294  [pdf, other

    eess.IV cs.CV cs.LG

    Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures

    Authors: Arkaprabha Basu, Kushal Bose, Sankha Subhra Mullick, Anish Chakrabarty, Swagatam Das

    Abstract: Super-Resolution (SR) is a time-hallowed image processing problem that aims to improve the quality of a Low-Resolution (LR) sample up to the standard of its High-Resolution (HR) counterpart. We aim to address this by introducing Super-Resolution Generator (SuRGe), a fully-convolutional Generative Adversarial Network (GAN)-based architecture for SR. We show that distinct convolutional features obta… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  33. arXiv:2404.05049  [pdf, other

    cs.CV

    PlateSegFL: A Privacy-Preserving License Plate Detection Using Federated Segmentation Learning

    Authors: Md. Shahriar Rahman Anuvab, Mishkat Sultana, Md. Atif Hossain, Shashwata Das, Suvarthi Chowdhury, Rafeed Rahman, Dibyo Fabian Dofadar, Shahriar Rahman Rana

    Abstract: Automatic License Plate Recognition (ALPR) is an integral component of an intelligent transport system with extensive applications in secure transportation, vehicle-to-vehicle communication, stolen vehicles detection, traffic violations, and traffic flow management. The existing license plate detection system focuses on one-shot learners or pre-trained models that operate with a geometric bounding… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  34. Advances in Differential Privacy and Differentially Private Machine Learning

    Authors: Saswat Das, Subhankar Mishra

    Abstract: There has been an explosion of research on differential privacy (DP) and its various applications in recent years, ranging from novel variants and accounting techniques in differential privacy to the thriving field of differentially private machine learning (DPML) to newer implementations in practice, like those by various companies and organisations such as census bureaus. Most recent surveys foc… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Journal ref: Information Technology Security, 2024, pp 147 to 188, Springer Tracts in Electrical and Electronics Engineering, Springer, Singapore

  35. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  36. arXiv:2404.02806  [pdf, other

    cs.SE cs.AI cs.HC

    The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

    Authors: Hussein Mozannar, Valerie Chen, Mohammed Alsobay, Subhro Das, Sebastian Zhao, Dennis Wei, Manish Nagireddy, Prasanna Sattigeri, Ameet Talwalkar, David Sontag

    Abstract: Evaluation of large language models (LLMs) for code has primarily relied on static benchmarks, including HumanEval (Chen et al., 2021), which measure the ability of LLMs to generate complete code that passes unit tests. As LLMs are increasingly used as programmer assistants, we study whether gains on existing benchmarks translate to gains in programmer productivity when coding with LLMs, including… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  37. arXiv:2404.00686   

    cs.LG

    Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning

    Authors: Srinjoy Roy, Swagatam Das

    Abstract: Accounting for the uncertainty of value functions boosts exploration in Reinforcement Learning (RL). Our work introduces Maximum Mean Discrepancy Q-Learning (MMD-QL) to improve Wasserstein Q-Learning (WQL) for uncertainty propagation during Temporal Difference (TD) updates. MMD-QL uses the MMD barycenter for this purpose, as MMD provides a tighter estimate of closeness between probability measures… ▽ More

    Submitted 3 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: We found some flaws in our analysis and we are in the process of rectifying those

  38. arXiv:2404.00464  [pdf, other

    cs.LG

    Leveraging Pre-trained and Transformer-derived Embeddings from EHRs to Characterize Heterogeneity Across Alzheimer's Disease and Related Dementias

    Authors: Matthew West, Colin Magdamo, Lily Cheng, Yingnan He, Sudeshna Das

    Abstract: Alzheimer's disease is a progressive, debilitating neurodegenerative disease that affects 50 million people globally. Despite this substantial health burden, available treatments for the disease are limited and its fundamental causes remain poorly understood. Previous work has suggested the existence of clinically-meaningful sub-types, which it is suggested may correspond to distinct etiologies, d… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 14 pages, 5 figures in main text

  39. arXiv:2403.19831  [pdf, other

    cs.GT

    TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion

    Authors: Doris E. M. Brown, Venkata Sriram Siddhardh Nadendla, Sajal K. Das

    Abstract: Stackelberg routing platforms (SRP) reduce congestion in one-shot traffic networks by proposing optimal route recommendations to selfish travelers. Traditionally, Stackelberg routing is cast as a partial control problem where a fraction of traveler flow complies with route recommendations, while the remaining respond as selfish travelers. In this paper, a novel Stackelberg routing framework is for… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  40. arXiv:2403.19816  [pdf, other

    cs.LG eess.SP

    The State of Lithium-Ion Battery Health Prognostics in the CPS Era

    Authors: Gaurav Shinde, Rohan Mohapatra, Pooja Krishan, Harish Garg, Srikanth Prabhu, Sanchari Das, Mohammad Masum, Saptarshi Sengupta

    Abstract: Lithium-ion batteries (Li-ion) have revolutionized energy storage technology, becoming integral to our daily lives by powering a diverse range of devices and applications. Their high energy density, fast power response, recyclability, and mobility advantages have made them the preferred choice for numerous sectors. This paper explores the seamless integration of Prognostics and Health Management w… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 18 pages, 12 figures, 6 tables. arXiv admin note: text overlap with arXiv:2310.00023

    MSC Class: 68 ACM Class: B.8.1

  41. arXiv:2403.19497  [pdf, other

    cs.CV

    Surface-based parcellation and vertex-wise analysis of ultra high-resolution ex vivo 7 tesla MRI in neurodegenerative diseases

    Authors: Pulkit Khandelwal, Michael Tran Duong, Constanza Fuentes, Amanda Denning, Winifred Trotman, Ranjit Ittyerah, Alejandra Bahena, Theresa Schuck, Marianna Gabrielyan, Karthik Prabhakaran, Daniel Ohm, Gabor Mizsei, John Robinson, Monica Munoz, John Detre, Edward Lee, David Irwin, Corey McMillan, M. Dylan Tisdall, Sandhitsu Das, David Wolk, Paul A. Yushkevich

    Abstract: Magnetic resonance imaging (MRI) is the standard modality to understand human brain structure and function in vivo (antemortem). Decades of research in human neuroimaging has led to the widespread development of methods and tools to provide automated volume-based segmentations and surface-based parcellations which help localize brain functions to specialized anatomical regions. Recently ex vivo (p… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Under review at MICCAI 2024

  42. arXiv:2403.19435  [pdf, other

    cs.CV

    BAMM: Bidirectional Autoregressive Motion Model

    Authors: Ekkasit Pinyoanuntapong, Muhammad Usama Saleem, Pu Wang, Minwoo Lee, Srijan Das, Chen Chen

    Abstract: Generating human motion from text has been dominated by denoising motion models either through diffusion or generative masking process. However, these models face great limitations in usability by requiring prior knowledge of the motion length. Conversely, autoregressive motion models address this limitation by adaptively predicting motion endpoints, at the cost of degraded generation quality and… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  43. arXiv:2403.16175  [pdf, other

    eess.IV cs.CV

    Enhancing MRI-Based Classification of Alzheimer's Disease with Explainable 3D Hybrid Compact Convolutional Transformers

    Authors: Arindam Majee, Avisek Gupta, Sourav Raha, Swagatam Das

    Abstract: Alzheimer's disease (AD), characterized by progressive cognitive decline and memory loss, presents a formidable global health challenge, underscoring the critical importance of early and precise diagnosis for timely interventions and enhanced patient outcomes. While MRI scans provide valuable insights into brain structures, traditional analysis methods often struggle to discern intricate 3D patter… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  44. arXiv:2403.14290  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Exploring Green AI for Audio Deepfake Detection

    Authors: Subhajit Saha, Md Sahidullah, Swagatam Das

    Abstract: The state-of-the-art audio deepfake detectors leveraging deep neural networks exhibit impressive recognition performance. Nonetheless, this advantage is accompanied by a significant carbon footprint. This is mainly due to the use of high-performance computing with accelerators and high training time. Studies show that average deep NLP model produces around 626k lbs of CO\textsubscript{2} which is… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: This manuscript is under review in a conference

  45. arXiv:2403.12161  [pdf

    cs.CE cs.CY q-fin.GN

    Effect of Leaders Voice on Financial Market: An Empirical Deep Learning Expedition on NASDAQ, NSE, and Beyond

    Authors: Arijit Das, Tanmoy Nandi, Prasanta Saha, Suman Das, Saronyo Mukherjee, Sudip Kumar Naskar, Diganta Saha

    Abstract: Financial market like the price of stock, share, gold, oil, mutual funds are affected by the news and posts on social media. In this work deep learning based models are proposed to predict the trend of financial market based on NLP analysis of the twitter handles of leaders of different fields. There are many models available to predict financial market based on only the historical data of the fin… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 20 pages original research

  46. arXiv:2403.11337  [pdf, other

    cs.CV cs.AI

    Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

    Authors: Xue Bai, Tasmiah Haque, Sumit Mohan, Yuliang Cai, Byungheon Jeong, Adam Halasz, Srinjoy Das

    Abstract: We propose a deep learning based novel prediction framework for enhanced bandwidth reduction in motion transfer enabled video applications such as video conferencing, virtual reality gaming and privacy preservation for patient health monitoring. To model complex motion, we use the First Order Motion Model (FOMM) that represents dynamic objects using learned keypoints along with their local affine… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  47. arXiv:2403.08819  [pdf, other

    cs.LG cs.CL stat.ML

    Thermometer: Towards Universal Calibration for Large Language Models

    Authors: Maohao Shen, Subhro Das, Kristjan Greenewald, Prasanna Sattigeri, Gregory Wornell, Soumya Ghosh

    Abstract: We consider the issue of calibration in large language models (LLM). Recent studies have found that common interventions such as instruction tuning often result in poorly calibrated LLMs. Although calibration is well-explored in traditional applications, calibrating LLMs is uniquely challenging. These challenges stem as much from the severe computational requirements of LLMs as from their versatil… ▽ More

    Submitted 27 June, 2024; v1 submitted 19 February, 2024; originally announced March 2024.

    Comments: Camera ready version for ICML 2024

  48. arXiv:2403.08149  [pdf, other

    cs.RO

    On the Feasibility of EEG-based Motor Intention Detection for Real-Time Robot Assistive Control

    Authors: Ho ** Choi, Satyajeet Das, Shaoting Peng, Ruzena Bajcsy, Nadia Figueroa

    Abstract: This paper explores the feasibility of employing EEG-based intention detection for real-time robot assistive control. We focus on predicting and distinguishing motor intentions of left/right arm movements by presenting: i) an offline data collection and training pipeline, used to train a classifier for left/right motion intention prediction, and ii) an online real-time prediction pipeline leveragi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  49. arXiv:2403.05174  [pdf, other

    cs.LG

    VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI

    Authors: Soumi Das, Shubhadip Nag, Shreyyash Sharma, Suparna Bhattacharya, Sourangshu Bhattacharya

    Abstract: Trustworthy AI is crucial to the widespread adoption of AI in high-stakes applications with fairness, robustness, and accuracy being some of the key trustworthiness metrics. In this work, we propose a controllable framework for data-centric trustworthy AI (DCTAI)- VTruST, that allows users to control the trade-offs between the different trustworthiness metrics of the constructed training datasets.… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted in ICLR 2024 DMLR workshop

  50. arXiv:2403.03024  [pdf, other

    cs.SE

    Toward Improved Deep Learning-based Vulnerability Detection

    Authors: Adriana Sejfia, Satyaki Das, Saad Shafiq, Nenad Medvidović

    Abstract: Deep learning (DL) has been a common thread across several recent techniques for vulnerability detection. The rise of large, publicly available datasets of vulnerabilities has fueled the learning process underpinning these techniques. While these datasets help the DL-based vulnerability detectors, they also constrain these detectors' predictive abilities. Vulnerabilities in these datasets have to… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.