Skip to main content

Showing 1–50 of 242 results for author: Chowdhury, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17740  [pdf, other

    cs.LG cs.AI cs.CV

    Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning

    Authors: Arijit Sehanobish, Avinava Dubey, Krzysztof Choromanski, Somnath Basu Roy Chowdhury, Deepali Jain, Vikas Sindhwani, Snigdha Chaturvedi

    Abstract: Recent efforts to scale Transformer models have demonstrated rapid progress across a wide range of tasks (Wei et al., 2022). However, fine-tuning these models for downstream tasks is expensive due to their large parameter counts. Parameter-efficient fine-tuning (PEFT) approaches have emerged as a viable alternative by allowing us to fine-tune models by updating only a small number of parameters. I… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Work in progress

  2. arXiv:2406.16612  [pdf, other

    cs.RO cs.MA

    Towards Physically Talented Aerial Robots with Tactically Smart Swarm Behavior thereof: An Efficient Co-design Approach

    Authors: Prajit KrisshnaKumar, Steve Paul, Hemanth Manjunatha, Mary Corra, Ehsan Esfahani, Souma Chowdhury

    Abstract: The collective performance or capacity of collaborative autonomous systems such as a swarm of robots is jointly influenced by the morphology and the behavior of individual systems in that collective. In that context, this paper explores how morphology impacts the learned tactical behavior of unmanned aerial/ground robots performing reconnaissance and search & rescue. This is achieved by presenting… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted for presentation in proceedings of ASME IDETC-CIE 2024

  3. arXiv:2406.16355  [pdf, other

    cs.LG eess.SY

    Compact Model Parameter Extraction via Derivative-Free Optimization

    Authors: Rafael Perez Martinez, Masaya Iwamoto, Kelly Woo, Zhengliang Bian, Roberto Tinti, Stephen Boyd, Srabanti Chowdhury

    Abstract: In this paper, we address the problem of compact model parameter extraction to simultaneously extract tens of parameters via derivative-free optimization. Traditionally, parameter extraction is performed manually by dividing the complete set of parameters into smaller subsets, each targeting different operational regions of the device, a process that can take several days or even weeks. Our approa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.16257  [pdf, other

    cs.LG

    Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning

    Authors: Somnath Basu Roy Chowdhury, Krzysztof Choromanski, Arijit Sehanobish, Avinava Dubey, Snigdha Chaturvedi

    Abstract: Machine unlearning is the process of efficiently removing the influence of a training data instance from a trained machine learning model without retraining it from scratch. A popular subclass of unlearning approaches is exact machine unlearning, which focuses on techniques that explicitly guarantee the removal of the influence of a data instance from a model. Exact unlearning approaches use a mac… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  5. arXiv:2406.16099  [pdf, other

    cs.SD eess.AS

    Speech Representation Analysis based on Inter- and Intra-Model Similarities

    Authors: Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

    Abstract: Self-supervised models have revolutionized speech processing, achieving new levels of performance in a wide variety of tasks with limited resources. However, the inner workings of these models are still opaque. In this paper, we aim to analyze the encoded contextual representation of these foundation models based on their inter- and intra-model similarity, independent of any external annotation an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 5 pages, Accepted to appear in ICASSP XAI-SA Workshop

  6. arXiv:2406.15881  [pdf, other

    cs.LG cs.AI

    Fast Tree-Field Integrators: From Low Displacement Rank to Topological Transformers

    Authors: Krzysztof Choromanski, Arijit Sehanobish, Somnath Basu Roy Chowdhury, Han Lin, Avinava Dubey, Tamas Sarlos, Snigdha Chaturvedi

    Abstract: We present a new class of fast polylog-linear algorithms based on the theory of structured matrices (in particular low displacement rank) for integrating tensor fields defined on weighted trees. Several applications of the resulting fast tree-field integrators (FTFIs) are presented, including (a) approximation of graph metrics with tree metrics, (b) graph classification, (c) modeling on meshes, an… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Preprint. Comments welcome

  7. arXiv:2406.13431  [pdf, other

    cs.CL cs.SD eess.AS

    Children's Speech Recognition through Discrete Token Enhancement

    Authors: Vrunda N. Sukhadia, Shammur Absar Chowdhury

    Abstract: Children's speech recognition is considered a low-resource task mainly due to the lack of publicly available data. There are several reasons for such data scarcity, including expensive data collection and annotation processes, and data privacy, among others. Transforming speech signals into discrete tokens that do not carry sensitive information but capture both linguistic and acoustic information… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  8. arXiv:2406.11107  [pdf, other

    cs.CL

    Exploring Safety-Utility Trade-Offs in Personalized Language Models

    Authors: Anvesh Rao Vij**i, Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

    Abstract: As large language models (LLMs) become increasingly integrated into daily applications, it is essential to ensure they operate fairly across diverse user demographics. In this work, we show that LLMs suffer from personalization bias, where their performance is impacted when they are personalized to a user's identity. We quantify personalization bias by evaluating the performance of LLMs along two… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  9. arXiv:2406.04673  [pdf, other

    cs.CV cs.AI cs.MM eess.AS

    MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models

    Authors: Sanjoy Chowdhury, Sayan Nag, K J Joseph, Balaji Vasan Srinivasan, Dinesh Manocha

    Abstract: Music is a universal language that can communicate emotions and feelings. It forms an essential part of the whole spectrum of creative media, ranging from movies to social media posts. Machine learning models that can synthesize music are predominantly conditioned on textual descriptions of it. Inspired by how musicians compose music not just from a movie script, but also through visualizations, w… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024 as Highlight paper. Webpage: https://schowdhury671.github.io/melfusion_cvpr2024/

  10. arXiv:2406.04511  [pdf, other

    cs.CV

    Classification of Non-native Handwritten Characters Using Convolutional Neural Network

    Authors: F. A. Mamun, S. A. H. Chowdhury, J. E. Giti, H. Sarker

    Abstract: The use of convolutional neural networks (CNNs) has accelerated the progress of handwritten character classification/recognition. Handwritten character recognition (HCR) has found applications in various domains, such as traffic signal detection, language translation, and document information extraction. However, the widespread use of existing HCR technology is yet to be seen as it does not provid… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  11. xNose: A Test Smell Detector for C#

    Authors: Partha P. Paul, Md Tonoy Akanda, M. Raihan Ullah, Dipto Mondal, Nazia S. Chowdhury, Fazle M. Tawsif

    Abstract: Test smells, similar to code smells, can negatively impact both the test code and the production code being tested. Despite extensive research on test smells in languages like Java, Scala, and Python, automated tools for detecting test smells in C# are lacking. This paper aims to bridge this gap by extending the study of test smells to C#, and develo** a tool (xNose) to identify test smells in t… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Full report of our ICSE'24 poster

  12. arXiv:2405.02571  [pdf, other

    cs.CV

    ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy

    Authors: Soumyadeep Chandra, Sayeed Shafayet Chowdhury, Courtney Yong, Chandru P. Sundaram, Kaushik Roy

    Abstract: Surgical action localization is a challenging computer vision problem. While it has promising applications including automated training of surgery procedures, surgical workflow optimization, etc., appropriate model design is pivotal to accomplishing this task. Moreover, the lack of suitable medical datasets adds an additional layer of complexity. To that effect, we introduce a new complex dataset… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Nephrectomy surgery, Surgical Phase Recognition, Surgical Workflow Segmentation, 11 pages, 2 figures, 2 tables

  13. arXiv:2404.09147  [pdf

    cs.HC

    Evaluating the efficacy of haptic feedback, 360° treadmill-integrated Virtual Reality framework and longitudinal training on decision-making performance in a complex search-and-shoot simulation

    Authors: Akash K Rao, Arnav Bhavsar, Shubhajit Roy Chowdhury, Sushil Chandra, Ramsingh Negi, Prakash Duraisamy, Varun Dutt

    Abstract: Virtual Reality (VR) has made significant strides, offering users a multitude of ways to interact with virtual environments. Each sensory modality in VR provides distinct inputs and interactions, enhancing the user's immersion and presence. However, the potential of additional sensory modalities, such as haptic feedback and 360° locomotion, to improve decision-making performance has not been thoro… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures, 1 Table

  14. arXiv:2404.05049  [pdf, other

    cs.CV

    PlateSegFL: A Privacy-Preserving License Plate Detection Using Federated Segmentation Learning

    Authors: Md. Shahriar Rahman Anuvab, Mishkat Sultana, Md. Atif Hossain, Shashwata Das, Suvarthi Chowdhury, Rafeed Rahman, Dibyo Fabian Dofadar, Shahriar Rahman Rana

    Abstract: Automatic License Plate Recognition (ALPR) is an integral component of an intelligent transport system with extensive applications in secure transportation, vehicle-to-vehicle communication, stolen vehicles detection, traffic violations, and traffic flow management. The existing license plate detection system focuses on one-shot learners or pre-trained models that operate with a geometric bounding… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  15. arXiv:2404.03570  [pdf, other

    cs.RO

    Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity

    Authors: Jake Varley, Sumeet Singh, Deepali Jain, Krzysztof Choromanski, Andy Zeng, Somnath Basu Roy Chowdhury, Avinava Dubey, Vikas Sindhwani

    Abstract: We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With sem… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  16. arXiv:2403.11487  [pdf, other

    cs.RO cs.AI

    Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis

    Authors: Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha

    Abstract: We present a novel approach to automatically synthesize "wayfinding instructions" for an embodied robot agent. In contrast to prior approaches that are heavily reliant on human-annotated datasets designed exclusively for specific simulation platforms, our algorithm uses in-context learning to condition an LLM to generate instructions using just a few references. Using an LLM-based Visual Question… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 14 Pages

  17. arXiv:2403.07131  [pdf, other

    cs.AI cs.MA

    Bigraph Matching Weighted with Learnt Incentive Function for Multi-Robot Task Allocation

    Authors: Steve Paul, Nathan Maurer, Souma Chowdhury

    Abstract: Most real-world Multi-Robot Task Allocation (MRTA) problems require fast and efficient decision-making, which is often achieved using heuristics-aided methods such as genetic algorithms, auction-based methods, and bipartite graph matching methods. These methods often assume a form that lends better explainability compared to an end-to-end (learnt) neural network based policy for MRTA. However, der… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: This paper was accepted for presentation in proceedings of IEEE International Conference on Robotics and Automation 2024

  18. arXiv:2403.03307  [pdf, other

    cs.CL

    Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots

    Authors: Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan

    Abstract: Educational chatbots are a promising tool for assisting student learning. However, the development of effective chatbots in education has been challenging, as high-quality data is seldom available in this domain. In this paper, we propose a framework for generating synthetic teacher-student interactions grounded in a set of textbooks. Our approaches capture one aspect of learning interactions wher… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 24 pages, 19 tables, 2 figures

  19. arXiv:2403.01789  [pdf, other

    cs.CR eess.SY

    DECOR: Enhancing Logic Locking Against Machine Learning-Based Attacks

    Authors: Yinghua Hu, Kaixin Yang, Subhajit Dutta Chowdhury, Pierluigi Nuzzo

    Abstract: Logic locking (LL) has gained attention as a promising intellectual property protection measure for integrated circuits. However, recent attacks, facilitated by machine learning (ML), have shown the potential to predict the correct key in multiple LL schemes by exploiting the correlation of the correct key value with the circuit structure. This paper presents a generic LL enhancement method based… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 8 pages. Accepted at the International Symposium on Quality Electronic Design (ISQED), 2024

  20. arXiv:2403.00409  [pdf, other

    cs.LG cs.CL

    Provably Robust DPO: Aligning Language Models with Noisy Feedback

    Authors: Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan

    Abstract: Learning from preference-based feedback has recently gained traction as a promising approach to align language models with human interests. While these aligned generative models have demonstrated impressive capabilities across various tasks, their dependence on high-quality human preference data poses a bottleneck in practical applications. Specifically, noisy (incorrect and ambiguous) preference… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  21. arXiv:2402.10500  [pdf, other

    cs.LG cs.AI cs.CL

    Active Preference Optimization for Sample Efficient RLHF

    Authors: Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is pivotal in aligning Large Language Models (LLMs) with human preferences. Although aligned generative models have shown remarkable abilities in various tasks, their reliance on high-quality human preference data creates a costly bottleneck in the practical application of RLHF. One primary reason is that current methods rely on uniformly picking p… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: New experimental results added. Some reorganization

  22. arXiv:2402.09216  [pdf, other

    cs.CL cs.HC

    AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails

    Authors: Sankalan Pal Chowdhury, Vilém Zouhar, Mrinmaya Sachan

    Abstract: Large Language Models (LLMs) have found several use cases in education, ranging from automatic question generation to essay evaluation. In this paper, we explore the potential of using Large Language Models (LLMs) to author Intelligent Tutoring Systems. A common pitfall of LLMs is their straying from desired pedagogical strategies such as leaking the answer to the student, and in general, providin… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: To be presented at Learning@Scale 2024

  23. arXiv:2402.05122  [pdf

    cs.GL cs.CL cs.HC

    History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

    Authors: Md. Al-Amin, Mohammad Shazed Ali, Abdus Salam, Arif Khan, Ashraf Ali, Ahsan Ullah, Md Nur Alam, Shamsul Kabir Chowdhury

    Abstract: This research provides an in-depth comprehensive review of the progress of chatbot technology over time, from the initial basic systems relying on rules to today's advanced conversational bots powered by artificial intelligence. Spanning many decades, the paper explores the major milestones, innovations, and paradigm shifts that have driven the evolution of chatbots. Looking back at the very basic… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  24. arXiv:2402.00090  [pdf

    q-bio.NC cs.HC

    Classification of attention performance post-longitudinal tDCS via functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Arnav Bhavsar, Shubhajit Roy Chowdhury, Ramsingh Negi, Varun Dutt

    Abstract: Attention is the brain's mechanism for selectively processing specific stimuli while filtering out irrelevant information. Characterizing changes in attention following long-term interventions (such as transcranial direct current stimulation (tDCS)) has seldom been emphasized in the literature. To classify attention performance post-tDCS, this study uses functional connectivity and machine learnin… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 6 pages, to be presented in the IEEE 9th International Conference for Convergence in Technology (I2CT),Pune, April 2024. arXiv admin note: substantial text overlap with arXiv:2401.17700

  25. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  26. arXiv:2401.17497  [pdf, other

    cs.CV

    Towards Visual Syntactical Understanding

    Authors: Sayeed Shafayet Chowdhury, Soumyadeep Chandra, Kaushik Roy

    Abstract: Syntax is usually studied in the realm of linguistics and refers to the arrangement of words in a sentence. Similarly, an image can be considered as a visual 'sentence', with the semantic parts of the image acting as 'words'. While visual syntactic understanding occurs naturally to humans, it is interesting to explore whether deep neural networks (DNNs) are equipped with such reasoning. To that en… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  27. arXiv:2401.14826  [pdf, other

    cs.SD cs.IR eess.AS

    Expressivity-aware Music Performance Retrieval using Mid-level Perceptual Features and Emotion Word Embeddings

    Authors: Shreyan Chowdhury, Gerhard Widmer

    Abstract: This paper explores a specific sub-task of cross-modal music retrieval. We consider the delicate task of retrieving a performance or rendition of a musical piece based on a description of its style, expressive character, or emotion from a set of different performances of the same piece. We observe that a general purpose cross-modal system trained to learn a common text-audio embedding space does n… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Presented at FIRE 2023 (Forum for Information Retrieval Evaluation) conference, Goa, India

  28. arXiv:2401.14098  [pdf, other

    cs.CR

    Carry Your Fault: A Fault Propagation Attack on Side-Channel Protected LWE-based KEM

    Authors: Suparna Kundu, Siddhartha Chowdhury, Sayandeep Saha, Angshuman Karmakar, Debdeep Mukhopadhyay, Ingrid Verbauwhede

    Abstract: Post-quantum cryptographic (PQC) algorithms, especially those based on the learning with errors (LWE) problem, have been subjected to several physical attacks in the recent past. Although the attacks broadly belong to two classes - passive side-channel attacks and active fault attacks, the attack strategies vary significantly due to the inherent complexities of such algorithms. Exploring further a… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    ACM Class: E.3.3

  29. arXiv:2401.08047  [pdf, other

    cs.CL cs.LG

    Incremental Extractive Opinion Summarization Using Cover Trees

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Avinava Dubey, Manzil Zaheer, Andrew McCallum, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Extractive opinion summarization involves automatically producing a summary of text about an entity (e.g., a product's reviews) by extracting representative sentences that capture prevalent opinions in the review set. Typically, in online marketplaces user reviews accumulate over time, and opinion summaries need to be updated periodically to provide customers with up-to-date information. In this w… ▽ More

    Submitted 12 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted at TMLR

  30. arXiv:2401.04851  [pdf, other

    cs.MA cs.AI cs.LG

    Graph Learning-based Fleet Scheduling for Urban Air Mobility under Operational Constraints, Varying Demand & Uncertainties

    Authors: Steve Paul, Jhoel Witter, Souma Chowdhury

    Abstract: This paper develops a graph reinforcement learning approach to online planning of the schedule and destinations of electric aircraft that comprise an urban air mobility (UAM) fleet operating across multiple vertiports. This fleet scheduling problem is formulated to consider time-varying demand, constraints related to vertiport capacity, aircraft capacity and airspace safety guidelines, uncertainti… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: This paper is accepted to be presented at the ACM Symposium on Applied Computing 2024

  31. arXiv:2401.02979  [pdf, other

    cs.CL cs.AI cs.IR

    Are we describing the same sound? An analysis of word embedding spaces of expressive piano performance

    Authors: Silvan David Peter, Shreyan Chowdhury, Carlos Eduardo Cancino-Chacón, Gerhard Widmer

    Abstract: Semantic embeddings play a crucial role in natural language-based information retrieval. Embedding models represent words and contexts as vectors whose spatial configuration is derived from the distribution of words in large text corpora. While such representations are generally very powerful, they might fail to account for fine-grained domain-specific nuances. In this article, we investigate this… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Journal ref: Proceedings of the Forum for Information Retrieval Evaluation, FIRE, 2023, Panjim, India

  32. arXiv:2401.01996  [pdf, other

    cs.ET cs.AR cs.LG cs.NE

    Mean-Field Assisted Deep Boltzmann Learning with Probabilistic Computers

    Authors: Shuvro Chowdhury, Shaila Niazi, Kerem Y. Camsari

    Abstract: Despite their appeal as physics-inspired, energy-based and generative nature, general Boltzmann Machines (BM) are considered intractable to train. This belief led to simplified models of BMs with restricted intralayer connections or layer-by-layer training of deep BMs. Recent developments in domain-specific hardware -- specifically probabilistic computers (p-computer) with probabilistic bits (p-bi… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Journal ref: Workshop on Machine Learning with New Compute Paradigms at NeurIPS 2023(MLNCP 2023)

  33. arXiv:2312.08748  [pdf, other

    cs.DC cs.ET cs.NE quant-ph

    All-to-all reconfigurability with sparse and higher-order Ising machines

    Authors: Srijan Nikhar, Sidharth Kannan, Navid Anjum Aadit, Shuvro Chowdhury, Kerem Y. Camsari

    Abstract: Domain-specific hardware to solve computationally hard optimization problems has generated tremendous excitement recently. Here, we evaluate probabilistic bit (p-bit) based on Ising Machines (IM) or p-computers with a benchmark combinatorial optimization problem, namely the 3-regular 3-XOR Satisfiability (3R3X). The 3R3X problem has a glassy energy landscape, and it has recently been used to bench… ▽ More

    Submitted 21 May, 2024; v1 submitted 21 November, 2023; originally announced December 2023.

    Comments: First three authors are equally contributing

  34. arXiv:2312.06568  [pdf, other

    cs.LG cs.AI cs.CR

    Sparse but Strong: Crafting Adversarially Robust Graph Lottery Tickets

    Authors: Subhajit Dutta Chowdhury, Zhiyu Ni, Qingyuan Peng, Souvik Kundu, Pierluigi Nuzzo

    Abstract: Graph Lottery Tickets (GLTs), comprising a sparse adjacency matrix and a sparse graph neural network (GNN), can significantly reduce the inference latency and compute footprint compared to their dense counterparts. Despite these benefits, their performance against adversarial structure perturbations remains to be fully explored. In this work, we first investigate the resilience of GLTs against dif… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at NeurIPS 2023 GLFrontiers Workshop

  35. arXiv:2312.01564  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    APoLLo: Unified Adapter and Prompt Learning for Vision Language Models

    Authors: Sanjoy Chowdhury, Sayan Nag, Dinesh Manocha

    Abstract: The choice of input text prompt plays a critical role in the performance of Vision-Language Pretrained (VLP) models such as CLIP. We present APoLLo, a unified multi-modal approach that combines Adapter and Prompt learning for Vision-Language models. Our method is designed to substantially improve the generalization capabilities of VLP models when they are fine-tuned in a few-shot setting. We intro… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP 2023 (Main track)

  36. arXiv:2312.00597  [pdf

    cs.RO cs.AI cs.CV

    UAVs and Birds: Enhancing Short-Range Navigation through Budgerigar Flight Studies

    Authors: Md. Mahmudur Rahman, Sajid Islam, Showren Chowdhury, Sadia Jahan Zeba, Debajyoti Karmaker

    Abstract: This study delves into the flight behaviors of Budgerigars (Melopsittacus undulatus) to gain insights into their flight trajectories and movements. Using 3D reconstruction from stereo video camera recordings, we closely examine the velocity and acceleration patterns during three flight motion takeoff, flying and landing. The findings not only contribute to our understanding of bird behaviors but a… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 26 pages, 10 figures

  37. arXiv:2312.00194  [pdf

    cs.LG cs.CL

    Robust Concept Erasure via Kernelized Rate-Distortion Maximization

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Distributed representations provide a vector space that captures meaningful relationships between data instances. The distributed nature of these representations, however, entangles together multiple attributes or concepts of data instances (e.g., the topic or sentiment of a text, characteristics of the author (age, gender, etc), etc). Recent work has proposed the task of concept erasure, in which… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  38. arXiv:2311.10731  [pdf

    cs.LG physics.med-ph physics.soc-ph

    Gender-Based Comparative Study of Type 2 Diabetes Risk Factors in Kolkata, India: A Machine Learning Approach

    Authors: Rahul Jain, Anoushka Saha, Gourav Daga, Durba Bhattacharya, Madhura Das Gupta, Sourav Chowdhury, Suparna Roychowdhury

    Abstract: Type 2 diabetes mellitus represents a prevalent and widespread global health concern, necessitating a comprehensive assessment of its risk factors. This study aimed towards learning whether there is any differential impact of age, Lifestyle, BMI and Waist to height ratio on the risk of Type 2 diabetes mellitus in males and females in Kolkata, West Bengal, India based on a sample observed from the… ▽ More

    Submitted 14 October, 2023; originally announced November 2023.

    Comments: 10 pages, 7 tables,3 figures, submitted to a conference

  39. arXiv:2311.10727  [pdf

    cs.CY

    Unveiling the Potential of Big Data Analytics for Transforming Higher Education in Bangladesh; Needs, Prospects, and Challenges

    Authors: Sabbir Ahmed Chowdhury, Md Aminul Islam, Mostafa Azad Kamal

    Abstract: Big Data Analytics has gained tremendous momentum in many sectors worldwide. Big Data has substantial influence in the field of Learning Analytics that may allow academic institutions to better understand the learners needs and proactively address them. Hence, it is essential to understand Big Data and its application. With the capability of Big Data to find a broad understanding of the scientific… ▽ More

    Submitted 24 November, 2023; v1 submitted 10 October, 2023; originally announced November 2023.

  40. arXiv:2311.03196  [pdf, other

    cs.CL cs.AI

    Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition

    Authors: Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam

    Abstract: One of the major challenges for develo** automatic speech recognition (ASR) for low-resource languages is the limited access to labeled data with domain-specific variations. In this study, we propose a pseudo-labeling approach to develop a large-scale domain-agnostic ASR dataset. With the proposed methodology, we developed a 20k+ hours labeled Bangla speech dataset covering diverse topics, speak… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at BLP-2023 (at EMNLP 2023), ASR, low-resource, out-of-distribution, domain-agnostic

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  41. arXiv:2311.03078  [pdf

    cs.CL

    BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer

    Authors: Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed

    Abstract: Lemmatization holds significance in both natural language processing (NLP) and linguistics, as it effectively decreases data density and aids in comprehending contextual meaning. However, due to the highly inflected nature and morphological richness, lemmatization in Bangla text poses a complex challenge. In this study, we propose linguistic rules for lemmatization and utilize a dictionary along w… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  42. arXiv:2310.20158  [pdf, other

    cs.CL

    GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval

    Authors: Daman Arora, Anush Kini, Sayak Ray Chowdhury, Nagarajan Natarajan, Gaurav Sinha, Amit Sharma

    Abstract: Given a query and a document corpus, the information retrieval (IR) task is to output a ranked list of relevant documents. Combining large language models (LLMs) with embedding-based retrieval models, recent work shows promising results on the zero-shot retrieval problem, i.e., no access to labeled data from the target domain. Two such popular paradigms are generation-augmented retrieval or GAR (g… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: preprint

  43. arXiv:2310.19733  [pdf, other

    cs.LG cs.CR

    Differentially Private Reward Estimation with Preference Feedback

    Authors: Sayak Ray Chowdhury, Xingyu Zhou, Nagarajan Natarajan

    Abstract: Learning from preference-based feedback has recently gained considerable traction as a promising approach to align generative models with human interests. Instead of relying on numerical rewards, the generative models are trained using reinforcement learning with human feedback (RLHF). These approaches first solicit feedback from human labelers typically in the form of pairwise comparisons between… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  44. arXiv:2310.13974  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Pronunciation Assessment -- A Review

    Authors: Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

    Abstract: Pronunciation assessment and its application in computer-aided pronunciation training (CAPT) have seen impressive progress in recent years. With the rapid growth in language processing and deep learning over the past few years, there is a need for an updated review. In this paper, we review methods employed in pronunciation assessment for both phonemic and prosodic. We categorize the main challeng… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 9 pages, accepted to EMNLP Findings

  45. arXiv:2310.11401  [pdf, other

    cs.LG

    Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami, Rahul Kidambi, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Fairness, especially group fairness, is an important consideration in the context of machine learning systems. The most commonly adopted group fairness-enhancing techniques are in-processing methods that rely on a mixture of a fairness objective (e.g., demographic parity) and a task-specific objective (e.g., cross-entropy) during the training process. However, when data arrives in an online fashio… ▽ More

    Submitted 27 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (Spotlight)

  46. Machine Learning Quantum Systems with Magnetic p-bits

    Authors: Shuvro Chowdhury, Kerem Y. Camsari

    Abstract: The slowing down of Moore's Law has led to a crisis as the computing workloads of Artificial Intelligence (AI) algorithms continue skyrocketing. There is an urgent need for scalable and energy-efficient hardware catering to the unique requirements of AI algorithms and applications. In this environment, probabilistic computing with p-bits emerged as a scalable, domain-specific, and energy-efficient… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Journal ref: 2023 IEEE International Magnetic Conference - Short Papers (INTERMAG Short Papers), Sendai, Japan, 2023, pp. 1-2

  47. arXiv:2310.02572  [pdf, other

    cs.LG

    Improving Knowledge Distillation with Teacher's Explanation

    Authors: Sayantan Chowdhury, Ben Liang, Ali Tizghadam, Ilijc Albanese

    Abstract: Knowledge distillation (KD) improves the performance of a low-complexity student model with the help of a more powerful teacher. The teacher in KD is a black-box model, imparting knowledge to the student only through its predictions. This limits the amount of transferred knowledge. In this work, we introduce a novel Knowledge Explaining Distillation (KED) framework, which allows the student to lea… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  48. arXiv:2309.15674  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Speech collage: code-switched audio generation by collaging monolingual corpora

    Authors: Amir Hussein, Dorsa Zeinali, Ondřej Klejch, Matthew Wiesner, Brian Yan, Shammur Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur

    Abstract: Designing effective automatic speech recognition (ASR) systems for Code-Switching (CS) often depends on the availability of the transcribed CS resources. To address data scarcity, this paper introduces Speech Collage, a method that synthesizes CS data from monolingual corpora by splicing audio segments. We further improve the smoothness quality of audio generation using an overlap-add approach. We… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  49. arXiv:2309.10016  [pdf

    cs.LG cs.AI

    Evaluation of GPT-3 for Anti-Cancer Drug Sensitivity Prediction

    Authors: Shaika Chowdhury, Sivaraman Rajaganapathy, Lichao Sun, James Cerhan, Nansu Zong

    Abstract: In this study, we investigated the potential of GPT-3 for the anti-cancer drug sensitivity prediction task using structured pharmacogenomics data across five tissue types and evaluated its performance with zero-shot prompting and fine-tuning paradigms. The drug's smile representation and cell line's genomic mutation features were predictive of the drug response. The results from this study have th… ▽ More

    Submitted 23 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: AMIA Informatics Summit 2024

  50. arXiv:2309.07739  [pdf, other

    cs.CL cs.SD eess.AS

    The complementary roles of non-verbal cues for Robust Pronunciation Assessment

    Authors: Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: Research on pronunciation assessment systems focuses on utilizing phonetic and phonological aspects of non-native (L2) speech, often neglecting the rich layer of information hidden within the non-verbal cues. In this study, we proposed a novel pronunciation assessment framework, IntraVerbalPA. % The framework innovatively incorporates both fine-grained frame- and abstract utterance-level non-verba… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 5 pages, submitted to ICASSP 2024