Skip to main content

Showing 1–50 of 196 results for author: Sinha, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13302  [pdf, other

    cs.CV

    Situational Instructions Database: Task Guidance in Dynamic Environments

    Authors: Muhammad Saif Ullah Khan, Sankalp Sinha, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: The Situational Instructions Database (SID) addresses the need for enhanced situational awareness in artificial intelligence (AI) systems operating in dynamic environments. By integrating detailed scene graphs with dynamically generated, task-specific instructions, SID provides a novel dataset that allows AI systems to perform complex, real-world tasks with improved context sensitivity and operati… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

  2. arXiv:2406.10764  [pdf, other

    cs.CL

    GNOME: Generating Negotiations through Open-Domain Map** of Exchanges

    Authors: Darshan Deshpande, Shambhavi Sinha, Anirudh Ravi Kumar, Debaditya Pal, Jonathan May

    Abstract: Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes exi… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.10247  [pdf, other

    cs.CL cs.AI

    QCQA: Quality and Capacity-aware grouped Query Attention

    Authors: Vinay Joshi, Prashant Laddha, Shambhavi Sinha, Om Ji Omer, Sreenivas Subramoney

    Abstract: Excessive memory requirements of key and value features (KV-cache) present significant challenges in the autoregressive inference of large language models (LLMs), restricting both the speed and length of text generation. Approaches such as Multi-Query Attention (MQA) and Grouped Query Attention (GQA) mitigate these challenges by grou** query heads and consequently reducing the number of correspo… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2406.08787  [pdf, ps, other

    cs.AI

    A Survey on Compositional Learning of AI Models: Theoretical and Experimetnal Practices

    Authors: Sania Sinha, Tanawan Premsri, Parisa Kordjamshidi

    Abstract: Compositional learning, mastering the ability to combine basic concepts and construct more intricate ones, is crucial for human cognition, especially in human language comprehension and visual perception. This notion is tightly connected to generalization over unobserved situations. Despite its integral role in intelligence, there is a lack of systematic theoretical and experimental research metho… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2405.19653  [pdf, other

    cs.LG cs.CL eess.SY

    SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems

    Authors: Patrick Emami, Zhaonan Li, Saumya Sinha, Truc Nguyen

    Abstract: Data-driven simulation surrogates help computational scientists study complex systems. They can also help inform impactful policy decisions. We introduce a learning framework for surrogate modeling where language is used to interface with the underlying system being simulated. We call a language description of a system a "system caption", or SysCap. To address the lack of datasets of paired natura… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages. Under review

  6. arXiv:2405.11446  [pdf, other

    cs.CL cs.LG

    MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

    Authors: Sanchit Sinha, Yuguang Yue, Victor Soto, Mayank Kulkarni, Jianhua Lu, Aidong Zhang

    Abstract: Adapting large language models (LLMs) to unseen tasks with in-context training samples without fine-tuning remains an important research problem. To learn a robust LLM that adapts well to unseen tasks, multiple meta-training approaches have been proposed such as MetaICL and MetaICT, which involve meta-training pre-trained LLMs on a wide variety of diverse tasks. These meta-training approaches esse… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: KDD 2024, 11 pages(9 main, 2 ref, 1 App) Openreview https://openreview.net/forum?id=JwecLNhWDy&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DKDD.org%2F2024%2FResearch_Track%2FAuthors%23your-submissions)

  7. arXiv:2405.03660  [pdf, other

    cs.CV

    CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification

    Authors: Sankalp Sinha, Muhammad Saif Ullah Khan, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Zero-shot learning has been extensively investigated in the broader field of visual recognition, attracting significant interest recently. However, the current work on zero-shot learning in document image classification remains scarce. The existing studies either focus exclusively on zero-shot inference, or their evaluation does not align with the established criteria of zero-shot evaluation in th… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 4 Figures and Accepted in ICDAR 2024

  8. arXiv:2405.00349  [pdf, other

    cs.LG

    A Self-explaining Neural Architecture for Generalizable Concept Learning

    Authors: Sanchit Sinha, Guangzhi Xiong, Aidong Zhang

    Abstract: With the wide proliferation of Deep Neural Networks in high-stake applications, there is a growing demand for explainability behind their decision-making process. Concept learning models attempt to learn high-level 'concepts' - abstract entities that align with human understanding, and thus provide interpretability to DNN architectures. However, in this paper, we demonstrate that present SOTA conc… ▽ More

    Submitted 5 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024. 16 pages (7 main content, 2 references, 7 Appendix) Code available at https://github.com/sanchit97/secl

  9. arXiv:2404.06405  [pdf, other

    cs.AI cs.CG cs.CL cs.LG

    Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

    Authors: Shiven Sinha, Ameya Prabhu, Ponnurangam Kumaraguru, Siddharth Bhat, Matthias Bethge

    Abstract: Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 2… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Work in Progress. Released for wider feedback

  10. arXiv:2403.18074  [pdf, other

    cs.CV eess.IV

    Every Shot Counts: Using Exemplars for Repetition Counting in Videos

    Authors: Saptarshi Sinha, Alexandros Stergiou, Dima Damen

    Abstract: Video repetition counting infers the number of repetitions of recurring actions or motion within a video. We propose an exemplar-based approach that discovers visual correspondence of video exemplars across repetitions within target videos. Our proposed Every Shot Counts (ESCounts) model is an attention-based encoder-decoder that encodes videos of varying lengths alongside exemplars from the same… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project website: https://sinhasaptarshi.github.io/escounts

  11. arXiv:2402.15589  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Prompting LLMs to Compose Meta-Review Drafts from Peer-Review Narratives of Scholarly Manuscripts

    Authors: Shubhra Kanti Karmaker Santu, Sanjeev Kumar Sinha, Naman Bansal, Alex Knipper, Souvika Sarkar, John Salvador, Yash Mahajan, Sri Guttikonda, Mousumi Akter, Matthew Freestone, Matthew C. Williams Jr

    Abstract: One of the most important yet onerous tasks in the academic peer-reviewing process is composing meta-reviews, which involves understanding the core contributions, strengths, and weaknesses of a scholarly manuscript based on peer-review narratives from multiple experts and then summarizing those multiple experts' perspectives into a concise holistic overview. Given the latest major developments in… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  12. arXiv:2402.15037  [pdf, other

    cs.GT econ.GN

    Analyzing Games in Maker Protocol Part One: A Multi-Agent Influence Diagram Approach Towards Coordination

    Authors: Abhimanyu Nag, Samrat Gupta, Sudipan Sinha, Arka Datta

    Abstract: Decentralized Finance (DeFi) ecosystems, exemplified by the Maker Protocol, rely on intricate games to maintain stability and security. Understanding the dynamics of these games is crucial for ensuring the robustness of the system. This motivating research proposes a novel methodology leveraging Multi-Agent Influence Diagrams (MAID), originally proposed by Koller and Milch, to dissect and analyze… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  13. arXiv:2402.12629  [pdf, other

    cs.MM cs.CY cs.SI

    Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale

    Authors: Anmol Agarwal, Pratyush Priyadarshi, Shiven Sinha, Shrey Gupta, Hitkul Jangra, Kiran Garimella, Ponnurangam Kumaraguru

    Abstract: In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimedia essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scal… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  14. arXiv:2402.08823  [pdf, other

    cs.CV cs.LG

    RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

    Authors: Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

    Abstract: We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks acro… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Tech Report

  15. arXiv:2402.05466  [pdf, other

    cs.HC

    Engineering End-to-End Remote Labs using IoT-based Retrofitting

    Authors: K. S. Viswanadh, Akshit Gureja, Nagesh Walchatwar, Rishabh Agrawal, Shiven Sinha, Sachin Chaudhari, Karthik Vaidhyanathan, Venkatesh Choppella, Prabhakar Bhimalapuram, Harikumar Kandath, Aftab Hussain

    Abstract: Remote labs are a groundbreaking development in the education industry, providing students with access to laboratory education anytime, anywhere. However, most remote labs are costly and difficult to scale, especially in develo** countries. With this as a motivation, this paper proposes a new remote labs (RLabs) solution that includes two use case experiments: Vanishing Rod and Focal Length. The… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 30 pages, 7 tables and 20 figures. Submitted to ACM Transactions on IoT

  16. arXiv:2402.04466  [pdf, other

    cs.SE cs.AI cs.LG cs.OS

    Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan

    Authors: Soham Sinha, Shekhar Dwivedi, Mahdi Azizian

    Abstract: The introduction of AI and ML technologies into medical devices has revolutionized healthcare diagnostics and treatments. Medical device manufacturers are keen to maximize the advantages afforded by AI and ML by consolidating multiple applications onto a single platform. However, concurrent execution of several AI applications, each with its own visualization components, leads to unpredictable end… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    ACM Class: C.3; J.7; D.2.11; D.2.10; D.4.8

  17. arXiv:2402.01980  [pdf, other

    cs.CL

    SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

    Authors: Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

    Abstract: Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about… ▽ More

    Submitted 14 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Short paper accepted to EACL 2024. 4 pgs, 2 tables

  18. arXiv:2401.18083  [pdf, other

    cs.CV cs.RO

    Improved Scene Landmark Detection for Camera Localization

    Authors: Tien Do, Sudipta N. Sinha

    Abstract: Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-spe… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: To be presented at 3DV 2024

  19. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024

  20. arXiv:2312.11541  [pdf, other

    cs.AI cs.CL

    CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare

    Authors: Akash Ghosh, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, Setu Sinha

    Abstract: In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  21. arXiv:2312.00894  [pdf, other

    cs.SE

    Leveraging Large Language Models to Improve REST API Testing

    Authors: Myeongsoo Kim, Tyler Stennett, Dhruv Shah, Saurabh Sinha, Alessandro Orso

    Abstract: The widespread adoption of REST APIs, coupled with their growing complexity and size, has led to the need for automated REST API testing tools. Current tools focus on the structured data in REST API specifications but often neglect valuable insights available in unstructured natural-language descriptions in the specifications, which leads to suboptimal test coverage. Recently, to address this gap,… ▽ More

    Submitted 29 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: To be published in the 46th IEEE/ACM International Conference on Software Engineering - New Ideas and Emerging Results Track (ICSE-NIER 2024)

  22. arXiv:2311.18820  [pdf, other

    cs.IT cs.NI eess.SP

    Adversarial Attacks and Defenses for Wireless Signal Classifiers using CDI-aware GANs

    Authors: Sujata Sinha, Alkan Soysal

    Abstract: We introduce a Channel Distribution Information (CDI)-aware Generative Adversarial Network (GAN), designed to address the unique challenges of adversarial attacks in wireless communication systems. The generator in this CDI-aware GAN maps random input noise to the feature space, generating perturbations intended to deceive a target modulation classifier. Its discriminators play a dual role: one en… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  23. arXiv:2310.04540  [pdf, other

    cs.LG

    Multi-decadal Sea Level Prediction using Neural Networks and Spectral Clustering on Climate Model Large Ensembles and Satellite Altimeter Data

    Authors: Saumya Sinha, John Fasullo, R. Steven Nerem, Claire Monteleoni

    Abstract: Sea surface height observations provided by satellite altimetry since 1993 show a rising rate (3.4 mm/year) for global mean sea level. While on average, sea level has risen 10 cm over the last 30 years, there is considerable regional variation in the sea level change. Through this work, we predict sea level trends 30 years into the future at a 2-degree spatial resolution and investigate the future… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  24. arXiv:2309.15782  [pdf

    cs.CV

    Joint-YODNet: A Light-weight Object Detector for UAVs to Achieve Above 100fps

    Authors: Vipin Gautam, Shitala Prasad, Sharad Sinha

    Abstract: Small object detection via UAV (Unmanned Aerial Vehicle) images captured from drones and radar is a complex task with several formidable challenges. This domain encompasses numerous complexities that impede the accurate detection and localization of small objects. To address these challenges, we propose a novel method called JointYODNet for UAVs to detect small objects, leveraging a joint loss fun… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  25. arXiv:2309.15780  [pdf

    cs.CV

    AaP-ReID: Improved Attention-Aware Person Re-identification

    Authors: Vipin Gautam, Shitala Prasad, Sharad Sinha

    Abstract: Person re-identification (ReID) is a well-known problem in the field of computer vision. The primary objective is to identify a specific individual within a gallery of images. However, this task is challenging due to various factors, such as pose variations, illumination changes, obstructions, and the presence ofconfusing backgrounds. Existing ReID methods often fail to capture discriminative feat… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  26. arXiv:2309.13387  [pdf

    cs.CV

    YOLORe-IDNet: An Efficient Multi-Camera System for Person-Tracking

    Authors: Vipin Gautam, Shitala Prasad, Sharad Sinha

    Abstract: The growing need for video surveillance in public spaces has created a demand for systems that can track individuals across multiple cameras feeds in real-time. While existing tracking systems have achieved impressive performance using deep learning models, they often rely on pre-existing images of suspects or historical data. However, this is not always feasible in cases where suspicious individu… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  27. arXiv:2309.04583  [pdf, other

    cs.SE

    Adaptive REST API Testing with Reinforcement Learning

    Authors: Myeongsoo Kim, Saurabh Sinha, Alessandro Orso

    Abstract: Modern web services increasingly rely on REST APIs. Effectively testing these APIs is challenging due to the vast search space to be explored, which involves selecting API operations for sequence creation, choosing parameters for each operation from a potentially large set of parameters, and sampling values from the virtually infinite parameter input space. Current testing tools lack efficient exp… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: To be published in the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  28. arXiv:2309.01086  [pdf, other

    cs.CV

    MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection

    Authors: Onkar Krishna, Hiroki Ohashi, Saptarshi Sinha

    Abstract: Cross-domain object detection is challenging, and it involves aligning labeled source and unlabeled target domains. Previous approaches have used adversarial training to align features at both image-level and instance-level. At the instance level, finding a suitable source sample that aligns with a target sample is crucial. A source sample is considered suitable if it differs from the target sampl… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  29. Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code

    Authors: Rangeet Pan, Ali Reza Ibrahimzada, Rahul Krishna, Divya Sankar, Lambert Pouguem Wassi, Michele Merler, Boris Sobolev, Raju Pavuluri, Saurabh Sinha, Reyhaneh Jabbarvand

    Abstract: Code translation aims to convert source code from one programming language (PL) to another. Given the promising abilities of large language models (LLMs) in code synthesis, researchers are exploring their potential to automate code translation. The prerequisite for advancing the state of LLM-based code translation is to understand their promises and limitations over existing techniques. To that en… ▽ More

    Submitted 16 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: Published in ICSE 2024

  30. arXiv:2308.02460  [pdf, other

    physics.ao-ph cs.LG

    Sea level Projections with Machine Learning using Altimetry and Climate Model ensembles

    Authors: Saumya Sinha, John Fasullo, R. Steven Nerem, Claire Monteleoni

    Abstract: Satellite altimeter observations retrieved since 1993 show that the global mean sea level is rising at an unprecedented rate (3.4mm/year). With almost three decades of observations, we can now investigate the contributions of anthropogenic climate-change signals such as greenhouse gases, aerosols, and biomass burning in this rising sea level. We use machine learning (ML) to investigate future patt… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  31. arXiv:2307.00453  [pdf, other

    cs.CL cs.SD eess.AS

    Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters

    Authors: Anshu Bhatia, Sanchit Sinha, Saket Dingliwal, Karthik Gopalakrishnan, Sravan Bodapati, Katrin Kirchhoff

    Abstract: Speech representations learned in a self-supervised fashion from massive unlabeled speech corpora have been adapted successfully toward several downstream tasks. However, such representations may be skewed toward canonical data characteristics of such corpora and perform poorly on atypical, non-native accented speaker populations. With the state-of-the-art HuBERT model as a baseline, we propose an… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  32. arXiv:2306.16722  [pdf, other

    cs.CL cs.AI

    Evaluating Paraphrastic Robustness in Textual Entailment Models

    Authors: Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak

    Abstract: We present PaRTE, a collection of 1,126 pairs of Recognizing Textual Entailment (RTE) examples to evaluate whether models are robust to paraphrasing. We posit that if RTE models understand language, their predictions should be consistent across inputs that share the same meaning. We use the evaluation set to determine if RTE models' predictions change when examples are paraphrased. In our experime… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  33. Enhance Diffusion to Improve Robust Generalization

    Authors: Jianhui Sun, Sanchit Sinha, Aidong Zhang

    Abstract: Deep neural networks are susceptible to human imperceptible adversarial perturbations. One of the strongest defense mechanisms is \emph{Adversarial Training} (AT). In this paper, we aim to address two predominant problems in AT. First, there is still little consensus on how to set hyperparameters with a performance guarantee for AT research, and customized settings impede a fair comparison between… ▽ More

    Submitted 17 August, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at KDD 2023

  34. arXiv:2306.00952  [pdf, other

    eess.AS cs.LG cs.SD

    Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition

    Authors: Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose

    Abstract: Speaker identification systems are deployed in diverse environments, often different from the lab conditions on which they are trained and tested. In this paper, first, we show the problem of generalization using fixed thresholds (computed using EER metric) for imposter identification in unseen speaker recognition and then introduce a robust speaker-specific thresholding technique for better perfo… ▽ More

    Submitted 30 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  35. arXiv:2305.14692  [pdf, other

    cs.SE

    Carving UI Tests to Generate API Tests and API Specification

    Authors: Rahulkrishna Yandrapally, Saurabh Sinha, Rachel Tzoref-Brill, Ali Mesbah

    Abstract: Modern web applications make extensive use of API calls to update the UI state in response to user events or server-side changes. For such applications, API-level testing can play an important role, in-between unit-level testing and UI-level (or end-to-end) testing. Existing API testing tools require API specifications (e.g., OpenAPI), which often may not be available or, when available, be incons… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    ACM Class: D.2.5

  36. arXiv:2305.13650  [pdf, other

    cs.LG cs.AI

    Robust Model-Based Optimization for Challenging Fitness Landscapes

    Authors: Saba Ghaffari, Ehsan Saleh, Alexander G. Schwing, Yu-Xiong Wang, Martin D. Burke, Saurabh Sinha

    Abstract: Protein design, a grand challenge of the day, involves optimization on a fitness landscape, and leading methods adopt a model-based approach where a model is trained on a training set (protein sequences and fitness) and proposes candidates to explore next. These methods are challenged by sparsity of high-fitness samples in the training set, a problem that has been in the literature. A less recogni… ▽ More

    Submitted 27 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  37. arXiv:2304.12523  [pdf

    cs.LG q-bio.QM stat.ME

    CIMLA: Interpretable AI for inference of differential causal networks

    Authors: Payam Dibaeinia, Saurabh Sinha

    Abstract: The discovery of causal relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model estimates a causal quantity reflecting the influence of one variable on another, under certain assumptions. We lever… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  38. arXiv:2304.11262  [pdf, other

    cs.RO

    Stochastic MPC Based Attacks on Object Tracking in Autonomous Driving Systems

    Authors: Sourav Sinha, Mazen Farhood

    Abstract: Decision making in advanced driver assistance systems involves in general the estimated trajectories of the surrounding objects. Multiple object tracking refers to the process of estimating in real time these trajectories, leveraging for this purpose sensors to detect the objects. This paper deals with devising attacks on object tracking in automated vehicles. The vehicle is assumed to have a dete… ▽ More

    Submitted 1 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: Accepted to IFAC World Congress 2023

  39. arXiv:2304.02925  [pdf

    eess.IV cs.CV

    Computer-aided Diagnosis of Malaria through Transfer Learning using the ResNet50 Backbone

    Authors: Sanya Sinha, Nilay Gupta

    Abstract: According to the World Malaria Report of 2022, 247 million cases of malaria and 619,000 related deaths were reported in 2021. This highlights the predominance of the disease, especially in the tropical and sub-tropical regions of Africa, parts of South-east Asia, Central and Southern America. Malaria is caused due to the Plasmodium parasite which is circulated through the bites of the female Anoph… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    ACM Class: I.4.9

  40. arXiv:2304.01143  [pdf, other

    cs.CV

    Use Your Head: Improving Long-Tail Video Recognition

    Authors: Toby Perrett, Saptarshi Sinha, Tilo Burghardt, Majid Mirmehdi, Dima Damen

    Abstract: This paper presents an investigation into long-tail video recognition. We demonstrate that, unlike naturally-collected video datasets and existing long-tail image benchmarks, current video benchmarks fall short on multiple long-tailed properties. Most critically, they lack few-shot classes in their tails. In response, we propose new video benchmarks that better assess long-tail recognition, by sam… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  41. arXiv:2303.14772  [pdf, other

    cs.CV

    $Δ$-Patching: A Framework for Rapid Adaptation of Pre-trained Convolutional Networks without Base Performance Loss

    Authors: Chaitanya Devaguptapu, Samarth Sinha, K J Joseph, Vineeth N Balasubramanian, Animesh Garg

    Abstract: Models pre-trained on large-scale datasets are often fine-tuned to support newer tasks and datasets that arrive over time. This process necessitates storing copies of the model over time for each task that the pre-trained model is fine-tuned to. Building on top of recent model patching work, we propose $Δ$-Patching for fine-tuning neural network models in an efficient manner, without the need to s… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  42. arXiv:2212.14405  [pdf, other

    cs.LG

    Offline Policy Optimization in RL with Variance Regularizaton

    Authors: Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Animesh Garg, Zhaoran Wang, Lihong Li, Doina Precup

    Abstract: Learning policies from fixed offline datasets is a key challenge to scale up reinforcement learning (RL) algorithms towards practical applications. This is often because off-policy RL algorithms suffer from distributional shift, due to mismatch between dataset and the target policy, leading to high variance and over-estimation of value functions. In this work, we propose variance regularization fo… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Old Draft, Offline RL Workshop, NeurIPS'20;

  43. FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

    Authors: Linfeng Du, Tingyuan Liang, Sharad Sinha, Zhiyao Xie, Wei Zhang

    Abstract: Multi-die FPGAs are widely adopted to deploy large hardware accelerators. Two factors impede the performance optimization of HLS designs implemented on multi-die FPGAs. On the one hand, the long net delay due to nets crossing die-boundaries results in an NP-hard problem to properly floorplan and pipeline an application. On the other hand, traditional automated searching flow for HLS directive opti… ▽ More

    Submitted 5 February, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted as a conference paper at FPGA '23. Open source at: https://github.com/RipperJ/FADO

  44. arXiv:2212.05259  [pdf, other

    math.DS cs.LG

    Online Real-time Learning of Dynamical Systems from Noisy Streaming Data: A Koopman Operator Approach

    Authors: S. Sinha, Sai P. Nandanoori, David Barajas-Solano

    Abstract: Recent advancements in sensing and communication facilitate obtaining high-frequency real-time data from various physical systems like power networks, climate systems, biological networks, etc. However, since the data are recorded by physical sensors, it is natural that the obtained data is corrupted by measurement noise. In this paper, we present a novel algorithm for online real-time learning of… ▽ More

    Submitted 24 December, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

  45. arXiv:2211.16991  [pdf, other

    cs.CV

    SparsePose: Sparse-View Camera Pose Regression and Refinement

    Authors: Samarth Sinha, Jason Y. Zhang, Andrea Tagliasacchi, Igor Gilitschenski, David B. Lindell

    Abstract: Camera pose estimation is a key step in standard 3D reconstruction pipelines that operate on a dense set of images of a single object or scene. However, methods for pose estimation often fail when only a few images are available because they rely on the ability to robustly identify and match visual features between image pairs. While these methods can work robustly with dense camera views, capturi… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  46. arXiv:2211.16080  [pdf, other

    cs.LG cs.AI

    Understanding and Enhancing Robustness of Concept-based Models

    Authors: Sanchit Sinha, Mengdi Huai, Jianhui Sun, Aidong Zhang

    Abstract: Rising usage of deep neural networks to perform decision making in critical applications like medical diagnosis and financial analysis have raised concerns regarding their reliability and trustworthiness. As automated systems become more mainstream, it is important their decisions be transparent, reliable and understandable by humans for better trust and confidence. To this effect, concept-based m… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted at AAAI 2023. Extended Version

  47. Classification of Melanocytic Nevus Images using BigTransfer (BiT)

    Authors: Sanya Sinha, Nilay Gupta

    Abstract: Skin cancer is a fatal disease that takes a heavy toll over human lives annually. The colored skin images show a significant degree of resemblance between different skin lesions such as melanoma and nevus, making identification and diagnosis more challenging. Melanocytic nevi may mature to cause fatal melanoma. Therefore, the current management protocol involves the removal of those nevi that appe… ▽ More

    Submitted 6 April, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures

    ACM Class: I.4.9

  48. arXiv:2211.10972  [pdf

    cs.CV cs.AI cs.LG

    A Comparative Analysis of Transfer Learning-based Techniques for the Classification of Melanocytic Nevi

    Authors: Sanya Sinha, Nilay Gupta

    Abstract: Skin cancer is a fatal manifestation of cancer. Unrepaired deoxyribo-nucleic acid (DNA) in skin cells, causes genetic defects in the skin and leads to skin cancer. To deal with lethal mortality rates coupled with skyrocketing costs of medical treatment, early diagnosis is mandatory. To tackle these challenges, researchers have developed a variety of rapid detection tools for skin cancer. Lesion-sp… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: 12 pages, 5 figures, submitted to International Conference on Advances and Applications of Artificial Intelligence and Machine Learning (ICAAAIML) 2022, to be published in Springer's Lecture Notes in Electrical Engineering

    ACM Class: I.4.9

  49. arXiv:2211.03889  [pdf, other

    cs.CV

    Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories

    Authors: Samarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotny

    Abstract: Obtaining photorealistic reconstructions of objects from sparse views is inherently ambiguous and can only be achieved by learning suitable reconstruction priors. Earlier works on sparse rigid object reconstruction successfully learned such priors from large datasets such as CO3D. In this paper, we extend this approach to dynamic objects. We use cats and dogs as a representative example and introd… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  50. arXiv:2211.03292  [pdf, ps, other

    cs.DS

    Approximate Trace Reconstruction from a Single Trace

    Authors: Xi Chen, Anindya De, Chin Ho Lee, Rocco A. Servedio, Sandip Sinha

    Abstract: The well-known trace reconstruction problem is the problem of inferring an unknown source string $x \in \{0,1\}^n$ from independent "traces", i.e. copies of $x$ that have been corrupted by a $δ$-deletion channel which independently deletes each bit of $x$ with probability $δ$ and concatenates the surviving bits. The current paper considers the extreme data-limited regime in which only a single tra… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.