Skip to main content

Showing 1–50 of 354 results for author: Garg, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19549  [pdf, other

    cs.CR cs.LG

    ASCENT: Amplifying Power Side-Channel Resilience via Learning & Monte-Carlo Tree Search

    Authors: Jitendra Bhandari, Animesh Basak Chowdhury, Mohammed Nabeel, Ozgur Sinanoglu, Siddharth Garg, Ramesh Karri, Johann Knechtel

    Abstract: Power side-channel (PSC) analysis is pivotal for securing cryptographic hardware. Prior art focused on securing gate-level netlists obtained as-is from chip design automation, neglecting all the complexities and potential side-effects for security arising from the design automation process. That is, automation traditionally prioritizes power, performance, and area (PPA), sidelining security. We pr… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at 2024 ACM/IEEE International Conference on Computer-Aided Design

  2. arXiv:2406.17132  [pdf, other

    cs.AR

    LLM-Aided Testbench Generation and Bug Detection for Finite-State Machines

    Authors: Jitendra Bhandari, Johann Knechtel, Ramesh Narayanaswamy, Siddharth Garg, Ramesh Karri

    Abstract: This work investigates the potential of tailoring Large Language Models (LLMs), specifically GPT3.5 and GPT4, for the domain of chip testing. A key aspect of chip design is functional testing, which relies on testbenches to evaluate the functionality and coverage of Register-Transfer Level (RTL) designs. We aim to enhance testbench generation by incorporating feedback from commercial-grade Electro… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.14532  [pdf, other

    cs.LG cs.CL

    RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

    Authors: Amrith Setlur, Saurabh Garg, Xinyang Geng, Naman Garg, Virginia Smith, Aviral Kumar

    Abstract: Training on model-generated synthetic data is a promising approach for finetuning LLMs, but it remains unclear when it helps or hurts. In this paper, we investigate this question for math reasoning via an empirical study, followed by building a conceptual understanding of our observations. First, we find that while the typical approach of finetuning a model on synthetic correct or positive problem… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.13473  [pdf, other

    cs.CV

    Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions

    Authors: Shivank Garg, Abhishek Baghel, Amit Agarwal, Durga Toshniwal

    Abstract: With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.12592  [pdf, other

    cs.CV

    Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images

    Authors: Shivank Garg, Manyana Tiwari

    Abstract: In this paper, we extend the study of concept ablation within pre-trained models as introduced in 'Ablating Concepts in Text-to-Image Diffusion Models' by (Kumari et al.,2022). Our work focuses on reproducing the results achieved by the different variants of concept ablation proposed and validated through predefined metrics. We also introduce a novel variant of concept ablation, namely 'trademark… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.11794  [pdf, other

    cs.LG cs.CL

    DataComp-LM: In search of the next generation of training sets for language models

    Authors: Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner , et al. (34 additional authors not shown)

    Abstract: We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretraining recipes based on the OpenLM framework, and a broad suite of 53 downstream evaluations. Participants in the DCLM benchmark can experiment with dat… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.datacomp.ai/dclm/

  7. arXiv:2406.09233  [pdf, other

    cs.AR

    C2HLSC: Can LLMs Bridge the Software-to-Hardware Design Gap?

    Authors: Luca Collini, Siddharth Garg, Ramesh Karri

    Abstract: High Level Synthesis (HLS) tools offer rapid hardware design from C code, but their compatibility is limited by code constructs. This paper investigates Large Language Models (LLMs) for refactoring C code into HLS-compatible formats. We present several case studies by using an LLM to rewrite C code for NIST 800-22 randomness tests, a QuickSort algorithm and AES-128 into HLS-synthesizable c. The LL… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at The First IEEE International Workshop on LLM-Aided Design

  8. arXiv:2406.05590  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

    Authors: Minghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

    Abstract: Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  9. arXiv:2406.03592  [pdf, other

    cs.CL cs.AI

    Measuring Retrieval Complexity in Question Answering Systems

    Authors: Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti

    Abstract: In this paper, we investigate which questions are challenging for retrieval-based Question Answering (QA). We (i) propose retrieval complexity (RC), a novel metric conditioned on the completeness of retrieved documents, which measures the difficulty of answering questions, and (ii) propose an unsupervised pipeline to measure RC given an arbitrary retrieval system. Our proposed pipeline measures RC… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 (findings)

  10. arXiv:2405.15842  [pdf, other

    cs.SE cs.LG

    Model Cascading for Code: Reducing Inference Costs with Model Cascading for LLM Based Code Generation

    Authors: Boyuan Chen, Mingzhi Zhu, Brendan Dolan-Gavitt, Muhammad Shafique, Siddharth Garg

    Abstract: The rapid development of large language models (LLMs) has led to significant advancements in code completion tasks. While larger models have higher accuracy, they also cost much more to run. Meanwhile, model cascading has been proven effective to conserve computational resources while enhancing accuracy in LLMs on natural language generation tasks. It generates output with the smallest model in a… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.09534  [pdf, other

    cs.IT

    Learning-Based Compress-and-Forward Schemes for the Relay Channel

    Authors: Ezgi Ozyilkan, Fabrizio Carpi, Siddharth Garg, Elza Erkip

    Abstract: The relay channel, consisting of a source-destination pair along with a relay, is a fundamental component of cooperative communications. While the capacity of a general relay channel remains unknown, various relaying strategies, including compress-and-forward (CF), have been proposed. In CF, the relay forwards a quantized version of its received signal to the destination. Given the correlated sign… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: journal submission under review. arXiv admin note: substantial text overlap with arXiv:2404.14594

  12. arXiv:2405.05792  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation

    Authors: Sourav Garg, Krishan Rana, Mehdi Hosseinzadeh, Lachlan Mares, Niko Sünderhauf, Feras Dayoub, Ian Reid

    Abstract: Map** is crucial for spatial reasoning, planning and robot navigation. Existing approaches range from metric, which require precise geometry-based optimization, to purely topological, where image-as-node based graphs lack explicit object-level reasoning and interconnectivity. In this paper, we propose a novel topological representation of an environment based on "image segments", which are seman… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Published at ICRA 2024; 9 pages, 8 figures

  13. arXiv:2405.05343  [pdf, other

    cs.DS cs.LG math.NA

    Distributed Least Squares in Small Space via Sketching and Bias Reduction

    Authors: Sachin Garg, Kevin Tan, Michał Dereziński

    Abstract: Matrix sketching is a powerful tool for reducing the size of large data matrices. Yet there are fundamental limitations to this size reduction when we want to recover an accurate estimator for a task such as least square regression. We show that these limitations can be circumvented in the distributed setting by designing sketching methods that minimize the bias of the estimator, rather than its e… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  14. arXiv:2405.04678  [pdf, other

    cs.NI cs.MA cs.RO

    Pipe Routing with Topology Control for UAV Networks

    Authors: Shreyas Devaraju, Shivam Garg, Alexander Ihler, Sunil Kumar

    Abstract: Routing protocols help in transmitting the sensed data from UAVs monitoring the targets (called target UAVs) to the BS. However, the highly dynamic nature of an autonomous, decentralized UAV network leads to frequent route breaks or traffic disruptions. Traditional routing schemes cannot quickly adapt to dynamic UAV networks and/or incur large control overhead and delays. To establish stable, high… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2405.02326  [pdf, other

    cs.AR cs.AI cs.CL cs.LG cs.PL

    Evaluating LLMs for Hardware Design and Test

    Authors: Jason Blocklove, Siddharth Garg, Ramesh Karri, Hammond Pearce

    Abstract: Large Language Models (LLMs) have demonstrated capabilities for producing code in Hardware Description Languages (HDLs). However, most of the focus remains on their abilities to write functional code, not test code. The hardware design process consists of both design and test, and so eschewing validation and verification leaves considerable potential benefit unexplored, given that a design and tes… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

  16. arXiv:2405.01858  [pdf, other

    cs.CL cs.CY

    SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India

    Authors: Salam Michael Singh, Shubhmoy Kumar Garg, Amitesh Misra, Aaditeshwar Seth, Tanmoy Chakraborty

    Abstract: Sexual education aims to foster a healthy lifestyle in terms of emotional, mental and social well-being. In countries like India, where adolescents form the largest demographic group, they face significant vulnerabilities concerning sexual health. Unfortunately, sexual education is often stigmatized, creating barriers to providing essential counseling and information to this at-risk population. Co… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  17. arXiv:2405.01073  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Poisoning Attacks on Federated Learning for Autonomous Driving

    Authors: Sonakshi Garg, Hugo Jönsson, Gustav Kalander, Axel Nilsson, Bhhaanu Pirange, Viktor Valadi, Johan Östman

    Abstract: Federated Learning (FL) is a decentralized learning paradigm, enabling parties to collaboratively train models while kee** their data confidential. Within autonomous driving, it brings the potential of reducing data storage costs, reducing bandwidth requirements, and to accelerate the learning. FL is, however, susceptible to poisoning attacks. In this paper, we introduce two novel poisoning atta… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted to SCAI2024

  18. arXiv:2404.16137  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Learned Pulse Sha** Design for PAPR Reduction in DFT-s-OFDM

    Authors: Fabrizio Carpi, Soheil Rostami, Joonyoung Cho, Siddharth Garg, Elza Erkip, Charlie Jianzhong Zhang

    Abstract: High peak-to-average power ratio (PAPR) is one of the main factors limiting cell coverage for cellular systems, especially in the uplink direction. Discrete Fourier transform spread orthogonal frequency-domain multiplexing (DFT-s-OFDM) with spectrally-extended frequency-domain spectrum sha** (FDSS) is one of the efficient techniques deployed to lower the PAPR of the uplink waveforms. In this wor… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 5 pages, under review

  19. arXiv:2404.14758  [pdf, other

    math.OC cs.LG stat.ML

    Second-order Information Promotes Mini-Batch Robustness in Variance-Reduced Gradients

    Authors: Sachin Garg, Albert S. Berahas, Michał Dereziński

    Abstract: We show that, for finite-sum minimization problems, incorporating partial second-order information of the objective function can dramatically improve the robustness to mini-batch size of variance-reduced stochastic gradient methods, making them more scalable while retaining their benefits over traditional Newton-type approaches. We demonstrate this phenomenon on a prototypical stochastic second-or… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    MSC Class: 65K05; 90C06; 90C30

  20. arXiv:2404.14594  [pdf, other

    cs.IT eess.SP

    Neural Compress-and-Forward for the Relay Channel

    Authors: Ezgi Ozyilkan, Fabrizio Carpi, Siddharth Garg, Elza Erkip

    Abstract: The relay channel, consisting of a source-destination pair and a relay, is a fundamental component of cooperative communications. While the capacity of a general relay channel remains unknown, various relaying strategies, including compress-and-forward (CF), have been proposed. For CF, given the correlated signals at the relay and destination, distributed compression techniques, such as Wyner-Ziv… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: in submission, under review

  21. arXiv:2404.07815  [pdf, other

    cs.LG cs.AI stat.ML

    Post-Hoc Reversal: Are We Selecting Models Prematurely?

    Authors: Rishabh Ranjan, Saurabh Garg, Mrigank Raman, Carlos Guestrin, Zachary Chase Lipton

    Abstract: Trained models are often composed with post-hoc transforms such as temperature scaling (TS), ensembling and stochastic weight averaging (SWA) to improve performance, robustness, uncertainty estimation, etc. However, such transforms are typically applied only after the base models have already been finalized by standard means. In this paper, we challenge this practice with an extensive empirical st… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 9 pages + references + appendix, 7 figures

  22. arXiv:2404.07377  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IT

    Deep Generative Sampling in the Dual Divergence Space: A Data-efficient & Interpretative Approach for Generative AI

    Authors: Sahil Garg, Anderson Schneider, Anant Raj, Kashif Rasul, Yuriy Nevmyvaka, Sneihil Gopal, Amit Dhurandhar, Guillermo Cecchi, Irina Rish

    Abstract: Building on the remarkable achievements in generative sampling of natural images, we propose an innovative challenge, potentially overly ambitious, which involves generating samples of entire multivariate time series that resemble images. However, the statistical challenge lies in the small sample size, sometimes consisting of a few hundred subjects. This issue is especially problematic for deep g… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  23. arXiv:2404.06442  [pdf, other

    cs.CV cs.RO

    QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding

    Authors: Yash Mehan, Kumaraditya Gupta, Rohit Jayanti, Anirudh Govil, Sourav Garg, Madhava Krishna

    Abstract: Understanding the structural organisation of 3D indoor scenes in terms of rooms is often accomplished via floorplan extraction. Robotic tasks such as planning and navigation require a semantic understanding of the scene as well. This is typically achieved via object-level semantic segmentation. However, such methods struggle to segment out topological regions like "kitchen" in the scene. In this w… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  24. arXiv:2403.20283  [pdf, ps, other

    cs.CC cs.DS

    A New Information Complexity Measure for Multi-pass Streaming with Applications

    Authors: Mark Braverman, Sumegha Garg, Qian Li, Shuo Wang, David P. Woodruff, Jiapeng Zhang

    Abstract: We introduce a new notion of information complexity for multi-pass streaming problems and use it to resolve several important questions in data streams. In the coin problem, one sees a stream of $n$ i.i.d. uniform bits and one would like to compute the majority with constant advantage. We show that any constant pass algorithm must use $Ω(\log n)$ bits of memory, significantly extending an earlie… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: To appear in STOC 2024

  25. arXiv:2403.17373  [pdf, other

    cs.CV cs.AI cs.LG

    AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving

    Authors: Mingfu Liang, Jong-Chyi Su, Samuel Schulter, Sparsh Garg, Shiyu Zhao, Ying Wu, Manmohan Chandraker

    Abstract: Autonomous vehicle (AV) systems rely on robust perception models as a cornerstone of safety assurance. However, objects encountered on the road exhibit a long-tailed distribution, with rare or unseen categories posing challenges to a deployed perception model. This necessitates an expensive process of continuously curating and annotating data with significant human effort. We propose to leverage r… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR-2024

  26. arXiv:2403.05798  [pdf, other

    cs.LG

    $\textbf{S}^2$IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting

    Authors: Zijie Pan, Yushan Jiang, Sahil Garg, Anderson Schneider, Yuriy Nevmyvaka, Dong** Song

    Abstract: Recently, there has been a growing interest in leveraging pre-trained large language models (LLMs) for various time series applications. However, the semantic space of LLMs, established through the pre-training, is still underexplored and may help yield more distinctive and informative representations to facilitate time series forecasting. To this end, we propose Semantic Space Informed Prompt lea… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  27. arXiv:2402.16926  [pdf, other

    cs.CR cs.AI cs.LG stat.ML

    On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

    Authors: Georg Pichler, Marco Romanelli, Divya Prakash Manivannan, Prashanth Krishnamurthy, Farshad Khorrami, Siddharth Garg

    Abstract: We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition. The main contributions of this work are an impossibility result and an achievability result for backdoor detection. We show a no-free-lunch theorem, proving that… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  28. arXiv:2402.15487  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

    Authors: Hanxiao Jiang, Binghao Huang, Ruihai Wu, Zhuoran Li, Shubham Garg, Hooshang Nayyeri, Shenlong Wang, Yunzhu Li

    Abstract: Robots need to explore their surroundings to adapt to and tackle tasks in unknown environments. Prior work has proposed building scene graphs of the environment but typically assumes that the environment is static, omitting regions that require active interactions. This severely limits their ability to handle more complex tasks in household and office environments: before setting up a table, robot… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Project Page: https://jianghanxiao.github.io/roboexp-web/

  29. arXiv:2402.12722  [pdf, other

    cs.LG

    Structural Knowledge Informed Continual Multivariate Time Series Forecasting

    Authors: Zijie Pan, Yushan Jiang, Dong** Song, Sahil Garg, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: Recent studies in multivariate time series (MTS) forecasting reveal that explicitly modeling the hidden dependencies among different time series can yield promising forecasting performance and reliable explanations. However, modeling variable dependencies remains underexplored when MTS is continuously accumulated under different regimes (stages). Due to the potential distribution and dependency di… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  30. arXiv:2402.11814  [pdf, other

    cs.CR

    An Empirical Evaluation of LLMs for Solving Offensive Security Challenges

    Authors: Minghao Shao, Boyuan Chen, Sofija Jancheska, Brendan Dolan-Gavitt, Siddharth Garg, Ramesh Karri, Muhammad Shafique

    Abstract: Capture The Flag (CTF) challenges are puzzles related to computer security scenarios. With the advent of large language models (LLMs), more and more CTF participants are using LLMs to understand and solve the challenges. However, so far no work has evaluated the effectiveness of LLMs in solving CTF challenges with a fully automated workflow. We develop two CTF-solving workflows, human-in-the-loop… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  31. arXiv:2402.03486  [pdf, other

    cs.LG cs.AI cs.IR

    Early prediction of onset of sepsis in Clinical Setting

    Authors: Fahim Mohammad, Lakshmi Arunachalam, Samanway Sadhu, Boudewijn Aasman, Shweta Garg, Adil Ahmed, Silvie Colman, Meena Arunachalam, Sudhir Kulkarni, Parsa Mirhaji

    Abstract: This study proposes the use of Machine Learning models to predict the early onset of sepsis using deidentified clinical data from Montefiore Medical Center in Bronx, NY, USA. A supervised learning approach was adopted, wherein an XGBoost model was trained utilizing 80\% of the train dataset, encompassing 107 features (including the original and derived features). Subsequently, the model was evalua… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 16 pages, 6 figures and 7 tables

  32. arXiv:2402.03289  [pdf, other

    cs.LG cs.AI cs.AR

    Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS

    Authors: Matthew DeLorenzo, Animesh Basak Chowdhury, Vasudev Gohil, Shailja Thakur, Ramesh Karri, Siddharth Garg, Jeyavijayan Rajendran

    Abstract: Existing large language models (LLMs) for register transfer level code generation face challenges like compilation failures and suboptimal power, performance, and area (PPA) efficiency. This is due to the lack of PPA awareness in conventional transformer decoding algorithms. In response, we present an automated transformer decoding algorithm that integrates Monte Carlo tree-search for lookahead, g… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  33. arXiv:2402.03182  [pdf, other

    cs.LG

    Empowering Time Series Analysis with Large Language Models: A Survey

    Authors: Yushan Jiang, Zijie Pan, Xikun Zhang, Sahil Garg, Anderson Schneider, Yuriy Nevmyvaka, Dong** Song

    Abstract: Recently, remarkable progress has been made over large language models (LLMs), demonstrating their unprecedented capability in varieties of natural language tasks. However, completely training a large general-purpose model from the scratch is challenging for time series analysis, due to the large volumes and varieties of time series data, as well as the non-stationarity that leads to concept drift… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  34. arXiv:2401.14033  [pdf, ps, other

    cs.LG

    Novel Quadratic Constraints for Extending LipSDP beyond Slope-Restricted Activations

    Authors: Patricia Pauli, Aaron Havens, Alexandre Araujo, Siddharth Garg, Farshad Khorrami, Frank Allgöwer, Bin Hu

    Abstract: Recently, semidefinite programming (SDP) techniques have shown great promise in providing accurate Lipschitz bounds for neural networks. Specifically, the LipSDP approach (Fazlyab et al., 2019) has received much attention and provides the least conservative Lipschitz upper bounds that can be computed with polynomial time guarantees. However, one main restriction of LipSDP is that its formulation r… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: accepted as a conference paper at ICLR 2024

  35. arXiv:2401.12205  [pdf, other

    cs.LG cs.AI cs.AR

    Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization

    Authors: Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan, Ramesh Karri, Siddharth Garg

    Abstract: Logic synthesis, a pivotal stage in chip design, entails optimizing chip specifications encoded in hardware description languages like Verilog into highly efficient implementations using Boolean logic gates. The process involves a sequential application of logic minimization heuristics (``synthesis recipe"), with their arrangement significantly impacting crucial metrics such as area and delay. Add… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted in ICLR 2024

  36. arXiv:2401.10423  [pdf, other

    cs.FL cs.PL

    Verification under TSO with an infinite Data Domain

    Authors: Parosh Aziz Abdulla, Mohamed Faouzi Atig, Florian Furbach, Shashwat Garg

    Abstract: We examine verification of concurrent programs under the total store ordering (TSO) semantics used by the x86 architecture. In our model, threads manipulate variables over infinite domains and they can check whether variables are related for a range of relations. We show that, in general, the control state reachability problem is undecidable. This result is derived through a reduction from the sta… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  37. arXiv:2312.03318  [pdf, other

    cs.LG cs.CV stat.ML

    Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift

    Authors: Saurabh Garg, Amrith Setlur, Zachary Chase Lipton, Sivaraman Balakrishnan, Virginia Smith, Aditi Raghunathan

    Abstract: Self-training and contrastive learning have emerged as leading techniques for incorporating unlabeled data, both under distribution shift (unsupervised domain adaptation) and when it is absent (semi-supervised learning). However, despite the popularity and compatibility of these techniques, their efficacy in combination remains unexplored. In this paper, we undertake a systematic empirical investi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  38. arXiv:2311.15373  [pdf, other

    cs.LG cs.AI cs.CR

    Confidence Is All You Need for MI Attacks

    Authors: Abhishek Sinha, Himanshi Tibrewal, Mansi Gupta, Nikhar Waghela, Shivank Garg

    Abstract: In this evolving era of machine learning security, membership inference attacks have emerged as a potent threat to the confidentiality of sensitive data. In this attack, adversaries aim to determine whether a particular point was used during the training of a target model. This paper proposes a new method to gauge a data point's membership in a model's training set. Instead of correlating loss wit… ▽ More

    Submitted 19 June, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 2 pages, 1 figure

  39. arXiv:2311.14710  [pdf, other

    cs.NE cs.LG

    Neuroscience inspired scientific machine learning (Part-2): Variable spiking wavelet neural operator

    Authors: Shailesh Garg, Souvik Chakraborty

    Abstract: We propose, in this paper, a Variable Spiking Wavelet Neural Operator (VS-WNO), which aims to bridge the gap between theoretical and practical implementation of Artificial Intelligence (AI) algorithms for mechanics applications. With recent developments like the introduction of neural operators, AI's potential for being used in mechanics applications has increased significantly. However, AI's imme… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  40. arXiv:2311.11194  [pdf, other

    cs.DS cs.IT cs.LG stat.ML

    Testing with Non-identically Distributed Samples

    Authors: Shivam Garg, Chirag Pabbaraju, Kirankumar Shiragur, Gregory Valiant

    Abstract: We examine the extent to which sublinear-sample property testing and estimation applies to settings where samples are independently but not identically distributed. Specifically, we consider the following distributional property testing framework: Suppose there is a set of distributions over a discrete support of size $k$, $\textbf{p}_1, \textbf{p}_2,\ldots,\textbf{p}_T$, and we obtain $c$ indepen… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  41. arXiv:2311.09267  [pdf, other

    cs.NE cs.LG

    Neuroscience inspired scientific machine learning (Part-1): Variable spiking neuron for regression

    Authors: Shailesh Garg, Souvik Chakraborty

    Abstract: Redundant information transfer in a neural network can increase the complexity of the deep learning model, thus increasing its power consumption. We introduce in this paper a novel spiking neuron, termed Variable Spiking Neuron (VSN), which can reduce the redundant firing using lessons from biological neuron inspired Leaky Integrate and Fire Spiking Neurons (LIF-SN). The proposed VSN blends LIF-SN… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  42. arXiv:2311.04887  [pdf, other

    cs.PL

    AutoChip: Automating HDL Generation Using LLM Feedback

    Authors: Shailja Thakur, Jason Blocklove, Hammond Pearce, Benjamin Tan, Siddharth Garg, Ramesh Karri

    Abstract: Traditionally, designs are written in Verilog hardware description language (HDL) and debugged by hardware engineers. While this approach is effective, it is time-consuming and error-prone for complex designs. Large language models (LLMs) are promising in automating HDL code generation. LLMs are trained on massive datasets of text and code, and they can learn to generate code that compiles and is… ▽ More

    Submitted 4 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  43. arXiv:2310.18491  [pdf, other

    cs.LG cs.CL cs.CR

    Publicly-Detectable Watermarking for Language Models

    Authors: Jaiden Fairoze, Sanjam Garg, Somesh Jha, Saeed Mahloujifar, Mohammad Mahmoody, Mingyuan Wang

    Abstract: We present a highly detectable, trustless watermarking scheme for LLMs: the detection algorithm contains no secret information, and it is executable by anyone. We embed a publicly-verifiable cryptographic signature into LLM output using rejection sampling. We prove that our scheme is cryptographically correct, sound, and distortion-free. We make novel uses of error-correction techniques to overcom… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  44. arXiv:2310.18274  [pdf, other

    cs.CV cs.LG

    LipSim: A Provably Robust Perceptual Similarity Metric

    Authors: Sara Ghazanfari, Alexandre Araujo, Prashanth Krishnamurthy, Farshad Khorrami, Siddharth Garg

    Abstract: Recent years have seen growing interest in develo** and applying perceptual similarity metrics. Research has shown the superiority of perceptual metrics over pixel-wise metrics in aligning with human perception and serving as a proxy for the human visual system. On the other hand, as perceptual metrics rely on neural networks, there is a growing concern regarding their resilience, given the esta… ▽ More

    Submitted 29 March, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  45. arXiv:2310.17768  [pdf, other

    cs.CV

    A Dataset of Relighted 3D Interacting Hands

    Authors: Gyeongsik Moon, Shunsuke Saito, Weipeng Xu, Rohan Joshi, Julia Buffalini, Harley Bellan, Nicholas Rosen, Jesse Richardson, Mallorie Mize, Philippe de Bree, Tomas Simon, Bo Peng, Shubham Garg, Kevyn McPhail, Takaaki Shiratori

    Abstract: The two-hand interaction is one of the most challenging signals to analyze due to the self-similarity, complicated articulations, and occlusions of hands. Although several datasets have been proposed for the two-hand interaction analysis, all of them do not achieve 1) diverse and realistic image appearances and 2) diverse and large-scale groundtruth (GT) 3D poses at the same time. In this work, we… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023 (Datasets and Benchmarks Track)

  46. arXiv:2310.16273  [pdf, other

    cs.CV

    Deep Learning for Plant Identification and Disease Classification from Leaf Images: Multi-prediction Approaches

    Authors: Jian** Yao, Son N. Tran, Saurabh Garg, Samantha Sawyer

    Abstract: Deep learning plays an important role in modern agriculture, especially in plant pathology using leaf images where convolutional neural networks (CNN) are attracting a lot of attention. While numerous reviews have explored the applications of deep learning within this research domain, there remains a notable absence of an empirical study to offer insightful comparisons due to the employment of var… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Jian** and Son are joint first authors (equal contribution)

  47. arXiv:2310.16226  [pdf, other

    cs.CV cs.CL cs.LG

    TiC-CLIP: Continual Training of CLIP Models

    Authors: Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri

    Abstract: Kee** large foundation models up to date on latest data is inherently expensive. To avoid the prohibitive costs of constantly retraining, it is imperative to continually train these models. This problem is exacerbated by the lack of any large scale continual learning benchmarks or baselines. We introduce the first set of web-scale Time-Continual (TiC) benchmarks for training vision-language mode… ▽ More

    Submitted 21 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  48. Machine Learning for Leaf Disease Classification: Data, Techniques and Applications

    Authors: Jian** Yao, Son N. Tran, Samantha Sawyer, Saurabh Garg

    Abstract: The growing demand for sustainable development brings a series of information technologies to help agriculture production. Especially, the emergence of machine learning applications, a branch of artificial intelligence, has shown multiple breakthroughs which can enhance and revolutionize plant pathology approaches. In recent years, machine learning has been adopted for leaf disease classification… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Journal ref: Artificial Intelligence Review 2023

  49. arXiv:2310.10560  [pdf, other

    cs.LG cs.AI cs.AR cs.PL

    Towards the Imagenets of ML4EDA

    Authors: Animesh Basak Chowdhury, Shailja Thakur, Hammond Pearce, Ramesh Karri, Siddharth Garg

    Abstract: Despite the growing interest in ML-guided EDA tools from RTL to GDSII, there are no standard datasets or prototypical learning tasks defined for the EDA problem domain. Experience from the computer vision community suggests that such datasets are crucial to spur further progress in ML for EDA. Here we describe our experience curating two large-scale, high-quality datasets for Verilog code generati… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Invited paper, ICCAD 2023

    Report number: October 16 Update

    Journal ref: ICCAD 2023

  50. arXiv:2310.10380  [pdf, ps, other

    cs.CL

    Contextual Data Augmentation for Task-Oriented Dialog Systems

    Authors: Dustin Axman, Avik Ray, Shubham Garg, **g Huang

    Abstract: Collection of annotated dialogs for training task-oriented dialog systems have been one of the key bottlenecks in improving current models. While dialog response generation has been widely studied on the agent side, it is not evident if similar generative models can be used to generate a large variety of, and often unexpected, user inputs that real dialog systems encounter in practice. Existing da… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: ECML-PKDD 2023 Workshop on Challenges and Opportunities of Large Language Models in Real-World Machine Learning Applications (COLLM)