Skip to main content

Showing 1–50 of 254 results for author: Balasubramanian, V

.
  1. arXiv:2406.09689  [pdf, other

    cond-mat.dis-nn cond-mat.soft cond-mat.stat-mech

    Physical networks become what they learn

    Authors: Menachem Stern, Marcelo Guzman, Felipe Martins, Andrea J Liu, Vijay Balasubramanian

    Abstract: Physical networks can develop diverse responses, or functions, by design, evolution or learning. We focus on electrical networks of nodes connected by resistive edges. Such networks can learn by adapting edge conductances to lower a cost function that penalizes deviations from a desired response. The network must also satisfy Kirchhoff's law, balancing currents at nodes, or, equivalently, minimizi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  2. arXiv:2405.07921  [pdf, other

    cs.CV

    Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?

    Authors: Hari Chandana Kuchibhotla, Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Going beyond mere fine-tuning of vision-language models (VLMs), learnable prompt tuning has emerged as a promising, resource-efficient alternative. Despite their potential, effectively learning prompts faces the following challenges: (i) training in a low-shot scenario results in overfitting, limiting adaptability, and yielding weaker performance on newer classes or datasets; (ii) prompt-tuning's… ▽ More

    Submitted 20 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.19276  [pdf, other

    cs.CV

    C2FDrone: Coarse-to-Fine Drone-to-Drone Detection using Vision Transformer Networks

    Authors: Sairam VC Rebbapragada, Pranoy Panda, Vineeth N Balasubramanian

    Abstract: A vision-based drone-to-drone detection system is crucial for various applications like collision avoidance, countering hostile drones, and search-and-rescue operations. However, detecting drones presents unique challenges, including small object sizes, distortion, occlusion, and real-time processing requirements. Current methods integrating multi-scale feature fusion and temporal information have… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted at ICRA 2024

  4. arXiv:2402.15044  [pdf, other

    cs.CV cs.LG

    Fiducial Focus Augmentation for Facial Landmark Detection

    Authors: Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth Balasubramanian

    Abstract: Deep learning methods have led to significant improvements in the performance on the facial landmark detection (FLD) task. However, detecting landmarks in challenging settings, such as head pose changes, exaggerated expressions, or uneven illumination, continue to remain a challenge due to high variability and insufficient samples. This inadequacy can be attributed to the model's inability to effe… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted to BMVC'23

  5. arXiv:2401.04647  [pdf, other

    cs.CV cs.AI cs.LG

    Advancing Ante-Hoc Explainable Models through Generative Adversarial Networks

    Authors: Tanmay Garg, Deepika Vemuri, Vineeth N Balasubramanian

    Abstract: This paper presents a novel concept learning framework for enhancing model interpretability and performance in visual classification tasks. Our approach appends an unsupervised explanation generator to the primary classifier network and makes use of adversarial training. During training, the explanation module is optimized to extract visual concepts from the classifier's latent representations, wh… ▽ More

    Submitted 3 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/). Paper accepted and presented at Deployable AI Workshop at AAAI-2024 (https://sites.google.com/view/dai-2024/home)

  6. arXiv:2312.10534  [pdf, other

    cs.LG cs.CR cs.CV

    Rethinking Robustness of Model Attributions

    Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian

    Abstract: For machine learning models to be reliable and trustworthy, their decisions must be interpretable. As these models find increasing use in safety-critical applications, it is important that not just the model predictions but also their explanations (as feature attributions) be robust to small human-imperceptible input perturbations. Recent works have shown that many attribution methods are fragile… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted AAAI 2024

  7. arXiv:2312.08434  [pdf, other

    hep-th gr-qc

    The entropy of finite gravitating regions

    Authors: Vijay Balasubramanian, Charlie Cummings

    Abstract: We develop a formalism for calculating the entanglement entropy of an arbitrary spatial region of a gravitating spacetime at a moment of time symmetry. The crucial ingredient is a path integral over embeddings of the region into the overall spacetime, interpretable as a sum over the edge modes associated with the region. We find that the entanglement entropy of a gravitating region equals the mini… ▽ More

    Submitted 11 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 29 pages, 8 figures. v3: Expanded Sec. 3.4 and 4, added references, fixed typos

  8. arXiv:2312.03848  [pdf, other

    hep-th cond-mat.stat-mech nlin.CD quant-ph

    Quantum chaos, integrability, and late times in the Krylov basis

    Authors: Vijay Balasubramanian, Javier M. Magan, Qingyue Wu

    Abstract: Quantum chaotic systems are conjectured to display a spectrum whose fine-grained features (gaps and correlations) are well described by Random Matrix Theory (RMT). We propose and develop a complementary version of this conjecture: quantum chaotic systems display a Lanczos spectrum whose local means and covariances are well described by RMT. To support this proposal, we first demonstrate its validi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  9. arXiv:2311.17166  [pdf, other

    cond-mat.stat-mech cs.CC cs.DC

    Is stochastic thermodynamics the key to understanding the energy costs of computation?

    Authors: David Wolpert, Jan Korbel, Christopher Lynn, Farita Tasnim, Joshua Grochow, Gülce Kardeş, James Aimone, Vijay Balasubramanian, Eric de Giuli, David Doty, Nahuel Freitas, Matteo Marsili, Thomas E. Ouldridge, Andrea Richa, Paul Riechers, Édgar Roldán, Brenda Rubenstein, Zoltan Toroczkai, Joseph Paradiso

    Abstract: The relationship between the thermodynamic and computational characteristics of dynamical physical systems has been a major theoretical interest since at least the 19th century, and has been of increasing practical importance as the energetic cost of digital devices has exploded over the last half century. One of the most important thermodynamic features of real-world computers is that they operat… ▽ More

    Submitted 30 November, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Typo fix

  10. arXiv:2311.08503  [pdf, other

    cs.CV cs.LG

    MADG: Margin-based Adversarial Learning for Domain Generalization

    Authors: Aveen Dayal, Vimal K. B., Linga Reddy Cenkeramaddi, C. Krishna Mohan, Abhinav Kumar, Vineeth N Balasubramanian

    Abstract: Domain Generalization (DG) techniques have emerged as a popular approach to address the challenges of domain shift in Deep Learning (DL), with the goal of generalizing well to the target domain unseen during the training. In recent years, numerous methods have been proposed to address the DG setting, among which one popular approach is the adversarial learning-based methodology. The main idea behi… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  11. arXiv:2310.15117  [pdf, other

    cs.AI cs.CL

    Causal Inference Using LLM-Guided Discovery

    Authors: Aniket Vashishtha, Abbavaram Gowtham Reddy, Abhinav Kumar, Saketh Bachu, Vineeth N Balasubramanian, Amit Sharma

    Abstract: At the core of causal inference lies the challenge of determining reliable causal graphs solely based on observational data. Since the well-known backdoor criterion depends on the graph, any errors in the graph can propagate downstream to effect inference. In this work, we initially show that complete graph information is not necessary for causal effect inference; the topological order over graph… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  12. arXiv:2310.00377  [pdf, other

    cs.LG

    Mitigating the Effect of Incidental Correlations on Part-based Learning

    Authors: Gaurav Bhatt, Deepayan Das, Leonid Sigal, Vineeth N Balasubramanian

    Abstract: Intelligent systems possess a crucial characteristic of breaking complicated problems into smaller reusable components or parts and adjusting to new tasks using these part representations. However, current part-learners encounter difficulties in dealing with incidental correlations resulting from the limited observations of objects that may appear only in specific arrangements or with specific bac… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted in 37th Conference on Neural Information Processing Systems (NeurIPS'2023)

  13. arXiv:2309.14715  [pdf, other

    cs.CV cs.HC cs.LG

    Explaining Deep Face Algorithms through Visualization: A Survey

    Authors: Thrupthi Ann John, Vineeth N Balasubramanian, C. V. Jawahar

    Abstract: Although current deep models for face tasks surpass human performance on some benchmarks, we do not understand how they work. Thus, we cannot predict how it will react to novel inputs, resulting in catastrophic failures and unwanted biases in the algorithms. Explainable AI helps bridge the gap, but currently, there are very few visualization algorithms designed for faces. This work undertakes a fi… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    ACM Class: I.2.10; I.4.10; I.5.1

    Journal ref: IEEE Transactions in Biometrics, Behaviour and Identity Science (IEEE T-BIOM) 2023

  14. arXiv:2309.02429  [pdf, other

    cs.CV cs.AI cs.LG

    Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach

    Authors: Vimal K B, Saketh Bachu, Tanmay Garg, Niveditha Lakshmi Narasimhan, Raghavan Konuru, Vineeth N Balasubramanian

    Abstract: Estimating the transferability of publicly available pretrained models to a target task has assumed an important place for transfer learning tasks in recent years. Existing efforts propose metrics that allow a user to choose one model from a pool of pre-trained models without having to fine-tune each model individually and identify one explicitly. With the growth in the number of available pre-tra… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: To appear at ICCV 2023

  15. arXiv:2308.09748  [pdf, other

    hep-th

    de Sitter space is sometimes not empty

    Authors: Vijay Balasubramanian, Yasunori Nomura, Tomonori Uga**

    Abstract: Multiple lines of evidence suggest that the Hilbert space of an isolated de Sitter universe is one dimensional but can appear larger when probed by a gravitating observer. To test this idea, we compute the von Neumann entropy of a field theory in a two-dimensional de Sitter universe which is entangled in a thermal-like state with the same field theory on a disjoint, asymptotically anti-de Sitter (… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 50 pages, 13 figures

    Report number: RIKEN-iTHEMS-Report-23

  16. arXiv:2306.12928  [pdf, other

    cond-mat.dis-nn cond-mat.soft cond-mat.stat-mech

    The Physical Effects of Learning

    Authors: Menachem Stern, Andrea J. Liu, Vijay Balasubramanian

    Abstract: Interacting many-body physical systems ranging from neural networks in the brain to folding proteins to self-modifying electrical circuits can learn to perform diverse tasks. This learning, both in nature and in engineered systems, can occur through evolutionary selection or through dynamical rules that drive active learning from experience. Here, we show that \added{learning in linear physical ne… ▽ More

    Submitted 27 January, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 24 pages, 10 figures

  17. arXiv:2305.18183  [pdf, other

    cs.LG cs.CV stat.ML

    On Counterfactual Data Augmentation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saketh Bachu, Saloni Dash, Charchit Sharma, Amit Sharma, Vineeth N Balasubramanian

    Abstract: Counterfactual data augmentation has recently emerged as a method to mitigate confounding biases in the training data. These biases, such as spurious correlations, arise due to various observed and unobserved confounding variables in the data generation process. In this paper, we formally analyze how confounding biases impact downstream classifiers and present a causal viewpoint to the solutions b… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  18. arXiv:2304.08976  [pdf, other

    q-bio.PE cond-mat.stat-mech math.ST q-bio.QM

    Playing it safe: information constrains collective betting strategies

    Authors: Philipp Fleig, Vijay Balasubramanian

    Abstract: Every interaction of a living organism with its environment involves the placement of a bet. Armed with partial knowledge about a stochastic world, the organism must decide its next step or near-term strategy, an act that implicitly or explicitly involves the assumption of a model of the world. Better information about environmental statistics can improve the bet quality, but in practice resources… ▽ More

    Submitted 28 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 23 pages, 10 figures; replotted figs S1-S3; corrected typos; added references

  19. arXiv:2304.01957  [pdf, ps, other

    cond-mat.soft cond-mat.mtrl-sci

    Dispersion and Orientation patterns in nanorod-infused polymer melts

    Authors: Navid Afrasiabian, Venkat Balasubramanian, Colin Denniston

    Abstract: Introducing nanorods into a polymeric matrix can enhance the physical and mechanical properties of the resulting material. In this paper, we focus on understanding the dispersion and orientation patterns of nanorods in an unentangled polymer melt, particularly as a function of nanorod concentration, using Molecular Dynamics (MD) simulations. The system is comprised of flexible polymer chains and m… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 16 pages, 15 figures

    Journal ref: J. Chem. Phys. 158, 054902 (2023)

  20. arXiv:2303.14772  [pdf, other

    cs.CV

    $Δ$-Patching: A Framework for Rapid Adaptation of Pre-trained Convolutional Networks without Base Performance Loss

    Authors: Chaitanya Devaguptapu, Samarth Sinha, K J Joseph, Vineeth N Balasubramanian, Animesh Garg

    Abstract: Models pre-trained on large-scale datasets are often fine-tuned to support newer tasks and datasets that arrive over time. This process necessitates storing copies of the model over time for each task that the pre-trained model is fine-tuned to. Building on top of recent model patching work, we propose $Δ$-Patching for fine-tuning neural network models in an efficient manner, without the need to s… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  21. arXiv:2303.13850  [pdf, other

    cs.LG cs.AI stat.ME

    Towards Learning and Explaining Indirect Causal Effects in Neural Networks

    Authors: Abbavaram Gowtham Reddy, Saketh Bachu, Harsharaj Pathak, Benin L Godfrey, Vineeth N. Balasubramanian, Varshaneya V, Satya Narayanan Kar

    Abstract: Recently, there has been a growing interest in learning and explaining causal effects within Neural Network (NN) models. By virtue of NN architectures, previous approaches consider only direct and total causal effects assuming independence among input variables. We view an NN as a structural causal model (SCM) and extend our focus to include indirect causal effects by introducing feedforward conne… ▽ More

    Submitted 8 January, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: AAAI 2024

  22. Classification of Methods to Reduce Clinical Alarm Signals for Remote Patient Monitoring: A Critical Review

    Authors: Teena Arora, Venki Balasubramanian, Andrew Stranieri, Shenhan Mai, Rajkumar Buyya, Sardar Islam

    Abstract: Remote Patient Monitoring (RPM) is an emerging technology paradigm that helps reduce clinician workload by automated monitoring and raising intelligent alarm signals. High sensitivity and intelligent data-processing algorithms used in RPM devices result in frequent false-positive alarms, resulting in alarm fatigue. This study aims to critically review the existing literature to identify the causes… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 25 pages, 6 figures

    ACM Class: A.1

  23. Quantum Error Correction from Complexity in Brownian SYK

    Authors: Vijay Balasubramanian, Arjun Kar, Cathy Li, Onkar Parrikar, Harshit Rajgadia

    Abstract: We study the robustness of quantum error correction in a one-parameter ensemble of codes generated by the Brownian SYK model, where the parameter quantifies the encoding complexity. The robustness of error correction by a quantum code is upper bounded by the "mutual purity" of a certain entangled state between the code subspace and environment in the isometric extension of the error channel, where… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 40+14 pages, 8 figures

  24. arXiv:2301.06928  [pdf, other

    cs.LG cs.AI

    Towards Estimating Transferability using Hard Subsets

    Authors: Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar

    Abstract: As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a pa… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: First three authors contributed equally

  25. arXiv:2212.08623  [pdf, other

    hep-th gr-qc

    Microscopic origin of the entropy of astrophysical black holes

    Authors: Vijay Balasubramanian, Albion Lawrence, Javier M. Magan, Martin Sasieta

    Abstract: We construct an infinite family of microstates for black holes in Minkowski spacetime which have effective semiclassical descriptions in terms of collapsing dust shells in the black hole interior. Quantum mechanical wormholes cause these states to have exponentially small, but universal, overlaps. We show that these overlaps imply that the microstates span a Hilbert space of log dimension equal to… ▽ More

    Submitted 2 November, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 9 pages, 4 figures. v3: minor edits, discussion section added

  26. arXiv:2212.02447  [pdf, other

    hep-th gr-qc

    Microscopic origin of the entropy of black holes in general relativity

    Authors: Vijay Balasubramanian, Albion Lawrence, Javier M. Magan, Martin Sasieta

    Abstract: We construct an infinite family of microstates with geometric interiors for eternal black holes in general relativity with negative cosmological constant in any dimension. Wormholes in the Euclidean path integral for gravity cause these states to have small, but non-zero, quantum mechanical overlaps that have a universal form. The overlaps have a dramatic consequence: the microstates span a Hilber… ▽ More

    Submitted 17 October, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 46 pages + appendices. v4: minor edits, added references

  27. arXiv:2211.04780  [pdf, other

    cs.LG cs.CR cs.CV

    On the Robustness of Explanations of Deep Neural Network Models: A Survey

    Authors: Amlan Jyoti, Karthik Balaji Ganesh, Manoj Gayala, Nandita Lakshmi Tunuguntla, Sandesh Kamath, Vineeth N Balasubramanian

    Abstract: Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions of these models. Recent years have also seen concerted efforts that have shown how such explanations can… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Under Review ACM Computing Surveys "Special Issue on Trustworthy AI"

  28. arXiv:2211.04370  [pdf, other

    cs.AI cs.LG stat.ME

    NESTER: An Adaptive Neurosymbolic Method for Causal Effect Estimation

    Authors: Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Causal effect estimation from observational data is a central problem in causal inference. Methods based on potential outcomes framework solve this problem by exploiting inductive biases and heuristics from causal inference. Each of these methods addresses a specific aspect of causal effect estimation, such as controlling propensity score, enforcing randomization, etc., by designing neural network… ▽ More

    Submitted 8 January, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: AAAI 2024

  29. arXiv:2210.12368  [pdf, other

    cs.LG cs.AI

    Counterfactual Generation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saloni Dash, Amit Sharma, Vineeth N Balasubramanian

    Abstract: A machine learning model, under the influence of observed or unobserved confounders in the training data, can learn spurious correlations and fail to generalize when deployed. For image classifiers, augmenting a training dataset using counterfactual examples has been empirically shown to break spurious correlations. However, the counterfactual generation task itself becomes more difficult as the l… ▽ More

    Submitted 10 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

  30. arXiv:2210.11728  [pdf, other

    cs.CV

    Distilling the Undistillable: Learning from a Nasty Teacher

    Authors: Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: The inadvertent stealing of private/sensitive information using Knowledge Distillation (KD) has been getting significant attention recently and has guided subsequent defense efforts considering its critical nature. Recent work Nasty Teacher proposed to develop teachers which can not be distilled or imitated by models attacking it. However, the promise of confidentiality offered by a nasty teacher… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Published in main track of ECCV 2022, 17 pages with references, 5 figures, 6 tables

    Journal ref: ECCV 2022

  31. arXiv:2210.04574  [pdf, other

    cs.CV

    ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection

    Authors: Rebbapragada V C Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N Balasubramanian

    Abstract: Deep neural networks tend to reciprocate the bias of their training dataset. In object detection, the bias exists in the form of various imbalances such as class, background-foreground, and object size. In this paper, we denote size of an object as the number of pixels it covers in an image and size imbalance as the over-representation of certain sizes of objects in a dataset. We aim to address th… ▽ More

    Submitted 18 November, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  32. arXiv:2208.08452  [pdf, other

    hep-th cond-mat.stat-mech math-ph nlin.CD

    A Tale of Two Hungarians: Tridiagonalizing Random Matrices

    Authors: Vijay Balasubramanian, Javier M. Magan, Qingyue Wu

    Abstract: The Hungarian physicist Eugene Wigner introduced random matrix models in physics to describe the energy spectra of atomic nuclei. As such, the main goal of Random Matrix Theory (RMT) has been to derive the eigenvalue statistics of matrices drawn from a given distribution. The Wigner approach gives powerful insights into the properties of complex, chaotic systems in thermal equilibrium. Another Hun… ▽ More

    Submitted 30 September, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: 30 pages. References added

  33. arXiv:2208.03767  [pdf, other

    cs.CV cs.AI cs.LG

    Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer

    Authors: Arjun Ashok, K J Joseph, Vineeth Balasubramanian

    Abstract: In class-incremental learning, the model is expected to learn new classes continually while maintaining knowledge on previous classes. The challenge here lies in preserving the model's ability to effectively represent prior classes in the feature space, while adapting it to represent incoming new classes. We propose two distillation-based objectives for class incremental learning that leverage the… ▽ More

    Submitted 16 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted at ECCV 2022; Project Page at http://cscct.github.io/

  34. Learning Modular Structures That Generalize Out-of-Distribution

    Authors: Arjun Ashok, Chaitanya Devaguptapu, Vineeth Balasubramanian

    Abstract: Out-of-distribution (O.O.D.) generalization remains to be a key challenge for real-world machine learning systems. We describe a method for O.O.D. generalization that, through training, encourages models to only preserve features in the network that are well reused across multiple training domains. Our method combines two complementary neuron-level regularizers with a probabilistic differentiable… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted at AAAI 2022 Student Abstract and Poster Program

  35. arXiv:2207.10659  [pdf, other

    cs.CV cs.AI cs.LG

    Novel Class Discovery without Forgetting

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Humans possess an innate ability to identify and differentiate instances that they are not familiar with, by leveraging and adapting the knowledge that they have acquired so far. Importantly, they achieve this without deteriorating the performance on their earlier learning. Inspired by this, we identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetti… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  36. arXiv:2206.05912  [pdf, other

    cs.CV

    INDIGO: Intrinsic Multimodality for Domain Generalization

    Authors: Puneet Mangla, Shivam Chandhok, Milan Aggarwal, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: For models to generalize under unseen domains (a.k.a domain generalization), it is crucial to learn feature representations that are domain-agnostic and capture the underlying semantics that makes up an object category. Recent advances towards weakly supervised vision-language models that learn holistic representations from cheap weakly supervised noisy text annotations have shown their ability on… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Under Submission

  37. arXiv:2205.03859  [pdf, other

    cs.CV cs.LG

    On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models

    Authors: Vedant Singh, Surgan Jandial, Ayush Chopra, Siddharth Ramesh, Balaji Krishnamurthy, Vineeth N. Balasubramanian

    Abstract: Conditional image generation has paved the way for several breakthroughs in image editing, generating stock photos and 3-D object generation. This continues to be a significant area of interest with the rise of new state-of-the-art methods that are based on diffusion models. However, diffusion models provide very little control over the generated image, which led to subsequent works exploring tech… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted at the workshop on AI for Content Creation at CVPR 2022

  38. arXiv:2204.11830  [pdf, other

    cs.CV

    Proto2Proto: Can you recognize the car, the way I do?

    Authors: Monish Keswani, Sriranjani Ramakrishnan, Nishant Reddy, Vineeth N Balasubramanian

    Abstract: Prototypical methods have recently gained a lot of attention due to their intrinsic interpretable nature, which is obtained through the prototypes. With growing use cases of model reuse and distillation, there is a need to also study transfer of interpretability from one model to another. We present Proto2Proto, a novel method to transfer interpretability of one prototypical part network to anothe… ▽ More

    Submitted 2 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: To appear in CVPR 2022. Code is available at https://github.com/archmaester/proto2proto

  39. arXiv:2204.10595  [pdf, other

    cs.CV cs.AI cs.LG

    Spacing Loss for Discovering Novel Categories

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Novel Class Discovery (NCD) is a learning paradigm, where a machine learning model is tasked to semantically group instances from unlabeled data, by utilizing labeled instances from a disjoint set of classes. In this work, we first characterize existing NCD approaches into single-stage and two-stage methods based on whether they require access to labeled and unlabeled data together while discoveri… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to Continual Learning in Computer Vision Workshop (CLVision) at CVPR 2022

  40. CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

    Authors: Chinedu Innocent Nwoye, Deepak Alapatt, Tong Yu, Armine Vardazaryan, Fangfang Xia, Zixuan Zhao, Tong Xia, Fucang Jia, Yuxuan Yang, Hao Wang, Derong Yu, Guoyan Zheng, Xiaotian Duan, Neil Getty, Ricardo Sanchez-Matilla, Maria Robu, Li Zhang, Huabin Chen, Jiacheng Wang, Liansheng Wang, Bokai Zhang, Beerend Gerats, Sista Raviteja, Rachana Sathish, Rong Tao , et al. (37 additional authors not shown)

    Abstract: Context-aware decision support in the operating room can foster surgical safety and efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing works recognize surgical activities at a coarse-grained level, such as phases, steps or events, leaving out fine-grained interaction details about the surgical activity; yet those are needed for more helpful AI assistance in… ▽ More

    Submitted 29 December, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: CholecTriplet2021 challenge report. Paper accepted at Elsevier journal of Medical Image Analysis. 22 pages, 8 figures, 11 tables. Challenge website: https://cholectriplet2021.grand-challenge.org

    Journal ref: Medical Image Analysis 86 (2023) 102803

  41. arXiv:2203.16517  [pdf, other

    cs.CV

    Unseen Classes at a Later Time? No Problem

    Authors: Hari Chandana Kuchibhotla, Sumitra S Malagi, Shivam Chandhok, Vineeth N Balasubramanian

    Abstract: Recent progress towards learning from limited supervision has encouraged efforts towards designing models that can recognize novel classes at test time (generalized zero-shot learning or GZSL). GZSL approaches assume knowledge of all classes, with or without labeled data, beforehand. However, practical scenarios demand models that are adaptable and can handle dynamic addition of new seen and unsee… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available @ (https://github.com/sumitramalagi/Unseen-classes-at-a-later-time)

  42. arXiv:2203.14952  [pdf, other

    cs.CV cs.AI cs.LG

    Energy-based Latent Aligner for Incremental Learning

    Authors: K J Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Vineeth N Balasubramanian

    Abstract: Deep learning models tend to forget their earlier knowledge while incrementally learning new tasks. This behavior emerges because the parameter updates optimized for the new tasks may not align well with the updates suitable for older tasks. The resulting latent representation mismatch causes forgetting. In this work, we propose ELI: Energy-based Latent Aligner for Incremental Learning, which firs… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available in https://github.com/JosephKJ/ELI

  43. Quantum Error Correction in the Black Hole Interior

    Authors: Vijay Balasubramanian, Arjun Kar, Cathy Li, Onkar Parrikar

    Abstract: We study the quantum error correction properties of the black hole interior in a toy model for an evaporating black hole: Jackiw-Teitelboim gravity entangled with a non-gravitational bath. After the Page time, the black hole interior degrees of freedom in this system are encoded in the bath Hilbert space. We use the gravitational path integral to show that the interior density matrix is correctabl… ▽ More

    Submitted 11 April, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 48+5 pages, 10 figures

  44. arXiv:2202.06957  [pdf, other

    hep-th cond-mat.stat-mech quant-ph

    Quantum chaos and the complexity of spread of states

    Authors: Vijay Balasubramanian, Pawel Caputa, Javier Magan, Qingyue Wu

    Abstract: We propose a measure of quantum state complexity defined by minimizing the spread of the wave-function over all choices of basis. Our measure is controlled by the "survival amplitude" for a state to remain unchanged, and can be efficiently computed in theories with discrete spectra. For continuous Hamiltonian evolution, it generalizes Krylov operator complexity to quantum states. We apply our meth… ▽ More

    Submitted 20 April, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 23 pages, double column format. Added references and improved title

  45. arXiv:2112.05746  [pdf, other

    cs.LG stat.ML

    On Causally Disentangled Representations

    Authors: Abbavaram Gowtham Reddy, Benin Godfrey L, Vineeth N Balasubramanian

    Abstract: Representation learners that disentangle factors of variation have already proven to be important in addressing various real world concerns such as fairness and interpretability. Initially consisting of unsupervised models with independence assumptions, more recently, weak supervision and correlated features have been explored, but without a causal view of the generative process. In contrast, we w… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: https://causal-disentanglement.github.io/IITH-CANDLE/ ; Accepted at the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

  46. arXiv:2111.12490  [pdf, other

    cs.LG cs.AI

    Matching Learned Causal Effects of Neural Networks with Domain Priors

    Authors: Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian, Amit Sharma

    Abstract: A trained neural network can be interpreted as a structural causal model (SCM) that provides the effect of changing input variables on the model's output. However, if training data contains both causal and correlational relationships, a model that optimizes prediction accuracy may not necessarily learn the true causal relationships between input and output variables. On the other hand, expert user… ▽ More

    Submitted 29 June, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted at International Conference on Machine Learning (ICML'22)

  47. arXiv:2111.05956  [pdf, other

    cs.CV cs.LG

    Feature Generation for Long-tail Classification

    Authors: Rahul Vigneswaran, Marc T. Law, Vineeth N. Balasubramanian, Makarand Tapaswi

    Abstract: The visual world naturally exhibits an imbalance in the number of object or scene instances resulting in a \emph{long-tailed distribution}. This imbalance poses significant challenges for classification models based on deep learning. Oversampling instances of the tail classes attempts to solve this imbalance. However, the limited visual diversity results in a network with poor representation abili… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: Accepted at ICVGIP'21. Code available at https://github.com/rahulvigneswaran/TailCalibX

  48. arXiv:2111.00295  [pdf, other

    cs.LG cs.CR cs.CV

    Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided Curriculum Learning Approach

    Authors: Anindya Sarkar, Anirban Sarkar, Sowrya Gali, Vineeth N Balasubramanian

    Abstract: Current SOTA adversarially robust models are mostly based on adversarial training (AT) and differ only by some regularizers either at inner maximization or outer minimization steps. Being repetitive in nature during the inner maximization step, they take a huge time to train. We propose a non-iterative method that enforces the following ideas during training. Attribution maps are more aligned to t… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 16 pages, 9 figures, Accepted at NeurIPS 2021, Code at https://github.com/sowgali/Get-Fooled-for-the-Right-Reason

  49. arXiv:2110.12205  [pdf, other

    cs.CV

    Multi-Domain Incremental Learning for Semantic Segmentation

    Authors: Prachi Garg, Rohit Saluja, Vineeth N Balasubramanian, Chetan Arora, Anbumani Subramanian, C. V. Jawahar

    Abstract: Recent efforts in multi-domain learning for semantic segmentation attempt to learn multiple geographical datasets in a universal, joint model. A simple fine-tuning experiment performed sequentially on three popular road scene segmentation datasets demonstrates that existing segmentation frameworks fail at incrementally learning on a series of visually disparate geographical domains. When learning… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: 11 pages, 5 figures, Accepted in WACV 2022

  50. arXiv:2108.11761  [pdf, other

    cs.LG cs.CV

    A Framework for Learning Ante-hoc Explainable Models via Concepts

    Authors: Anirban Sarkar, Deepak Vijaykeerthy, Anindya Sarkar, Vineeth N Balasubramanian

    Abstract: Self-explaining deep models are designed to learn the latent concept-based explanations implicitly during training, which eliminates the requirement of any post-hoc explanation generation technique. In this work, we propose one such model that appends an explanation generation module on top of any basic network and jointly trains the whole module that shows high predictive performance and generate… ▽ More

    Submitted 30 November, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: 16 pages, 15 figures