Skip to main content

Showing 1–50 of 85 results for author: Balasubramanian, V N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07921  [pdf, other

    cs.CV

    Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?

    Authors: Hari Chandana Kuchibhotla, Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Going beyond mere fine-tuning of vision-language models (VLMs), learnable prompt tuning has emerged as a promising, resource-efficient alternative. Despite their potential, effectively learning prompts faces the following challenges: (i) training in a low-shot scenario results in overfitting, limiting adaptability, and yielding weaker performance on newer classes or datasets; (ii) prompt-tuning's… ▽ More

    Submitted 20 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2404.19276  [pdf, other

    cs.CV

    C2FDrone: Coarse-to-Fine Drone-to-Drone Detection using Vision Transformer Networks

    Authors: Sairam VC Rebbapragada, Pranoy Panda, Vineeth N Balasubramanian

    Abstract: A vision-based drone-to-drone detection system is crucial for various applications like collision avoidance, countering hostile drones, and search-and-rescue operations. However, detecting drones presents unique challenges, including small object sizes, distortion, occlusion, and real-time processing requirements. Current methods integrating multi-scale feature fusion and temporal information have… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted at ICRA 2024

  3. arXiv:2401.04647  [pdf, other

    cs.CV cs.AI cs.LG

    Advancing Ante-Hoc Explainable Models through Generative Adversarial Networks

    Authors: Tanmay Garg, Deepika Vemuri, Vineeth N Balasubramanian

    Abstract: This paper presents a novel concept learning framework for enhancing model interpretability and performance in visual classification tasks. Our approach appends an unsupervised explanation generator to the primary classifier network and makes use of adversarial training. During training, the explanation module is optimized to extract visual concepts from the classifier's latent representations, wh… ▽ More

    Submitted 3 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/). Paper accepted and presented at Deployable AI Workshop at AAAI-2024 (https://sites.google.com/view/dai-2024/home)

  4. arXiv:2312.10534  [pdf, other

    cs.LG cs.CR cs.CV

    Rethinking Robustness of Model Attributions

    Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian

    Abstract: For machine learning models to be reliable and trustworthy, their decisions must be interpretable. As these models find increasing use in safety-critical applications, it is important that not just the model predictions but also their explanations (as feature attributions) be robust to small human-imperceptible input perturbations. Recent works have shown that many attribution methods are fragile… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted AAAI 2024

  5. arXiv:2311.08503  [pdf, other

    cs.CV cs.LG

    MADG: Margin-based Adversarial Learning for Domain Generalization

    Authors: Aveen Dayal, Vimal K. B., Linga Reddy Cenkeramaddi, C. Krishna Mohan, Abhinav Kumar, Vineeth N Balasubramanian

    Abstract: Domain Generalization (DG) techniques have emerged as a popular approach to address the challenges of domain shift in Deep Learning (DL), with the goal of generalizing well to the target domain unseen during the training. In recent years, numerous methods have been proposed to address the DG setting, among which one popular approach is the adversarial learning-based methodology. The main idea behi… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  6. arXiv:2310.15117  [pdf, other

    cs.AI cs.CL

    Causal Inference Using LLM-Guided Discovery

    Authors: Aniket Vashishtha, Abbavaram Gowtham Reddy, Abhinav Kumar, Saketh Bachu, Vineeth N Balasubramanian, Amit Sharma

    Abstract: At the core of causal inference lies the challenge of determining reliable causal graphs solely based on observational data. Since the well-known backdoor criterion depends on the graph, any errors in the graph can propagate downstream to effect inference. In this work, we initially show that complete graph information is not necessary for causal effect inference; the topological order over graph… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  7. arXiv:2310.00377  [pdf, other

    cs.LG

    Mitigating the Effect of Incidental Correlations on Part-based Learning

    Authors: Gaurav Bhatt, Deepayan Das, Leonid Sigal, Vineeth N Balasubramanian

    Abstract: Intelligent systems possess a crucial characteristic of breaking complicated problems into smaller reusable components or parts and adjusting to new tasks using these part representations. However, current part-learners encounter difficulties in dealing with incidental correlations resulting from the limited observations of objects that may appear only in specific arrangements or with specific bac… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted in 37th Conference on Neural Information Processing Systems (NeurIPS'2023)

  8. arXiv:2309.14715  [pdf, other

    cs.CV cs.HC cs.LG

    Explaining Deep Face Algorithms through Visualization: A Survey

    Authors: Thrupthi Ann John, Vineeth N Balasubramanian, C. V. Jawahar

    Abstract: Although current deep models for face tasks surpass human performance on some benchmarks, we do not understand how they work. Thus, we cannot predict how it will react to novel inputs, resulting in catastrophic failures and unwanted biases in the algorithms. Explainable AI helps bridge the gap, but currently, there are very few visualization algorithms designed for faces. This work undertakes a fi… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    ACM Class: I.2.10; I.4.10; I.5.1

    Journal ref: IEEE Transactions in Biometrics, Behaviour and Identity Science (IEEE T-BIOM) 2023

  9. arXiv:2309.02429  [pdf, other

    cs.CV cs.AI cs.LG

    Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach

    Authors: Vimal K B, Saketh Bachu, Tanmay Garg, Niveditha Lakshmi Narasimhan, Raghavan Konuru, Vineeth N Balasubramanian

    Abstract: Estimating the transferability of publicly available pretrained models to a target task has assumed an important place for transfer learning tasks in recent years. Existing efforts propose metrics that allow a user to choose one model from a pool of pre-trained models without having to fine-tune each model individually and identify one explicitly. With the growth in the number of available pre-tra… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: To appear at ICCV 2023

  10. arXiv:2305.18183  [pdf, other

    cs.LG cs.CV stat.ML

    On Counterfactual Data Augmentation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saketh Bachu, Saloni Dash, Charchit Sharma, Amit Sharma, Vineeth N Balasubramanian

    Abstract: Counterfactual data augmentation has recently emerged as a method to mitigate confounding biases in the training data. These biases, such as spurious correlations, arise due to various observed and unobserved confounding variables in the data generation process. In this paper, we formally analyze how confounding biases impact downstream classifiers and present a causal viewpoint to the solutions b… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  11. arXiv:2303.14772  [pdf, other

    cs.CV

    $Δ$-Patching: A Framework for Rapid Adaptation of Pre-trained Convolutional Networks without Base Performance Loss

    Authors: Chaitanya Devaguptapu, Samarth Sinha, K J Joseph, Vineeth N Balasubramanian, Animesh Garg

    Abstract: Models pre-trained on large-scale datasets are often fine-tuned to support newer tasks and datasets that arrive over time. This process necessitates storing copies of the model over time for each task that the pre-trained model is fine-tuned to. Building on top of recent model patching work, we propose $Δ$-Patching for fine-tuning neural network models in an efficient manner, without the need to s… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  12. arXiv:2303.13850  [pdf, other

    cs.LG cs.AI stat.ME

    Towards Learning and Explaining Indirect Causal Effects in Neural Networks

    Authors: Abbavaram Gowtham Reddy, Saketh Bachu, Harsharaj Pathak, Benin L Godfrey, Vineeth N. Balasubramanian, Varshaneya V, Satya Narayanan Kar

    Abstract: Recently, there has been a growing interest in learning and explaining causal effects within Neural Network (NN) models. By virtue of NN architectures, previous approaches consider only direct and total causal effects assuming independence among input variables. We view an NN as a structural causal model (SCM) and extend our focus to include indirect causal effects by introducing feedforward conne… ▽ More

    Submitted 8 January, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: AAAI 2024

  13. arXiv:2301.06928  [pdf, other

    cs.LG cs.AI

    Towards Estimating Transferability using Hard Subsets

    Authors: Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar

    Abstract: As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a pa… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: First three authors contributed equally

  14. arXiv:2211.04780  [pdf, other

    cs.LG cs.CR cs.CV

    On the Robustness of Explanations of Deep Neural Network Models: A Survey

    Authors: Amlan Jyoti, Karthik Balaji Ganesh, Manoj Gayala, Nandita Lakshmi Tunuguntla, Sandesh Kamath, Vineeth N Balasubramanian

    Abstract: Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions of these models. Recent years have also seen concerted efforts that have shown how such explanations can… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Under Review ACM Computing Surveys "Special Issue on Trustworthy AI"

  15. arXiv:2211.04370  [pdf, other

    cs.AI cs.LG stat.ME

    NESTER: An Adaptive Neurosymbolic Method for Causal Effect Estimation

    Authors: Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Causal effect estimation from observational data is a central problem in causal inference. Methods based on potential outcomes framework solve this problem by exploiting inductive biases and heuristics from causal inference. Each of these methods addresses a specific aspect of causal effect estimation, such as controlling propensity score, enforcing randomization, etc., by designing neural network… ▽ More

    Submitted 8 January, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: AAAI 2024

  16. arXiv:2210.12368  [pdf, other

    cs.LG cs.AI

    Counterfactual Generation Under Confounding

    Authors: Abbavaram Gowtham Reddy, Saloni Dash, Amit Sharma, Vineeth N Balasubramanian

    Abstract: A machine learning model, under the influence of observed or unobserved confounders in the training data, can learn spurious correlations and fail to generalize when deployed. For image classifiers, augmenting a training dataset using counterfactual examples has been empirically shown to break spurious correlations. However, the counterfactual generation task itself becomes more difficult as the l… ▽ More

    Submitted 10 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

  17. arXiv:2210.11728  [pdf, other

    cs.CV

    Distilling the Undistillable: Learning from a Nasty Teacher

    Authors: Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: The inadvertent stealing of private/sensitive information using Knowledge Distillation (KD) has been getting significant attention recently and has guided subsequent defense efforts considering its critical nature. Recent work Nasty Teacher proposed to develop teachers which can not be distilled or imitated by models attacking it. However, the promise of confidentiality offered by a nasty teacher… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Published in main track of ECCV 2022, 17 pages with references, 5 figures, 6 tables

    Journal ref: ECCV 2022

  18. arXiv:2210.04574  [pdf, other

    cs.CV

    ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection

    Authors: Rebbapragada V C Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N Balasubramanian

    Abstract: Deep neural networks tend to reciprocate the bias of their training dataset. In object detection, the bias exists in the form of various imbalances such as class, background-foreground, and object size. In this paper, we denote size of an object as the number of pixels it covers in an image and size imbalance as the over-representation of certain sizes of objects in a dataset. We aim to address th… ▽ More

    Submitted 18 November, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  19. arXiv:2207.10659  [pdf, other

    cs.CV cs.AI cs.LG

    Novel Class Discovery without Forgetting

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Humans possess an innate ability to identify and differentiate instances that they are not familiar with, by leveraging and adapting the knowledge that they have acquired so far. Importantly, they achieve this without deteriorating the performance on their earlier learning. Inspired by this, we identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetti… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  20. arXiv:2206.05912  [pdf, other

    cs.CV

    INDIGO: Intrinsic Multimodality for Domain Generalization

    Authors: Puneet Mangla, Shivam Chandhok, Milan Aggarwal, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: For models to generalize under unseen domains (a.k.a domain generalization), it is crucial to learn feature representations that are domain-agnostic and capture the underlying semantics that makes up an object category. Recent advances towards weakly supervised vision-language models that learn holistic representations from cheap weakly supervised noisy text annotations have shown their ability on… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Under Submission

  21. arXiv:2205.03859  [pdf, other

    cs.CV cs.LG

    On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models

    Authors: Vedant Singh, Surgan Jandial, Ayush Chopra, Siddharth Ramesh, Balaji Krishnamurthy, Vineeth N. Balasubramanian

    Abstract: Conditional image generation has paved the way for several breakthroughs in image editing, generating stock photos and 3-D object generation. This continues to be a significant area of interest with the rise of new state-of-the-art methods that are based on diffusion models. However, diffusion models provide very little control over the generated image, which led to subsequent works exploring tech… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted at the workshop on AI for Content Creation at CVPR 2022

  22. arXiv:2204.11830  [pdf, other

    cs.CV

    Proto2Proto: Can you recognize the car, the way I do?

    Authors: Monish Keswani, Sriranjani Ramakrishnan, Nishant Reddy, Vineeth N Balasubramanian

    Abstract: Prototypical methods have recently gained a lot of attention due to their intrinsic interpretable nature, which is obtained through the prototypes. With growing use cases of model reuse and distillation, there is a need to also study transfer of interpretability from one model to another. We present Proto2Proto, a novel method to transfer interpretability of one prototypical part network to anothe… ▽ More

    Submitted 2 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: To appear in CVPR 2022. Code is available at https://github.com/archmaester/proto2proto

  23. arXiv:2204.10595  [pdf, other

    cs.CV cs.AI cs.LG

    Spacing Loss for Discovering Novel Categories

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Novel Class Discovery (NCD) is a learning paradigm, where a machine learning model is tasked to semantically group instances from unlabeled data, by utilizing labeled instances from a disjoint set of classes. In this work, we first characterize existing NCD approaches into single-stage and two-stage methods based on whether they require access to labeled and unlabeled data together while discoveri… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to Continual Learning in Computer Vision Workshop (CLVision) at CVPR 2022

  24. arXiv:2203.16517  [pdf, other

    cs.CV

    Unseen Classes at a Later Time? No Problem

    Authors: Hari Chandana Kuchibhotla, Sumitra S Malagi, Shivam Chandhok, Vineeth N Balasubramanian

    Abstract: Recent progress towards learning from limited supervision has encouraged efforts towards designing models that can recognize novel classes at test time (generalized zero-shot learning or GZSL). GZSL approaches assume knowledge of all classes, with or without labeled data, beforehand. However, practical scenarios demand models that are adaptable and can handle dynamic addition of new seen and unsee… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available @ (https://github.com/sumitramalagi/Unseen-classes-at-a-later-time)

  25. arXiv:2203.14952  [pdf, other

    cs.CV cs.AI cs.LG

    Energy-based Latent Aligner for Incremental Learning

    Authors: K J Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Vineeth N Balasubramanian

    Abstract: Deep learning models tend to forget their earlier knowledge while incrementally learning new tasks. This behavior emerges because the parameter updates optimized for the new tasks may not align well with the updates suitable for older tasks. The resulting latent representation mismatch causes forgetting. In this work, we propose ELI: Energy-based Latent Aligner for Incremental Learning, which firs… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available in https://github.com/JosephKJ/ELI

  26. arXiv:2112.05746  [pdf, other

    cs.LG stat.ML

    On Causally Disentangled Representations

    Authors: Abbavaram Gowtham Reddy, Benin Godfrey L, Vineeth N Balasubramanian

    Abstract: Representation learners that disentangle factors of variation have already proven to be important in addressing various real world concerns such as fairness and interpretability. Initially consisting of unsupervised models with independence assumptions, more recently, weak supervision and correlated features have been explored, but without a causal view of the generative process. In contrast, we w… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: https://causal-disentanglement.github.io/IITH-CANDLE/ ; Accepted at the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

  27. arXiv:2111.12490  [pdf, other

    cs.LG cs.AI

    Matching Learned Causal Effects of Neural Networks with Domain Priors

    Authors: Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian, Amit Sharma

    Abstract: A trained neural network can be interpreted as a structural causal model (SCM) that provides the effect of changing input variables on the model's output. However, if training data contains both causal and correlational relationships, a model that optimizes prediction accuracy may not necessarily learn the true causal relationships between input and output variables. On the other hand, expert user… ▽ More

    Submitted 29 June, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted at International Conference on Machine Learning (ICML'22)

  28. arXiv:2111.05956  [pdf, other

    cs.CV cs.LG

    Feature Generation for Long-tail Classification

    Authors: Rahul Vigneswaran, Marc T. Law, Vineeth N. Balasubramanian, Makarand Tapaswi

    Abstract: The visual world naturally exhibits an imbalance in the number of object or scene instances resulting in a \emph{long-tailed distribution}. This imbalance poses significant challenges for classification models based on deep learning. Oversampling instances of the tail classes attempts to solve this imbalance. However, the limited visual diversity results in a network with poor representation abili… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: Accepted at ICVGIP'21. Code available at https://github.com/rahulvigneswaran/TailCalibX

  29. arXiv:2111.00295  [pdf, other

    cs.LG cs.CR cs.CV

    Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided Curriculum Learning Approach

    Authors: Anindya Sarkar, Anirban Sarkar, Sowrya Gali, Vineeth N Balasubramanian

    Abstract: Current SOTA adversarially robust models are mostly based on adversarial training (AT) and differ only by some regularizers either at inner maximization or outer minimization steps. Being repetitive in nature during the inner maximization step, they take a huge time to train. We propose a non-iterative method that enforces the following ideas during training. Attribution maps are more aligned to t… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 16 pages, 9 figures, Accepted at NeurIPS 2021, Code at https://github.com/sowgali/Get-Fooled-for-the-Right-Reason

  30. arXiv:2110.12205  [pdf, other

    cs.CV

    Multi-Domain Incremental Learning for Semantic Segmentation

    Authors: Prachi Garg, Rohit Saluja, Vineeth N Balasubramanian, Chetan Arora, Anbumani Subramanian, C. V. Jawahar

    Abstract: Recent efforts in multi-domain learning for semantic segmentation attempt to learn multiple geographical datasets in a universal, joint model. A simple fine-tuning experiment performed sequentially on three popular road scene segmentation datasets demonstrates that existing segmentation frameworks fail at incrementally learning on a series of visually disparate geographical domains. When learning… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: 11 pages, 5 figures, Accepted in WACV 2022

  31. arXiv:2108.11761  [pdf, other

    cs.LG cs.CV

    A Framework for Learning Ante-hoc Explainable Models via Concepts

    Authors: Anirban Sarkar, Deepak Vijaykeerthy, Anindya Sarkar, Vineeth N Balasubramanian

    Abstract: Self-explaining deep models are designed to learn the latent concept-based explanations implicitly during training, which eliminates the requirement of any post-hoc explanation generation technique. In this work, we propose one such model that appends an explanation generation module on top of any basic network and jointly trains the whole module that shows high predictive performance and generate… ▽ More

    Submitted 30 November, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: 16 pages, 15 figures

  32. arXiv:2107.07497  [pdf, other

    cs.CV

    Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains

    Authors: Puneet Mangla, Shivam Chandhok, Vineeth N Balasubramanian, Fahad Shahbaz Khan

    Abstract: Recent progress towards designing models that can generalize to unseen domains (i.e domain generalization) or unseen classes (i.e zero-shot learning) has embarked interest towards building models that can tackle both domain-shift and semantic shift simultaneously (i.e zero-shot domain generalization). For models to generalize to unseen classes in unseen domains, it is crucial to learn feature repr… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  33. arXiv:2107.05622  [pdf, other

    cs.CV

    Structured Latent Embeddings for Recognizing Unseen Classes in Unseen Domains

    Authors: Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N Balasubramanian, Fahad Shahbaz Khan, Ling Shao

    Abstract: The need to address the scarcity of task-specific annotated data has resulted in concerted efforts in recent years for specific settings such as zero-shot learning (ZSL) and domain generalization (DG), to separately address the issues of semantic shift and domain shift, respectively. However, real-world applications often do not have constrained settings and necessitate handling unseen classes in… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  34. arXiv:2107.04952  [pdf, other

    cs.CV cs.AI cs.LG

    Learn from Anywhere: Rethinking Generalized Zero-Shot Learning with Limited Supervision

    Authors: Gaurav Bhatt, Shivam Chandhok, Vineeth N Balasubramanian

    Abstract: A common problem with most zero and few-shot learning approaches is they suffer from bias towards seen classes resulting in sub-optimal performance. Existing efforts aim to utilize unlabeled images from unseen classes (i.e transductive zero-shot) during training to enable generalization. However, this limits their use in practical scenarios where data from target unseen classes is unavailable or i… ▽ More

    Submitted 13 July, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted at IJCAI'21 workshop on Weakly Supervised Representation Learning

  35. arXiv:2105.01386  [pdf, other

    cs.CV cs.LG

    Canonical Saliency Maps: Decoding Deep Face Models

    Authors: Thrupthi Ann John, Vineeth N Balasubramanian, C V Jawahar

    Abstract: As Deep Neural Network models for face processing tasks approach human-like performance, their deployment in critical applications such as law enforcement and access control has seen an upswing, where any failure may have far-reaching consequences. We need methods to build trust in deployed systems by making their working as transparent as possible. Existing visualization algorithms are designed f… ▽ More

    Submitted 16 August, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Under review. Added three new experiments, cleaned up some figures and equations

    ACM Class: I.4

  36. arXiv:2104.12759  [pdf, other

    cs.LG cs.AI

    Instance-wise Causal Feature Selection for Model Interpretation

    Authors: Pranoy Panda, Sai Srinivas Kancheti, Vineeth N Balasubramanian

    Abstract: We formulate a causal extension to the recently introduced paradigm of instance-wise feature selection to explain black-box visual classifiers. Our method selects a subset of input features that has the greatest causal effect on the models output. We quantify the causal influence of a subset of features by the Relative Entropy Distance measure. Under certain assumptions this is equivalent to the c… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 6 pages, 5 figures. Accepted at the Causality in Vision workshop, CVPR 2021

  37. arXiv:2104.09073  [pdf, other

    cs.LG

    Improving Attribution Methods by Learning Submodular Functions

    Authors: Piyushi Manupriya, Tarun Ram Menta, J. Saketha Nath, Vineeth N Balasubramanian

    Abstract: This work explores the novel idea of learning a submodular scoring function to improve the specificity/selectivity of existing feature attribution methods. Submodular scores are natural for attribution as they are known to accurately model the principle of diminishing returns. A new formulation for learning a deep submodular set function that is consistent with the real-valued attribution maps obt… ▽ More

    Submitted 21 February, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

  38. arXiv:2103.02603  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Open World Object Detection

    Authors: K J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: Humans have a natural instinct to identify unknown object instances in their environments. The intrinsic curiosity about these unknown instances aids in learning about them, when the corresponding knowledge is eventually available. This motivates us to propose a novel computer vision problem called: `Open World Object Detection', where a model is tasked to: 1) identify objects that have not been i… ▽ More

    Submitted 9 May, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: To appear in CVPR 2021 as an ORAL paper. Code is available in https://github.com/JosephKJ/OWOD

  39. arXiv:2012.14395  [pdf, other

    cs.CV cs.AI

    Enhanced Regularizers for Attributional Robustness

    Authors: Anindya Sarkar, Anirban Sarkar, Vineeth N Balasubramanian

    Abstract: Deep neural networks are the default choice of learning models for computer vision tasks. Extensive work has been carried out in recent years on explaining deep models for vision tasks such as classification. However, recent work has shown that it is possible for these models to produce substantially different attribution maps even when two very similar images are given to the network, raising ser… ▽ More

    Submitted 13 August, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 15 pages, 18 figures, Accepted at AAAI 2021. Code at https://github.com/tataiani/Enhanced_regularizers_attributional_robustness

  40. arXiv:2012.04256  [pdf, other

    cs.CV

    Data InStance Prior (DISP) in Generative Adversarial Networks

    Authors: Puneet Mangla, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy, Vineeth N Balasubramanian

    Abstract: Recent advances in generative adversarial networks (GANs) have shown remarkable progress in generating high-quality images. However, this gain in performance depends on the availability of a large amount of training data. In limited data regimes, training typically diverges, and therefore the generated samples are of low quality and lack diversity. Previous works have addressed training in low dat… ▽ More

    Submitted 21 September, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted at WACV 2022

  41. arXiv:2012.03801  [pdf, other

    cs.LG

    A Deeper Look at the Hessian Eigenspectrum of Deep Neural Networks and its Applications to Regularization

    Authors: Adepu Ravi Sankar, Yash Khasbage, Rahul Vigneswaran, Vineeth N Balasubramanian

    Abstract: Loss landscape analysis is extremely useful for a deeper understanding of the generalization ability of deep neural network models. In this work, we propose a layerwise loss landscape analysis where the loss surface at every layer is studied independently and also on how each correlates to the overall loss surface. We study the layerwise loss landscape by studying the eigenspectra of the Hessian a… ▽ More

    Submitted 7 December, 2020; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Accepted at AAAI 2021

  42. arXiv:2011.14696  [pdf, other

    cs.LG cs.CV

    On Initial Pools for Deep Active Learning

    Authors: Akshay L Chandra, Sai Vikas Desai, Chaitanya Devaguptapu, Vineeth N Balasubramanian

    Abstract: Active Learning (AL) techniques aim to minimize the training data required to train a model for a given task. Pool-based AL techniques start with a small initial labeled pool and then iteratively pick batches of the most informative samples for labeling. Generally, the initial pool is sampled randomly and labeled to seed the AL iterations. While recent studies have focused on evaluating the robust… ▽ More

    Submitted 14 July, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: Accepted at NeurIPS 2020 Preregistration Workshop and included in PMLR v148. 19 pages, 9 figures

    Journal ref: Proceedings of Machine Learning Research. 148 (2021) 14-32

  43. arXiv:2010.12852  [pdf, other

    cs.CV cs.AI

    Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions

    Authors: Radhika Dua, Sai Srinivas Kancheti, Vineeth N Balasubramanian

    Abstract: Visual Question Answering is a multi-modal task that aims to measure high-level visual understanding. Contemporary VQA models are restrictive in the sense that answers are obtained via classification over a limited vocabulary (in the case of open-ended VQA), or via classification over a set of multiple-choice-type answers. In this work, we present a completely generative formulation where a multi-… ▽ More

    Submitted 17 June, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: MULA Workshop, CVPR 2021

  44. arXiv:2010.00352  [pdf, other

    cs.CV cs.AI

    Meta-Consolidation for Continual Learning

    Authors: K J Joseph, Vineeth N Balasubramanian

    Abstract: The ability to continuously learn and adapt itself to new tasks, without losing grasp of already acquired knowledge is a hallmark of biological learning systems, which current deep learning systems fall short of. In this work, we present a novel methodology for continual learning called MERLIN: Meta-Consolidation for Continual Learning. We assume that weights of a neural network $\boldsymbol ψ$,… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2020

  45. arXiv:2009.08270  [pdf, other

    cs.CV cs.LG

    Evaluating and Mitigating Bias in Image Classifiers: A Causal Perspective Using Counterfactuals

    Authors: Saloni Dash, Vineeth N Balasubramanian, Amit Sharma

    Abstract: Counterfactual examples for an input -- perturbations that change specific features but not others -- have been shown to be useful for evaluating bias of machine learning models, e.g., against specific demographic groups. However, generating counterfactual examples for images is non-trivial due to the underlying causal structure on the various features of an image. To be meaningful, generated pert… ▽ More

    Submitted 6 January, 2022; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted for Publication at WACV 2022

  46. arXiv:2008.07443  [pdf, other

    cs.CV cs.LG

    Zero Shot Domain Generalization

    Authors: Udit Maniyar, Joseph K J, Aniket Anand Deshmukh, Urun Dogan, Vineeth N Balasubramanian

    Abstract: Standard supervised learning setting assumes that training data and test data come from the same distribution (domain). Domain generalization (DG) methods try to learn a model that when trained on data from multiple domains, would generalize to a new unseen domain. We extend DG to an even more challenging setting, where the label space of the unseen domain could also change. We introduce this prob… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted to BMVC 2020

  47. arXiv:2008.03555  [pdf, other

    cs.CV

    Assisting Scene Graph Generation with Self-Supervision

    Authors: Sandeep Inuganti, Vineeth N Balasubramanian

    Abstract: Research in scene graph generation has quickly gained traction in the past few years because of its potential to help in downstream tasks like visual question answering, image captioning, etc. Many interesting approaches have been proposed to tackle this problem. Most of these works have a pre-trained object detection model as a preliminary feature extractor. Therefore, getting object bounding box… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

  48. arXiv:2007.08428  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    On Adversarial Robustness: A Neural Architecture Search perspective

    Authors: Chaitanya Devaguptapu, Devansh Agarwal, Gaurav Mittal, Pulkit Gopalani, Vineeth N Balasubramanian

    Abstract: Adversarial robustness of deep learning models has gained much traction in the last few years. Various attacks and defenses are proposed to improve the adversarial robustness of modern-day deep learning architectures. While all these approaches help improve the robustness, one promising direction for improving adversarial robustness is unexplored, i.e., the complex topology of the neural network a… ▽ More

    Submitted 26 August, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted at the Workshop on Adversarial Robustness in Real-World, ICCV-2021 (previous version accepted at four ICLR-21 Workshops)

  49. arXiv:2007.07757  [pdf, other

    cs.CV cs.LG

    Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot Learning

    Authors: Shivam Chandhok, Vineeth N Balasubramanian

    Abstract: The performance of generative zero-shot methods mainly depends on the quality of generated features and how well the model facilitates knowledge transfer between visual and semantic domains. The quality of generated features is a direct consequence of the ability of the model to capture the several modes of the underlying data distribution. To address these issues, we propose a new two-level joint… ▽ More

    Submitted 30 November, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Under Submission

  50. Computer Vision with Deep Learning for Plant Phenoty** in Agriculture: A Survey

    Authors: Akshay L Chandra, Sai Vikas Desai, Wei Guo, Vineeth N Balasubramanian

    Abstract: In light of growing challenges in agriculture with ever growing food demand across the world, efficient crop management techniques are necessary to increase crop yield. Precision agriculture techniques allow the stakeholders to make effective and customized crop management decisions based on data gathered from monitoring crop environments. Plant phenoty** techniques play a major role in accurate… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: Featured as an article at Journal of Advanced Computing and Communications, April 2020. arXiv admin note: text overlap with arXiv:1805.00881 by other authors