Skip to main content

Showing 1–8 of 8 results for author: Kalibhat, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16401  [pdf, other

    cs.CV cs.LG

    Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning

    Authors: Neha Kalibhat, Priyatham Kattakinda, Arman Zarei, Nikita Seleznev, Samuel Sharpe, Senthil Kumar, Soheil Feizi

    Abstract: Vision transformers have established a precedent of patchifying images into uniformly-sized chunks before processing. We hypothesize that this design choice may limit models in learning comprehensive and compositional representations from visual data. This paper explores the notion of providing semantically-meaningful visual tokens to transformer encoders within a vision-language pre-training fram… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2403.05726  [pdf, other

    cs.LG cs.CV

    Augmentations vs Algorithms: What Works in Self-Supervised Learning

    Authors: Warren Morningstar, Alex Bijamov, Chris Duvarney, Luke Friedman, Neha Kalibhat, Luyang Liu, Philip Mansfield, Renan Rojas-Gomez, Karan Singhal, Bradley Green, Sushant Prakash

    Abstract: We study the relative effects of data augmentations, pretraining algorithms, and model architectures in Self-Supervised Learning (SSL). While the recent literature in this space leaves the impression that the pretraining algorithm is of critical importance to performance, understanding its effect is complicated by the difficulty in making objective and direct comparisons between methods. We propos… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 18 pages, 1 figure

  3. arXiv:2312.02205  [pdf, other

    cs.CV cs.LG

    Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations

    Authors: Neha Kalibhat, Warren Morningstar, Alex Bijamov, Luyang Liu, Karan Singhal, Philip Mansfield

    Abstract: Self-Supervised Learning (SSL) enables training performant models using limited labeled data. One of the pillars underlying vision SSL is the use of data augmentations/perturbations of the input which do not significantly alter its semantic content. For audio and other temporal signals, augmentations are commonly used alongside format transforms such as Fourier transforms or wavelet transforms. Un… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  4. arXiv:2309.03999  [pdf, other

    cs.CV cs.LG

    Adapting Self-Supervised Representations to Multi-Domain Setups

    Authors: Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi

    Abstract: Current state-of-the-art self-supervised approaches, are effective when trained on individual domains but show limited generalization on unseen domains. We observe that these models poorly generalize even when trained on a mixture of domains, making them unsuitable to be deployed under diverse real-world setups. We therefore propose a general-purpose, lightweight Domain Disentanglement Module (DDM… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at BMVC 2023

  5. arXiv:2307.10504  [pdf, other

    cs.CV cs.LG

    Identifying Interpretable Subspaces in Image Representations

    Authors: Neha Kalibhat, Shweta Bhardwaj, Bayan Bruss, Hamed Firooz, Maziar Sanjabi, Soheil Feizi

    Abstract: We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a large captioning dataset (like LAION-400m) and a pre-trained vision-language model like CLIP. Each word among the captions is scored and ranked leading to a small… ▽ More

    Submitted 7 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Published at ICML 2023 Code: https://github.com/NehaKalibhat/falcon-explain

  6. arXiv:2203.01881  [pdf, other

    cs.LG cs.AI cs.CV

    Measuring Self-Supervised Representation Quality for Downstream Classification using Discriminative Features

    Authors: Neha Kalibhat, Kanika Narang, Hamed Firooz, Maziar Sanjabi, Soheil Feizi

    Abstract: Self-supervised learning (SSL) has shown impressive results in downstream classification tasks. However, there is limited work in understanding their failure modes and interpreting their learned representations. In this paper, we study the representation space of state-of-the-art self-supervised models including SimCLR, SwaV, MoCo, BYOL, DINO, SimSiam, VICReg and Barlow Twins. Without the use of c… ▽ More

    Submitted 12 December, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Published at AAAI 2024

  7. arXiv:2104.05605  [pdf, other

    cs.LG stat.ML

    Understanding Overparameterization in Generative Adversarial Networks

    Authors: Yogesh Balaji, Mohammadmahdi Sajedi, Neha Mukund Kalibhat, Mucong Ding, Dominik Stöger, Mahdi Soltanolkotabi, Soheil Feizi

    Abstract: A broad class of unsupervised deep learning methods such as Generative Adversarial Networks (GANs) involve training of overparameterized models where the number of parameters of the model exceeds a certain threshold. A large body of work in supervised learning have shown the importance of model overparameterization in the convergence of the gradient descent (GD) to globally optimal solutions. In c… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted in ICLR 2021

  8. arXiv:2010.02350  [pdf, other

    cs.LG cs.CV

    Winning Lottery Tickets in Deep Generative Models

    Authors: Neha Mukund Kalibhat, Yogesh Balaji, Soheil Feizi

    Abstract: The lottery ticket hypothesis suggests that sparse, sub-networks of a given neural network, if initialized properly, can be trained to reach comparable or even better performance to that of the original network. Prior works in lottery tickets have primarily focused on the supervised learning setup, with several papers proposing effective ways of finding "winning tickets" in classification problems… ▽ More

    Submitted 29 January, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Published at AAAI 2021