Skip to main content

Showing 1–10 of 10 results for author: Mukhoti, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.13320  [pdf, other

    cs.LG cs.CV

    Fine-tuning can cripple your foundation model; preserving features may be the solution

    Authors: Jishnu Mukhoti, Yarin Gal, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Pre-trained foundation models, due to their enormous capacity and exposure to vast amounts of data during pre-training, are known to have learned plenty of real-world concepts. An important step in making these pre-trained models effective on downstream tasks is to fine-tune them on related datasets. While various fine-tuning methods have been devised and have been shown to be highly effective, we… ▽ More

    Submitted 1 July, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Published in TMLR: https://openreview.net/forum?id=kfhoeZCeW7

  2. arXiv:2212.04994  [pdf, other

    cs.CV

    Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

    Authors: Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

    Abstract: We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder. With such an alignment, a model can identify regions of an image corresponding to a given text input, and therefore transfer seamlessly to the task of open vocabul… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  3. arXiv:2209.11960  [pdf, other

    cs.CV cs.LG

    Raising the Bar on the Evaluation of Out-of-Distribution Detection

    Authors: Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim

    Abstract: In image classification, a lot of development has happened in detecting out-of-distribution (OoD) data. However, most OoD detection methods are evaluated on a standard set of datasets, arbitrarily different from training data. There is no clear definition of what forms a ``good" OoD dataset. Furthermore, the state-of-the-art OoD detection methods already achieve near perfect results on these stand… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  4. arXiv:2111.00079  [pdf, other

    cs.CV cs.LG

    Deep Deterministic Uncertainty for Semantic Segmentation

    Authors: Jishnu Mukhoti, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

    Abstract: We extend Deep Deterministic Uncertainty (DDU), a method for uncertainty estimation using feature space densities, to semantic segmentation. DDU enables quantifying and disentangling epistemic and aleatoric uncertainty in a single forward pass through the model. We study the similarity of feature representations of pixels at different locations for the same class and conclude that it is feasible t… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  5. arXiv:2102.11582  [pdf, other

    cs.LG stat.ML

    Deep Deterministic Uncertainty: A Simple Baseline

    Authors: Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

    Abstract: Reliable uncertainty from deterministic single-forward pass models is sought after because conventional methods of uncertainty quantification are computationally expensive. We take two complex single-forward-pass uncertainty approaches, DUQ and SNGP, and examine whether they mainly rely on a well-regularized feature space. Crucially, without using their more complex methods for estimating uncertai… ▽ More

    Submitted 28 January, 2022; v1 submitted 23 February, 2021; originally announced February 2021.

  6. arXiv:2012.13220  [pdf, other

    cs.LG stat.ML

    On Batch Normalisation for Approximate Bayesian Inference

    Authors: Jishnu Mukhoti, Puneet K. Dokania, Philip H. S. Torr, Yarin Gal

    Abstract: We study batch normalisation in the context of variational inference methods in Bayesian neural networks, such as mean-field or MC Dropout. We show that batch-normalisation does not affect the optimum of the evidence lower bound (ELBO). Furthermore, we study the Monte Carlo Batch Normalisation (MCBN) algorithm, proposed as an approximate inference technique parallel to MC Dropout, and show that fo… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  7. arXiv:2002.09437  [pdf, other

    cs.LG cs.CV stat.ML

    Calibrating Deep Neural Networks using Focal Loss

    Authors: Jishnu Mukhoti, Viveka Kulharia, Amartya Sanyal, Stuart Golodetz, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Miscalibration - a mismatch between a model's confidence and its correctness - of Deep Neural Networks (DNNs) makes their predictions hard to rely on. Ideally, we want networks to be accurate, calibrated and confident. We show that, as opposed to the standard cross-entropy loss, focal loss [Lin et. al., 2017] allows us to learn models that are already very well calibrated. When combined with tempe… ▽ More

    Submitted 26 October, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: This paper was accepted at NeurIPS 2020

  8. arXiv:1906.08744  [pdf, other

    cs.CV cs.LG cs.RO

    Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

    Authors: Tommaso Cavallari, Luca Bertinetto, Jishnu Mukhoti, Philip Torr, Stuart Golodetz

    Abstract: Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can lever… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Tommaso Cavallari and Stuart Golodetz contributed equally to this paper

  9. arXiv:1811.12709  [pdf, other

    cs.CV

    Evaluating Bayesian Deep Learning Methods for Semantic Segmentation

    Authors: Jishnu Mukhoti, Yarin Gal

    Abstract: Deep learning has been revolutionary for computer vision and semantic segmentation in particular, with Bayesian Deep Learning (BDL) used to obtain uncertainty maps from deep models when predicting semantic classes. This information is critical when using semantic segmentation for autonomous driving for example. Standard semantic segmentation systems have well-established evaluation metrics. Howeve… ▽ More

    Submitted 23 March, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Updated baselines and numbers on concrete dropout

  10. arXiv:1811.09385  [pdf, ps, other

    cs.LG stat.ML

    On the Importance of Strong Baselines in Bayesian Deep Learning

    Authors: Jishnu Mukhoti, Pontus Stenetorp, Yarin Gal

    Abstract: Like all sub-fields of machine learning Bayesian Deep Learning is driven by empirical validation of its theoretical proposals. Given the many aspects of an experiment it is always possible that minor or even major experimental flaws can slip by both authors and reviewers. One of the most popular experiments used to evaluate approximate inference techniques is the regression experiment on UCI datas… ▽ More

    Submitted 30 November, 2018; v1 submitted 23 November, 2018; originally announced November 2018.

    Comments: Bayesian Deep Learning Workshop, NeurIPS 2018