Skip to main content

Showing 1–5 of 5 results for author: Saranrittichai, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.16414  [pdf, other

    cs.CV cs.AI cs.LG

    AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models

    Authors: Jan Hendrik Metzen, Piyapat Saranrittichai, Chaithanya Kumar Mummadi

    Abstract: Classifiers built upon vision-language models such as CLIP have shown remarkable zero-shot performance across a broad range of image classification tasks. Prior work has studied different ways of automatically creating descriptor sets for every class based on prompt templates, ranging from manually engineered templates over templates obtained from a large language model to templates built from ran… ▽ More

    Submitted 29 September, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

  2. arXiv:2309.06581  [pdf, other

    cs.CV

    Zero-Shot Visual Classification with Guided Crop**

    Authors: Piyapat Saranrittichai, Mauricio Munoz, Volker Fischer, Chaithanya Kumar Mummadi

    Abstract: Pretrained vision-language models, such as CLIP, show promising zero-shot performance across a wide variety of datasets. For closed-set classification tasks, however, there is an inherent limitation: CLIP image encoders are typically designed to extract generic image-level features that summarize superfluous or confounding information for the target tasks. This results in degradation of classifica… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  3. arXiv:2208.06809  [pdf, other

    cs.CV

    Multi-Attribute Open Set Recognition

    Authors: Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer

    Abstract: Open Set Recognition (OSR) extends image classification to an open-world setting, by simultaneously classifying known classes and identifying unknown ones. While conventional OSR approaches can detect Out-of-Distribution (OOD) samples, they cannot provide explanations indicating which underlying visual attribute(s) (e.g., shape, color or background) cause a specific sample to be unknown. In this w… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at German Conference for Pattern Recognition (GCPR) 2022

  4. arXiv:2207.10002  [pdf, other

    cs.CV cs.AI

    Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain

    Authors: Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer

    Abstract: Shortcut learning occurs when a deep neural network overly relies on spurious correlations in the training dataset in order to solve downstream tasks. Prior works have shown how this impairs the compositional generalization capability of deep learning models. To address this problem, we propose a novel approach to mitigate shortcut learning in uncontrolled target domains. Our approach extends the… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted for publication at European Conference on Computer Vision (ECCV) 2022

  5. arXiv:2108.05779  [pdf, other

    cs.CV

    DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities

    Authors: Elias Eulig, Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Kilian Rambach, William Beluch, Xiahan Shi, Volker Fischer

    Abstract: Common deep neural networks (DNNs) for image classification have been shown to rely on shortcut opportunities (SO) in the form of predictive and easy-to-represent visual factors. This is known as shortcut learning and leads to impaired generalization. In this work, we show that common DNNs also suffer from shortcut learning when predicting only basic visual object factors of variation (FoV) such a… ▽ More

    Submitted 8 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at IEEE International Conference on Computer Vision (ICCV) 2021; updated affiliations & corrected typo

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 10655-10664