Skip to main content

Showing 1–6 of 6 results for author: Stegmüller, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16085  [pdf, other

    cs.CV

    A Simple Framework for Open-Vocabulary Zero-Shot Segmentation

    Authors: Thomas Stegmüller, Tim Lebailly, Nikola Dukic, Behzad Bozorgtabar, Tinne Tuytelaars, Jean-Philippe Thiran

    Abstract: Zero-shot classification capabilities naturally arise in models trained within a vision-language contrastive framework. Despite their classification prowess, these models struggle in dense tasks like zero-shot open-vocabulary segmentation. This deficiency is often attributed to the absence of localization cues in captions and the intertwined nature of the learning process, which encompasses both i… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2310.07855  [pdf, other

    cs.CV cs.LG

    CrIBo: Self-Supervised Learning via Cross-Image Object-Level Bootstrap**

    Authors: Tim Lebailly, Thomas Stegmüller, Behzad Bozorgtabar, Jean-Philippe Thiran, Tinne Tuytelaars

    Abstract: Leveraging nearest neighbor retrieval for self-supervised representation learning has proven beneficial with object-centric images. However, this approach faces limitations when applied to scene-centric datasets, where multiple objects within an image are only implicitly captured in the global representation. Such global bootstrap** can lead to undesirable entanglement of object representations.… ▽ More

    Submitted 3 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (spotlight)

  3. arXiv:2303.13606  [pdf, other

    cs.CV

    Adaptive Similarity Bootstrap** for Self-Distillation based Representation Learning

    Authors: Tim Lebailly, Thomas Stegmüller, Behzad Bozorgtabar, Jean-Philippe Thiran, Tinne Tuytelaars

    Abstract: Most self-supervised methods for representation learning leverage a cross-view consistency objective i.e., they maximize the representation similarity of a given image's augmented views. Recent work NNCLR goes beyond the cross-view paradigm and uses positive pairs from different images obtained via nearest neighbor bootstrap** in a contrastive setting. We empirically show that as opposed to the… ▽ More

    Submitted 7 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: ICCV 2023. * denotes equal contribution

  4. arXiv:2303.13245  [pdf, other

    cs.CV

    CrOC: Cross-View Online Clustering for Dense Visual Representation Learning

    Authors: Thomas Stegmüller, Tim Lebailly, Behzad Bozorgtabar, Tinne Tuytelaars, Jean-Philippe Thiran

    Abstract: Learning dense visual representations without labels is an arduous task and more so from scene-centric data. We propose to tackle this challenging problem by proposing a Cross-view consistency objective with an Online Clustering mechanism (CrOC) to discover and segment the semantics of the views. In the absence of hand-crafted priors, the resulting method is more generalizable and does not require… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023, * denotes equal contribution

  5. arXiv:2302.05195  [pdf, other

    eess.IV cs.CV

    Self-supervised learning-based cervical cytology for the triage of HPV-positive women in resource-limited settings and low-data regime

    Authors: Thomas Stegmüller, Christian Abbet, Behzad Bozorgtabar, Holly Clarke, Patrick Petignat, Pierre Vassilakos, Jean-Philippe Thiran

    Abstract: Screening Papanicolaou test samples has proven to be highly effective in reducing cervical cancer-related mortality. However, the lack of trained cytopathologists hinders its widespread implementation in low-resource settings. Deep learning-based telecytology diagnosis emerges as an appealing alternative, but it requires the collection of large annotated training datasets, which is costly and time… ▽ More

    Submitted 7 June, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

  6. arXiv:2202.07570  [pdf, other

    cs.CV

    ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification

    Authors: Thomas Stegmüller, Behzad Bozorgtabar, Antoine Spahr, Jean-Philippe Thiran

    Abstract: Progress in digital pathology is hindered by high-resolution images and the prohibitive cost of exhaustive localized annotations. The commonly used paradigm to categorize pathology images is patch-based processing, which often incorporates multiple instance learning (MIL) to aggregate local patch-level representations yielding image-level prediction. Nonetheless, diagnostically relevant regions ma… ▽ More

    Submitted 18 July, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: 19 pages, 7 figures