Skip to main content

Showing 1–2 of 2 results for author: Golbert, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.00287  [pdf, other

    cs.CV

    Vision Transformers with Mixed-Resolution Tokenization

    Authors: Tomer Ronen, Omer Levy, Avram Golbert

    Abstract: Vision Transformer models process input images by dividing them into a spatially regular grid of equal-size patches. Conversely, Transformers were originally introduced over natural language sequences, where each token represents a subword - a chunk of raw data of arbitrary size. In this work, we apply this approach to Vision Transformers by introducing a novel image tokenization scheme, replacing… ▽ More

    Submitted 27 April, 2023; v1 submitted 1 April, 2023; originally announced April 2023.

  2. arXiv:2204.09134  [pdf, other

    cs.CV cs.LG

    Diverse Imagenet Models Transfer Better

    Authors: Niv Nayman, Avram Golbert, Asaf Noy, Tan **, Lihi Zelnik-Manor

    Abstract: A commonly accepted hypothesis is that models with higher accuracy on Imagenet perform better on other downstream tasks, leading to much research dedicated to optimizing Imagenet accuracy. Recently this hypothesis has been challenged by evidence showing that self-supervised models transfer better than their supervised counterparts, despite their inferior Imagenet accuracy. This calls for identifyi… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    MSC Class: 68T07; 68T10; 68T45 ACM Class: I.2.10; I.2.6; I.4.10