Skip to main content

Showing 1–1 of 1 results for author: Kasera, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.17848  [pdf, other

    cs.CV

    Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

    Authors: Ariel N. Lee, Sarah Adel Bargal, Janavi Kasera, Stan Sclaroff, Kate Saenko, Nataniel Ruiz

    Abstract: Vision transformers (ViTs) have significantly changed the computer vision landscape and have periodically exhibited superior performance in vision tasks compared to convolutional neural networks (CNNs). Although the jury is still out on which model type is superior, each has unique inductive biases that shape their learning and generalization performance. For example, ViTs have interesting propert… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.