Skip to main content

Showing 1–1 of 1 results for author: Lin, W S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.09260  [pdf, other

    cs.CV

    How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation

    Authors: Akash Umakantha, Joao D. Semedo, S. Alireza Golestaneh, Wan-Yi S. Lin

    Abstract: The Vision Transformer (ViT) architecture has recently achieved competitive performance across a variety of computer vision tasks. One of the motivations behind ViTs is weaker inductive biases, when compared to convolutional neural networks (CNNs). However this also makes ViTs more difficult to train. They require very large training datasets, heavy regularization, and strong data augmentations. T… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.