Skip to main content

Showing 1–6 of 6 results for author: Rojas-Gomez, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05726  [pdf, other

    cs.LG cs.CV

    Augmentations vs Algorithms: What Works in Self-Supervised Learning

    Authors: Warren Morningstar, Alex Bijamov, Chris Duvarney, Luke Friedman, Neha Kalibhat, Luyang Liu, Philip Mansfield, Renan Rojas-Gomez, Karan Singhal, Bradley Green, Sushant Prakash

    Abstract: We study the relative effects of data augmentations, pretraining algorithms, and model architectures in Self-Supervised Learning (SSL). While the recent literature in this space leaves the impression that the pretraining algorithm is of critical importance to performance, understanding its effect is complicated by the difficulty in making objective and direct comparisons between methods. We propos… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 18 pages, 1 figure

  2. arXiv:2312.01187  [pdf, other

    cs.CV cs.LG stat.ML

    SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer

    Authors: Renan A. Rojas-Gomez, Karan Singhal, Ali Etemad, Alex Bijamov, Warren R. Morningstar, Philip Andrew Mansfield

    Abstract: Existing data augmentation in self-supervised learning, while diverse, fails to preserve the inherent structure of natural images. This results in distorted augmented samples with compromised semantic information, ultimately impacting downstream performance. To overcome this, we propose SASSL: Style Augmentations for Self Supervised Learning, a novel augmentation technique based on Neural Style Tr… ▽ More

    Submitted 3 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  3. arXiv:2305.16316  [pdf, other

    cs.CV

    Making Vision Transformers Truly Shift-Equivariant

    Authors: Renan A. Rojas-Gomez, Teck-Yian Lim, Minh N. Do, Raymond A. Yeh

    Abstract: For computer vision, Vision Transformers (ViTs) have become one of the go-to deep net architectures. Despite being inspired by Convolutional Neural Networks (CNNs), ViTs' output remains sensitive to small spatial shifts in the input, i.e., not shift invariant. To address this shortcoming, we introduce novel data-adaptive designs for each of the modules in ViTs, such as tokenization, self-attention… ▽ More

    Submitted 28 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  4. arXiv:2210.08001  [pdf, other

    cs.CV cs.AI cs.LG

    Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional Networks

    Authors: Renan A. Rojas-Gomez, Teck-Yian Lim, Alexander G. Schwing, Minh N. Do, Raymond A. Yeh

    Abstract: We propose learnable polyphase sampling (LPS), a pair of learnable down/upsampling layers that enable truly shift-invariant and equivariant convolutional networks. LPS can be trained end-to-end from data and generalizes existing handcrafted downsampling layers. It is widely applicable as it can be integrated into any convolutional network by replacing down/upsampling layers. We evaluate LPS on ima… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  5. arXiv:2106.06927  [pdf, other

    cs.CV cs.LG cs.NE

    Inverting Adversarially Robust Networks for Image Synthesis

    Authors: Renan A. Rojas-Gomez, Raymond A. Yeh, Minh N. Do, Anh Nguyen

    Abstract: Despite unconditional feature inversion being the foundation of many image synthesis applications, training an inverter demands a high computational budget, large decoding capacity and imposing conditions such as autoregressive priors. To address these limitations, we propose the use of adversarially robust representations as a perceptual primitive for feature inversion. We train an adversarially… ▽ More

    Submitted 21 October, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: Accepted at the 16th Asian Conference on Computer Vision (ACCV 2022)

  6. arXiv:2009.01807  [pdf, other

    cs.LG eess.IV stat.ML

    Physics-Consistent Data-driven Waveform Inversion with Adaptive Data Augmentation

    Authors: Renán Rojas-Gómez, Jihyun Yang, Youzuo Lin, James Theiler, Brendt Wohlberg

    Abstract: Seismic full-waveform inversion (FWI) is a nonlinear computational imaging technique that can provide detailed estimates of subsurface geophysical properties. Solving the FWI problem can be challenging due to its ill-posedness and high computational cost. In this work, we develop a new hybrid computational approach to solve FWI that combines physics-based models with data-driven methodologies. In… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.