Skip to main content

Showing 1–4 of 4 results for author: Vijaykumar, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.00868  [pdf, other

    cs.CV

    We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline

    Authors: Simar Kareer, Vivek Vijaykumar, Harsh Maheshwari, Prithvijit Chattopadhyay, Judy Hoffman, Viraj Prabhu

    Abstract: There has been abundant work in unsupervised domain adaptation for semantic segmentation (DAS) seeking to adapt a model trained on images from a labeled source domain to an unlabeled target domain. While the vast majority of prior work has studied this as a frame-level Image-DAS problem, a few Video-DAS works have sought to additionally leverage the temporal signal present in adjacent frames. Howe… ▽ More

    Submitted 27 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: TMLR 2024

  2. arXiv:2212.00979  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization

    Authors: Prithvijit Chattopadhyay, Kartik Sarangmath, Vivek Vijaykumar, Judy Hoffman

    Abstract: Synthetic data offers the promise of cheap and bountiful training data for settings where labeled real-world data is scarce. However, models trained on synthetic data significantly underperform when evaluated on real-world data. In this paper, we propose Proportional Amplitude Spectrum Training Augmentation (PASTA), a simple and effective augmentation strategy to improve out-of-the-box synthetic-t… ▽ More

    Submitted 22 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted at ICCV 2023, Code: https://github.com/prithv1/PASTA

  3. arXiv:2103.12718  [pdf, other

    cs.CV

    Self-Supervised Pretraining Improves Self-Supervised Pretraining

    Authors: Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell

    Abstract: While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models pretrained on datasets dissimilar to their target data, such as chest X-ray models trained on ImageNet, underperform models trained from scratch. Users that lack the r… ▽ More

    Submitted 24 March, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  4. arXiv:2009.06494  [pdf, other

    cs.HC

    Play Music An HCI Oriented Evaluation of Googles Default Music Player Interface

    Authors: Venkatesh Vijaykumar

    Abstract: The work embodied in this paper attempts to suggest a few improvements to the playlist creation task interface of the Google Play Music Android application based on recommended practices encountered in the Human-Computer Interaction discipline. The improvements are largely centered on intuitive navigation and selection actions, in order to facilitate a smoother experience in creating, ordering, an… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.