Skip to main content

Showing 1–17 of 17 results for author: Balaji, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.10474  [pdf, other

    cs.CV cs.GR cs.LG

    Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

    Authors: Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji

    Abstract: Despite tremendous progress in generating high-quality images using diffusion models, synthesizing a sequence of animated frames that are both photorealistic and temporally coherent is still in its infancy. While off-the-shelf billion-scale datasets for image generation are available, collecting similar video data of the same scale is still challenging. Also, training a video diffusion model is co… ▽ More

    Submitted 25 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ICCV 2023. Project webpage: https://research.nvidia.com/labs/dir/pyoco

  2. arXiv:2211.01324  [pdf, other

    cs.CV cs.LG

    eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

    Authors: Yogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Qinsheng Zhang, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, Ming-Yu Liu

    Abstract: Large-scale diffusion-based generative models have led to breakthroughs in text-conditioned high-resolution image synthesis. Starting from random noise, such text-to-image diffusion models gradually synthesize images in an iterative fashion while conditioning on text prompts. We find that their synthesis behavior qualitatively changes throughout this process: Early in sampling, generation strongly… ▽ More

    Submitted 13 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  3. arXiv:2201.10766  [pdf, other

    cs.CV

    A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

    Authors: Mazda Moayeri, Phillip Pope, Yogesh Balaji, Soheil Feizi

    Abstract: While datasets with single-label supervision have propelled rapid advances in image classification, additional annotations are necessary in order to quantitatively assess how models make predictions. To this end, for a subset of ImageNet samples, we collect segmentation masks for the entire object and $18$ informative attributes. We call this dataset RIVAL10 (RIch Visual Attributes with Localizati… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  4. arXiv:2104.05605  [pdf, other

    cs.LG stat.ML

    Understanding Overparameterization in Generative Adversarial Networks

    Authors: Yogesh Balaji, Mohammadmahdi Sajedi, Neha Mukund Kalibhat, Mucong Ding, Dominik Stöger, Mahdi Soltanolkotabi, Soheil Feizi

    Abstract: A broad class of unsupervised deep learning methods such as Generative Adversarial Networks (GANs) involve training of overparameterized models where the number of parameters of the model exceeds a certain threshold. A large body of work in supervised learning have shown the importance of model overparameterization in the convergence of the gradient descent (GD) to globally optimal solutions. In c… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted in ICLR 2021

  5. arXiv:2010.05862  [pdf, other

    cs.LG cs.CV

    Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

    Authors: Yogesh Balaji, Rama Chellappa, Soheil Feizi

    Abstract: Optimal Transport (OT) distances such as Wasserstein have been used in several areas such as GANs and domain adaptation. OT, however, is very sensitive to outliers (samples with large noise) in the data since in its objective function, every sample, including outliers, is weighed similarly due to the marginal constraints. To remedy this issue, robust formulations of OT with unbalanced marginal con… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted in NeurIPS 2020. Code available at https://github.com/yogeshbalaji/robustOT

  6. arXiv:2010.02418  [pdf, other

    cs.LG cs.AI cs.CV

    The Effectiveness of Memory Replay in Large Scale Continual Learning

    Authors: Yogesh Balaji, Mehrdad Farajtabar, Dong Yin, Alex Mott, Ang Li

    Abstract: We study continual learning in the large scale setting where tasks in the input sequence are not limited to classification, and the outputs can be of high dimension. Among multiple state-of-the-art methods, we found vanilla experience replay (ER) still very competitive in terms of both performance and scalability, despite its simplicity. However, a degraded performance is observed for ER with smal… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 15 pages

  7. arXiv:2010.02350  [pdf, other

    cs.LG cs.CV

    Winning Lottery Tickets in Deep Generative Models

    Authors: Neha Mukund Kalibhat, Yogesh Balaji, Soheil Feizi

    Abstract: The lottery ticket hypothesis suggests that sparse, sub-networks of a given neural network, if initialized properly, can be trained to reach comparable or even better performance to that of the original network. Prior works in lottery tickets have primarily focused on the supervised learning setup, with several papers proposing effective ways of finding "winning tickets" in classification problems… ▽ More

    Submitted 29 January, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Published at AAAI 2021

  8. arXiv:2008.12839  [pdf, other

    cs.CV cs.LG

    Learning to Balance Specificity and Invariance for In and Out of Domain Generalization

    Authors: Prithvijit Chattopadhyay, Yogesh Balaji, Judy Hoffman

    Abstract: We introduce Domain-specific Masks for Generalization, a model for improving both in-domain and out-of-domain generalization performance. For domain generalization, the goal is to learn from a set of source domains to produce a single model that will best generalize to an unseen target domain. As such, many prior approaches focus on learning representations which persist across all source domains… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

    Comments: Published at ECCV 2020

  9. arXiv:2007.01261  [pdf, other

    cs.CV cs.LG eess.IV

    Curriculum Manager for Source Selection in Multi-Source Domain Adaptation

    Authors: Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava

    Abstract: The performance of Multi-Source Unsupervised Domain Adaptation depends significantly on the effectiveness of transfer from labeled source domain samples. In this paper, we proposed an adversarial agent that learns a dynamic curriculum for source samples, called Curriculum Manager for Source Selection (CMSS). The Curriculum Manager, an independent network module, constantly updates the curriculum d… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  10. arXiv:2003.10713  [pdf, other

    cs.LG stat.ML

    Unsupervised Anomaly Detection with Adversarial Mirrored AutoEncoders

    Authors: Gowthami Somepalli, Yexin Wu, Yogesh Balaji, Bhanukiran Vinzamuri, Soheil Feizi

    Abstract: Detecting out of distribution (OOD) samples is of paramount importance in all Machine Learning applications. Deep generative modeling has emerged as a dominant paradigm to model complex data distributions without labels. However, prior work has shown that generative models tend to assign higher likelihoods to OOD samples compared to the data distribution on which they were trained. First, we propo… ▽ More

    Submitted 3 January, 2021; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: Updated the paper with more OOD detection baselines. Performed ablation analysis on various components of AMA

  11. arXiv:1911.10291  [pdf, other

    cs.LG cs.CV stat.ML

    Invert and Defend: Model-based Approximate Inversion of Generative Adversarial Networks for Secure Inference

    Authors: Wei-An Lin, Yogesh Balaji, Pouya Samangouei, Rama Chellappa

    Abstract: Inferring the latent variable generating a given test sample is a challenging problem in Generative Adversarial Networks (GANs). In this paper, we propose InvGAN - a novel framework for solving the inference problem in GANs, which involves training an encoder network capable of inverting a pre-trained generator network without access to any training data. Under mild assumptions, we theoretically s… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  12. arXiv:1911.08654  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness of Flow-Based Generative Models

    Authors: Phillip Pope, Yogesh Balaji, Soheil Feizi

    Abstract: Flow-based generative models leverage invertible generator functions to fit a distribution to the training data using maximum likelihood. Despite their use in several application domains, robustness of these models to adversarial attacks has hardly been explored. In this paper, we study adversarial robustness of flow-based generative models both theoretically (for some simple models) and empirical… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  13. arXiv:1910.08051  [pdf, other

    cs.LG cs.CV stat.ML

    Instance adaptive adversarial training: Improved accuracy tradeoffs in neural nets

    Authors: Yogesh Balaji, Tom Goldstein, Judy Hoffman

    Abstract: Adversarial training is by far the most successful strategy for improving robustness of neural networks to adversarial attacks. Despite its success as a defense mechanism, adversarial training fails to generalize well to unperturbed test set. We hypothesize that this poor generalization is a consequence of adversarial training with uniform perturbation radius around every training sample. Samples… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  14. arXiv:1902.00415  [pdf, other

    cs.LG stat.ML

    Normalized Wasserstein Distance for Mixture Distributions with Applications in Adversarial Learning and Domain Adaptation

    Authors: Yogesh Balaji, Rama Chellappa, Soheil Feizi

    Abstract: Understanding proper distance measures between distributions is at the core of several learning tasks such as generative models, domain adaptation, clustering, etc. In this work, we focus on mixture distributions that arise naturally in several application domains where the data contains different sub-populations. For mixture distributions, established distance measures such as the Wasserstein dis… ▽ More

    Submitted 29 October, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Accepted at ICCV 2019

  15. arXiv:1810.04147  [pdf, other

    cs.LG stat.ML

    Entropic GANs meet VAEs: A Statistical Approach to Compute Sample Likelihoods in GANs

    Authors: Yogesh Balaji, Hamed Hassani, Rama Chellappa, Soheil Feizi

    Abstract: Building on the success of deep learning, two modern approaches to learn a probability model from the data are Generative Adversarial Networks (GANs) and Variational AutoEncoders (VAEs). VAEs consider an explicit probability model for the data and compute a generative distribution by maximizing a variational lower-bound on the log-likelihood function. GANs, however, compute a generative model by m… ▽ More

    Submitted 5 June, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

  16. arXiv:1711.06969  [pdf, other

    cs.CV cs.LG stat.ML

    Learning from Synthetic Data: Addressing Domain Shift for Semantic Segmentation

    Authors: Swami Sankaranarayanan, Yogesh Balaji, Arpit Jain, Ser Nam Lim, Rama Chellappa

    Abstract: Visual Domain Adaptation is a problem of immense importance in computer vision. Previous approaches showcase the inability of even deep neural networks to learn informative representations across domain shift. This problem is more severe for tasks where acquiring hand labeled data is extremely hard and tedious. In this work, we focus on adapting the representations learned by segmentation networks… ▽ More

    Submitted 1 April, 2018; v1 submitted 19 November, 2017; originally announced November 2017.

    Comments: Accepted as spotlight talk at CVPR 2018. Code available here: https://github.com/swamiviv/LSD-seg

  17. arXiv:1704.01705  [pdf, other

    cs.CV

    Generate To Adapt: Aligning Domains using Generative Adversarial Networks

    Authors: Swami Sankaranarayanan, Yogesh Balaji, Carlos D. Castillo, Rama Chellappa

    Abstract: Domain Adaptation is an actively researched problem in Computer Vision. In this work, we propose an approach that leverages unsupervised data to bring the source and target distributions closer in a learned joint feature space. We accomplish this by inducing a symbiotic relationship between the learned embedding and a generative adversarial network. This is in contrast to methods which use the adv… ▽ More

    Submitted 12 April, 2018; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: Accepted as spotlight talk at CVPR 2018. Code available here: https://github.com/yogeshbalaji/Generate_To_Adapt