Skip to main content

Showing 1–23 of 23 results for author: Vahdat, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.04391  [pdf, other

    cs.LG cs.CV math.NA stat.ML

    A Variational Perspective on Solving Inverse Problems with Diffusion Models

    Authors: Morteza Mardani, Jiaming Song, Jan Kautz, Arash Vahdat

    Abstract: Diffusion models have emerged as a key pillar of foundation models in visual domains. One of their critical applications is to universally solve different downstream inverse tasks via a single diffusion prior without re-training for each task. Most inverse tasks can be formulated as inferring a posterior distribution over data (e.g., a full image) given a measurement (e.g., a masked image). This i… ▽ More

    Submitted 29 September, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

  2. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  3. arXiv:2302.05872  [pdf, other

    cs.CV cs.LG stat.ML

    I$^2$SB: Image-to-Image Schrödinger Bridge

    Authors: Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, Anima Anandkumar

    Abstract: We propose Image-to-Image Schrödinger Bridge (I$^2$SB), a new class of conditional diffusion models that directly learn the nonlinear diffusion processes between two given distributions. These diffusion bridges are particularly useful for image restoration, as the degraded images are structurally informative priors for reconstructing the clean images. I$^2$SB belongs to a tractable class of Schröd… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: ICML camera ready (high-resolution figures)

  4. arXiv:2210.09929  [pdf, other

    stat.ML cs.CR cs.LG

    Differentially Private Diffusion Models

    Authors: Tim Dockhorn, Tianshi Cao, Arash Vahdat, Karsten Kreis

    Abstract: While modern machine learning models rely on increasingly large training datasets, data is often limited in privacy-sensitive domains. Generative models trained with differential privacy (DP) on sensitive data can sidestep this challenge, providing access to synthetic data instead. We build on the recent success of diffusion models (DMs) and introduce Differentially Private Diffusion Models (DPDMs… ▽ More

    Submitted 30 December, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted at TMLR (https://openreview.net/forum?id=ZPpQk7FJXF)

  5. arXiv:2210.06978  [pdf, other

    cs.CV cs.LG stat.ML

    LION: Latent Point Diffusion Models for 3D Shape Generation

    Authors: Xiaohui Zeng, Arash Vahdat, Francis Williams, Zan Gojcic, Or Litany, Sanja Fidler, Karsten Kreis

    Abstract: Denoising diffusion models (DDMs) have shown promising results in 3D point cloud synthesis. To advance 3D DDMs and make them useful for digital artists, we require (i) high generation quality, (ii) flexibility for manipulation and applications such as conditional synthesis and shape interpolation, and (iii) the ability to output smooth surfaces or meshes. To this end, we introduce the hierarchical… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  6. arXiv:2210.05475  [pdf, other

    stat.ML cs.LG

    GENIE: Higher-Order Denoising Diffusion Solvers

    Authors: Tim Dockhorn, Arash Vahdat, Karsten Kreis

    Abstract: Denoising diffusion models (DDMs) have emerged as a powerful class of generative models. A forward diffusion process slowly perturbs the data, while a deep model learns to gradually denoise. Synthesis amounts to solving a differential equation (DE) defined by the learnt model. Solving the DE requires slow iterative solvers for high-quality generation. In this work, we propose Higher-Order Denoisin… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  7. arXiv:2112.07804  [pdf, other

    cs.LG stat.ML

    Tackling the Generative Learning Trilemma with Denoising Diffusion GANs

    Authors: Zhisheng Xiao, Karsten Kreis, Arash Vahdat

    Abstract: A wide variety of deep generative models has been developed in the past decade. Yet, these models often struggle with simultaneously addressing three key requirements including: high sample quality, mode coverage, and fast sampling. We call the challenge imposed by these requirements the generative learning trilemma, as the existing models often trade some of them for others. Particularly, denoisi… ▽ More

    Submitted 4 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: ICLR 2022 (Spotlight)

  8. arXiv:2112.07068  [pdf, other

    stat.ML cs.LG

    Score-Based Generative Modeling with Critically-Damped Langevin Diffusion

    Authors: Tim Dockhorn, Arash Vahdat, Karsten Kreis

    Abstract: Score-based generative models (SGMs) have demonstrated remarkable synthesis quality. SGMs rely on a diffusion process that gradually perturbs the data towards a tractable distribution, while the generative model learns to denoise. The complexity of this denoising task is, apart from the data distribution itself, uniquely determined by the diffusion process. We argue that current SGMs employ overly… ▽ More

    Submitted 25 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: ICLR 2022 (Spotlight)

  9. arXiv:2106.05931  [pdf, other

    stat.ML cs.LG

    Score-based Generative Modeling in Latent Space

    Authors: Arash Vahdat, Karsten Kreis, Jan Kautz

    Abstract: Score-based generative models (SGMs) have recently demonstrated impressive results in terms of both sample quality and distribution coverage. However, they are usually applied directly in data space and often require thousands of network evaluations for sampling. Here, we propose the Latent Score-based Generative Model (LSGM), a novel approach that trains SGMs in a latent space, relying on the var… ▽ More

    Submitted 2 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  10. arXiv:2010.02917  [pdf, other

    cs.LG cs.CV stat.ML

    A Contrastive Learning Approach for Training Variational Autoencoder Priors

    Authors: Jyoti Aneja, Alexander Schwing, Jan Kautz, Arash Vahdat

    Abstract: Variational autoencoders (VAEs) are one of the powerful likelihood-based generative models with applications in many domains. However, they struggle to generate high-quality images, especially when samples are obtained from the prior without any tempering. One explanation for VAEs' poor generative quality is the prior hole problem: the prior distribution fails to match the aggregate approximate po… ▽ More

    Submitted 3 November, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2021

  11. arXiv:2010.00654  [pdf, other

    cs.LG cs.CV stat.ML

    VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models

    Authors: Zhisheng Xiao, Karsten Kreis, Jan Kautz, Arash Vahdat

    Abstract: Energy-based models (EBMs) have recently been successful in representing complex distributions of small images. However, sampling from them requires expensive Markov chain Monte Carlo (MCMC) iterations that mix slowly in high dimensional pixel space. Unlike EBMs, variational autoencoders (VAEs) generate samples quickly and are equipped with a latent space that enables fast traversal of the data ma… ▽ More

    Submitted 4 November, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: ICLR 2021 (spotlight)

  12. arXiv:2007.03898  [pdf, other

    stat.ML cs.CV cs.LG

    NVAE: A Deep Hierarchical Variational Autoencoder

    Authors: Arash Vahdat, Jan Kautz

    Abstract: Normalizing flows, autoregressive models, variational autoencoders (VAEs), and deep energy-based models are among competing likelihood-based frameworks for deep generative learning. Among them, VAEs have the advantage of fast and tractable sampling and easy-to-access encoding networks. However, they are currently outperformed by other models such as normalizing flows and autoregressive models. Whi… ▽ More

    Submitted 7 January, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Neural Information Processing Systems (NeurIPS) 2020 (spotlight)

  13. arXiv:2006.09920  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Contrastive Learning for Weakly Supervised Phrase Grounding

    Authors: Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem

    Abstract: Phrase grounding, the problem of associating image regions to caption words, is a crucial component of vision-language tasks. We show that phrase grounding can be learned by optimizing word-region attention to maximize a lower bound on mutual information between images and caption words. Given pairs of images and captions, we maximize compatibility of the attention-weighted regions and the words i… ▽ More

    Submitted 5 August, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: ECCV 2020 (spotlight paper), Project page: http://tanmaygupta.info/info-ground

  14. arXiv:2002.03432  [pdf, other

    cs.LG cs.NE math.NA stat.ML

    On the distance between two neural networks and the stability of learning

    Authors: Jeremy Bernstein, Arash Vahdat, Yisong Yue, Ming-Yu Liu

    Abstract: This paper relates parameter distance to gradient breakdown for a broad class of nonlinear compositional functions. The analysis leads to a new distance function called deep relative trust and a descent lemma for neural networks. Since the resulting learning rule seems to require little to no learning rate tuning, it may unlock a simpler workflow for training deeper and more complex neural network… ▽ More

    Submitted 8 January, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

  15. arXiv:1912.07651  [pdf, other

    cs.LG cs.CV stat.ML

    UNAS: Differentiable Architecture Search Meets Reinforcement Learning

    Authors: Arash Vahdat, Arun Mallya, Ming-Yu Liu, Jan Kautz

    Abstract: Neural architecture search (NAS) aims to discover network architectures with desired properties such as high accuracy or low latency. Recently, differentiable NAS (DNAS) has demonstrated promising results while maintaining a search cost orders of magnitude lower than reinforcement learning (RL) based NAS. However, DNAS models can only optimize differentiable loss functions in search, and they requ… ▽ More

    Submitted 27 August, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020 (Oral)

  16. arXiv:1904.02361  [pdf, other

    cs.LG cs.CV stat.ML

    A Robust Learning Approach to Domain Adaptive Object Detection

    Authors: Mehran Khodabandeh, Arash Vahdat, Mani Ranjbar, William G. Macready

    Abstract: Domain shift is unavoidable in real-world applications of object detection. For example, in self-driving cars, the target domain consists of unconstrained road environments which cannot all possibly be observed in training data. Similarly, in surveillance applications sufficiently representative training data may be lacking due to privacy regulations. In this paper, we address the domain adaptatio… ▽ More

    Submitted 18 November, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: Accepted to ICCV 2019

  17. arXiv:1902.01923  [pdf, other

    cs.LG stat.AP stat.ML

    Active Learning for High-Dimensional Binary Features

    Authors: Ali Vahdat, Mouloud Belbahri, Vahid Partovi Nia

    Abstract: Erbium-doped fiber amplifier (EDFA) is an optical amplifier/repeater device used to boost the intensity of optical signals being carried through a fiber optic communication system. A highly accurate EDFA model is important because of its crucial role in optical network management and optimization. The input channels of an EDFA device are treated as either on or off, hence the input features are bi… ▽ More

    Submitted 11 June, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

  18. arXiv:1901.03440  [pdf, other

    stat.ML cs.LG

    Undirected Graphical Models as Approximate Posteriors

    Authors: Arash Vahdat, Evgeny Andriyash, William G. Macready

    Abstract: The representation of the approximate posterior is a critical aspect of effective variational autoencoders (VAEs). Poor choices for the approximate posterior have a detrimental impact on the generative performance of VAEs due to the mismatch with the true posterior. We extend the class of posterior models that may be learned by using undirected graphical models. We develop an efficient method to t… ▽ More

    Submitted 7 June, 2020; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: Accepted to ICML 2020

  19. arXiv:1811.07073  [pdf, other

    cs.CV cs.LG stat.ML

    Semi-Supervised Semantic Image Segmentation with Self-correcting Networks

    Authors: Mostafa S. Ibrahim, Arash Vahdat, Mani Ranjbar, William G. Macready

    Abstract: Building a large image dataset with high-quality object masks for semantic segmentation is costly and time consuming. In this paper, we introduce a principled semi-supervised framework that only uses a small set of fully supervised images (having semantic segmentation labels and box labels) and a set of images with only object bounding box labels (we call it the weak set). Our framework trains the… ▽ More

    Submitted 25 February, 2020; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: Accepted to CVPR 2020

  20. arXiv:1810.00116  [pdf, other

    stat.ML cs.LG

    Improved Gradient-Based Optimization Over Discrete Distributions

    Authors: Evgeny Andriyash, Arash Vahdat, Bill Macready

    Abstract: In many applications we seek to maximize an expectation with respect to a distribution over discrete variables. Estimating gradients of such objectives with respect to the distribution parameters is a challenging problem. We analyze existing solutions including finite-difference (FD) estimators and continuous relaxation (CR) estimators in terms of bias and variance. We show that the commonly used… ▽ More

    Submitted 15 June, 2019; v1 submitted 28 September, 2018; originally announced October 2018.

  21. arXiv:1805.07445  [pdf, other

    stat.ML cs.LG

    DVAE#: Discrete Variational Autoencoders with Relaxed Boltzmann Priors

    Authors: Arash Vahdat, Evgeny Andriyash, William G. Macready

    Abstract: Boltzmann machines are powerful distributions that have been shown to be an effective prior over binary latent variables in variational autoencoders (VAEs). However, previous methods for training discrete VAEs have used the evidence lower bound and not the tighter importance-weighted bound. We propose two approaches for relaxing Boltzmann machines to continuous distributions that permit training w… ▽ More

    Submitted 15 October, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Neural Information Processing Systems (NIPS) 2018

  22. arXiv:1802.04920  [pdf, other

    cs.LG stat.ML

    DVAE++: Discrete Variational Autoencoders with Overlap** Transformations

    Authors: Arash Vahdat, William G. Macready, Zhengbing Bian, Amir Khoshaman, Evgeny Andriyash

    Abstract: Training of discrete latent variable models remains challenging because passing gradient information through discrete units is difficult. We propose a new class of smoothing transformations based on a mixture of two overlap** distributions, and show that the proposed transformation can be used for training binary latent models with either directed or undirected priors. We derive a new variationa… ▽ More

    Submitted 25 May, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at International Conference on Machine Learning (ICML), 2018

  23. arXiv:1706.00038  [pdf, other

    cs.LG stat.ML

    Toward Robustness against Label Noise in Training Deep Discriminative Neural Networks

    Authors: Arash Vahdat

    Abstract: Collecting large training datasets, annotated with high-quality labels, is costly and time-consuming. This paper proposes a novel framework for training deep convolutional neural networks from noisy labeled datasets that can be obtained cheaply. The problem is formulated using an undirected graphical model that represents the relationship between noisy and clean labels, trained in a semi-supervise… ▽ More

    Submitted 2 November, 2017; v1 submitted 31 May, 2017; originally announced June 2017.

    Comments: To appear in Neural Information Processing Systems (NIPS) 2017