Skip to main content

Showing 1–17 of 17 results for author: Kautz, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.04391  [pdf, other

    cs.LG cs.CV math.NA stat.ML

    A Variational Perspective on Solving Inverse Problems with Diffusion Models

    Authors: Morteza Mardani, Jiaming Song, Jan Kautz, Arash Vahdat

    Abstract: Diffusion models have emerged as a key pillar of foundation models in visual domains. One of their critical applications is to universally solve different downstream inverse tasks via a single diffusion prior without re-training for each task. Most inverse tasks can be formulated as inferring a posterior distribution over data (e.g., a full image) given a measurement (e.g., a masked image). This i… ▽ More

    Submitted 29 September, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

  2. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  3. arXiv:2106.05931  [pdf, other

    stat.ML cs.LG

    Score-based Generative Modeling in Latent Space

    Authors: Arash Vahdat, Karsten Kreis, Jan Kautz

    Abstract: Score-based generative models (SGMs) have recently demonstrated impressive results in terms of both sample quality and distribution coverage. However, they are usually applied directly in data space and often require thousands of network evaluations for sampling. Here, we propose the Latent Score-based Generative Model (LSGM), a novel approach that trains SGMs in a latent space, relying on the var… ▽ More

    Submitted 2 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  4. arXiv:2010.02917  [pdf, other

    cs.LG cs.CV stat.ML

    A Contrastive Learning Approach for Training Variational Autoencoder Priors

    Authors: Jyoti Aneja, Alexander Schwing, Jan Kautz, Arash Vahdat

    Abstract: Variational autoencoders (VAEs) are one of the powerful likelihood-based generative models with applications in many domains. However, they struggle to generate high-quality images, especially when samples are obtained from the prior without any tempering. One explanation for VAEs' poor generative quality is the prior hole problem: the prior distribution fails to match the aggregate approximate po… ▽ More

    Submitted 3 November, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2021

  5. arXiv:2010.00654  [pdf, other

    cs.LG cs.CV stat.ML

    VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models

    Authors: Zhisheng Xiao, Karsten Kreis, Jan Kautz, Arash Vahdat

    Abstract: Energy-based models (EBMs) have recently been successful in representing complex distributions of small images. However, sampling from them requires expensive Markov chain Monte Carlo (MCMC) iterations that mix slowly in high dimensional pixel space. Unlike EBMs, variational autoencoders (VAEs) generate samples quickly and are equipped with a latent space that enables fast traversal of the data ma… ▽ More

    Submitted 4 November, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: ICLR 2021 (spotlight)

  6. arXiv:2007.03898  [pdf, other

    stat.ML cs.CV cs.LG

    NVAE: A Deep Hierarchical Variational Autoencoder

    Authors: Arash Vahdat, Jan Kautz

    Abstract: Normalizing flows, autoregressive models, variational autoencoders (VAEs), and deep energy-based models are among competing likelihood-based frameworks for deep generative learning. Among them, VAEs have the advantage of fast and tractable sampling and easy-to-access encoding networks. However, they are currently outperformed by other models such as normalizing flows and autoregressive models. Whi… ▽ More

    Submitted 7 January, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Neural Information Processing Systems (NeurIPS) 2020 (spotlight)

  7. arXiv:2006.09920  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Contrastive Learning for Weakly Supervised Phrase Grounding

    Authors: Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem

    Abstract: Phrase grounding, the problem of associating image regions to caption words, is a crucial component of vision-language tasks. We show that phrase grounding can be learned by optimizing word-region attention to maximize a lower bound on mutual information between images and caption words. Given pairs of images and captions, we maximize compatibility of the attention-weighted regions and the words i… ▽ More

    Submitted 5 August, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: ECCV 2020 (spotlight paper), Project page: http://tanmaygupta.info/info-ground

  8. arXiv:2002.09131  [pdf, other

    cs.LG cs.CV stat.ML

    Convolutional Tensor-Train LSTM for Spatio-temporal Learning

    Authors: Jiahao Su, Wonmin Byeon, Jean Kossaifi, Furong Huang, Jan Kautz, Animashree Anandkumar

    Abstract: Learning from spatio-temporal data has numerous applications such as human-behavior analysis, object tracking, video compression, and physics simulation.However, existing methods still perform poorly on challenging video tasks such as long-term forecasting. This is because these kinds of challenging tasks require learning long-term spatio-temporal correlations in the video sequence. In this paper,… ▽ More

    Submitted 4 October, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: Jiahao Su and Wonmin Byeon contributed equally to this work. 22 pages, 14 figures, NeurIPS 2020

  9. arXiv:2001.01885  [pdf, other

    cs.LG stat.ML

    Discovering Nonlinear Relations with Minimum Predictive Information Regularization

    Authors: Tailin Wu, Thomas Breuel, Michael Skuhersky, Jan Kautz

    Abstract: Identifying the underlying directional relations from observational time series with nonlinear interactions and complex relational structures is key to a wide range of applications, yet remains a hard problem. In this work, we introduce a novel minimum predictive information regularization method to infer directional relations from time series, allowing deep learning models to discover nonlinear r… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: 26 pages, 11 figures; ICML'19 Time Series Workshop

  10. arXiv:1912.08795  [pdf, other

    cs.LG cs.CV stat.ML

    Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion

    Authors: Hongxu Yin, Pavlo Molchanov, Zhizhong Li, Jose M. Alvarez, Arun Mallya, Derek Hoiem, Niraj K. Jha, Jan Kautz

    Abstract: We introduce DeepInversion, a new method for synthesizing images from the image distribution used to train a deep neural network. We 'invert' a trained network (teacher) to synthesize class-conditional input images starting from random noise, without using any additional information about the training dataset. Kee** the teacher fixed, our method optimizes the input while regularizing the distrib… ▽ More

    Submitted 15 June, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

  11. arXiv:1912.07651  [pdf, other

    cs.LG cs.CV stat.ML

    UNAS: Differentiable Architecture Search Meets Reinforcement Learning

    Authors: Arash Vahdat, Arun Mallya, Ming-Yu Liu, Jan Kautz

    Abstract: Neural architecture search (NAS) aims to discover network architectures with desired properties such as high accuracy or low latency. Recently, differentiable NAS (DNAS) has demonstrated promising results while maintaining a search cost orders of magnitude lower than reinforcement learning (RL) based NAS. However, DNAS models can only optimize differentiable loss functions in search, and they requ… ▽ More

    Submitted 27 August, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020 (Oral)

  12. arXiv:1912.02279  [pdf, other

    cs.LG cs.CV stat.ML

    Angular Visual Hardness

    Authors: Beidi Chen, Weiyang Liu, Zhiding Yu, Jan Kautz, Anshumali Shrivastava, Animesh Garg, Anima Anandkumar

    Abstract: Recent convolutional neural networks (CNNs) have led to impressive performance but often suffer from poor calibration. They tend to be overconfident, with the model confidence not always reflecting the underlying true ambiguity and hardness. In this paper, we propose angular visual hardness (AVH), a score given by the normalized angular distance between the sample feature embedding and the target… ▽ More

    Submitted 10 July, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

  13. arXiv:1906.10771  [pdf, other

    cs.LG cs.CV stat.ML

    Importance Estimation for Neural Network Pruning

    Authors: Pavlo Molchanov, Arun Mallya, Stephen Tyree, Iuri Frosio, Jan Kautz

    Abstract: Structural pruning of neural network parameters reduces computation, energy, and memory transfer costs during inference. We propose a novel method that estimates the contribution of a neuron (filter) to the final loss and iteratively removes those with smaller scores. We describe two variations of our method using the first and second-order Taylor expansions to approximate a filter's contribution.… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  14. arXiv:1905.01723  [pdf, other

    cs.CV cs.AI cs.GR cs.MM stat.ML

    Few-Shot Unsupervised Image-to-Image Translation

    Authors: Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz

    Abstract: Unsupervised image-to-image translation methods learn to map images in a given class to an analogous image in a different class, drawing on unstructured (non-registered) datasets of images. While remarkably successful, current methods require access to many images in both source and destination classes at training time. We argue this greatly limits their use. Drawing inspiration from the human cap… ▽ More

    Submitted 9 September, 2019; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: The paper will be presented at the International Conference on Computer Vision (ICCV) 2019

    Journal ref: ICCV 2019

  15. arXiv:1804.04732  [pdf, other

    cs.CV cs.LG stat.ML

    Multimodal Unsupervised Image-to-Image Translation

    Authors: Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz

    Abstract: Unsupervised image-to-image translation is an important and challenging problem in computer vision. Given an image in the source domain, the goal is to learn the conditional distribution of corresponding images in the target domain, without seeing any pairs of corresponding images. While this conditional distribution is inherently multimodal, existing approaches make an overly simplified assumptio… ▽ More

    Submitted 14 August, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: Accepted by ECCV 2018

  16. arXiv:1611.06440  [pdf, other

    cs.LG stat.ML

    Pruning Convolutional Neural Networks for Resource Efficient Inference

    Authors: Pavlo Molchanov, Stephen Tyree, Tero Karras, Timo Aila, Jan Kautz

    Abstract: We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that maintains good generalization in the pruned network. We propose a new criterion based on Taylor expansion that approximates the change in the cost function induce… ▽ More

    Submitted 8 June, 2017; v1 submitted 19 November, 2016; originally announced November 2016.

    Comments: 17 pages, 14 figures, ICLR 2017 paper

  17. arXiv:1504.08219  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Subquery Evaluation for Active Learning on a Graph

    Authors: Oisin Mac Aodha, Neill D. F. Campbell, Jan Kautz, Gabriel J. Brostow

    Abstract: To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

    Comments: CVPR 2014