Skip to main content

Showing 1–6 of 6 results for author: Peebles, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.09748  [pdf, other

    cs.CV cs.LG

    Scalable Diffusion Models with Transformers

    Authors: William Peebles, Saining Xie

    Abstract: We explore a new class of diffusion models based on the transformer architecture. We train latent diffusion models of images, replacing the commonly-used U-Net backbone with a transformer that operates on latent patches. We analyze the scalability of our Diffusion Transformers (DiTs) through the lens of forward pass complexity as measured by Gflops. We find that DiTs with higher Gflops -- through… ▽ More

    Submitted 2 March, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Code, project page and videos available at https://www.wpeebles.com/DiT

  2. arXiv:2209.12892  [pdf, other

    cs.LG cs.CV

    Learning to Learn with Generative Models of Neural Network Checkpoints

    Authors: William Peebles, Ilija Radosavovic, Tim Brooks, Alexei A. Efros, Jitendra Malik

    Abstract: We explore a data-driven approach for learning to optimize neural networks. We construct a dataset of neural network checkpoints and train a generative model on the parameters. In particular, our model is a conditional diffusion transformer that, given an initial input parameter vector and a prompted loss, error, or return, predicts the distribution over parameter updates that achieve the desired… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Code available at https://www.github.com/wpeebles/G.pt . Project page and videos available at https://www.wpeebles.com/Gpt

  3. arXiv:2112.05143  [pdf, other

    cs.CV

    GAN-Supervised Dense Visual Alignment

    Authors: William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei A. Efros, Eli Shechtman

    Abstract: We propose GAN-Supervised Learning, a framework for learning discriminative models and their GAN-generated training data jointly end-to-end. We apply our framework to the dense visual alignment problem. Inspired by the classic Congealing method, our GANgealing algorithm trains a Spatial Transformer to map random samples from a GAN trained on unaligned data to a common, jointly-learned target mode.… ▽ More

    Submitted 4 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: An updated version of our CVPR 2022 paper (oral); v2 features additional references and minor text changes. Code available at https://www.github.com/wpeebles/gangealing . Project page and videos available at https://www.wpeebles.com/gangealing

  4. arXiv:2008.10599  [pdf, other

    cs.CV cs.GR cs.LG cs.NE

    The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement

    Authors: William Peebles, John Peebles, Jun-Yan Zhu, Alexei Efros, Antonio Torralba

    Abstract: Existing disentanglement methods for deep generative models rely on hand-picked priors and complex encoder-based architectures. In this paper, we propose the Hessian Penalty, a simple regularization term that encourages the Hessian of a generative model with respect to its input to be diagonal. We introduce a model-agnostic, unbiased stochastic approximation of this term based on Hutchinson's esti… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: ECCV 2020 (Spotlight). Code available at https://github.com/wpeebles/hessian_penalty . Project page and videos available at https://www.wpeebles.com/hessian-penalty

  5. arXiv:2005.07727  [pdf, other

    cs.CV cs.GR cs.LG

    Semantic Photo Manipulation with a Generative Image Prior

    Authors: David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, Antonio Torralba

    Abstract: Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the high-level attributes of an existing natural photograph with GANs is challenging for two reasons. First, it is hard for GANs to precisely reproduce an input image. Second, after manipulation, the newly synthesized pixels often do not fit the original ima… ▽ More

    Submitted 12 September, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2019

    ACM Class: I.2.10; I.4; I.3

    Journal ref: ACM Transactions on Graphics (TOG) 38.4 (2019)

  6. arXiv:1910.11626  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    Seeing What a GAN Cannot Generate

    Authors: David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba

    Abstract: Despite the success of Generative Adversarial Networks (GANs), mode collapse remains a serious issue during GAN training. To date, little work has focused on understanding and quantifying which modes have been dropped by a model. In this work, we visualize mode collapse at both the distribution level and the instance level. First, we deploy a semantic segmentation network to compare the distributi… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: ICCV 2019 oral; http://ganseeing.csail.mit.edu/