Skip to main content

Showing 1–16 of 16 results for author: Chan, E R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04239  [pdf, other

    cs.LG

    Solving Inverse Problems in Protein Space Using Diffusion-Based Priors

    Authors: Axel Levy, Eric R. Chan, Sara Fridovich-Keil, Frédéric Poitevin, Ellen D. Zhong, Gordon Wetzstein

    Abstract: The interaction of a protein with its environment can be understood and controlled via its 3D structure. Experimental methods for protein structure determination, such as X-ray crystallography or cryogenic electron microscopy, shed light on biological processes but introduce challenging inverse problems. Learning-based approaches have emerged as accurate and efficient methods to solve these invers… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2310.17994  [pdf, other

    cs.CV cs.GR

    ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

    Authors: Kyle Sargent, Zizhang Li, Tanmay Shah, Charles Herrmann, Hong-Xing Yu, Yunzhi Zhang, Eric Ryan Chan, Dmitry Lagun, Li Fei-Fei, Deqing Sun, Jiajun Wu

    Abstract: We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced by in-the-wild multi-object scenes with complex backgrounds. Specifically, we train a generative prior on a mixture of data sources that capture obje… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to CVPR 2024. 12 pages

  3. arXiv:2310.07204  [pdf, other

    cs.AI cs.CV cs.GR cs.LG

    State of the Art on Diffusion Models for Visual Computing

    Authors: Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein

    Abstract: The field of visual computing is rapidly advancing due to the emergence of generative artificial intelligence (AI), which unlocks unprecedented capabilities for the generation, editing, and reconstruction of images, videos, and 3D scenes. In these domains, diffusion models are the generative AI architecture of choice. Within the last year alone, the literature on diffusion-based tools and applicat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  4. Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization

    Authors: Connor Z. Lin, Koki Nagano, Jan Kautz, Eric R. Chan, Umar Iqbal, Leonidas Guibas, Gordon Wetzstein, Sameh Khamis

    Abstract: There is a growing demand for the accessible creation of high-quality 3D avatars that are animatable and customizable. Although 3D morphable models provide intuitive control for editing and animation, and robustness for single-view face reconstruction, they cannot easily capture geometric and appearance details. Methods based on neural implicit representations, such as signed distance functions (S… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023, Project Page: https://research.nvidia.com/labs/toronto-ai/ssif

  5. arXiv:2305.02310  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Real-Time Radiance Fields for Single-Image Portrait View Synthesis

    Authors: Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, Zhiding Yu, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

    Abstract: We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural radiance field for 3D-aware novel view synthesis via volume rendering. Our method is fast (24 fps) on consumer hardware, and produces higher q… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Project page: https://research.nvidia.com/labs/nxp/lp3d/

  6. arXiv:2304.02602  [pdf, other

    cs.CV cs.AI cs.GR

    Generative Novel View Synthesis with 3D-Aware Diffusion Models

    Authors: Eric R. Chan, Koki Nagano, Matthew A. Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, Gordon Wetzstein

    Abstract: We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of ambiguity, is capable of rendering diverse and plausible novel views. To achieve this, our method makes use of existing 2D diffusion backbones but, crucially, incorp… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Project page: https://nvlabs.github.io/genvs

  7. arXiv:2303.06138  [pdf, other

    cs.CV cs.GR

    Learning Object-Centric Neural Scattering Functions for Free-Viewpoint Relighting and Scene Composition

    Authors: Hong-Xing Yu, Michelle Guo, Alireza Fathi, Yen-Yu Chang, Eric Ryan Chan, Ruohan Gao, Thomas Funkhouser, Jiajun Wu

    Abstract: Photorealistic object appearance modeling from 2D images is a constant topic in vision and graphics. While neural implicit methods (such as Neural Radiance Fields) have shown high-fidelity view synthesis results, they cannot relight the captured objects. More recent neural inverse rendering approaches have enabled object relighting, but they represent surface properties as simple BRDFs, and theref… ▽ More

    Submitted 3 October, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Journal extension of arXiv:2012.08503 (TMLR 2023). The first two authors contributed equally to this work. Project page: https://kovenyu.com/osf/

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023

  8. arXiv:2303.04291  [pdf, other

    eess.IV cs.CV

    Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition

    Authors: Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein

    Abstract: Capturing images is a key part of automation for high-level tasks such as scene text recognition. Low-light conditions pose a challenge for high-level perception stacks, which are often optimized on well-lit, artifact-free images. Reconstruction methods for low-light images can produce well-lit counterparts, but typically at the cost of high-frequency details critical for downstream tasks. We prop… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: WACV 2024. Project website: https://ccnguyen.github.io/diffusion-in-the-dark/

  9. arXiv:2211.16677  [pdf, other

    cs.CV cs.AI cs.GR

    3D Neural Field Generation using Triplane Diffusion

    Authors: J. Ryan Shue, Eric Ryan Chan, Ryan Po, Zachary Ankner, Jiajun Wu, Gordon Wetzstein

    Abstract: Diffusion models have emerged as the state-of-the-art for image generation, among other tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural fields. Our approach pre-processes training data, such as ShapeNet meshes, by converting them to continuous occupancy fields and factoring them into a set of axis-aligned triplane feature representations. Thus, our 3D t… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Project page: https://jryanshue.com/nfd

  10. arXiv:2211.12131  [pdf, other

    cs.CV

    DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion Models

    Authors: Shengqu Cai, Eric Ryan Chan, Songyou Peng, Mohamad Shahbazi, Anton Obukhov, Luc Van Gool, Gordon Wetzstein

    Abstract: Scene extrapolation -- the idea of generating novel views by flying into a given image -- is a promising, yet challenging task. For each predicted frame, a joint inpainting and 3D refinement problem has to be solved, which is ill posed and includes a high level of ambiguity. Moreover, training data for long-range scenes is difficult to obtain and usually lacks sufficient views to infer accurate ca… ▽ More

    Submitted 18 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

  11. arXiv:2206.14314  [pdf, other

    cs.CV cs.GR

    Generative Neural Articulated Radiance Fields

    Authors: Alexander W. Bergman, Petr Kellnhofer, Wang Yifan, Eric R. Chan, David B. Lindell, Gordon Wetzstein

    Abstract: Unsupervised learning of 3D-aware generative adversarial networks (GANs) using only collections of single-view 2D photographs has very recently made much progress. These 3D GANs, however, have not been demonstrated for human bodies and the generated radiance fields of existing frameworks are not directly editable, limiting their applicability in downstream tasks. We propose a solution to these cha… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Project website: http://www.computationalimaging.org/publications/gnarf/

  12. arXiv:2203.13441  [pdf, other

    cs.CV cs.GR cs.LG

    3D GAN Inversion for Controllable Portrait Image Animation

    Authors: Connor Z. Lin, David B. Lindell, Eric R. Chan, Gordon Wetzstein

    Abstract: Millions of images of human faces are captured every single day; but these photographs portray the likeness of an individual with a fixed pose, expression, and appearance. Portrait image animation enables the post-capture adjustment of these attributes from a single image while maintaining a photorealistic reconstruction of the subject's likeness or identity. Still, current methods for portrait im… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Project page: https://www.computationalimaging.org/publications/3dganinversion/

  13. arXiv:2112.07945  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Efficient Geometry-aware 3D Generative Adversarial Networks

    Authors: Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein

    Abstract: Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using only collections of single-view 2D photographs has been a long-standing challenge. Existing 3D GANs are either compute-intensive or make approximations that are not 3D-consistent; the former limits quality and resolution of the generated images and the latter adversely affects multi-view consistency and shape… ▽ More

    Submitted 27 April, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Project page: https://matthew-a-chan.github.io/EG3D

  14. arXiv:2105.02788  [pdf, other

    cs.CV cs.GR cs.LG

    ACORN: Adaptive Coordinate Networks for Neural Scene Representation

    Authors: Julien N. P. Martel, David B. Lindell, Connor Z. Lin, Eric R. Chan, Marco Monteiro, Gordon Wetzstein

    Abstract: Neural representations have emerged as a new paradigm for applications in rendering, imaging, geometric modeling, and simulation. Compared to traditional representations such as meshes, point clouds, or volumes they can be flexibly incorporated into differentiable learning-based pipelines. While recent improvements to neural representations now make it possible to represent signals with fine detai… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: J. N. P. Martel and D. B. Lindell equally contributed to this work

  15. arXiv:2012.00926  [pdf, other

    cs.CV cs.GR

    pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis

    Authors: Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein

    Abstract: We have witnessed rapid progress on 3D-aware image synthesis, leveraging recent advances in generative visual models and neural rendering. Existing approaches however fall short in two ways: first, they may lack an underlying 3D representation or rely on view-inconsistent rendering, hence synthesizing images that are not multi-view consistent; second, they often depend upon representation network… ▽ More

    Submitted 5 April, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

  16. arXiv:2006.09662  [pdf, other

    cs.CV cs.GR cs.LG

    MetaSDF: Meta-learning Signed Distance Functions

    Authors: Vincent Sitzmann, Eric R. Chan, Richard Tucker, Noah Snavely, Gordon Wetzstein

    Abstract: Neural implicit shape representations are an emerging paradigm that offers many potential benefits over conventional discrete representations, including memory efficiency at a high spatial resolution. Generalizing across shapes with such neural implicit representations amounts to learning priors over the respective function space and enables geometry reconstruction from partial or noisy observatio… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Project website: https://vsitzmann.github.io/metasdf/