Skip to main content

Showing 1–13 of 13 results for author: Lagun, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20055  [pdf, other

    cs.CV cs.LG

    SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting

    Authors: Sara Sabour, Lily Goli, George Kopanas, Mark Matthews, Dmitry Lagun, Leonidas Guibas, Alec Jacobson, David J. Fleet, Andrea Tagliasacchi

    Abstract: 3D Gaussian Splatting (3DGS) is a promising technique for 3D reconstruction, offering efficient training and rendering speeds, making it suitable for real-time applications.However, current methods require highly controlled environments (no moving people or wind-blown elements, and consistent lighting) to meet the inter-view consistency assumption of 3DGS. This makes reconstruction of real-world c… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2404.04421  [pdf, other

    cs.GR cs.CV

    PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

    Authors: Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein

    Abstract: Modeling and rendering photorealistic avatars is of crucial importance in many applications. Existing methods that build a 3D avatar from visual observations, however, struggle to reconstruct clothed humans. We introduce PhysAvatar, a novel framework that combines inverse rendering with inverse physics to automatically estimate the shape and appearance of a human from multi-view video data along w… ▽ More

    Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: Project Page: https://qingqing-zhao.github.io/PhysAvatar

  3. arXiv:2312.02970  [pdf, other

    cs.CV cs.AI cs.GR

    Alchemist: Parametric Control of Material Properties with Diffusion Models

    Authors: Prafull Sharma, Varun Jampani, Yuanzhen Li, Xuhui Jia, Dmitry Lagun, Fredo Durand, William T. Freeman, Mark Matthews

    Abstract: We propose a method to control material attributes of objects like roughness, metallic, albedo, and transparency in real images. Our method capitalizes on the generative prior of text-to-image models known for photorealism, employing a scalar value and instructions to alter low-level material properties. Addressing the lack of datasets with controlled material attributes, we generated an object-ce… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  4. arXiv:2310.17994  [pdf, other

    cs.CV cs.GR

    ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

    Authors: Kyle Sargent, Zizhang Li, Tanmay Shah, Charles Herrmann, Hong-Xing Yu, Yunzhi Zhang, Eric Ryan Chan, Dmitry Lagun, Li Fei-Fei, Deqing Sun, Jiajun Wu

    Abstract: We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced by in-the-wild multi-object scenes with complex backgrounds. Specifically, we train a generative prior on a mixture of data sources that capture obje… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to CVPR 2024. 12 pages

  5. arXiv:2309.16859  [pdf, other

    cs.CV cs.AI cs.LG

    Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis

    Authors: Marcel C. Bühler, Kripasindhu Sarkar, Tanmay Shah, Gengyan Li, Daoye Wang, Leonhard Helminger, Sergio Orts-Escolano, Dmitry Lagun, Otmar Hilliges, Thabo Beeler, Abhimitra Meka

    Abstract: NeRFs have enabled highly realistic synthesis of human faces including complex appearance and reflectance effects of hair and skin. These methods typically require a large number of multi-view input images, making the process hardware intensive and cumbersome, limiting applicability to unconstrained settings. We propose a novel volumetric human face prior that enables the synthesis of ultra high-r… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

  6. arXiv:2309.05569  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    ITI-GEN: Inclusive Text-to-Image Generation

    Authors: Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre

    Abstract: Text-to-image generative models often reflect the biases of the training data, leading to unequal representations of underrepresented groups. This study investigates inclusive text-to-image generative models that generate images based on human-written prompts and ensure the resulting images are uniformly distributed across attributes of interest. Unfortunately, directly expressing the desired attr… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023 (Oral Presentation)

  7. arXiv:2303.13743  [pdf, other

    cs.CV

    TEGLO: High Fidelity Canonical Texture Map** from Single-View Images

    Authors: Vishal Vinod, Tanmay Shah, Dmitry Lagun

    Abstract: Recent work in Neural Fields (NFs) learn 3D representations from class-specific single view image collections. However, they are unable to reconstruct the input data preserving high-frequency details. Further, these methods do not disentangle appearance from geometry and hence are not suitable for tasks such as texture transfer and editing. In this work, we propose TEGLO (Textured EG3D-GLO) for le… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  8. arXiv:2303.08096  [pdf, other

    cs.CV

    MELON: NeRF with Unposed Images in SO(3)

    Authors: Axel Levy, Mark Matthews, Matan Sela, Gordon Wetzstein, Dmitry Lagun

    Abstract: Neural radiance fields enable novel-view synthesis and scene reconstruction with photorealistic quality from a few images, but require known and accurate camera poses. Conventional pose estimation algorithms fail on smooth or self-similar scenes, while methods performing inverse rendering from unposed views require a rough initialization of the camera orientations. The main difficulty of pose esti… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  9. arXiv:2209.10684  [pdf, other

    cs.CV

    Attention Beats Concatenation for Conditioning Neural Fields

    Authors: Daniel Rebain, Mark J. Matthews, Kwang Moo Yi, Gopal Sharma, Dmitry Lagun, Andrea Tagliasacchi

    Abstract: Neural fields model signals by map** coordinate inputs to sampled values. They are becoming an increasingly important backbone architecture across many fields from vision and graphics to biology and astronomy. In this paper, we explore the differences between common conditioning mechanisms within these networks, an essential ingredient in shifting neural fields from memorization of signals to ge… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  10. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  11. arXiv:2112.09061  [pdf, other

    cs.CV cs.AI cs.LG

    Solving Inverse Problems with NerfGANs

    Authors: Giannis Daras, Wen-Sheng Chu, Abhishek Kumar, Dmitry Lagun, Alexandros G. Dimakis

    Abstract: We introduce a novel framework for solving inverse problems using NeRF-style generative models. We are interested in the problem of 3-D scene reconstruction given a single 2-D image and known camera parameters. We show that naively optimizing the latent space leads to artifacts and poor novel view rendering. We attribute this problem to volume obstructions that are clear in the 3-D geometry and be… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 16 pages, 18 figures

  12. arXiv:2111.09996  [pdf, other

    cs.CV

    LOLNeRF: Learn from One Look

    Authors: Daniel Rebain, Mark Matthews, Kwang Moo Yi, Dmitry Lagun, Andrea Tagliasacchi

    Abstract: We present a method for learning a generative 3D model based on neural radiance fields, trained solely from data with only single views of each object. While generating realistic images is no longer a difficult task, producing the corresponding 3D structure such that they can be rendered from different views is non-trivial. We show that, unlike existing methods, one does not need multi-view data t… ▽ More

    Submitted 25 April, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: See https://lolnerf.github.io for additional results

  13. arXiv:1711.09767  [pdf, other

    cs.CV cs.AI cs.HC

    GazeGAN - Unpaired Adversarial Image Generation for Gaze Estimation

    Authors: Matan Sela, **mei Xu, Junfeng He, Vidhya Navalpakkam, Dmitry Lagun

    Abstract: Recent research has demonstrated the ability to estimate gaze on mobile devices by performing inference on the image from the phone's front-facing camera, and without requiring specialized hardware. While this offers wide potential applications such as in human-computer interaction, medical diagnosis and accessibility (e.g., hands free gaze as input for patients with motor disorders), current meth… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: Project was done when the first author was at Google Research