Skip to main content

Showing 51–76 of 76 results for author: Zollhofer, M

.
  1. Real-Time Global Illumination Decomposition of Videos

    Authors: Abhimitra Meka, Mohammad Shafiei, Michael Zollhoefer, Christian Richardt, Christian Theobalt

    Abstract: We propose the first approach for the decomposition of a monocular color video into direct and indirect illumination components in real time. We retrieve, in separate layers, the contribution made to the scene appearance by the scene reflectance, the light sources and the reflections from various coherent scene regions to one another. Existing techniques that invert global light transport require… ▽ More

    Submitted 10 June, 2021; v1 submitted 6 August, 2019; originally announced August 2019.

    Journal ref: ACM Transactions on Graphics, 2021, 40(3), 22:1-16

  2. arXiv:1906.01618  [pdf, other

    cs.CV cs.AI

    Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

    Authors: Vincent Sitzmann, Michael Zollhöfer, Gordon Wetzstein

    Abstract: Unsupervised learning with generative models has the potential of discovering rich representations of 3D scenes. While geometric deep learning has explored 3D-structure-aware representations of scene geometry, these models typically require explicit 3D supervision. Emerging neural scene representations can be trained only with posed 2D images, but existing methods ignore the three-dimensional stru… ▽ More

    Submitted 28 January, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Video: https://youtu.be/6vMEBWD8O20 Project page: https://vsitzmann.github.io/srns/

    ACM Class: I.2.10; I.4.5; I.4.8; I.4.10

  3. arXiv:1906.01524  [pdf, other

    cs.CV cs.GR cs.LG

    Text-based Editing of Talking-head Video

    Authors: Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B Goldman, Kyle Genova, Zeyu **, Christian Theobalt, Maneesh Agrawala

    Abstract: Editing talking-head video to change the speech content or to remove filler words is challenging. We propose a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has been modified, while maintaining a seamless audio-visual flow (i.e. no jump cuts). Our method automatically annotates an input talking-head video wi… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: A version with higher resolution images can be downloaded from the authors' website

  4. arXiv:1905.10290  [pdf, other

    cs.CV cs.GR

    DEMEA: Deep Mesh Autoencoders for Non-Rigidly Deforming Objects

    Authors: Edgar Tretschk, Ayush Tewari, Michael Zollhöfer, Vladislav Golyanik, Christian Theobalt

    Abstract: Mesh autoencoders are commonly used for dimensionality reduction, sampling and mesh modeling. We propose a general-purpose DEep MEsh Autoencoder (DEMEA) which adds a novel embedded deformation layer to a graph-convolutional mesh autoencoder. The embedded deformation layer (EDL) is a differentiable deformable geometric proxy which explicitly models point displacements of non-rigid deformations in a… ▽ More

    Submitted 4 August, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: 27 pages, including supplementary material

  5. arXiv:1904.12356  [pdf, other

    cs.CV cs.GR

    Deferred Neural Rendering: Image Synthesis using Neural Textures

    Authors: Justus Thies, Michael Zollhöfer, Matthias Nießner

    Abstract: The modern computer graphics pipeline can synthesize images at remarkable visual quality; however, it requires well-defined, high-quality 3D content as input. In this work, we explore the use of imperfect 3D content, for instance, obtained from photo-metric reconstructions with noisy and incomplete surface geometry, while still aiming to produce photo-realistic (re-)renderings. To address this cha… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: Video: https://youtu.be/z-pVip6WeyY SIGGRAPH 2019

  6. arXiv:1902.06835  [pdf, other

    cs.CV

    Commodity RGB-D Sensors: Data Acquisition

    Authors: Michael Zollhöfer

    Abstract: Over the past ten years we have seen a democratization of range sensing technology. While previously range sensors have been highly expensive and only accessible to a few domain experts, such sensors are nowadays ubiquitous and can even be found in the latest generation of mobile devices, e.g., current smartphones. This democratization of range sensing technology was started with the release of th… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: Contributed chapter to a book on "RGB-D Image Analysis and Processing"

  7. arXiv:1812.07603  [pdf, other

    cs.CV

    FML: Face Model Learning from Videos

    Authors: Ayush Tewari, Florian Bernard, Pablo Garrido, Gaurav Bharaj, Mohamed Elgharib, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

    Abstract: Monocular image-based 3D reconstruction of faces is a long-standing problem in computer vision. Since image data is a 2D projection of a 3D face, the resulting depth ambiguity makes the problem ill-posed. Most existing methods rely on data-driven priors that are built from limited 3D face scans. In contrast, we propose multi-frame video-based self-supervised training of a deep network that (i) lea… ▽ More

    Submitted 9 April, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: CVPR 2019 (Oral). Video: https://www.youtube.com/watch?v=SG2BwxCw0lQ, Project Page: https://gvv.mpi-inf.mpg.de/projects/FML19/

  8. arXiv:1812.01024  [pdf, other

    cs.CV

    DeepVoxels: Learning Persistent 3D Feature Embeddings

    Authors: Vincent Sitzmann, Justus Thies, Felix Heide, Matthias Nießner, Gordon Wetzstein, Michael Zollhöfer

    Abstract: In this work, we address the lack of 3D understanding of generative neural networks by introducing a persistent 3D feature embedding for view synthesis. To this end, we propose DeepVoxels, a learned representation that encodes the view-dependent appearance of a 3D scene without having to explicitly model its geometry. At its core, our approach is based on a Cartesian 3D grid of persistent embedded… ▽ More

    Submitted 10 April, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: Video: https://www.youtube.com/watch?v=HM_WsZhoGXw Supplemental material: https://drive.google.com/file/d/1BnZRyNcVUty6-LxAstN83H79ktUq8Cjp/view?usp=sharing Code: https://github.com/vsitzmann/deepvoxels Project page: https://vsitzmann.github.io/deepvoxels/

  9. arXiv:1811.10720  [pdf, other

    cs.CV

    IGNOR: Image-guided Neural Object Rendering

    Authors: Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner

    Abstract: We propose a learned image-guided rendering technique that combines the benefits of image-based rendering and GAN-based image synthesis. The goal of our method is to generate photo-realistic re-renderings of reconstructed objects for virtual and augmented reality applications (e.g., virtual showrooms, virtual tours \& sightseeing, the digital inspection of historical artifacts). A core component o… ▽ More

    Submitted 15 January, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: Video: https://youtu.be/s79HG9yn7QM

  10. arXiv:1810.02648  [pdf, other

    cs.CV

    LiveCap: Real-time Human Performance Capture from Monocular Video

    Authors: Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

    Abstract: We present the first real-time human performance capture approach that reconstructs dense, space-time coherent deforming geometry of entire humans in general everyday clothing from just a single RGB video. We propose a novel two-stage analysis-by-synthesis optimization whose formulation and implementation are designed for high performance. In the first stage, a skinned template model is jointly fi… ▽ More

    Submitted 25 January, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

  11. arXiv:1809.03658  [pdf, other

    cs.CV

    Neural Rendering and Reenactment of Human Actor Videos

    Authors: Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wen** Wang, Christian Theobalt

    Abstract: We propose a method for generating video-realistic animations of real humans under user control. In contrast to conventional human character rendering, we do not require the availability of a production-quality photo-realistic 3D model of the human, but instead rely on a video sequence in conjunction with a (medium-quality) controllable 3D template model of the person. With that, our approach sign… ▽ More

    Submitted 9 May, 2019; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: ACM ToG paper. Project page: http://gvv.mpi-inf.mpg.de/projects/wxu/HumanReenactment/

  12. HeadOn: Real-time Reenactment of Human Portrait Videos

    Authors: Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner

    Abstract: We propose HeadOn, the first real-time source-to-target reenactment approach for complete human portrait videos that enables transfer of torso and head motion, face expression, and eye gaze. Given a short RGB-D video of the target actor, we automatically construct a personalized geometry proxy that embeds a parametric head, eye, and kinematic torso model. A novel real-time reenactment algorithm em… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: Video: https://www.youtube.com/watch?v=7Dg49wv2c_g Presented at Siggraph'18

  13. arXiv:1805.11714  [pdf, other

    cs.CV cs.AI cs.GR

    Deep Video Portraits

    Authors: Hyeongwoo Kim, Pablo Garrido, Ayush Tewari, Weipeng Xu, Justus Thies, Matthias Nießner, Patrick Pérez, Christian Richardt, Michael Zollhöfer, Christian Theobalt

    Abstract: We present a novel approach that enables photo-realistic re-animation of portrait videos using only an input video. In contrast to existing approaches that are restricted to manipulations of facial expressions only, we are the first to transfer the full 3D head position, head rotation, face expression, eye gaze, and eye blinking from a source actor to a portrait video of a target actor. The core o… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: SIGGRAPH 2018, Video: https://www.youtube.com/watch?v=qc5P2bvfl44

  14. arXiv:1803.05959  [pdf, other

    cs.CV

    Mo2Cap2: Real-time Mobile 3D Motion Capture with a Cap-mounted Fisheye Camera

    Authors: Weipeng Xu, Avishek Chatterjee, Michael Zollhoefer, Helge Rhodin, Pascal Fua, Hans-Peter Seidel, Christian Theobalt

    Abstract: We propose the first real-time approach for the egocentric estimation of 3D human body pose in a wide range of unconstrained everyday activities. This setting has a unique set of challenges, such as mobility of the hardware setup, and robustness to long capture sessions with fast recovery from tracking failures. We tackle these challenges based on a novel lightweight setup that converts a standard… ▽ More

    Submitted 23 January, 2019; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: IEEE TVCG Proc. VR 2019. Webpage: http://gvv.mpi-inf.mpg.de/projects/wxu/Mo2Cap2/

  15. arXiv:1801.01446  [pdf, other

    cs.CV

    IMU2Face: Real-time Gesture-driven Facial Reenactment

    Authors: Justus Thies, Michael Zollhöfer, Matthias Nießner

    Abstract: We present IMU2Face, a gesture-driven facial reenactment system. To this end, we combine recent advances in facial motion capture and inertial measurement units (IMUs) to control the facial expressions of a person in a target video based on intuitive hand gestures. IMUs are omnipresent, since modern smart-phones, smart-watches and drones integrate such sensors, e.g., for changing the orientation o… ▽ More

    Submitted 18 December, 2017; originally announced January 2018.

    Comments: https://youtu.be/UXGodiDAqiE

  16. arXiv:1801.01075  [pdf, other

    cs.CV

    LIME: Live Intrinsic Material Estimation

    Authors: Abhimitra Meka, Maxim Maximov, Michael Zollhoefer, Avishek Chatterjee, Hans-Peter Seidel, Christian Richardt, Christian Theobalt

    Abstract: We present the first end to end approach for real time material estimation for general object shapes with uniform material that only requires a single color image as input. In addition to Lambertian surface properties, our approach fully automatically computes the specular albedo, material shininess, and a foreground segmentation. We tackle this challenging and ill posed inverse rendering problem… ▽ More

    Submitted 4 May, 2018; v1 submitted 3 January, 2018; originally announced January 2018.

    Comments: 17 pages, Spotlight paper in CVPR 2018

  17. arXiv:1712.02859  [pdf, other

    cs.CV

    Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz

    Authors: Ayush Tewari, Michael Zollhöfer, Pablo Garrido, Florian Bernard, Hyeongwoo Kim, Patrick Pérez, Christian Theobalt

    Abstract: The reconstruction of dense 3D models of face geometry and appearance from a single image is highly challenging and ill-posed. To constrain the problem, many approaches rely on strong priors, such as parametric face models learned from limited 3D scan data. However, prior models restrict generalization of the true diversity in facial geometry, skin reflectance and illumination. To alleviate this p… ▽ More

    Submitted 29 March, 2018; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: CVPR 2018 (Oral). Project webpage: https://gvv.mpi-inf.mpg.de/projects/FML/

  18. arXiv:1708.02136  [pdf, other

    cs.CV cs.GR

    MonoPerfCap: Human Performance Capture from Monocular Video

    Authors: Weipeng Xu, Avishek Chatterjee, Michael Zollhöfer, Helge Rhodin, Dushyant Mehta, Hans-Peter Seidel, Christian Theobalt

    Abstract: We present the first marker-less approach for temporally coherent 3D performance capture of a human with general clothing from monocular video. Our approach reconstructs articulated human skeleton motion as well as medium-scale non-rigid surface deformations in general scenes. Human performance capture is a challenging problem due to the large range of articulation, potentially fast motion, and co… ▽ More

    Submitted 23 February, 2018; v1 submitted 7 August, 2017; originally announced August 2017.

    Comments: Accepted to ACM TOG 2018, to be presented on SIGGRAPH 2018

  19. arXiv:1703.10956  [pdf, other

    cs.CV

    InverseFaceNet: Deep Monocular Inverse Face Rendering

    Authors: Hyeongwoo Kim, Michael Zollhöfer, Ayush Tewari, Justus Thies, Christian Richardt, Christian Theobalt

    Abstract: We introduce InverseFaceNet, a deep convolutional inverse rendering framework for faces that jointly estimates facial pose, shape, expression, reflectance and illumination from a single input image. By estimating all parameters from just a single image, advanced editing possibilities on a single face image, such as appearance editing and relighting, become feasible in real time. Most previous lear… ▽ More

    Submitted 16 May, 2018; v1 submitted 31 March, 2017; originally announced March 2017.

    Comments: CVPR 2018 (poster) 10 pages (+5 pages)

  20. arXiv:1703.10580  [pdf, other

    cs.CV

    MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

    Authors: Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Pérez, Christian Theobalt

    Abstract: In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is our new differentiable parametric decoder that encapsulates im… ▽ More

    Submitted 7 December, 2017; v1 submitted 30 March, 2017; originally announced March 2017.

    Comments: International Conference on Computer Vision (ICCV) 2017 (Oral), 13 pages

  21. arXiv:1610.07159  [pdf, other

    cs.CV

    Real-time Halfway Domain Reconstruction of Motion and Geometry

    Authors: Lucas Thies, Michael Zollhöfer, Christian Richardt, Christian Theobalt, Günther Greiner

    Abstract: We present a novel approach for real-time joint reconstruction of 3D scene motion and geometry from binocular stereo videos. Our approach is based on a novel variational halfway-domain scene flow formulation, which allows us to obtain highly accurate spatiotemporal reconstructions of shape and motion. We solve the underlying optimization problem at real-time frame rates using a novel data-parallel… ▽ More

    Submitted 23 October, 2016; originally announced October 2016.

    Comments: Proc. of the International Conference on 3D Vision 2016 (3DV 2016)

    ACM Class: I.4.8

  22. arXiv:1610.04889  [pdf, other

    cs.CV

    Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

    Authors: Srinath Sridhar, Franziska Mueller, Michael Zollhöfer, Dan Casas, Antti Oulasvirta, Christian Theobalt

    Abstract: Real-time simultaneous tracking of hands manipulating and interacting with external objects has many potential applications in augmented reality, tangible computing, and wearable computing. However, due to difficult occlusions, fast motions, and uniform hand appearance, jointly tracking hand and object pose is more challenging than tracking either of the two separately. Many previous approaches re… ▽ More

    Submitted 16 October, 2016; originally announced October 2016.

    Comments: Proceedings of ECCV 2016

  23. arXiv:1610.03151  [pdf, other

    cs.CV

    FaceVR: Real-Time Facial Reenactment and Eye Gaze Control in Virtual Reality

    Authors: Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, Matthias Nießner

    Abstract: We propose FaceVR, a novel image-based method that enables video teleconferencing in VR based on self-reenactment. State-of-the-art face tracking methods in the VR context are focused on the animation of rigged 3d avatars. While they achieve good tracking performance the results look cartoonish and not real. In contrast to these model-based approaches, FaceVR enables VR teleconferencing using an i… ▽ More

    Submitted 21 March, 2018; v1 submitted 10 October, 2016; originally announced October 2016.

    Comments: Video: https://youtu.be/jIlujM5avU8 Presented at Siggraph'18

  24. arXiv:1604.06525  [pdf, other

    cs.GR cs.CV cs.PL

    Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging

    Authors: Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, Matthias Nießner

    Abstract: Many graphics and vision problems can be expressed as non-linear least squares optimizations of objective functions over visual data, such as images and meshes. The mathematical descriptions of these functions are extremely concise, but their implementation in real code is tedious, especially when optimized for real-time performance on modern GPUs in interactive applications. In this work, we prop… ▽ More

    Submitted 9 September, 2017; v1 submitted 21 April, 2016; originally announced April 2016.

  25. arXiv:1604.01093  [pdf, other

    cs.GR cs.CV

    BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration

    Authors: Angela Dai, Matthias Nießner, Michael Zollhöfer, Shahram Izadi, Christian Theobalt

    Abstract: Real-time, high-quality, 3D scanning of large-scale scenes is key to mixed reality and robotic applications. However, scalability brings challenges of drift in pose estimation, introducing significant errors in the accumulated model. Approaches often require hours of offline processing to globally correct model errors. Recent online methods demonstrate compelling results, but suffer from: (1) need… ▽ More

    Submitted 7 February, 2017; v1 submitted 4 April, 2016; originally announced April 2016.

  26. arXiv:1603.08161  [pdf, other

    cs.CV

    VolumeDeform: Real-time Volumetric Non-rigid Reconstruction

    Authors: Matthias Innmann, Michael Zollhöfer, Matthias Nießner, Christian Theobalt, Marc Stamminger

    Abstract: We present a novel approach for the reconstruction of dynamic geometric shapes using a single hand-held consumer-grade RGB-D sensor at real-time rates. Our method does not require a pre-defined shape template to start with and builds up the scene model from scratch during the scanning process. Geometry and motion are parameterized in a unified manner by a volumetric representation that encodes a d… ▽ More

    Submitted 30 July, 2016; v1 submitted 26 March, 2016; originally announced March 2016.