Skip to main content

Showing 1–21 of 21 results for author: Beeler, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13807  [pdf, other

    cs.CV cs.GR

    FaceFolds: Meshed Radiance Manifolds for Efficient Volumetric Rendering of Dynamic Faces

    Authors: Safa C. Medin, Gengyan Li, Ruofei Du, Stephan Garbin, Philip Davidson, Gregory W. Wornell, Thabo Beeler, Abhimitra Meka

    Abstract: 3D rendering of dynamic face captures is a challenging problem, and it demands improvements on several fronts$\unicode{x2014}$photorealism, efficiency, compatibility, and configurability. We present a novel representation that enables high-quality volumetric rendering of an actor's dynamic facial performances with minimal compute and memory footprint. It runs natively on commodity graphics soft- a… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: In Proceedings of the ACM in Computer Graphics and Interactive Techniques, 2024

  2. arXiv:2404.04421  [pdf, other

    cs.GR cs.CV

    PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

    Authors: Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein

    Abstract: Modeling and rendering photorealistic avatars is of crucial importance in many applications. Existing methods that build a 3D avatar from visual observations, however, struggle to reconstruct clothed humans. We introduce PhysAvatar, a novel framework that combines inverse rendering with inverse physics to automatically estimate the shape and appearance of a human from multi-view video data along w… ▽ More

    Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: Project Page: https://qingqing-zhao.github.io/PhysAvatar

  3. arXiv:2404.02225  [pdf, other

    cs.CV cs.AI

    CHOSEN: Contrastive Hypothesis Selection for Multi-View Depth Refinement

    Authors: Di Qiu, Yinda Zhang, Thabo Beeler, Vladimir Tankovich, Christian Häne, Sean Fanello, Christoph Rhemann, Sergio Orts Escolano

    Abstract: We propose CHOSEN, a simple yet flexible, robust and effective multi-view depth refinement framework. It can be employed in any existing multi-view stereo pipeline, with straightforward generalization capability for different multi-view capture systems such as camera relative positioning and lenses. Given an initial depth estimation, CHOSEN iteratively re-samples and selects the best hypotheses, a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  4. arXiv:2404.01296  [pdf, other

    cs.CV

    MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space

    Authors: Armand Comas-Massagué, Di Qiu, Menglei Chai, Marcel Bühler, Amit Raj, Ruiqi Gao, Qiangeng Xu, Mark Matthews, Paulo Gotardo, Octavia Camps, Sergio Orts-Escolano, Thabo Beeler

    Abstract: We introduce a novel framework for 3D human avatar generation and personalization, leveraging text prompts to enhance user engagement and customization. Central to our approach are key innovations aimed at overcoming the challenges in photo-realistic avatar synthesis. Firstly, we utilize a conditional Neural Radiance Fields (NeRF) model, trained on a large-scale unannotated multi-view dataset, to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  5. arXiv:2312.14929  [pdf, other

    cs.CV cs.GR

    MACS: Mass Conditioned 3D Hand and Object Motion Synthesis

    Authors: Soshi Shimada, Franziska Mueller, Jan Bednarik, Bardia Doosti, Bernd Bickel, Danhang Tang, Vladislav Golyanik, Jonathan Taylor, Christian Theobalt, Thabo Beeler

    Abstract: The physical properties of an object, such as mass, significantly affect how we manipulate it with our hands. Surprisingly, this aspect has so far been neglected in prior work on 3D motion synthesis. To improve the naturalness of the synthesized 3D hand object motions, this work proposes MACS the first MAss Conditioned 3D hand and object motion Synthesis approach. Our approach is based on cascaded… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  6. arXiv:2312.11994  [pdf, other

    cs.CV

    Optimizing Diffusion Noise Can Serve As Universal Motion Priors

    Authors: Korrawe Karunratanakul, Konpat Preechakul, Emre Aksan, Thabo Beeler, Supasorn Suwajanakorn, Siyu Tang

    Abstract: We propose Diffusion Noise Optimization (DNO), a new method that effectively leverages existing motion diffusion models as motion priors for a wide range of motion-related tasks. Instead of training a task-specific diffusion model for each new task, DNO operates by optimizing the diffusion latent noise of an existing pre-trained text-to-motion model. Given the corresponding latent noise of a human… ▽ More

    Submitted 3 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project page: https://korrawe.github.io/dno-project/

  7. arXiv:2311.16495  [pdf, other

    cs.CV

    Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement

    Authors: Jian Wang, Zhe Cao, Diogo Luvizon, Lingjie Liu, Kripasindhu Sarkar, Danhang Tang, Thabo Beeler, Christian Theobalt

    Abstract: In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion. This task presents significant challenges due to three factors: the lack of high-quality datasets, fisheye camera distortion, and human body self-occlusion. To address these challenges, we propose a novel approach that leverages FisheyeViT to extra… ▽ More

    Submitted 2 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

  8. GroomGen: A High-Quality Generative Hair Model Using Hierarchical Latent Representations

    Authors: Yuxiao Zhou, Menglei Chai, Alessandro Pepe, Markus Gross, Thabo Beeler

    Abstract: Despite recent successes in hair acquisition that fits a high-dimensional hair model to a specific input subject, generative hair models, which establish general embedding spaces for encoding, editing, and sampling diverse hairstyles, are way less explored. In this paper, we present GroomGen, the first generative model designed for hair geometry composed of highly-detailed dense strands. Our appro… ▽ More

    Submitted 16 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: SIGGRAPH Asia 2023

    Journal ref: ACM Trans. Graph. 42, 6, Article 267 (December 2023)

  9. arXiv:2309.16859  [pdf, other

    cs.CV cs.AI cs.LG

    Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis

    Authors: Marcel C. Bühler, Kripasindhu Sarkar, Tanmay Shah, Gengyan Li, Daoye Wang, Leonhard Helminger, Sergio Orts-Escolano, Dmitry Lagun, Otmar Hilliges, Thabo Beeler, Abhimitra Meka

    Abstract: NeRFs have enabled highly realistic synthesis of human faces including complex appearance and reflectance effects of hair and skin. These methods typically require a large number of multi-view input images, making the process hardware intensive and cumbersome, limiting applicability to unconstrained settings. We propose a novel volumetric human face prior that enables the synthesis of ultra high-r… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

  10. arXiv:2309.05569  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    ITI-GEN: Inclusive Text-to-Image Generation

    Authors: Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre

    Abstract: Text-to-image generative models often reflect the biases of the training data, leading to unequal representations of underrepresented groups. This study investigates inclusive text-to-image generative models that generate images based on human-written prompts and ensure the resulting images are uniformly distributed across attributes of interest. Unfortunately, directly expressing the desired attr… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023 (Oral Presentation)

  11. arXiv:2308.11015  [pdf, other

    cs.CV

    Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images

    Authors: Tze Ho Elden Tse, Franziska Mueller, Zhengyang Shen, Danhang Tang, Thabo Beeler, Mingsong Dou, Yinda Zhang, Sasa Petrovic, Hyung ** Chang, Jonathan Taylor, Bardia Doosti

    Abstract: We propose a novel transformer-based framework that reconstructs two high fidelity hands from multi-view RGB images. Unlike existing hand pose estimation methods, where one typically trains a deep network to regress hand model parameters from single RGB image, we consider a more challenging problem setting where we directly regress the absolute root poses of two-hands with extended forearm at high… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  12. arXiv:2305.12411  [pdf, other

    cs.CV

    Synthesizing Diverse Human Motions in 3D Indoor Scenes

    Authors: Kaifeng Zhao, Yan Zhang, Shaofei Wang, Thabo Beeler, Siyu Tang

    Abstract: We present a novel method for populating 3D indoor scenes with virtual humans that can navigate in the environment and interact with objects in a realistic manner. Existing approaches rely on training sequences that contain captured human motions and the 3D scenes they interact with. However, such interaction data are costly, difficult to capture, and can hardly cover all plausible human-scene int… ▽ More

    Submitted 21 August, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  13. arXiv:2304.01436  [pdf, other

    cs.CV cs.GR

    Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

    Authors: Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, ** Tan, Thabo Beeler, Sean Fanello, Yinda Zhang

    Abstract: We propose a method to learn a high-quality implicit 3D head avatar from a monocular RGB video captured in the wild. The learnt avatar is driven by a parametric face model to achieve user-controlled facial expressions and head poses. Our hybrid pipeline combines the geometry prior and dynamic tracking of a 3DMM with a neural radiance field to achieve fine-grained control and photorealism. To reduc… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: In CVPR2023. Project page: https://augmentedperception.github.io/monoavatar/

  14. arXiv:2207.12824  [pdf, other

    cs.CV

    Compositional Human-Scene Interaction Synthesis with Semantic Control

    Authors: Kaifeng Zhao, Shaofei Wang, Yan Zhang, Thabo Beeler, Siyu Tang

    Abstract: Synthesizing natural interactions between virtual humans and their 3D environments is critical for numerous applications, such as computer games and AR/VR experiences. Our goal is to synthesize humans interacting with a given 3D scene controlled by high-level semantic specifications as pairs of action categories and object instances, e.g., "sit on the chair". The key challenge of incorporating int… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: To appear at ECCV 2022. The project page is available at https://zkf1997.github.io/COINS/index.html

  15. EyeNeRF: A Hybrid Representation for Photorealistic Synthesis, Animation and Relighting of Human Eyes

    Authors: Gengyan Li, Abhimitra Meka, Franziska Müller, Marcel C. Bühler, Otmar Hilliges, Thabo Beeler

    Abstract: A unique challenge in creating high-quality animatable and relightable 3D avatars of people is modeling human eyes. The challenge of synthesizing eyes is multifold as it requires 1) appropriate representations for the various components of the eye and the periocular region for coherent viewpoint synthesis, capable of representing diffuse, refractive and highly reflective surfaces, 2) disentangling… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 16 pages, 16 figures, 1 table, to be published in ACM Transactions on Graphics (TOG) (Volume: 41, Issue: 4), 2022

    ACM Class: I.4.5; I.3

  16. arXiv:2104.05988  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    VariTex: Variational Neural Face Textures

    Authors: Marcel C. Bühler, Abhimitra Meka, Gengyan Li, Thabo Beeler, Otmar Hilliges

    Abstract: Deep generative models can synthesize photorealistic images of human faces with novel identities. However, a key challenge to the wide applicability of such techniques is to provide independent control over semantically meaningful parameters: appearance, head pose, face shape, and facial expressions. In this paper, we propose VariTex - to the best of our knowledge the first method that learns a va… ▽ More

    Submitted 18 August, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021

  17. arXiv:2011.12866  [pdf, other

    cs.CV

    Deep Physics-aware Inference of Cloth Deformation for Monocular Human Performance Capture

    Authors: Yue Li, Marc Habermann, Bernhard Thomaszewski, Stelian Coros, Thabo Beeler, Christian Theobalt

    Abstract: Recent monocular human performance capture approaches have shown compelling dense tracking results of the full body from a single RGB camera. However, existing methods either do not estimate clothing at all or model cloth deformation with simple geometric priors instead of taking into account the underlying physical principles. This leads to noticeable artifacts in their reconstructions, e.g. bake… ▽ More

    Submitted 14 October, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  18. arXiv:2007.15837  [pdf, other

    cs.CV

    ETH-XGaze: A Large Scale Dataset for Gaze Estimation under Extreme Head Pose and Gaze Variation

    Authors: Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang, Otmar Hilliges

    Abstract: Gaze estimation is a fundamental task in many applications of computer vision, human computer interaction and robotics. Many state-of-the-art methods are trained and tested on custom datasets, making comparison across methods challenging. Furthermore, existing gaze estimation datasets have limited head pose and gaze variations, and the evaluations are conducted using different protocols and metric… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: Accepted at ECCV 2020 (Spotlight)

  19. arXiv:1909.02518  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Style-Preserving Visual Dubbing

    Authors: Hyeongwoo Kim, Mohamed Elgharib, Michael Zollhöfer, Hans-Peter Seidel, Thabo Beeler, Christian Richardt, Christian Theobalt

    Abstract: Dubbing is a technique for translating video content from one language to another. However, state-of-the-art visual dubbing techniques directly copy facial expressions from source to target actors without considering identity-specific idiosyncrasies such as a unique type of smile. We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of… ▽ More

    Submitted 6 September, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: SIGGRAPH Asia 2019

  20. arXiv:1909.01815  [pdf, other

    cs.CV cs.GR cs.LG

    3D Morphable Face Models -- Past, Present and Future

    Authors: Bernhard Egger, William A. P. Smith, Ayush Tewari, Stefanie Wuhrer, Michael Zollhoefer, Thabo Beeler, Florian Bernard, Timo Bolkart, Adam Kortylewski, Sami Romdhani, Christian Theobalt, Volker Blanz, Thomas Vetter

    Abstract: In this paper, we provide a detailed survey of 3D Morphable Face Models over the 20 years since they were first proposed. The challenges in building and applying these models, namely capture, modeling, image formation, and image analysis, are still active research topics, and we review the state-of-the-art in each of these areas. We also look ahead, identifying unsolved challenges, proposing direc… ▽ More

    Submitted 16 April, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: ACM Transactions on Graphics (TOG)

  21. arXiv:1907.10402  [pdf, other

    cs.GR

    Data-Driven Physical Face Inversion

    Authors: Yeara Kozlov, Hongyi Xu, Moritz Bächer, Derek Bradley, Markus Gross, Thabo Beeler

    Abstract: Facial animation is one of the most challenging problems in computer graphics, and it is often solved using linear heuristics like blend-shape rigging. More expressive approaches like physical simulation have emerged, but these methods are very difficult to tune, especially when simulating a real actor's face. We propose to use a simple finite element simulation approach for face animation, and pr… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    ACM Class: I.3.5; I.3.7