Showing 1–2 of 2 results for author: Gafni, G

Search v0.5.6 released 2020-02-24

arXiv:2308.07415 [pdf, other]

cs.CV cs.GR

Semantify: Simplifying the Control of 3D Morphable Models using CLIP

Authors: Omer Gralnik, Guy Gafni, Ariel Shamir

Abstract: We present Semantify: a self-supervised method that utilizes the semantic power of CLIP language-vision foundation model to simplify the control of 3D morphable models. Given a parametric model, training data is created by randomly sampling the model's parameters, creating various shapes and rendering them. The similarity between the output images and a set of word descriptors is calculated in CLI… ▽ More We present Semantify: a self-supervised method that utilizes the semantic power of CLIP language-vision foundation model to simplify the control of 3D morphable models. Given a parametric model, training data is created by randomly sampling the model's parameters, creating various shapes and rendering them. The similarity between the output images and a set of word descriptors is calculated in CLIP's latent space. Our key idea is first to choose a small set of semantically meaningful and disentangled descriptors that characterize the 3DMM, and then learn a non-linear map** from scores across this set to the parametric coefficients of the given 3DMM. The non-linear map** is defined by training a neural network without a human-in-the-loop. We present results on numerous 3DMMs: body shape models, face shape and expression models, as well as animal shapes. We demonstrate how our method defines a simple slider interface for intuitive modeling, and show how the map** can be used to instantly fit a 3D parametric body shape to in-the-wild images. △ Less

Submitted 14 August, 2023; originally announced August 2023.
arXiv:2012.03065 [pdf, other]

cs.CV cs.GR

Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction

Authors: Guy Gafni, Justus Thies, Michael Zollhöfer, Matthias Nießner

Abstract: We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face. Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoints or head-poses is required. In contrast to state-of-the-art approach… ▽ More We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face. Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoints or head-poses is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene representation can be learned from monocular input data only, without the need of a specialized capture setup. In our experiments, we show that this learned volumetric representation allows for photo-realistic image generation that surpasses the quality of state-of-the-art video-based reenactment methods. △ Less

Submitted 5 December, 2020; originally announced December 2020.

Comments: Video: https://youtu.be/m7oROLdQnjk | Project page: https://gafniguy.github.io/4D-Facial-Avatars/

Search v0.5.6 released 2020-02-24