Skip to main content

Showing 1–19 of 19 results for author: Bagautdinov, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15891  [pdf, other

    cs.CV cs.GR cs.LG

    Score Distillation via Reparametrized DDIM

    Authors: Artem Lukoianov, Haitz Sáez de Ocáriz Borde, Kristjan Greenewald, Vitor Campagnolo Guizilini, Timur Bagautdinov, Vincent Sitzmann, Justin Solomon

    Abstract: While 2D diffusion models generate realistic, high-detail images, 3D shape generation methods like Score Distillation Sampling (SDS) built on these 2D diffusion models produce cartoon-like, over-smoothed shapes. To help explain this discrepancy, we show that the image guidance used in Score Distillation can be understood as the velocity field of a 2D denoising generative process, up to the choice… ▽ More

    Submitted 13 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Preprint. 25 pages, 26 figures. Revision : added missed comparisons, fixed typos, fixed PDF compatibility issues

  2. arXiv:2401.01885  [pdf, other

    cs.CV

    From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

    Authors: Evonne Ng, Javier Romero, Timur Bagautdinov, Shaojie Bai, Trevor Darrell, Angjoo Kanazawa, Alexander Richard

    Abstract: We present a framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction. Given speech audio, we output multiple possibilities of gestural motion for an individual, including face, body, and hands. The key behind our method is in combining the benefits of sample diversity from vector quantization with the high-frequency… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  3. arXiv:2311.08581  [pdf, other

    cs.CV

    Drivable 3D Gaussian Avatars

    Authors: Wojciech Zielonka, Timur Bagautdinov, Shunsuke Saito, Michael Zollhöfer, Justus Thies, Javier Romero

    Abstract: We present Drivable 3D Gaussian Avatars (D3GA), the first 3D controllable model for human bodies rendered with Gaussian splats. Current photorealistic drivable avatars require either accurate 3D registrations during training, dense input images during testing, or both. The ones based on neural radiance fields also tend to be prohibitively slow for telepresence applications. This work uses the rece… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Website: https://zielon.github.io/d3ga/

  4. Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input

    Authors: Donglai Xiang, Fabian Prada, Zhe Cao, Kaiwen Guo, Chenglei Wu, Jessica Hodgins, Timur Bagautdinov

    Abstract: Clothing is an important part of human appearance but challenging to model in photorealistic avatars. In this work we present avatars with dynamically moving loose clothing that can be faithfully driven by sparse RGB-D inputs as well as body and face motion. We propose a Neural Iterative Closest Point (N-ICP) algorithm that can efficiently track the coarse garment shape given sparse depth input. G… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: SIGGRAPH Asia 2023 Conference Paper. Project website: https://xiangdonglai.github.io/www-sa23-drivable-clothing/

  5. arXiv:2304.02013  [pdf, other

    cs.CV

    NPC: Neural Point Characters from Video

    Authors: Shih-Yang Su, Timur Bagautdinov, Helge Rhodin

    Abstract: High-fidelity human 3D models can now be learned directly from videos, typically by combining a template-based surface model with neural representations. However, obtaining a template surface requires expensive multi-view capture systems, laser scans, or strictly controlled conditions. Previous methods avoid using a template but rely on a costly or ill-posed map** from observation to canonical s… ▽ More

    Submitted 1 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Project website: https://lemonatsu.github.io/npc/

  6. arXiv:2302.04866  [pdf, other

    cs.CV cs.GR

    RelightableHands: Efficient Neural Relighting of Articulated Hand Models

    Authors: Shun Iwase, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Timur Bagautdinov, Rohan Joshi, Fabian Prada, Takaaki Shiratori, Yaser Sheikh, Jason Saragih

    Abstract: We present the first neural relighting approach for rendering high-fidelity personalized hands that can be animated in real-time under novel illumination. Our approach adopts a teacher-student framework, where the teacher learns appearance under a single point light from images captured in a light-stage, allowing us to synthesize hands in arbitrary illuminations but with heavy compute. Using image… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 8 pages, 16 figures, Website: https://sh8.io/#/relightable_hands

  7. arXiv:2207.09774  [pdf, other

    cs.CV

    Drivable Volumetric Avatars using Texel-Aligned Features

    Authors: Edoardo Remelli, Timur Bagautdinov, Shunsuke Saito, Tomas Simon, Chenglei Wu, Shih-En Wei, Kaiwen Guo, Zhe Cao, Fabian Prada, Jason Saragih, Yaser Sheikh

    Abstract: Photorealistic telepresence requires both high-fidelity body modeling and faithful driving to enable dynamically synthesized appearance that is indistinguishable from reality. In this work, we propose an end-to-end framework that addresses two core challenges in modeling and driving full-body avatars of real people. One challenge is driving an avatar while staying faithful to details and dynamics… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Journal ref: SIGGRAPH 2022 Conference Proceedings

  8. Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing

    Authors: Donglai Xiang, Timur Bagautdinov, Tuur Stuyck, Fabian Prada, Javier Romero, Weipeng Xu, Shunsuke Saito, **gfan Guo, Breannan Smith, Takaaki Shiratori, Yaser Sheikh, Jessica Hodgins, Chenglei Wu

    Abstract: Despite recent progress in develo** animatable full-body avatars, realistic modeling of clothing - one of the core aspects of human self-expression - remains an open challenge. State-of-the-art physical simulation methods can generate realistically behaving clothing geometry at interactive rates. Modeling photorealistic appearance, however, usually requires physically-based rendering which is to… ▽ More

    Submitted 19 September, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: SIGGRAPH Asia 2022 (ACM ToG) camera ready. The supplementary video can be found on https://research.facebook.com/publications/dressing-avatars-deep-photorealistic-appearance-for-physically-simulated-clothing/

  9. arXiv:2206.03373  [pdf, other

    cs.CV

    Garment Avatars: Realistic Cloth Driving using Pattern Registration

    Authors: Oshri Halimi, Fabian Prada, Tuur Stuyck, Donglai Xiang, Timur Bagautdinov, He Wen, Ron Kimmel, Takaaki Shiratori, Chenglei Wu, Yaser Sheikh

    Abstract: Virtual telepresence is the future of online communication. Clothing is an essential part of a person's identity and self-expression. Yet, ground truth data of registered clothes is currently unavailable in the required resolution and accuracy for training telepresence models for realistic cloth animation. Here, we propose an end-to-end pipeline for building drivable representations for clothing.… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  10. arXiv:2205.01666  [pdf, other

    cs.CV

    DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks

    Authors: Shih-Yang Su, Timur Bagautdinov, Helge Rhodin

    Abstract: Deep learning greatly improved the realism of animatable human models by learning geometry and appearance from collections of 3D scans, template meshes, and multi-view imagery. High-resolution models enable photo-realistic avatars but at the cost of requiring studio settings not available to end users. Our goal is to create avatars directly from raw images without relying on expensive studio setup… ▽ More

    Submitted 11 October, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: ECCV 2022. Project website: https://lemonatsu.github.io/danbo

  11. arXiv:2203.13817  [pdf, other

    cs.CV cs.GR

    AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling

    Authors: Ziqian Bai, Timur Bagautdinov, Javier Romero, Michael Zollhöfer, ** Tan, Shunsuke Saito

    Abstract: Neural fields such as implicit surfaces have recently enabled avatar modeling from raw scans without explicit temporal correspondences. In this work, we exploit autoregressive modeling to further extend this notion to capture dynamic effects, such as soft-tissue deformations. Although autoregressive models are naturally capable of handling dynamics, it is non-trivial to apply them to implicit repr… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Project page: https://zqbai-jeremy.github.io/autoavatar

  12. Modeling Clothing as a Separate Layer for an Animatable Human Avatar

    Authors: Donglai Xiang, Fabian Prada, Timur Bagautdinov, Weipeng Xu, Yuan Dong, He Wen, Jessica Hodgins, Chenglei Wu

    Abstract: We have recently seen great progress in building photorealistic animatable full-body codec avatars, but generating high-fidelity animation of clothing is still difficult. To address these difficulties, we propose a method to build an animatable clothed body avatar with an explicit representation of the clothing on the upper body from multi-view captured videos. We use a two-layer mesh representati… ▽ More

    Submitted 4 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Camera ready for SIGGRAPH Asia 2021 Technical Papers. https://research.fb.com/publications/modeling-clothing-as-a-separate-layer-for-an-animatable-human-avatar/

  13. arXiv:2106.11795  [pdf, other

    cs.CV

    DeepMesh: Differentiable Iso-Surface Extraction

    Authors: Benoit Guillard, Edoardo Remelli, Artem Lukoianov, Stephan R. Richter, Timur Bagautdinov, Pierre Baque, Pascal Fua

    Abstract: Geometric Deep Learning has recently made striking progress with the advent of continuous deep implicit fields. They allow for detailed modeling of watertight surfaces of arbitrary topology while not relying on a 3D Euclidean grid, resulting in a learnable parameterization that is unlimited in resolution. Unfortunately, these methods are often unsuitable for applications that require an explicit… ▽ More

    Submitted 24 March, 2022; v1 submitted 20 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2006.03997

  14. arXiv:2105.10441  [pdf, other

    cs.CV cs.AI cs.GR

    Driving-Signal Aware Full-Body Avatars

    Authors: Timur Bagautdinov, Chenglei Wu, Tomas Simon, Fabian Prada, Takaaki Shiratori, Shih-En Wei, Weipeng Xu, Yaser Sheikh, Jason Saragih

    Abstract: We present a learning-based method for building driving-signal aware full-body avatars. Our model is a conditional variational autoencoder that can be animated with incomplete driving signals, such as human pose and facial keypoints, and produces a high-quality representation of human geometry and view-dependent appearance. The core intuition behind our method is that better drivability and genera… ▽ More

    Submitted 25 June, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  15. arXiv:2012.09955  [pdf, other

    cs.CV cs.GR

    Learning Compositional Radiance Fields of Dynamic Human Heads

    Authors: Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhöfer

    Abstract: Photorealistic rendering of dynamic humans is an important ability for telepresence systems, virtual shop**, synthetic data generation, and more. Recently, neural rendering methods, which combine techniques from computer graphics and machine learning, have created high-fidelity models of humans and objects. Some of these methods do not produce results with high-enough fidelity for driveable huma… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  16. arXiv:2012.08334  [pdf, other

    cs.LG cs.CV

    Masksembles for Uncertainty Estimation

    Authors: Nikita Durasov, Timur Bagautdinov, Pierre Baque, Pascal Fua

    Abstract: Deep neural networks have amply demonstrated their prowess but estimating the reliability of their predictions remains challenging. Deep Ensembles are widely considered as being one of the best methods for generating uncertainty estimates but are very expensive to train and evaluate. MC-Dropout is another popular alternative, which is less expensive, but also less reliable. Our central intuition i… ▽ More

    Submitted 25 June, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 13539-13548

  17. arXiv:2006.03997  [pdf, other

    cs.CV

    MeshSDF: Differentiable Iso-Surface Extraction

    Authors: Edoardo Remelli, Artem Lukoianov, Stephan R. Richter, Benoît Guillard, Timur Bagautdinov, Pierre Baque, Pascal Fua

    Abstract: Geometric Deep Learning has recently made striking progress with the advent of continuous Deep Implicit Fields. They allow for detailed modeling of watertight surfaces of arbitrary topology while not relying on a 3D Euclidean grid, resulting in a learnable parameterization that is not limited in resolution. Unfortunately, these methods are often not suitable for applications that require an expl… ▽ More

    Submitted 31 October, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: 22 pages, 16 figures, Neural Information Processing Systems (NeurIPS 2020)

    MSC Class: 30L05 ACM Class: I.2.10; I.4.8; J.6

  18. arXiv:1611.09078  [pdf, other

    cs.CV

    Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition

    Authors: Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese

    Abstract: We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end to generate de… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

  19. arXiv:1511.06103  [pdf, other

    cs.CV cs.LG

    Principled Parallel Mean-Field Inference for Discrete Random Fields

    Authors: Pierre Baqué, Timur Bagautdinov, François Fleuret, Pascal Fua

    Abstract: Mean-field variational inference is one of the most popular approaches to inference in discrete random fields. Standard mean-field optimization is based on coordinate descent and in many situations can be impractical. Thus, in practice, various parallel techniques are used, which either rely on ad-hoc smoothing with heuristically set parameters, or put strong constraints on the type of models. In… ▽ More

    Submitted 3 December, 2015; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: The first two authors contributed equally