Skip to main content

Showing 1–13 of 13 results for author: Zanfir, A

.
  1. arXiv:2404.00485  [pdf, other

    cs.CV

    DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans

    Authors: Akash Sengupta, Thiemo Alldieck, Nikos Kolotouros, Enric Corona, Andrei Zanfir, Cristian Sminchisescu

    Abstract: We present DiffHuman, a probabilistic method for photorealistic 3D human reconstruction from a single RGB image. Despite the ill-posed nature of this problem, most methods are deterministic and output a single solution, often resulting in a lack of geometric detail and blurriness in unseen or uncertain regions. In contrast, DiffHuman predicts a probability distribution over 3D reconstructions cond… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: CVPR 2024

  2. arXiv:2403.08764  [pdf, other

    cs.CV

    VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

    Authors: Enric Corona, Andrei Zanfir, Eduard Gabriel Bazavan, Nikos Kolotouros, Thiemo Alldieck, Cristian Sminchisescu

    Abstract: We propose VLOGGER, a method for audio-driven human video generation from a single input image of a person, which builds on the success of recent generative diffusion models. Our method consists of 1) a stochastic human-to-3d-motion diffusion model, and 2) a novel diffusion-based architecture that augments text-to-image models with both spatial and temporal controls. This supports the generation o… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Project web: https://enriccorona.github.io/vlogger/

  3. arXiv:2311.02461  [pdf, other

    cs.CV

    SPHEAR: Spherical Head Registration for Complete Statistical 3D Modeling

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Thiemo Alldieck, Teodor Alexandru Szente, Mihai Zanfir, Cristian Sminchisescu

    Abstract: We present \emph{SPHEAR}, an accurate, differentiable parametric statistical 3D human head model, enabled by a novel 3D registration method based on spherical embeddings. We shift the paradigm away from the classical Non-Rigid Registration methods, which operate under various surface priors, increasing reconstruction fidelity and minimizing required human intervention. Additionally, SPHEAR is a \e… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: To be published at the International Conference on 3D Vision 2024

  4. arXiv:2309.05782  [pdf, other

    cs.CV

    Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction

    Authors: Ivan Grishchenko, Geng Yan, Eduard Gabriel Bazavan, Andrei Zanfir, Nikolai Chinaev, Karthik Raveendran, Matthias Grundmann, Cristian Sminchisescu

    Abstract: We present Blendshapes GHUM, an on-device ML pipeline that predicts 52 facial blendshape coefficients at 30+ FPS on modern mobile phones, from a single monocular RGB image and enables facial motion capture applications like virtual avatars. Our main contributions are: i) an annotation-free offline method for obtaining blendshape coefficients from real-world human scans, ii) a lightweight real-time… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 4 pages, 3 figures

  5. arXiv:2306.09329  [pdf, other

    cs.CV

    DreamHuman: Animatable 3D Avatars from Text

    Authors: Nikos Kolotouros, Thiemo Alldieck, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Fieraru, Cristian Sminchisescu

    Abstract: We present DreamHuman, a method to generate realistic animatable 3D human avatar models solely from textual descriptions. Recent text-to-3D methods have made considerable strides in generation, but are still lacking in important aspects. Control and often spatial resolution remain limited, existing methods produce fixed rather than animated 3D human models, and anthropometric consistency for compl… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Project website at https://dream-human.github.io/

  6. arXiv:2212.07729  [pdf, other

    cs.CV

    HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving

    Authors: Andrei Zanfir, Mihai Zanfir, Alexander Gorban, **gwei Ji, Yin Zhou, Dragomir Anguelov, Cristian Sminchisescu

    Abstract: Autonomous driving is an exciting new industry, posing important research questions. Within the perception module, 3D human pose estimation is an emerging technology, which can enable the autonomous vehicle to perceive and understand the subtle and complex behaviors of pedestrians. While hardware systems and sensors have dramatically improved over the decades -- with cars potentially boasting comp… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Published at the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand

  7. arXiv:2212.06820  [pdf, other

    cs.CV

    Structured 3D Features for Reconstructing Controllable Avatars

    Authors: Enric Corona, Mihai Zanfir, Thiemo Alldieck, Eduard Gabriel Bazavan, Andrei Zanfir, Cristian Sminchisescu

    Abstract: We introduce Structured 3D Features, a model based on a novel implicit 3D representation that pools pixel-aligned image features onto dense 3D points sampled from a parametric, statistical human mesh surface. The 3D points have associated semantics and can move freely in 3D space. This allows for optimal coverage of the person of interest, beyond just the body shape, which in turn, additionally he… ▽ More

    Submitted 15 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023. Project page: https://enriccorona.github.io/s3f/, Video: https://www.youtube.com/watch?v=mcZGcQ6L-2s

  8. arXiv:2206.11678  [pdf, other

    cs.CV

    BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation

    Authors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian Sminchisescu

    Abstract: We present BlazePose GHUM Holistic, a lightweight neural network pipeline for 3D human body landmarks and pose estimation, specifically tailored to real-time on-device inference. BlazePose GHUM Holistic enables motion capture from a single RGB image including avatar control, fitness tracking and AR/VR effects. Our main contributions include i) a novel method for 3D ground truth data acquisition, i… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022

  9. arXiv:2112.12867  [pdf, other

    cs.CV

    HSPACE: Synthetic Parametric Humans Animated in Complex Environments

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Advances in the state of the art for 3d human sensing are currently limited by the lack of visual datasets with 3d ground truth, including multiple people, in motion, operating in real-world environments, with complex illumination or occlusion, and potentially observed by a moving camera. Sophisticated scene understanding would require estimating human pose and shape as well as gestures, towards r… ▽ More

    Submitted 6 January, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  10. arXiv:2106.09336  [pdf, other

    cs.CV

    THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

    Authors: Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present THUNDR, a transformer-based deep neural network methodology to reconstruct the 3d pose and shape of people, given monocular RGB images. Key to our methodology is an intermediate 3d marker representation, where we aim to combine the predictive power of model-free-output architectures and the regularizing, anthropometrically-preserving properties of a statistical human surface model like… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  11. arXiv:2008.06910  [pdf, other

    cs.CV

    Neural Descent for Visual 3D Human Pose and Shape

    Authors: Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present deep neural network methodology to reconstruct the 3d pose and shape of people, given an input RGB image. We rely on a recently introduced, expressivefull body statistical 3d human model, GHUM, trained end-to-end, and learn to reconstruct its pose and shape state in a self-supervised regime. Central to our methodology, is a learning to learn and optimize approach, referred to as HUmanNe… ▽ More

    Submitted 14 June, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: CVPR 2021

  12. arXiv:2003.10350  [pdf, other

    cs.CV

    Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows

    Authors: Andrei Zanfir, Eduard Gabriel Bazavan, Hongyi Xu, Bill Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Monocular 3D human pose and shape estimation is challenging due to the many degrees of freedom of the human body and thedifficulty to acquire training data for large-scale supervised learning in complex visual scenes. In this paper we present practical semi-supervised and self-supervised models that support training and good generalization in real-world images and video. Our formulation is based o… ▽ More

    Submitted 22 August, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Journal ref: ECCV 2020

  13. arXiv:1909.10307  [pdf, other

    cs.CV

    Human Synthesis and Scene Compositing

    Authors: Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu

    Abstract: Generating good quality and geometrically plausible synthetic images of humans with the ability to control appearance, pose and shape parameters, has become increasingly important for a variety of tasks ranging from photo editing, fashion virtual try-on, to special effects and image compression. In this paper, we propose HUSC, a HUman Synthesis and Scene Compositing framework for the realistic syn… ▽ More

    Submitted 18 October, 2019; v1 submitted 23 September, 2019; originally announced September 2019.