Skip to main content

Showing 1–14 of 14 results for author: Zanfir, M

.
  1. arXiv:2311.02461  [pdf, other

    cs.CV

    SPHEAR: Spherical Head Registration for Complete Statistical 3D Modeling

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Thiemo Alldieck, Teodor Alexandru Szente, Mihai Zanfir, Cristian Sminchisescu

    Abstract: We present \emph{SPHEAR}, an accurate, differentiable parametric statistical 3D human head model, enabled by a novel 3D registration method based on spherical embeddings. We shift the paradigm away from the classical Non-Rigid Registration methods, which operate under various surface priors, increasing reconstruction fidelity and minimizing required human intervention. Additionally, SPHEAR is a \e… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: To be published at the International Conference on 3D Vision 2024

  2. arXiv:2308.01854  [pdf, other

    cs.CV

    Reconstructing Three-Dimensional Models of Interacting Humans

    Authors: Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

    Abstract: Understanding 3d human interactions is fundamental for fine-grained scene analysis and behavioural modeling. However, most of the existing models predict incorrect, lifeless 3d estimates, that miss the subtle human contact aspects--the essence of the event--and are of little use for detailed behavioral understanding. This paper addresses such issues with several contributions: (1) we introduce mod… ▽ More

    Submitted 4 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

  3. arXiv:2212.07729  [pdf, other

    cs.CV

    HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving

    Authors: Andrei Zanfir, Mihai Zanfir, Alexander Gorban, **gwei Ji, Yin Zhou, Dragomir Anguelov, Cristian Sminchisescu

    Abstract: Autonomous driving is an exciting new industry, posing important research questions. Within the perception module, 3D human pose estimation is an emerging technology, which can enable the autonomous vehicle to perceive and understand the subtle and complex behaviors of pedestrians. While hardware systems and sensors have dramatically improved over the decades -- with cars potentially boasting comp… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Published at the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand

  4. arXiv:2212.07275  [pdf, other

    cs.CV

    PhoMoH: Implicit Photorealistic 3D Models of Human Heads

    Authors: Mihai Zanfir, Thiemo Alldieck, Cristian Sminchisescu

    Abstract: We present PhoMoH, a neural network methodology to construct generative models of photo-realistic 3D geometry and appearance of human heads including hair, beards, an oral cavity, and clothing. In contrast to prior work, PhoMoH models the human head using neural fields, thus supporting complex topology. Instead of learning a head model from scratch, we propose to augment an existing expressive hea… ▽ More

    Submitted 24 October, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: To be published at the International Conference on 3D Vision 2024

  5. arXiv:2212.06820  [pdf, other

    cs.CV

    Structured 3D Features for Reconstructing Controllable Avatars

    Authors: Enric Corona, Mihai Zanfir, Thiemo Alldieck, Eduard Gabriel Bazavan, Andrei Zanfir, Cristian Sminchisescu

    Abstract: We introduce Structured 3D Features, a model based on a novel implicit 3D representation that pools pixel-aligned image features onto dense 3D points sampled from a parametric, statistical human mesh surface. The 3D points have associated semantics and can move freely in 3D space. This allows for optimal coverage of the person of interest, beyond just the body shape, which in turn, additionally he… ▽ More

    Submitted 15 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023. Project page: https://enriccorona.github.io/s3f/, Video: https://www.youtube.com/watch?v=mcZGcQ6L-2s

  6. arXiv:2206.11678  [pdf, other

    cs.CV

    BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation

    Authors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian Sminchisescu

    Abstract: We present BlazePose GHUM Holistic, a lightweight neural network pipeline for 3D human body landmarks and pose estimation, specifically tailored to real-time on-device inference. BlazePose GHUM Holistic enables motion capture from a single RGB image including avatar control, fitness tracking and AR/VR effects. Our main contributions include i) a novel method for 3D ground truth data acquisition, i… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022

  7. arXiv:2204.08906  [pdf, other

    cs.CV

    Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing

    Authors: Thiemo Alldieck, Mihai Zanfir, Cristian Sminchisescu

    Abstract: We present PHORHUM, a novel, end-to-end trainable, deep neural network methodology for photorealistic 3D human reconstruction given just a monocular RGB image. Our pixel-aligned method estimates detailed 3D geometry and, for the first time, the unshaded surface color together with the scene illumination. Observing that 3D supervision alone is not sufficient for high fidelity color reconstruction,… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: https://phorhum.github.io/

  8. arXiv:2112.12867  [pdf, other

    cs.CV

    HSPACE: Synthetic Parametric Humans Animated in Complex Environments

    Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Advances in the state of the art for 3d human sensing are currently limited by the lack of visual datasets with 3d ground truth, including multiple people, in motion, operating in real-world environments, with complex illumination or occlusion, and potentially observed by a moving camera. Sophisticated scene understanding would require estimating human pose and shape as well as gestures, towards r… ▽ More

    Submitted 6 January, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  9. arXiv:2106.09336  [pdf, other

    cs.CV

    THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

    Authors: Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present THUNDR, a transformer-based deep neural network methodology to reconstruct the 3d pose and shape of people, given monocular RGB images. Key to our methodology is an intermediate 3d marker representation, where we aim to combine the predictive power of model-free-output architectures and the regularizing, anthropometrically-preserving properties of a statistical human surface model like… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  10. arXiv:2012.10366  [pdf, other

    cs.CV

    Learning Complex 3D Human Self-Contact

    Authors: Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

    Abstract: Monocular estimation of three dimensional human self-contact is fundamental for detailed scene analysis including body language understanding and behaviour modeling. Existing 3d reconstruction methods do not focus on body regions in self-contact and consequently recover configurations that are either far from each other or self-intersecting, when they should just touch. This leads to perceptually… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: To be published in the Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI-2021)

  11. arXiv:2008.06910  [pdf, other

    cs.CV

    Neural Descent for Visual 3D Human Pose and Shape

    Authors: Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: We present deep neural network methodology to reconstruct the 3d pose and shape of people, given an input RGB image. We rely on a recently introduced, expressivefull body statistical 3d human model, GHUM, trained end-to-end, and learn to reconstruct its pose and shape state in a self-supervised regime. Central to our methodology, is a learning to learn and optimize approach, referred to as HUmanNe… ▽ More

    Submitted 14 June, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: CVPR 2021

  12. arXiv:1909.10307  [pdf, other

    cs.CV

    Human Synthesis and Scene Compositing

    Authors: Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu

    Abstract: Generating good quality and geometrically plausible synthetic images of humans with the ability to control appearance, pose and shape parameters, has become increasingly important for a variety of tasks ranging from photo editing, fashion virtual try-on, to special effects and image compression. In this paper, we propose HUSC, a HUman Synthesis and Scene Compositing framework for the realistic syn… ▽ More

    Submitted 18 October, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

  13. arXiv:1701.08985  [pdf, other

    cs.CV

    Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

    Authors: Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

    Abstract: We propose a deep multitask architecture for \emph{fully automatic 2d and 3d human sensing} (DMHS), including \emph{recognition and reconstruction}, in \emph{monocular images}. The system computes the figure-ground segmentation, semantically identifies the human body parts at pixel level, and estimates the 2d and 3d pose of the person. The model supports the joint training of all components by mea… ▽ More

    Submitted 31 January, 2017; originally announced January 2017.

  14. arXiv:1610.04997  [pdf, other

    cs.CV

    Spatio-Temporal Attention Models for Grounded Video Captioning

    Authors: Mihai Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu

    Abstract: Automatic video captioning is challenging due to the complex interactions in dynamic real scenes. A comprehensive system would ultimately localize and track the objects, actions and interactions present in a video and generate a description that relies on temporal localization in order to ground the visual concepts. However, most existing automatic video captioning systems map from raw video data… ▽ More

    Submitted 18 October, 2016; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: To appear in Asian Conference on Computer Vision (ACCV), Taipei, Taiwan, 2016