Skip to main content

Showing 1–19 of 19 results for author: Bhatnagar, B L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01758  [pdf, other

    cs.CV

    GEARS: Local Geometry-aware Hand-object Interaction Synthesis

    Authors: Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-moll

    Abstract: Generating realistic hand motion sequences in interaction with objects has gained increasing attention with the growing interest in digital humans. Prior work has illustrated the effectiveness of employing occupancy-based or distance-based virtual sensors to extract hand-object interaction features. Nonetheless, these methods show limited generalizability across object categories, shapes and sizes… ▽ More

    Submitted 11 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2403.11237  [pdf, other

    cs.CV

    FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction

    Authors: Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Ilya Petrov, Vladimir Guzov, Helisa Dhamo, Eduardo PĂ©rez-Pellitero, Gerard Pons-Moll

    Abstract: Interactions between human and objects are influenced not only by the object's pose and shape, but also by physical attributes such as object mass and surface friction. They introduce important motion nuances that are essential for diversity and realism. Despite advancements in recent kinematics-based methods, this aspect has been overlooked. Generating nuanced human motion presents two challenges… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 24 pages, 9 figures

  3. arXiv:2401.08570  [pdf, other

    cs.CV

    RoHM: Robust Human Motion Reconstruction via Diffusion

    Authors: Siwei Zhang, Bharat Lal Bhatnagar, Yuanlu Xu, Alexander Winkler, Petr Kadlecek, Siyu Tang, Federica Bogo

    Abstract: We propose RoHM, an approach for robust 3D human motion reconstruction from monocular RGB(-D) videos in the presence of noise and occlusions. Most previous approaches either train neural networks to directly regress motion in 3D or learn data-driven motion priors and combine them with optimization at test time. The former do not recover globally coherent motion and fail under occlusions; the latte… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: With the appendix included

  4. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeong** Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  5. arXiv:2312.07063  [pdf, other

    cs.CV

    Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation

    Authors: Xianghui Xie, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll

    Abstract: Reconstructing human-object interaction in 3D from a single RGB image is a challenging task and existing data driven methods do not generalize beyond the objects present in the carefully curated 3D interaction datasets. Capturing large-scale real data to learn strong interaction and 3D shape priors is very expensive due to the combinatorial nature of human-object interactions. In this paper, we pr… ▽ More

    Submitted 6 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: CVPR'24 camera ready version. 25 pages, 20 figures. Project page: https://virtualhumans.mpi-inf.mpg.de/procigen-hdm

  6. arXiv:2311.13655  [pdf, other

    cs.CV

    GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

    Authors: Berna Kabadayi, Wojciech Zielonka, Bharat Lal Bhatnagar, Gerard Pons-Moll, Justus Thies

    Abstract: Digital humans and, especially, 3D facial avatars have raised a lot of attention in the past years, as they are the backbone of several applications like immersive telepresence in AR or VR. Despite the progress, facial avatars reconstructed from commodity hardware are incomplete and miss out on parts of the side and back of the head, severely limiting the usability of the avatar. This limitation i… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Website: https://ganavatar.github.io/ , Video: https://www.youtube.com/watch?v=uAi5IVrzzZY&ab_channel=JustusThies , Accepted to 3DV2024

  7. arXiv:2308.14847  [pdf, other

    cs.CV

    NSF: Neural Surface Fields for Human Modeling from Monocular Depth

    Authors: Yuxuan Xue, Bharat Lal Bhatnagar, Riccardo Marin, Nikolaos Sarafianos, Yuanlu Xu, Gerard Pons-Moll, Tony Tung

    Abstract: Obtaining personalized 3D animatable avatars from a monocular camera has several real world applications in gaming, virtual try-on, animation, and VR/XR, etc. However, it is very challenging to model dynamic and fine-grained clothing deformations from such sparse data. Existing methods for modeling 3D humans from depth data have limitations in terms of computational efficiency, mesh coherency, and… ▽ More

    Submitted 27 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accpted to ICCV 2023; Homepage at: https://yuxuan-xue.com/nsf

  8. arXiv:2303.16479  [pdf, other

    cs.CV

    Visibility Aware Human-Object Interaction Tracking from Single RGB Camera

    Authors: Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll

    Abstract: Capturing the interactions between humans and their environment in 3D is important for many applications in robotics, graphics, and vision. Recent works to reconstruct the 3D human and object from a single RGB image do not have consistent relative translation across frames because they assume a fixed depth. Moreover, their performance drops significantly when the object is occluded. In this work,… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: accepted to CVPR 2023, edited acknowledgement

  9. arXiv:2205.07982  [pdf, other

    cs.CV

    TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement

    Authors: Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll

    Abstract: We present TOCH, a method for refining incorrect 3D hand-object interaction sequences using a data prior. Existing hand trackers, especially those that rely on very few cameras, often produce visually unrealistic results with hand-object intersection or missing contacts. Although correcting such errors requires reasoning about temporal aspects of interaction, most previous works focus on static gr… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

  10. arXiv:2205.00541  [pdf, other

    cs.CV

    COUCH: Towards Controllable Human-Chair Interactions

    Authors: Xiaohan Zhang, Bharat Lal Bhatnagar, Vladimir Guzov, Sebastian Starke, Gerard Pons-Moll

    Abstract: Humans interact with an object in many different ways by making contact at different locations, creating a highly complex motion space that can be difficult to learn, particularly when synthesizing such human interactions in a controllable manner. Existing works on synthesizing human scene interaction focus on the high-level control of action but do not consider the fine-grained control of motion.… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  11. arXiv:2204.06950  [pdf, other

    cs.CV

    BEHAVE: Dataset and Method for Tracking Human Object Interactions

    Authors: Bharat Lal Bhatnagar, Xianghui Xie, Ilya A. Petrov, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

    Abstract: Modelling interactions between humans and objects in natural environments is central to many applications including gaming, virtual and mixed reality, as well as human behavior analysis and human-robot collaboration. This challenging operation scenario requires generalization to vast number of objects, scenes, and human actions. Unfortunately, there exist no such dataset. Moreover, this data needs… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted at CVPR'22

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022

  12. arXiv:2204.02445  [pdf, other

    cs.CV

    CHORE: Contact, Human and Object REconstruction from a single RGB image

    Authors: Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll

    Abstract: Most prior works in perceiving 3D humans from images reason human in isolation without their surroundings. However, humans are constantly interacting with the surrounding objects, thus calling for models that can reason about not only the human but also the object and their interaction. The problem is extremely challenging due to heavy occlusions between humans and objects, diverse interaction typ… ▽ More

    Submitted 31 October, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted at ECCV 2022, edited the acknowledgement

  13. arXiv:2102.01161  [pdf, other

    cs.CV

    Adjoint Rigid Transform Network: Task-conditioned Alignment of 3D Shapes

    Authors: Keyang Zhou, Bharat Lal Bhatnagar, Bernt Schiele, Gerard Pons-Moll

    Abstract: Most learning methods for 3D data (point clouds, meshes) suffer significant performance drops when the data is not carefully aligned to a canonical orientation. Aligning real world 3D data collected from different sources is non-trivial and requires manual intervention. In this paper, we propose the Adjoint Rigid Transform (ART) Network, a neural module which can be integrated with a variety of 3D… ▽ More

    Submitted 27 October, 2023; v1 submitted 1 February, 2021; originally announced February 2021.

  14. arXiv:2010.12447  [pdf, other

    cs.CV

    LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration

    Authors: Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

    Abstract: We address the problem of fitting 3D human models to 3D scans of dressed humans. Classical methods optimize both the data-to-model correspondences and the human model parameters (pose and shape), but are reliable only when initialized close to the solution. Some methods initialize the optimization based on fully supervised correspondence predictors, which is not differentiable end-to-end, and can… ▽ More

    Submitted 26 November, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: NeurIPS'20 (Oral)

    Journal ref: NeurIPS 2020

  15. arXiv:2007.11610  [pdf, other

    cs.CV

    SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing

    Authors: Garvita Tiwari, Bharat Lal Bhatnagar, Tony Tung, Gerard Pons-Moll

    Abstract: While models of 3D clothing learned from real data exist, no method can predict clothing deformation as a function of garment size. In this paper, we introduce SizerNet to predict 3D clothing conditioned on human body shape and garment size parameters, and ParserNet to infer garment meshes and shape under clothing with personal details in a single pass from an input mesh. SizerNet allows to estima… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: European Conference on Computer Vision 2020

  16. arXiv:2007.11432  [pdf, other

    cs.CV

    Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction

    Authors: Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

    Abstract: Implicit functions represented as deep learning approximations are powerful for reconstructing 3D surfaces. However, they can only produce static surfaces that are not controllable, which provides limited ability to modify the resulting model by editing its pose or shape parameters. Nevertheless, such features are essential in building flexible models for both computer graphics and computer vision… ▽ More

    Submitted 26 November, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: Accepted at ECCV'20 (Oral)

  17. arXiv:2007.11341  [pdf, other

    cs.CV

    Unsupervised Shape and Pose Disentanglement for 3D Meshes

    Authors: Keyang Zhou, Bharat Lal Bhatnagar, Gerard Pons-Moll

    Abstract: Parametric models of humans, faces, hands and animals have been widely used for a range of tasks such as image-based reconstruction, shape correspondence estimation, and animation. Their key strength is the ability to factor surface variations into shape and pose dependent components. Learning such models requires lots of expert knowledge and hand-defined object-specific constraints, making the le… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

  18. arXiv:1908.06903  [pdf, other

    cs.CV

    Multi-Garment Net: Learning to Dress 3D People from Images

    Authors: Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll

    Abstract: We present Multi-Garment Network (MGN), a method to predict body shape and clothing, layered on top of the SMPL model from a few frames (1-8) of a video. Several experiments demonstrate that this representation allows higher level of control when compared to single mesh or voxel representations of shape. Our model allows to predict garment geometry, relate it to the body shape, and transfer it to… ▽ More

    Submitted 20 August, 2019; v1 submitted 19 August, 2019; originally announced August 2019.

    Comments: International Conference in Computer Vision (ICCV), 2019

  19. arXiv:1903.05885  [pdf, other

    cs.CV

    Learning to Reconstruct People in Clothing from a Single RGB Camera

    Authors: Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll

    Abstract: We present a learning-based model to infer the personalized 3D shape of people from a few frames (1-8) of a monocular video in which the person is moving, in less than 10 seconds with a reconstruction accuracy of 5mm. Our model learns to predict the parameters of a statistical body model and instance displacements that add clothing and hair to the shape. The model achieves fast and accurate predic… ▽ More

    Submitted 8 April, 2019; v1 submitted 14 March, 2019; originally announced March 2019.