Skip to main content

Showing 1–15 of 15 results for author: Bogo, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08570  [pdf, other

    cs.CV

    RoHM: Robust Human Motion Reconstruction via Diffusion

    Authors: Siwei Zhang, Bharat Lal Bhatnagar, Yuanlu Xu, Alexander Winkler, Petr Kadlecek, Siyu Tang, Federica Bogo

    Abstract: We propose RoHM, an approach for robust 3D human motion reconstruction from monocular RGB(-D) videos in the presence of noise and occlusions. Most previous approaches either train neural networks to directly regress motion in 3D or learn data-driven motion priors and combine them with optimization at test time. The former do not recover globally coherent motion and fail under occlusions; the latte… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: With the appendix included

  2. arXiv:2203.05789  [pdf, other

    cs.CV cs.LG

    FLAG: Flow-based 3D Avatar Generation from Sparse Observations

    Authors: Sadegh Aliakbarian, Pashmina Cameron, Federica Bogo, Andrew Fitzgibbon, Thomas J. Cashman

    Abstract: To represent people in mixed reality applications for collaboration and communication, we need to generate realistic and faithful avatar poses. However, the signal streams that can be applied for this task from head-mounted devices (HMDs) are typically limited to head pose and hand pose estimates. While these signals are valuable, they are an incomplete representation of the human body, making it… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  3. arXiv:2202.01493  [pdf, other

    cs.RO cs.CV cs.HC

    Spatial Computing and Intuitive Interaction: Bringing Mixed Reality and Robotics Together

    Authors: Jeffrey Delmerico, Roi Poranne, Federica Bogo, Helen Oleynikova, Eric Vollenweider, Stelian Coros, Juan Nieto, Marc Pollefeys

    Abstract: Spatial computing -- the ability of devices to be aware of their surroundings and to represent this digitally -- offers novel capabilities in human-robot interaction. In particular, the combination of spatial computing and egocentric sensing on mixed reality devices enables them to capture and understand human actions and translate these to actions with spatial meaning, which offers exciting new p… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  4. arXiv:2112.07642  [pdf, other

    cs.CV cs.AI

    EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

    Authors: Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang

    Abstract: Understanding social interactions from egocentric views is crucial for many applications, ranging from assistive robotics to AR/VR. Key to reasoning about interactions is to understand the body pose and motion of the interaction partner from the egocentric view. However, research in this area is severely hindered by the lack of datasets. Existing datasets are limited in terms of either size, captu… ▽ More

    Submitted 16 August, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Camera ready version for ECCV 2022, appendix included

  5. arXiv:2111.14824  [pdf, other

    cs.CV

    Learning to Fit Morphable Models

    Authors: Vasileios Choutas, Federica Bogo, **g**g Shen, Julien Valentin

    Abstract: Fitting parametric models of human bodies, hands or faces to sparse input signals in an accurate, robust, and fast manner has the promise of significantly improving immersion in AR and VR scenarios. A common first step in systems that tackle these problems is to regress the parameters of the parametric model directly from the input data. This approach is fast, robust, and is a good starting point… ▽ More

    Submitted 20 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  6. arXiv:2108.10399  [pdf, other

    cs.CV cs.AI

    Learning Motion Priors for 4D Human Body Capture in 3D Scenes

    Authors: Siwei Zhang, Yan Zhang, Federica Bogo, Marc Pollefeys, Siyu Tang

    Abstract: Recovering high-quality 3D human motion in complex scenes from monocular videos is important for many applications, ranging from AR/VR to robotics. However, capturing realistic human-scene interactions, while dealing with occlusions and partial views, is challenging; current approaches are still far from achieving compelling results. We address this problem by proposing LEMO: LEarning human MOtion… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021, camera ready version with appendix

  7. arXiv:2104.11181  [pdf, other

    cs.CV

    H2O: Two Hands Manipulating Objects for First Person Interaction Recognition

    Authors: Taein Kwon, Bugra Tekin, Jan Stuhmer, Federica Bogo, Marc Pollefeys

    Abstract: We present a comprehensive framework for egocentric interaction recognition using markerless 3D annotations of two hands manipulating objects. To this end, we propose a method to create a unified dataset for egocentric 3D interaction recognition. Our method produces annotations of the 3D pose of two hands and the 6D pose of the manipulated objects, along with their interaction labels for each fram… ▽ More

    Submitted 24 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to ICCV 2021

  8. arXiv:2008.11239  [pdf, other

    cs.CV

    HoloLens 2 Research Mode as a Tool for Computer Vision Research

    Authors: Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

    Abstract: Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research. In this technical report, we present HoloLens 2 Research Mode, an API and a set of tools enabling access to the raw sensor streams. We provide an overview of the API and explain how it can be used to build mixed… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  9. arXiv:2007.04940  [pdf, other

    cs.CV

    The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization

    Authors: **g**g Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew William Fitzgibbon, Jamie Shotton

    Abstract: Realtime perceptual and interaction capabilities in mixed reality require a range of 3D tracking problems to be solved at low latency on resource-constrained hardware such as head-mounted devices. Indeed, for devices such as HoloLens 2 where the CPU and GPU are left available for applications, multiple tracking subsystems are required to run on a continuous, real-time basis while sharing a single… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Journal ref: ECCV2020

  10. arXiv:2004.13449  [pdf, other

    cs.CV

    Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

    Authors: Yana Hasson, Bugra Tekin, Federica Bogo, Ivan Laptev, Marc Pollefeys, Cordelia Schmid

    Abstract: Modeling hand-object manipulations is essential for understanding how humans interact with their environment. While of practical importance, estimating the pose of hands and objects during interactions is challenging due to the large mutual occlusions that occur during manipulation. Recent efforts have been directed towards fully-supervised methods that require large amounts of labeled training sa… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: CVPR 2020. See the project webpage at https://hassony2.github.io/handobjectconsist.html

  11. Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition

    Authors: Huseyin Coskun, Zeeshan Zia, Bugra Tekin, Federica Bogo, Nassir Navab, Federico Tombari, Harpreet Sawhney

    Abstract: The lack of large-scale real datasets with annotations makes transfer learning a necessity for video activity understanding. We aim to develop an effective method for few-shot transfer learning for first-person action classification. We leverage independently trained local visual cues to learn representations that can be transferred from a source domain, which provides primitive action labels, to… ▽ More

    Submitted 7 December, 2021; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: Paper has been accepted in Transactions on Pattern Analysis and Machine Intelligence

    Journal ref: year = {5555}, volume = {}, number = {01}, issn = {1939-3539}, pages = {1-1},

  12. arXiv:1904.05349  [pdf, other

    cs.CV

    H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions

    Authors: Bugra Tekin, Federica Bogo, Marc Pollefeys

    Abstract: We present a unified framework for understanding 3D hand and object interactions in raw image sequences from egocentric RGB cameras. Given a single RGB image, our model jointly estimates the 3D hand and object poses, models their interactions, and recognizes the object and action classes with a single feed-forward pass through a neural network. We propose a single architecture that does not rely o… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: CVPR 2019 (Oral)

  13. arXiv:1707.07548  [pdf, other

    cs.CV

    Towards Accurate Markerless Human Shape and Pose Estimation over Time

    Authors: Yinghao Huang, Federica Bogo, Christoph Lassner, Angjoo Kanazawa, Peter V. Gehler, Ijaz Akhter, Michael J. Black

    Abstract: Existing marker-less motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, which narrows its application scenarios. Here we propose a fully automatic method that given multi-view video, estimates 3D human motion and body shape. We take recent SMPLify \cite{bogo2016keep} as the base method, and extend it in several ways. First we fit the body to… ▽ More

    Submitted 30 April, 2018; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: 10 pages, 6 figures, 5 tables, published in 3DV-2017

  14. arXiv:1701.02468  [pdf, other

    cs.CV

    Unite the People: Closing the Loop Between 3D and 2D Human Representations

    Authors: Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler

    Abstract: 3D models provide a common ground for different representations of human bodies. In turn, robust 2D estimation has proven to be a powerful tool to obtain 3D fits "in-the- wild". However, depending on the level of detail, it can be hard to impossible to acquire labeled data for training 2D estimators on large scale. We propose a hybrid approach to this problem: with an extended version of the recen… ▽ More

    Submitted 24 July, 2017; v1 submitted 10 January, 2017; originally announced January 2017.

  15. arXiv:1607.08128  [pdf, other

    cs.CV

    Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image

    Authors: Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero, Michael J. Black

    Abstract: We describe the first method to automatically estimate the 3D pose of the human body as well as its 3D shape from a single unconstrained image. We estimate a full 3D mesh and show that 2D joints alone carry a surprising amount of information about body shape. The problem is challenging because of the complexity of the human body, articulation, occlusion, clothing, lighting, and the inherent ambigu… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

    Comments: To appear in ECCV 2016