Skip to main content

Showing 1–16 of 16 results for author: Garcia-Hernando, G

.
  1. arXiv:2406.18387  [pdf, other

    cs.CV cs.LG

    DoubleTake: Geometry Guided Depth Estimation

    Authors: Mohamed Sayed, Filippo Aleotti, Jamie Watson, Zawar Qureshi, Guillermo Garcia-Hernando, Gabriel Brostow, Sara Vicente, Michael Firman

    Abstract: Estimating depth from a sequence of posed RGB images is a fundamental computer vision task, with applications in augmented reality, path planning etc. Prior work typically makes use of previous frames in a multi view stereo framework, relying on matching textures in a local neighborhood. In contrast, our model leverages historical predictions by giving the latest 3D geometry data as an extra input… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2212.11966  [pdf, other

    cs.CV

    Removing Objects From Neural Radiance Fields

    Authors: Silvan Weder, Guillermo Garcia-Hernando, Aron Monszpart, Marc Pollefeys, Gabriel Brostow, Michael Firman, Sara Vicente

    Abstract: Neural Radiance Fields (NeRFs) are emerging as a ubiquitous scene representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable with other people. Before sharing a NeRF, though, it might be desirable to remove personal information or unsightly objects. Such removal is not easily achieved with the current NeRF editing frameworks. We propose a framework to remove objects… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  3. arXiv:2210.05494  [pdf, other

    cs.CV

    Map-free Visual Relocalization: Metric Pose Relative to a Single Image

    Authors: Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo Garcia-Hernando, Áron Monszpart, Victor Adrian Prisacariu, Daniyar Turmukhambetov, Eric Brachmann

    Abstract: Can we relocalize in a scene represented by a single reference image? Standard visual relocalization requires hundreds of images and scale calibration to build a scene-specific 3D map. In contrast, we propose Map-free Relocalization, i.e., using only one photo of a scene to enable instant, metric scaled relocalization. Existing datasets are not suitable to benchmark map-free relocalization, due to… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: ECCV2022 camera-ready. 14 pages + 4 reference pages

  4. arXiv:2008.05785  [pdf, other

    cs.CV cs.LG

    Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings

    Authors: Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: To what extent are two images picturing the same 3D surfaces? Even when this is a known scene, the answer typically requires an expensive search across scale space, with matching and geometric verification of large sets of local features. This expense is further multiplied when a query image is evaluated against a gallery, e.g. in visual relocalization. While we don't obviate the need for geometri… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  5. arXiv:2008.03285  [pdf, other

    cs.CV cs.RO

    Physics-Based Dexterous Manipulations with Estimated Hand Poses and Residual Reinforcement Learning

    Authors: Guillermo Garcia-Hernando, Edward Johns, Tae-Kyun Kim

    Abstract: Dexterous manipulation of objects in virtual environments with our bare hands, by using only a depth sensor and a state-of-the-art 3D hand pose estimator (HPE), is challenging. While virtual environments are ruled by physics, e.g. object weights and surface frictions, the absence of force feedback makes the task challenging, as even slight inaccuracies on finger tips or contact points from HPE may… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To appear in IROS2020

  6. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  7. arXiv:2003.12344  [pdf, other

    cs.CV

    Introducing Pose Consistency and Warp-Alignment for Self-Supervised 6D Object Pose Estimation in Color Images

    Authors: Juil Sock, Guillermo Garcia-Hernando, Anil Armagan, Tae-Kyun Kim

    Abstract: Most successful approaches to estimate the 6D pose of an object typically train a neural network by supervising the learning with annotated poses in real world images. These annotations are generally expensive to obtain and a common workaround is to generate and train on synthetic scenes, with the drawback of limited generalisation when the model is deployed in the real world. In this work, a two-… ▽ More

    Submitted 16 October, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

    Comments: Accepted to 3DV'2020 as Oral

  8. arXiv:2001.10609  [pdf, other

    cs.CV

    A Review on Object Pose Recovery: from 3D Bounding Box Detectors to Full 6D Pose Estimators

    Authors: Caner Sahin, Guillermo Garcia-Hernando, Juil Sock, Tae-Kyun Kim

    Abstract: Object pose recovery has gained increasing attention in the computer vision field as it has become an important problem in rapidly evolving technological areas related to autonomous driving, robotics, and augmented reality. Existing review-related studies have addressed the problem at visual level in 2D, going through the methods which produce 2D bounding boxes of objects of interest in RGB images… ▽ More

    Submitted 19 April, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: Accepted to the journal of Image and Vision Computing (IVC). arXiv admin note: text overlap with arXiv:1903.04229

  9. arXiv:1910.08811  [pdf, other

    cs.CV cs.RO

    Active 6D Multi-Object Pose Estimation in Cluttered Scenarios with Deep Reinforcement Learning

    Authors: Juil Sock, Guillermo Garcia-Hernando, Tae-Kyun Kim

    Abstract: In this work, we explore how a strategic selection of camera movements can facilitate the task of 6D multi-object pose estimation in cluttered scenarios while respecting real-world constraints important in robotics and augmented reality applications, such as time and distance traveled. In the proposed framework, a set of multiple object hypotheses is given to an agent, which is inferred by an obje… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

  10. arXiv:1903.04229  [pdf, other

    cs.CV

    Instance- and Category-level 6D Object Pose Estimation

    Authors: Caner Sahin, Guillermo Garcia-Hernando, Juil Sock, Tae-Kyun Kim

    Abstract: 6D object pose estimation is an important task that determines the 3D position and 3D rotation of an object in camera-centred coordinates. By utilizing such a task, one can propose promising solutions for various problems related to scene understanding, augmented reality, control and navigation of robotics. Recent developments on visual depth sensors and low-cost availability of depth data signifi… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: Book Chapter Submission. arXiv admin note: substantial text overlap with arXiv:1706.03285

  11. arXiv:1810.10818  [pdf, other

    cs.CV

    HANDS18: Methods, Techniques and Applications for Hand Observation

    Authors: Iason Oikonomidis, Guillermo Garcia-Hernando, Angela Yao, Antonis Argyros, Vincent Lepetit, Tae-Kyun Kim

    Abstract: This report outlines the proceedings of the Fourth International Workshop on Observing and Understanding Hands in Action (HANDS 2018). The fourth instantiation of this workshop attracted significant interest from both academia and the industry. The program of the workshop included regular papers that are published as the workshop's proceedings, extended abstracts, invited posters, and invited talk… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: 11 pages, 1 figure, Discussion of the HANDS 2018 workshop held in conjunction with ECCV 2018

  12. arXiv:1810.01845  [pdf, other

    cs.CV

    Task-Oriented Hand Motion Retargeting for Dexterous Manipulation Imitation

    Authors: Dafni Antotsiou, Guillermo Garcia-Hernando, Tae-Kyun Kim

    Abstract: Human hand actions are quite complex, especially when they involve object manipulation, mainly due to the high dimensionality of the hand and the vast action space that entails. Imitating those actions with dexterous hand models involves different important and challenging steps: acquiring human hand information, retargeting it to a hand model, and learning a policy from acquired data. In this wor… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: ECCV 2018 workshop paper

  13. arXiv:1712.03917  [pdf, other

    cs.CV

    Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals

    Authors: Shanxin Yuan, Guillermo Garcia-Hernando, Bjorn Stenger, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee, Pavlo Molchanov, Jan Kautz, Sina Honari, Liuhao Ge, Junsong Yuan, Xinghao Chen, Gui** Wang, Fan Yang, Kai Akiyama, Yang Wu, Qingfu Wan, Meysam Madadi, Sergio Escalera, Shile Li, Dongheui Lee, Iason Oikonomidis, Antonis Argyros, Tae-Kyun Kim

    Abstract: In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are the next challenges that need to be tackled? Following the successful Hands In the Million Challenge (HIM2017), we investigate the top 10 state-of-the-art methods on three tasks: single frame 3D pose estimation, 3D hand tracking, and hand pose estimation during ob… ▽ More

    Submitted 29 March, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

  14. arXiv:1707.02237  [pdf, other

    cs.CV

    The 2017 Hands in the Million Challenge on 3D Hand Pose Estimation

    Authors: Shanxin Yuan, Qi Ye, Guillermo Garcia-Hernando, Tae-Kyun Kim

    Abstract: We present the 2017 Hands in the Million Challenge, a public competition designed for the evaluation of the task of 3D hand pose estimation. The goal of this challenge is to assess how far is the state of the art in terms of solving the problem of 3D hand pose estimation as well as detect major failure and strength modes of both systems and evaluation metrics that can help to identify future resea… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

  15. arXiv:1704.02463  [pdf, other

    cs.CV

    First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations

    Authors: Guillermo Garcia-Hernando, Shanxin Yuan, Seungryul Baek, Tae-Kyun Kim

    Abstract: In this work we study the use of 3D hand poses to recognize first-person dynamic hand actions interacting with 3D objects. Towards this goal, we collected RGB-D video sequences comprised of more than 100K frames of 45 daily hand action categories, involving 26 different objects in several hand configurations. To obtain hand pose annotations, we used our own mo-cap system that automatically infers… ▽ More

    Submitted 10 April, 2018; v1 submitted 8 April, 2017; originally announced April 2017.

    Comments: Accepted to CVPR 2018

  16. arXiv:1607.02737  [pdf, other

    cs.CV

    Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

    Authors: Guillermo Garcia-Hernando, Tae-Kyun Kim

    Abstract: A human action can be seen as transitions between one's body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate s… ▽ More

    Submitted 31 March, 2017; v1 submitted 10 July, 2016; originally announced July 2016.

    Comments: to appear in CVPR 2017