Skip to main content

Showing 1–37 of 37 results for author: Rogez, G

.
  1. arXiv:2406.00636  [pdf, other

    cs.CV

    T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences

    Authors: Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Gregory Rogez

    Abstract: In this paper, we address the challenging problem of long-term 3D human motion generation. Specifically, we aim to generate a long sequence of smoothly connected actions from a stream of multiple sentences (i.e., paragraph). Previous long-term motion generating approaches were mostly based on recurrent methods, using previously generated motion chunks as input for the next step. However, this appr… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 HuMoGen Workshop

  2. arXiv:2404.12942  [pdf, other

    cs.CV

    Purposer: Putting Human Motion Generation in Context

    Authors: Nicolas Ugrinovic, Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Gregory Rogez, Francesc Moreno-Noguer

    Abstract: We present a novel method to generate human motion to populate 3D indoor scenes. It can be controlled with various combinations of conditioning signals such as a path in a scene, target poses, past motions, and scenes represented as 3D point clouds. State-of-the-art methods are either models specialized to one single setting, require vast amounts of high-quality and diverse training data, or are u… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2402.16392  [pdf, other

    cs.CV

    Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

    Authors: Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogez

    Abstract: When deploying a semantic segmentation model into the real world, it will inevitably be confronted with semantic classes unseen during training. Thus, to safely deploy such systems, it is crucial to accurately evaluate and improve their anomaly segmentation capabilities. However, acquiring and labelling semantic segmentation data is expensive and unanticipated conditions are long-tail and potentia… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2402.14654  [pdf, other

    cs.CV

    Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot

    Authors: Fabien Baradel, Matthieu Armando, Salma Galaaoui, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez, Thomas Lucas

    Abstract: We present Multi-HMR, a strong single-shot model for multi-person 3D human mesh recovery from a single RGB image. Predictions encompass the whole body, i.e, including hands and facial expressions, using the SMPL-X parametric model and spatial location in the camera coordinate system. Our model detects people by predicting coarse 2D heatmaps of person centers, using features produced by a standard… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: https://github.com/naver/multi-hmr

  5. arXiv:2311.09104  [pdf, other

    cs.CV

    Cross-view and Cross-pose Completion for 3D Human Understanding

    Authors: Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

    Abstract: Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets. We hypothesize that the most common pre-training strategy of relying on general purpose, object-centric image datasets such as ImageNet, is limited by an important domain shift. On the other hand, colle… ▽ More

    Submitted 18 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  6. arXiv:2309.10748  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

    Authors: Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez

    Abstract: Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes. To go beyond these limitations and spur further research, we introduce the SHOWMe dataset which consists of 96 videos, annotated with real and detailed hand-object 3D textured meshes. Following recent work, we consider a rigid hand-object sce… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Paper and Appendix, Accepted in ACVR workshop at ICCV conference

  7. arXiv:2309.08480  [pdf, other

    cs.CV

    PoseFix: Correcting 3D Human Poses with Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Automatically producing instructions to modify one's posture could open the door to endless applications, such as personalized coaching and in-home physical therapy. Tackling the reverse problem (i.e., refining a 3D pose based on some natural language feedback) could help for assisted 3D character animation or robot teaching, for instance. Although a few recent works explore the connections betwee… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Published in ICCV 2023

  8. arXiv:2306.07399  [pdf, other

    cs.CV

    4DHumanOutfit: a multi-subject 4D dataset of human motion sequences in varying outfits exhibiting large displacements

    Authors: Matthieu Armando, Laurence Boissieux, Edmond Boyer, Jean-Sebastien Franco, Martin Humenberger, Christophe Legras, Vincent Leroy, Mathieu Marsot, Julien Pansiot, Sergi Pujades, Rim Rekik, Gregory Rogez, Anilkumar Swamy, Stefanie Wuhrer

    Abstract: This work presents 4DHumanOutfit, a new dataset of densely sampled spatio-temporal 4D human motion data of different actors, outfits and motions. The dataset is designed to contain different actors wearing different outfits while performing different motions in each outfit. In this way, the dataset can be seen as a cube of data containing 4D motion sequences along 3 axes with identity, outfit and… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  9. arXiv:2303.11298  [pdf, other

    cs.CV

    Reliability in Semantic Segmentation: Are We on the Right Track?

    Authors: Pau de Jorge, Riccardo Volpi, Philip Torr, Gregory Rogez

    Abstract: Motivated by the increasing popularity of transformers in computer vision, in recent times there has been a rapid development of novel architectures. While in-domain performance follows a constant, upward trend, properties like robustness or uncertainty estimation are less explored -leaving doubts about advances in model reliability. Studies along these axes exist, but they are mainly limited to c… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  10. arXiv:2210.11795  [pdf, other

    cs.CV

    PoseScript: Linking 3D Human Poses and Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Natural language plays a critical role in many computer vision applications, such as image captioning, visual question answering, and cross-modal retrieval, to provide fine-grained semantic information. Unfortunately, while human pose is key to human understanding, current 3D human pose datasets lack detailed language descriptions. To address this issue, we have introduced the PoseScript dataset.… ▽ More

    Submitted 19 January, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Extended version of the ECCV 2022 paper

  11. arXiv:2210.10542  [pdf, other

    cs.CV

    PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting

    Authors: Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Grégory Rogez

    Abstract: We address the problem of action-conditioned generation of human motion sequences. Existing work falls into two categories: forecast models conditioned on observed past motions, or generative models conditioned on action labels and duration only. In contrast, we generate motion conditioned on observations of arbitrary length, including none. To solve this generalized problem, we propose PoseGPT, a… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: ECCV'22 Conference paper

  12. arXiv:2210.00627  [pdf, other

    cs.CV

    MonoNHR: Monocular Neural Human Renderer

    Authors: Hongsuk Choi, Gyeongsik Moon, Matthieu Armando, Vincent Leroy, Kyoung Mu Lee, Gregory Rogez

    Abstract: Existing neural human rendering methods struggle with a single image input due to the lack of information in invisible areas and the depth ambiguity of pixels in visible areas. In this regard, we propose Monocular Neural Human Renderer (MonoNHR), a novel approach that renders robust free-viewpoint images of an arbitrary human given only a single image. MonoNHR is the first method that (i) renders… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Hongsuk Choi and Gyeongsik Moon contributed equally, 15 pages including the reference and supplementary material

  13. arXiv:2208.10211  [pdf, other

    cs.CV

    PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

    Authors: Fabien Baradel, Romain Brégier, Thibault Groueix, Philippe Weinzaepfel, Yannis Kalantidis, Grégory Rogez

    Abstract: Training state-of-the-art models for human pose estimation in videos requires datasets with annotations that are really hard and expensive to obtain. Although transformers have been recently utilized for body pose sequence modeling, related methods rely on pseudo-ground truth to augment the currently limited training data available for learning such models. In this paper, we introduce PoseBERT, a… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted to TPAMI 2022

  14. arXiv:2206.08242  [pdf, other

    cs.LG cs.AI cs.CV

    Catastrophic overfitting can be induced with discriminative non-robust features

    Authors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H. S. Torr

    Abstract: Adversarial training (AT) is the de facto method for building robust neural networks, but it can be computationally expensive. To mitigate this, fast single-step attacks can be used, but this may lead to catastrophic overfitting (CO). This phenomenon appears when networks gain non-trivial robustness during the first stages of AT, but then reach a breaking point where they become vulnerable in just… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  15. arXiv:2202.01181  [pdf, other

    cs.LG cs.CV

    Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

    Authors: Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip H. S. Torr, Grégory Rogez, Puneet K. Dokania

    Abstract: Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prior to FGSM (RS-FGSM) could prevent CO. However, Andriushchenko and Flammarion observed that RS-FGSM still le… ▽ More

    Submitted 17 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Published in NeurIPS 2022

  16. arXiv:2112.12004  [pdf, other

    cs.CV

    Barely-Supervised Learning: Semi-Supervised Learning with very few labeled images

    Authors: Thomas Lucas, Philippe Weinzaepfel, Gregory Rogez

    Abstract: This paper tackles the problem of semi-supervised learning when the set of labeled samples is limited to a small number of images per class, typically less than 10, problem that we refer to as barely-supervised learning. We analyze in depth the behavior of a state-of-the-art semi-supervised method, FixMatch, which relies on a weakly-augmented version of an image to obtain supervision signal for a… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  17. arXiv:2110.09243  [pdf, other

    cs.CV

    Leveraging MoCap Data for Human Mesh Recovery

    Authors: Fabien Baradel, Thibault Groueix, Philippe Weinzaepfel, Romain Brégier, Yannis Kalantidis, Grégory Rogez

    Abstract: Training state-of-the-art models for human body pose and shape recovery from images or videos requires datasets with corresponding annotations that are really hard and expensive to obtain. Our goal in this paper is to study whether poses from 3D Motion Capture (MoCap) data can be used to improve image-based and video-based human mesh recovery methods. We find that fine-tune image-based models with… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 3DV 2021

  18. Large Enhancement of Ferro-Magnetism under Collective Strong Coupling of YBCO Nanoparticles

    Authors: Anoop Thomas, Eloise Devaux, Kalaivanan Nagarajan, Guillaume Rogez, Marcus Seidel, Fanny Richard, Cyriaque Genet, Marc Drillon, Thomas W. Ebbesen

    Abstract: Light-matter strong coupling in the vacuum limit has been shown to enhance material properties over the past decade. Oxide nanoparticles are known to exhibit weak ferromagnetism due to vacancies in the lattice. Here we report the 700-fold enhancement of the ferromagnetism of YBa$_2$Cu$_3$O$_{7-x}$ nanoparticles under cooperative strong coupling at room temperature. The magnetic moment reaches 0.90… ▽ More

    Submitted 23 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: 24 pages, 4 figures - difference with v1 version: revised Supplementary Information file

  19. arXiv:2012.09696  [pdf, other

    cs.RO cs.LG

    Multi-FinGAN: Generative Coarse-To-Fine Sampling of Multi-Finger Grasps

    Authors: Jens Lundell, Enric Corona, Tran Nguyen Le, Francesco Verdoja, Philippe Weinzaepfel, Gregory Rogez, Francesc Moreno-Noguer, Ville Kyrki

    Abstract: While there exists many methods for manipulating rigid objects with parallel-jaw grippers, gras** with multi-finger robotic hands remains a quite unexplored research topic. Reasoning and planning collision-free trajectories on the additional degrees of freedom of several fingers represents an important challenge that, so far, involves computationally costly and slow processes. In this work, we p… ▽ More

    Submitted 15 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Accepted to IEEE Conference on Robotics and Automation 2021 (ICRA). Code is available at https://irobotics.aalto.fi/multi-fingan/

  20. arXiv:2012.04324  [pdf, other

    cs.CV cs.AI cs.LG

    Continual Adaptation of Visual Representations via Domain Randomization and Meta-learning

    Authors: Riccardo Volpi, Diane Larlus, Grégory Rogez

    Abstract: Most standard learning approaches lead to fragile models which are prone to drift when sequentially trained on samples of a different nature - the well-known "catastrophic forgetting" issue. In particular, when a model consecutively learns from different visual domains, it tends to forget the past domains in favor of the most recent ones. In this context, we show that one way to learn models that… ▽ More

    Submitted 8 April, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021

  21. arXiv:2012.02743  [pdf, other

    cs.CV

    SMPLy Benchmarking 3D Human Pose Estimation in the Wild

    Authors: Vincent Leroy, Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Grégory Rogez

    Abstract: Predicting 3D human pose from images has seen great recent improvements. Novel approaches that can even predict both pose and shape from a single input image have been introduced, often relying on a parametric model of the human body such as SMPL. While qualitative results for such methods are often shown for images captured in-the-wild, a proper benchmark in such conditions is still missing, as i… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 3DV 2020 Oral presentation

  22. arXiv:2008.09457  [pdf, other

    cs.CV

    DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild

    Authors: Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez

    Abstract: We introduce DOPE, the first method to detect and estimate whole-body 3D human poses, including bodies, hands and faces, in the wild. Achieving this level of details is key for a number of applications that require understanding the interactions of the people with each other or with the environment. The main challenge is the lack of in-the-wild data with labeled whole-body 3D poses. In previous wo… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  23. arXiv:2006.09081  [pdf, other

    cs.CV cs.LG

    Progressive Skeletonization: Trimming more fat from a network at initialization

    Authors: Pau de Jorge, Amartya Sanyal, Harkirat S. Behl, Philip H. S. Torr, Gregory Rogez, Puneet K. Dokania

    Abstract: Recent studies have shown that skeletonization (pruning parameters) of networks \textit{at initialization} provides all the practical benefits of sparsity both at inference and training time, while only marginally degrading their performance. However, we observe that beyond a certain level of sparsity (approx $95\%$), these approaches fail to preserve the network performance, and to our surprise,… ▽ More

    Submitted 19 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  24. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  25. arXiv:1912.07249  [pdf, other

    cs.CV

    Mimetics: Towards Understanding Human Actions Out of Context

    Authors: Philippe Weinzaepfel, Grégory Rogez

    Abstract: Recent methods for video action recognition have reached outstanding performances on existing benchmarks. However, they tend to leverage context such as scenes or objects instead of focusing on understanding the human action itself. For instance, a tennis field leads to the prediction playing tennis irrespectively of the actions performed in the video. In contrast, humans have a more complete unde… ▽ More

    Submitted 2 February, 2021; v1 submitted 16 December, 2019; originally announced December 2019.

  26. arXiv:1908.00439  [pdf, other

    cs.CV

    Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images

    Authors: Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez

    Abstract: In this paper, we tackle the problem of 3D human shape estimation from single RGB images. While the recent progress in convolutional neural networks has allowed impressive results for 3D human pose estimation, estimating the full 3D shape of a person is still an open issue. Model-based approaches can output precise meshes of naked under-cloth human bodies but fail to estimate details and un-modell… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: Accepted at ICCV 2019

  27. arXiv:1905.07487  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Designing of a magnetodielectric system in hybrid organic-inorganic framework, a perovskite layered phosphonate MnO3PC6H4-m-Br.H2O

    Authors: Tathamay Basu, Clarisse Bloyet, Felicien Beaubras, Vincent Caignaert, Olivier Perez, Jean-Michel Rueff, Alain Pautrat, Bernard Raveau, Jean-François Lohier, Paul-Alain Jaffrès, Hélène Couthon, Guillaume Rogez, Grégory Taupier, Honorat Dorkenoo

    Abstract: The research on multiferrocity and magnetoelectric coupling in metal-organic system is rare. Very few hybrid organic-inorganic frameworks (HOIF) exhibit direct magnetoelectric coupling (coupling between spins and dipoles) and also restricted to particular COOH-based system. We show how one can design a hybrid system to obtain such coupling based on the rational design of the organic ligands. The l… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: accepted in Advanced Functional Materials

    Journal ref: Adv. Funct. Mater.2019, 1901878

  28. arXiv:1809.04809  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Incipient spin-dipole coupling in a 1D helical-chain metal-organic hybrid

    Authors: Tathamay Basu, Clarisse Bloyet, Jean-Michel Rueff, Vincent Caignaert, Alain Pautrat, Bernard Raveau, Guillaume Rogez, Paul-Alain Jaffrès

    Abstract: Low dimensional magnetic systems (such as spin-chain) are extensively studied due to their exotic magnetic properties. Here, we would like to address that such systems should also be interesting in the field of dielectric, ferroelectricity and magnetodielectric coupling. As a prototype example, we have investigated a one-dimensional (1D) helical-chain metal-organic hybrid system with a chiral stru… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: Manuscript is accepted in J. Mat. Chem. C as a Communication

    Journal ref: Journal of Materials Chemistry C 2018

  29. LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

    Authors: Gregory Rogez, Philippe Weinzaepfel, Cordelia Schmid

    Abstract: We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D poses of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our Localization-Classification-R… ▽ More

    Submitted 13 January, 2019; v1 submitted 1 March, 2018; originally announced March 2018.

    Comments: journal version of the CVPR 2017 paper, accepted to appear in IEEE Trans. PAMI

  30. arXiv:1802.04216  [pdf, other

    cs.CV

    Image-based Synthesis for Deep 3D Human Pose Estimation

    Authors: Grégory Rogez, Cordelia Schmid

    Abstract: This paper addresses the problem of 3D human pose estimation in the wild. A significant challenge is the lack of training data, i.e., 2D images of humans annotated with 3D poses. Such data is necessary to train state-of-the-art CNN architectures. Here, we propose a solution to generate a large set of photorealistic synthetic images of humans with 3D pose annotations. We introduce an image-based sy… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: accepted to appear in IJCV (with minor revisions). Follow-up to NIPS 2016 arXiv:1607.02046

  31. arXiv:1707.06005  [pdf, other

    cs.CV

    Detecting Parts for Action Localization

    Authors: Nicolas Chesneau, Grégory Rogez, Karteek Alahari, Cordelia Schmid

    Abstract: In this paper, we propose a new framework for action localization that tracks people in videos and extracts full-body human tubes, i.e., spatio-temporal regions localizing actions, even in the case of occlusions or truncations. This is achieved by training a novel human part detector that scores visible parts while regressing full-body bounding boxes. The core of our method is a convolutional neur… ▽ More

    Submitted 21 July, 2017; v1 submitted 19 July, 2017; originally announced July 2017.

    Comments: BMVC 2017

  32. arXiv:1607.02046  [pdf, other

    cs.CV

    MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

    Authors: Grégory Rogez, Cordelia Schmid

    Abstract: This paper addresses the problem of 3D human pose estimation in the wild. A significant challenge is the lack of training data, i.e., 2D images of humans annotated with 3D poses. Such data is necessary to train state-of-the-art CNN architectures. Here, we propose a solution to generate a large set of photorealistic synthetic images of humans with 3D pose annotations. We introduce an image-based sy… ▽ More

    Submitted 28 October, 2016; v1 submitted 7 July, 2016; originally announced July 2016.

    Comments: 9 pages, accepted to appear in NIPS 2016

  33. arXiv:1603.09439  [pdf, other

    cs.CV

    The Open World of Micro-Videos

    Authors: Phuc Xuan Nguyen, Gregory Rogez, Charless Fowlkes, Deva Ramanan

    Abstract: Micro-videos are six-second videos popular on social media networks with several unique properties. Firstly, because of the authoring process, they contain significantly more diversity and narrative structure than existing collections of video "snippets". Secondly, because they are often captured by hand-held mobile cameras, they contain specialized viewpoints including third-person, egocentric, a… ▽ More

    Submitted 31 March, 2016; v1 submitted 30 March, 2016; originally announced March 2016.

  34. arXiv:1504.06378  [pdf, other

    cs.CV

    Depth-based hand pose estimation: methods, data, and challenges

    Authors: James Steven Supancic III, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan

    Abstract: Hand pose estimation has matured rapidly in recent years. The introduction of commodity depth sensors and a multitude of practical applications have spurred new advances. We provide an extensive analysis of the state-of-the-art, focusing on hand pose estimation from a single depth frame. To do so, we have implemented a considerable number of systems, and will release all software and evaluation co… ▽ More

    Submitted 6 May, 2015; v1 submitted 23 April, 2015; originally announced April 2015.

  35. arXiv:1412.0065  [pdf, other

    cs.CV

    3D Hand Pose Detection in Egocentric RGB-D Images

    Authors: Gregory Rogez, James S. Supancic III, Maryam Khademi, Jose Maria Martinez Montiel, Deva Ramanan

    Abstract: We focus on the task of everyday hand pose estimation from egocentric viewpoints. For this task, we show that depth sensors are particularly informative for extracting near-field interactions of the camera wearer with his/her environment. Despite the recent advances in full-body pose estimation using Kinect-like sensors, reliable monocular hand pose estimation in RGB-D images is still an unsolved… ▽ More

    Submitted 28 November, 2014; originally announced December 2014.

    Comments: 14 pages, 15 figures, extended version of the corresponding ECCV workshop paper, submitted to International Journal of Computer Vision

  36. arXiv:1412.0060  [pdf, other

    cs.CV

    Egocentric Pose Recognition in Four Lines of Code

    Authors: Gregory Rogez, James S. Supancic III, Deva Ramanan

    Abstract: We tackle the problem of estimating the 3D pose of an individual's upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volum… ▽ More

    Submitted 28 November, 2014; originally announced December 2014.

    Comments: 9 pages, 10 figures

  37. arXiv:0908.0607  [pdf

    cond-mat.mtrl-sci

    Study of molecular spin-crossover complex Fe(phen)2(NCS)2 thin films

    Authors: Shengwei Shi, G. Schmerber, J. Arabski, J. -B. Beaufrand, D. J. Kim, S. Boukari, M. Bowen, N. T. Kemp, N. Viart, G. Rogez, E. Beaurepaire, H. Aubriet, J. Petersen, C. Becker, D. Ruch

    Abstract: We report on the growth by evaporation under high vacuum of high-quality thin films of Fe(phen)2(NCS)2 (phen=1,10-phenanthroline) that maintain the expected electronic structure down to a thickness of 10 nm and that exhibit a temperature-driven spin transition. We have investigated the current-voltage characteristics of a device based on such films. From the space charge-limited current regime,… ▽ More

    Submitted 5 August, 2009; originally announced August 2009.

    Journal ref: Appl. Phys. Lett. 95, 043303 (2009)