Skip to main content

Showing 1–50 of 66 results for author: Moreno-Noguer, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19852  [pdf, other

    cs.CV cs.MA

    FootBots: A Transformer-based Architecture for Motion Prediction in Soccer

    Authors: Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Motion prediction in soccer involves capturing complex dynamics from player and ball interactions. We present FootBots, an encoder-decoder transformer-based architecture addressing motion prediction and conditioned motion prediction through equivariance properties. FootBots captures temporal and social dynamics using set attention blocks and multi-attention block decoder. Our evaluation utilizes t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at IEEE ICIP 2024

  2. arXiv:2405.19876  [pdf, other

    cs.CV

    IReNe: Instant Recoloring of Neural Radiance Fields

    Authors: Alessio Mazzucchelli, Adrian Garcia-Garcia, Elena Garces, Fernando Rivas-Manzaneque, Francesc Moreno-Noguer, Adrian Penate-Sanchez

    Abstract: Advances in NERFs have allowed for 3D scene reconstructions and novel view synthesis. Yet, efficiently editing these representations while retaining photorealism is an emerging challenge. Recent methods face three primary limitations: they're slow for interactive use, lack precision at object boundaries, and struggle to ensure multi-view consistency. We introduce IReNe to address these limitations… ▽ More

    Submitted 10 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.18839  [pdf, other

    cs.CV

    MEGA: Masked Generative Autoencoder for Human Mesh Recovery

    Authors: Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Francesc Moreno-Noguer

    Abstract: Human Mesh Recovery (HMR) from a single RGB image is a highly ambiguous problem, as similar 2D projections can correspond to multiple 3D interpretations. Nevertheless, most HMR methods overlook this ambiguity and make a single prediction without accounting for the associated uncertainty. A few approaches generate a distribution of human meshes, enabling the sampling of multiple predictions; howeve… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2404.12942  [pdf, other

    cs.CV

    Purposer: Putting Human Motion Generation in Context

    Authors: Nicolas Ugrinovic, Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Gregory Rogez, Francesc Moreno-Noguer

    Abstract: We present a novel method to generate human motion to populate 3D indoor scenes. It can be controlled with various combinations of conditioning signals such as a path in a scene, target poses, past motions, and scenes represented as 3D point clouds. State-of-the-art methods are either models specialized to one single setting, require vast amounts of high-quality and diverse training data, or are u… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  5. arXiv:2404.11987  [pdf, other

    cs.CV

    MultiPhys: Multi-Person Physics-aware 3D Motion Estimation

    Authors: Nicolas Ugrinovic, Boxiao Pan, Georgios Pavlakos, Despoina Paschalidou, Bokui Shen, Jordi Sanchez-Riera, Francesc Moreno-Noguer, Leonidas Guibas

    Abstract: We introduce MultiPhys, a method designed for recovering multi-person motion from monocular videos. Our focus lies in capturing coherent spatial placement between pairs of individuals across varying degrees of engagement. MultiPhys, being physically aware, exhibits robustness to jittering and occlusions, and effectively eliminates penetration issues between the two individuals. We devise a pipelin… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2402.15552  [pdf, other

    cs.RO cs.AI eess.SY

    Morphological Symmetries in Robotics

    Authors: Daniel Ordoñez-Apraez, Giulio Turrisi, Vladimir Kostic, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer, Massimiliano Pontil, Claudio Semini, Carlos Mastalli

    Abstract: We present a comprehensive framework for studying and leveraging morphological symmetries in robotic systems. These are intrinsic properties of the robot's morphology, frequently observed in animal biology and robotics, which stem from the replication of kinematic structures and the symmetrical distribution of mass. We illustrate how these symmetries extend to the robot's state space and both prop… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 18 pages, 11 figures

  7. arXiv:2312.08291  [pdf, other

    cs.CV

    VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

    Authors: Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Previous works on Human Pose and Shape Estimation (HPSE) from RGB images can be broadly categorized into two main groups: parametric and non-parametric approaches. Parametric techniques leverage a low-dimensional statistical body model for realistic results, whereas recent non-parametric methods achieve higher precision by directly regressing the 3D coordinates of the human body mesh. This work in… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2311.01815  [pdf, other

    cs.CV cs.RO

    Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields

    Authors: Jianxiong Shen, Ruijie Ren, Adria Ruiz, Francesc Moreno-Noguer

    Abstract: Current methods based on Neural Radiance Fields (NeRF) significantly lack the capacity to quantify uncertainty in their predictions, particularly on the unseen space including the occluded and outside scene content. This limitation hinders their extensive applications in robotics, where the reliability of model predictions has to be considered for tasks such as robotic exploration and planning in… ▽ More

    Submitted 25 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  9. arXiv:2310.08784  [pdf, other

    cs.CV

    Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction

    Authors: Pol Caselles, Eduard Ramon, Jaime Garcia, Gil Triginer, Francesc Moreno-Noguer

    Abstract: Recent advancements in learning techniques that employ coordinate-based neural representations have yielded remarkable results in multi-view 3D reconstruction tasks. However, these approaches often require a substantial number of input views (typically several tens) and computationally intensive optimization procedures to achieve their effectiveness. In this paper, we address these limitations spe… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  10. arXiv:2309.08480  [pdf, other

    cs.CV

    PoseFix: Correcting 3D Human Poses with Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Automatically producing instructions to modify one's posture could open the door to endless applications, such as personalized coaching and in-home physical therapy. Tackling the reverse problem (i.e., refining a 3D pose based on some natural language feedback) could help for assisted 3D character animation or robot teaching, for instance. Although a few recent works explore the connections betwee… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Published in ICCV 2023

  11. arXiv:2308.04868  [pdf, other

    cs.CV

    InstantAvatar: Efficient 3D Head Reconstruction via Surface Rendering

    Authors: Antonio Canela, Pol Caselles, Ibrar Malik, Eduard Ramon, Jaime García, Jordi Sánchez-Riera, Gil Triginer, Francesc Moreno-Noguer

    Abstract: Recent advances in full-head reconstruction have been obtained by optimizing a neural field through differentiable surface or volume rendering to represent a single scene. While these techniques achieve an unprecedented accuracy, they take several minutes, or even hours, due to the expensive optimization process required. In this work, we introduce InstantAvatar, a method that recovers full-head a… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  12. arXiv:2302.10433  [pdf, other

    cs.RO cs.LG eess.SY

    On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis

    Authors: Daniel Ordonez-Apraez, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: We present a comprehensive study on discrete morphological symmetries of dynamical systems, which are commonly observed in biological and artificial locomoting systems, such as legged, swimming, and flying animals/robots/virtual characters. These symmetries arise from the presence of one or more planes/axis of symmetry in the system's morphology, resulting in harmonious duplication and distributio… ▽ More

    Submitted 7 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 8 pages, 4 figures, 7 optional appendix pages, 4 appendix figures

    MSC Class: 37J15; ACM Class: J.2

    Journal ref: Robotics: Science and System 2023

  13. arXiv:2301.08784  [pdf, other

    cs.CL cs.CV

    Visual Semantic Relatedness Dataset for Image Captioning

    Authors: Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

    Abstract: Modern image captioning system relies heavily on extracting knowledge from images to capture the concept of a static story. In this paper, we propose a textual visual context dataset for captioning, in which the publicly available dataset COCO Captions (Lin et al., 2014) has been extended with information about the scene (such as objects in the image). Since this information has a textual form, it… ▽ More

    Submitted 30 April, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: Project Page: bit.ly/project-page-paper

  14. arXiv:2210.11795  [pdf, other

    cs.CV

    PoseScript: Linking 3D Human Poses and Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Natural language plays a critical role in many computer vision applications, such as image captioning, visual question answering, and cross-modal retrieval, to provide fine-grained semantic information. Unfortunately, while human pose is key to human understanding, current 3D human pose datasets lack detailed language descriptions. To address this issue, we have introduced the PoseScript dataset.… ▽ More

    Submitted 19 January, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Extended version of the ECCV 2022 paper

  15. arXiv:2209.08163  [pdf, other

    cs.CV cs.CL

    Belief Revision based Caption Re-ranker with Visual Semantic Information

    Authors: Ahmed Sabir, Francesc Moreno-Noguer, Pranava Madhyastha, Lluís Padró

    Abstract: In this work, we focus on improving the captions generated by image-caption generation systems. We propose a novel re-ranking approach that leverages visual-semantic measures to identify the ideal caption that maximally captures the visual information in the image. Our re-ranker utilizes the Belief Revision framework (Blok et al., 2003) to calibrate the original likelihood of the top-n captions by… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: COLING 2022

  16. arXiv:2209.03027  [pdf, other

    cs.CV cs.AI eess.IV

    SIRA: Relightable Avatars from a Single Image

    Authors: Pol Caselles, Eduard Ramon, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer, Gil Triginer

    Abstract: Recovering the geometry of a human head from a single image, while factorizing the materials and illumination is a severely ill-posed problem that requires prior information to be solved. Methods based on 3D Morphable Models (3DMM), and their combination with differentiable renderers, have shown promising results. However, the expressiveness of 3DMMs is limited, and they typically yield over-smoot… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  17. arXiv:2209.02402  [pdf, other

    cs.CV cs.AI

    Topic Detection in Continuous Sign Language Videos

    Authors: Alvaro Budria, Laia Tarres, Gerard I. Gallego, Francesc Moreno-Noguer, Jordi Torres, Xavier Giro-i-Nieto

    Abstract: Significant progress has been made recently on challenging tasks in automatic sign language understanding, such as sign language recognition, translation and production. However, these works have focused on datasets with relatively few samples, short recordings and limited vocabulary and signing space. In this work, we introduce the novel task of sign language topic detection. We base our experime… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Presented as an extended abstract in the "AVA: Accessibility, Vision, and Autonomy Meet" CVPR 2022 Workshop

    Journal ref: "AVA: Accessibility, Vision, and Autonomy Meet" CVPR 2022 Workshop

  18. arXiv:2207.01567  [pdf, other

    cs.CV cs.AI

    Back to MLP: A Simple Baseline for Human Motion Prediction

    Authors: Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer

    Abstract: This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences. State-of-the-art approaches provide good results, however, they rely on deep learning architectures of arbitrary complexity, such as Recurrent Neural Networks(RNN), Transformers or Graph Convolutional Networks(GCN), typically requiring multiple training stage… ▽ More

    Submitted 5 October, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted to WACV 2023; Code available at https://github.com/dulucas/siMLPe

  19. arXiv:2205.06254  [pdf, other

    cs.CV

    Learned Vertex Descent: A New Direction for 3D Human Model Fitting

    Authors: Enric Corona, Gerard Pons-Moll, Guillem Alenyà, Francesc Moreno-Noguer

    Abstract: We propose a novel optimization-based paradigm for 3D human model fitting on images and scans. In contrast to existing approaches that directly regress the parameters of a low-dimensional statistical body model (e.g. SMPL) from input images, we train an ensemble of per-vertex neural fields network. The network predicts, in a distributed manner, the vertex descent direction towards the ground truth… ▽ More

    Submitted 19 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Project page: https://www.iri.upc.edu/people/ecorona/lvd/

    Journal ref: ECCV 2022

  20. Single-view 3D Body and Cloth Reconstruction under Complex Poses

    Authors: Nicolas Ugrinovic, Albert Pumarola, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: Recent advances in 3D human shape reconstruction from single images have shown impressive results, leveraging on deep networks that model the so-called implicit function to learn the occupancy status of arbitrarily dense 3D points in space. However, while current algorithms based on this paradigm, like PiFuHD, are able to estimate accurate geometry of the human shape and clothes, they require high… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  21. arXiv:2204.04913  [pdf, other

    cs.CV

    Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation

    Authors: Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: The recovery of multi-person 3D poses from a single RGB image is a severely ill-conditioned problem due to the inherent 2D-3D depth ambiguity, inter-person occlusions, and body truncations. To tackle these issues, recent works have shown promising results by simultaneously reasoning for different people. However, in most cases this is done by only considering pairwise person interactions, hinderin… ▽ More

    Submitted 31 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  22. arXiv:2204.01695  [pdf, other

    cs.CV

    LISA: Learning Implicit Shape and Appearance of Hands

    Authors: Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe, Lingni Ma

    Abstract: This paper proposes a do-it-all neural model of human hands, named LISA. The model can capture accurate hand shape and appearance, generalize to arbitrary hand subjects, provide dense surface correspondences, be reconstructed from images in the wild and easily animated. We train LISA by minimizing the shape and appearance losses on a large set of multi-view RGB image sequences annotated with coars… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Published at CVPR 2022

  23. arXiv:2204.01565  [pdf, other

    cs.CV

    HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE

    Authors: Xiaoyu Bie, Wen Guo, Simon Leglaive, Lauren Girin, Francesc Moreno-Noguer, Xavier Alameda-Pineda

    Abstract: Studies on the automatic processing of 3D human pose data have flourished in the recent past. In this paper, we are interested in the generation of plausible and diverse future human poses following an observed 3D pose sequence. Current methods address this problem by injecting random variables from a single latent space into a deterministic motion prediction framework, which precludes the inheren… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  24. arXiv:2203.10192  [pdf, other

    cs.CV

    Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification

    Authors: Jianxiong Shen, Antonio Agudo, Francesc Moreno-Noguer, Adria Ruiz

    Abstract: A critical limitation of current methods based on Neural Radiance Fields (NeRF) is that they are unable to quantify the uncertainty associated with the learned appearance and geometry of the scene. This information is paramount in real applications such as medical diagnosis or autonomous driving where, to reduce potentially catastrophic failures, the confidence on the model outputs must be include… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  25. arXiv:2201.02017  [pdf, other

    cs.CV

    Enhancing Egocentric 3D Pose Estimation with Third Person Views

    Authors: Ameya Dhamanaskar, Mariella Dimiccoli, Enric Corona, Albert Pumarola, Francesc Moreno-Noguer

    Abstract: In this paper, we propose a novel approach to enhance the 3D body pose estimation of a person computed from videos captured from a single wearable camera. The key idea is to leverage high-level features linking first- and third-views in a joint embedding space. To learn such embedding space we introduce First2Third-Pose, a new paired synchronized dataset of nearly 2,000 videos depicting human acti… ▽ More

    Submitted 15 June, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

  26. arXiv:2111.07195  [pdf, other

    cs.CV

    PhysXNet: A Customizable Approach for LearningCloth Dynamics on Dressed People

    Authors: Jordi Sanchez-Riera, Albert Pumarola, Francesc Moreno-Noguer

    Abstract: We introduce PhysXNet, a learning-based approach to predict the dynamics of deformable clothes given 3D skeleton motion sequences of humans wearing these clothes. The proposed model is adaptable to a large variety of garments and changing topologies, without need of being retrained. Such simulations are typically carried out by physics engines that require manual human expertise and are subjectto… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  27. arXiv:2111.01884  [pdf, other

    cs.CV

    Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images

    Authors: Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: We address the problem of multi-person 3D body pose and shape estimation from a single image. While this problem can be addressed by applying single-person approaches multiple times for the same scene, recent works have shown the advantages of building upon deep architectures that simultaneously reason about all people in the scene in a holistic manner by enforcing, e.g., depth order constraints o… ▽ More

    Submitted 8 December, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

  28. arXiv:2110.14998  [pdf, other

    cs.RO cs.AI cs.GR eess.SY

    An Adaptable Approach to Learn Realistic Legged Locomotion without Examples

    Authors: Daniel Ordonez-Apraez, Antonio Agudo, Francesc Moreno-Noguer, Mario Martin

    Abstract: Learning controllers that reproduce legged locomotion in nature has been a long-time goal in robotics and computer graphics. While yielding promising results, recent approaches are not yet flexible enough to be applicable to legged systems of different morphologies. This is partly because they often rely on precise motion capture references or elaborate learning environments that ensure the natura… ▽ More

    Submitted 8 February, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted to ICRA 2022

    ACM Class: I.2.9

    Journal ref: 2022 International Conference on Robotics and Automation (ICRA)

  29. arXiv:2110.02903  [pdf, other

    cs.CV

    Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision

    Authors: Ruijie Ren, Mohit Gurnani Rajesh, Jordi Sanchez-Riera, Fan Zhang, Yurun Tian, Antonio Agudo, Yiannis Demiris, Krystian Mikolajczyk, Francesc Moreno-Noguer

    Abstract: Automatically detecting graspable regions from a single depth image is a key ingredient in cloth manipulation. The large variability of cloth deformations has motivated most of the current approaches to focus on identifying specific gras** points rather than semantic parts, as the appearance and depth variations of local regions are smaller and easier to model than the larger ones. However, task… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures. Submitted to International Conference on Robotics and Automation (ICRA)

  30. arXiv:2109.02123  [pdf, other

    cs.CV

    Stochastic Neural Radiance Fields: Quantifying Uncertainty in Implicit 3D Representations

    Authors: Jianxiong Shen, Adria Ruiz, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Neural Radiance Fields (NeRF) has become a popular framework for learning implicit 3D representations and addressing different tasks such as novel-view synthesis or depth-map estimation. However, in downstream applications where decisions need to be made based on automatic predictions, it is critical to leverage the confidence associated with the model estimations. Whereas uncertainty quantificati… ▽ More

    Submitted 28 September, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

  31. arXiv:2108.05465  [pdf, other

    cs.CV

    SIDER: Single-Image Neural Optimization for Facial Geometric Detail Recovery

    Authors: Aggelina Chatziagapi, ShahRukh Athar, Francesc Moreno-Noguer, Dimitris Samaras

    Abstract: We present SIDER(Single-Image neural optimization for facial geometric DEtail Recovery), a novel photometric optimization method that recovers detailed facial geometry from a single image in an unsupervised manner. Inspired by classical techniques of coarse-to-fine optimization and recent advances in implicit neural representations of 3D shape, SIDER combines a geometry prior based on statistical… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: version 1.0.0

  32. arXiv:2107.12512  [pdf, other

    cs.CV cs.AI

    H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

    Authors: Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer

    Abstract: Recent learning approaches that implicitly represent surface geometry using coordinate-based neural representations have shown impressive results in the problem of multi-view 3D reconstruction. The effectiveness of these techniques is, however, subject to the availability of a large number (several tens) of input views of the scene, and computationally demanding optimizations. In this paper, we ta… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  33. arXiv:2107.03890  [pdf, other

    cs.CV

    Uncertainty-Aware Camera Pose Estimation from Points and Lines

    Authors: Alexander Vakhitov, Luis Ferraz Colomina, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Perspective-n-Point-and-Line (P$n$PL) algorithms aim at fast, accurate, and robust camera localization with respect to a 3D model from 2D-3D feature correspondences, being a major part of modern robotic and AR/VR systems. Current point-based pose estimation methods use only 2D feature detection uncertainties, and the line-based methods do not take uncertainties into account. In our setup, both 3D… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: CVPR 2021

  34. arXiv:2105.08825  [pdf, other

    cs.CV

    Multi-Person Extreme Motion Prediction

    Authors: Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer

    Abstract: Human motion prediction aims to forecast future poses given a sequence of past 3D skeletons. While this problem has recently received increasing attention, it has mostly been tackled for single humans in isolation. In this paper, we explore this problem when dealing with humans performing collaborative tasks, we seek to predict the future motion of two interacted persons given two sequences of the… ▽ More

    Submitted 19 June, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: CVPR 2022, update results of MSR in Table 3

  35. arXiv:2103.14507  [pdf, other

    cs.GR

    AVATAR: Blender add-on for fast creation of 3D human models

    Authors: Jordi Sanchez-Riera, Aniol Civit, Marta Altarriba, Francesc Moreno-Noguer

    Abstract: Create an articulated and realistic human 3D model is a complicated task, not only get a model with the right body proportions but also to the whole process of rigging the model with correct articulation points and vertices weights. Having a tool that can create such a model with just a few clicks will be very advantageous for amateurs developers to use in their projects, researchers to easily gen… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 7 pages, 2 figures, software description

  36. arXiv:2103.06871  [pdf, other

    cs.CV

    SMPLicit: Topology-aware Generative Model for Clothed People

    Authors: Enric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer

    Abstract: In this paper we introduce SMPLicit, a novel generative model to jointly represent body pose, shape and clothing geometry. In contrast to existing learning-based approaches that require training specific models for each type of garment, SMPLicit can represent in a unified manner different garment topologies (e.g. from sleeveless tops to hoodies and to open jackets), while controlling other propert… ▽ More

    Submitted 2 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  37. arXiv:2103.06498  [pdf, other

    cs.CV cs.AI

    3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

    Authors: Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando De la Torre

    Abstract: 3D human pose and shape estimation from monocular images has been an active research area in computer vision. Existing deep learning methods for this task rely on high-resolution input, which however, is not always available in many scenarios such as video surveillance and sports broadcasting. Two common approaches to deal with low-resolution images are applying super-resolution techniques to the… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.13666

  38. arXiv:2012.09696  [pdf, other

    cs.RO cs.LG

    Multi-FinGAN: Generative Coarse-To-Fine Sampling of Multi-Finger Grasps

    Authors: Jens Lundell, Enric Corona, Tran Nguyen Le, Francesco Verdoja, Philippe Weinzaepfel, Gregory Rogez, Francesc Moreno-Noguer, Ville Kyrki

    Abstract: While there exists many methods for manipulating rigid objects with parallel-jaw grippers, gras** with multi-finger robotic hands remains a quite unexplored research topic. Reasoning and planning collision-free trajectories on the additional degrees of freedom of several fingers represents an important challenge that, so far, involves computationally costly and slow processes. In this work, we p… ▽ More

    Submitted 15 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Accepted to IEEE Conference on Robotics and Automation 2021 (ICRA). Code is available at https://irobotics.aalto.fi/multi-fingan/

  39. arXiv:2012.07999  [pdf, other

    cs.CV

    FaceDet3D: Facial Expressions with 3D Geometric Detail Prediction

    Authors: ShahRukh Athar, Albert Pumarola, Francesc Moreno-Noguer, Dimitris Samaras

    Abstract: Facial Expressions induce a variety of high-level details on the 3D face geometry. For example, a smile causes the wrinkling of cheeks or the formation of dimples, while being angry often causes wrinkling of the forehead. Morphable Models (3DMMs) of the human face fail to capture such fine details in their PCA-based representations and consequently cannot generate such details when used to edit ex… ▽ More

    Submitted 23 December, 2020; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: Fixed errors in acknowledgements

  40. arXiv:2011.13961  [pdf, other

    cs.CV

    D-NeRF: Neural Radiance Fields for Dynamic Scenes

    Authors: Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer

    Abstract: Neural rendering techniques combining machine learning with geometric reasoning have arisen as one of the most promising approaches for synthesizing novel views of a scene from a sparse set of images. Among these, stands out the Neural radiance fields (NeRF), which trains a deep network to map 5D input coordinates (representing spatial location and viewing direction) into a volume density and view… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  41. arXiv:2010.05302  [pdf, other

    cs.CV

    PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation

    Authors: Wen Guo, Enric Corona, Francesc Moreno-Noguer, Xavier Alameda-Pineda

    Abstract: Recent literature addressed the monocular 3D pose estimation task very satisfactorily. In these studies, different persons are usually treated as independent pose instances to estimate. However, in many every-day situations, people are interacting, and the pose of an individual depends on the pose of his/her interactees. In this paper, we investigate how to exploit this dependency to enhance curre… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted at WACV 2021

  42. arXiv:2009.10521  [pdf, other

    cs.CV

    A survey on Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

    Authors: E. Riba, D. Mishkin, J. Shi, D. Ponsa, F. Moreno-Noguer, G. Bradski

    Abstract: This work presents Kornia, an open source computer vision library built upon a set of differentiable routines and modules that aims to solve generic computer vision problems. The package uses PyTorch as its main backend, not only for efficiency but also to take advantage of the reverse auto-differentiation engine to define and compute the gradient of complex functions. Inspired by OpenCV, Kornia i… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.02190

  43. arXiv:2007.13666  [pdf, other

    cs.CV cs.LG eess.IV

    3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

    Authors: Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando De la Torre

    Abstract: 3D human shape and pose estimation from monocular images has been an active area of research in computer vision, having a substantial impact on the development of new applications, from activity recognition to creating virtual avatars. Existing deep learning methods for 3D human shape and pose estimation rely on relatively high-resolution input images; however, high-resolution visual content is no… ▽ More

    Submitted 9 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: ECCV 2020, project page: https://sites.google.com/view/xiangyuxu/3d_eccv20

  44. arXiv:2006.12155  [pdf, other

    cs.NE cs.CV cs.LG

    Neural Cellular Automata Manifold

    Authors: Alejandro Hernandez Ruiz, Armand Vilalta, Francesc Moreno-Noguer

    Abstract: Very recently, the Neural Cellular Automata (NCA) has been proposed to simulate the morphogenesis process with deep networks. NCA learns to grow an image starting from a fixed single pixel. In this work, we show that the neural network (NN) architecture of the NCA can be encapsulated in a larger NN. This allows us to propose a new model that encodes a manifold of NCA, each of them capable of gener… ▽ More

    Submitted 2 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  45. arXiv:2004.10349  [pdf, other

    cs.CV cs.CL

    Textual Visual Semantic Dataset for Text Spotting

    Authors: Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

    Abstract: Text Spotting in the wild consists of detecting and recognizing text appearing in images (e.g. signboards, traffic signals or brands in clothing or objects). This is a challenging problem due to the complexity of the context where texts appear (uneven backgrounds, shading, occlusions, perspective distortions, etc.). Only a few approaches try to exploit the relation between text and its surrounding… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  46. arXiv:1912.07009  [pdf, other

    cs.CV

    C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds

    Authors: Albert Pumarola, Stefan Popov, Francesc Moreno-Noguer, Vittorio Ferrari

    Abstract: Flow-based generative models have highly desirable properties like exact log-likelihood evaluation and exact latent-variable inference, however they are still in their infancy and have not received as much attention as alternative generative models. In this paper, we introduce C-Flow, a novel conditioning scheme that brings normalizing flows to an entirely new scenario with great possibilities for… ▽ More

    Submitted 3 April, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

  47. arXiv:1910.03336  [pdf, other

    cs.CV cs.LG cs.RO

    Improving Map Re-localization with Deep 'Movable' Objects Segmentation on 3D LiDAR Point Clouds

    Authors: Victor Vaquero, Kai Fischer, Francesc Moreno-Noguer, Alberto Sanfeliu, Stefan Milz

    Abstract: Localization and Map** is an essential component to enable Autonomous Vehicles navigation, and requires an accuracy exceeding that of commercial GPS-based systems. Current odometry and map** algorithms are able to provide this accurate information. However, the lack of robustness of these algorithms against dynamic obstacles and environmental changes, even for short time periods, forces the ge… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

  48. arXiv:1909.07950  [pdf, other

    cs.CL cs.CV

    Semantic Relatedness Based Re-ranker for Text Spotting

    Authors: Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

    Abstract: Applications such as textual entailment, plagiarism detection or document clustering rely on the notion of semantic similarity, and are usually approached with dimension reduction techniques like LDA or with embedding-based neural approaches. We present a scenario where semantic similarity is not enough, and we devise a neural approach to learn semantic relatedness. The scenario is text spotting i… ▽ More

    Submitted 19 September, 2019; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted by EMNLP 2019

  49. arXiv:1904.04571  [pdf, other

    cs.CV

    3DPeople: Modeling the Geometry of Dressed Humans

    Authors: Albert Pumarola, Jordi Sanchez, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: Recent advances in 3D human shape estimation build upon parametric representations that model very well the shape of the naked body, but are not appropriate to represent the clothing geometry. In this paper, we present an approach to model dressed humans and predict their geometry from single images. We contribute in three fundamental aspects of the problem, namely, a new dataset, a novel shape pa… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  50. arXiv:1904.03419  [pdf, other

    cs.CV

    Context-aware Human Motion Prediction

    Authors: Enric Corona, Albert Pumarola, Guillem Alenyà, Francesc Moreno-Noguer

    Abstract: The problem of predicting human motion given a sequence of past observations is at the core of many applications in robotics and computer vision. Current state-of-the-art formulate this problem as a sequence-to-sequence task, in which a historical of 3D skeletons feeds a Recurrent Neural Network (RNN) that predicts future movements, typically in the order of 1 to 2 seconds. However, one aspect tha… ▽ More

    Submitted 23 March, 2020; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: Accepted at CVPR20