Skip to main content

Showing 1–18 of 18 results for author: Agudo, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19852  [pdf, other

    cs.CV cs.MA

    FootBots: A Transformer-based Architecture for Motion Prediction in Soccer

    Authors: Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Motion prediction in soccer involves capturing complex dynamics from player and ball interactions. We present FootBots, an encoder-decoder transformer-based architecture addressing motion prediction and conditioned motion prediction through equivariance properties. FootBots captures temporal and social dynamics using set attention blocks and multi-attention block decoder. Our evaluation utilizes t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at IEEE ICIP 2024

  2. arXiv:2406.06165  [pdf, other

    cs.CV cs.AI cs.IT cs.LG

    Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

    Authors: Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

    Abstract: Rate-distortion optimization through neural networks has accomplished competitive results in compression efficiency and image quality. This learning-based approach seeks to minimize the compromise between compression rate and reconstructed image quality by automatically extracting and retaining crucial information, while discarding less critical details. A successful technique consists in introduc… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to ICIP 2024

  3. arXiv:2404.08401  [pdf, other

    cs.CV cs.AI

    No Bells, Just Whistles: Sports Field Registration by Leveraging Geometric Properties

    Authors: Marc Gutiérrez-Pérez, Antonio Agudo

    Abstract: Broadcast sports field registration is traditionally addressed as a homography estimation task, map** the visible image area to a planar field model, predominantly focusing on the main camera shot. Addressing the shortcomings of previous approaches, we propose a novel calibration pipeline enabling camera calibration using a 3D soccer field model and extending the process to assess the multiple-v… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted in CVPRW 2024

  4. arXiv:2402.15552  [pdf, other

    cs.RO cs.AI eess.SY

    Morphological Symmetries in Robotics

    Authors: Daniel Ordoñez-Apraez, Giulio Turrisi, Vladimir Kostic, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer, Massimiliano Pontil, Claudio Semini, Carlos Mastalli

    Abstract: We present a comprehensive framework for studying and leveraging morphological symmetries in robotic systems. These are intrinsic properties of the robot's morphology, frequently observed in animal biology and robotics, which stem from the replication of kinematic structures and the symmetrical distribution of mass. We illustrate how these symmetries extend to the robot's state space and both prop… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 18 pages, 11 figures

  5. arXiv:2312.08291  [pdf, other

    cs.CV

    VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

    Authors: Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Previous works on Human Pose and Shape Estimation (HPSE) from RGB images can be broadly categorized into two main groups: parametric and non-parametric approaches. Parametric techniques leverage a low-dimensional statistical body model for realistic results, whereas recent non-parametric methods achieve higher precision by directly regressing the 3D coordinates of the human body mesh. This work in… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  6. arXiv:2306.14810  [pdf, other

    cs.CV cs.LG eess.IV

    Robust Wind Turbine Blade Segmentation from RGB Images in the Wild

    Authors: Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

    Abstract: With the relentless growth of the wind industry, there is an imperious need to design automatic data-driven solutions for wind turbine maintenance. As structural health monitoring mainly relies on visual inspections, the first stage in any automatic solution is to identify the blade region on the image. Thus, we propose a novel segmentation algorithm that strengthens the U-Net results by a tailore… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted to ICIP 2023

  7. arXiv:2302.10433  [pdf, other

    cs.RO cs.LG eess.SY

    On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis

    Authors: Daniel Ordonez-Apraez, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: We present a comprehensive study on discrete morphological symmetries of dynamical systems, which are commonly observed in biological and artificial locomoting systems, such as legged, swimming, and flying animals/robots/virtual characters. These symmetries arise from the presence of one or more planes/axis of symmetry in the system's morphology, resulting in harmonious duplication and distributio… ▽ More

    Submitted 7 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 8 pages, 4 figures, 7 optional appendix pages, 4 appendix figures

    MSC Class: 37J15; ACM Class: J.2

    Journal ref: Robotics: Science and System 2023

  8. arXiv:2204.04913  [pdf, other

    cs.CV

    Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation

    Authors: Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: The recovery of multi-person 3D poses from a single RGB image is a severely ill-conditioned problem due to the inherent 2D-3D depth ambiguity, inter-person occlusions, and body truncations. To tackle these issues, recent works have shown promising results by simultaneously reasoning for different people. However, in most cases this is done by only considering pairwise person interactions, hinderin… ▽ More

    Submitted 31 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  9. arXiv:2203.10192  [pdf, other

    cs.CV

    Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification

    Authors: Jianxiong Shen, Antonio Agudo, Francesc Moreno-Noguer, Adria Ruiz

    Abstract: A critical limitation of current methods based on Neural Radiance Fields (NeRF) is that they are unable to quantify the uncertainty associated with the learned appearance and geometry of the scene. This information is paramount in real applications such as medical diagnosis or autonomous driving where, to reduce potentially catastrophic failures, the confidence on the model outputs must be include… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  10. arXiv:2111.01884  [pdf, other

    cs.CV

    Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images

    Authors: Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: We address the problem of multi-person 3D body pose and shape estimation from a single image. While this problem can be addressed by applying single-person approaches multiple times for the same scene, recent works have shown the advantages of building upon deep architectures that simultaneously reason about all people in the scene in a holistic manner by enforcing, e.g., depth order constraints o… ▽ More

    Submitted 8 December, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

  11. arXiv:2110.14998  [pdf, other

    cs.RO cs.AI cs.GR eess.SY

    An Adaptable Approach to Learn Realistic Legged Locomotion without Examples

    Authors: Daniel Ordonez-Apraez, Antonio Agudo, Francesc Moreno-Noguer, Mario Martin

    Abstract: Learning controllers that reproduce legged locomotion in nature has been a long-time goal in robotics and computer graphics. While yielding promising results, recent approaches are not yet flexible enough to be applicable to legged systems of different morphologies. This is partly because they often rely on precise motion capture references or elaborate learning environments that ensure the natura… ▽ More

    Submitted 8 February, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted to ICRA 2022

    ACM Class: I.2.9

    Journal ref: 2022 International Conference on Robotics and Automation (ICRA)

  12. arXiv:2110.02903  [pdf, other

    cs.CV

    Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision

    Authors: Ruijie Ren, Mohit Gurnani Rajesh, Jordi Sanchez-Riera, Fan Zhang, Yurun Tian, Antonio Agudo, Yiannis Demiris, Krystian Mikolajczyk, Francesc Moreno-Noguer

    Abstract: Automatically detecting graspable regions from a single depth image is a key ingredient in cloth manipulation. The large variability of cloth deformations has motivated most of the current approaches to focus on identifying specific gras** points rather than semantic parts, as the appearance and depth variations of local regions are smaller and easier to model than the larger ones. However, task… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures. Submitted to International Conference on Robotics and Automation (ICRA)

  13. arXiv:2109.02123  [pdf, other

    cs.CV

    Stochastic Neural Radiance Fields: Quantifying Uncertainty in Implicit 3D Representations

    Authors: Jianxiong Shen, Adria Ruiz, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Neural Radiance Fields (NeRF) has become a popular framework for learning implicit 3D representations and addressing different tasks such as novel-view synthesis or depth-map estimation. However, in downstream applications where decisions need to be made based on automatic predictions, it is critical to leverage the confidence associated with the model estimations. Whereas uncertainty quantificati… ▽ More

    Submitted 28 September, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

  14. arXiv:2107.03890  [pdf, other

    cs.CV

    Uncertainty-Aware Camera Pose Estimation from Points and Lines

    Authors: Alexander Vakhitov, Luis Ferraz Colomina, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Perspective-n-Point-and-Line (P$n$PL) algorithms aim at fast, accurate, and robust camera localization with respect to a 3D model from 2D-3D feature correspondences, being a major part of modern robotic and AR/VR systems. Current point-based pose estimation methods use only 2D feature detection uncertainties, and the line-based methods do not take uncertainties into account. In our setup, both 3D… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: CVPR 2021

  15. arXiv:2101.06773  [pdf, other

    cs.CV

    Generating Attribution Maps with Disentangled Masked Backpropagation

    Authors: Adria Ruiz, Antonio Agudo, Francesc Moreno

    Abstract: Attribution map visualization has arisen as one of the most effective techniques to understand the underlying inference process of Convolutional Neural Networks. In this task, the goal is to compute an score for each image pixel related with its contribution to the final network output. In this paper, we introduce Disentangled Masked Backpropagation (DMBP), a novel gradient-based method that lever… ▽ More

    Submitted 30 August, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Journal ref: Internation Conference on Computer Vision (ICCV), 2021

  16. arXiv:1809.10305  [pdf, other

    cs.CV

    Geometry-Aware Network for Non-Rigid Shape Prediction from a Single View

    Authors: Albert Pumarola, Antonio Agudo, Lorenzo Porzi, Alberto Sanfeliu, Vincent Lepetit, Francesc Moreno-Noguer

    Abstract: We propose a method for predicting the 3D shape of a deformable surface from a single view. By contrast with previous approaches, we do not need a pre-registered template of the surface, and our method is robust to the lack of texture and partial occlusions. At the core of our approach is a {\it geometry-aware} deep architecture that tackles the problem as usually done in analytic solutions: first… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: Accepted at CVPR 2018

  17. arXiv:1809.10280  [pdf, other

    cs.CV

    Unsupervised Person Image Synthesis in Arbitrary Poses

    Authors: Albert Pumarola, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: We present a novel approach for synthesizing photo-realistic images of people in arbitrary poses using generative adversarial learning. Given an input image of a person and a desired pose represented by a 2D skeleton, our model renders the image of the same person under the new pose, synthesizing novel views of the parts visible in the input image and hallucinating those that are not seen. This pr… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: Accepted as Spotlight at CVPR 2018

  18. arXiv:1807.09251  [pdf, other

    cs.CV

    GANimation: Anatomically-aware Facial Animation from a Single Image

    Authors: Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis. The most successful architecture is StarGAN, that conditions GANs generation process with images of a specific domain, namely a set of images of persons sharing the same expression. While effective, this approach can only generate a discrete number of expressions, determ… ▽ More

    Submitted 28 August, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: Accepted as oral at ECCV 2018. Code available at https://github.com/albertpumarola/GANimation. Added minor updates