Skip to main content

Showing 1–9 of 9 results for author: Villar-Corrales, A

.
  1. arXiv:2405.19921  [pdf, other

    cs.CV cs.RO

    MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion

    Authors: Angel Villar-Corrales, Moritz Austermann, Sven Behnke

    Abstract: Autonomous systems, such as self-driving cars, rely on reliable semantic environment perception for decision making. Despite great advances in video semantic segmentation, existing approaches ignore important inductive biases and lack structured and interpretable internal representations. In this work, we propose MCDS-VSS, a structured filter model that learns in a self-supervised manner to estima… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2401.05909  [pdf, other

    cs.RO

    RoboCup 2023 Humanoid AdultSize Winner NimbRo: NimbRoNet3 Visual Perception and Responsive Gait with Waveform In-walk Kicks

    Authors: Dmytro Pavlichenko, Grzegorz Ficht, Angel Villar-Corrales, Luis Denninger, Julia Brocker, Tim Sinen, Michael Schreiber, Sven Behnke

    Abstract: The RoboCup Humanoid League holds annual soccer robot world championships towards the long-term objective of winning against the FIFA world champions by 2050. The participating teams continuously improve their systems. This paper presents the upgrades to our humanoid soccer system, leading our team NimbRo to win the Soccer Tournament in the Humanoid AdultSize League at RoboCup 2023 in Bordeaux, Fr… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted for: RoboCup 2023: Robot World Cup XXVI, LNCS, Springer, to appear 2024

  3. arXiv:2302.11850  [pdf, other

    cs.CV

    Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions

    Authors: Angel Villar-Corrales, Ismail Wahdan, Sven Behnke

    Abstract: We propose a novel framework for the task of object-centric video prediction, i.e., extracting the compositional structure of a video sequence, as well as modeling objects dynamics and interactions from visual observations in order to predict the future object states, from which we can then generate subsequent video frames. With the goal of learning meaningful spatio-temporal object representation… ▽ More

    Submitted 31 July, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at IEEE International Conference on Image Processing (ICIP) 2023

  4. arXiv:2302.02956  [pdf, other

    cs.RO

    RoboCup 2022 AdultSize Winner NimbRo: Upgraded Perception, Capture Steps Gait and Phase-based In-walk Kicks

    Authors: Dmytro Pavlichenko, Grzegorz Ficht, Arash Amini, Mojtaba Hosseini, Raphael Memmesheimer, Angel Villar-Corrales, Stefan M. Schulz, Marcell Missura, Maren Bennewitz, Sven Behnke

    Abstract: Beating the human world champions by 2050 is an ambitious goal of the Humanoid League that provides a strong incentive for RoboCup teams to further improve and develop their systems. In this paper, we present upgrades of our system which enabled our team NimbRo to win the Soccer Tournament, the Drop-in Games, and the Technical Challenges in the Humanoid AdultSize League of RoboCup 2022. Strong per… ▽ More

    Submitted 7 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: In: RoboCup 2022: Robot World Cup XXV. LNCS 13561, Springer, May 2023

  5. arXiv:2203.09303  [pdf, other

    cs.CV cs.RO

    MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks

    Authors: Angel Villar-Corrales, Ani Karapetyan, Andreas Boltres, Sven Behnke

    Abstract: Autonomous systems not only need to understand their current environment, but should also be able to predict future actions conditioned on past states, for instance based on captured camera frames. However, existing models mainly focus on forecasting future video frames for short time-horizons, hence being of limited use for long-term action planning. We propose Multi-Scale Hierarchical Prediction… ▽ More

    Submitted 9 November, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

  6. arXiv:2110.03473  [pdf, other

    cs.CV cs.LG

    Unsupervised Image Decomposition with Phase-Correlation Networks

    Authors: Angel Villar-Corrales, Sven Behnke

    Abstract: The ability to decompose scenes into their object components is a desired property for autonomous agents, allowing them to reason and act in their surroundings. Recently, different methods have been proposed to learn object-centric representations from data in an unsupervised manner. These methods often rely on latent representations learned by deep neural networks, hence requiring high computatio… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

  7. arXiv:2102.05105  [pdf, other

    cs.CV cs.MM eess.IV

    Deep learning architectural designs for super-resolution of noisy images

    Authors: Angel Villar-Corrales, Franziska Schirrmacher, Christian Riess

    Abstract: Recent advances in deep learning have led to significant improvements in single image super-resolution (SR) research. However, due to the amplification of noise during the upsampling steps, state-of-the-art methods often fail at reconstructing high-resolution images from noisy versions of their low-resolution counterparts. However, this is especially important for images from unknown cameras with… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  8. Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

    Authors: Prathmesh Madhu, Angel Villar-Corrales, Ronak Kosti, Torsten Bendschus, Corinna Reinhardt, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: Human pose estimation (HPE) is a central part of understanding the visual narration and body movements of characters depicted in artwork collections, such as Greek vase paintings. Unfortunately, existing HPE methods do not generalise well across domains resulting in poorly recognized poses. Therefore, we propose a two step approach: (1) adapting a dataset of natural images of known person and pose… ▽ More

    Submitted 25 February, 2024; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Link to the repository containing the code to reproduce the experiments. For further details, please read the README. Link: https://anonymous.4open.science/r/3b1bd8ac-bd3a-4df6-8671-56d4f9bdbd8d/

    Journal ref: J. Comput. Cult. Herit. 16, 1, Article 16 (March 2023), 17 pages

  9. Scattering Transform Based Image Clustering using Projection onto Orthogonal Complement

    Authors: Angel Villar-Corrales, Veniamin I. Morgenshtern

    Abstract: In the last few years, large improvements in image clustering have been driven by the recent advances in deep learning. However, due to the architectural complexity of deep neural networks, there is no mathematical theory that explains the success of deep clustering techniques. In this work we introduce Projected-Scattering Spectral Clustering (PSSC), a state-of-the-art, stable, and fast algorithm… ▽ More

    Submitted 24 November, 2020; v1 submitted 23 November, 2020; originally announced November 2020.