Skip to main content

Showing 1–27 of 27 results for author: Ferrer, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00885  [pdf, other

    cs.CV cs.RO

    Visual place recognition for aerial imagery: A survey

    Authors: Ivan Moskalenko, Anastasiia Kornilova, Gonzalo Ferrer

    Abstract: Aerial imagery and its direct application to visual localization is an essential problem for many Robotics and Computer Vision tasks. While Global Navigation Satellite Systems (GNSS) are the standard default solution for solving the aerial localization problem, it is subject to a number of limitations, such as, signal instability or solution unreliability that make this option not so desirable. Co… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2405.02162  [pdf, other

    cs.CV cs.AI cs.RO

    Map** the Unseen: Unified Promptable Panoptic Map** with Dynamic Labeling using Foundation Models

    Authors: Mohamad Al Mdfaa, Raghad Salameh, Sergey Zagoruyko, Gonzalo Ferrer

    Abstract: In the field of robotics and computer vision, efficient and accurate semantic map** remains a significant challenge due to the growing demand for intelligent machines that can comprehend and interact with complex environments. Conventional panoptic map** methods, however, are limited by predefined semantic classes, thus making them ineffective for handling novel or unforeseen objects. In respo… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2307.01069  [pdf, other

    cs.CV

    Shi-NeSS: Detecting Good and Stable Keypoints with a Neural Stability Score

    Authors: Konstantin Pakulev, Alexander Vakhitov, Gonzalo Ferrer

    Abstract: Learning a feature point detector presents a challenge both due to the ambiguity of the definition of a keypoint and correspondingly the need for a specially prepared ground truth labels for such points. In our work, we address both of these issues by utilizing a combination of a hand-crafted Shi detector and a neural network. We build on the principled and localized keypoints provided by the Shi… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 10 pages, 4 figures

  4. arXiv:2305.02859  [pdf, other

    cs.RO

    Social Robot Navigation through Constrained Optimization: a Comparative Study of Uncertainty-based Objectives and Constraints

    Authors: Timur Akhtyamov, Aleksandr Kashirin, Aleksey Postnikov, Gonzalo Ferrer

    Abstract: This work is dedicated to the study of how uncertainty estimation of the human motion prediction can be embedded into constrained optimization techniques, such as Model Predictive Control (MPC) for the social robot navigation. We propose several cost objectives and constraint functions obtained from the uncertainty of predicting pedestrian positions and related to the probability of the collision… ▽ More

    Submitted 17 July, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  5. arXiv:2304.05342  [pdf, other

    cs.RO

    TT-SDF2PC: Registration of Point Cloud and Compressed SDF Directly in the Memory-Efficient Tensor Train Domain

    Authors: Alexey I. Boyko, Anastasiia Kornilova, Rahim Tariverdizadeh, Mirfarid Musavian, Larisa Markeeva, Ivan Oseledets, Gonzalo Ferrer

    Abstract: This paper addresses the following research question: ``can one compress a detailed 3D representation and use it directly for point cloud registration?''. Map compression of the scene can be achieved by the tensor train (TT) decomposition of the signed distance function (SDF) representation. It regulates the amount of data reduced by the so-called TT-ranks. Using this representation we have prop… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  6. arXiv:2304.01055  [pdf, other

    cs.RO

    Eigen-Factors an Alternating Optimization for Back-end Plane SLAM of 3D Point Clouds

    Authors: Gonzalo Ferrer, Dmitrii Iarosh, Anastasiia Kornilova

    Abstract: Modern depth sensors can generate a huge number of 3D points in few seconds to be latter processed by Localization and Map** algorithms. Ideally, these algorithms should handle efficiently large sizes of Point Clouds under the assumption that using more points implies more information available. The Eigen Factors (EF) is a new algorithm that solves SLAM by using planes as the main geometric prim… ▽ More

    Submitted 4 September, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  7. arXiv:2303.05162  [pdf, other

    cs.CV cs.RO

    EVOLIN Benchmark: Evaluation of Line Detection and Association

    Authors: Kirill Ivanov, Gonzalo Ferrer, Anastasiia Kornilova

    Abstract: Lines are interesting geometrical features commonly seen in indoor and urban environments. There is missing a complete benchmark where one can evaluate lines from a sequential stream of images in all its stages: Line detection, Line Association and Pose error. To do so, we present a complete and exhaustive benchmark for visual lines in a SLAM front-end, both for RGB and RGBD, by providing a pletho… ▽ More

    Submitted 31 July, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  8. arXiv:2303.05123  [pdf, other

    cs.CV cs.RO

    Dominating Set Database Selection for Visual Place Recognition

    Authors: Anastasiia Kornilova, Ivan Moskalenko, Timofei Pushkin, Fakhriddin Tojiboev, Rahim Tariverdizadeh, Gonzalo Ferrer

    Abstract: This paper presents an approach for creating a visual place recognition (VPR) database for localization in indoor environments from RGBD scanning sequences. The proposed approach is formulated as a minimization problem in terms of dominating set algorithm for graph, constructed from spatial information, and referred as DominatingSet. Our algorithm shows better scene coverage in comparison to other… ▽ More

    Submitted 21 January, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

  9. arXiv:2301.07433  [pdf, other

    cs.RO cs.LG

    DDPEN: Trajectory Optimisation With Sub Goal Generation Model

    Authors: Aleksander Gamayunov, Aleksey Postnikov, Gonzalo Ferrer

    Abstract: Differential dynamic programming (DDP) is a widely used and powerful trajectory optimization technique, however, due to its internal structure, it is not exempt from local minima. In this paper, we present Differential Dynamic Programming with Escape Network (DDPEN) - a novel approach to avoid DDP local minima by utilising an additional term used in the optimization criteria pointing towards the d… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 4 pages, 6 figures, IROS2022 Workshop: Artificial Intelligence for Social Robots Interacting with Humans in the Real World [intellect4hri]

  10. arXiv:2209.08895  [pdf, other

    cs.RO

    Best Axes Composition Extended: Multiple Gyroscopes and Accelerometers Data Fusion to Reduce Systematic Error

    Authors: Marsel Faizullin, Gonzalo Ferrer

    Abstract: Multiple rigidly attached Inertial Measurement Unit (IMU) sensors provide a richer flow of data compared to a single IMU. State-of-the-art methods follow a probabilistic model of IMU measurements based on the random nature of errors combined under a Bayesian framework. However, affordable low-grade IMUs, in addition, suffer from systematic errors due to their imperfections not covered by their cor… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted to Robotics and Autonomous Systems journal. arXiv admin note: substantial text overlap with arXiv:2107.02632

  11. arXiv:2208.01421  [pdf, other

    cs.CV

    T4DT: Tensorizing Time for Learning Temporal 3D Visual Data

    Authors: Mikhail Usvyatsov, Rafael Ballester-Rippoll, Lina Bashaeva, Konrad Schindler, Gonzalo Ferrer, Ivan Oseledets

    Abstract: Unlike 2D raster images, there is no single dominant representation for 3D visual data processing. Different formats like point clouds, meshes, or implicit functions each have their strengths and weaknesses. Still, grid representations such as signed distance functions have attractive properties also in 3D. In particular, they offer constant-time random access and are eminently suitable for modern… ▽ More

    Submitted 5 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

  12. arXiv:2206.14442  [pdf, other

    cs.RO cs.AI

    Conditioned Human Trajectory Prediction using Iterative Attention Blocks

    Authors: Aleksey Postnikov, Aleksander Gamayunov, Gonzalo Ferrer

    Abstract: Human motion prediction is key to understand social environments, with direct applications in robotics, surveillance, etc. We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments conditioned by the environment: map and surround agents. Our model is a neural-based architecture that can run several layers of attent… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  13. arXiv:2204.10211  [pdf, other

    cs.CV

    SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis

    Authors: Anastasiia Kornilova, Marsel Faizullin, Konstantin Pakulev, Andrey Sadkov, Denis Kukushkin, Azat Akhmetyanov, Timur Akhtyamov, Hekmat Taherinejad, Gonzalo Ferrer

    Abstract: We present a dataset of 1000 video sequences of human portraits recorded in real and uncontrolled conditions by using a handheld smartphone accompanied by an external high-quality depth camera. The collected dataset contains 200 people captured in different poses and locations and its main purpose is to bridge the gap between raw measurements obtained from a smartphone and downstream applications,… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR'2022

  14. arXiv:2204.05799  [pdf, other

    cs.CV cs.RO

    EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data

    Authors: Anastasiia Kornilova, Dmitrii Iarosh, Denis Kukushkin, Nikolai Goncharov, Pavel Mokeev, Arthur Saliou, Gonzalo Ferrer

    Abstract: This paper provides the EVOPS dataset for plane segmentation from 3D data, both from RGBD images and LiDAR point clouds. We have designed two annotation methodologies (RGBD and LiDAR) running on well-known and widely-used datasets for SLAM evaluation and we have provided a complete set of benchmarking tools including point, planes and segmentation metrics. The data includes a total number of 10k R… ▽ More

    Submitted 24 August, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted to IROS'2022

  15. arXiv:2112.04350  [pdf, other

    cs.RO cs.CV

    Transformer based trajectory prediction

    Authors: Aleksey Postnikov, Aleksander Gamayunov, Gonzalo Ferrer

    Abstract: To plan a safe and efficient route, an autonomous vehicle should anticipate future motions of other agents around it. Motion prediction is an extremely challenging task which recently gained significant attention of the research community. In this work, we present a simple and yet strong baseline for uncertainty aware motion prediction based purely on transformer neural networks, which has shown i… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  16. SmartDepthSync: Open Source Synchronized Video Recording System of Smartphone RGB and Depth Camera Range Image Frames with Sub-millisecond Precision

    Authors: Marsel Faizullin, Anastasiia Kornilova, Azat Akhmetyanov, Konstantin Pakulev, Andrey Sadkov, Gonzalo Ferrer

    Abstract: Nowadays, smartphones can produce a synchronized (synced) stream of high-quality data, including RGB images, inertial measurements, and other data. Therefore, smartphones are becoming appealing sensor systems in the robotics community. Unfortunately, there is still the need for external supporting sensing hardware, such as a depth camera precisely synced with the smartphone sensors. In this pape… ▽ More

    Submitted 13 September, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: IEEE Sensors Journal paper

  17. arXiv:2109.02965  [pdf, other

    cs.CV cs.RO

    CovarianceNet: Conditional Generative Model for Correct Covariance Prediction in Human Motion Prediction

    Authors: Aleksey Postnikov, Aleksander Gamayunov, Gonzalo Ferrer

    Abstract: The correct characterization of uncertainty when predicting human motion is equally important as the accuracy of this prediction. We present a new method to correctly predict the uncertainty associated with the predicted distribution of future trajectories. Our approach, CovariaceNet, is based on a Conditional Generative Model with Gaussian latent variables in order to predict the parameters of a… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

  18. arXiv:2108.01654  [pdf, other

    cs.RO cs.CV

    Comparison of modern open-source visual SLAM approaches

    Authors: Dinar Sharafutdinov, Mark Griguletskii, Pavel Kopanev, Mikhail Kurenkov, Gonzalo Ferrer, Aleksey Burkov, Aleksei Gonnochenko, Dzmitry Tsetserukou

    Abstract: SLAM is one of the most fundamental areas of research in robotics and computer vision. State of the art solutions has advanced significantly in terms of accuracy and stability. Unfortunately, not all the approaches are available as open-source solutions and free to use. The results of some of them are difficult to reproduce, and there is a lack of comparison on common datasets. In our work, we mak… ▽ More

    Submitted 4 February, 2023; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: Preprint, 19 pages

  19. arXiv:2107.02632  [pdf, other

    cs.RO

    Best Axes Composition: Multiple Gyroscopes IMU Sensor Fusion to Reduce Systematic Error

    Authors: Marsel Faizullin, Gonzalo Ferrer

    Abstract: In this paper, we propose an algorithm to combine multiple cheap Inertial Measurement Unit (IMU) sensors to calculate 3D-orientations accurately. Our approach takes into account the inherent and non-negligible systematic error in the gyroscope model and provides a solution based on the error observed during previous instants of time. Our algorithm, the Best Axes Composition (BAC), chooses dynamica… ▽ More

    Submitted 22 July, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted for the 10th European Conference on Mobile Robots (ECMR 2021)

  20. arXiv:2107.02625  [pdf, other

    cs.RO

    Open-Source LiDAR Time Synchronization System by Mimicking GNSS-clock

    Authors: Marsel Faizullin, Anastasiia Kornilova, Gonzalo Ferrer

    Abstract: Data fusion algorithms that employ LiDAR measurements, such as Visual-LiDAR, LiDAR-Inertial, or Multiple LiDAR Odometry and simultaneous localization and map** (SLAM) rely on precise timestam** schemes that grant synchronicity to data from LiDAR and other sensors. Poor synchronization performance, due to incorrect timestam** procedure, may negatively affect the algorithms' state estimation r… ▽ More

    Submitted 13 September, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted to IEEE ISPCS 2022 Conference (International Symposium on Precision Clock Synchronization for Measurement, Control and Communication)

  21. arXiv:2107.00987  [pdf, other

    cs.CV

    Sub-millisecond Video Synchronization of Multiple Android Smartphones

    Authors: Azat Akhmetyanov, Anastasiia Kornilova, Marsel Faizullin, David Pozo, Gonzalo Ferrer

    Abstract: This paper addresses the problem of building an affordable easy-to-setup synchronized multi-view camera system, which is in demand for many Computer Vision and Robotics applications in high-dynamic environments. In our work, we propose a solution for this problem -- a publicly-available Android application for synchronized video recording on multiple smartphones with sub-millisecond accuracy. We p… ▽ More

    Submitted 26 August, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: Accepted to conference IEEE Sensors'2021 as Lecture presentation

  22. Be your own Benchmark: No-Reference Trajectory Metric on Registered Point Clouds

    Authors: Anastasiia Kornilova, Gonzalo Ferrer

    Abstract: This paper addresses the problem of assessing trajectory quality in conditions when no ground truth poses are available or when their accuracy is not enough for the specific task - for example, small-scale map** in outdoor scenes. In our work, we propose a no-reference metric, Mutually Orthogonal Metric (MOM), that estimates the quality of the map from registered point clouds via the trajectory… ▽ More

    Submitted 12 August, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted for the 10th European Conference on Mobile Robots (ECMR 2021)

  23. arXiv:2012.09963  [pdf, other

    cs.CV

    Relightable 3D Head Portraits from a Smartphone Video

    Authors: Artem Sevastopolsky, Savva Ignatiev, Gonzalo Ferrer, Evgeny Burnaev, Victor Lempitsky

    Abstract: In this work, a system for creating a relightable 3D portrait of a human head is presented. Our neural pipeline operates on a sequence of frames captured by a smartphone camera with the flash blinking (flash-no flash sequence). A coarse point cloud reconstructed via structure-from-motion software and multi-view denoising is then used as a geometric proxy. Afterwards, a deep rendering network is tr… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  24. arXiv:2011.00594  [pdf, other

    cs.RO

    Random Fourier Features based SLAM

    Authors: Yermek Kapushev, Anastasia Kishkun, Gonzalo Ferrer, Evgeny Burnaev

    Abstract: This work is dedicated to simultaneous continuous-time trajectory estimation and map** based on Gaussian Processes (GP). State-of-the-art GP-based models for Simultaneous Localization and Map** (SLAM) are computationally efficient but can only be used with a restricted class of kernel functions. This paper provides the algorithm based on GP with Random Fourier Features (RFF) approximation for… ▽ More

    Submitted 6 September, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

  25. arXiv:2009.04299  [pdf, other

    cs.CV cs.RO

    HSFM-$Σ$nn: Combining a Feedforward Motion Prediction Network and Covariance Prediction

    Authors: A. Postnikov, A. Gamayunov, G. Ferrer

    Abstract: In this paper, we propose a new method for motion prediction: HSFM-$Σ$nn. Our proposed method combines two different approaches: a feedforward network whose layers are model-based transition functions using the HSFM and a Neural Network (NN), on each of these layers, for covariance prediction. We will compare our method with classical methods for covariance estimation showing their limitations. We… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  26. arXiv:1609.01176  [pdf, other

    cs.LG stat.AP

    The Player Kernel: Learning Team Strengths Based on Implicit Player Contributions

    Authors: Lucas Maystre, Victor Kristof, Antonio J. González Ferrer, Matthias Grossglauser

    Abstract: In this work, we draw attention to a connection between skill-based models of game outcomes and Gaussian process classification models. The Gaussian process perspective enables a) a principled way of dealing with uncertainty and b) rich models, specified through kernel functions. Using this connection, we tackle the problem of predicting outcomes of football matches between national teams. We deve… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

  27. arXiv:1602.08158  [pdf, ps, other

    cs.RO

    Associative Memories and Human-Robot Social Interaction

    Authors: Gabriel J. Ferrer

    Abstract: In this position paper, we discuss how the use of a cognitive architecture based on unsupervised clustering (the Kohonen Self-Organizing Map) enables us to meet our goals of efficient action selection in a mobile robot. This architecture provides several opportunities for human-robot interaction, and we discuss how its features facilitate these interactions.

    Submitted 25 February, 2016; originally announced February 2016.

    Comments: Presented at "2nd Workshop on Cognitive Architectures for Social Human-Robot Interaction 2016 (arXiv:1602.01868)

    Report number: CogArch4sHRI/2016/03