Skip to main content

Showing 1–33 of 33 results for author: Tardos, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16932  [pdf, other

    cs.RO cs.CV

    CudaSIFT-SLAM: multiple-map visual SLAM for full procedure map** in real human endoscopy

    Authors: Richard Elvira, Juan D. Tardós, José M. M. Montiel

    Abstract: Monocular visual simultaneous localization and map** (V-SLAM) is nowadays an irreplaceable tool in mobile robotics and augmented reality, where it performs robustly. However, human colonoscopies pose formidable challenges like occlusions, blur, light changes, lack of texture, deformation, water jets or tool interaction, which result in very frequent tracking losses. ORB-SLAM3, the top performing… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 10 pages, 10 figures, 6 tables, under revision

    ACM Class: I.4.9

  2. arXiv:2309.02777  [pdf, other

    cs.CV

    LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline

    Authors: Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós

    Abstract: We propose a new approach to 3D reconstruction from sequences of images acquired by monocular endoscopes. It is based on two key insights. First, endoluminal cavities are watertight, a property naturally enforced by modeling them in terms of a signed distance function. Second, the scene illumination is variable. It comes from the endoscope's light sources and decays with the inverse of the squared… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 12 pages, 7 figures, 1 table, submitted to MICCAI 2023

  3. arXiv:2308.10525  [pdf, other

    cs.CV

    LightDepth: Single-View Depth Self-Supervision from Illumination Decline

    Authors: Javier Rodríguez-Puigvert, Víctor M. Batlle, J. M. M. Montiel, Ruben Martinez-Cantin, Pascal Fua, Juan D. Tardós, Javier Civera

    Abstract: Single-view depth estimation can be remarkably effective if there is enough ground-truth depth data for supervised training. However, there are scenarios, especially in medicine in the case of endoscopies, where such data cannot be obtained. In such cases, multi-view self-supervision and synthetic-to-real transfer serve as alternative approaches, however, with a considerable performance reduction… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  4. arXiv:2308.04036  [pdf, other

    cs.RO

    NR-SLAM: Non-Rigid Monocular SLAM

    Authors: Juan J. Gomez Rodriguez, J. M. M Montiel, Juan D. Tardos

    Abstract: In this paper we present NR-SLAM, a novel non-rigid monocular SLAM system founded on the combination of a Dynamic Deformation Graph with a Visco-Elastic deformation model. The former enables our system to represent the dynamics of the deforming environment as the camera explores, while the later allows us to model general deformations in a simple way. The presented system is able to automatically… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 12 pages, 7 figures, submited to the IEEE Transactions on Robotics (T-RO)

  5. arXiv:2305.15118  [pdf, other

    cs.LG cs.CY cs.DS

    Fairness in Streaming Submodular Maximization over a Matroid Constraint

    Authors: Marwa El Halabi, Federico Fusco, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

    Abstract: Streaming submodular maximization is a natural model for the task of selecting a representative subset from a large-scale dataset. If datapoints have sensitive attributes such as gender or race, it becomes important to enforce fairness to avoid bias and discrimination. This has spurred significant interest in develo** fair machine learning algorithms. Recently, such algorithms have been develope… ▽ More

    Submitted 19 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML 23

  6. arXiv:2305.05546  [pdf, other

    cs.CV

    ColonMapper: topological map** and localization for colonoscopy

    Authors: Javier Morlana, Juan D. Tardós, J. M. M. Montiel

    Abstract: We propose a topological map** and localization system able to operate on real human colonoscopies, despite significant shape and illumination changes. The map is a graph where each node codes a colon location by a set of real images, while edges represent traversability between nodes. For close-in-time images, where scene changes are minor, place recognition can be successfully managed with the… ▽ More

    Submitted 21 November, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Under review. ICRA 2024

  7. arXiv:2209.03693  [pdf, other

    cs.RO

    ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology

    Authors: Julio A. Placed, Juan J. Gómez Rodríguez, Juan D. Tardós, José A. Castellanos

    Abstract: Deploying autonomous robots capable of exploring unknown environments has long been a topic of great relevance to the robotics community. In this work, we take a further step in that direction by presenting an open-source active visual SLAM framework that leverages the accuracy of a state-of-the-art graph-SLAM system and takes advantage of the fast utility computation that exploiting the structure… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 12 pages. To be presented in 5th Iberian Robotics Conference

  8. EndoMapper dataset of complete calibrated endoscopy procedures

    Authors: Pablo Azagra, Carlos Sostres, Ángel Ferrandez, Luis Riazuelo, Clara Tomasini, Oscar León Barbed, Javier Morlana, David Recasens, Victor M. Batlle, Juan J. Gómez-Rodríguez, Richard Elvira, Julia López, Cristina Oriol, Javier Civera, Juan D. Tardós, Ana Cristina Murillo, Angel Lanas, José M. M. Montiel

    Abstract: Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introdu… ▽ More

    Submitted 10 October, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 17 pages, 14 figures, 8 tables

    Journal ref: Sci Data 10, 671 (2023)

  9. arXiv:2204.09951  [pdf, other

    cs.DS

    Motif Cut Sparsifiers

    Authors: Michael Kapralov, Mikhail Makarov, Sandeep Silwal, Christian Sohler, Jakab Tardos

    Abstract: A motif is a frequently occurring subgraph of a given directed or undirected graph $G$. Motifs capture higher order organizational structure of $G$ beyond edge relationships, and, therefore, have found wide applications such as in graph clustering, community detection, and analysis of biological and physical networks to name a few. In these applications, the cut structure of motifs plays a crucial… ▽ More

    Submitted 12 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: 48 pages, 3 figures

  10. arXiv:2204.09083  [pdf, other

    cs.CV cs.RO eess.IV

    Photometric single-view dense 3D reconstruction in endoscopy

    Authors: Victor M. Batlle, J. M. M. Montiel, Juan D. Tardos

    Abstract: Visual SLAM inside the human body will open the way to computer-assisted navigation in endoscopy. However, due to space limitations, medical endoscopes only provide monocular images, leading to systems lacking true scale. In this paper, we exploit the controlled lighting in colonoscopy to achieve the first in-vivo 3D reconstruction of the human colon using photometric stereo on a calibrated monocu… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 7 pages, 7 figures, submitted to IROS 2022

  11. arXiv:2204.08309  [pdf, other

    cs.CV

    Tracking monocular camera pose and deformation for SLAM inside the human body

    Authors: Juan J. Gomez Rodriguez, J. M. M Montiel, Juan D. Tardos

    Abstract: Monocular SLAM in deformable scenes will open the way to multiple medical applications like computer-assisted navigation in endoscopy, automatic drug delivery or autonomous robotic surgery. In this paper we propose a novel method to simultaneously track the camera pose and the 3D scene deformation, without any assumption about environment topology or shape. The method uses an illumination-invarian… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 8 pages, 3 figures, submitted to IROS 2022

  12. arXiv:2112.00655  [pdf, ps, other

    cs.DC cs.DS cs.LG

    Efficient and Local Parallel Random Walks

    Authors: Michael Kapralov, Silvio Lattanzi, Navid Nouri, Jakab Tardos

    Abstract: Random walks are a fundamental primitive used in many machine learning algorithms with several applications in clustering and semi-supervised learning. Despite their relevance, the first efficient parallel algorithm to compute random walks has been introduced very recently (Lacki et al.). Unfortunately their method has a fundamental shortcoming: their algorithm is non-local in that it heavily reli… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  13. arXiv:2109.10077  [pdf, other

    cs.RO cs.CV

    Scale-aware direct monocular odometry

    Authors: Carlos Campos, Juan D. Tardós

    Abstract: We present a generic framework for scale-aware direct monocular odometry based on depth prediction from a deep neural network. In contrast with previous methods where depth information is only partially exploited, we formulate a novel depth prediction residual which allows us to incorporate multi-view depth information. In addition, we propose to use a truncated robust cost function which prevents… ▽ More

    Submitted 22 July, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: This paper has been accepted for publication in the IROS2022 conference

  14. arXiv:2109.07370  [pdf, other

    cs.CV

    Direct and Sparse Deformable Tracking

    Authors: Jose Lamarca, Juan J. Gomez Rodriguez, Juan D. Tardos, J. M. M. Montiel

    Abstract: Deformable Monocular SLAM algorithms recover the localization of a camera in an unknown deformable environment. Current approaches use a template-based deformable tracking to recover the camera pose and the deformation of the map. These template-based methods use an underlying global deformation model. In this paper, we introduce a novel deformable camera tracking method with a local deformation m… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 8 pages, 5 figures, submitted to RAL with ICRA

  15. arXiv:2107.02578  [pdf, ps, other

    cs.DS

    Noisy Boolean Hidden Matching with Applications

    Authors: Michael Kapralov, Amulya Musipatla, Jakab Tardos, David P. Woodruff, Samson Zhou

    Abstract: The Boolean Hidden Matching (BHM) problem, introduced in a seminal paper of Gavinsky et. al. [STOC'07], has played an important role in the streaming lower bounds for graph problems such as triangle and subgraph counting, maximum matching, MAX-CUT, Schatten $p$-norm approximation, maximum acyclic subgraph, testing bipartiteness, $k$-connectivity, and cycle-freeness. The one-way communication compl… ▽ More

    Submitted 28 January, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

  16. arXiv:2106.04805  [pdf, other

    stat.ML cs.LG cs.SI math.PR

    Streaming Belief Propagation for Community Detection

    Authors: Yuchen Wu, MohammadHossein Bateni, Andre Linhares, Filipe Miguel Goncalves de Almeida, Andrea Montanari, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: The community detection problem requires to cluster the nodes of a network into a small number of well-connected "communities". There has been substantial recent progress in characterizing the fundamental statistical limits of community detection under simple stochastic block models. However, in real-world applications, the network structure is typically dynamic, with nodes that join over time. In… ▽ More

    Submitted 10 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 36 pages, 13 figures

  17. arXiv:2106.02353  [pdf, ps, other

    cs.DS

    Spectral Hypergraph Sparsifiers of Nearly Linear Size

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Graph sparsification has been studied extensively over the past two decades, culminating in spectral sparsifiers of optimal size (up to constant factors). Spectral hypergraph sparsification is a natural analogue of this problem, for which optimal bounds on the sparsifier size are not known, mainly because the hypergraph Laplacian is non-linear, and thus lacks the linear-algebraic structure and too… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  18. arXiv:2011.06530  [pdf, ps, other

    cs.DS

    Towards Tight Bounds for Spectral Sparsification of Hypergraphs

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Cut and spectral sparsification of graphs have numerous applications, including e.g. speeding up algorithms for cuts and Laplacian solvers. These powerful notions have recently been extended to hypergraphs, which are much richer and may offer new applications. However, the current bounds on the size of hypergraph sparsifiers are not as tight as the corresponding bounds for graphs. Our first resu… ▽ More

    Submitted 12 April, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  19. arXiv:2011.06481  [pdf, ps, other

    cs.DS

    Communication Efficient Coresets for Maximum Matching

    Authors: Michael Kapralov, Gilbert Maystre, Jakab Tardos

    Abstract: In this paper we revisit the problem of constructing randomized composable coresets for bipartite matching. In this problem the input graph is randomly partitioned across $k$ players, each of which sends a single message to a coordinator, who then must output a good approximation to the maximum matching in the input graph. Assadi and Khanna gave the first such coreset, achieving a $1/9$-approximat… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  20. arXiv:2010.09409  [pdf, other

    cs.CV

    SD-DefSLAM: Semi-Direct Monocular SLAM for Deformable and Intracorporeal Scenes

    Authors: Juan J. Gómez Rodríguez, José Lamarca, Javier Morlana, Juan D. Tardós, José M. M. Montiel

    Abstract: Conventional SLAM techniques strongly rely on scene rigidity to solve data association, ignoring dynamic parts of the scene. In this work we present Semi-Direct DefSLAM (SD-DefSLAM), a novel monocular deformable SLAM method able to map highly deforming environments, built on top of DefSLAM. To robustly solve data association in challenging deforming scenes, SD-DefSLAM combines direct and indirect… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 10 pages, 8 figures. Submitted to RA-L with option to ICRA 2021. Associated video: https://youtu.be/gkcC0IR3X6A

    ACM Class: I.4.5; I.4.6; I.4.8

  21. arXiv:2010.07820  [pdf, other

    cs.RO cs.CV

    DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM

    Authors: Berta Bescos, Carlos Campos, Juan D. Tardós, José Neira

    Abstract: The assumption of scene rigidity is common in visual SLAM algorithms. However, it limits their applicability in populated real-world environments. Furthermore, most scenarios including autonomous driving, multi-robot collaboration and augmented/virtual reality, require explicit motion information of the surroundings to help with decision making and scene understanding. We present in this paper Dyn… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  22. arXiv:2010.07431  [pdf, other

    cs.LG cs.DS

    Fairness in Streaming Submodular Maximization: Algorithms and Hardness

    Authors: Marwa El Halabi, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

    Abstract: Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning algorithms, left unchecked, are known to exhibit bias: under- or over-representation of particular groups. This has made the design of fair machine learning algori… ▽ More

    Submitted 18 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2020

  23. ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

    Authors: Carlos Campos, Richard Elvira, Juan J. Gómez Rodríguez, José M. M. Montiel, Juan D. Tardós

    Abstract: This paper presents ORB-SLAM3, the first system able to perform visual, visual-inertial and multi-map SLAM with monocular, stereo and RGB-D cameras, using pin-hole and fisheye lens models. The first main novelty is a feature-based tightly-integrated visual-inertial SLAM system that fully relies on Maximum-a-Posteriori (MAP) estimation, even during the IMU initialization phase. The result is a syst… ▽ More

    Submitted 23 April, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

  24. arXiv:2003.05766  [pdf, other

    cs.RO

    Inertial-Only Optimization for Visual-Inertial Initialization

    Authors: Carlos Campos, José M. M. Montiel, Juan D. Tardós

    Abstract: We formulate for the first time visual-inertial initialization as an optimal estimation problem, in the sense of maximum-a-posteriori (MAP) estimation. This allows us to properly take into account IMU measurement uncertainty, which was neglected in previous methods that either solved sets of algebraic equations, or minimized ad-hoc cost functions using least squares. Our exhaustive initialization… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: 2020 International Conference on Robotics and Automation

  25. arXiv:1908.11585  [pdf, other

    cs.CV

    ORBSLAM-Atlas: a robust and accurate multi-map system

    Authors: Richard Elvira, Juan D. Tardós, J. M. M. Montiel

    Abstract: We propose ORBSLAM-Atlas, a system able to handle an unlimited number of disconnected sub-maps, that includes a robust map merging algorithm able to detect sub-maps with common regions and seamlessly fuse them. The outstanding robustness and accuracy of ORBSLAM are due to its ability to detect wide-baseline matches between keyframes, and to exploit them by means of non-linear optimization, however… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  26. Fast and Robust Initialization for Visual-Inertial SLAM

    Authors: Carlos Campos, J. M. M. Montiel, Juan D. Tardós

    Abstract: Visual-inertial SLAM (VI-SLAM) requires a good initial estimation of the initial velocity, orientation with respect to gravity and gyroscope and accelerometer biases. In this paper we build on the initialization method proposed by Martinelli and extended by Kaiser et al. , modifying it to be more general and efficient. We improve accuracy with several rounds of visual-inertial bundle adjustment, a… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 2019 International Conference on Robotics and Automation

    Journal ref: C. Campos, M. José M.M. and J. D. Tardós, "Fast and Robust Initialization for Visual-Inertial SLAM," 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 2019, pp. 1288-1294

  27. arXiv:1907.05725  [pdf, other

    cs.DS

    Space Efficient Approximation to Maximum Matching Size from Uniform Edge Samples

    Authors: Michael Kapralov, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: Given a source of iid samples of edges of an input graph $G$ with $n$ vertices and $m$ edges, how many samples does one need to compute a constant factor approximation to the maximum matching size in $G$? Moreover, is it possible to obtain such an estimate in a small amount of space? We show that, on the one hand, this problem cannot be solved using a nontrivially sublinear (in $m$) number of samp… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  28. arXiv:1903.12150  [pdf, ps, other

    cs.DS

    Dynamic Streaming Spectral Sparsification in Nearly Linear Time and Space

    Authors: Michael Kapralov, Navid Nouri, Aaron Sidford, Jakab Tardos

    Abstract: In this paper we consider the problem of computing spectral approximations to graphs in the single pass dynamic streaming model. We provide a linear sketching based solution that given a stream of edge insertions and deletions to a $n$-node undirected graph, uses $\tilde O(n)$ space, processes each update in $\tilde O(1)$ time, and with high probability recovers a spectral sparsifier in… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  29. arXiv:1801.10043  [pdf, other

    cs.RO

    Simultaneous Deployment and Tracking Multi-Robot Strategies with Connectivity Maintenance

    Authors: Javier Tardós, Rosario Aragues, Carlos Sagüés, Carlos Rubio

    Abstract: Multi robot teams composed by ground and aerial vehicles have gained attention during the last years. We present a scenario where both types of robots must monitor the same area from different view points. In this paper we propose two Lloyd-based tracking strategies to allow the ground robots (agents) follow the aerial ones (targets), kee** the connectivity between the agents. The first strategy… ▽ More

    Submitted 19 March, 2018; v1 submitted 30 January, 2018; originally announced January 2018.

  30. ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras

    Authors: Raul Mur-Artal, Juan D. Tardos

    Abstract: We present ORB-SLAM2 a complete SLAM system for monocular, stereo and RGB-D cameras, including map reuse, loop closing and relocalization capabilities. The system works in real-time on standard CPUs in a wide variety of environments from small hand-held indoors sequences, to drones flying in industrial environments and cars driving around a city. Our back-end based on bundle adjustment with monocu… ▽ More

    Submitted 19 June, 2017; v1 submitted 20 October, 2016; originally announced October 2016.

    Comments: Accepted for publication in IEEE Transactions on Robotics

  31. Visual-Inertial Monocular SLAM with Map Reuse

    Authors: Raul Mur-Artal, Juan D. Tardos

    Abstract: In recent years there have been excellent results in Visual-Inertial Odometry techniques, which aim to compute the incremental motion of the sensor with high accuracy and robustness. However these approaches lack the capability to close loops, and trajectory estimation accumulates drift even if the sensor is continually revisiting the same place. In this work we present a novel tightly-coupled Vis… ▽ More

    Submitted 17 January, 2017; v1 submitted 19 October, 2016; originally announced October 2016.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters

  32. arXiv:1504.02398  [pdf, other

    cs.RO cs.CV

    Real-time Monocular Object SLAM

    Authors: Dorian Gálvez-López, Marta Salas, Juan D. Tardós, J. M. M. Montiel

    Abstract: We present a real-time object-based SLAM system that leverages the largest object database to date. Our approach comprises two main components: 1) a monocular SLAM algorithm that exploits object rigidity constraints to improve the map and find its real scale, and 2) a novel object recognition algorithm based on bags of binary words, which provides live detections with a database of 500 3D objects.… ▽ More

    Submitted 9 April, 2015; originally announced April 2015.

  33. ORB-SLAM: a Versatile and Accurate Monocular SLAM System

    Authors: Raul Mur-Artal, J. M. M. Montiel, Juan D. Tardos

    Abstract: This paper presents ORB-SLAM, a feature-based monocular SLAM system that operates in real time, in small and large, indoor and outdoor environments. The system is robust to severe motion clutter, allows wide baseline loop closing and relocalization, and includes full automatic initialization. Building on excellent algorithms of recent years, we designed from scratch a novel system that uses the sa… ▽ More

    Submitted 18 September, 2015; v1 submitted 3 February, 2015; originally announced February 2015.

    Comments: 17 pages. 13 figures. IEEE Transactions on Robotics, 2015. Project webpage (videos, code): http://webdiis.unizar.es/~raulmur/orbslam/