Skip to main content

Showing 1–50 of 96 results for author: Carlone, L

.
  1. arXiv:2407.06141  [pdf, other

    cs.CV

    CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

    Authors: Harry Zhang, Luca Carlone

    Abstract: We introduce CHAMP, a novel method for learning sequence-to-sequence, multi-hypothesis 3D human poses from 2D keypoints by leveraging a conditional distribution with a diffusion model. To predict a single output 3D pose sequence, we generate and aggregate multiple 3D pose hypotheses. For better aggregation results, we develop a method to score these hypotheses during training, effectively integrat… ▽ More

    Submitted 26 May, 2024; originally announced July 2024.

  2. arXiv:2406.16837  [pdf, other

    cs.RO

    A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking

    Authors: Lorenzo Shaikewitz, Samuel Ubellacker, Luca Carlone

    Abstract: Applications from manipulation to autonomous vehicles rely on robust and general object tracking to safely perform tasks in dynamic environments. We propose the first certifiably optimal category-level approach for simultaneous shape estimation and pose tracking of an object of known category (e.g. a car). Our approach uses 3D semantic keypoint measurements extracted from an RGB-D image sequence,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 11 pages, 6 figures (with appendix). Code released at https://github.com/MIT-SPARK/certifiable_tracking. Video available at https://youtu.be/eTIlVD9pDtc

  3. arXiv:2405.21066  [pdf, other

    cs.CV

    Mixed Diffusion for 3D Indoor Scene Synthesis

    Authors: Siyi Hu, Diego Martin Arroyo, Stephanie Debats, Fabian Manhardt, Luca Carlone, Federico Tombari

    Abstract: Realistic conditional 3D scene synthesis significantly enhances and accelerates the creation of virtual environments, which can also provide extensive training data for computer vision and robotics research among other applications. Diffusion models have shown great performance in related applications, e.g., making precise arrangements of unordered sets. However, these models have not been fully e… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 19 pages, 14 figures. Under review. Code to be released at: https://github.com/MIT-SPARK/MiDiffusion

  4. arXiv:2405.00552  [pdf, other

    cs.RO cs.HC

    Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs

    Authors: Nicolas Gorlo, Lukas Schmid, Luca Carlone

    Abstract: We present a novel approach for long-term human trajectory prediction, which is essential for long-horizon robot planning in human-populated environments. State-of-the-art human trajectory prediction methods are limited by their focus on collision avoidance and short-term planning, and their inability to model complex interactions of humans with the environment. In contrast, our approach overcomes… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 8 pages, 6 figures. Code to be released at: https://github.com/MIT-SPARK/LP2

  5. arXiv:2404.13696  [pdf, other

    cs.RO

    Clio: Real-time Task-Driven Open-Set 3D Scene Graphs

    Authors: Dominic Maggio, Yun Chang, Nathan Hughes, Matthew Trang, Dan Griffith, Carlyn Dougherty, Eric Cristofalo, Lukas Schmid, Luca Carlone

    Abstract: Modern tools for class-agnostic image segmentation (e.g., SegmentAnything) and open-set semantic understanding (e.g., CLIP) provide unprecedented opportunities for robot perception and map**. While traditional closed-set metric-semantic maps were restricted to tens or hundreds of semantic classes, we can now build maps with a plethora of objects and countless semantic variations. This leaves us… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  6. arXiv:2403.08094  [pdf, other

    cs.RO

    Task and Motion Planning in Hierarchical 3D Scene Graphs

    Authors: Aaron Ray, Christopher Bradley, Luca Carlone, Nicholas Roy

    Abstract: Recent work in the construction of 3D scene graphs has enabled mobile robots to build large-scale hybrid metric-semantic hierarchical representations of the world. These detailed models contain information that is useful for planning, however how to derive a planning domain from a 3D scene graph that enables efficient computation of executable plans is an open question. In this work, we present a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    MSC Class: 68T40; 68T20 ACM Class: I.2.9; I.2.4; I.2.8

  7. arXiv:2403.04712  [pdf, other

    cs.RO eess.SY

    GMKF: Generalized Moment Kalman Filter for Polynomial Systems with Arbitrary Noise

    Authors: Sangli Teng, Harry Zhang, David **, Ashkan Jasour, Maani Ghaffari, Luca Carlone

    Abstract: This paper develops a new filtering approach for state estimation in polynomial systems corrupted by arbitrary noise, which commonly arise in robotics. We first consider a batch setup where we perform state estimation using all data collected from the initial to the current time. We formulate the batch state estimation problem as a Polynomial Optimization Problem (POP) and relax the assumption of… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  8. arXiv:2402.13817  [pdf, other

    cs.RO

    Khronos: A Unified Approach for Spatio-Temporal Metric-Semantic SLAM in Dynamic Environments

    Authors: Lukas Schmid, Marcus Abate, Yun Chang, Luca Carlone

    Abstract: Perceiving and understanding highly dynamic and changing environments is a crucial capability for robot autonomy. While large strides have been made towards develo** dynamic SLAM approaches that estimate the robot pose accurately, a lesser emphasis has been put on the construction of dense spatio-temporal representations of the robot environment. A detailed understanding of the scene and its evo… ▽ More

    Submitted 20 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to Robotics: Science and Systems, 2024. Code to be released at https://github.com/MIT-SPARK/Khronos Video available at https://www.youtube.com/watch?v=YsH6YIL5_kc

  9. arXiv:2402.10865  [pdf, other

    cs.RO cs.CV

    Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds

    Authors: David **, Sushrut Karmalkar, Harry Zhang, Luca Carlone

    Abstract: We investigate a variation of the 3D registration problem, named multi-model 3D registration. In the multi-model registration problem, we are given two point clouds picturing a set of objects at different poses (and possibly including points belonging to the background) and we want to simultaneously reconstruct how all objects moved between the two point clouds. This setup generalizes standard 3D… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages, Accepted by ICRA 2024

  10. arXiv:2401.06323  [pdf, other

    cs.RO

    Kimera2: Robust and Accurate Metric-Semantic SLAM in the Real World

    Authors: Marcus Abate, Yun Chang, Nathan Hughes, Luca Carlone

    Abstract: We present improvements to Kimera, an open-source metric-semantic visual-inertial SLAM library. In particular, we enhance Kimera-VIO, the visual-inertial odometry pipeline powering Kimera, to support better feature tracking, more efficient keyframe selection, and various input modalities (eg monocular, stereo, and RGB-D images, as well as wheel odometry). Additionally, Kimera-RPGO and Kimera-PGMO,… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Presented at ISER 2023

  11. arXiv:2312.11713  [pdf, other

    cs.RO cs.AI

    Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies

    Authors: Jared Strader, Nathan Hughes, William Chen, Alberto Speranzon, Luca Carlone

    Abstract: This paper proposes an approach to build 3D scene graphs in arbitrary indoor and outdoor environments. Such extension is challenging; the hierarchy of concepts that describe an outdoor environment is more complex than for indoors, and manually defining such hierarchy is time-consuming and does not scale. Furthermore, the lack of training data prevents the straightforward application of learning-ba… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages, 6 figures, accepted to Robotics and Automation Letters

  12. arXiv:2309.13035  [pdf, other

    cs.RO

    PyPose v0.6: The Imperative Programming Interface for Robotics

    Authors: Zitong Zhan, Xiangfu Li, Qihang Li, Haonan He, Abhinav Pandey, Haitao Xiao, Yangmengfei Xu, Xiangyu Chen, Kuan Xu, Kun Cao, Zhipeng Zhao, Zihan Wang, Huan Xu, Zihang Fang, Yutian Chen, Wentao Wang, Xu Fang, Yi Du, Tianhao Wu, Xiao Lin, Yuheng Qiu, Fan Yang, **gnan Shi, Shaoshu Su, Yiren Lu , et al. (11 additional authors not shown)

    Abstract: PyPose is an open-source library for robot learning. It combines a learning-based approach with physics-based optimization, which enables seamless end-to-end robot learning. It has been used in many tasks due to its meticulously designed application programming interface (API) and efficient implementation. From its initial launch in early 2022, PyPose has experienced significant enhancements, inco… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  13. arXiv:2308.06351  [pdf, other

    cs.RO

    Aggressive Aerial Gras** using a Soft Drone with Onboard Perception

    Authors: Samuel Ubellacker, Aaron Ray, James Bern, Jared Strader, Luca Carlone

    Abstract: Contrary to the stunning feats observed in birds of prey, aerial manipulation and gras** with flying robots still lack versatility and agility. Conventional approaches using rigid manipulators require precise positioning and are subject to large reaction forces at grasp, which limit performance at high speeds. The few reported examples of aggressive aerial gras** rely on motion capture systems… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    MSC Class: 68T40; 70B15; 70E60; 74Pxx; 65D19 ACM Class: I.2.9; G.1.6; I.5.2; I.4.5

  14. arXiv:2308.05939  [pdf, other

    cs.RO

    VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields

    Authors: Dominic Maggio, Courtney Mario, Luca Carlone

    Abstract: We present VERF, a collection of two methods (VERF-PnP and VERF-Light) for providing runtime assurance on the correctness of a camera pose estimate of a monocular camera without relying on direct depth measurements. We leverage the ability of NeRF (Neural Radiance Fields) to render novel RGB perspectives of a scene. We only require as input the camera image whose pose is being estimated, an estima… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  15. arXiv:2305.07154  [pdf, other

    cs.RO

    Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems

    Authors: Nathan Hughes, Yun Chang, Siyi Hu, Rajat Talak, Rumaisa Abdulhai, Jared Strader, Luca Carlone

    Abstract: 3D spatial perception is the problem of building and maintaining an actionable and persistent representation of the environment in real-time using sensor data and prior knowledge. Despite the fast-paced progress in robot perception, most existing methods either build purely geometric maps (as in traditional SLAM) or flat metric-semantic maps that do not scale to large environments or large diction… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 38 pages, 19 figures; submitted to International Journal of Robotics Research

  16. arXiv:2305.01870  [pdf, other

    cs.RO

    Task-Aware Risk Estimation of Perception Failures for Autonomous Vehicles

    Authors: Pasquale Antonante, Sushant Veer, Karen Leung, Xinshuo Weng, Luca Carlone, Marco Pavone

    Abstract: Safety and performance are key enablers for autonomous driving: on the one hand we want our autonomous vehicles (AVs) to be safe, while at the same time their performance (e.g., comfort or progression) is key to adoption. To effectively walk the tight-rope between safety and performance, AVs need to be risk-averse, but not entirely risk-avoidant. To facilitate safe-yet-performant driving, in this… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  17. arXiv:2304.13487  [pdf, other

    cs.RO

    Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams

    Authors: Yun Chang, Nathan Hughes, Aaron Ray, Luca Carlone

    Abstract: 3D scene graphs have recently emerged as an expressive high-level map representation that describes a 3D environment as a layered graph where nodes represent spatial concepts at multiple levels of abstraction (e.g., objects, rooms, buildings) and edges represent relations between concepts (e.g., inclusion, adjacency). This paper describes Hydra-Multi, the first multi-robot spatial perception syste… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 8 pages, 10 figures

  18. arXiv:2304.13182  [pdf, other

    cs.RO

    Multi-Camera Visual-Inertial Simultaneous Localization and Map** for Autonomous Valet Parking

    Authors: Marcus Abate, Ariel Schwartz, Xue Iuan Wong, Wangdong Luo, Rotem Littman, Marc Klinger, Lars Kuhnert, Douglas Blue, Luca Carlone

    Abstract: Localization and map** are key capabilities for self-driving vehicles. In this paper, we build on Kimera and extend it to use multiple cameras as well as external (eg wheel) odometry sensors, to obtain accurate and robust odometry estimates in real-world problems. Additionally, we propose an effective scheme for closing loops that circumvents the drawbacks of common alternatives based on the Per… ▽ More

    Submitted 11 January, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  19. arXiv:2304.04362  [pdf, other

    cs.RO

    Resilient and Distributed Multi-Robot Visual SLAM: Datasets, Experiments, and Lessons Learned

    Authors: Yulun Tian, Yun Chang, Long Quang, Arthur Schang, Carlos Nieto-Granda, Jonathan P. How, Luca Carlone

    Abstract: This paper revisits Kimera-Multi, a distributed multi-robot Simultaneous Localization and Map** (SLAM) system, towards the goal of deployment in the real world. In particular, this paper has three main contributions. First, we describe improvements to Kimera-Multi to make it resilient to large-scale real-world deployments, with particular emphasis on handling intermittent and unreliable communic… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 8 pages, 9 figures

  20. arXiv:2302.13264  [pdf, other

    cs.RO

    Data-Association-Free Landmark-based SLAM

    Authors: Yihao Zhang, Odin A. Severinsen, John J. Leonard, Luca Carlone, Kasra Khosoussi

    Abstract: We study landmark-based SLAM with unknown data association: our robot navigates in a completely unknown environment and has to simultaneously reason over its own trajectory, the positions of an unknown number of landmarks in the environment, and potential data associations between measurements and landmarks. This setup is interesting since: (i) it arises when recovering from data association failu… ▽ More

    Submitted 4 May, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

    Comments: Accepted at ICRA 2023. Correcting a typo (missing parentheses) in eq. (1) and following equations

  21. arXiv:2302.08011  [pdf, other

    cs.CV

    Vision-Based Terrain Relative Navigation on High-Altitude Balloon and Sub-Orbital Rocket

    Authors: Dominic Maggio, Courtney Mario, Brett Streetman, Ted Steiner, Luca Carlone

    Abstract: We present an experimental analysis on the use of a camera-based approach for high-altitude navigation by associating mapped landmarks from a satellite image database to camera images, and by leveraging inertial sensors between camera frames. We evaluate performance of both a sideways-tilted and downward-facing camera on data collected from a World View Enterprises high-altitude balloon with data… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: Published in 2023 AIAA SciTech

    Journal ref: 2023 AIAA SciTech

  22. arXiv:2302.06019  [pdf, other

    cs.CV cs.LG cs.RO

    A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training

    Authors: **gnan Shi, Rajat Talak, Dominic Maggio, Luca Carlone

    Abstract: Real-world robotics applications demand object pose estimation methods that work reliably across a variety of scenarios. Modern learning-based approaches require large labeled datasets and tend to perform poorly outside the training domain. Our first contribution is to develop a robust corrector module that corrects pose estimates using depth information, thus enabling existing methods to better g… ▽ More

    Submitted 11 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

  23. arXiv:2210.13641  [pdf, other

    cs.CV

    NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields

    Authors: Antoni Rosinol, John J. Leonard, Luca Carlone

    Abstract: We propose a novel geometric and photometric 3D map** pipeline for accurate and real-time scene reconstruction from monocular images. To achieve this, we leverage recent advances in dense monocular SLAM and real-time hierarchical volumetric neural radiance fields. Our insight is that dense monocular SLAM provides the right information to fit a neural radiance field of the scene in real-time, by… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures

  24. arXiv:2210.01276  [pdf, other

    cs.CV

    Probabilistic Volumetric Fusion for Dense Monocular SLAM

    Authors: Antoni Rosinol, John J. Leonard, Luca Carlone

    Abstract: We present a novel method to reconstruct 3D scenes from images by leveraging deep dense monocular SLAM and fast uncertainty propagation. The proposed approach is able to 3D reconstruct scenes densely, accurately, and in real-time while being robust to extremely noisy depth estimates coming from dense monocular SLAM. Differently from previous approaches, that either use ad-hoc depth filters, or tha… ▽ More

    Submitted 16 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 9 pages, 6 figures, 2 tables

  25. arXiv:2209.15428  [pdf, other

    cs.RO

    PyPose: A Library for Robot Learning with Physics-based Optimization

    Authors: Chen Wang, Dasong Gao, Kuan Xu, Junyi Geng, Yaoyu Hu, Yuheng Qiu, Bowen Li, Fan Yang, Brady Moon, Abhinav Pandey, Aryan, Jiahe Xu, Tianhao Wu, Haonan He, Daning Huang, Zhongqiang Ren, Shibo Zhao, Taimeng Fu, Pranay Reddy, Xiao Lin, Wenshan Wang, **gnan Shi, Rajat Talak, Kun Cao, Yi Du , et al. (12 additional authors not shown)

    Abstract: Deep learning has had remarkable success in robotic perception, but its data-centric nature suffers when it comes to generalizing to ever-changing environments. By contrast, physics-based optimization generalizes better, but it does not perform as well in complicated tasks due to the lack of high-level semantic information and reliance on manual parametric tuning. To take advantage of these two co… ▽ More

    Submitted 24 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Project Website: https://pypose.org Documentation: https://pypose.org/docs/ Tutorial: https://pypose.org/tutorials/ Source code: https://github.com/pypose/pypose

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

  26. arXiv:2209.09050  [pdf, other

    cs.RO

    Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields

    Authors: Dominic Maggio, Marcus Abate, **gnan Shi, Courtney Mario, Luca Carlone

    Abstract: We present Loc-NeRF, a real-time vision-based robot localization approach that combines Monte Carlo localization and Neural Radiance Fields (NeRF). Our system uses a pre-trained NeRF model as the map of an environment and can localize itself in real-time using an RGB camera as the only exteroceptive sensor onboard the robot. While neural radiance fields have seen significant applications for visua… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  27. arXiv:2209.06111  [pdf, other

    cs.RO cs.DM cs.MA

    D-Lite: Navigation-Oriented Compression of 3D Scene Graphs for Multi-Robot Collaboration

    Authors: Yun Chang, Luca Ballotta, Luca Carlone

    Abstract: For a multi-robot team that collaboratively explores an unknown environment, it is of vital importance that collected information is efficiently shared among robots in order to support exploration and navigation tasks. Practical constraints of wireless channels, such as limited bandwidth, urge robots to carefully select information to be transmitted. In this paper, we consider the case where envir… ▽ More

    Submitted 23 September, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: 18 pages, 18 figures; accepted at IEEE RA-L 2023

    MSC Class: 05C85 (Primary); 65D19; 91B32 (Secondary) ACM Class: I.2.9; I.2.10; I.2.11; G.2.2; C.2.4

  28. arXiv:2209.05629  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    Leveraging Large (Visual) Language Models for Robot 3D Scene Understanding

    Authors: William Chen, Siyi Hu, Rajat Talak, Luca Carlone

    Abstract: Abstract semantic 3D scene understanding is a problem of critical importance in robotics. As robots still lack the common-sense knowledge about household objects and locations of an average human, we investigate the use of pre-trained language models to impart common sense for scene understanding. We introduce and compare a wide range of scene classification paradigms that leverage language only (… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2206.04585

  29. arXiv:2208.10521  [pdf, other

    stat.ML cs.CV cs.DS cs.LG cs.RO

    Estimation Contracts for Outlier-Robust Geometric Perception

    Authors: Luca Carlone

    Abstract: Outlier-robust estimation is a fundamental problem and has been extensively investigated by statisticians and practitioners. The last few years have seen a convergence across research fields towards "algorithmic robust statistics", which focuses on develo** tractable outlier-robust techniques for high-dimensional estimation problems. Despite this convergence, research efforts across fields have… ▽ More

    Submitted 16 December, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: 95 pages, 12 figures

    MSC Class: 68T40; 74Pxx; 46N10; 65D19 ACM Class: I.2.9; G.1.6; I.4.5

  30. arXiv:2208.01787  [pdf, other

    cs.RO

    Present and Future of SLAM in Extreme Underground Environments

    Authors: Kamak Ebadi, Lukas Bernreiter, Harel Biggie, Gavin Catt, Yun Chang, Arghya Chatterjee, Christopher E. Denniston, Simon-Pierre Deschênes, Kyle Harlow, Shehryar Khattak, Lucas Nogueira, Matteo Palieri, Pavel Petráček, Matěj Petrlík, Andrzej Reinke, Vít Krátký, Shibo Zhao, Ali-akbar Agha-mohammadi, Kostas Alexis, Christoffer Heckman, Kasra Khosoussi, Navinda Kottege, Benjamin Morrell, Marco Hutter, Fred Pauling , et al. (6 additional authors not shown)

    Abstract: This paper reports on the state of the art in underground SLAM by discussing different SLAM strategies and results across six teams that participated in the three-year-long SubT competition. In particular, the paper has four main goals. First, we review the algorithms, architectures, and systems adopted by the teams; particular emphasis is put on lidar-centric SLAM solutions (the go-to approach fo… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 21 pages including references. This survey paper is submitted to IEEE Transactions on Robotics for pre-approval

  31. arXiv:2207.00254  [pdf, ps, other

    cs.RO

    A Survey on Active Simultaneous Localization and Map**: State of the Art and New Frontiers

    Authors: Julio A. Placed, Jared Strader, Henry Carrillo, Nikolay Atanasov, Vadim Indelman, Luca Carlone, José A. Castellanos

    Abstract: Active Simultaneous Localization and Map** (SLAM) is the problem of planning and controlling the motion of a robot to build the most accurate and complete model of the surrounding environment. Since the first foundational work in active perception appeared, more than three decades ago, this field has received increasing attention across different scientific communities. This has brought about ma… ▽ More

    Submitted 13 February, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE Transactions on Robotics (T-RO). 20 pages

    Journal ref: IEEE Transactions on Robotics (T-RO), 2023

  32. Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic Keypoints

    Authors: **gnan Shi, Heng Yang, Luca Carlone

    Abstract: We consider a category-level perception problem, where one is given 2D or 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the 3D pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). We consider an active shape model, where -- for an object category -- we are given a library of potential CAD mod… ▽ More

    Submitted 16 September, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2104.08383

  33. arXiv:2206.11215  [pdf, other

    cs.CV cs.LG cs.RO

    Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and Self-Training

    Authors: Rajat Talak, Lisa Peng, Luca Carlone

    Abstract: We consider a certifiable object pose estimation problem, where -- given a partial point cloud of an object -- the goal is to not only estimate the object pose, but also to provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of $ζ$-correctness, which bou… ▽ More

    Submitted 28 April, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

  34. arXiv:2206.04585  [pdf, other

    cs.RO cs.CL

    Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding

    Authors: William Chen, Siyi Hu, Rajat Talak, Luca Carlone

    Abstract: Semantic 3D scene understanding is a problem of critical importance in robotics. While significant advances have been made in simultaneous localization and map** algorithms, robots are still far from having the common sense knowledge about household objects and their locations of an average human. We introduce a novel method for leveraging common sense embedded within large language models for l… ▽ More

    Submitted 18 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 4 pages (excluding references and appendix), 2 figures, 2 tables. Submitted to Robotics: Science and Systems 2022 2nd Workshop on Scaling Robot Learning. Corrected typos and notation

  35. arXiv:2206.00777  [pdf

    cs.RO

    Visual Navigation for Autonomous Vehicles: An Open-source Hands-on Robotics Course at MIT

    Authors: Luca Carlone, Kasra Khosoussi, Vasileios Tzoumas, Golnaz Habibi, Markus Ryll, Rajat Talak, **gnan Shi, Pasquale Antonante

    Abstract: This paper reports on the development, execution, and open-sourcing of a new robotics course at MIT. The course is a modern take on "Visual Navigation for Autonomous Vehicles" (VNAV) and targets first-year graduate students and senior undergraduates with prior exposure to robotics. VNAV has the goal of preparing the students to perform research in robotics and vision-based navigation, with emphasi… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: This paper has been accepted for publication at the IEEE Integrated STEM Education Conference

  36. arXiv:2205.13135  [pdf, other

    cs.RO cs.MA

    LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments

    Authors: Yun Chang, Kamak Ebadi, Christopher E. Denniston, Muhammad Fadhil Ginting, Antoni Rosinol, Andrzej Reinke, Matteo Palieri, **gnan Shi, Arghya Chatterjee, Benjamin Morrell, Ali-akbar Agha-mohammadi, Luca Carlone

    Abstract: Search and rescue with a team of heterogeneous mobile robots in unknown and large-scale underground environments requires high-precision localization and map**. This crucial requirement is faced with many challenges in complex and perceptually-degraded subterranean environments, as the onboard perception system is required to operate in off-nominal conditions (poor visibility due to darkness and… ▽ More

    Submitted 8 July, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  37. arXiv:2205.12402  [pdf, other

    cs.RO

    Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM

    Authors: Christopher E. Denniston, Yun Chang, Andrzej Reinke, Kamak Ebadi, Gaurav S. Sukhatme, Luca Carlone, Benjamin Morrell, Ali-akbar Agha-mohammadi

    Abstract: Multi-robot SLAM systems in GPS-denied environments require loop closures to maintain a drift-free centralized map. With an increasing number of robots and size of the environment, checking and computing the transformation for all the loop closure candidates becomes computationally infeasible. In this work, we describe a loop closure module that is able to prioritize which loop closures to compute… ▽ More

    Submitted 8 July, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 8 pages, Accepted to RA-L/IROS 2022

  38. LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Map**

    Authors: Andrzej Reinke, Matteo Palieri, Benjamin Morrell, Yun Chang, Kamak Ebadi, Luca Carlone, Ali-akbar Agha-mohammadi

    Abstract: Lidar odometry has attracted considerable attention as a robust localization method for autonomous robots operating in complex GNSS-denied environments. However, achieving reliable and efficient performance on heterogeneous platforms in large-scale environments remains an open challenge due to the limitations of onboard computation and memory resources needed for autonomous operation. In this work… ▽ More

    Submitted 13 June, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  39. arXiv:2205.10906  [pdf, other

    cs.RO cs.AI

    Monitoring of Perception Systems: Deterministic, Probabilistic, and Learning-based Fault Detection and Identification

    Authors: Pasquale Antonante, Heath Nilsen, Luca Carlone

    Abstract: This paper investigates runtime monitoring of perception systems. Perception is a critical component of high-integrity applications of robotics and autonomous systems, such as self-driving cars. In these applications, failure of perception systems may put human life at risk, and a broad adoption of these technologies requires the development of methodologies to guarantee and monitor safe operation… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  40. arXiv:2201.13360  [pdf, other

    cs.RO

    Hydra: A Real-time Spatial Perception System for 3D Scene Graph Construction and Optimization

    Authors: Nathan Hughes, Yun Chang, Luca Carlone

    Abstract: 3D scene graphs have recently emerged as a powerful high-level representation of 3D environments. A 3D scene graph describes the environment as a layered graph where nodes represent spatial concepts at multiple levels of abstraction and edges represent relations between concepts. While 3D scene graphs can serve as an advanced "mental model" for robots, how to build such a rich representation in re… ▽ More

    Submitted 20 June, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 13 pages, 10 figures, published in Robotics Science and Systems (RSS) 2022 proceedings

  41. arXiv:2109.12343  [pdf, other

    cs.RO cs.LG cs.MA eess.SY

    Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

    Authors: Amanda Prorok, Matthew Malencia, Luca Carlone, Gaurav S. Sukhatme, Brian M. Sadler, Vijay Kumar

    Abstract: Robustness is key to engineering, automation, and science as a whole. However, the property of robustness is often underpinned by costly requirements such as over-provisioning, known uncertainty and predictive models, and known adversaries. These conditions are idealistic, and often not satisfiable. Resilience on the other hand is the capability to endure unexpected disruptions, to recover swiftly… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  42. arXiv:2109.03349  [pdf, other

    cs.CV cs.RO math.OC

    Certifiably Optimal Outlier-Robust Geometric Perception: Semidefinite Relaxations and Scalable Global Optimization

    Authors: Heng Yang, Luca Carlone

    Abstract: We propose the first general and scalable framework to design certifiable algorithms for robust geometric perception in the presence of outliers. Our first contribution is to show that estimation using common robust costs, such as truncated least squares (TLS), maximum consensus, Geman-McClure, Tukey's biweight, among others, can be reformulated as polynomial optimization problems (POPs). By focus… ▽ More

    Submitted 29 May, 2022; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022

  43. Computation and Communication Co-Design for Real-Time Monitoring and Control in Multi-Agent Systems

    Authors: Vishrant Tripathi, Luca Ballotta, Luca Carlone, Eytan Modiano

    Abstract: We investigate the problem of co-designing computation and communication in a multi-agent system (e.g. a sensor network or a multi-robot team). We consider the realistic setting where each agent acquires sensor data and is capable of local processing before sending updates to a base station, which is in charge of making decisions or monitoring phenomena of interest in real time. Longer processing… ▽ More

    Submitted 8 August, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: accepted at WiOpt 2021

  44. arXiv:2108.02957  [pdf, other

    cs.CV cs.RO

    Smooth Mesh Estimation from Depth Data using Non-Smooth Convex Optimization

    Authors: Antoni Rosinol, Luca Carlone

    Abstract: Meshes are commonly used as 3D maps since they encode the topology of the scene while being lightweight. Unfortunately, 3D meshes are mathematically difficult to handle directly because of their combinatorial and discrete nature. Therefore, most approaches generate 3D meshes of a scene after fusing depth data using volumetric or other representations. Nevertheless, volumetric fusion remains… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: 8 pages

  45. arXiv:2108.01176  [pdf, other

    cs.RO cs.AI

    Hierarchical Representations and Explicit Memory: Learning Effective Navigation Policies on 3D Scene Graphs using Graph Neural Networks

    Authors: Zachary Ravichandran, Lisa Peng, Nathan Hughes, J. Daniel Griffith, Luca Carlone

    Abstract: Representations are crucial for a robot to learn effective navigation policies. Recent work has shown that mid-level perceptual abstractions, such as depth estimates or 2D semantic segmentation, lead to more effective policies when provided as observations in place of raw sensor data (e.g., RGB images). However, such policies must still learn latent three-dimensional scene properties from mid-leve… ▽ More

    Submitted 5 May, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted by the International Conference on Robotics and Automation (ICRA) 2022

  46. arXiv:2106.14386  [pdf, other

    cs.RO cs.CV cs.MA

    Kimera-Multi: Robust, Distributed, Dense Metric-Semantic SLAM for Multi-Robot Systems

    Authors: Yulun Tian, Yun Chang, Fernando Herrera Arias, Carlos Nieto-Granda, Jonathan P. How, Luca Carlone

    Abstract: This paper presents Kimera-Multi, the first multi-robot system that (i) is robust and capable of identifying and rejecting incorrect inter and intra-robot loop closures resulting from perceptual aliasing, (ii) is fully distributed and only relies on local (peer-to-peer) communication to achieve distributed localization and map**, and (iii) builds a globally consistent metric-semantic 3D mesh mod… ▽ More

    Submitted 17 December, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE Transactions on Robotics (18 pages, 15 figures)

  47. arXiv:2105.14033  [pdf, other

    math.OC cs.CV cs.LG

    An Inexact Projected Gradient Method with Rounding and Lifting by Nonlinear Programming for Solving Rank-One Semidefinite Relaxation of Polynomial Optimization

    Authors: Heng Yang, Ling Liang, Luca Carlone, Kim-Chuan Toh

    Abstract: We consider solving high-order semidefinite programming (SDP) relaxations of nonconvex polynomial optimization problems (POPs) that often admit degenerate rank-one optimal solutions. Instead of solving the SDP alone, we propose a new algorithmic framework that blends local search using the nonconvex POP into global descent using the convex SDP. In particular, we first design a globally convergent… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Code available at https://github.com/MIT-SPARK/STRIDE

    MSC Class: 90C06; 90C22; 90C23; 90C55

  48. arXiv:2105.07264  [pdf, other

    cs.LG cs.CV cs.RO

    Neural Trees for Learning on Graphs

    Authors: Rajat Talak, Siyi Hu, Lisa Peng, Luca Carlone

    Abstract: Graph Neural Networks (GNNs) have emerged as a flexible and powerful approach for learning over graphs. Despite this success, existing GNNs are constrained by their local message-passing architecture and are provably limited in their expressive power. In this work, we propose a new GNN architecture -- the Neural Tree. The neural tree architecture does not perform message passing on the input graph… ▽ More

    Submitted 27 October, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

  49. Optimal Pose and Shape Estimation for Category-level 3D Object Perception

    Authors: **gnan Shi, Heng Yang, Luca Carlone

    Abstract: We consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g. a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e. different car models have different shapes). We consider an active shape model, where -- for an object category -- we are given a library of potential CAD models describ… ▽ More

    Submitted 16 September, 2023; v1 submitted 16 April, 2021; originally announced April 2021.

  50. arXiv:2103.11470  [pdf, other

    cs.RO cs.AI

    NeBula: Quest for Robotic Autonomy in Challenging Environments; TEAM CoSTAR at the DARPA Subterranean Challenge

    Authors: Ali Agha, Kyohei Otsu, Benjamin Morrell, David D. Fan, Rohan Thakker, Angel Santamaria-Navarro, Sung-Kyun Kim, Amanda Bouman, Xianmei Lei, Jeffrey Edlund, Muhammad Fadhil Ginting, Kamak Ebadi, Matthew Anderson, Torkom Pailevanian, Edward Terry, Michael Wolf, Andrea Tagliabue, Tiago Stegun Vaquero, Matteo Palieri, Scott Tepsuporn, Yun Chang, Arash Kalantari, Fernando Chavez, Brett Lopez, Nobuhiro Funabiki , et al. (47 additional authors not shown)

    Abstract: This paper presents and discusses algorithms, hardware, and software architecture developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), competing in the DARPA Subterranean Challenge. Specifically, it presents the techniques utilized within the Tunnel (2019) and Urban (2020) competitions, where CoSTAR achieved 2nd and 1st place, respectively. We also discuss CoSTAR's demonstr… ▽ More

    Submitted 18 October, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: For team website, see https://costar.jpl.nasa.gov/. Accepted for publication in the Journal of Field Robotics, 2021