Skip to main content

Showing 51–100 of 154 results for author: Rus, D

.
  1. arXiv:2210.06650  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Interpreting Neural Policies with Disentangled Tree Representations

    Authors: Tsun-Hsuan Wang, Wei Xiao, Tim Seyde, Ramin Hasani, Daniela Rus

    Abstract: The advancement of robots, particularly those functioning in complex human-centric environments, relies on control solutions that are driven by machine learning. Understanding how learning-based controllers make decisions is crucial since robots are often safety-critical systems. This urges a formal and quantitative understanding of the explanatory factors in the interpretability of robot learning… ▽ More

    Submitted 12 November, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  2. arXiv:2210.04763  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    On the Forward Invariance of Neural ODEs

    Authors: Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Mathias Lechner, Yutong Ban, Chuang Gan, Daniela Rus

    Abstract: We propose a new method to ensure neural ordinary differential equations (ODEs) satisfy output specifications by using invariance set propagation. Our approach uses a class of control barrier functions to transform output specifications into constraints on the parameters and inputs of the learning system. This setup allows us to achieve output specification guarantees simply by changing the constr… ▽ More

    Submitted 31 May, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 25 pages, accepted in ICML2023, website: https://weixy21.github.io/invariance/

  3. arXiv:2210.04728  [pdf, other

    cs.LG

    PyHopper -- Hyperparameter optimization

    Authors: Mathias Lechner, Ramin Hasani, Philipp Neubauer, Sophie Neubauer, Daniela Rus

    Abstract: Hyperparameter tuning is a fundamental aspect of machine learning research. Setting up the infrastructure for systematic optimization of hyperparameters can take a significant amount of time. Here, we present PyHopper, a black-box optimization platform designed to streamline the hyperparameter tuning workflow of machine learning researchers. PyHopper's goal is to integrate with existing code with… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  4. arXiv:2210.04303  [pdf, other

    cs.CV cs.AI cs.LG cs.NE cs.RO

    Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap

    Authors: Mathias Lechner, Ramin Hasani, Alexander Amini, Tsun-Hsuan Wang, Thomas A. Henzinger, Daniela Rus

    Abstract: There is an ever-growing zoo of modern neural network models that can efficiently learn end-to-end control from visual observations. These advanced deep models, ranging from convolutional to patch-based networks, have been extensively tested on offline image classification and regression tasks. In this paper, we study these vision architectures with respect to the open-loop to closed-loop causalit… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  5. arXiv:2209.12968  [pdf, ps, other

    cs.RO cs.GT eess.SY

    Intention Communication and Hypothesis Likelihood in Game-Theoretic Motion Planning

    Authors: Makram Chahine, Roya Firoozi, Wei Xiao, Mac Schwager, Daniela Rus

    Abstract: Game-theoretic motion planners are a potent solution for controlling systems of multiple highly interactive robots. Most existing game-theoretic planners unrealistically assume a priori objective function knowledge is available to all agents. To address this, we propose a fault-tolerant receding horizon game-theoretic motion planner that leverages inter-agent communication with intention hypothesi… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.8; I.2.9; I.2.11

  6. arXiv:2209.12951  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.NE

    Liquid Structural State-Space Models

    Authors: Ramin Hasani, Mathias Lechner, Tsun-Hsuan Wang, Makram Chahine, Alexander Amini, Daniela Rus

    Abstract: A proper parametrization of state transition matrices of linear state-space models (SSMs) followed by standard nonlinearities enables them to efficiently learn representations from sequential data, establishing the state-of-the-art on a large series of long-range sequence modeling benchmarks. In this paper, we show that we can improve further when the structural SSM such as S4 is given by a linear… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  7. arXiv:2209.11064  [pdf, other

    cs.CV cs.LG cs.RO

    Deep Learning on Home Drone: Searching for the Optimal Architecture

    Authors: Alaa Maalouf, Yotam Gurfinkel, Barak Diker, Oren Gal, Daniela Rus, Dan Feldman

    Abstract: We suggest the first system that runs real-time semantic segmentation via deep learning on a weak micro-computer such as the Raspberry Pi Zero v2 (whose price was \$15) attached to a toy-drone. In particular, since the Raspberry Pi weighs less than $16$ grams, and its size is half of a credit card, we could easily attach it to the common commercial DJI Tello toy-drone (<\$100, <90 grams, 98… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  8. BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling

    Authors: Yechao Bai, Xiaogang Wang, Marcelo H. Ang Jr, Daniela Rus

    Abstract: The learning and aggregation of multi-scale features are essential in empowering neural networks to capture the fine-grained geometric details in the point cloud upsampling task. Most existing approaches extract multi-scale features from a point cloud of a fixed resolution, hence obtain only a limited level of details. Though an existing approach aggregates a feature hierarchy of different resolut… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: Accepted to RA-L 2022. in IEEE Robotics and Automation Letters

    Journal ref: in IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7447-7454, July 2022

  9. arXiv:2206.01261  [pdf, other

    cs.LG cs.AI cs.NE

    Entangled Residual Map**s

    Authors: Mathias Lechner, Ramin Hasani, Zahra Babaiee, Radu Grosu, Daniela Rus, Thomas A. Henzinger, Sepp Hochreiter

    Abstract: Residual map**s have been shown to perform representation learning in the first layers and iterative feature refinement in higher layers. This interplay, combined with their stabilizing effect on the gradient norms, enables them to train very deep networks. In this paper, we take a step further and introduce entangled residual map**s to generalize the structure of the residual connections and… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 21 Pages

  10. Multi-robot Task Assignment for Aerial Tracking with Viewpoint Constraints

    Authors: Aaron Ray, Alyssa Pierson, Hai Zhu, Javier Alonso-Mora, Daniela Rus

    Abstract: We address the problem of assigning a team of drones to autonomously capture a set desired shots of a dynamic target in the presence of obstacles. We present a two-stage planning pipeline that generates offline an assignment of drone to shots and locally optimizes online the viewpoint. Given desired shot parameters, the high-level planner uses a visibility heuristic to predict good times for captu… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Journal ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1515-1522

  11. arXiv:2205.15473  [pdf, other

    cs.RO

    Free-Space Ellipsoid Graphs for Multi-Agent Target Monitoring

    Authors: Aaron Ray, Alyssa Pierson, Daniela Rus

    Abstract: We apply a novel framework for decomposing and reasoning about free space in an environment to a multi-agent persistent monitoring problem. Our decomposition method represents free space as a collection of ellipsoids associated with a weighted connectivity graph. The same ellipsoids used for reasoning about connectivity and distance during high level planning can be used as state constraints in a… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: IEEE Intl. Conf. on Robotics and Automation (ICRA) 2022

  12. arXiv:2205.13542  [pdf, other

    cs.CV

    BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Authors: Zhijian Liu, Haotian Tang, Alexander Amini, Xinyu Yang, Huizi Mao, Daniela Rus, Song Han

    Abstract: Multi-sensor fusion is essential for an accurate and reliable autonomous driving system. Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with camera features. However, the camera-to-LiDAR projection throws away the semantic density of camera features, hindering the effectiveness of such methods, especially for semantic-oriented tasks (such as 3D scene segmentati… ▽ More

    Submitted 16 June, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: The first two authors contributed equally to this work. Project page: https://bevfusion.mit.edu

  13. arXiv:2205.09117  [pdf, other

    cs.LG cs.RO eess.SY

    Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks

    Authors: Ryan Sander, Wilko Schwarting, Tim Seyde, Igor Gilitschenski, Sertac Karaman, Daniela Rus

    Abstract: Experience replay plays a crucial role in improving the sample efficiency of deep reinforcement learning agents. Recent advances in experience replay propose using Mixup (Zhang et al., 2018) to further improve sample efficiency via synthetic sample generation. We build upon this technique with Neighborhood Mixup Experience Replay (NMER), a geometrically-grounded replay buffer that interpolates tra… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: Accepted to L4DC 2022

  14. arXiv:2204.07412  [pdf, other

    cs.CV cs.AI cs.LG

    End-to-End Sensitivity-Based Filter Pruning

    Authors: Zahra Babaiee, Lucas Liebenwein, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: In this paper, we present a novel sensitivity-based filter pruning algorithm (SbF-Pruner) to learn the importance scores of filters of each layer end-to-end. Our method learns the scores from the filter weights, enabling it to account for the correlations between the filters of each layer. Moreover, by training the pruning scores of all layers simultaneously our method can account for layer interd… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  15. arXiv:2204.07373  [pdf, other

    cs.RO cs.CV cs.LG

    Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

    Authors: Mathias Lechner, Alexander Amini, Daniela Rus, Thomas A. Henzinger

    Abstract: Adversarial training (i.e., training on adversarially perturbed input data) is a well-studied method for making neural networks robust to potential adversarial attacks during inference. However, the improved robustness does not come for free but rather is accompanied by a decrease in overall model accuracy and performance. Recent work has shown that, in practical robot learning applications, the e… ▽ More

    Submitted 25 January, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

  16. arXiv:2204.02392  [pdf, other

    cs.RO cs.AI cs.MA

    Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models

    Authors: Jose L. Vazquez, Alexander Liniger, Wilko Schwarting, Daniela Rus, Luc Van Gool

    Abstract: In most classical Autonomous Vehicle (AV) stacks, the prediction and planning layers are separated, limiting the planner to react to predictions that are not informed by the planned trajectory of the AV. This work presents a module that tightly couples these layers via a game-theoretic Model Predictive Controller (MPC) that uses a novel interactive multi-agent neural network policy as part of its… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: accepted to L4DC

  17. arXiv:2203.07978  [pdf, other

    eess.SY cs.RO

    Control Barrier Functions for Systems with Multiple Control Inputs

    Authors: Wei Xiao, Christos G. Cassandras, Calin A. Belta, Daniela Rus

    Abstract: Control Barrier Functions (CBFs) are becoming popular tools in guaranteeing safety for nonlinear systems and constraints, and they can reduce a constrained optimal control problem into a sequence of Quadratic Programs (QPs) for affine control systems. The recently proposed High Order Control Barrier Functions (HOCBFs) work for arbitrary relative degree constraints. One of the challenges in a HOCBF… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: To appear in ACC2022

  18. arXiv:2203.02401  [pdf, other

    cs.RO cs.CV cs.LG

    Differentiable Control Barrier Functions for Vision-based End-to-End Autonomous Driving

    Authors: Wei Xiao, Tsun-Hsuan Wang, Makram Chahine, Alexander Amini, Ramin Hasani, Daniela Rus

    Abstract: Guaranteeing safety of perception-based learning systems is challenging due to the absence of ground-truth state information unlike in state-aware control scenarios. In this paper, we introduce a safety guaranteed learning framework for vision-based end-to-end autonomous driving. To this end, we design a learning system equipped with differentiable control barrier functions (dCBFs) that is trained… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 11 pages, Wei Xiao and Tsun-Hsuan Wang are with equal contributions

  19. arXiv:2202.13402  [pdf, other

    cs.CV

    Concept Graph Neural Networks for Surgical Video Understanding

    Authors: Yutong Ban, Jennifer A. Eckhoff, Thomas M. Ward, Daniel A. Hashimoto, Ozanan R. Meireles, Daniela Rus, Guy Rosman

    Abstract: We constantly integrate our knowledge and understanding of the world to enhance our interpretation of what we see. This ability is crucial in application domains which entail reasoning about multiple entities and concepts, such as AI-augmented surgery. In this paper, we propose a novel way of integrating conceptual knowledge into temporal analysis tasks via temporal concept graph networks. In th… ▽ More

    Submitted 25 April, 2023; v1 submitted 27 February, 2022; originally announced February 2022.

  20. arXiv:2111.12137  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Interactive Driving Policies via Data-driven Simulation

    Authors: Tsun-Hsuan Wang, Alexander Amini, Wilko Schwarting, Igor Gilitschenski, Sertac Karaman, Daniela Rus

    Abstract: Data-driven simulators promise high data-efficiency for driving policy learning. When used for modelling interactions, this data-efficiency becomes a bottleneck: Small underlying datasets often lack interesting and challenging edge cases for learning interactive driving. We address this challenge by proposing a simulation method that uses in-painted ado vehicles for learning robust driving policie… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: The first two authors contributed equally to this this work. Code is available here: http://vista.csail.mit.edu/

  21. arXiv:2111.12083  [pdf, other

    cs.RO cs.CV cs.LG

    VISTA 2.0: An Open, Data-driven Simulator for Multimodal Sensing and Policy Learning for Autonomous Vehicles

    Authors: Alexander Amini, Tsun-Hsuan Wang, Igor Gilitschenski, Wilko Schwarting, Zhijian Liu, Song Han, Sertac Karaman, Daniela Rus

    Abstract: Simulation has the potential to transform the development of robust algorithms for mobile agents deployed in safety-critical scenarios. However, the poor photorealism and lack of diverse sensor modalities of existing simulation engines remain key hurdles towards realizing this potential. Here, we present VISTA, an open source, data-driven simulator that integrates multiple types of sensors for aut… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: First two authors contributed equally. Code and project website is available here: https://vista.csail.mit.edu

  22. arXiv:2111.11277  [pdf, other

    cs.LG cs.RO eess.SY

    BarrierNet: A Safety-Guaranteed Layer for Neural Networks

    Authors: Wei Xiao, Ramin Hasani, Xiao Li, Daniela Rus

    Abstract: This paper introduces differentiable higher-order control barrier functions (CBF) that are end-to-end trainable together with learning systems. CBFs are usually overly conservative, while guaranteeing safety. Here, we address their conservativeness by softening their definitions using environmental dependencies without loosing safety guarantees, and embed them into differentiable quadratic program… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 23 pages

  23. arXiv:2111.02552  [pdf, other

    cs.LG cs.AI cs.RO

    Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

    Authors: Tim Seyde, Igor Gilitschenski, Wilko Schwarting, Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

    Abstract: Reinforcement learning (RL) for continuous control typically employs distributions whose support covers the entire action space. In this work, we investigate the colloquially known phenomenon that trained agents often prefer actions at the boundaries of that space. We draw theoretical connections to the emergence of bang-bang behavior in optimal control, and provide extensive empirical evaluation… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  24. Model Based Control of Soft Robots: A Survey of the State of the Art and Open Challenges

    Authors: Cosimo Della Santina, Christian Duriez, Daniela Rus

    Abstract: Continuum soft robots are mechanical systems entirely made of continuously deformable elements. This design solution aims to bring robots closer to invertebrate animals and soft appendices of vertebrate animals (e.g., an elephant's trunk, a monkey's tail). This work aims to introduce the control theorist perspective to this novel development in robotics. We aim to remove the barriers to entry into… ▽ More

    Submitted 3 September, 2023; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 69 pages, 13 figures

  25. arXiv:2107.11442  [pdf, other

    cs.LG cs.AI cs.CV

    Compressing Neural Networks: Towards Determining the Optimal Layer-wise Decomposition

    Authors: Lucas Liebenwein, Alaa Maalouf, Oren Gal, Dan Feldman, Daniela Rus

    Abstract: We present a novel global compression framework for deep neural networks that automatically analyzes each layer to identify the optimal per-layer compression ratio, while simultaneously achieving the desired overall compression. Our algorithm hinges on the idea of compressing each convolutional (or fully-connected) layer by slicing its channels into multiple groups and decomposing each group via l… ▽ More

    Submitted 18 November, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  26. arXiv:2107.08467  [pdf, other

    cs.LG cs.AI cs.NE math.DS stat.ML

    GoTube: Scalable Stochastic Verification of Continuous-Depth Models

    Authors: Sophie Gruenbacher, Mathias Lechner, Ramin Hasani, Daniela Rus, Thomas A. Henzinger, Scott Smolka, Radu Grosu

    Abstract: We introduce a new stochastic verification algorithm that formally quantifies the behavioral robustness of any time-continuous process formulated as a continuous-depth model. Our algorithm solves a set of global optimization (Go) problems over a given time horizon to construct a tight enclosure (Tube) of the set of all process executions starting from a ball of initial states. We call our algorith… ▽ More

    Submitted 2 December, 2021; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  27. arXiv:2107.05858  [pdf, other

    cs.RO

    Multi-Objective Graph Heuristic Search for Terrestrial Robot Design

    Authors: Jie Xu, Andrew Spielberg, Allan Zhao, Daniela Rus, Wojciech Matusik

    Abstract: We present methods for co-designing rigid robots over control and morphology (including discrete topology) over multiple objectives. Previous work has addressed problems in single-objective robot co-design or multi-objective control. However, the joint multi-objective co-design problem is extremely important for generating capable, versatile, algorithmically designed robots. In this work, we prese… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: IEEE International Conference on Robotics and Automation (ICRA 2021)

  28. arXiv:2106.13898  [pdf, other

    cs.LG cs.AI cs.NE cs.RO math.DS

    Closed-form Continuous-time Neural Models

    Authors: Ramin Hasani, Mathias Lechner, Alexander Amini, Lucas Liebenwein, Aaron Ray, Max Tschaikowski, Gerald Teschl, Daniela Rus

    Abstract: Continuous-time neural processes are performant sequential decision-makers that are built by differential equations (DE). However, their expressive power when they are deployed on computers is bottlenecked by numerical DE solvers. This limitation has significantly slowed down the scaling and understanding of numerous natural physical phenomena such as the dynamics of nervous systems. Ideally, we w… ▽ More

    Submitted 2 March, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: 40 pages

    Journal ref: Nature Machine Intelligence 4, 992--1003 (2022)

  29. arXiv:2106.12718  [pdf, other

    cs.LG cs.AI

    Sparse Flows: Pruning Continuous-depth Models

    Authors: Lucas Liebenwein, Ramin Hasani, Alexander Amini, Daniela Rus

    Abstract: Continuous deep learning architectures enable learning of flexible probabilistic models for predictive modeling as neural ordinary differential equations (ODEs), and for generative modeling as continuous normalizing flows. In this work, we design a framework to decipher the internal dynamics of these continuous depth models by pruning their network architectures. Our empirical results suggest that… ▽ More

    Submitted 18 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  30. arXiv:2106.08314  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Causal Navigation by Continuous-time Neural Networks

    Authors: Charles Vorbach, Ramin Hasani, Alexander Amini, Mathias Lechner, Daniela Rus

    Abstract: Imitation learning enables high-fidelity, vision-based learning of policies within rich, photorealistic environments. However, such techniques often rely on traditional discrete-time neural models and face difficulties in generalizing to domain shifts by failing to account for the causal relationships between the agent and the environment. In this paper, we propose a theoretical and experimental f… ▽ More

    Submitted 16 August, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 24 Pages

  31. arXiv:2106.07091  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    On-Off Center-Surround Receptive Fields for Accurate and Robust Image Classification

    Authors: Zahra Babaiee, Ramin Hasani, Mathias Lechner, Daniela Rus, Radu Grosu

    Abstract: Robustness to variations in lighting conditions is a key objective for any deep vision system. To this end, our paper extends the receptive field of convolutional neural networks with two residual components, ubiquitous in the visual processing system of vertebrates: On-center and off-center pathways, with excitatory center and inhibitory surround; OOCS for short. The on-center pathway is excited… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Comments: 21 Pages. Accepted for publication in the proceedings of the 38th International Conference on Machine Learning (ICML) 2021

  32. Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation

    Authors: Yechao Bai, Ziyuan Huang, Lyuyu Shen, Hongliang Guo, Marcelo H. Ang Jr, Daniela Rus

    Abstract: Exploiting multi-scale features has shown great potential in tackling semantic segmentation problems. The aggregation is commonly done with sum or concatenation (concat) followed by convolutional (conv) layers. However, it fully passes down the high-level context to the following hierarchy without considering their interrelation. In this work, we aim to enable the low-level feature to aggregate th… ▽ More

    Submitted 25 June, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted to RA-L 2021. in IEEE Robotics and Automation Letters. The contents of this paper were also selected by the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021) Program Committee for presentation at the Conference

  33. arXiv:2105.09932  [pdf, other

    cs.RO cs.CV

    Efficient and Robust LiDAR-Based End-to-End Navigation

    Authors: Zhijian Liu, Alexander Amini, Sibo Zhu, Sertac Karaman, Song Han, Daniela Rus

    Abstract: Deep learning has been used to demonstrate end-to-end neural network learning for autonomous vehicle control from raw sensory input. While LiDAR sensors provide reliably accurate information, existing end-to-end driving solutions are mainly based on cameras since processing 3D data requires a large memory footprint and computation cost. On the other hand, increasing the robustness of these systems… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: ICRA 2021. The first two authors contributed equally to this work. Project page: https://le2ed.mit.edu/

  34. arXiv:2105.05060  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    Estimating the State of Epidemics Spreading with Graph Neural Networks

    Authors: Abhishek Tomy, Matteo Razzanelli, Francesco Di Lauro, Daniela Rus, Cosimo Della Santina

    Abstract: When an epidemic spreads into a population, it is often unpractical or impossible to have a continuous monitoring of all subjects involved. As an alternative, algorithmic solutions can be used to infer the state of the whole population from a limited amount of measures. We analyze the capability of deep neural networks to solve this challenging task. Our proposed architecture is based on Graph Con… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: 15 pages, 7 figures

  35. arXiv:2105.04642  [pdf, other

    cs.CV

    SUPR-GAN: SUrgical PRediction GAN for Event Anticipation in Laparoscopic and Robotic Surgery

    Authors: Yutong Ban, Guy Rosman, Jennifer A. Eckhoff, Thomas M. Ward, Daniel A. Hashimoto, Taisei Kondo, Hidekazu Iwaki, Ozanan R. Meireles, Daniela Rus

    Abstract: Comprehension of surgical workflow is the foundation upon which artificial intelligence (AI) and machine learning (ML) holds the potential to assist intraoperative decision-making and risk mitigation. In this work, we move beyond mere identification of past surgical phases, into the prediction of future surgical steps and specification of the transitions between them. We use a novel Generative Adv… ▽ More

    Submitted 9 March, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: RA-L ICRA 2022

  36. arXiv:2104.10831  [pdf, other

    cs.RO

    LVI-SAM: Tightly-coupled Lidar-Visual-Inertial Odometry via Smoothing and Map**

    Authors: Tixiao Shan, Brendan Englot, Carlo Ratti, Daniela Rus

    Abstract: We propose a framework for tightly-coupled lidar-visual-inertial odometry via smoothing and map**, LVI-SAM, that achieves real-time state estimation and map-building with high accuracy and robustness. LVI-SAM is built atop a factor graph and is composed of two sub-systems: a visual-inertial system (VIS) and a lidar-inertial system (LIS). The two sub-systems are designed in a tightly-coupled mann… ▽ More

    Submitted 30 May, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  37. arXiv:2104.08614  [pdf

    cs.SD cs.AI cs.CL cs.LG cs.RO eess.AS

    Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales

    Authors: Jacob Andreas, Gašper Beguš, Michael M. Bronstein, Roee Diamant, Denley Delaney, Shane Gero, Shafi Goldwasser, David F. Gruber, Sarah de Haas, Peter Malkin, Roger Payne, Giovanni Petri, Daniela Rus, Pratyusha Sharma, Dan Tchernov, Pernille Tønnesen, Antonio Torralba, Daniel Vogt, Robert J. Wood

    Abstract: The past decade has witnessed a groundbreaking rise of machine learning for human language analysis, with current methods capable of automatically accurately recovering various aspects of syntax and semantics - including sentence structure and grounded word meaning - from large data collections. Recent research showed the promise of such tools for analyzing acoustic communication in nonhuman speci… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  38. arXiv:2104.02822  [pdf, other

    cs.LG

    Low-Regret Active learning

    Authors: Cenk Baykal, Lucas Liebenwein, Dan Feldman, Daniela Rus

    Abstract: We develop an online learning algorithm for identifying unlabeled data points that are most informative for training (i.e., active learning). By formulating the active learning problem as the prediction with slee** experts problem, we provide a regret minimization framework for identifying relevant data with respect to any given definition of informativeness. Motivated by the successes of ensemb… ▽ More

    Submitted 22 February, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

  39. arXiv:2103.15145  [pdf, other

    cs.CV

    TransCenter: Transformers with Dense Representations for Multiple-Object Tracking

    Authors: Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda

    Abstract: Transformers have proven superior performance for a wide variety of tasks since they were introduced. In recent years, they have drawn attention from the vision community in tasks such as image classification and object detection. Despite this wave, an accurate and efficient multiple-object tracking (MOT) method based on transformers is yet to be designed. We argue that the direct application of a… ▽ More

    Submitted 30 September, 2022; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: 17 pages, 10 figures, updated results and add comparisons

  40. arXiv:2103.10888  [pdf, other

    eess.SY

    Feedback from Pixels: Output Regulation via Learning-Based Scene View Synthesis

    Authors: Murad Abu-Khalaf, Sertac Karaman, Daniela Rus

    Abstract: We propose a novel controller synthesis involving feedback from pixels, whereby the measurement is a high dimensional signal representing a pixelated image with Red-Green-Blue (RGB) values. The approach neither requires feature extraction, nor object detection, nor visual correspondence. The control policy does not involve the estimation of states or similar latent representations. Instead, tracki… ▽ More

    Submitted 23 April, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: Submitted to L4DC on November-20-2020; Accepted on March-5-2021

  41. arXiv:2103.08187  [pdf, other

    cs.LG

    Adversarial Training is Not Ready for Robot Learning

    Authors: Mathias Lechner, Ramin Hasani, Radu Grosu, Daniela Rus, Thomas A. Henzinger

    Abstract: Adversarial training is an effective method to train deep learning models that are resilient to norm-bounded perturbations, with the cost of nominal performance drop. While adversarial training appears to enhance the robustness and safety of a deep model deployed in open-world decision-critical applications, counterintuitively, it induces undesired behaviors in robot learning settings. In this pap… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Accepted at the IEEE International Conference on Robotics and Automation (ICRA) 2021

  42. arXiv:2103.04909  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing

    Authors: Axel Brunnbauer, Luigi Berducci, Andreas Brandstätter, Mathias Lechner, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: World models learn behaviors in a latent imagination space to enhance the sample-efficiency of deep reinforcement learning (RL) algorithms. While learning world models for high-dimensional observations (e.g., pixel inputs) has become practicable on standard RL benchmarks and some games, their effectiveness in real-world robotics applications has not been explored. In this paper, we investigate how… ▽ More

    Submitted 28 February, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: This paper is accepted for presentation at the International Conference on Robotics and Automation (ICRA), 2022

  43. arXiv:2103.03014  [pdf, other

    cs.LG cs.AI cs.CV

    Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy

    Authors: Lucas Liebenwein, Cenk Baykal, Brandon Carter, David Gifford, Daniela Rus

    Abstract: Neural network pruning is a popular technique used to reduce the inference costs of modern, potentially overparameterized, networks. Starting from a pre-trained network, the process is as follows: remove redundant parameters, retrain, and repeat while maintaining the same test accuracy. The result is a model that is a fraction of the size of the original with comparable predictive performance (tes… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: Published in MLSys 2021

  44. arXiv:2103.02111  [pdf, other

    cs.CV cs.RO

    Robust Place Recognition using an Imaging Lidar

    Authors: Tixiao Shan, Brendan Englot, Fabio Duarte, Carlo Ratti, Daniela Rus

    Abstract: We propose a methodology for robust, real-time place recognition using an imaging lidar, which yields image-quality high-resolution 3D point clouds. Utilizing the intensity readings of an imaging lidar, we project the point cloud and obtain an intensity image. ORB feature descriptors are extracted from the image and encoded into a bag-of-words vector. The vector, used to identify the point cloud,… ▽ More

    Submitted 21 April, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: ICRA 2021

  45. arXiv:2102.12571  [pdf, other

    cs.AI cs.LG cs.RO

    The Logical Options Framework

    Authors: Brandon Araki, Xiao Li, Kiran Vodrahalli, Jonathan DeCastro, Micah J. Fry, Daniela Rus

    Abstract: Learning composable policies for environments with complex rules and tasks is a challenging problem. We introduce a hierarchical reinforcement learning framework called the Logical Options Framework (LOF) that learns policies that are satisfying, optimal, and composable. LOF efficiently learns policies that satisfy tasks by representing the task as an automaton and integrating it into learning and… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: 23 pages, 19 figures

    ACM Class: I.2.9; I.2.6; G.3; I.5.1

  46. arXiv:2102.09812  [pdf, other

    cs.LG cs.AI cs.RO

    Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space

    Authors: Wilko Schwarting, Tim Seyde, Igor Gilitschenski, Lucas Liebenwein, Ryan Sander, Sertac Karaman, Daniela Rus

    Abstract: Learning competitive behaviors in multi-agent settings such as racing requires long-term reasoning about potential adversarial interactions. This paper presents Deep Latent Competition (DLC), a novel reinforcement learning algorithm that learns competitive visual control policies through self-play in imagination. The DLC agent imagines multi-agent interaction sequences in the compact latent space… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: Wilko, Tim, and Igor contributed equally to this work; published in Conference on Robot Learning 2020

  47. arXiv:2101.05917  [pdf, other

    cs.LG cs.GR

    DiffPD: Differentiable Projective Dynamics

    Authors: Tao Du, Kui Wu, **chuan Ma, Sebastien Wah, Andrew Spielberg, Daniela Rus, Wojciech Matusik

    Abstract: We present a novel, fast differentiable simulator for soft-body learning and control applications. Existing differentiable soft-body simulators can be classified into two categories based on their time integration methods: Simulators using explicit time-step** schemes require tiny time steps to avoid numerical instabilities in gradient computation, and simulators using implicit time integration… ▽ More

    Submitted 10 October, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: ACM Transactions on Graphics, 2021. Code: https://github.com/dut09/diff_pd

  48. arXiv:2010.14641  [pdf, other

    cs.LG cs.AI cs.RO

    Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles

    Authors: Tim Seyde, Wilko Schwarting, Sertac Karaman, Daniela Rus

    Abstract: Learning complex robot behaviors through interaction requires structured exploration. Planning should target interactions with the potential to optimize long-term performance, while only reducing uncertainty where conducive to this objective. This paper presents Latent Optimistic Value Exploration (LOVE), a strategy that enables deep exploration through optimism in the face of uncertain long-term… ▽ More

    Submitted 11 December, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

  49. arXiv:2010.09909  [pdf

    cs.RO cs.CY

    The Role of Robotics in Infectious Disease Crises

    Authors: Gregory Hager, Vijay Kumar, Robin Murphy, Daniela Rus, Russell Taylor

    Abstract: The recent coronavirus pandemic has highlighted the many challenges faced by the healthcare, public safety, and economic systems when confronted with a surge in patients that require intensive treatment and a population that must be quarantined or shelter in place. The most obvious and pressing challenge is taking care of acutely ill patients while managing spread of infection within the care faci… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 25 pages (including title page)

  50. arXiv:2010.04290  [pdf, other

    cs.LG stat.ML

    Deep Learning Meets Projective Clustering

    Authors: Alaa Maalouf, Harry Lang, Daniela Rus, Dan Feldman

    Abstract: A common approach for compressing NLP networks is to encode the embedding layer as a matrix $A\in\mathbb{R}^{n\times d}$, compute its rank-$j$ approximation $A_j$ via SVD, and then factor $A_j$ into a pair of matrices that correspond to smaller fully-connected layers to replace the original embedding layer. Geometrically, the rows of $A$ represent points in $\mathbb{R}^d$, and the rows of $A_j$ re… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.