Skip to main content

Showing 1–44 of 44 results for author: Hasani, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15149  [pdf, other

    cs.RO cs.AI cs.CV

    Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks

    Authors: Alex Quach, Makram Chahine, Alexander Amini, Ramin Hasani, Daniela Rus

    Abstract: Simulators are powerful tools for autonomous robot learning as they offer scalable data generation, flexible design, and optimization of trajectories. However, transferring behavior learned from simulation data into the real world proves to be difficult, usually mitigated with compute-heavy domain randomization methods or further model fine-tuning. We present a method to improve generalization and… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

  3. arXiv:2404.01750  [pdf, other

    cs.CV

    Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder

    Authors: Anass Bairouk, Mirjana Maras, Simon Herlin, Alexander Amini, Marc Blanchon, Ramin Hasani, Patrick Chareyre, Daniela Rus

    Abstract: Autonomous driving presents a complex challenge, which is usually addressed with artificial intelligence models that are end-to-end or modular in nature. Within the landscape of modular approaches, a bio-inspired neural circuit policy model has emerged as an innovative control module, offering a compact and inherently interpretable system to infer a steering wheel command from abstract visual feat… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Submitted to 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  4. arXiv:2401.08602  [pdf, other

    cs.NE cs.LG

    Learning with Chemical versus Electrical Synapses -- Does it Make a Difference?

    Authors: Mónika Farsang, Mathias Lechner, David Lung, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: Bio-inspired neural networks have the potential to advance our understanding of neural computation and improve the state-of-the-art of AI systems. Bio-electrical synapses directly transmit neural signals, by enabling fast current flow between neurons. In contrast, bio-chemical synapses transmit neural signals indirectly, through neurotransmitters. Prior work showed that interpretable dynamics for… ▽ More

    Submitted 21 November, 2023; originally announced January 2024.

  5. arXiv:2310.03915  [pdf, other

    cs.LG

    Leveraging Low-Rank and Sparse Recurrent Connectivity for Robust Closed-Loop Control

    Authors: Neehal Tumma, Mathias Lechner, Noel Loo, Ramin Hasani, Daniela Rus

    Abstract: Develo** autonomous agents that can interact with changing environments is an open challenge in machine learning. Robustness is particularly important in these settings as agents are often fit offline on expert demonstrations but deployed online where they must generalize to the closed feedback loop within the environment. In this work, we explore the application of recurrent neural networks to… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2305.14113  [pdf, other

    cs.LG

    On the Size and Approximation Error of Distilled Sets

    Authors: Alaa Maalouf, Murad Tukan, Noel Loo, Ramin Hasani, Mathias Lechner, Daniela Rus

    Abstract: Dataset Distillation is the task of synthesizing small datasets from large ones while still retaining comparable predictive accuracy to the original uncompressed dataset. Despite significant empirical progress in recent years, there is little understanding of the theoretical limitations/guarantees of dataset distillation, specifically, what excess risk is achieved by distillation compared to the o… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  7. arXiv:2304.02733  [pdf, other

    cs.RO cs.LG eess.SY

    Learning Stability Attention in Vision-based End-to-end Driving Policies

    Authors: Tsun-Hsuan Wang, Wei Xiao, Makram Chahine, Alexander Amini, Ramin Hasani, Daniela Rus

    Abstract: Modern end-to-end learning systems can learn to explicitly infer control from perception. However, it is difficult to guarantee stability and robustness for these systems since they are often exposed to unstructured, high-dimensional, and complex observation spaces (e.g., autonomous driving from a stream of pixel inputs). We propose to leverage control Lyapunov functions (CLFs) to equip end-to-end… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: First two authors contributed equally; L4DC 2023

  8. Infrastructure-based End-to-End Learning and Prevention of Driver Failure

    Authors: Noam Buckman, Shiva Sreeram, Mathias Lechner, Yutong Ban, Ramin Hasani, Sertac Karaman, Daniela Rus

    Abstract: Intelligent intersection managers can improve safety by detecting dangerous drivers or failure modes in autonomous vehicles, warning oncoming vehicles as they approach an intersection. In this work, we present FailureNet, a recurrent neural network trained end-to-end on trajectories of both nominal and reckless drivers in a scaled miniature city. FailureNet observes the poses of vehicles as they a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 8 pages. Accepted to ICRA 2023

  9. arXiv:2302.06755  [pdf, other

    cs.LG cs.CV stat.ML

    Dataset Distillation with Convexified Implicit Gradients

    Authors: Noel Loo, Ramin Hasani, Mathias Lechner, Daniela Rus

    Abstract: We propose a new dataset distillation algorithm using reparameterization and convexification of implicit gradients (RCIG), that substantially improves the state-of-the-art. To this end, we first formulate dataset distillation as a bi-level optimization problem. Then, we show how implicit gradients can be effectively used to compute meta-gradient updates. We further equip the algorithm with a conve… ▽ More

    Submitted 9 November, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

  10. arXiv:2302.01428  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation

    Authors: Noel Loo, Ramin Hasani, Mathias Lechner, Alexander Amini, Daniela Rus

    Abstract: Modern deep learning requires large volumes of data, which could contain sensitive or private information that cannot be leaked. Recent work has shown for homogeneous neural networks a large portion of this training data could be reconstructed with only access to the trained network parameters. While the attack was shown to work empirically, there exists little formal understanding of its effectiv… ▽ More

    Submitted 9 November, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  11. arXiv:2212.11084  [pdf, other

    cs.RO cs.AI

    Towards Cooperative Flight Control Using Visual-Attention

    Authors: Lianhao Yin, Makram Chahine, Tsun-Hsuan Wang, Tim Seyde, Chao Liu, Mathias Lechner, Ramin Hasani, Daniela Rus

    Abstract: The cooperation of a human pilot with an autonomous agent during flight control realizes parallel autonomy. We propose an air-guardian system that facilitates cooperation between a pilot with eye tracking and a parallel end-to-end neural control system. Our vision-based air-guardian system combines a causal continuous-depth neural network model with a cooperation layer to enable parallel autonomy… ▽ More

    Submitted 20 September, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  12. arXiv:2210.12067  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Efficient Dataset Distillation Using Random Feature Approximation

    Authors: Noel Loo, Ramin Hasani, Alexander Amini, Daniela Rus

    Abstract: Dataset distillation compresses large datasets into smaller synthetic coresets which retain performance with the aim of reducing the storage and computational burden of processing the entire dataset. Today's best-performing algorithm, \textit{Kernel Inducing Points} (KIP), which makes use of the correspondence between infinite-width neural networks and kernel-ridge regression, is prohibitively slo… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to the Conference on the Advances in Neural Information Processing Systems (NeurIPS) 2022

  13. arXiv:2210.12030  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Evolution of Neural Tangent Kernels under Benign and Adversarial Training

    Authors: Noel Loo, Ramin Hasani, Alexander Amini, Daniela Rus

    Abstract: Two key challenges facing modern deep learning are mitigating deep networks' vulnerability to adversarial attacks and understanding deep learning's generalization capabilities. Towards the first issue, many defense strategies have been developed, with the most common being Adversarial Training (AT). Towards the second challenge, one of the dominant theories that has emerged is the Neural Tangent K… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to the Conference on Advances in Neural Information Processing Systems (NeurIPS) 2022

  14. arXiv:2210.11114  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Pruning by Active Attention Manipulation

    Authors: Zahra Babaiee, Lucas Liebenwein, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: Filter pruning of a CNN is typically achieved by applying discrete masks on the CNN's filter weights or activation maps, post-training. Here, we present a new filter-importance-scoring concept named pruning by active attention manipulation (PAAM), that sparsifies the CNN's set of filters through a particular attention mechanism, during-training. PAAM learns analog filter scores from the filter wei… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.07412

  15. arXiv:2210.06650  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Interpreting Neural Policies with Disentangled Tree Representations

    Authors: Tsun-Hsuan Wang, Wei Xiao, Tim Seyde, Ramin Hasani, Daniela Rus

    Abstract: The advancement of robots, particularly those functioning in complex human-centric environments, relies on control solutions that are driven by machine learning. Understanding how learning-based controllers make decisions is crucial since robots are often safety-critical systems. This urges a formal and quantitative understanding of the explanatory factors in the interpretability of robot learning… ▽ More

    Submitted 12 November, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  16. arXiv:2210.04763  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    On the Forward Invariance of Neural ODEs

    Authors: Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Mathias Lechner, Yutong Ban, Chuang Gan, Daniela Rus

    Abstract: We propose a new method to ensure neural ordinary differential equations (ODEs) satisfy output specifications by using invariance set propagation. Our approach uses a class of control barrier functions to transform output specifications into constraints on the parameters and inputs of the learning system. This setup allows us to achieve output specification guarantees simply by changing the constr… ▽ More

    Submitted 31 May, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 25 pages, accepted in ICML2023, website: https://weixy21.github.io/invariance/

  17. arXiv:2210.04728  [pdf, other

    cs.LG

    PyHopper -- Hyperparameter optimization

    Authors: Mathias Lechner, Ramin Hasani, Philipp Neubauer, Sophie Neubauer, Daniela Rus

    Abstract: Hyperparameter tuning is a fundamental aspect of machine learning research. Setting up the infrastructure for systematic optimization of hyperparameters can take a significant amount of time. Here, we present PyHopper, a black-box optimization platform designed to streamline the hyperparameter tuning workflow of machine learning researchers. PyHopper's goal is to integrate with existing code with… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  18. arXiv:2210.04303  [pdf, other

    cs.CV cs.AI cs.LG cs.NE cs.RO

    Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap

    Authors: Mathias Lechner, Ramin Hasani, Alexander Amini, Tsun-Hsuan Wang, Thomas A. Henzinger, Daniela Rus

    Abstract: There is an ever-growing zoo of modern neural network models that can efficiently learn end-to-end control from visual observations. These advanced deep models, ranging from convolutional to patch-based networks, have been extensively tested on offline image classification and regression tasks. In this paper, we study these vision architectures with respect to the open-loop to closed-loop causalit… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  19. arXiv:2209.12951  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.NE

    Liquid Structural State-Space Models

    Authors: Ramin Hasani, Mathias Lechner, Tsun-Hsuan Wang, Makram Chahine, Alexander Amini, Daniela Rus

    Abstract: A proper parametrization of state transition matrices of linear state-space models (SSMs) followed by standard nonlinearities enables them to efficiently learn representations from sequential data, establishing the state-of-the-art on a large series of long-range sequence modeling benchmarks. In this paper, we show that we can improve further when the structural SSM such as S4 is given by a linear… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  20. arXiv:2206.01261  [pdf, other

    cs.LG cs.AI cs.NE

    Entangled Residual Map**s

    Authors: Mathias Lechner, Ramin Hasani, Zahra Babaiee, Radu Grosu, Daniela Rus, Thomas A. Henzinger, Sepp Hochreiter

    Abstract: Residual map**s have been shown to perform representation learning in the first layers and iterative feature refinement in higher layers. This interplay, combined with their stabilizing effect on the gradient norms, enables them to train very deep networks. In this paper, we take a step further and introduce entangled residual map**s to generalize the structure of the residual connections and… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 21 Pages

  21. arXiv:2204.07412  [pdf, other

    cs.CV cs.AI cs.LG

    End-to-End Sensitivity-Based Filter Pruning

    Authors: Zahra Babaiee, Lucas Liebenwein, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: In this paper, we present a novel sensitivity-based filter pruning algorithm (SbF-Pruner) to learn the importance scores of filters of each layer end-to-end. Our method learns the scores from the filter weights, enabling it to account for the correlations between the filters of each layer. Moreover, by training the pruning scores of all layers simultaneously our method can account for layer interd… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  22. arXiv:2203.02401  [pdf, other

    cs.RO cs.CV cs.LG

    Differentiable Control Barrier Functions for Vision-based End-to-End Autonomous Driving

    Authors: Wei Xiao, Tsun-Hsuan Wang, Makram Chahine, Alexander Amini, Ramin Hasani, Daniela Rus

    Abstract: Guaranteeing safety of perception-based learning systems is challenging due to the absence of ground-truth state information unlike in state-aware control scenarios. In this paper, we introduce a safety guaranteed learning framework for vision-based end-to-end autonomous driving. To this end, we design a learning system equipped with differentiable control barrier functions (dCBFs) that is trained… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 11 pages, Wei Xiao and Tsun-Hsuan Wang are with equal contributions

  23. arXiv:2111.11277  [pdf, other

    cs.LG cs.RO eess.SY

    BarrierNet: A Safety-Guaranteed Layer for Neural Networks

    Authors: Wei Xiao, Ramin Hasani, Xiao Li, Daniela Rus

    Abstract: This paper introduces differentiable higher-order control barrier functions (CBF) that are end-to-end trainable together with learning systems. CBFs are usually overly conservative, while guaranteeing safety. Here, we address their conservativeness by softening their definitions using environmental dependencies without loosing safety guarantees, and embed them into differentiable quadratic program… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 23 pages

  24. arXiv:2110.07667  [pdf, other

    cs.CV cs.AI cs.HC

    Interactive Analysis of CNN Robustness

    Authors: Stefan Sietzen, Mathias Lechner, Judy Borowski, Ramin Hasani, Manuela Waldner

    Abstract: While convolutional neural networks (CNNs) have found wide adoption as state-of-the-art models for image-related tasks, their predictions are often highly sensitive to small input perturbations, which the human vision is robust against. This paper presents Perturber, a web-based application that allows users to instantaneously explore how CNN activations and predictions evolve when a 3D input scen… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted at Pacific Graphics 2021

  25. arXiv:2107.08467  [pdf, other

    cs.LG cs.AI cs.NE math.DS stat.ML

    GoTube: Scalable Stochastic Verification of Continuous-Depth Models

    Authors: Sophie Gruenbacher, Mathias Lechner, Ramin Hasani, Daniela Rus, Thomas A. Henzinger, Scott Smolka, Radu Grosu

    Abstract: We introduce a new stochastic verification algorithm that formally quantifies the behavioral robustness of any time-continuous process formulated as a continuous-depth model. Our algorithm solves a set of global optimization (Go) problems over a given time horizon to construct a tight enclosure (Tube) of the set of all process executions starting from a ball of initial states. We call our algorith… ▽ More

    Submitted 2 December, 2021; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  26. arXiv:2106.13898  [pdf, other

    cs.LG cs.AI cs.NE cs.RO math.DS

    Closed-form Continuous-time Neural Models

    Authors: Ramin Hasani, Mathias Lechner, Alexander Amini, Lucas Liebenwein, Aaron Ray, Max Tschaikowski, Gerald Teschl, Daniela Rus

    Abstract: Continuous-time neural processes are performant sequential decision-makers that are built by differential equations (DE). However, their expressive power when they are deployed on computers is bottlenecked by numerical DE solvers. This limitation has significantly slowed down the scaling and understanding of numerous natural physical phenomena such as the dynamics of nervous systems. Ideally, we w… ▽ More

    Submitted 2 March, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: 40 pages

    Journal ref: Nature Machine Intelligence 4, 992--1003 (2022)

  27. arXiv:2106.12718  [pdf, other

    cs.LG cs.AI

    Sparse Flows: Pruning Continuous-depth Models

    Authors: Lucas Liebenwein, Ramin Hasani, Alexander Amini, Daniela Rus

    Abstract: Continuous deep learning architectures enable learning of flexible probabilistic models for predictive modeling as neural ordinary differential equations (ODEs), and for generative modeling as continuous normalizing flows. In this work, we design a framework to decipher the internal dynamics of these continuous depth models by pruning their network architectures. Our empirical results suggest that… ▽ More

    Submitted 18 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  28. arXiv:2106.08314  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Causal Navigation by Continuous-time Neural Networks

    Authors: Charles Vorbach, Ramin Hasani, Alexander Amini, Mathias Lechner, Daniela Rus

    Abstract: Imitation learning enables high-fidelity, vision-based learning of policies within rich, photorealistic environments. However, such techniques often rely on traditional discrete-time neural models and face difficulties in generalizing to domain shifts by failing to account for the causal relationships between the agent and the environment. In this paper, we propose a theoretical and experimental f… ▽ More

    Submitted 16 August, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 24 Pages

  29. arXiv:2106.07091  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    On-Off Center-Surround Receptive Fields for Accurate and Robust Image Classification

    Authors: Zahra Babaiee, Ramin Hasani, Mathias Lechner, Daniela Rus, Radu Grosu

    Abstract: Robustness to variations in lighting conditions is a key objective for any deep vision system. To this end, our paper extends the receptive field of convolutional neural networks with two residual components, ubiquitous in the visual processing system of vertebrates: On-center and off-center pathways, with excitatory center and inhibitory surround; OOCS for short. The on-center pathway is excited… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Comments: 21 Pages. Accepted for publication in the proceedings of the 38th International Conference on Machine Learning (ICML) 2021

  30. arXiv:2103.08187  [pdf, other

    cs.LG

    Adversarial Training is Not Ready for Robot Learning

    Authors: Mathias Lechner, Ramin Hasani, Radu Grosu, Daniela Rus, Thomas A. Henzinger

    Abstract: Adversarial training is an effective method to train deep learning models that are resilient to norm-bounded perturbations, with the cost of nominal performance drop. While adversarial training appears to enhance the robustness and safety of a deep model deployed in open-world decision-critical applications, counterintuitively, it induces undesired behaviors in robot learning settings. In this pap… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Accepted at the IEEE International Conference on Robotics and Automation (ICRA) 2021

  31. arXiv:2103.04909  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing

    Authors: Axel Brunnbauer, Luigi Berducci, Andreas Brandstätter, Mathias Lechner, Ramin Hasani, Daniela Rus, Radu Grosu

    Abstract: World models learn behaviors in a latent imagination space to enhance the sample-efficiency of deep reinforcement learning (RL) algorithms. While learning world models for high-dimensional observations (e.g., pixel inputs) has become practicable on standard RL benchmarks and some games, their effectiveness in real-world robotics applications has not been explored. In this paper, we investigate how… ▽ More

    Submitted 28 February, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: This paper is accepted for presentation at the International Conference on Robotics and Automation (ICRA), 2022

  32. arXiv:2012.08863  [pdf, other

    cs.LG cs.NE eess.SY

    On The Verification of Neural ODEs with Stochastic Guarantees

    Authors: Sophie Gruenbacher, Ramin Hasani, Mathias Lechner, Jacek Cyranka, Scott A. Smolka, Radu Grosu

    Abstract: We show that Neural ODEs, an emerging class of time-continuous neural networks, can be verified by solving a set of global-optimization problems. For this purpose, we introduce Stochastic Lagrangian Reachability (SLR), an abstraction-based technique for constructing a tight Reachtube (an over-approximation of the set of reachable states over a given time-horizon), and provide stochastic guarantees… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 12 pages, 2 figures

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 35(13), 2021, pages 11525-11535

  33. arXiv:2006.04439  [pdf, other

    cs.LG cs.NE stat.ML

    Liquid Time-constant Networks

    Authors: Ramin Hasani, Mathias Lechner, Alexander Amini, Daniela Rus, Radu Grosu

    Abstract: We introduce a new class of time-continuous recurrent neural network models. Instead of declaring a learning system's dynamics by implicit nonlinearities, we construct networks of linear first-order dynamical systems modulated via nonlinear interlinked gates. The resulting models represent dynamical systems with varying (i.e., liquid) time-constants coupled to their hidden state, with outputs bein… ▽ More

    Submitted 14 December, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted to the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21)

  34. arXiv:2006.04418  [pdf, other

    cs.LG stat.ML

    Learning Long-Term Dependencies in Irregularly-Sampled Time Series

    Authors: Mathias Lechner, Ramin Hasani

    Abstract: Recurrent neural networks (RNNs) with continuous-time hidden states are a natural fit for modeling irregularly-sampled time series. These models, however, face difficulties when the input data possess long-term dependencies. We prove that similar to standard RNNs, the underlying reason for this issue is the vanishing or exploding of the gradient during training. This phenomenon is expressed by the… ▽ More

    Submitted 4 December, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  35. arXiv:1811.00321  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Liquid Time-constant Recurrent Neural Networks as Universal Approximators

    Authors: Ramin M. Hasani, Mathias Lechner, Alexander Amini, Daniela Rus, Radu Grosu

    Abstract: In this paper, we introduce the notion of liquid time-constant (LTC) recurrent neural networks (RNN)s, a subclass of continuous-time RNNs, with varying neuronal time-constant realized by their nonlinear synaptic transmission model. This feature is inspired by the communication principles in the nervous system of small species. It enables the model to approximate continuous map** with a small num… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: This short report introduces the universal approximation capabilities of liquid time-constant (LTC) recurrent neural networks, and provides theoretical bounds for its dynamics

  36. arXiv:1809.04423  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Can a Compact Neuronal Circuit Policy be Re-purposed to Learn Simple Robotic Control?

    Authors: Ramin Hasani, Mathias Lechner, Alexander Amini, Daniela Rus, Radu Grosu

    Abstract: We propose a neural information processing system which is obtained by re-purposing the function of a biological neural circuit model, to govern simulated and real-world control tasks. Inspired by the structure of the nervous system of the soil-worm, C. elegans, we introduce Neuronal Circuit Policies (NCPs), defined as the model of biological neural circuits reparameterized for the control of an a… ▽ More

    Submitted 16 November, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1803.08554

  37. arXiv:1809.03864  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Response Characterization for Auditing Cell Dynamics in Long Short-term Memory Networks

    Authors: Ramin M. Hasani, Alexander Amini, Mathias Lechner, Felix Naser, Radu Grosu, Daniela Rus

    Abstract: In this paper, we introduce a novel method to interpret recurrent neural networks (RNNs), particularly long short-term memory networks (LSTMs) at the cellular level. We propose a systematic pipeline for interpreting individual hidden state dynamics within the network using response characterization methods. The ranked contribution of individual cells to the network's output is computed by analyzin… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

  38. arXiv:1803.08554  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Neuronal Circuit Policies

    Authors: Mathias Lechner, Ramin M. Hasani, Radu Grosu

    Abstract: We propose an effective way to create interpretable control agents, by re-purposing the function of a biological neural circuit model, to govern simulated and real world reinforcement learning (RL) test-beds. We model the tap-withdrawal (TW) neural circuit of the nematode, C. elegans, a circuit responsible for the worm's reflexive response to external mechanical touch stimulations, and learn its s… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

  39. arXiv:1711.03467  [pdf, other

    cs.NE cs.AI

    Worm-level Control through Search-based Reinforcement Learning

    Authors: Mathias Lechner, Radu Grosu, Ramin M. Hasani

    Abstract: Through natural evolution, nervous systems of organisms formed near-optimal structures to express behavior. Here, we propose an effective way to create control agents, by \textit{re-purposing} the function of biological neural circuit models, to govern similar real world applications. We model the tap-withdrawal (TW) neural circuit of the nematode, \textit{C. elegans}, a circuit responsible for th… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

  40. arXiv:1711.01436  [pdf, other

    q-bio.QM cs.AI cs.NE

    Searching for Biophysically Realistic Parameters for Dynamic Neuron Models by Genetic Algorithms from Calcium Imaging Recording

    Authors: Magdalena Fuchs, Manuel Zimmer, Radu Grosu, Ramin M. Hasani

    Abstract: Individual Neurons in the nervous systems exploit various dynamics. To capture these dynamics for single neurons, we tune the parameters of an electrophysiological model of nerve cells, to fit experimental data obtained by calcium imaging. A search for the biophysical parameters of this model is performed by means of a genetic algorithm, where the model neuron is exposed to a predefined input curr… ▽ More

    Submitted 4 November, 2017; originally announced November 2017.

  41. arXiv:1703.06272  [pdf, other

    cs.LG cs.NE stat.ML

    An Automated Auto-encoder Correlation-based Health-Monitoring and Prognostic Method for Machine Bearings

    Authors: Ramin M. Hasani, Guodong Wang, Radu Grosu

    Abstract: This paper studies an intelligent ultimate technique for health-monitoring and prognostic of common rotary machine components, particularly bearings. During a run-to-failure experiment, rich unsupervised features from vibration sensory data are extracted by a trained sparse auto-encoder. Then, the correlation of the extracted attributes of the initial samples (presumably healthy at the beginning o… ▽ More

    Submitted 18 March, 2017; originally announced March 2017.

  42. arXiv:1703.06270  [pdf, other

    q-bio.NC cs.NE q-bio.QM stat.ML

    SIM-CE: An Advanced Simulink Platform for Studying the Brain of Caenorhabditis elegans

    Authors: Ramin M. Hasani, Victoria Beneder, Magdalena Fuchs, David Lung, Radu Grosu

    Abstract: We introduce SIM-CE, an advanced, user-friendly modeling and simulation environment in Simulink for performing multi-scale behavioral analysis of the nervous system of Caenorhabditis elegans (C. elegans). SIM-CE contains an implementation of the mathematical models of C. elegans's neurons and synapses, in Simulink, which can be easily extended and particularized by the user. The Simulink model is… ▽ More

    Submitted 25 March, 2017; v1 submitted 18 March, 2017; originally announced March 2017.

  43. arXiv:1703.06264  [pdf, other

    q-bio.NC cs.NE q-bio.QM

    Non-Associative Learning Representation in the Nervous System of the Nematode Caenorhabditis elegans

    Authors: Ramin M. Hasani, Magdalena Fuchs, Victoria Beneder, Radu Grosu

    Abstract: Caenorhabditis elegans (C. elegans) illustrated remarkable behavioral plasticities including complex non-associative and associative learning representations. Understanding the principles of such mechanisms presumably leads to constructive inspirations for the design of efficient learning algorithms. In the present study, we postulate a novel approach on modeling single neurons and synapses to stu… ▽ More

    Submitted 25 March, 2017; v1 submitted 18 March, 2017; originally announced March 2017.

  44. arXiv:1702.07426  [pdf, other

    cs.NE q-bio.NC q-bio.QM

    Control of the Correlation of Spontaneous Neuron Activity in Biological and Noise-activated CMOS Artificial Neural Microcircuits

    Authors: Ramin M. Hasani, Giorgio Ferrari, Hideaki Yamamoto, Sho Kono, Koji Ishihara, Soya Fujimori, Takashi Tanii, Enrico Prati

    Abstract: There are several indications that brain is organized not on a basis of individual unreliable neurons, but on a micro-circuital scale providing Lego blocks employed to create complex architectures. At such an intermediate scale, the firing activity in the microcircuits is governed by collective effects emerging by the background noise soliciting spontaneous firing, the degree of mutual connections… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.