Skip to main content

Showing 1–8 of 8 results for author: Shaj, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16078  [pdf, other

    cs.AI cs.LG

    Learning World Models With Hierarchical Temporal Abstractions: A Probabilistic Perspective

    Authors: Vaisakh Shaj

    Abstract: Machines that can replicate human intelligence with type 2 reasoning capabilities should be able to reason at multiple levels of spatio-temporal abstractions and scales using internal world models. Devising formalisms to develop such internal world models, which accurately reflect the causal hierarchies inherent in the dynamics of the real world, is a critical research challenge in the domains of… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Doctoral Dissertation Preprint, Department of Computer Science, Karlsruhe Institute Of Technology, 2024

  2. arXiv:2310.18534  [pdf, other

    cs.LG cs.AI

    Multi Time Scale World Models

    Authors: Vaisakh Shaj, Saleh Gholam Zadeh, Ozan Demir, Luiz Ricardo Douat, Gerhard Neumann

    Abstract: Intelligent agents use internal world models to reason and make predictions about different courses of their actions at many scales. Devising learning paradigms and architectures that allow machines to learn world models that operate at multiple levels of temporal abstractions while dealing with complex uncertainty predictions is a major technical hurdle. In this work, we propose a probabilistic f… ▽ More

    Submitted 4 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted as spotlight at NeurIPS 2023

  3. arXiv:2206.14697  [pdf, other

    cs.LG eess.SY

    Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios

    Authors: Vaisakh Shaj, Dieter Buchler, Rohit Sonker, Philipp Becker, Gerhard Neumann

    Abstract: Recurrent State-space models (RSSMs) are highly expressive models for learning patterns in time series data and system identification. However, these models assume that the dynamics are fixed and unchanging, which is rarely the case in real-world scenarios. Many control applications often exhibit tasks with similar but not identical dynamics which can be modeled as a latent variable. We introduce… ▽ More

    Submitted 12 October, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Published at the International Conference on Learning Representations, ICLR 2022

  4. arXiv:2205.13804  [pdf, other

    cs.RO cs.LG

    End-to-End Learning of Hybrid Inverse Dynamics Models for Precise and Compliant Impedance Control

    Authors: Moritz Reuss, Niels van Duijkeren, Robert Krug, Philipp Becker, Vaisakh Shaj, Gerhard Neumann

    Abstract: It is well-known that inverse dynamics models can improve tracking performance in robot control. These models need to precisely capture the robot dynamics, which consist of well-understood components, e.g., rigid body dynamics, and effects that remain challenging to capture, e.g., stick-slip friction and mechanical flexibilities. Such effects exhibit hysteresis and partial observability, rendering… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at Robotics: Science and System XVIII (RSS), year 2022. Paper length is 13 pages (i.e. 9 pages of technical content, 1 page of the Bibliography/References and 3 pages of Appendix)

  5. arXiv:2010.10201  [pdf, other

    cs.RO cs.LG

    Action-Conditional Recurrent Kalman Networks For Forward and Inverse Dynamics Learning

    Authors: Vaisakh Shaj, Philipp Becker, Dieter Buchler, Harit Pandya, Niels van Duijkeren, C. James Taylor, Marc Hanheide, Gerhard Neumann

    Abstract: Estimating accurate forward and inverse dynamics models is a crucial component of model-based control for sophisticated robots such as robots driven by hydraulics, artificial muscles, or robots dealing with different contact situations. Analytic models to such processes are often unavailable or inaccurate due to complex hysteresis effects, unmodelled friction and stiction phenomena,and unknown eff… ▽ More

    Submitted 5 November, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted to Conference On Robot Learning(CoRL), 2020

  6. arXiv:2004.12771  [pdf, other

    cs.CV

    Adversarial Fooling Beyond "Flip** the Label"

    Authors: Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu

    Abstract: Recent advancements in CNNs have shown remarkable achievements in various CV/AI applications. Though CNNs show near human or better than human performance in many critical tasks, they are quite vulnerable to adversarial attacks. These attacks are potentially dangerous in real-life deployments. Though there have been many adversarial attacks proposed in recent years, there is no proper way of quant… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: CVPR-AMLCV-2020

  7. arXiv:1905.08114  [pdf, other

    cs.LG cs.CV stat.ML

    Zero-Shot Knowledge Distillation in Deep Networks

    Authors: Gaurav Kumar Nayak, Konda Reddy Mopuri, Vaisakh Shaj, R. Venkatesh Babu, Anirban Chakraborty

    Abstract: Knowledge distillation deals with the problem of training a smaller model (Student) from a high capacity source model (Teacher) so as to retain most of its performance. Existing approaches use either the training data or meta-data extracted from it in order to train the Student. However, accessing the dataset on which the Teacher has been trained may not always be feasible if the dataset is very l… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: Accepted in ICML 2019, codes will be available at https://github.com/vcl-iisc/ZSKD

  8. arXiv:1712.00640  [pdf, other

    cs.LG

    Learning Sparse Adversarial Dictionaries For Multi-Class Audio Classification

    Authors: Vaisakh Shaj, Puranjoy Bhattacharya

    Abstract: Audio events are quite often overlap** in nature, and more prone to noise than visual signals. There has been increasing evidence for the superior performance of representations learned using sparse dictionaries for applications like audio denoising and speech enhancement. This paper concentrates on modifying the traditional reconstructive dictionary learning algorithms, by incorporating a discr… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: Accepted in Asian Conference of Pattern Recognition (ACPR-2017)