Skip to main content

Showing 1–13 of 13 results for author: Santara, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.19111  [pdf, other

    cs.RO cs.AI cs.LG

    GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts

    Authors: Returaj Burnwal, Anirban Santara, Nirav P. Bhatt, Balaraman Ravindran, Gaurav Aggarwal

    Abstract: Model predictive control (MPC) is a popular approach for trajectory optimization in practical robotics applications. MPC policies can optimize trajectory parameters under kinodynamic and safety constraints and provide guarantees on safety, optimality, generalizability, interpretability, and explainability. However, some behaviors are complex and it is difficult to hand-craft an MPC objective funct… ▽ More

    Submitted 7 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Recipient of the best paper award at RBCDSAI-DAI 2023, IIT Madras (https://rbcdsai.iitm.ac.in/DAI-2023/)

  2. arXiv:2211.16309  [pdf, other

    cs.RO cs.LG stat.AP

    A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations

    Authors: Sohan Rudra, Saksham Goel, Anirban Santara, Claudio Gentile, Laurent Perron, Fei Xia, Vikas Sindhwani, Carolina Parada, Gaurav Aggarwal

    Abstract: Object-goal navigation (Object-nav) entails searching, recognizing and navigating to a target object. Object-nav has been extensively studied by the Embodied-AI community, but most solutions are often restricted to considering static objects (e.g., television, fridge, etc.). We propose a modular framework for object-nav that is able to efficiently search indoor environments for not just static obj… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Shorter version accepted at NeurIPS 2022 Workshop on Robot Learning: Trustworthy Robotics

  3. arXiv:2106.03546  [pdf, other

    cs.LG cs.AI

    On Learning to Rank Long Sequences with Contextual Bandits

    Authors: Anirban Santara, Claudio Gentile, Gaurav Aggarwal, Shuai Li

    Abstract: Motivated by problems of learning to rank long item sequences, we introduce a variant of the cascading bandit model that considers flexible length sequences with varying rewards and losses. We formulate two generative models for this problem within the generalized linear setting, and design and analyze upper confidence algorithms for it. Our analysis delivers tight regret bounds which, when specia… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Report number: PMLR 151:767-797

    Journal ref: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:767-797, 2022

  4. arXiv:2102.04353  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Unlocking Pixels for Reinforcement Learning via Implicit Attention

    Authors: Krzysztof Marcin Choromanski, Deepali Jain, Wenhao Yu, Xingyou Song, Jack Parker-Holder, Tingnan Zhang, Valerii Likhosherstov, Aldo Pacchiano, Anirban Santara, Yunhao Tang, Jie Tan, Adrian Weller

    Abstract: There has recently been significant interest in training reinforcement learning (RL) agents in vision-based environments. This poses many challenges, such as high dimensionality and the potential for observational overfitting through spurious correlations. A promising approach to solve both of these problems is an attention bottleneck, which provides a simple and effective framework for learning h… ▽ More

    Submitted 1 October, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

  5. arXiv:2010.00993  [pdf, other

    cs.RO cs.LG cs.MA

    MADRaS : Multi Agent Driving Simulator

    Authors: Anirban Santara, Sohan Rudra, Sree Aditya Buridi, Meha Kaushik, Abhishek Naik, Bharat Kaul, Balaraman Ravindran

    Abstract: In this work, we present MADRaS, an open-source multi-agent driving simulator for use in the design and evaluation of motion planning algorithms for autonomous driving. MADRaS provides a platform for constructing a wide variety of highway and track driving scenarios where multiple driving agents can train for motion planning tasks using reinforcement learning and other machine learning algorithms.… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  6. arXiv:1906.11785  [pdf, other

    cs.LG stat.ML

    ExTra: Transfer-guided Exploration

    Authors: Anirban Santara, Rishabh Madan, Balaraman Ravindran, Pabitra Mitra

    Abstract: In this work we present a novel approach for transfer-guided exploration in reinforcement learning that is inspired by the human tendency to leverage experiences from similar encounters in the past while navigating a new task. Given an optimal policy in a related task-environment, we show that its bisimulation distance from the current task-environment gives a lower bound on the optimal advantage… ▽ More

    Submitted 27 May, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: Published as an extended abstract at AAMAS 2020

  7. arXiv:1904.04547  [pdf, other

    cs.IR cs.CV

    PUNCH: Positive UNlabelled Classification based information retrieval in Hyperspectral images

    Authors: Anirban Santara, Jayeeta Datta, Sourav Sarkar, Ankur Garg, Kirti Padia, Pabitra Mitra

    Abstract: Hyperspectral images of land-cover captured by airborne or satellite-mounted sensors provide a rich source of information about the chemical composition of the materials present in a given place. This makes hyperspectral imaging an important tool for earth sciences, land-cover studies, and military and strategic applications. However, the scarcity of labeled training examples and spatial variabili… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 9 pages, under review at ACMMM-2019

  8. arXiv:1707.06658  [pdf, other

    cs.LG cs.AI

    RAIL: Risk-Averse Imitation Learning

    Authors: Anirban Santara, Abhishek Naik, Balaraman Ravindran, Dipankar Das, Dheevatsa Mudigere, Sasikanth Avancha, Bharat Kaul

    Abstract: Imitation learning algorithms learn viable policies by imitating an expert's behavior when reward signals are not available. Generative Adversarial Imitation Learning (GAIL) is a state-of-the-art algorithm for learning policies when the expert's behavior is available as a fixed set of trajectories. We evaluate in terms of the expert's cost function and observe that the distribution of trajectory-c… ▽ More

    Submitted 29 November, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: Accepted for presentation in Deep Reinforcement Learning Symposium at NIPS 2017

  9. BASS Net: Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

    Authors: Anirban Santara, Kaustubh Mani, Pranoot Hatwar, Ankit Singh, Ankur Garg, Kirti Padia, Pabitra Mitra

    Abstract: Deep learning based landcover classification algorithms have recently been proposed in literature. In hyperspectral images (HSI) they face the challenges of large dimensionality, spatial variability of spectral signatures and scarcity of labeled data. In this article we propose an end-to-end deep learning architecture that extracts band specific spectral-spatial features and performs landcover cla… ▽ More

    Submitted 2 December, 2016; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: 8 pages, 10 figures, Submitted to IEEE TGRS, Code available at: https://github.com/kaustubh0mani/BASS-Net

  10. arXiv:1605.01101  [pdf, other

    cs.CV

    WEPSAM: Weakly Pre-Learnt Saliency Model

    Authors: Avisek Lahiri, Sourya Roy, Anirban Santara, Pabitra Mitra, Prabir Kumar Biswas

    Abstract: Visual saliency detection tries to mimic human vision psychology which concentrates on sparse, important areas in natural image. Saliency prediction research has been traditionally based on low level features such as contrast, edge, etc. Recent thrust in saliency prediction research is to learn high level semantics using ground truth eye fixation datasets. In this paper we present, WEPSAM : Weakly… ▽ More

    Submitted 3 May, 2016; originally announced May 2016.

  11. arXiv:1604.02646  [pdf, other

    cs.LG cs.CV cs.NE

    Visualization Regularizers for Neural Network based Image Recognition

    Authors: Biswajit Paria, Vikas Reddy, Anirban Santara, Pabitra Mitra

    Abstract: The success of deep neural networks is mostly due their ability to learn meaningful features from the data. Features learned in the hidden layers of deep neural networks trained in computer vision tasks have been shown to be similar to mid-level vision features. We leverage this fact in this work and propose the visualization regularizer for image tasks. The proposed regularization technique enfor… ▽ More

    Submitted 3 January, 2017; v1 submitted 10 April, 2016; originally announced April 2016.

  12. arXiv:1603.04833  [pdf, other

    cs.LG cs.CV stat.ML

    Ensemble of Deep Convolutional Neural Networks for Learning to Detect Retinal Vessels in Fundus Images

    Authors: Debapriya Maji, Anirban Santara, Pabitra Mitra, Debdoot Sheet

    Abstract: Vision impairment due to pathological damage of the retina can largely be prevented through periodic screening using fundus color imaging. However the challenge with large scale screening is the inability to exhaustively detect fine blood vessels crucial to disease diagnosis. In this work we present a computational imaging framework using deep and ensemble learning for reliable detection of blood… ▽ More

    Submitted 15 March, 2016; originally announced March 2016.

  13. arXiv:1603.02836  [pdf, other

    cs.LG

    Faster learning of deep stacked autoencoders on multi-core systems using synchronized layer-wise pre-training

    Authors: Anirban Santara, Debapriya Maji, DP Tejas, Pabitra Mitra, Arobinda Gupta

    Abstract: Deep neural networks are capable of modelling highly non-linear functions by capturing different levels of abstraction of data hierarchically. While training deep networks, first the system is initialized near a good optimum by greedy layer-wise unsupervised pre-training. However, with burgeoning data and increasing dimensions of the architecture, the time complexity of this approach becomes enorm… ▽ More

    Submitted 9 March, 2016; originally announced March 2016.