Skip to main content

Showing 1–40 of 40 results for author: Rajeswaran, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04798  [pdf, other

    cs.RO cs.AI

    From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control

    Authors: Yide Shentu, Philipp Wu, Aravind Rajeswaran, Pieter Abbeel

    Abstract: Hierarchical control for robotics has long been plagued by the need to have a well defined interface layer to communicate between high-level task planners and low-level policies. With the advent of LLMs, language has been emerging as a prospective interface layer. However, this has several limitations. Not all tasks can be decomposed into steps that are easily expressible in natural language (e.g.… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2310.06828  [pdf, other

    cs.RO

    RoboHive: A Unified Framework for Robot Learning

    Authors: Vikash Kumar, Rutav Shah, Gaoyue Zhou, Vincent Moens, Vittorio Caggiano, Jay Vakil, Abhishek Gupta, Aravind Rajeswaran

    Abstract: We present RoboHive, a comprehensive software platform and ecosystem for research in the field of Robot Learning and Embodied Artificial Intelligence. Our platform encompasses a diverse range of pre-existing and novel environments, including dexterous manipulation with the Shadow Hand, whole-arm manipulation tasks with Franka and Fetch robots, quadruped locomotion, among others. Included environme… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  3. arXiv:2310.02219  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?

    Authors: Sneha Silwal, Karmesh Yadav, Tingfan Wu, Jay Vakil, Arjun Majumdar, Sergio Arnaud, Claire Chen, Vincent-Pierre Berges, Dhruv Batra, Aravind Rajeswaran, Mrinal Kalakrishnan, Franziska Meier, Oleksandr Maksymets

    Abstract: We present a large empirical investigation on the use of pre-trained visual representations (PVRs) for training downstream policies that execute real-world tasks. Our study spans five different PVRs, two different policy-learning paradigms (imitation and reinforcement learning), and three different robots for 5 distinct manipulation and indoor navigation tasks. From this effort, we can arrive at t… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Project website https://pvrs-sim2real.github.io/

    MSC Class: 68T45 (Primary) 68T40; 68T05(Secondary) ACM Class: I.2.9; I.2.6; I.4.8; I.5.4

  4. arXiv:2309.14236  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation

    Authors: Patrick Lancaster, Nicklas Hansen, Aravind Rajeswaran, Vikash Kumar

    Abstract: Robotic systems that aspire to operate in uninstrumented real-world environments must perceive the world directly via onboard sensing. Vision-based learning systems aim to eliminate the need for environment instrumentation by building an implicit understanding of the world based on raw pixels, but navigating the contact-rich high-dimensional search space from solely sparse visual reward signals si… ▽ More

    Submitted 12 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures

  5. arXiv:2306.00942  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Train Offline, Test Online: A Real Robot Learning Benchmark

    Authors: Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta

    Abstract: Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robotic hardware for evaluating methods… ▽ More

    Submitted 30 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICRA 2023

  6. arXiv:2305.02968  [pdf, other

    cs.LG cs.AI

    Masked Trajectory Models for Prediction, Representation, and Control

    Authors: Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran

    Abstract: We introduce Masked Trajectory Models (MTM) as a generic abstraction for sequential decision making. MTM takes a trajectory, such as a state-action sequence, and aims to reconstruct the trajectory conditioned on random subsets of the same trajectory. By training with a highly randomized masking pattern, MTM learns versatile networks that can take on different roles or capabilities, by simply choos… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at ICML 2023. Project webpage: https://wuphilipp.github.io/mtm/

  7. arXiv:2303.18240  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

    Authors: Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier

    Abstract: We present the largest and most comprehensive empirical study of pre-trained visual representations (PVRs) or visual 'foundation models' for Embodied AI. First, we curate CortexBench, consisting of 17 different tasks spanning locomotion, navigation, dexterous, and mobile manipulation. Next, we systematically evaluate existing PVRs and find that none are universally dominant. To study the effect of… ▽ More

    Submitted 1 February, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Project website: https://eai-vc.github.io

  8. arXiv:2212.05749  [pdf, other

    cs.LG cs.CV cs.RO

    On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

    Authors: Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang

    Abstract: In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks. We revisit a simple Learning-from-Scratch (LfS) baseline that incorporates data augmentation and a shallow ConvNet, and find that this baseline is surprisingly competitive with recent approaches (PVR, MVP, R3M) that leverage frozen visual representations trained on large-scale vision datasets -- across a var… ▽ More

    Submitted 15 June, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: Code: https://github.com/gemcollector/learning-from-scratch

  9. arXiv:2212.05711  [pdf, other

    cs.RO cs.AI cs.LG

    CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning

    Authors: Zhao Mandi, Homanga Bharadhwaj, Vincent Moens, Shuran Song, Aravind Rajeswaran, Vikash Kumar

    Abstract: Large-scale training have propelled significant progress in various sub-fields of AI such as computer vision and natural language processing. However, building robot learning systems at a comparable scale remains challenging. To develop robots that can perform a wide range of skills and adapt to new scenarios, efficient methods for collecting vast and diverse amounts of data on physical robot syst… ▽ More

    Submitted 16 February, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  10. arXiv:2212.05698  [pdf, other

    cs.LG cs.AI cs.RO

    MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations

    Authors: Nicklas Hansen, Yixin Lin, Hao Su, Xiaolong Wang, Vikash Kumar, Aravind Rajeswaran

    Abstract: Poor sample efficiency continues to be the primary challenge for deployment of deep Reinforcement Learning (RL) algorithms for real-world applications, and in particular for visuo-motor control. Model-based RL has the potential to be highly sample efficient by concurrently learning a world model and using synthetic rollouts for planning and policy improvement. However, in practice, sample-efficien… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  11. arXiv:2210.06479  [pdf, other

    cs.RO cs.LG

    Real World Offline Reinforcement Learning with Realistic Data Source

    Authors: Gaoyue Zhou, Liyiming Ke, Siddhartha Srinivasa, Abhinav Gupta, Aravind Rajeswaran, Vikash Kumar

    Abstract: Offline reinforcement learning (ORL) holds great promise for robot learning due to its ability to learn from arbitrary pre-generated experience. However, current ORL benchmarks are almost entirely in simulation and utilize contrived datasets like replay buffers of online RL agents or sub-optimal trajectories, and thus hold limited relevance for real-world robotics. In this work (Real-ORL), we posi… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Project website: https://sites.google.com/view/real-orl

  12. arXiv:2204.11134  [pdf, other

    cs.RO cs.AI

    Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?

    Authors: Yuchen Cui, Scott Niekum, Abhinav Gupta, Vikash Kumar, Aravind Rajeswaran

    Abstract: Task specification is at the core of programming autonomous robots. A low-effort modality for task specification is critical for engagement of non-expert end-users and ultimate adoption of personalized robot agents. A widely studied approach to task specification is through goals, using either compact state vectors or goal images from the same robot scene. The former is hard to interpret for non-e… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 30 pages with appendix, published as a conference paper at L4DC 2022

  13. arXiv:2203.12601  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    R3M: A Universal Visual Representation for Robot Manipulation

    Authors: Suraj Nair, Aravind Rajeswaran, Vikash Kumar, Chelsea Finn, Abhinav Gupta

    Abstract: We study how visual representations pre-trained on diverse human video data can enable data-efficient learning of downstream robotic manipulation tasks. Concretely, we pre-train a visual representation using the Ego4D human video dataset using a combination of time-contrastive learning, video-language alignment, and an L1 penalty to encourage sparse and compact representations. The resulting repre… ▽ More

    Submitted 18 November, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Conference on Robot Learning (CoRL) 2022

  14. arXiv:2203.05960  [pdf, other

    cs.LG cs.AI cs.RO

    Policy Architectures for Compositional Generalization in Control

    Authors: Allan Zhou, Vikash Kumar, Chelsea Finn, Aravind Rajeswaran

    Abstract: Many tasks in control, robotics, and planning can be specified using desired goal configurations for various entities in the environment. Learning goal-conditioned policies is a natural paradigm to solve such tasks. However, current approaches struggle to learn and generalize as task complexity increases, such as variations in number of environment entities or compositions of goals. In this work,… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  15. arXiv:2203.03580  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    The Unsurprising Effectiveness of Pre-Trained Vision Models for Control

    Authors: Simone Parisi, Aravind Rajeswaran, Senthil Purushwalkam, Abhinav Gupta

    Abstract: Recent years have seen the emergence of pre-trained representations as a powerful abstraction for AI applications in computer vision, natural language, and speech. However, policy learning for control is still dominated by a tabula-rasa learning paradigm, with visuo-motor policies often trained from scratch using data from deployment environments. In this context, we revisit and study the role of… ▽ More

    Submitted 8 August, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: First two authors contributed equally

    Journal ref: International Conference on Machine Learning (ICML), 2022, 162:17359-17371

  16. arXiv:2202.00161  [pdf, other

    cs.LG cs.AI

    CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

    Authors: Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel

    Abstract: We introduce Contrastive Intrinsic Control (CIC), an algorithm for unsupervised skill discovery that maximizes the mutual information between state-transitions and latent skill vectors. CIC utilizes contrastive learning between state-transitions and skills to learn behavior embeddings and maximizes the entropy of these embeddings as an intrinsic reward to encourage behavioral diversity. We evaluat… ▽ More

    Submitted 29 March, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Project website: https://sites.google.com/view/cicrl/

  17. arXiv:2107.08829  [pdf, other

    cs.LG cs.AI cs.RO

    Visual Adversarial Imitation Learning using Variational Models

    Authors: Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

    Abstract: Reward function specification, which requires considerable human effort and iteration, remains a major impediment for learning behaviors through deep reinforcement learning. In contrast, providing visual demonstrations of desired behaviors often presents an easier and more natural way to teach agents. We consider a setting where an agent is provided a fixed dataset of visual demonstrations illustr… ▽ More

    Submitted 27 June, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

  18. arXiv:2106.09119  [pdf, other

    cs.LG

    Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL

    Authors: Catherine Cang, Aravind Rajeswaran, Pieter Abbeel, Michael Laskin

    Abstract: Offline Reinforcement Learning (RL) aims to extract near-optimal policies from imperfect offline data without additional environment interactions. Extracting policies from diverse offline datasets has the potential to expand the range of applicability of RL by making the training process safer, faster, and more streamlined. We investigate how to improve the performance of offline RL algorithms, it… ▽ More

    Submitted 18 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  19. arXiv:2106.01345  [pdf, other

    cs.LG cs.AI

    Decision Transformer: Reinforcement Learning via Sequence Modeling

    Authors: Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch

    Abstract: We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence modeling problem. This allows us to draw upon the simplicity and scalability of the Transformer architecture, and associated advances in language modeling such as GPT-x and BERT. In particular, we present Decision Transformer, an architecture that casts the problem of RL as conditional sequence modeling. Unlike prior… ▽ More

    Submitted 24 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: First two authors contributed equally. Last two authors advised equally

  20. arXiv:2102.08363  [pdf, other

    cs.LG cs.AI cs.RO

    COMBO: Conservative Offline Model-Based Policy Optimization

    Authors: Tianhe Yu, Aviral Kumar, Rafael Rafailov, Aravind Rajeswaran, Sergey Levine, Chelsea Finn

    Abstract: Model-based algorithms, which learn a dynamics model from logged experience and perform some sort of pessimistic planning under the learned model, have emerged as a promising paradigm for offline reinforcement learning (offline RL). However, practical variants of such model-based algorithms rely on explicit uncertainty quantification for incorporating pessimism. Uncertainty estimation with complex… ▽ More

    Submitted 26 January, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021

  21. arXiv:2101.01857  [pdf, other

    cs.LG

    Reinforcement Learning with Latent Flow

    Authors: Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin

    Abstract: Temporal information is essential to learning effective policies with Reinforcement Learning (RL). However, current state-of-the-art RL algorithms either assume that such information is given as part of the state space or, when learning from pixels, use the simple heuristic of frame-stacking to implicitly capture temporal information present in the image observations. This heuristic is in contrast… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  22. arXiv:2012.11547  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Reinforcement Learning from Images with Latent Space Models

    Authors: Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

    Abstract: Offline reinforcement learning (RL) refers to the problem of learning policies from a static dataset of environment interactions. Offline RL enables extensive use and re-use of historical datasets, while also alleviating safety concerns associated with online exploration, thereby expanding the real-world applicability of RL. Most prior work in offline RL has focused on tasks with compact state rep… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  23. arXiv:2005.05951  [pdf, other

    cs.LG cs.AI stat.ML

    MOReL : Model-Based Offline Reinforcement Learning

    Authors: Rahul Kidambi, Aravind Rajeswaran, Praneeth Netrapalli, Thorsten Joachims

    Abstract: In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based solely on a dataset of historical interactions with the environment. The ability to train RL policies offline can greatly expand the applicability of RL, its data efficiency, and its experimental velocity. Prior work in offline RL has been confined almost exclusively to model-free RL approaches. In this wo… ▽ More

    Submitted 1 March, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: First two authors contributed equally. Published at NeurIPS 2020. After publication at NeurIPS 2020, (1) D4RL benchmark results have been added; (2) hyper-parameter ablation studies have been added; (3) scope of Lemma 3 has been extended

  24. arXiv:2004.07804  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Game Theoretic Framework for Model Based Reinforcement Learning

    Authors: Aravind Rajeswaran, Igor Mordatch, Vikash Kumar

    Abstract: Model-based reinforcement learning (MBRL) has recently gained immense interest due to its potential for sample efficiency and ability to incorporate off-policy data. However, designing stable and efficient MBRL algorithms using rich function approximators have remained challenging. To help expose the practical challenges in MBRL and simplify algorithm design from the lens of abstraction, we develo… ▽ More

    Submitted 11 March, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: ICML 2020. This version contains expanded discussion, hyperparameter configurations, and ablation studies

  25. arXiv:2001.07343  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Lyceum: An efficient and scalable ecosystem for robot learning

    Authors: Colin Summers, Kendall Lowrey, Aravind Rajeswaran, Siddhartha Srinivasa, Emanuel Todorov

    Abstract: We introduce Lyceum, a high-performance computational ecosystem for robot learning. Lyceum is built on top of the Julia programming language and the MuJoCo physics simulator, combining the ease-of-use of a high-level programming language with the performance of native C. In addition, Lyceum has a straightforward API to support parallel computation across multiple cores and machines. Overall, depen… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  26. arXiv:1909.04630  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Meta-Learning with Implicit Gradients

    Authors: Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine

    Abstract: A core capability of intelligent systems is the ability to quickly learn new tasks by drawing on prior experience. Gradient (or optimization) based meta-learning has recently emerged as an effective approach for few-shot learning. In this formulation, meta-parameters are learned in the outer loop, while task-specific models are learned in the inner-loop, by using only a small amount of data from t… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2019. First two authors contributed equally

  27. arXiv:1902.08438  [pdf, other

    cs.LG cs.AI stat.ML

    Online Meta-Learning

    Authors: Chelsea Finn, Aravind Rajeswaran, Sham Kakade, Sergey Levine

    Abstract: A central capability of intelligent systems is the ability to continuously build upon previous experiences to speed up and enhance learning of new tasks. Two distinct research paradigms have studied this question. Meta-learning views this problem as learning a prior over model parameters that is amenable for fast adaptation on a new task, but typically assumes the set of tasks are available togeth… ▽ More

    Submitted 3 July, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: ICML 2019. The first two authors contributed equally. Expanded Appendix

  28. arXiv:1811.01848  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control

    Authors: Kendall Lowrey, Aravind Rajeswaran, Sham Kakade, Emanuel Todorov, Igor Mordatch

    Abstract: We propose a plan online and learn offline (POLO) framework for the setting where an agent, with an internal model, needs to continually act and learn in the world. Our work builds on the synergistic relationship between local model-based control, global value function learning, and exploration. We study how local trajectory optimization can cope with approximation errors in the value function, an… ▽ More

    Submitted 28 January, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The first two authors contributed equally. Accepted at ICLR 2019. Supplementary videos available at: https://sites.google.com/view/polo-mpc

  29. arXiv:1810.06045  [pdf, other

    cs.AI cs.RO

    Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost

    Authors: Henry Zhu, Abhishek Gupta, Aravind Rajeswaran, Sergey Levine, Vikash Kumar

    Abstract: Dexterous multi-fingered robotic hands can perform a wide range of manipulation skills, making them an appealing component for general-purpose robotic manipulators. However, such hands pose a major challenge for autonomous control, due to the high dimensionality of their configuration space and complex intermittent contact interactions. In this work, we propose deep reinforcement learning (deep RL… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

    Comments: https://sites.google.com/view/deeprl-handmanipulation

  30. arXiv:1803.10371  [pdf, other

    cs.RO cs.LG eess.SY

    Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

    Authors: Kendall Lowrey, Svetoslav Kolev, Jeremy Dao, Aravind Rajeswaran, Emanuel Todorov

    Abstract: Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data collection methods. Model-based reinforcement learning methods provide an avenue to circumvent these challenges, but the traditional concern has been the mismatch bet… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.

    Comments: Accepted at IEEE SIMPAR 2018. Project page: https://sites.google.com/view/phantomsim2real

  31. arXiv:1803.07246  [pdf, other

    cs.LG cs.AI stat.ML

    Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

    Authors: Cathy Wu, Aravind Rajeswaran, Yan Duan, Vikash Kumar, Alexandre M Bayen, Sham Kakade, Igor Mordatch, Pieter Abbeel

    Abstract: Policy gradient methods have enjoyed great success in deep reinforcement learning but suffer from high variance of gradient estimates. The high variance problem is particularly exasperated in problems with long horizons or high-dimensional action spaces. To mitigate this issue, we derive a bias-free action-dependent baseline for variance reduction which fully exploits the structural form of the st… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Accepted to ICLR 2018, Oral (2%)

  32. arXiv:1711.09874  [pdf, other

    cs.LG cs.RO

    Divide-and-Conquer Reinforcement Learning

    Authors: Dibya Ghosh, Avi Singh, Aravind Rajeswaran, Vikash Kumar, Sergey Levine

    Abstract: Standard model-free deep reinforcement learning (RL) algorithms sample a new initial state for each trial, allowing them to optimize policies that can perform well even in highly stochastic environments. However, problems that exhibit considerable initial state variation typically produce high-variance gradient estimates for model-free RL, making direct policy or value function optimization challe… ▽ More

    Submitted 27 April, 2018; v1 submitted 27 November, 2017; originally announced November 2017.

    Comments: Presented at ICLR 2018. Videos and supporting materials are located at http://bit.ly/dnc-rl

  33. arXiv:1709.10087  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations

    Authors: Aravind Rajeswaran, Vikash Kumar, Abhishek Gupta, Giulia Vezzani, John Schulman, Emanuel Todorov, Sergey Levine

    Abstract: Dexterous multi-fingered hands are extremely versatile and provide a generic way to perform a multitude of tasks in human-centric environments. However, effectively controlling them remains challenging due to their high dimensionality and large number of potential contacts. Deep reinforcement learning (DRL) provides a model-agnostic approach to control complex dynamical systems, but has not been s… ▽ More

    Submitted 26 June, 2018; v1 submitted 28 September, 2017; originally announced September 2017.

    Comments: Accepted for presentation at Robotics: Science and Systems (RSS) 2018. Project page: https://sites.google.com/view/deeprl-dexterous-manipulation

  34. arXiv:1703.02660  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Towards Generalization and Simplicity in Continuous Control

    Authors: Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

    Abstract: This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, existing training and testing scena… ▽ More

    Submitted 20 March, 2018; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: NIPS 2017, Project page: https://sites.google.com/view/simple-pol

  35. arXiv:1610.01283  [pdf, other

    cs.LG cs.AI cs.RO

    EPOpt: Learning Robust Neural Network Policies Using Model Ensembles

    Authors: Aravind Rajeswaran, Sarvjeet Ghotra, Balaraman Ravindran, Sergey Levine

    Abstract: Sample complexity and safety are major challenges when learning policies with reinforcement learning for real-world tasks, especially when the policies are represented using rich function approximators like deep neural networks. Model-based methods where the real-world target domain is approximated using a simulated source domain provide an avenue to tackle the above challenges by augmenting real… ▽ More

    Submitted 3 March, 2017; v1 submitted 5 October, 2016; originally announced October 2016.

    Comments: Accepted for publication at the International Conference on Learning Representations (ICLR) 2017. Supplementary video: https://youtu.be/w1YJ9vwaoto

  36. arXiv:1609.02678  [pdf, ps, other

    eess.SY cs.LG

    Identifying Topology of Power Distribution Networks Based on Smart Meter Data

    Authors: Jayadev P Satya, Nirav Bhatt, Ramkrishna Pasumarthy, Aravind Rajeswaran

    Abstract: In a power distribution network, the network topology information is essential for an efficient operation of the network. This information of network connectivity is not accurately available, at the low voltage level, due to uninformed changes that happen from time to time. In this paper, we propose a novel data--driven approach to identify the underlying network topology including the load phase… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: Submitted to IEEE transaction on smart grid

  37. arXiv:1606.01754  [pdf, other

    cs.DS cs.DM cs.SI math.OC

    A Graph Partitioning Algorithm for Leak Detection in Water Distribution Networks

    Authors: Aravind Rajeswaran, Sridharakumar Narasimhan, Shankar Narasimhan

    Abstract: Leak detection in urban water distribution networks (WDNs) is challenging given their scale, complexity, and limited instrumentation. We present an algorithm for leak detection in WDNs, which involves making additional flow measurements on-demand, and repeated use of water balance. Graph partitioning is used to determine the location of flow measurements, with the objective to minimize the measure… ▽ More

    Submitted 3 June, 2016; originally announced June 2016.

  38. arXiv:1511.06063  [pdf, ps, other

    cs.LG stat.AP stat.ML

    A Novel Approach for Phase Identification in Smart Grids Using Graph Theory and Principal Component Analysis

    Authors: P Satya Jayadev, Aravind Rajeswaran, Nirav P Bhatt, Ramkrishna Pasumarthy

    Abstract: Consumers with low demand, like households, are generally supplied single-phase power by connecting their service mains to one of the phases of a distribution transformer. The distribution companies face the problem of kee** a record of consumer connectivity to a phase due to uninformed changes that happen. The exact phase connectivity information is important for the efficient operation and con… ▽ More

    Submitted 7 June, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Accepted for the presentation at ACC 16

  39. arXiv:1506.00438  [pdf, other

    cs.LG cs.DM eess.SY stat.ME

    Network Topology Identification using PCA and its Graph Theoretic Interpretations

    Authors: Aravind Rajeswaran, Shankar Narasimhan

    Abstract: We solve the problem of identifying (reconstructing) network topology from steady state network measurements. Concretely, given only a data matrix $\mathbf{X}$ where the $X_{ij}$ entry corresponds to flow in edge $i$ in configuration (steady-state) $j$, we wish to find a network structure for which flow conservation is obeyed at all the nodes. This models many network problems involving conserved… ▽ More

    Submitted 21 January, 2016; v1 submitted 1 June, 2015; originally announced June 2015.

    Comments: Structure of paper is changed to improve presentation. Methods and results are unchanged. A more detailed literature survey has been added

  40. arXiv:cs/0508095  [pdf, ps, other

    cs.IT cs.NI

    Capacity of Ultra Wide Band Wireless Ad Hoc Networks

    Authors: Rohit Negi, Arjunan Rajeswaran

    Abstract: Throughput capacity is a critical parameter for the design and evaluation of ad-hoc wireless networks. Consider n identical randomly located nodes, on a unit area, forming an ad-hoc wireless network. Assuming a fixed per node transmission capability of T bits per second at a fixed range, it has been shown that the uniform throughput capacity per node r(n) is Theta((T)/(sqrt{n log n})), a decreas… ▽ More

    Submitted 22 August, 2005; originally announced August 2005.

    Comments: 12 pages,2 figures