Skip to main content

Showing 1–50 of 51 results for author: Sindhwani, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19800  [pdf, other

    cs.LG cs.RO

    Modeling the Real World with High-Density Visual Particle Dynamics

    Authors: William F. Whitney, Jacob Varley, Deepali Jain, Krzysztof Choromanski, Sumeet Singh, Vikas Sindhwani

    Abstract: We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neig… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.17740  [pdf, other

    cs.LG cs.AI cs.CV

    Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning

    Authors: Arijit Sehanobish, Avinava Dubey, Krzysztof Choromanski, Somnath Basu Roy Chowdhury, Deepali Jain, Vikas Sindhwani, Snigdha Chaturvedi

    Abstract: Recent efforts to scale Transformer models have demonstrated rapid progress across a wide range of tasks (Wei et al., 2022). However, fine-tuning these models for downstream tasks is expensive due to their large parameter counts. Parameter-efficient fine-tuning (PEFT) approaches have emerged as a viable alternative by allowing us to fine-tune models by updating only a small number of parameters. I… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Work in progress

  3. arXiv:2404.03570  [pdf, other

    cs.RO

    Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity

    Authors: Jake Varley, Sumeet Singh, Deepali Jain, Krzysztof Choromanski, Andy Zeng, Somnath Basu Roy Chowdhury, Avinava Dubey, Vikas Sindhwani

    Abstract: We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With sem… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  4. arXiv:2312.01990  [pdf, other

    cs.RO cs.AI

    SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention

    Authors: Isabel Leal, Krzysztof Choromanski, Deepali Jain, Avinava Dubey, Jake Varley, Michael Ryoo, Yao Lu, Frederick Liu, Vikas Sindhwani, Quan Vuong, Tamas Sarlos, Ken Oslund, Karol Hausman, Kanishka Rao

    Abstract: We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (includi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  5. arXiv:2309.05803  [pdf, other

    cs.RO cs.LG

    Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models

    Authors: Sumeet Singh, Stephen Tu, Vikas Sindhwani

    Abstract: A crucial design decision for any robot learning pipeline is the choice of policy representation: what type of model should be used to generate the next set of robot actions? Owing to the inherent multi-modal nature of many robotic tasks, combined with the recent successes in generative modeling, researchers have turned to state-of-the-art probabilistic models such as diffusion models for policy r… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  6. Robotic Table Tennis: A Case Study into a High Speed Learning System

    Authors: David B. D'Ambrosio, Jonathan Abelian, Saminda Abeyruwan, Michael Ahn, Alex Bewley, Justin Boyd, Krzysztof Choromanski, Omar Cortes, Erwin Coumans, Tianli Ding, Wenbo Gao, Laura Graesser, Atil Iscen, Navdeep Jaitly, Deepali Jain, Juhana Kangaspunta, Satoshi Kataoka, Gus Kouretas, Yuheng Kuang, Nevena Lazic, Corey Lynch, Reza Mahjourian, Sherry Q. Moore, Thinh Nguyen, Ken Oslund , et al. (10 additional authors not shown)

    Abstract: We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real w… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Published and presented at Robotics: Science and Systems (RSS2023)

  7. arXiv:2306.08205  [pdf, other

    cs.RO

    Agile Catching with Whole-Body MPC and Blackbox Policy Learning

    Authors: Saminda Abeyruwan, Alex Bewley, Nicholas M. Boffi, Krzysztof Choromanski, David D'Ambrosio, Deepali Jain, Pannag Sanketi, Anish Shankar, Vikas Sindhwani, Sumeet Singh, Jean-Jacques Slotine, Stephen Tu

    Abstract: We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) M… ▽ More

    Submitted 19 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: L4DC 2023

  8. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, lea**, and jum**. There is a growing interest in develo** legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  9. arXiv:2305.12284  [pdf, other

    math.OC cs.LG eess.SY math.DS

    Safely Learning Dynamical Systems

    Authors: Amir Ali Ahmadi, Abraar Chaudhry, Vikas Sindhwani, Stephen Tu

    Abstract: A fundamental challenge in learning an unknown dynamical system is to reduce model uncertainty by making measurements while maintaining safety. We formulate a mathematical definition of what it means to safely learn a dynamical system by sequentially deciding where to initialize trajectories. The state of the system must stay within a safety region for a horizon of $T$ time steps under the action… ▽ More

    Submitted 8 June, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: 49 pages. arXiv admin note: text overlap with arXiv:2011.12257

  10. arXiv:2302.01128  [pdf, other

    cs.LG cs.AI

    Mnemosyne: Learning to Train Transformers with Transformers

    Authors: Deepali Jain, Krzysztof Marcin Choromanski, Avinava Dubey, Sumeet Singh, Vikas Sindhwani, Tingnan Zhang, Jie Tan

    Abstract: In this work, we propose a new class of learnable optimizers, called \textit{Mnemosyne}. It is based on the novel spatio-temporal low-rank implicit attention Transformers that can learn to train entire neural network architectures, including other Transformers, without any task-specific optimizer tuning. We show that Mnemosyne: (a) outperforms popular LSTM optimizers (also with new feature enginee… ▽ More

    Submitted 16 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  11. arXiv:2212.06764  [pdf, other

    cs.RO

    Single-Level Differentiable Contact Simulation

    Authors: Simon Le Cleac'h, Mac Schwager, Zachary Manchester, Vikas Sindhwani, Pete Florence, Sumeet Singh

    Abstract: We present a differentiable formulation of rigid-body contact dynamics for objects and robots represented as compositions of convex primitives. Existing optimization-based approaches simulating contact between convex primitives rely on a bilevel formulation that separates collision detection and contact simulation. These approaches are unreliable in realistic contact simulation scenarios because i… ▽ More

    Submitted 3 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  12. arXiv:2211.16309  [pdf, other

    cs.RO cs.LG stat.AP

    A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations

    Authors: Sohan Rudra, Saksham Goel, Anirban Santara, Claudio Gentile, Laurent Perron, Fei Xia, Vikas Sindhwani, Carolina Parada, Gaurav Aggarwal

    Abstract: Object-goal navigation (Object-nav) entails searching, recognizing and navigating to a target object. Object-nav has been extensively studied by the Embodied-AI community, but most solutions are often restricted to considering static objects (e.g., television, fridge, etc.). We propose a modular framework for object-nav that is able to efficiently search indoor environments for not just static obj… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Shorter version accepted at NeurIPS 2022 Workshop on Robot Learning: Trustworthy Robotics

  13. arXiv:2210.10865  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Robotic Table Wi** via Reinforcement Learning and Whole-body Trajectory Optimization

    Authors: Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

    Abstract: We propose a framework to enable multipurpose assistive mobile robots to autonomously wipe tables to clean spills and crumbs. This problem is challenging, as it requires planning wi** actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations. Simultaneously, we must guarantee constraints satisfaction to enable safe deployment in… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  14. arXiv:2209.10780  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

    Authors: Xuesu Xiao, Tingnan Zhang, Krzysztof Choromanski, Edward Lee, Anthony Francis, Jake Varley, Stephen Tu, Sumeet Singh, Peng Xu, Fei Xia, Sven Mikael Persson, Dmitry Kalashnikov, Leila Takayama, Roy Frostig, Jie Tan, Carolina Parada, Vikas Sindhwani

    Abstract: Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach… ▽ More

    Submitted 23 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  15. arXiv:2208.01191  [pdf, other

    cs.LG cs.AI cs.NE

    Implicit Two-Tower Policies

    Authors: Yunfan Zhao, Qingkai Pan, Krzysztof Choromanski, Deepali Jain, Vikas Sindhwani

    Abstract: We present a new class of structured reinforcement learning policy-architectures, Implicit Two-Tower (ITT) policies, where the actions are chosen based on the attention scores of their learnable latent representations with those of the input states. By explicitly disentangling action from state processing in the policy stack, we achieve two main goals: substantial computational gains and better pe… ▽ More

    Submitted 25 October, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

  16. arXiv:2204.00598  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

    Authors: Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence

    Abstract: Large pretrained (e.g., "foundation") models exhibit distinct capabilities depending on the domain of data they are trained on. While these domains are generic, they may only barely overlap. For example, visual-language models (VLMs) are trained on Internet-scale image captions, but large language models (LMs) are further trained on Internet-scale text with no images (e.g., spreadsheets, SAT quest… ▽ More

    Submitted 27 May, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: https://socraticmodels.github.io/

  17. arXiv:2203.08715  [pdf, other

    cs.RO cs.AI cs.LG eess.SY math.DS

    Multiscale Sensor Fusion and Continuous Control with Neural CDEs

    Authors: Sumeet Singh, Francis McCann Ramirez, Jacob Varley, Andy Zeng, Vikas Sindhwani

    Abstract: Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch obs… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Submitted to IEEE IROS 2022

  18. arXiv:2110.04367  [pdf, other

    cs.LG stat.ML

    Hybrid Random Features

    Authors: Krzysztof Choromanski, Haoxian Chen, Han Lin, Yuanzhe Ma, Arijit Sehanobish, Deepali Jain, Michael S Ryoo, Jake Varley, Andy Zeng, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller

    Abstract: We propose a new class of random feature methods for linearizing softmax and Gaussian kernels called hybrid random features (HRFs) that automatically adapt the quality of kernel estimation to provide most accurate approximation in the defined regions of interest. Special instantiations of HRFs lead to well-known methods such as trigonometric (Rahimi and Recht, 2007) or (recently introduced in the… ▽ More

    Submitted 30 January, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022

  19. arXiv:2109.04928  [pdf, other

    cs.RO eess.SY

    Trajectory Optimization with Optimization-Based Dynamics

    Authors: Taylor A. Howell, Simon Le Cleac'h, Sumeet Singh, Pete Florence, Zachary Manchester, Vikas Sindhwani

    Abstract: We present a framework for bi-level trajectory optimization in which a system's dynamics are encoded as the solution to a constrained optimization problem and smooth gradients of this lower-level problem are passed to an upper-level trajectory optimizer. This optimization-based dynamics representation enables constraint handling, additional variables, and non-smooth behavior to be abstracted away… ▽ More

    Submitted 11 January, 2023; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Minor fixes. Table formatting. Terminology modifications

  20. arXiv:2012.03385  [pdf, other

    cs.RO cs.LG

    Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks

    Authors: Daniel Seita, Pete Florence, Jonathan Tompson, Erwin Coumans, Vikas Sindhwani, Ken Goldberg, Andy Zeng

    Abstract: Rearranging and manipulating deformable objects such as cables, fabrics, and bags is a long-standing challenge in robotic manipulation. The complex dynamics and high-dimensional configuration spaces of deformables, compared to rigid objects, make manipulation difficult not only for multi-step planning, but even for goal specification. Goals cannot be as easily specified as rigid object poses, and… ▽ More

    Submitted 18 June, 2023; v1 submitted 6 December, 2020; originally announced December 2020.

    Comments: See https://berkeleyautomation.github.io/bags/ for project website and code; v3 is ICRA 2021 version and v4 adds physical experiments and improves simulation results

  21. arXiv:2011.12257  [pdf, other

    math.OC cs.LG eess.SY math.DS

    Safely Learning Dynamical Systems from Short Trajectories

    Authors: Amir Ali Ahmadi, Abraar Chaudhry, Vikas Sindhwani, Stephen Tu

    Abstract: A fundamental challenge in learning to control an unknown dynamical system is to reduce model uncertainty by making measurements while maintaining safety. In this work, we formulate a mathematical definition of what it means to safely learn a dynamical system by sequentially deciding where to initialize the next trajectory. In our framework, the state of the system is required to stay within a giv… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  22. arXiv:2010.14406  [pdf, other

    cs.RO

    Transporter Networks: Rearranging the Visual World for Robotic Manipulation

    Authors: Andy Zeng, Pete Florence, Jonathan Tompson, Stefan Welker, Jonathan Chien, Maria Attarian, Travis Armstrong, Ivan Krasin, Dan Duong, Ayzaan Wahid, Vikas Sindhwani, Johnny Lee

    Abstract: Robotic manipulation can be formulated as inducing a sequence of spatial displacements: where the space being moved can encompass an object, part of an object, or end effector. In this work, we propose the Transporter Network, a simple model architecture that rearranges deep features to infer spatial displacements from visual input - which can parameterize robot actions. It makes no assumptions of… ▽ More

    Submitted 5 January, 2022; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Project webpage: https://transporternets.github.io Summary video: https://youtu.be/8afHfReCfPo?t=12214

  23. arXiv:2010.08167  [pdf, other

    cs.RO math.OC

    Piecewise-Linear Motion Planning amidst Static, Moving, or Morphing Obstacles

    Authors: Bachir El Khadir, Jean Bernard Lasserre, Vikas Sindhwani

    Abstract: We propose a novel method for planning shortest length piecewise-linear motions through complex environments punctured with static, moving, or even morphing obstacles. Using a moment optimization approach, we formulate a hierarchy of semidefinite programs that yield increasingly refined lower bounds converging monotonically to the optimal path length. For computational tractability, our global m… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  24. arXiv:2008.05952  [pdf, other

    cs.LG eess.SY stat.ML

    Learning Stability Certificates from Data

    Authors: Nicholas M. Boffi, Stephen Tu, Nikolai Matni, Jean-Jacques E. Slotine, Vikas Sindhwani

    Abstract: Many existing tools in nonlinear control theory for establishing stability or safety of a dynamical system can be distilled to the construction of a certificate function that guarantees a desired property. However, algorithms for synthesizing certificate functions typically require a closed-form analytical expression of the underlying dynamics, which rules out their use on many modern robotic plat… ▽ More

    Submitted 14 September, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: Fixes an error in the statement and proof of Theorem 5.1, Theorem 5.2, and Proposition D.1

  25. arXiv:2006.11421  [pdf, other

    cs.LG math.CA math.DS math.OC stat.ML

    An Ode to an ODE

    Authors: Krzysztof Choromanski, Jared Quincy Davis, Valerii Likhosherstov, Xingyou Song, Jean-Jacques Slotine, Jacob Varley, Honglak Lee, Adrian Weller, Vikas Sindhwani

    Abstract: We present a new paradigm for Neural ODE algorithms, called ODEtoODE, where time-dependent parameters of the main flow evolve according to a matrix flow on the orthogonal group O(d). This nested system of two flows, where the parameter-flow is constrained to lie on the compact manifold, provides stability and effectiveness of training and provably solves the gradient vanishing-explosion problem wh… ▽ More

    Submitted 22 June, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 20 pages, 9 figures

  26. arXiv:2005.01906  [pdf, other

    cs.LG stat.ML

    Time Dependence in Non-Autonomous Neural ODEs

    Authors: Jared Quincy Davis, Krzysztof Choromanski, Jake Varley, Honglak Lee, Jean-Jacques Slotine, Valerii Likhosterov, Adrian Weller, Ameesh Makadia, Vikas Sindhwani

    Abstract: Neural Ordinary Differential Equations (ODEs) are elegant reinterpretations of deep networks where continuous time can replace the discrete notion of depth, ODE solvers perform forward propagation, and the adjoint method enables efficient, constant memory backpropagation. Neural ODEs are universal approximators only when they are non-autonomous, that is, the dynamics depends explicitly on time. We… ▽ More

    Submitted 6 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

  27. arXiv:2003.14398  [pdf, other

    cs.LG cs.RO stat.ML

    Robotic Table Tennis with Model-Free Reinforcement Learning

    Authors: Wenbo Gao, Laura Graesser, Krzysztof Choromanski, Xingyou Song, Nevena Lazic, Pannag Sanketi, Vikas Sindhwani, Navdeep Jaitly

    Abstract: We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We demonstrate that evolutionary search (ES) methods acting on CNN-based policy architectures for non-visual inputs and convolving across time learn compact controllers leading to smooth motions. Furthermore, we show that with appropriately tuned… ▽ More

    Submitted 27 May, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

    Comments: V2: new URL of supplementary video. 8 pages, 4 figures

    ACM Class: I.2.6; I.2.9

  28. arXiv:2003.13563  [pdf, other

    cs.LG stat.ML

    Stochastic Flows and Geometric Optimization on the Orthogonal Group

    Authors: Krzysztof Choromanski, David Cheikhi, Jared Davis, Valerii Likhosherstov, Achille Nazaret, Achraf Bahamou, Xingyou Song, Mrugank Akarte, Jack Parker-Holder, Jacob Bergquist, Yuan Gao, Aldo Pacchiano, Tamas Sarlos, Adrian Weller, Vikas Sindhwani

    Abstract: We present a new class of stochastic, geometrically-driven optimization algorithms on the orthogonal group $O(d)$ and naturally reductive homogeneous manifolds obtained from the action of the rotation group $SO(d)$. We theoretically and experimentally demonstrate that our methods can be applied in various fields of machine learning including deep, convolutional and recurrent neural networks, reinf… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  29. arXiv:1910.02812  [pdf, other

    cs.RO cs.AI cs.LG

    Policies Modulating Trajectory Generators

    Authors: Atil Iscen, Ken Caluwaerts, Jie Tan, Tingnan Zhang, Erwin Coumans, Vikas Sindhwani, Vincent Vanhoucke

    Abstract: We propose an architecture for learning complex controllable behaviors by having simple Policies Modulate Trajectory Generators (PMTG), a powerful combination that can provide both memory and prior knowledge to the controller. The result is a flexible architecture that is applicable to a class of problems with periodic motion for which one has an insight into the class of trajectories that might l… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Journal ref: In Proceedings of The 2nd Conference on Robot Learning, volume 87 of Proceedings of Machine Learning Research, pages 916-926. PMLR, 29-31 Oct 2018

  30. arXiv:1907.13122  [pdf, other

    math.OC cs.LG cs.RO eess.SY

    Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

    Authors: Sumeet Singh, Spencer M. Richards, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

    Abstract: We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key contribution is a control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, a constraint which guarantees the existence of robust tracking controllers for arbitrary open-loop trajectories generated with the learned system. Leveraging… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: Invited submission for IJRR; under review. arXiv admin note: text overlap with arXiv:1808.00113

  31. arXiv:1907.03613  [pdf, other

    cs.LG cs.AI cs.RO

    Data Efficient Reinforcement Learning for Legged Robots

    Authors: Yuxiang Yang, Ken Caluwaerts, Atil Iscen, Tingnan Zhang, Jie Tan, Vikas Sindhwani

    Abstract: We present a model-based framework for robot locomotion that achieves walking based on only 4.5 minutes (45,000 control steps) of data collected on a quadruped robot. To accurately model the robot's dynamics over a long horizon, we introduce a loss function that tracks the model's prediction over multiple timesteps. We adapt model predictive control to account for planning latency, which allows th… ▽ More

    Submitted 6 October, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

  32. arXiv:1905.09499  [pdf, other

    cs.RO math.OC

    Teleoperator Imitation with Continuous-time Safety

    Authors: Bachir El Khadir, Jake Varley, Vikas Sindhwani

    Abstract: Learning to effectively imitate human teleoperators, with generalization to unseen and dynamic environments, is a promising path to greater autonomy enabling robots to steadily acquire complex skills from supervision. We propose a new motion learning technique rooted in contraction theory and sum-of-squares programming for estimating a control law in the form of a polynomial vector field from a gi… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  33. arXiv:1903.02993  [pdf, other

    cs.LG stat.ML

    Provably Robust Blackbox Optimization for Reinforcement Learning

    Authors: Krzysztof Choromanski, Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang, Deepali Jain, Yuxiang Yang, Atil Iscen, Jasmine Hsu, Vikas Sindhwani

    Abstract: Interest in derivative-free optimization (DFO) and "evolutionary strategies" (ES) has recently surged in the Reinforcement Learning (RL) community, with growing evidence that they can match state of the art methods for policy optimization problems in Robotics. However, it is well known that DFO methods suffer from prohibitively high sampling complexity. They can also be very sensitive to noisy rew… ▽ More

    Submitted 8 July, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

  34. arXiv:1808.00113  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Learning Stabilizable Dynamical Systems via Control Contraction Metrics

    Authors: Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

    Abstract: We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may… ▽ More

    Submitted 10 November, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

  35. arXiv:1805.07831  [pdf, other

    cs.RO

    Optimizing Simulations with Noise-Tolerant Structured Exploration

    Authors: Krzysztof Choromanski, Atil Iscen, Vikas Sindhwani, Jie Tan, Erwin Coumans

    Abstract: We propose a simple drop-in noise-tolerant replacement for the standard finite difference procedure used ubiquitously in blackbox optimization. In our approach, parameter perturbation directions are defined by a family of structured orthogonal matrices. We show that at the small cost of computing a Fast Walsh-Hadamard/Fourier Transform (FWHT/FFT), such structured finite differences consistently gi… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  36. arXiv:1804.04878  [pdf, other

    cs.RO cs.LG stat.ML

    Learning Contracting Vector Fields For Stable Imitation Learning

    Authors: Vikas Sindhwani, Stephen Tu, Mohi Khansari

    Abstract: We propose a new non-parametric framework for learning incrementally stable dynamical systems x' = f(x) from a set of sampled trajectories. We construct a rich family of smooth vector fields induced by certain classes of matrix-valued kernels, whose equilibria are placed exactly at a desired set of locations and whose local contraction and curvature properties at various points can be explicitly c… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

  37. arXiv:1804.02395  [pdf, other

    cs.LG cs.RO stat.ML

    Structured Evolution with Compact Architectures for Scalable Policy Optimization

    Authors: Krzysztof Choromanski, Mark Rowland, Vikas Sindhwani, Richard E. Turner, Adrian Weller

    Abstract: We present a new method of blackbox optimization via gradient approximation with the use of structured random orthogonal matrices, providing more accurate estimators than baselines and with provable theoretical guarantees. We show that this algorithm can be successfully applied to learn better quality compact policies than those using standard gradient estimation techniques. The compact policies w… ▽ More

    Submitted 12 June, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

  38. arXiv:1710.05387  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Manifold Regularization for Kernelized LSTD

    Authors: Xinyan Yan, Krzysztof Choromanski, Byron Boots, Vikas Sindhwani

    Abstract: Policy evaluation or value function or Q-function approximation is a key procedure in reinforcement learning (RL). It is a necessary component of policy iteration and can be used for variance reduction in policy gradient methods. Therefore its quality has a significant impact on most RL algorithms. Motivated by manifold regularized learning, we propose a novel kernelized policy evaluation method t… ▽ More

    Submitted 15 October, 2017; originally announced October 2017.

    Comments: 6 pages, CoRL 2017 non-archival track

  39. arXiv:1611.07369  [pdf, other

    math.OC cs.CG cs.CV cs.GR

    Geometry of 3D Environments and Sum of Squares Polynomials

    Authors: Amir Ali Ahmadi, Georgina Hall, Ameesh Makadia, Vikas Sindhwani

    Abstract: Motivated by applications in robotics and computer vision, we study problems related to spatial reasoning of a 3D environment using sublevel sets of polynomials. These include: tightly containing a cloud of points (e.g., representing an obstacle) with convex or nearly-convex basic semialgebraic sets, computation of Euclidean distances between two such sets, separation of two convex basic semalgebr… ▽ More

    Submitted 7 March, 2017; v1 submitted 22 November, 2016; originally announced November 2016.

  40. arXiv:1608.00860  [pdf, other

    cs.LG stat.ML

    Hierarchically Compositional Kernels for Scalable Nonparametric Learning

    Authors: Jie Chen, Haim Avron, Vikas Sindhwani

    Abstract: We propose a novel class of kernels to alleviate the high computational cost of large-scale nonparametric learning with kernel methods. The proposed kernel is defined based on a hierarchical partitioning of the underlying data domain, where the Nyström method (a globally low-rank approximation) is married with a locally lossless approximation in a hierarchical fashion. The kernel maintains (strict… ▽ More

    Submitted 14 August, 2017; v1 submitted 2 August, 2016; originally announced August 2016.

    Comments: Journal of Machine Learning Research, vol 18, 2017

  41. arXiv:1605.09049  [pdf, other

    cs.LG math.NA stat.ML

    Recycling Randomness with Structure for Sublinear time Kernel Expansions

    Authors: Krzysztof Choromanski, Vikas Sindhwani

    Abstract: We propose a scheme for recycling Gaussian random vectors into structured matrices to approximate various kernel functions in sublinear time via random embeddings. Our framework includes the Fastfood construction as a special case, but also extends to Circulant, Toeplitz and Hankel matrices, and the broader family of structured matrices that are characterized by the concept of low-displacement ran… ▽ More

    Submitted 29 May, 2016; originally announced May 2016.

  42. arXiv:1604.02594  [pdf, ps, other

    cs.LG cs.CL cs.NE

    Learning Compact Recurrent Neural Networks

    Authors: Zhiyun Lu, Vikas Sindhwani, Tara N. Sainath

    Abstract: Recurrent neural networks (RNNs), including long short-term memory (LSTM) RNNs, have produced state-of-the-art results on a variety of speech recognition tasks. However, these models are often too large in size for deployment on mobile devices with memory and latency constraints. In this work, we study mechanisms for learning compact RNNs and LSTMs via low-rank factorizations and parameter sharing… ▽ More

    Submitted 9 April, 2016; originally announced April 2016.

  43. arXiv:1510.01722  [pdf, other

    stat.ML cs.CV cs.LG

    Structured Transforms for Small-Footprint Deep Learning

    Authors: Vikas Sindhwani, Tara N. Sainath, Sanjiv Kumar

    Abstract: We consider the task of building compact deep learning pipelines suitable for deployment on storage and power constrained mobile devices. We propose a unified framework to learn a broad family of structured parameter matrices that are characterized by the notion of low displacement rank. Our structured transforms admit fast function and gradient evaluation, and span a rich range of parameter shari… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

    Comments: To appear in NIPS 2015; 9 pages

  44. arXiv:1412.8293  [pdf, ps, other

    stat.ML cs.LG math.NA stat.CO

    Quasi-Monte Carlo Feature Maps for Shift-Invariant Kernels

    Authors: Haim Avron, Vikas Sindhwani, Jiyan Yang, Michael Mahoney

    Abstract: We consider the problem of improving the efficiency of randomized Fourier feature maps to accelerate training and testing speed of kernel methods on large datasets. These approximate feature maps arise as Monte Carlo approximations to integral representations of shift-invariant kernel functions (e.g., Gaussian kernel). In this paper, we propose to use Quasi-Monte Carlo (QMC) approximations instead… ▽ More

    Submitted 9 August, 2015; v1 submitted 29 December, 2014; originally announced December 2014.

    Comments: A short version of this paper has been presented in ICML 2014

  45. arXiv:1409.2620  [pdf, other

    cs.LG stat.ML

    Learning Machines Implemented on Non-Deterministic Hardware

    Authors: Suyog Gupta, Vikas Sindhwani, Kailash Gopalakrishnan

    Abstract: This paper highlights new opportunities for designing large-scale machine learning systems as a consequence of blurring traditional boundaries that have allowed algorithm designers and application-level practitioners to stay -- for the most part -- oblivious to the details of the underlying hardware-level implementations. The hardware/software co-design methodology advocated here hinges on the dep… ▽ More

    Submitted 9 September, 2014; originally announced September 2014.

  46. arXiv:1409.0940  [pdf, other

    stat.ML cs.DC cs.LG

    High-performance Kernel Machines with Implicit Distributed Optimization and Randomization

    Authors: Vikas Sindhwani, Haim Avron

    Abstract: In order to fully utilize "big data", it is often required to use "big models". Such models tend to grow with the complexity and size of the training data, and do not make strong parametric assumptions upfront on the nature of the underlying statistical dependencies. Kernel methods fit this need well, as they constitute a versatile and principled statistical methodology for solving a wide range of… ▽ More

    Submitted 16 April, 2015; v1 submitted 2 September, 2014; originally announced September 2014.

    Comments: Work presented at MMDS 2014 (June 2014) and JSM 2014

  47. arXiv:1408.2066  [pdf

    cs.LG stat.ML

    Scalable Matrix-valued Kernel Learning for High-dimensional Nonlinear Multivariate Regression and Granger Causality

    Authors: Vikas Sindhwani, Ha Quang Minh, Aurelie Lozano

    Abstract: We propose a general matrix-valued multiple kernel learning framework for high-dimensional nonlinear multivariate regression problems. This framework allows a broad class of mixed norm regularizers, including those that induce sparsity, to be imposed on a dictionary of vector-valued Reproducing Kernel Hilbert Spaces. We develop a highly scalable and eigendecomposition-free algorithm that orchestra… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-586-595

  48. arXiv:1312.7167  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Near-separable Non-negative Matrix Factorization with $\ell_1$- and Bregman Loss Functions

    Authors: Abhishek Kumar, Vikas Sindhwani

    Abstract: Recently, a family of tractable NMF algorithms have been proposed under the assumption that the data matrix satisfies a separability condition Donoho & Stodden (2003); Arora et al. (2012). Geometrically, this condition reformulates the NMF problem as that of finding the extreme rays of the conical hull of a finite set of vectors. In this paper, we develop several extensions of the conical hull pro… ▽ More

    Submitted 26 December, 2013; originally announced December 2013.

  49. arXiv:1210.4792  [pdf, ps, other

    stat.ML cs.LG

    Scalable Matrix-valued Kernel Learning for High-dimensional Nonlinear Multivariate Regression and Granger Causality

    Authors: Vikas Sindhwani, Minh Ha Quang, Aurelie C. Lozano

    Abstract: We propose a general matrix-valued multiple kernel learning framework for high-dimensional nonlinear multivariate regression problems. This framework allows a broad class of mixed norm regularizers, including those that induce sparsity, to be imposed on a dictionary of vector-valued Reproducing Kernel Hilbert Spaces. We develop a highly scalable and eigendecomposition-free algorithm that orchestra… ▽ More

    Submitted 7 March, 2013; v1 submitted 17 October, 2012; originally announced October 2012.

    Comments: 22 pages. Presentation changes; Corrections made to Theorem 2 (section 6.2) in this version

  50. arXiv:1210.1190  [pdf, ps, other

    stat.ML cs.LG

    Fast Conical Hull Algorithms for Near-separable Non-negative Matrix Factorization

    Authors: Abhishek Kumar, Vikas Sindhwani, Prabhanjan Kambadur

    Abstract: The separability assumption (Donoho & Stodden, 2003; Arora et al., 2012) turns non-negative matrix factorization (NMF) into a tractable problem. Recently, a new class of provably-correct NMF algorithms have emerged under this assumption. In this paper, we reformulate the separable NMF problem as that of finding the extreme rays of the conical hull of a finite set of vectors. From this geometric pe… ▽ More

    Submitted 3 October, 2012; originally announced October 2012.

    Comments: 15 pages, 6 figures