Skip to main content

Showing 1–24 of 24 results for author: Varley, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19800  [pdf, other

    cs.LG cs.RO

    Modeling the Real World with High-Density Visual Particle Dynamics

    Authors: William F. Whitney, Jacob Varley, Deepali Jain, Krzysztof Choromanski, Sumeet Singh, Vikas Sindhwani

    Abstract: We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neig… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2404.03570  [pdf, other

    cs.RO

    Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity

    Authors: Jake Varley, Sumeet Singh, Deepali Jain, Krzysztof Choromanski, Andy Zeng, Somnath Basu Roy Chowdhury, Avinava Dubey, Vikas Sindhwani

    Abstract: We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With sem… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2312.01990  [pdf, other

    cs.RO cs.AI

    SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention

    Authors: Isabel Leal, Krzysztof Choromanski, Deepali Jain, Avinava Dubey, Jake Varley, Michael Ryoo, Yao Lu, Frederick Liu, Vikas Sindhwani, Quan Vuong, Tamas Sarlos, Ken Oslund, Karol Hausman, Kanishka Rao

    Abstract: We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (includi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  4. arXiv:2307.01928  [pdf, other

    cs.RO cs.AI stat.AP

    Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

    Authors: Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

    Abstract: Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for… ▽ More

    Submitted 4 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Conference on Robot Learning (CoRL) 2023, Oral Presentation

  5. arXiv:2210.02343  [pdf, other

    cs.RO cs.LG

    Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

    Authors: David Brandfonbrener, Stephen Tu, Avi Singh, Stefan Welker, Chad Boodoo, Nikolai Matni, Jake Varley

    Abstract: We consider how to most efficiently leverage teleoperator time to collect data for learning robust image-based value functions and policies for sparse reward robotic tasks. To accomplish this goal, we modify the process of data collection to include more than just successful demonstrations of the desired task. Instead we develop a novel protocol that we call Visual Backtracking Teleoperation (VBT)… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  6. arXiv:2209.10780  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

    Authors: Xuesu Xiao, Tingnan Zhang, Krzysztof Choromanski, Edward Lee, Anthony Francis, Jake Varley, Stephen Tu, Sumeet Singh, Peng Xu, Fei Xia, Sven Mikael Persson, Dmitry Kalashnikov, Leila Takayama, Roy Frostig, Jie Tan, Carolina Parada, Vikas Sindhwani

    Abstract: Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach… ▽ More

    Submitted 23 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  7. arXiv:2209.06291  [pdf, other

    cs.CV cs.RO

    Multiple View Performers for Shape Completion

    Authors: David Watkins, Peter Allen, Krzysztof Choromanski, Jacob Varley, Nicholas Waytowich

    Abstract: We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views. MVP accomplishes this task by using linear-attention Transformers called Performers. Our model allows the current observation of the scene to attend to the previous ones for more accurate infilling. The history of past observations is compressed via the compact as… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 6 pages, 2 pages of references, 6 figures, 3 tables

  8. arXiv:2203.08715  [pdf, other

    cs.RO cs.AI cs.LG eess.SY math.DS

    Multiscale Sensor Fusion and Continuous Control with Neural CDEs

    Authors: Sumeet Singh, Francis McCann Ramirez, Jacob Varley, Andy Zeng, Vikas Sindhwani

    Abstract: Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch obs… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Submitted to IEEE IROS 2022

  9. arXiv:2203.01983  [pdf, other

    cs.RO

    Implicit Kinematic Policies: Unifying Joint and Cartesian Action Spaces in End-to-End Robot Learning

    Authors: Aditya Ganapathi, Pete Florence, Jake Varley, Kaylee Burns, Ken Goldberg, Andy Zeng

    Abstract: Action representation is an important yet often overlooked aspect in end-to-end robot learning with deep networks. Choosing one action space over another (e.g. target joint positions, or Cartesian end-effector poses) can result in surprisingly stark performance differences between various downstream tasks -- and as a result, considerable research has been devoted to finding the right action space… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: International Conference on Robotics and Automation (ICRA) 2022

  10. arXiv:2110.04367  [pdf, other

    cs.LG stat.ML

    Hybrid Random Features

    Authors: Krzysztof Choromanski, Haoxian Chen, Han Lin, Yuanzhe Ma, Arijit Sehanobish, Deepali Jain, Michael S Ryoo, Jake Varley, Andy Zeng, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller

    Abstract: We propose a new class of random feature methods for linearizing softmax and Gaussian kernels called hybrid random features (HRFs) that automatically adapt the quality of kernel estimation to provide most accurate approximation in the defined regions of interest. Special instantiations of HRFs lead to well-known methods such as trigonometric (Rahimi and Recht, 2007) or (recently introduced in the… ▽ More

    Submitted 30 January, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022

  11. arXiv:2110.00717  [pdf, other

    cs.RO

    Mobile Manipulation Leveraging Multiple Views

    Authors: David Watkins, Peter K Allen, Henrique Maia, Madhavan Seshadri, Jonathan Sanabria, Nicholas Waytowich, Jacob Varley

    Abstract: While both navigation and manipulation are challenging topics in isolation, many tasks require the ability to both navigate and manipulate in concert. To this end, we propose a mobile manipulation system that leverages novel navigation and shape completion methods to manipulate an object with a mobile robot. Our system utilizes uncertainty in the initial estimation of a manipulation target to calc… ▽ More

    Submitted 7 March, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: 6 pages, 2 pages of references, 5 figures, 5 tables

  12. arXiv:2104.08212  [pdf, other

    cs.RO cs.LG

    MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale

    Authors: Dmitry Kalashnikov, Jacob Varley, Yevgen Chebotar, Benjamin Swanson, Rico Jonschkowski, Chelsea Finn, Sergey Levine, Karol Hausman

    Abstract: General-purpose robotic systems must master a large repertoire of diverse skills to be useful in a range of daily tasks. While reinforcement learning provides a powerful framework for acquiring individual behaviors, the time needed to acquire each skill makes the prospect of a generalist robot trained with RL daunting. In this paper, we study how a large-scale collective robotic learning system ca… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

  13. arXiv:2104.07749  [pdf, other

    cs.RO cs.LG

    Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

    Authors: Yevgen Chebotar, Karol Hausman, Yao Lu, Ted Xiao, Dmitry Kalashnikov, Jake Varley, Alex Irpan, Benjamin Eysenbach, Ryan Julian, Chelsea Finn, Sergey Levine

    Abstract: We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is becoming increasingly important for scaling robot learning by reusing past robotic data. In particular, we propose the objective of learning a functional understanding of the environment by learning to reac… ▽ More

    Submitted 10 June, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  14. arXiv:2103.14633  [pdf, other

    cs.RO cs.CV cs.LG cs.NE

    Visionary: Vision architecture discovery for robot learning

    Authors: Iretiayo Akinola, Anelia Angelova, Yao Lu, Yevgen Chebotar, Dmitry Kalashnikov, Jacob Varley, Julian Ibarz, Michael S. Ryoo

    Abstract: We propose a vision-based architecture search algorithm for robot manipulation learning, which discovers interactions between low dimension action inputs and high dimensional visual inputs. Our approach automatically designs architectures while training on the task - discovering novel ways of combining and attending image feature representations with actions as well as features from previous layer… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Journal ref: ICRA 2021

  15. arXiv:2012.14464  [pdf, other

    cs.RO cs.AI

    Disentangled Planning and Control in Vision Based Robotics via Reward Machines

    Authors: Alberto Camacho, Jacob Varley, Deepali Jain, Atil Iscen, Dmitry Kalashnikov

    Abstract: In this work we augment a Deep Q-Learning agent with a Reward Machine (DQRM) to increase speed of learning vision-based policies for robot tasks, and overcome some of the limitations of DQN that prevent it from converging to good-quality policies. A reward machine (RM) is a finite state machine that decomposes a task into a discrete planning graph and equips the agent with a reward function to gui… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    Comments: Accepted to the Deep Reinforcement Learning Workshop at Neural Information Processing Systems (2020)

  16. arXiv:2006.11421  [pdf, other

    cs.LG math.CA math.DS math.OC stat.ML

    An Ode to an ODE

    Authors: Krzysztof Choromanski, Jared Quincy Davis, Valerii Likhosherstov, Xingyou Song, Jean-Jacques Slotine, Jacob Varley, Honglak Lee, Adrian Weller, Vikas Sindhwani

    Abstract: We present a new paradigm for Neural ODE algorithms, called ODEtoODE, where time-dependent parameters of the main flow evolve according to a matrix flow on the orthogonal group O(d). This nested system of two flows, where the parameter-flow is constrained to lie on the compact manifold, provides stability and effectiveness of training and provably solves the gradient vanishing-explosion problem wh… ▽ More

    Submitted 22 June, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 20 pages, 9 figures

  17. arXiv:2005.01906  [pdf, other

    cs.LG stat.ML

    Time Dependence in Non-Autonomous Neural ODEs

    Authors: Jared Quincy Davis, Krzysztof Choromanski, Jake Varley, Honglak Lee, Jean-Jacques Slotine, Valerii Likhosterov, Adrian Weller, Ameesh Makadia, Vikas Sindhwani

    Abstract: Neural Ordinary Differential Equations (ODEs) are elegant reinterpretations of deep networks where continuous time can replace the discrete notion of depth, ODE solvers perform forward propagation, and the adjoint method enables efficient, constant memory backpropagation. Neural ODEs are universal approximators only when they are non-autonomous, that is, the dynamics depends explicitly on time. We… ▽ More

    Submitted 6 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

  18. arXiv:2002.09107  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras

    Authors: Iretiayo Akinola, Jacob Varley, Dmitry Kalashnikov

    Abstract: In this work, we present an effective multi-view approach to closed-loop end-to-end learning of precise manipulation tasks that are 3D in nature. Our method learns to accomplish these tasks using multiple statically placed but uncalibrated RGB camera views without building an explicit 3D representation such as a pointcloud or voxel grid. This multi-camera approach achieves superior task performanc… ▽ More

    Submitted 31 March, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted at International Conference on Robotics and Automation (ICRA 2020)

  19. arXiv:1909.04787  [pdf, other

    cs.RO cs.AI cs.LG

    MAT: Multi-Fingered Adaptive Tactile Gras** via Deep Reinforcement Learning

    Authors: Bohan Wu, Iretiayo Akinola, Jacob Varley, Peter Allen

    Abstract: Vision-based gras** systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of… ▽ More

    Submitted 9 October, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted at 3rd Conference on Robot Learning (CoRL 2019). Oral Presentation

  20. arXiv:1905.09499  [pdf, other

    cs.RO math.OC

    Teleoperator Imitation with Continuous-time Safety

    Authors: Bachir El Khadir, Jake Varley, Vikas Sindhwani

    Abstract: Learning to effectively imitate human teleoperators, with generalization to unseen and dynamic environments, is a promising path to greater autonomy enabling robots to steadily acquire complex skills from supervision. We propose a new motion learning technique rooted in contraction theory and sum-of-squares programming for estimating a control law in the form of a polynomial vector field from a gi… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  21. arXiv:1806.11402  [pdf, other

    cs.RO

    Workspace Aware Online Grasp Planning

    Authors: Iretiayo Akinola, Jacob Varley, Boyuan Chen, Peter K. Allen

    Abstract: This work provides a framework for a workspace aware online grasp planner. This framework greatly improves the performance of standard online grasp planning algorithms by incorporating a notion of reachability into the online grasp planning process. Offline, a database of hundreds of thousands of unique end-effector poses were queried for feasability. At runtime, our grasp planner uses this databa… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 8 pages, Submitted to IROS 2018

  22. arXiv:1804.02462  [pdf, other

    cs.HC cs.RO

    Human Robot Interface for Assistive Gras**

    Authors: David Watkins, Chaiwen Chou, Caroline Weinberg, Jacob Varley, Kenneth Lyons, Sanjay Joshi, Lynne Weber, Joel Stein, Peter Allen

    Abstract: This work describes a new human-in-the-loop (HitL) assistive gras** system for individuals with varying levels of physical capabilities. We investigated the feasibility of using four potential input devices with our assistive gras** system interface, using able-bodied individuals to define a set of quantitative metrics that could be used to assess an assistive gras** system. We then took the… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: 8 pages, 21 figures

  23. arXiv:1803.07671  [pdf, other

    cs.RO

    Multi-Modal Geometric Learning for Gras** and Manipulation

    Authors: David Watkins, Jacob Varley, Peter Allen

    Abstract: This work provides an architecture that incorporates depth and tactile information to create rich and accurate 3D models useful for robotic manipulation tasks. This is accomplished through the use of a 3D convolutional neural network (CNN). Offline, the network is provided with both depth and tactile information and trained to predict the object's geometry, thus filling in regions of occlusion. At… ▽ More

    Submitted 27 February, 2019; v1 submitted 20 March, 2018; originally announced March 2018.

  24. arXiv:1609.08546  [pdf, other

    cs.RO

    Shape Completion Enabled Robotic Gras**

    Authors: Jacob Varley, Chad DeChant, Adam Richardson, JoaquĆ­n Ruales, Peter Allen

    Abstract: This work provides an architecture to enable robotic grasp planning via shape completion. Shape completion is accomplished through the use of a 3D convolutional neural network (CNN). The network is trained on our own new open source dataset of over 440,000 3D exemplars captured from varying viewpoints. At runtime, a 2.5D pointcloud captured from a single point of view is fed into the CNN, which fi… ▽ More

    Submitted 2 March, 2017; v1 submitted 27 September, 2016; originally announced September 2016.

    Comments: Under review at IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS) 2017