-
ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation
Authors:
ALOHA 2 Team,
Jorge Aldaco,
Travis Armstrong,
Robert Baruch,
Jeff Bingham,
Sanky Chan,
Kenneth Draper,
Debidatta Dwibedi,
Chelsea Finn,
Pete Florence,
Spencer Goodrich,
Wayne Gramlich,
Torr Hage,
Alexander Herzog,
Jonathan Hoech,
Thinh Nguyen,
Ian Storz,
Baruch Tabanpour,
Leila Takayama,
Jonathan Tompson,
Ayzaan Wahid,
Ted Wahrburg,
Sichun Xu,
Sergey Yaroshenko,
Kevin Zakka
, et al. (1 additional authors not shown)
Abstract:
Diverse demonstration datasets have powered significant advances in robot learning, but the dexterity and scale of such data can be limited by the hardware cost, the hardware robustness, and the ease of teleoperation. We introduce ALOHA 2, an enhanced version of ALOHA that has greater performance, ergonomics, and robustness compared to the original design. To accelerate research in large-scale bim…
▽ More
Diverse demonstration datasets have powered significant advances in robot learning, but the dexterity and scale of such data can be limited by the hardware cost, the hardware robustness, and the ease of teleoperation. We introduce ALOHA 2, an enhanced version of ALOHA that has greater performance, ergonomics, and robustness compared to the original design. To accelerate research in large-scale bimanual manipulation, we open source all hardware designs of ALOHA 2 with a detailed tutorial, together with a MuJoCo model of ALOHA 2 with system identification. See the project website at aloha-2.github.io.
△ Less
Submitted 7 February, 2024;
originally announced May 2024.
-
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Authors:
Open X-Embodiment Collaboration,
Abby O'Neill,
Abdul Rehman,
Abhinav Gupta,
Abhiram Maddukuri,
Abhishek Gupta,
Abhishek Padalkar,
Abraham Lee,
Acorn Pooley,
Agrim Gupta,
Ajay Mandlekar,
A**kya Jain,
Albert Tung,
Alex Bewley,
Alex Herzog,
Alex Irpan,
Alexander Khazatsky,
Anant Rai,
Anchit Gupta,
Andrew Wang,
Andrey Kolobov,
Anikait Singh,
Animesh Garg,
Aniruddha Kembhavi,
Annie Xie
, et al. (267 additional authors not shown)
Abstract:
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method…
▽ More
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train generalist X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. More details can be found on the project website https://robotics-transformer-x.github.io.
△ Less
Submitted 1 June, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Authors:
Alexander Herzog,
Kanishka Rao,
Karol Hausman,
Yao Lu,
Paul Wohlhart,
Mengyuan Yan,
Jessica Lin,
Montserrat Gonzalez Arenas,
Ted Xiao,
Daniel Kappler,
Daniel Ho,
Jarek Rettinghouse,
Yevgen Chebotar,
Kuang-Huei Lee,
Keerthana Gopalakrishnan,
Ryan Julian,
Adrian Li,
Chuyuan Kelly Fu,
Bob Wei,
Sangeetha Ramesh,
Khem Holden,
Kim Kleiven,
David Rendleman,
Sean Kirmani,
Jeff Bingham
, et al. (15 additional authors not shown)
Abstract:
We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL…
▽ More
We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL from real-world data with bootstrap** from training in simulation, and incorporates auxiliary inputs from existing computer vision systems as a way to boost generalization to novel objects, while retaining the benefits of end-to-end training. We analyze the tradeoffs of different design decisions in our system, and present a large-scale empirical validation that includes training on real-world data gathered over the course of 24 months of experimentation, across a fleet of 23 robots in three office buildings, with a total training set of 9527 hours of robotic experience. Our final validation also consists of 4800 evaluation trials across 240 waste station configurations, in order to evaluate in detail the impact of the design decisions in our system, the scaling effects of including more real-world data, and the performance of the method on novel objects. The projects website and videos can be found at \href{http://rl-at-scale.github.io}{rl-at-scale.github.io}.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Robotic Table Wi** via Reinforcement Learning and Whole-body Trajectory Optimization
Authors:
Thomas Lew,
Sumeet Singh,
Mario Prats,
Jeffrey Bingham,
Jonathan Weisz,
Benjie Holson,
Xiaohan Zhang,
Vikas Sindhwani,
Yao Lu,
Fei Xia,
Peng Xu,
Tingnan Zhang,
Jie Tan,
Montserrat Gonzalez
Abstract:
We propose a framework to enable multipurpose assistive mobile robots to autonomously wipe tables to clean spills and crumbs. This problem is challenging, as it requires planning wi** actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations. Simultaneously, we must guarantee constraints satisfaction to enable safe deployment in…
▽ More
We propose a framework to enable multipurpose assistive mobile robots to autonomously wipe tables to clean spills and crumbs. This problem is challenging, as it requires planning wi** actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations. Simultaneously, we must guarantee constraints satisfaction to enable safe deployment in unstructured cluttered environments. To tackle this problem, we first propose a stochastic differential equation to model crumbs and spill dynamics and absorption with a robot wiper. Using this model, we train a vision-based policy for planning wi** actions in simulation using reinforcement learning (RL). To enable zero-shot sim-to-real deployment, we dovetail the RL policy with a whole-body trajectory optimization framework to compute base and arm joint trajectories that execute the desired wi** motions while guaranteeing constraints satisfaction. We extensively validate our approach in simulation and on hardware. Video: https://youtu.be/inORKP4F3EI
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Trajectory Splitting: A Distributed Formulation for Collision Avoiding Trajectory Optimization
Authors:
Changhao Wang,
Jeffrey Bingham,
Masayoshi Tomizuka
Abstract:
Efficient trajectory optimization is essential for avoiding collisions in unstructured environments, but it remains challenging to have both speed and quality in the solutions. One reason is that second-order optimality requires calculating Hessian matrices that can grow with $O(N^2)$ with the number of waypoints. Decreasing the waypoints can quadratically decrease computation time. Unfortunately,…
▽ More
Efficient trajectory optimization is essential for avoiding collisions in unstructured environments, but it remains challenging to have both speed and quality in the solutions. One reason is that second-order optimality requires calculating Hessian matrices that can grow with $O(N^2)$ with the number of waypoints. Decreasing the waypoints can quadratically decrease computation time. Unfortunately, fewer waypoints result in lower quality trajectories that may not avoid the collision. To have both, dense waypoints and reduced computation time, we took inspiration from recent studies on consensus optimization and propose a distributed formulation of collocated trajectory optimization. It breaks a long trajectory into several segments, where each segment becomes a subproblem of a few waypoints. These subproblems are solved classically, but in parallel, and the solutions are fused into a single trajectory with a consensus constraint that enforces continuity of the segments through a consensus update. With this scheme, the quadratic complexity is distributed to each segment and enables solving for higher-quality trajectories with denser waypoints. Furthermore, the proposed formulation is amenable to using any existing trajectory optimizer for solving the subproblems. We compare the performance of our implementation of trajectory splitting against leading motion planning algorithms and demonstrate the improved computational efficiency of our method.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
An age-structured model of hepatitis B viral infection highlights the potential of different therapeutic strategies
Authors:
Farzad Fatehi,
Richard J. Bingham,
Eric C. Dykeman,
Peter G. Stockley,
Reidun Twarock
Abstract:
Hepatitis B virus is a global health threat, and its elimination by 2030 has been prioritised by the World Health Organisation. Here we present an age-structured model for the immune response to an HBV infection, which takes into account contributions from both cell-mediated and humoral immunity. The model has been validated using published patient data recorded during acute infection. It has been…
▽ More
Hepatitis B virus is a global health threat, and its elimination by 2030 has been prioritised by the World Health Organisation. Here we present an age-structured model for the immune response to an HBV infection, which takes into account contributions from both cell-mediated and humoral immunity. The model has been validated using published patient data recorded during acute infection. It has been adapted to the scenarios of chronic infection, clearance of infection, and flare-ups via variation of the immune response parameters. The impacts of immune response exhaustion and non-infectious subviral particles on the immune response dynamics are analysed. A comparison of different treatment options in the context of this model reveals that drugs targeting aspects of the viral life cycle are more effective than exhaustion therapy, a form of therapy mitigating immune response exhaustion. Our results suggest that antiviral treatment is best started when viral load is declining rather than in a flare-up. The model suggests that a fast antibody production rate always lead to viral clearance, highlighting the promise of antibody therapies currently in clinical trials.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Therapeutic Interfering Particles Exploiting Viral Replication and Assembly Mechanisms Show Promising Performance: A Modelling Study
Authors:
Farzad Fatehi,
Richard J. Bingham,
Pierre-Philippe Dechant,
Peter G. Stockley,
Reidun Twarock
Abstract:
Defective interfering particles arise spontaneously during a viral infection as mutants lacking essential parts of the viral genome. Their ability to replicate in the presence of the wild-type (WT) virus (at the expense of viable viral particles) is mimicked and exploited by therapeutic interfering particles. We propose a strategy for the design of therapeutic interfering RNAs (tiRNAs) against pos…
▽ More
Defective interfering particles arise spontaneously during a viral infection as mutants lacking essential parts of the viral genome. Their ability to replicate in the presence of the wild-type (WT) virus (at the expense of viable viral particles) is mimicked and exploited by therapeutic interfering particles. We propose a strategy for the design of therapeutic interfering RNAs (tiRNAs) against positive-sense single-stranded RNA viruses that assemble via packaging signal-mediated assembly. These tiRNAs contain both an optimised version of the virus assembly manual that is encoded by multiple dispersed RNA packaging signals and a replication signal for viral polymerase, but lack any protein coding information. We use an intracellular model for hepatitis C viral (HCV) infection that captures key aspects of the competition dynamics between tiRNAs and viral genomes for virally produced capsid protein and polymerase. We show that only a small increase in the assembly and replication efficiency of the tiRNAs compared with WT virus is required in order to achieve a treatment efficacy greater than 99%. This demonstrates that the proposed tiRNA design could be a promising treatment option for RNA viral infections.
△ Less
Submitted 15 December, 2021; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Comparing antiviral strategies against COVID-19 via multiscale within-host modelling
Authors:
Farzad Fatehi,
Richard J Bingham,
Eric C Dykeman,
Peter G Stockley,
Reidun Twarock
Abstract:
Within-host models of COVID-19 infection dynamics enable the merits of different forms of antiviral therapy to be assessed in individual patients. A stochastic agent-based model of COVID-19 intracellular dynamics is introduced here, that incorporates essential steps of the viral life cycle targeted by treatment options. Integration of model predictions with an intercellular ODE model of within-hos…
▽ More
Within-host models of COVID-19 infection dynamics enable the merits of different forms of antiviral therapy to be assessed in individual patients. A stochastic agent-based model of COVID-19 intracellular dynamics is introduced here, that incorporates essential steps of the viral life cycle targeted by treatment options. Integration of model predictions with an intercellular ODE model of within-host infection dynamics, fitted to patient data, generates a generic profile of disease progression in patients that have recovered in the absence of treatment. This is contrasted with the profiles obtained after variation of model parameters pertinent to the immune response, such as effector cell and antibody proliferation rates, mimicking disease progression in immunocompromised patients. These profiles are then compared with disease progression in the presence of antiviral and convalescent plasma therapy against COVID-19 infections. The model reveals that using both therapies in combination can be very effective in reducing the length of infection, but these synergistic effects decline with a delayed treatment start. Conversely, early treatment with either therapy alone can actually increase the duration of infection, with infectious virions still present after the decline of other markers of infection. This suggests that usage of these treatments should remain carefully controlled in a clinical environment.
△ Less
Submitted 21 December, 2021; v1 submitted 18 October, 2020;
originally announced October 2020.
-
Action Image Representation: Learning Scalable Deep Gras** Policies with Zero Real World Data
Authors:
Mohi Khansari,
Daniel Kappler,
Jianlan Luo,
Jeff Bingham,
Mrinal Kalakrishnan
Abstract:
This paper introduces Action Image, a new grasp proposal representation that allows learning an end-to-end deep-gras** policy. Our model achieves $84\%$ grasp success on $172$ real world objects while being trained only in simulation on $48$ objects with just naive domain randomization. Similar to computer vision problems, such as object detection, Action Image builds on the idea that object fea…
▽ More
This paper introduces Action Image, a new grasp proposal representation that allows learning an end-to-end deep-gras** policy. Our model achieves $84\%$ grasp success on $172$ real world objects while being trained only in simulation on $48$ objects with just naive domain randomization. Similar to computer vision problems, such as object detection, Action Image builds on the idea that object features are invariant to translation in image space. Therefore, grasp quality is invariant when evaluating the object-gripper relationship; a successful grasp for an object depends on its local context, but is independent of the surrounding environment. Action Image represents a grasp proposal as an image and uses a deep convolutional network to infer grasp quality. We show that by using an Action Image representation, trained networks are able to extract local, salient features of gras** tasks that generalize across different objects and environments. We show that this representation works on a variety of inputs, including color images (RGB), depth images (D), and combined color-depth (RGB-D). Our experimental results demonstrate that networks utilizing an Action Image representation exhibit strong domain transfer between training on simulated data and inference on real-world sensor streams. Finally, our experiments show that a network trained with Action Image improves grasp success ($84\%$ vs. $53\%$) over a baseline model with the same structure, but using actions encoded as vectors.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
Environment-aware Reconfigurable Noise Suppression
Authors:
Jun Yang,
Joshua Bingham
Abstract:
The paper proposes an efficient, robust, and reconfigurable technique to suppress various types of noises for any sampling rate. The theoretical analyses, subjective and objective test results show that the proposed noise suppression (NS) solution significantly enhances the speech transmission index (STI), speech intelligibility (SI), signal-to-noise ratio (SNR), and subjective listening experienc…
▽ More
The paper proposes an efficient, robust, and reconfigurable technique to suppress various types of noises for any sampling rate. The theoretical analyses, subjective and objective test results show that the proposed noise suppression (NS) solution significantly enhances the speech transmission index (STI), speech intelligibility (SI), signal-to-noise ratio (SNR), and subjective listening experience. The STI and SI consists of 5 levels, i.e., bad, poor, fair, good, and excellent. The most common noisy condition is of SNR ranging from -5 to 8 dB. For the input SNR between -5 and 2.5 dB, the proposed NS improves the STI and SI from "fair" to "good". For the input SNR between 2.5 to 8 dB, the STI and SI are improved from "good" to "excellent" by the proposed NS. The proposed NS can be adopted in both capture and playback paths for voice over internet protocol, voice-trigger, and automatic speech recognition applications.
△ Less
Submitted 29 January, 2020;
originally announced January 2020.
-
Digit Serial Methods with Applications to Division and Square Root (with mechanically checked correctness proofs)
Authors:
Warren E. Ferguson Jr,
Jesse Bingham,
Levent Erkök,
John R. Harrison,
Joe Leslie-Hurd
Abstract:
We present a generic digit serial method (DSM) to compute the digits of a real number $V$ . Bounds on these digits, and on the errors in the associated estimates of $V$ formed from these digits, are derived. To illustrate our results, we derive such bounds for a parameterized family of high-radix algorithms for division and square root. These bounds enable a DSM designer to determine, for example,…
▽ More
We present a generic digit serial method (DSM) to compute the digits of a real number $V$ . Bounds on these digits, and on the errors in the associated estimates of $V$ formed from these digits, are derived. To illustrate our results, we derive such bounds for a parameterized family of high-radix algorithms for division and square root. These bounds enable a DSM designer to determine, for example, whether a given choice of parameters allows rapid formation and rounding of its approximation to $V$. All our claims are mechanically verified using the HOL-Light theorem prover, and are included in the appendix with commentary.
△ Less
Submitted 31 July, 2017;
originally announced August 2017.
-
Dynamics of an asymmetric bilayer lipid membrane in a viscous solvent
Authors:
R. J. Bingham,
S. W. Smye,
P. D. Olmsted
Abstract:
Bilayer lipid membranes (BLMs) are an essential component of many biological systems, forming a functional barrier between the cell and the surrounding environment. When the membrane relaxes from a structural perturbation, the dynamics of the relaxation depends on the bilayer structure. We present a model of a BLM in a viscous solvent, including an explicit description of a 'thick' membrane, where…
▽ More
Bilayer lipid membranes (BLMs) are an essential component of many biological systems, forming a functional barrier between the cell and the surrounding environment. When the membrane relaxes from a structural perturbation, the dynamics of the relaxation depends on the bilayer structure. We present a model of a BLM in a viscous solvent, including an explicit description of a 'thick' membrane, where the fluctuations in the thickness of a monolayer leaflet are coupled to changes in the lipid density within that monolayer. We find dispersion relations describing three intuitive forms of bilayer motion, including a mode describing motion of the intermonolayer surface not noted previously in the literature. Two intrinsic length scales emerge that help characterise the dynamics; the well known Saffman-Delbruck length and another, $\ell_r$, resulting from the intermonolayer friction. The framework also allows for asymmetry in the BLM parameters between the monolayer leaflets, which is found to couple dynamic modes of bilayer motion.
△ Less
Submitted 30 June, 2015;
originally announced July 2015.
-
Undulation instability in a bilayer lipid membrane due to electric field interaction with lipid dipoles
Authors:
Richard J. Bingham,
Peter D. Olmsted,
Stephen W. Smye
Abstract:
Bilayer lipid membranes [BLMs] are an essential component of all biological systems, forming a functional barrier for cells and organelles from the surrounding environment. The lipid molecules that form membranes contain both permanent and induced dipoles, and an electric field can induce the formation of pores when the transverse field is sufficiently strong (electroporation). Here, a phenomenolo…
▽ More
Bilayer lipid membranes [BLMs] are an essential component of all biological systems, forming a functional barrier for cells and organelles from the surrounding environment. The lipid molecules that form membranes contain both permanent and induced dipoles, and an electric field can induce the formation of pores when the transverse field is sufficiently strong (electroporation). Here, a phenomenological free energy is constructed to model the response of a BLM to a transverse static electric field. The model contains a continuum description of the membrane dipoles and a coupling between the headgroup dipoles and the membrane tilt. The membrane is found to become unstable through buckling modes, which are weakly coupled to thickness fluctuations in the membrane. The thickness fluctuations, along with the increase in interfacial area produced by membrane buckling, increase the probability of localized membrane breakdown, which may lead to pore formation. The instability is found to depend strongly on the strength of the coupling between the dipolar headgroups and the membrane tilt as well as the degree of dipolar ordering in the membrane.
△ Less
Submitted 11 May, 2010; v1 submitted 5 May, 2010;
originally announced May 2010.
-
The origin of the red luminescence in Mg-doped GaN
Authors:
S. Zeng,
G. N. Aliev,
J. J. Davies,
D. Wolverson,
S. J. Bingham,
D. A. Adbulmalik,
P. G. Coleman,
P. J. Parbrook,
T. Wang
Abstract:
Optically-detected magnetic resonance (ODMR) and positron annihilation spectroscopy (PAS) experiments have been employed to study magnesium-doped GaN layers grown by metal-organic vapor phase epitaxy. As the Mg do** level is changed, the combined experiments reveal a strong correlation between the vacancy concentrations and the intensity of the red photoluminescence band at 1.8 eV. The analysi…
▽ More
Optically-detected magnetic resonance (ODMR) and positron annihilation spectroscopy (PAS) experiments have been employed to study magnesium-doped GaN layers grown by metal-organic vapor phase epitaxy. As the Mg do** level is changed, the combined experiments reveal a strong correlation between the vacancy concentrations and the intensity of the red photoluminescence band at 1.8 eV. The analysis provides strong evidence that the emission is due to recombination in which electrons both from effective mass donors and from deeper donors recombine with deep centers, the deep centers being vacancy-related defects.
△ Less
Submitted 4 February, 2006;
originally announced February 2006.