-
Modeling the Real World with High-Density Visual Particle Dynamics
Authors:
William F. Whitney,
Jacob Varley,
Deepali Jain,
Krzysztof Choromanski,
Sumeet Singh,
Vikas Sindhwani
Abstract:
We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neig…
▽ More
We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neighbour attention layers. We demonstrate the capabilities of HD-VPD by modeling the dynamics of high degree-of-freedom bi-manual robots with two RGB-D cameras. Compared to the previous graph neural network approach, our Interlacer dynamics is twice as fast with the same prediction quality, and can achieve higher quality using 4x as many particles. We illustrate how HD-VPD can evaluate motion plan quality with robotic box pushing and can gras** tasks. See videos and particle dynamics rendered by HD-VPD at https://sites.google.com/view/hd-vpd.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity
Authors:
Jake Varley,
Sumeet Singh,
Deepali Jain,
Krzysztof Choromanski,
Andy Zeng,
Somnath Basu Roy Chowdhury,
Avinava Dubey,
Vikas Sindhwani
Abstract:
We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With sem…
▽ More
We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With semantic and physical safety in mind, these modules are interfaced with a real-time trajectory optimizer and a compliant tracking controller to enable human-robot proximity. We demonstrate performance for the following tasks: bi-arm sorting, bottle opening, and trash disposal tasks. These are done zero-shot where the models used have not been trained with any real world data from this bi-arm robot, scenes or workspace.Composing both learning- and non-learning-based components in a modular fashion with interpretable inputs and outputs allows the user to easily debug points of failures and fragilities. One may also in-place swap modules to improve the robustness of the overall platform, for instance with imitation-learned policies.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Simulating Charged Defects at Database Scale
Authors:
Jimmy-Xuan Shen,
Lars F. Voss,
Joel Basile Varley
Abstract:
Point defects have a strong influence on the physical properties of materials, often dominating the electronic and optical behavior in semiconductors and insulators. The simulation and analysis of point defects is therefore crucial for understanding the growth and operation of materials especially for optoelectronics applications. In this work, we present a general-purpose Python framework for the…
▽ More
Point defects have a strong influence on the physical properties of materials, often dominating the electronic and optical behavior in semiconductors and insulators. The simulation and analysis of point defects is therefore crucial for understanding the growth and operation of materials especially for optoelectronics applications. In this work, we present a general-purpose Python framework for the analysis of point defects in crystalline materials, as well as a generalized workflow for their treatment with high-throughput simulations. The distinguishing feature of our approach is an emphasis on a unique, unitcell, structure-only, definition of point defects which decouples the defect definition and the specific supercell representation used to simulate the defect. This allows the results of first-principles calculations to be aggregated into a database without extensive provenance information and is a crucial step in building a persistent database of point defects that can grow over time, a key component towards realizing the idea of a ``defect genome' that can yield more complex relationships governing the behavior of defects in materials. We demonstrate several examples of the approach for three technologically relevant materials and highlight current pitfalls that must be considered when employing these methodologies, as well as their potential solutions.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Authors:
Isabel Leal,
Krzysztof Choromanski,
Deepali Jain,
Avinava Dubey,
Jake Varley,
Michael Ryoo,
Yao Lu,
Frederick Liu,
Vikas Sindhwani,
Quan Vuong,
Tamas Sarlos,
Ken Oslund,
Karol Hausman,
Kanishka Rao
Abstract:
We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (includi…
▽ More
We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (including massive billion-parameter vision-language-action models or VLAs), into their efficient linear-attention counterparts maintaining high quality. We demonstrate the effectiveness of SARA-RT by speeding up: (a) the class of recently introduced RT-2 models, the first VLA robotic policies pre-trained on internet-scale data, as well as (b) Point Cloud Transformer (PCT) robotic policies operating on large point clouds. We complement our results with the rigorous mathematical analysis providing deeper insight into the phenomenon of SARA.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners
Authors:
Allen Z. Ren,
Anushri Dixit,
Alexandra Bodrova,
Sumeet Singh,
Stephen Tu,
Noah Brown,
Peng Xu,
Leila Takayama,
Fei Xia,
Jake Varley,
Zhenjia Xu,
Dorsa Sadigh,
Andy Zeng,
Anirudha Majumdar
Abstract:
Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for…
▽ More
Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for help when needed. KnowNo builds on the theory of conformal prediction to provide statistical guarantees on task completion while minimizing human help in complex multi-step planning settings. Experiments across a variety of simulated and real robot setups that involve tasks with different modes of ambiguity (e.g., from spatial to numeric uncertainties, from human preferences to Winograd schemas) show that KnowNo performs favorably over modern baselines (which may involve ensembles or extensive prompt tuning) in terms of improving efficiency and autonomy, while providing formal assurances. KnowNo can be used with LLMs out of the box without model-finetuning, and suggests a promising lightweight approach to modeling uncertainty that can complement and scale with the growing capabilities of foundation models. Website: https://robot-help.github.io
△ Less
Submitted 4 September, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Dangling bonds as possible contributors to charge noise in silicon and silicon-germanium quantum dot qubits
Authors:
Joel B. Varley,
Keith G. Ray,
Vincenzo Lordi
Abstract:
Spin qubits based on Si and Si$_{1-x}$Ge$_{x}$ quantum dot architectures exhibit among the best coherence times of competing quantum computing technologies, yet they still suffer from charge noise that limit their qubit gate fidelities. Identifying the origins of these charge fluctuations is therefore a critical step toward improving Si quantum-dot-based qubits. Here we use hybrid functional calcu…
▽ More
Spin qubits based on Si and Si$_{1-x}$Ge$_{x}$ quantum dot architectures exhibit among the best coherence times of competing quantum computing technologies, yet they still suffer from charge noise that limit their qubit gate fidelities. Identifying the origins of these charge fluctuations is therefore a critical step toward improving Si quantum-dot-based qubits. Here we use hybrid functional calculations to investigate possible atomistic sources of charge noise, focusing on charge trap** at Si and Ge dangling bonds (DBs). We evaluate the role of global and local environment in the defect levels associated with DBs in Si, Ge, and \sige alloys, and consider their trap** and excitation energies within the framework of configuration coordinate diagrams. We additionally consider the influence of strain and oxidation in charge-trap** energetics by analyzing Si and Ge$_{\rm Si}$ DBs in SiO$_2$ and strained Si layers in typical \sige quantum dot heterostructures. Our results identify that Ge dangling bonds are more problematic charge-trap** centers both in typical \sige alloys and associated oxidation layers, and they may be exacerbated by compositional inhomogeneities. These results suggest the importance of alloy homogeneity and possible passivation schemes for DBs in Si-based quantum dot qubits and are of general relevance to mitigating possible trap levels in other Si, Ge, and Si$_{1-x}$Ge$_{x}$-based metal-oxide-semiconductor stacks and related devices.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
Acceptor and compensating donor do** of single crystalline SnO (001) films grown by molecular beam epitaxy and its perspectives for optoelectronics and gas-sensing
Authors:
Kingsley Egbo,
Jonas Lähnemann,
Andreas Falkenstein,
Joel Varley,
Oliver Bierwagen
Abstract:
(La and Ga)-doped tin monoxide (stannous oxide, tin (II) oxide, SnO) thin films were grown by plasma-assisted and suboxide molecular beam epitaxy with dopant concentrations ranging from $\approx5\times10^{18}$cm$^{-3}$ to $2\times10^{21}$cm$^{-3}$. In this concentration range, the incorporation of Ga into SnO was limited by the formation of secondary phases observed at $1.2\times10^{21}$cm$^{-3}$…
▽ More
(La and Ga)-doped tin monoxide (stannous oxide, tin (II) oxide, SnO) thin films were grown by plasma-assisted and suboxide molecular beam epitaxy with dopant concentrations ranging from $\approx5\times10^{18}$cm$^{-3}$ to $2\times10^{21}$cm$^{-3}$. In this concentration range, the incorporation of Ga into SnO was limited by the formation of secondary phases observed at $1.2\times10^{21}$cm$^{-3}$ Ga, while the incorporation of La showed a lower solubility limit. Transport measurements on the doped samples reveal that Ga acts as an acceptor and La as a compensating donor. While Ga do** led to an increase of the hole concentration from $1\times10^{18}$cm$^{-3}-1\times10^{19}$cm$^{-3}$ for unintentionally (UID) SnO up to $5\times10^{19}$cm$^{-3}$, La-concentrations well in excess of the UID acceptor concentration resulted in semi-insulating films without detectable $n$-type conductivity. Ab-initio calculations qualitatively agree with our dopant assignment of Ga and La, and further predict In$_\text{Sn}$ to act as an acceptor as well as Al$_\text{Sn}$ and B$_\text{Sn}$ as donor. These results show the possibilities of controlling the hole concentration in $p$-type SnO, which can be useful for a range of optoelectronic and gas-sensing applications.
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
Authors:
David Brandfonbrener,
Stephen Tu,
Avi Singh,
Stefan Welker,
Chad Boodoo,
Nikolai Matni,
Jake Varley
Abstract:
We consider how to most efficiently leverage teleoperator time to collect data for learning robust image-based value functions and policies for sparse reward robotic tasks. To accomplish this goal, we modify the process of data collection to include more than just successful demonstrations of the desired task. Instead we develop a novel protocol that we call Visual Backtracking Teleoperation (VBT)…
▽ More
We consider how to most efficiently leverage teleoperator time to collect data for learning robust image-based value functions and policies for sparse reward robotic tasks. To accomplish this goal, we modify the process of data collection to include more than just successful demonstrations of the desired task. Instead we develop a novel protocol that we call Visual Backtracking Teleoperation (VBT), which deliberately collects a dataset of visually similar failures, recoveries, and successes. VBT data collection is particularly useful for efficiently learning accurate value functions from small datasets of image-based observations. We demonstrate VBT on a real robot to perform continuous control from image observations for the deformable manipulation task of T-shirt gras**. We find that by adjusting the data collection process we improve the quality of both the learned value functions and policies over a variety of baseline methods for data collection. Specifically, we find that offline reinforcement learning on VBT data outperforms standard behavior cloning on successful demonstration data by 13% when both methods are given equal-sized datasets of 60 minutes of data from the real robot.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation
Authors:
Xuesu Xiao,
Tingnan Zhang,
Krzysztof Choromanski,
Edward Lee,
Anthony Francis,
Jake Varley,
Stephen Tu,
Sumeet Singh,
Peng Xu,
Fei Xia,
Sven Mikael Persson,
Dmitry Kalashnikov,
Leila Takayama,
Roy Frostig,
Jie Tan,
Carolina Parada,
Vikas Sindhwani
Abstract:
Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach…
▽ More
Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach, called Performer-MPC, uses a learned cost function parameterized by vision context embeddings provided by Performers -- a low-rank implicit-attention Transformer. We jointly train the cost function and construct the controller relying on it, effectively solving end-to-end the corresponding bi-level optimization problem. We show that the resulting policy improves standard MPC performance by leveraging a few expert demonstrations of the desired navigation behavior in different challenging real-world scenarios. Compared with a standard MPC policy, Performer-MPC achieves >40% better goal reached in cluttered environments and >65% better on social metrics when navigating around humans.
△ Less
Submitted 23 September, 2022; v1 submitted 22 September, 2022;
originally announced September 2022.
-
Multiple View Performers for Shape Completion
Authors:
David Watkins,
Peter Allen,
Krzysztof Choromanski,
Jacob Varley,
Nicholas Waytowich
Abstract:
We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views. MVP accomplishes this task by using linear-attention Transformers called Performers. Our model allows the current observation of the scene to attend to the previous ones for more accurate infilling. The history of past observations is compressed via the compact as…
▽ More
We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views. MVP accomplishes this task by using linear-attention Transformers called Performers. Our model allows the current observation of the scene to attend to the previous ones for more accurate infilling. The history of past observations is compressed via the compact associative memory approximating modern continuous Hopfield memory, but crucially of size independent from the history length. We compare our model with several baselines for shape completion over time, demonstrating the generalization gains that MVP provides. To the best of our knowledge, MVP is the first multiple view voxel reconstruction method that does not require registration of multiple depth views and the first causal Transformer based model for 3D shape completion.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Tackling Disorder in $γ$-Ga$_2$O$_3$
Authors:
Laura E. Ratcliff,
Takayoshi Oshima,
Felix Nippert,
Benjamin M. Janzen,
Elias Kluth,
Rüdiger Goldhahn,
Martin Feneberg,
Piero Mazzolini,
Oliver Bierwagen,
Charlotte Wouters,
Musbah Nofal,
Martin Albrecht,
Jack E. N. Swallow,
Leanne A. H. Jones,
Pardeep K. Thakur,
Tien-Lin Lee,
Curran Kalha,
Christoph Schlueter,
Tim D. Veal,
Joel B. Varley,
Markus R. Wagner,
Anna Regoutz
Abstract:
Ga$_2$O$_3$ and its polymorphs are attracting increasing attention. The rich structural space of polymorphic oxide systems such as Ga$_2$O$_3$ offers potential for electronic structure engineering, which is of particular interest for a range of applications, such as power electronics. $γ$-Ga$_2$O$_3$ presents a particular challenge across synthesis, characterisation, and theory due to its inherent…
▽ More
Ga$_2$O$_3$ and its polymorphs are attracting increasing attention. The rich structural space of polymorphic oxide systems such as Ga$_2$O$_3$ offers potential for electronic structure engineering, which is of particular interest for a range of applications, such as power electronics. $γ$-Ga$_2$O$_3$ presents a particular challenge across synthesis, characterisation, and theory due to its inherent disorder and resulting complex structure -- electronic structure relationship. Here, density functional theory is used in combination with a machine learning approach to screen nearly one million potential structures, thereby develo** a robust atomistic model of the $γ$-phase. Theoretical results are compared with surface and bulk sensitive soft and hard X-ray photoelectron spectroscopy, X-ray absorption spectroscopy, spectroscopic ellipsometry, and photoluminescence excitation spectroscopy experiments representative of the occupied and unoccupied states of $γ$-Ga$_2$O$_3$. The first onset of strong absorption at room temperature is found at 5.1 eV from spectroscopic ellipsometry, which agrees well with the excitation maximum at 5.17 eV obtained by PLE spectroscopy, where the latter shifts to 5.33 eV at 5 K. This work presents a leap forward in the treatment of complex, disordered oxides and is a crucial step towards exploring how their electronic structure can be understood in terms of local coordination and overall structure.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Multiscale Sensor Fusion and Continuous Control with Neural CDEs
Authors:
Sumeet Singh,
Francis McCann Ramirez,
Jacob Varley,
Andy Zeng,
Vikas Sindhwani
Abstract:
Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch obs…
▽ More
Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch observations into fixed-time windows then pass them through feed-forward encoders (e.g., with deep networks), we show that there exists a more elegant approach -- one that treats policy learning as modeling latent state dynamics in continuous-time. Specifically, we present 'InFuser', a unified architecture that trains continuous time-policies with Neural Controlled Differential Equations (CDEs). InFuser evolves a single latent state representation over time by (In)tegrating and (Fus)ing multi-sensory observations (arriving at different frequencies), and inferring actions in continuous-time. This enables policies that can react to multi-frequency multi sensory feedback for truly end-to-end visuomotor control, without discrete-time assumptions. Behavior cloning experiments demonstrate that InFuser learns robust policies for dynamic tasks (e.g., swinging a ball into a cup) notably outperforming several baselines in settings where observations from one sensing modality can arrive at much sparser intervals than others.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
Implicit Kinematic Policies: Unifying Joint and Cartesian Action Spaces in End-to-End Robot Learning
Authors:
Aditya Ganapathi,
Pete Florence,
Jake Varley,
Kaylee Burns,
Ken Goldberg,
Andy Zeng
Abstract:
Action representation is an important yet often overlooked aspect in end-to-end robot learning with deep networks. Choosing one action space over another (e.g. target joint positions, or Cartesian end-effector poses) can result in surprisingly stark performance differences between various downstream tasks -- and as a result, considerable research has been devoted to finding the right action space…
▽ More
Action representation is an important yet often overlooked aspect in end-to-end robot learning with deep networks. Choosing one action space over another (e.g. target joint positions, or Cartesian end-effector poses) can result in surprisingly stark performance differences between various downstream tasks -- and as a result, considerable research has been devoted to finding the right action space for a given application. However, in this work, we instead investigate how our models can discover and learn for themselves which action space to use. Leveraging recent work on implicit behavioral cloning, which takes both observations and actions as input, we demonstrate that it is possible to present the same action in multiple different spaces to the same policy -- allowing it to learn inductive patterns from each space. Specifically, we study the benefits of combining Cartesian and joint action spaces in the context of learning manipulation skills. To this end, we present Implicit Kinematic Policies (IKP), which incorporates the kinematic chain as a differentiable module within the deep network. Quantitative experiments across several simulated continuous control tasks -- from scoo** piles of small objects, to lifting boxes with elbows, to precise block insertion with miscalibrated robots -- suggest IKP not only learns complex prehensile and non-prehensile manipulation from pixels better than baseline alternatives, but also can learn to compensate for small joint encoder offset errors. Finally, we also run qualitative experiments on a real UR5e to demonstrate the feasibility of our algorithm on a physical robotic system with real data. See https://tinyurl.com/4wz3nf86 for code and supplementary material.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Persistent room temperature photodarkening in Cu-doped \b{eta}-Ga2O3
Authors:
J. Jesenovec,
C. Pansegrau,
M. D. McCluskey,
J. S. McCloy,
T. D. Gustafson,
L. E. Halliburton,
J. B. Varley
Abstract:
Beta-Ga2O3 is an ultra-wide bandgap semiconductor with emerging applications in power electronics. The introduction of acceptor dopants yields semi-insulating substrates necessary for thin-film devices. In the present work, exposure of Cu-doped Ga2O3 to UV light > 4 eV is shown to cause large, persistent photo-induced darkening at room temperature. Electron paramagnetic resonance spectroscopy indi…
▽ More
Beta-Ga2O3 is an ultra-wide bandgap semiconductor with emerging applications in power electronics. The introduction of acceptor dopants yields semi-insulating substrates necessary for thin-film devices. In the present work, exposure of Cu-doped Ga2O3 to UV light > 4 eV is shown to cause large, persistent photo-induced darkening at room temperature. Electron paramagnetic resonance spectroscopy indicates that light exposure converts Cu2+ to Cu3+, a rare oxidation state that is responsible for the optical absorption. The photodarkening is accompanied by the appearance of O-H vibrational modes in the infrared spectrum. Hybrid function calculations show that Cu acceptors can favorably complex with hydrogen donors incorporated as interstitial (Hi) or substitutional (H_O) defects. When Cu_Ga-H_O complexes absorb light, hydrogen is released, contributing to the observed Cu3+ species and O-H modes.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Role of carbon and hydrogen in limiting $n$-type do** of monoclinic (Al$_x$Ga$_{1-x}$)$_2$O$_3$
Authors:
Sai Mu,
Mengen Wang,
Joel B. Varley,
John L. Lyons,
Darshana Wickramaratne,
Chris G. Van de Walle
Abstract:
We use hybrid density functional calculations to assess n-type do** in monoclinic (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys. We focus on Si, the most promising donor dopant, and study the structural properties, formation energies and charge-state transition levels of its various configurations. We also explore the impact of C and H, which are common impurities in metal-organic chemical vapor deposition…
▽ More
We use hybrid density functional calculations to assess n-type do** in monoclinic (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys. We focus on Si, the most promising donor dopant, and study the structural properties, formation energies and charge-state transition levels of its various configurations. We also explore the impact of C and H, which are common impurities in metal-organic chemical vapor deposition (MOCVD). In Ga$_2$O$_3$, Si$_{Ga}$ is an effective shallow donor, but in Al$_2O_3$ Si$_{Al}$ acts as a DX center with a (+/-) transition level in the band gap. Interstitial H acts as a shallow donor in Ga$_2$O$_3$, but behaves as a compensating acceptor in n-type Al$_2O_3$. Interpolation indicates that Si is an effective donor in (Al$_x$Ga$_{1-x}$)$_2$O$_3$ up to 70% Al, but it can be compensated by H already at 1% Al. We also assess the diffusivity of H and study complex formation. Si$_{cation}$-H complexes have relatively low binding energies. Substitutional C on a cation site acts as a shallow donor in Ga$_2$O$_3$, but can be stable in a negative charge state in (Al$_x$Ga$_{1-x}$)$_2$O$_3$ when x>5%. Substitutional C on an O site (C$_O$) always acts as an acceptor in n-type (Al$_x$Ga$_{1-x}$)$_2$O$_3$, but will incorporate only under relatively O-poor conditions. C$_O$-H complexes can actually incorporate more easily, explaining observations of C-related compensation in Ga$_2$O$_3$ grown by MOCVD. We also investigate C$_{cation}$-H complexes, finding they have high binding energies and act as compensating acceptors when x>56%; otherwise the H just passivates the unintentional C donors. C-H complex formation explains why MOCVD grown Ga$_2$O$_3$ can exhibit record-low free-carrier concentrations, in spite of the unavoidable incorporation of C. Our study highlights that, while Si is a suitable shallow donor in ALGO alloys, control of unintentional impurities is essential to avoid compensation.
△ Less
Submitted 23 January, 2022; v1 submitted 13 November, 2021;
originally announced November 2021.
-
Hybrid Random Features
Authors:
Krzysztof Choromanski,
Haoxian Chen,
Han Lin,
Yuanzhe Ma,
Arijit Sehanobish,
Deepali Jain,
Michael S Ryoo,
Jake Varley,
Andy Zeng,
Valerii Likhosherstov,
Dmitry Kalashnikov,
Vikas Sindhwani,
Adrian Weller
Abstract:
We propose a new class of random feature methods for linearizing softmax and Gaussian kernels called hybrid random features (HRFs) that automatically adapt the quality of kernel estimation to provide most accurate approximation in the defined regions of interest. Special instantiations of HRFs lead to well-known methods such as trigonometric (Rahimi and Recht, 2007) or (recently introduced in the…
▽ More
We propose a new class of random feature methods for linearizing softmax and Gaussian kernels called hybrid random features (HRFs) that automatically adapt the quality of kernel estimation to provide most accurate approximation in the defined regions of interest. Special instantiations of HRFs lead to well-known methods such as trigonometric (Rahimi and Recht, 2007) or (recently introduced in the context of linear-attention Transformers) positive random features (Choromanski et al., 2021). By generalizing Bochner's Theorem for softmax/Gaussian kernels and leveraging random features for compositional kernels, the HRF-mechanism provides strong theoretical guarantees - unbiased approximation and strictly smaller worst-case relative errors than its counterparts. We conduct exhaustive empirical evaluation of HRF ranging from pointwise kernel estimation experiments, through tests on data admitting clustering structure to benchmarking implicit-attention Transformers (also for downstream Robotics applications), demonstrating its quality in a wide spectrum of machine learning problems.
△ Less
Submitted 30 January, 2022; v1 submitted 8 October, 2021;
originally announced October 2021.
-
Mobile Manipulation Leveraging Multiple Views
Authors:
David Watkins,
Peter K Allen,
Henrique Maia,
Madhavan Seshadri,
Jonathan Sanabria,
Nicholas Waytowich,
Jacob Varley
Abstract:
While both navigation and manipulation are challenging topics in isolation, many tasks require the ability to both navigate and manipulate in concert. To this end, we propose a mobile manipulation system that leverages novel navigation and shape completion methods to manipulate an object with a mobile robot. Our system utilizes uncertainty in the initial estimation of a manipulation target to calc…
▽ More
While both navigation and manipulation are challenging topics in isolation, many tasks require the ability to both navigate and manipulate in concert. To this end, we propose a mobile manipulation system that leverages novel navigation and shape completion methods to manipulate an object with a mobile robot. Our system utilizes uncertainty in the initial estimation of a manipulation target to calculate a predicted next-best-view. Without the need of localization, the robot then uses the predicted panoramic view at the next-best-view location to navigate to the desired location, capture a second view of the object, create a new model that predicts the shape of object more accurately than a single image alone, and uses this model for grasp planning. We show that the system is highly effective for mobile manipulation tasks through simulation experiments using real world data, as well as ablations on each component of our system.
△ Less
Submitted 7 March, 2022; v1 submitted 1 October, 2021;
originally announced October 2021.
-
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Authors:
Dmitry Kalashnikov,
Jacob Varley,
Yevgen Chebotar,
Benjamin Swanson,
Rico Jonschkowski,
Chelsea Finn,
Sergey Levine,
Karol Hausman
Abstract:
General-purpose robotic systems must master a large repertoire of diverse skills to be useful in a range of daily tasks. While reinforcement learning provides a powerful framework for acquiring individual behaviors, the time needed to acquire each skill makes the prospect of a generalist robot trained with RL daunting. In this paper, we study how a large-scale collective robotic learning system ca…
▽ More
General-purpose robotic systems must master a large repertoire of diverse skills to be useful in a range of daily tasks. While reinforcement learning provides a powerful framework for acquiring individual behaviors, the time needed to acquire each skill makes the prospect of a generalist robot trained with RL daunting. In this paper, we study how a large-scale collective robotic learning system can acquire a repertoire of behaviors simultaneously, sharing exploration, experience, and representations across tasks. In this framework new tasks can be continuously instantiated from previously learned tasks improving overall performance and capabilities of the system. To instantiate this system, we develop a scalable and intuitive framework for specifying new tasks through user-provided examples of desired outcomes, devise a multi-robot collective learning system for data collection that simultaneously collects experience for multiple tasks, and develop a scalable and generalizable multi-task deep reinforcement learning method, which we call MT-Opt. We demonstrate how MT-Opt can learn a wide range of skills, including semantic picking (i.e., picking an object from a particular category), placing into various fixtures (e.g., placing a food item onto a plate), covering, aligning, and rearranging. We train and evaluate our system on a set of 12 real-world tasks with data collected from 7 robots, and demonstrate the performance of our system both in terms of its ability to generalize to structurally similar new tasks, and acquire distinct new tasks more quickly by leveraging past experience. We recommend viewing the videos at https://karolhausman.github.io/mt-opt/
△ Less
Submitted 27 April, 2021; v1 submitted 16 April, 2021;
originally announced April 2021.
-
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
Authors:
Yevgen Chebotar,
Karol Hausman,
Yao Lu,
Ted Xiao,
Dmitry Kalashnikov,
Jake Varley,
Alex Irpan,
Benjamin Eysenbach,
Ryan Julian,
Chelsea Finn,
Sergey Levine
Abstract:
We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is becoming increasingly important for scaling robot learning by reusing past robotic data. In particular, we propose the objective of learning a functional understanding of the environment by learning to reac…
▽ More
We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is becoming increasingly important for scaling robot learning by reusing past robotic data. In particular, we propose the objective of learning a functional understanding of the environment by learning to reach any goal state in a given dataset. We employ goal-conditioned Q-learning with hindsight relabeling and develop several techniques that enable training in a particularly challenging offline setting. We find that our method can operate on high-dimensional camera images and learn a variety of skills on real robots that generalize to previously unseen scenes and objects. We also show that our method can learn to reach long-horizon goals across multiple episodes through goal chaining, and learn rich representations that can help with downstream tasks through pre-training or auxiliary objectives. The videos of our experiments can be found at https://actionable-models.github.io
△ Less
Submitted 10 June, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Visionary: Vision architecture discovery for robot learning
Authors:
Iretiayo Akinola,
Anelia Angelova,
Yao Lu,
Yevgen Chebotar,
Dmitry Kalashnikov,
Jacob Varley,
Julian Ibarz,
Michael S. Ryoo
Abstract:
We propose a vision-based architecture search algorithm for robot manipulation learning, which discovers interactions between low dimension action inputs and high dimensional visual inputs. Our approach automatically designs architectures while training on the task - discovering novel ways of combining and attending image feature representations with actions as well as features from previous layer…
▽ More
We propose a vision-based architecture search algorithm for robot manipulation learning, which discovers interactions between low dimension action inputs and high dimensional visual inputs. Our approach automatically designs architectures while training on the task - discovering novel ways of combining and attending image feature representations with actions as well as features from previous layers. The obtained new architectures demonstrate better task success rates, in some cases with a large margin, compared to a recent high performing baseline. Our real robot experiments also confirm that it improves gras** performance by 6%. This is the first approach to demonstrate a successful neural architecture search and attention connectivity search for a real-robot task.
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
Disentangled Planning and Control in Vision Based Robotics via Reward Machines
Authors:
Alberto Camacho,
Jacob Varley,
Deepali Jain,
Atil Iscen,
Dmitry Kalashnikov
Abstract:
In this work we augment a Deep Q-Learning agent with a Reward Machine (DQRM) to increase speed of learning vision-based policies for robot tasks, and overcome some of the limitations of DQN that prevent it from converging to good-quality policies. A reward machine (RM) is a finite state machine that decomposes a task into a discrete planning graph and equips the agent with a reward function to gui…
▽ More
In this work we augment a Deep Q-Learning agent with a Reward Machine (DQRM) to increase speed of learning vision-based policies for robot tasks, and overcome some of the limitations of DQN that prevent it from converging to good-quality policies. A reward machine (RM) is a finite state machine that decomposes a task into a discrete planning graph and equips the agent with a reward function to guide it toward task completion. The reward machine can be used for both reward sha**, and informing the policy what abstract state it is currently at. An abstract state is a high level simplification of the current state, defined in terms of task relevant features. These two supervisory signals of reward sha** and knowledge of current abstract state coming from the reward machine complement each other and can both be used to improve policy performance as demonstrated on several vision based robotic pick and place tasks. Particularly for vision based robotics applications, it is often easier to build a reward machine than to try and get a policy to learn the task without this structure.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.
-
An Ode to an ODE
Authors:
Krzysztof Choromanski,
Jared Quincy Davis,
Valerii Likhosherstov,
Xingyou Song,
Jean-Jacques Slotine,
Jacob Varley,
Honglak Lee,
Adrian Weller,
Vikas Sindhwani
Abstract:
We present a new paradigm for Neural ODE algorithms, called ODEtoODE, where time-dependent parameters of the main flow evolve according to a matrix flow on the orthogonal group O(d). This nested system of two flows, where the parameter-flow is constrained to lie on the compact manifold, provides stability and effectiveness of training and provably solves the gradient vanishing-explosion problem wh…
▽ More
We present a new paradigm for Neural ODE algorithms, called ODEtoODE, where time-dependent parameters of the main flow evolve according to a matrix flow on the orthogonal group O(d). This nested system of two flows, where the parameter-flow is constrained to lie on the compact manifold, provides stability and effectiveness of training and provably solves the gradient vanishing-explosion problem which is intrinsically related to training deep neural network architectures such as Neural ODEs. Consequently, it leads to better downstream models, as we show on the example of training reinforcement learning policies with evolution strategies, and in the supervised learning setting, by comparing with previous SOTA baselines. We provide strong convergence results for our proposed mechanism that are independent of the depth of the network, supporting our empirical studies. Our results show an intriguing connection between the theory of deep neural networks and the field of matrix flows on compact manifolds.
△ Less
Submitted 22 June, 2020; v1 submitted 19 June, 2020;
originally announced June 2020.
-
Influence of Polymorphism on the Electronic Structure of Ga$_2$O$_3$
Authors:
Jack E. N. Swallow,
Christian Vorwerk,
Piero Mazzolini,
Patrick Vogt,
Oliver Bierwagen,
Alexander Karg,
Martin Eickhoff,
Jörg Schörmann,
Markus R. Wagner,
Joseph W. Roberts,
Paul R. Chalker,
Matthew J. Smiles,
Philip A. E. Murgatroyd,
Sara A. Razek,
Zachary W. Lebens-Higgins,
Louis F. J. Piper,
Leanne A. H. Jones,
Pardeep Kumar Thakur,
Tien-Lin Lee,
Joel B. Varley,
Jürgen Furthmüller,
Claudia Draxl,
Tim D. Veal,
Anna Regoutz
Abstract:
The search for new wide band gap materials is intensifying to satisfy the need for more advanced and energy efficient power electronic devices. Ga$_2$O$_3$ has emerged as an alternative to SiC and GaN, sparking a renewed interest in its fundamental properties beyond the main $β$-phase. Here, three polymorphs of Ga$_2$O$_3$, $α$, $β$ and $\varepsilon$, are investigated using X-ray diffraction, X-ra…
▽ More
The search for new wide band gap materials is intensifying to satisfy the need for more advanced and energy efficient power electronic devices. Ga$_2$O$_3$ has emerged as an alternative to SiC and GaN, sparking a renewed interest in its fundamental properties beyond the main $β$-phase. Here, three polymorphs of Ga$_2$O$_3$, $α$, $β$ and $\varepsilon$, are investigated using X-ray diffraction, X-ray photoelectron and absorption spectroscopy, and ab initio theoretical approaches to gain insights into their structure - electronic structure relationships. Valence and conduction electronic structure as well as semi-core and core states are probed, providing a complete picture of the influence of local coordination environments on the electronic structure. State-of-the-art electronic structure theory, including all-electron density functional theory and many-body perturbation theory, provide detailed understanding of the spectroscopic results. The calculated spectra provide very accurate descriptions of all experimental spectra and additionally illuminate the origin of observed spectral features. This work provides a strong basis for the exploration of the Ga$_2$O$_3$ polymorphs as materials at the heart of future electronic device generations.
△ Less
Submitted 22 September, 2020; v1 submitted 27 May, 2020;
originally announced May 2020.
-
Split Ga vacancies and the unusually strong anisotropy of positron annihilation spectra in $\boldsymbolβ$-Ga$_2$O$_3$
Authors:
Antti Karjalainen,
Vera Prozheeva,
Kristoffer Simula,
Ilja Makkonen,
Vincent Callewaert,
Joel B. Varley,
Filip Tuomisto
Abstract:
We report a systematic first principles study on positron annihilation parameters in the $β$-Ga$_2$O$_3$ lattice and Ga mono-vacancy defects complemented with orientation-dependent experiments of the Doppler broadening of the positron-electron annihilation. We find that both the $β$-Ga$_2$O$_3$ lattice and the considered defects exhibit unusually strong anisotropy in their Doppler broadening signa…
▽ More
We report a systematic first principles study on positron annihilation parameters in the $β$-Ga$_2$O$_3$ lattice and Ga mono-vacancy defects complemented with orientation-dependent experiments of the Doppler broadening of the positron-electron annihilation. We find that both the $β$-Ga$_2$O$_3$ lattice and the considered defects exhibit unusually strong anisotropy in their Doppler broadening signals. This anisotropy is associated with low symmetry of the $β$-Ga$_2$O$_3$ crystal structure that leads to unusual kind of one-dimensional confinement of positrons even in the delocalized state in the lattice. In particular, the split Ga vacancies recently observed by scanning transmission electron microscopy produce unusually anisotropic positron annihilation signals. We show that in experiments, the positron annihilation signals in $β$-Ga$_2$O$_3$ samples seem to be often dominated by split Ga vacancies.
△ Less
Submitted 4 November, 2020; v1 submitted 13 May, 2020;
originally announced May 2020.
-
Time Dependence in Non-Autonomous Neural ODEs
Authors:
Jared Quincy Davis,
Krzysztof Choromanski,
Jake Varley,
Honglak Lee,
Jean-Jacques Slotine,
Valerii Likhosterov,
Adrian Weller,
Ameesh Makadia,
Vikas Sindhwani
Abstract:
Neural Ordinary Differential Equations (ODEs) are elegant reinterpretations of deep networks where continuous time can replace the discrete notion of depth, ODE solvers perform forward propagation, and the adjoint method enables efficient, constant memory backpropagation. Neural ODEs are universal approximators only when they are non-autonomous, that is, the dynamics depends explicitly on time. We…
▽ More
Neural Ordinary Differential Equations (ODEs) are elegant reinterpretations of deep networks where continuous time can replace the discrete notion of depth, ODE solvers perform forward propagation, and the adjoint method enables efficient, constant memory backpropagation. Neural ODEs are universal approximators only when they are non-autonomous, that is, the dynamics depends explicitly on time. We propose a novel family of Neural ODEs with time-varying weights, where time-dependence is non-parametric, and the smoothness of weight trajectories can be explicitly controlled to allow a tradeoff between expressiveness and efficiency. Using this enhanced expressiveness, we outperform previous Neural ODE variants in both speed and representational capacity, ultimately outperforming standard ResNet and CNN models on select image classification and video prediction tasks.
△ Less
Submitted 6 May, 2020; v1 submitted 4 May, 2020;
originally announced May 2020.
-
Boron phosphide as a \emph{p}-type transparent conductor: optical absorption and transport through electron-phonon coupling
Authors:
Viet-Anh Ha,
Bora Karasulu,
Ryo Maezono,
Guillaume Brunin,
Joel Basile Varley,
Gian-Marco Rignanese,
Bartomeu Monserrat,
Geoffroy Hautier
Abstract:
Boron phosphide has recently been identified as a potential high hole mobility transparent conducting material. This promise arises from its low hole effective masses. However, BP has a relatively small 2 eV indirect band gap which will affect its transparency. In this work, we computationally study both optical absorption across the indirect gap and phonon-limited electronic transport to quantify…
▽ More
Boron phosphide has recently been identified as a potential high hole mobility transparent conducting material. This promise arises from its low hole effective masses. However, BP has a relatively small 2 eV indirect band gap which will affect its transparency. In this work, we computationally study both optical absorption across the indirect gap and phonon-limited electronic transport to quantify the potential of boron phosphide as a \emph{p}-type transparent conductor. We find that phonon-mediated indirect optical absorption is weak in the visible spectrum and that the phonon-limited hole mobility is very high (around 900 cm$^2$/Vs) at room temperature. This exceptional mobility comes from a combination of low hole effective mass and very weak scattering by polar phonon modes. We rationalize the weak scattering by the less ionic bonding in boron phosphide compared to oxides. We suggest this could be a general advantage of non-oxides for \emph{p}-type transparent conducting applications. Using our computed properties, we assess the transparent conductor figure of merit of boron phosphide and shows that it exceeds by one order of magnitude that of established \emph{p}-type transparent conductors, confirming the potential of this material.
△ Less
Submitted 11 April, 2020;
originally announced April 2020.
-
Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras
Authors:
Iretiayo Akinola,
Jacob Varley,
Dmitry Kalashnikov
Abstract:
In this work, we present an effective multi-view approach to closed-loop end-to-end learning of precise manipulation tasks that are 3D in nature. Our method learns to accomplish these tasks using multiple statically placed but uncalibrated RGB camera views without building an explicit 3D representation such as a pointcloud or voxel grid. This multi-camera approach achieves superior task performanc…
▽ More
In this work, we present an effective multi-view approach to closed-loop end-to-end learning of precise manipulation tasks that are 3D in nature. Our method learns to accomplish these tasks using multiple statically placed but uncalibrated RGB camera views without building an explicit 3D representation such as a pointcloud or voxel grid. This multi-camera approach achieves superior task performance on difficult stacking and insertion tasks compared to single-view baselines. Single view robotic agents struggle from occlusion and challenges in estimating relative poses between points of interest. While full 3D scene representations (voxels or pointclouds) are obtainable from registered output of multiple depth sensors, several challenges complicate operating off such explicit 3D representations. These challenges include imperfect camera calibration, poor depth maps due to object properties such as reflective surfaces, and slower inference speeds over 3D representations compared to 2D images. Our use of static but uncalibrated cameras does not require camera-robot or camera-camera calibration making the proposed approach easy to setup and our use of \textit{sensor dropout} during training makes it resilient to the loss of camera-views after deployment.
△ Less
Submitted 31 March, 2021; v1 submitted 20 February, 2020;
originally announced February 2020.
-
Degenerate do** in \b{eta}-Ga2O3 Single Crystals through Hf-do**
Authors:
Muad Saleh,
Joel B. Varley,
Jani Jesenovec,
Arkka Bhattacharyya,
Sriram Krishnamoorthy,
Santosh Swain,
Kelvin Lynn
Abstract:
N type conductivity of \b{eta}-Ga2O3 grown from the melt is typically achieved using Sn and Si. In this paper, we experimentally and computationally investigate Hf do** of \b{eta}-Ga2O3 single crystals using UV-Vis-NIR absorption and Hall Effect measurements and hybrid functional calculations. Unintentionally-doped and Hf-doped samples with a nominal concentration of 0.5at% were grown from the m…
▽ More
N type conductivity of \b{eta}-Ga2O3 grown from the melt is typically achieved using Sn and Si. In this paper, we experimentally and computationally investigate Hf do** of \b{eta}-Ga2O3 single crystals using UV-Vis-NIR absorption and Hall Effect measurements and hybrid functional calculations. Unintentionally-doped and Hf-doped samples with a nominal concentration of 0.5at% were grown from the melt using vertical gradient freeze (VGF) and Czochralski method in mixed Ar+O2 atmosphere. We demonstrate Hf dopants, predicted to incorporate on the octahedral GaII site as a shallow donor, achieve degenerate do** in \b{eta}-Ga2O3 with a measured electron concentration 2 x 10^19 cm^-3 , mobility 80-65 cm^2 /Vs, and resistivity down to 5 mOhm-cm in our samples. The concentration of Hf was measured to be 1.3 x 10^19 atoms/cm^3 using glow discharge mass spectroscopy (GDMS) on doped samples, confirming Hf to be the cause of n-type conductivity (electron concentration ~2 x 10^19 cm-3).
△ Less
Submitted 30 January, 2020;
originally announced January 2020.
-
On Quantifying Large Lattice Relaxations in Photovoltaic Devices
Authors:
Marco Nardone,
Yasas Patikirige,
Kyoung E. Kweon,
Curtis Walkons,
Theresa Magorian Friedlmeier,
Joel B. Varley,
Vincenzo Lordi,
Shubhra Bansal
Abstract:
Temporal variations of Cu(In,Ga)Se$_2$ photovoltaic device properties during light exposure at various temperatures and voltage biases for times up to 100 h were analyzed using the kinetic theory of large lattice relaxations. Open-circuit voltage and p-type do** increased with charge injection and decreased with temperature at low injection conditions. Lattice relaxation can account for both tre…
▽ More
Temporal variations of Cu(In,Ga)Se$_2$ photovoltaic device properties during light exposure at various temperatures and voltage biases for times up to 100 h were analyzed using the kinetic theory of large lattice relaxations. Open-circuit voltage and p-type do** increased with charge injection and decreased with temperature at low injection conditions. Lattice relaxation can account for both trends and activation energies extracted from the data were approximately 0.9 and 1.2 eV for devices with lower and higher sodium content, respectively. In these devices, increased sodium content resulted in higher initial p-type do** with greater stability. First principles calculations providing revised activation energies for the ($V_{Se}-V_{Cu}$) complex suggest that this defect does not account for the metastability observed here.
△ Less
Submitted 11 November, 2019;
originally announced November 2019.
-
MOVPE-grown Si-doped \b{eta}-(Al0.26Ga0.74)2O3 thin films and heterostructures
Authors:
Praneeth Ranga,
Ashwin Rishinaramangalam,
Joel Varley,
Arkka Bhattacharyya,
Daniel Feezell,
Sriram Krishnamoorthy
Abstract:
We report on n-type degenerate do** in MOVPE grown \b{eta}-(Al0.26Ga0.74)2O3 epitaxial thin films and modulation do** in \b{eta}-(Al0.26Ga0.74)2O3/\b{eta}-Ga2O3 heterostructure. Alloy composition is confirmed using HRXRD measurements. Carrier concentration in the thin films is proportional to the silane molar flow. Room temperature hall measurements showed a high carrier concentration of 6x101…
▽ More
We report on n-type degenerate do** in MOVPE grown \b{eta}-(Al0.26Ga0.74)2O3 epitaxial thin films and modulation do** in \b{eta}-(Al0.26Ga0.74)2O3/\b{eta}-Ga2O3 heterostructure. Alloy composition is confirmed using HRXRD measurements. Carrier concentration in the thin films is proportional to the silane molar flow. Room temperature hall measurements showed a high carrier concentration of 6x1018-7.3x1019 cm-3 with a corresponding electron mobility of 53-27 cm2/V.s in uniformly-doped \b{eta}-(Al0.26Ga0.74)2O3 layers. Modulation do** is used to realize a total electron sheet charge of 2.3x1012 cm-2 in a \b{eta}-(Al0.26Ga0.74)2O3/\b{eta}-Ga2O3 heterostructure using a uniformly-doped \b{eta}-(Al0.26Ga0.74)2O3 barrier layer and a thin spacer layer.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
MAT: Multi-Fingered Adaptive Tactile Gras** via Deep Reinforcement Learning
Authors:
Bohan Wu,
Iretiayo Akinola,
Jacob Varley,
Peter Allen
Abstract:
Vision-based gras** systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of…
▽ More
Vision-based gras** systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile gras** policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.
△ Less
Submitted 9 October, 2019; v1 submitted 10 September, 2019;
originally announced September 2019.
-
Unusual Formation of Point Defect Complexes in the Ultra-wide Band Gap Semiconductor beta-Ga2O3
Authors:
Jared M. Johnson,
Zhen Chen,
Joel B. Varley,
Christine M. Jackson,
Esmat Farzana,
Zeng Zhang,
Aaron R. Arehart,
Hsien-Lien Huang,
Arda Genc,
Steven A. Ringel,
Chris G. Van de Walle,
David A. Muller,
**woo Hwang
Abstract:
Understanding the unique properties of ultra-wide band gap semiconductors requires detailed information about the exact nature of point defects and their role in determining the properties. Here, we report the first direct microscopic observation of an unusual formation of point defect complexes within the atomic scale structure of beta-Ga2O3 using high resolution scanning transmission electron mi…
▽ More
Understanding the unique properties of ultra-wide band gap semiconductors requires detailed information about the exact nature of point defects and their role in determining the properties. Here, we report the first direct microscopic observation of an unusual formation of point defect complexes within the atomic scale structure of beta-Ga2O3 using high resolution scanning transmission electron microscopy (STEM). Each complex involves one cation interstitial atom paired with two cation vacancies. These divacancy - interstitial complexes correlate directly with structures obtained by density functional theory, which predicts them to be compensating acceptors in beta-Ga2O3. This prediction is confirmed by a comparison between STEM data and deep level optical spectroscopy results, which reveals that these complexes correspond to a deep trap within the band gap, and that the development of the complexes is facilitated by Sn do** through the increase in vacancy concentration. These findings provide new insight on this emerging material's unique response to the incorporation of impurities that can critically influence their properties.
△ Less
Submitted 1 July, 2019;
originally announced July 2019.
-
Teleoperator Imitation with Continuous-time Safety
Authors:
Bachir El Khadir,
Jake Varley,
Vikas Sindhwani
Abstract:
Learning to effectively imitate human teleoperators, with generalization to unseen and dynamic environments, is a promising path to greater autonomy enabling robots to steadily acquire complex skills from supervision. We propose a new motion learning technique rooted in contraction theory and sum-of-squares programming for estimating a control law in the form of a polynomial vector field from a gi…
▽ More
Learning to effectively imitate human teleoperators, with generalization to unseen and dynamic environments, is a promising path to greater autonomy enabling robots to steadily acquire complex skills from supervision. We propose a new motion learning technique rooted in contraction theory and sum-of-squares programming for estimating a control law in the form of a polynomial vector field from a given set of demonstrations. Notably, this vector field is provably optimal for the problem of minimizing imitation loss while providing continuous-time guarantees on the induced imitation behavior. Our method generalizes to new initial and goal poses of the robot and can adapt in real-time to dynamic obstacles during execution, with convergence to teleoperator behavior within a well-defined safety tube. We present an application of our framework for pick-and-place tasks in the presence of moving obstacles on a 7-DOF KUKA IIWA arm. The method compares favorably to other learning-from-demonstration approaches on benchmark handwriting imitation tasks.
△ Less
Submitted 23 May, 2019;
originally announced May 2019.
-
Workspace Aware Online Grasp Planning
Authors:
Iretiayo Akinola,
Jacob Varley,
Boyuan Chen,
Peter K. Allen
Abstract:
This work provides a framework for a workspace aware online grasp planner. This framework greatly improves the performance of standard online grasp planning algorithms by incorporating a notion of reachability into the online grasp planning process. Offline, a database of hundreds of thousands of unique end-effector poses were queried for feasability. At runtime, our grasp planner uses this databa…
▽ More
This work provides a framework for a workspace aware online grasp planner. This framework greatly improves the performance of standard online grasp planning algorithms by incorporating a notion of reachability into the online grasp planning process. Offline, a database of hundreds of thousands of unique end-effector poses were queried for feasability. At runtime, our grasp planner uses this database to bias the hand towards reachable end-effector configurations.
The bias keeps the grasp planner in accessible regions of the planning scene so that the resulting grasps are tailored to the situation at hand. This results in a higher percentage of reachable grasps, a higher percentage of successful grasp executions, and a reduced planning time. We also present experimental results using simulated and real environments.
△ Less
Submitted 29 June, 2018;
originally announced June 2018.
-
Human Robot Interface for Assistive Gras**
Authors:
David Watkins,
Chaiwen Chou,
Caroline Weinberg,
Jacob Varley,
Kenneth Lyons,
Sanjay Joshi,
Lynne Weber,
Joel Stein,
Peter Allen
Abstract:
This work describes a new human-in-the-loop (HitL) assistive gras** system for individuals with varying levels of physical capabilities. We investigated the feasibility of using four potential input devices with our assistive gras** system interface, using able-bodied individuals to define a set of quantitative metrics that could be used to assess an assistive gras** system. We then took the…
▽ More
This work describes a new human-in-the-loop (HitL) assistive gras** system for individuals with varying levels of physical capabilities. We investigated the feasibility of using four potential input devices with our assistive gras** system interface, using able-bodied individuals to define a set of quantitative metrics that could be used to assess an assistive gras** system. We then took these measurements and created a generalized benchmark for evaluating the effectiveness of any arbitrary input device into a HitL gras** system. The four input devices were a mouse, a speech recognition device, an assistive switch, and a novel sEMG device developed by our group that was connected either to the forearm or behind the ear of the subject. These preliminary results provide insight into how different interface devices perform for generalized assistive gras** tasks and also highlight the potential of sEMG based control for severely disabled individuals.
△ Less
Submitted 6 April, 2018;
originally announced April 2018.
-
Multi-Modal Geometric Learning for Gras** and Manipulation
Authors:
David Watkins,
Jacob Varley,
Peter Allen
Abstract:
This work provides an architecture that incorporates depth and tactile information to create rich and accurate 3D models useful for robotic manipulation tasks. This is accomplished through the use of a 3D convolutional neural network (CNN). Offline, the network is provided with both depth and tactile information and trained to predict the object's geometry, thus filling in regions of occlusion. At…
▽ More
This work provides an architecture that incorporates depth and tactile information to create rich and accurate 3D models useful for robotic manipulation tasks. This is accomplished through the use of a 3D convolutional neural network (CNN). Offline, the network is provided with both depth and tactile information and trained to predict the object's geometry, thus filling in regions of occlusion. At runtime, the network is provided a partial view of an object. Tactile information is acquired to augment the captured depth information. The network can then reason about the object's geometry by utilizing both the collected tactile and depth information. We demonstrate that even small amounts of additional tactile information can be incredibly helpful in reasoning about object geometry. This is particularly true when information from depth alone fails to produce an accurate geometric prediction. Our method is benchmarked against and outperforms other visual-tactile approaches to general geometric reasoning. We also provide experimental results comparing gras** success with our method.
△ Less
Submitted 27 February, 2019; v1 submitted 20 March, 2018;
originally announced March 2018.
-
Structure of exciton condensates in imbalanced electron-hole bilayers
Authors:
J. R. Varley,
D. K. K. Lee
Abstract:
We investigate the possibility of excitonic superfluidity in electron-hole bilayers. We calculate the phase diagram of the system for the whole range of electron-hole density imbalance and for different degrees of electrostatic screening, using mean-field theory and a Ginzburg-Landau expansion. We are able to resolve differences on previous work in the literature which concentrated on restricted r…
▽ More
We investigate the possibility of excitonic superfluidity in electron-hole bilayers. We calculate the phase diagram of the system for the whole range of electron-hole density imbalance and for different degrees of electrostatic screening, using mean-field theory and a Ginzburg-Landau expansion. We are able to resolve differences on previous work in the literature which concentrated on restricted regions of the parameter space. We also give detailed descriptions of the pairing wavefunction in the Fulde-Ferrell-Larkin-Ovchinnikov paired state. The Ginzburg-Landau treatment allows us to investigate the energy scales involved in the pairing state and discuss the possible spontaneous breaking of two-dimensional translation symmetry in the ground state.
△ Less
Submitted 29 November, 2016;
originally announced November 2016.
-
Shape Completion Enabled Robotic Gras**
Authors:
Jacob Varley,
Chad DeChant,
Adam Richardson,
Joaquín Ruales,
Peter Allen
Abstract:
This work provides an architecture to enable robotic grasp planning via shape completion. Shape completion is accomplished through the use of a 3D convolutional neural network (CNN). The network is trained on our own new open source dataset of over 440,000 3D exemplars captured from varying viewpoints. At runtime, a 2.5D pointcloud captured from a single point of view is fed into the CNN, which fi…
▽ More
This work provides an architecture to enable robotic grasp planning via shape completion. Shape completion is accomplished through the use of a 3D convolutional neural network (CNN). The network is trained on our own new open source dataset of over 440,000 3D exemplars captured from varying viewpoints. At runtime, a 2.5D pointcloud captured from a single point of view is fed into the CNN, which fills in the occluded regions of the scene, allowing grasps to be planned and executed on the completed object. Runtime shape completion is very rapid because most of the computational costs of shape completion are borne during offline training. We explore how the quality of completions vary based on several factors. These include whether or not the object being completed existed in the training data and how many object models were used to train the network. We also look at the ability of the network to generalize to novel objects allowing the system to complete previously unseen objects at runtime. Finally, experimentation is done both in simulation and on actual robotic hardware to explore the relationship between completion quality and the utility of the completed mesh model for gras**.
△ Less
Submitted 2 March, 2017; v1 submitted 27 September, 2016;
originally announced September 2016.
-
Dual behavior of excess electrons in rutile TiO2
Authors:
A. Janotti,
C. Franchini,
J. B. Varley,
G. Kresse,
C. G. Van de Walle
Abstract:
The behavior of electrons in the conduction band of TiO2 and other transition-metal oxides is key to the many applications of these materials. Experiments seem to produce conflicting results: optical and spin-resonance techniques reveal strongly localized small polarons, while electrical measurements show high mobilities that can only be explained by delocalized free electrons. By means of hybrid…
▽ More
The behavior of electrons in the conduction band of TiO2 and other transition-metal oxides is key to the many applications of these materials. Experiments seem to produce conflicting results: optical and spin-resonance techniques reveal strongly localized small polarons, while electrical measurements show high mobilities that can only be explained by delocalized free electrons. By means of hybrid functional calculations we resolve this apparent contradiction and show that small polarons can actually coexist with delocalized electrons in the conduction band of TiO2, the former being energetically only slightly more favorable. We also find that small polarons can form complexes with oxygen vacancies and ionized shallow-donor impurities, explaining the rich spectrum of Ti$^{3+}$ species observed in electron spin resonance experiments.
△ Less
Submitted 24 December, 2012;
originally announced December 2012.
-
Quantum computing with defects
Authors:
J. R. Weber,
W. F. Koehl,
J. B. Varley,
A. Janotti,
B. B. Buckley,
C. G. Van de Walle,
D. D. Awschalom
Abstract:
Identifying and designing physical systems for use as qubits, the basic units of quantum information, are critical steps in the development of a quantum computer. Among the possibilities in the solid state, a defect in diamond known as the nitrogen-vacancy (NV-1) center stands out for its robustness - its quantum state can be initialized, manipulated, and measured with high fidelity at room temper…
▽ More
Identifying and designing physical systems for use as qubits, the basic units of quantum information, are critical steps in the development of a quantum computer. Among the possibilities in the solid state, a defect in diamond known as the nitrogen-vacancy (NV-1) center stands out for its robustness - its quantum state can be initialized, manipulated, and measured with high fidelity at room temperature. Here we describe how to systematically identify other deep center defects with similar quantum-mechanical properties. We present a list of physical criteria that these centers and their hosts should meet and explain how these requirements can be used in conjunction with electronic structure theory to intelligently sort through candidate defect systems. To illustrate these points in detail, we compare electronic structure calculations of the NV-1 center in diamond with those of several deep centers in 4H silicon carbide (SiC). We then discuss the proposed criteria for similar defects in other tetrahedrally-coordinated semiconductors.
△ Less
Submitted 8 March, 2010;
originally announced March 2010.
-
Friction in nanoelectromechanical systems: Clam** loss in the GHz regime
Authors:
Michael R. Geller,
Joel B. Varley
Abstract:
The performance of a wide variety of ultra-sensitive devices employing nanoelectromechanical resonators is determined by their mechanical quality factor, yet energy dissipation in these systems remains poorly understood. Here we develop a comprehensive theory of friction in high frequency resonators caused by the radiation of elastic energy into the support substrate, referred to as clam** los…
▽ More
The performance of a wide variety of ultra-sensitive devices employing nanoelectromechanical resonators is determined by their mechanical quality factor, yet energy dissipation in these systems remains poorly understood. Here we develop a comprehensive theory of friction in high frequency resonators caused by the radiation of elastic energy into the support substrate, referred to as clam** loss. The elastic radiation rate is found to be a strong increasing function of resonator frequency, and we argue that this mechanism will play an important role in future microwave-frequency devices.
△ Less
Submitted 29 December, 2005;
originally announced December 2005.