Search | arXiv e-print repository

Constructing 100 MΩ and 1 GΩ Resistance Standards via Star-Mesh Transformations

Authors: Dean G. Jarrett, Albert F. Rigosi, Dominick S. Scaletta, Ngoc Thanh Mai Tran, Heather M. Hill, Alireza R. Panna, Cheng Hsueh Yang, Yanfei Yang, Randolph E. Elmquist, David B. Newell

Abstract: A recent mathematical framework for optimizing resistor networks to achieve values in the MΩ through GΩ levels was employed for two specific cases. Objectives here include proof of concept and identification of possible apparatus limitations for future experiments involving graphene-based quantum Hall array resistance standards. Using fractal-like, or recursive, features of the framework allows on… ▽ More A recent mathematical framework for optimizing resistor networks to achieve values in the MΩ through GΩ levels was employed for two specific cases. Objectives here include proof of concept and identification of possible apparatus limitations for future experiments involving graphene-based quantum Hall array resistance standards. Using fractal-like, or recursive, features of the framework allows one to calculate and implement network designs with substantially lower-valued resistors. The cases of 100 MΩ and 1 GΩ demonstrate that, theoretically, one would not need more than 100 quantum Hall elements to achieve these high resistances. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2311.01489 [pdf, other]

Invariant Causal Imitation Learning for Generalizable Policies

Authors: Ioana Bica, Daniel Jarrett, Mihaela van der Schaar

Abstract: Consider learning an imitation policy on the basis of demonstrated behavior from multiple environments, with an eye towards deployment in an unseen environment. Since the observable features from each setting may be different, directly learning individual policies as map**s from features to actions is prone to spurious correlations -- and may not generalize well. However, the expert's policy is… ▽ More Consider learning an imitation policy on the basis of demonstrated behavior from multiple environments, with an eye towards deployment in an unseen environment. Since the observable features from each setting may be different, directly learning individual policies as map**s from features to actions is prone to spurious correlations -- and may not generalize well. However, the expert's policy is often a function of a shared latent structure underlying those observable features that is invariant across settings. By leveraging data from multiple environments, we propose Invariant Causal Imitation Learning (ICIL), a novel technique in which we learn a feature representation that is invariant across domains, on the basis of which we learn an imitation policy that matches expert behavior. To cope with transition dynamics mismatch, ICIL learns a shared representation of causal features (for all training environments), that is disentangled from the specific representations of noise variables (for each of those environments). Moreover, to ensure that the learned policy matches the observation distribution of the expert's policy, ICIL estimates the energy of the expert's observations and uses a regularization term that minimizes the imitator policy's next state energy. Experimentally, we compare our methods against several benchmarks in control and healthcare tasks and show its effectiveness in learning imitation policies capable of generalizing to unseen environments. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Journal ref: In Proc. 35th International Conference on Neural Information Processing Systems (NeurIPS 2021)

arXiv:2311.01388 [pdf, other]

Time-series Generation by Contrastive Imitation

Authors: Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

Abstract: Consider learning a generative model for time-series data. The sequential setting poses a unique challenge: Not only should the generator capture the conditional dynamics of (stepwise) transitions, but its open-loop rollouts should also preserve the joint distribution of (multi-step) trajectories. On one hand, autoregressive models trained by MLE allow learning and computing explicit transition di… ▽ More Consider learning a generative model for time-series data. The sequential setting poses a unique challenge: Not only should the generator capture the conditional dynamics of (stepwise) transitions, but its open-loop rollouts should also preserve the joint distribution of (multi-step) trajectories. On one hand, autoregressive models trained by MLE allow learning and computing explicit transition distributions, but suffer from compounding error during rollouts. On the other hand, adversarial models based on GAN training alleviate such exposure bias, but transitions are implicit and hard to assess. In this work, we study a generative framework that seeks to combine the strengths of both: Motivated by a moment-matching objective to mitigate compounding error, we optimize a local (but forward-looking) transition policy, where the reinforcement signal is provided by a global (but stepwise-decomposable) energy model trained by contrastive estimation. At training, the two components are learned cooperatively, avoiding the instabilities typical of adversarial objectives. At inference, the learned policy serves as the generator for iterative sampling, and the learned energy serves as a trajectory-level measure for evaluating sample quality. By expressly training a policy to imitate sequential behavior of time-series features in a dataset, this approach embodies "generation by imitation". Theoretically, we illustrate the correctness of this formulation and the consistency of the algorithm. Empirically, we evaluate its ability to generate predictively useful samples from real-world datasets, verifying that it performs at the standard of existing benchmarks. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Journal ref: In Proc. 35th International Conference on Neural Information Processing Systems (NeurIPS 2021)

arXiv:2310.19831 [pdf, other]

Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning

Authors: Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Abstract: Understanding human behavior from observed data is critical for transparency and accountability in decision-making. Consider real-world settings such as healthcare, in which modeling a decision-maker's policy is challenging -- with no access to underlying states, no knowledge of environment dynamics, and no allowance for live experimentation. We desire learning a data-driven representation of deci… ▽ More Understanding human behavior from observed data is critical for transparency and accountability in decision-making. Consider real-world settings such as healthcare, in which modeling a decision-maker's policy is challenging -- with no access to underlying states, no knowledge of environment dynamics, and no allowance for live experimentation. We desire learning a data-driven representation of decision-making behavior that (1) inheres transparency by design, (2) accommodates partial observability, and (3) operates completely offline. To satisfy these key criteria, we propose a novel model-based Bayesian method for interpretable policy learning ("Interpole") that jointly estimates an agent's (possibly biased) belief-update process together with their (possibly suboptimal) belief-action map**. Through experiments on both simulated and real-world data for the problem of Alzheimer's disease diagnosis, we illustrate the potential of our approach as an investigative device for auditing, quantifying, and understanding human decision-making behavior. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Journal ref: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

arXiv:2310.18688 [pdf, other]

Clairvoyance: A Pipeline Toolkit for Medical Time Series

Authors: Daniel Jarrett, **sung Yoon, Ioana Bica, Zhaozhi Qian, Ari Ercole, Mihaela van der Schaar

Abstract: Time-series learning is the bread and butter of data-driven *clinical decision support*, and the recent explosion in ML research has demonstrated great potential in various healthcare settings. At the same time, medical time-series problems in the wild are challenging due to their highly *composite* nature: They entail design choices and interactions among components that preprocess data, impute m… ▽ More Time-series learning is the bread and butter of data-driven *clinical decision support*, and the recent explosion in ML research has demonstrated great potential in various healthcare settings. At the same time, medical time-series problems in the wild are challenging due to their highly *composite* nature: They entail design choices and interactions among components that preprocess data, impute missing values, select features, issue predictions, estimate uncertainty, and interpret models. Despite exponential growth in electronic patient data, there is a remarkable gap between the potential and realized utilization of ML for clinical research and decision support. In particular, orchestrating a real-world project lifecycle poses challenges in engineering (i.e. hard to build), evaluation (i.e. hard to assess), and efficiency (i.e. hard to optimize). Designed to address these issues simultaneously, Clairvoyance proposes a unified, end-to-end, autoML-friendly pipeline that serves as a (i) software toolkit, (ii) empirical standard, and (iii) interface for optimization. Our ultimate goal lies in facilitating transparent and reproducible experimentation with complex inference workflows, providing integrated pathways for (1) personalized prediction, (2) treatment-effect estimation, and (3) information acquisition. Through illustrative examples on real-world data in outpatient, general wards, and intensive-care settings, we illustrate the applicability of the pipeline paradigm on core tasks in the healthcare journey. To the best of our knowledge, Clairvoyance is the first to demonstrate viability of a comprehensive and automatable pipeline for clinical time-series ML. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Journal ref: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

arXiv:2310.18601 [pdf, other]

Online Decision Mediation

Authors: Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Abstract: Consider learning a decision support assistant to serve as an intermediary between (oracle) expert behavior and (imperfect) human behavior: At each time, the algorithm observes an action chosen by a fallible agent, and decides whether to *accept* that agent's decision, *intervene* with an alternative, or *request* the expert's opinion. For instance, in clinical diagnosis, fully-autonomous machine… ▽ More Consider learning a decision support assistant to serve as an intermediary between (oracle) expert behavior and (imperfect) human behavior: At each time, the algorithm observes an action chosen by a fallible agent, and decides whether to *accept* that agent's decision, *intervene* with an alternative, or *request* the expert's opinion. For instance, in clinical diagnosis, fully-autonomous machine behavior is often beyond ethical affordances, thus real-world decision support is often limited to monitoring and forecasting. Instead, such an intermediary would strike a prudent balance between the former (purely prescriptive) and latter (purely descriptive) approaches, while providing an efficient interface between human mistakes and expert feedback. In this work, we first formalize the sequential problem of *online decision mediation* -- that is, of simultaneously learning and evaluating mediator policies from scratch with *abstentive feedback*: In each round, deferring to the oracle obviates the risk of error, but incurs an upfront penalty, and reveals the otherwise hidden expert action as a new training data point. Second, we motivate and propose a solution that seeks to trade off (immediate) loss terms against (future) improvements in generalization error; in doing so, we identify why conventional bandit algorithms may fail. Finally, through experiments and sensitivities on a variety of datasets, we illustrate consistent gains over applicable benchmarks on performance measures with respect to the mediator policy, the learned model, and the decision-making system as a whole. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Journal ref: In Proc. 36th International Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2310.18591 [pdf, other]

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Authors: Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Abstract: Decision analysis deals with modeling and enhancing decision processes. A principal challenge in improving behavior is in obtaining a transparent description of existing behavior in the first place. In this paper, we develop an expressive, unifying perspective on inverse decision modeling: a framework for learning parameterized representations of sequential decision behavior. First, we formalize t… ▽ More Decision analysis deals with modeling and enhancing decision processes. A principal challenge in improving behavior is in obtaining a transparent description of existing behavior in the first place. In this paper, we develop an expressive, unifying perspective on inverse decision modeling: a framework for learning parameterized representations of sequential decision behavior. First, we formalize the forward problem (as a normative standard), subsuming common classes of control behavior. Second, we use this to formalize the inverse problem (as a descriptive model), generalizing existing work on imitation/reward learning -- while opening up a much broader class of research problems in behavior representation. Finally, we instantiate this approach with an example (inverse bounded rational control), illustrating how this structure enables learning (interpretable) representations of (bounded) rationality -- while naturally capturing intuitive notions of suboptimal actions, biased beliefs, and imperfect knowledge of environments. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Journal ref: In Proc. 38th International Conference on Machine Learning (ICML 2021)

arXiv:2310.07747 [pdf, other]

Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Authors: Hao Sun, Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Abstract: Learning controllers with offline data in decision-making systems is an essential area of research due to its potential to reduce the risk of applications in real-world systems. However, in responsibility-sensitive settings such as healthcare, decision accountability is of paramount importance, yet has not been adequately addressed by the literature. This paper introduces the Accountable Offline C… ▽ More Learning controllers with offline data in decision-making systems is an essential area of research due to its potential to reduce the risk of applications in real-world systems. However, in responsibility-sensitive settings such as healthcare, decision accountability is of paramount importance, yet has not been adequately addressed by the literature. This paper introduces the Accountable Offline Controller (AOC) that employs the offline dataset as the Decision Corpus and performs accountable control based on a tailored selection of examples, referred to as the Corpus Subset. AOC operates effectively in low-data scenarios, can be extended to the strictly offline imitation setting, and displays qualities of both conservation and adaptability. We assess AOC's performance in both simulated and real-world healthcare scenarios, emphasizing its capability to manage offline control tasks with high levels of performance while maintaining accountability. △ Less

Submitted 27 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2309.15813 [pdf]

Fractal-like star-mesh transformations using graphene quantum Hall arrays

Authors: Dominick S. Scaletta, Swapnil M. Mhatre, Ngoc Thanh Mai Tran, Cheng-Hsueh Yang, Heather M. Hill, Yanfei Yang, Linli Meng, Alireza R. Panna, Shamith U. Payagala, Randolph E. Elmquist, Dean G. Jarrett, David B. Newell, Albert F. Rigosi

Abstract: A mathematical approach is adopted for optimizing the number of total device elements required for obtaining high effective quantized resistances in graphene-based quantum Hall array devices. This work explores an analytical extension to the use of star-mesh transformations such that fractal-like, or recursive, device designs can yield high enough resistances (like 1 EΩ, arguably the highest resis… ▽ More A mathematical approach is adopted for optimizing the number of total device elements required for obtaining high effective quantized resistances in graphene-based quantum Hall array devices. This work explores an analytical extension to the use of star-mesh transformations such that fractal-like, or recursive, device designs can yield high enough resistances (like 1 EΩ, arguably the highest resistance with meaningful applicability) while still being feasible to build with modern fabrication techniques. Epitaxial graphene elements are tested, whose quantized Hall resistance at the nu=2 plateau (R_H = 12906.4 Ω) becomes the building block for larger effective, quantized resistances. It is demonstrated that, mathematically, one would not need more than 200 elements to achieve the highest pertinent resistances △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.04865 [pdf, other]

doi 10.1103/PhysRevB.108.L121404

Observation of flat and weakly dispersing bands in a van der Waals semiconductor Nb3Br8 with breathing kagome lattice

Authors: Sabin Regmi, Anup Pradhan Sakhya, Tharindu Fernando, Yuzhou Zhao, Dylan Jeff, Milo Sprague, Favian Gonzalez, Iftakhar Bin Elius, Mazharul Islam Mondal, Nathan Valadez, Damani Jarrett, Alexis Agosto, Jihui Yang, Jiun-Haw Chu, Saiful I. Khondaker, Xiaodong Xu, Ting Cao, Madhab Neupane

Abstract: Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are… ▽ More Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are well explained by the theoretical calculations, which show they have Nb d character indicating their origination from the Nb atoms forming the breathing kagome plane. This van der Waals material can be easily thinned down via mechanical exfoliation to the ultrathin limit and such ultrathin samples are stable as depicted from the time-dependent Raman spectroscopy measurements at room temperature. These results demonstrate that Nb3Br8 is an excellent material not only for studying breathing kagome induced flat band physics and its connection with magnetism, but also for heterostructure fabrication for application purposes. △ Less

Submitted 9 September, 2023; originally announced September 2023.

Comments: 24 pages, 12 figures, Supplemental Material included

Journal ref: Phys. Rev. B 108, L121404 (2023)

arXiv:2308.00200 [pdf, other]

Realization of the quantum ampere using the quantum anomalous Hall and Josephson effects

Authors: Linsey K. Rodenbach, Ngoc Thanh Mai Tran, Jason M. Underwood, Alireza R. Panna, Molly P. Andersen, Zachary S. Barcikowski, Shamith U. Payagala, Peng Zhang, Lixuan Tai, Kang L. Wang, Randolph E. Elmquist, Dean G. Jarrett, David B. Newell, Albert F. Rigosi, David Goldhaber-Gordon

Abstract: By directly coupling a quantum anomalous Hall resistor to a programmable Josephson voltage standard, we have implemented a quantum current sensor (QCS) that operates within a single cryostat in zero magnetic field. Using this QCS we determine values of current within the range 9.33 nA - 252 nA, providing a realization of the ampere based on fundamental constants and quantum phenomena. The relative… ▽ More By directly coupling a quantum anomalous Hall resistor to a programmable Josephson voltage standard, we have implemented a quantum current sensor (QCS) that operates within a single cryostat in zero magnetic field. Using this QCS we determine values of current within the range 9.33 nA - 252 nA, providing a realization of the ampere based on fundamental constants and quantum phenomena. The relative Type A uncertainty is lowest, 2.30 $\times$10$^{-6}$ A/A, at the highest current studied, 252 nA. The total root-sum-square combined relative uncertainty ranges from 3.91 $\times$10$^{-6}$ A/A at 252 nA to 41.2 $\times$10$^{-6}$ A/A at 9.33 nA. No DC current standard is available in the nanoampere range with relative uncertainty comparable to this, so we assess our QCS accuracy by comparison to a traditional Ohm's law measurement of the same current source. We find closest agreement (1.46 $\pm$ 4.28)$\times$10$^{-6}$ A/A for currents near 83.9 nA, for which the highest number of measurements were made. △ Less

Submitted 31 July, 2023; originally announced August 2023.

Comments: 12 pages, 5 figures, 15 pages of supplemental information

arXiv:2306.04447 [pdf, other]

doi 10.1038/s41598-023-44851-8

Observation of momentum-dependent charge density wave gap in a layered antiferromagnet GdTe3

Authors: Sabin Regmi, Iftakhar Bin Elius, Anup Pradhan Sakhya, Dylan Jeff, Milo Sprague, Mazharul Islam Mondal, Damani Jarrett, Nathan Valadez, Alexis Agosto, Tetiana Romanova, Jiun-Haw Chu, Saiful I. Khondaker, Andrzej Ptok, Dariusz Kaczorowski, Madhab Neupane

Abstract: Charge density wave (CDW) ordering has been an important topic of study for a long time owing to its connection with other exotic phases such as superconductivity and magnetism. The RTe3 (R = rare-earth elements) family of materials provides a fertile ground to study the dynamics of CDW in van der Waals layered materials, and the presence of magnetism in these materials allows to explore the inter… ▽ More Charge density wave (CDW) ordering has been an important topic of study for a long time owing to its connection with other exotic phases such as superconductivity and magnetism. The RTe3 (R = rare-earth elements) family of materials provides a fertile ground to study the dynamics of CDW in van der Waals layered materials, and the presence of magnetism in these materials allows to explore the interplay among CDW and long range magnetic ordering. Here, we have carried out a high-resolution angle-resolved photoemission spectroscopy (ARPES) study of a CDW material GdTe3, which is antiferromagnetic below 12 K, along with thermodynamic, electrical transport, magnetic, and Raman measurements. Our Raman spectroscopy measurements show the presence of CDW amplitude mode at room temperature, which remains prominent when the sample is thinned down to 4-layers by exfoliation. Our ARPES data show a two-fold symmetric Fermi surface with both gapped and ungapped regions indicative of the partial nesting. The gap is momentum dependent, maximum along G-Z and gradually decreases going towards G - M. Our study provides a platform to study the dynamics of CDW and its interaction with other physical orders in two- and three-dimensions. △ Less

Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: 30 pages, 12 figures, Supplementary Information included

Journal ref: Scientific Reports volume 13, Article number: 18618 (2023)

arXiv:2304.11243 [pdf]

Star-Mesh Quantized Hall Array Resistance Devices

Authors: Dean G. Jarrett, Ching-Chen Yeh, Shamith U. Payagala, Alireza R. Panna, Yanfei Yang, Linli Meng, Swapnil M. Mhatre, Ngoc Thanh Mai Tran, Heather M. Hill, Dipanjan Saha, Randolph E. Elmquist, David B. Newell, Albert F. Rigosi

Abstract: Advances in the development of graphene-based technology have enabled improvements in DC resistance metrology. Devices made from epitaxially grown graphene have replaced the GaAs-based counterparts, leading to an easier and more accessible realization of the ohm. By optimizing the scale of the growth, it has become possible to fabricate quantized Hall array resistance standards (QHARS) with nomina… ▽ More Advances in the development of graphene-based technology have enabled improvements in DC resistance metrology. Devices made from epitaxially grown graphene have replaced the GaAs-based counterparts, leading to an easier and more accessible realization of the ohm. By optimizing the scale of the growth, it has become possible to fabricate quantized Hall array resistance standards (QHARS) with nominal values between 1 kΩ and 1.29 MΩ. One of these QHARS device designs accommodates a value of about 1.01 MΩ, which made it an ideal candidate to pursue a proof-of-concept that graphene-based QHARS devices are suitable for forming wye-delta resistance networks. In this work, the 1.01 MΩ array output nearly 20.6 MΩ due to the wye-delta transformation, which itself is a special case of star-mesh transformations. These mathematical equivalence principles allow one to extend the QHR to the 100 MΩ and 10 GΩ resistance levels with fewer array elements than would be necessary for a single array with many more elements in series. The 1.01 MΩ device shows promise that the wye-delta transformation can shorten the calibration chain, and, more importantly, provide a chain with a more direct line to the quantum SI. △ Less

Submitted 21 April, 2023; originally announced April 2023.

arXiv:2211.10515 [pdf, other]

Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments

Authors: Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Rémi Munos, Michal Valko

Abstract: Consider the problem of exploration in sparse-reward or reward-free environments, such as in Montezuma's Revenge. In the curiosity-driven paradigm, the agent is rewarded for how much each realized outcome differs from their predicted outcome. But using predictive error as intrinsic motivation is fragile in stochastic environments, as the agent may become trapped by high-entropy areas of the state-… ▽ More Consider the problem of exploration in sparse-reward or reward-free environments, such as in Montezuma's Revenge. In the curiosity-driven paradigm, the agent is rewarded for how much each realized outcome differs from their predicted outcome. But using predictive error as intrinsic motivation is fragile in stochastic environments, as the agent may become trapped by high-entropy areas of the state-action space, such as a "noisy TV". In this work, we study a natural solution derived from structural causal models of the world: Our key idea is to learn representations of the future that capture precisely the unpredictable aspects of each outcome -- which we use as additional input for predictions, such that intrinsic rewards only reflect the predictable aspects of world dynamics. First, we propose incorporating such hindsight representations into models to disentangle "noise" from "novelty", yielding Curiosity in Hindsight: a simple and scalable generalization of curiosity that is robust to stochasticity. Second, we instantiate this framework for the recently introduced BYOL-Explore algorithm as our prime example, resulting in the noise-robust BYOL-Hindsight. Third, we illustrate its behavior under a variety of different stochasticities in a grid world, and find improvements over BYOL-Explore in hard-exploration Atari games with sticky actions. Notably, we show state-of-the-art results in exploring Montezuma's Revenge with sticky actions, while preserving performance in the non-sticky setting. △ Less

Submitted 14 July, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

Journal ref: In Proc. 40th International Conference on Machine Learning (ICML 2023)

arXiv:2206.07769 [pdf, other]

HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

Authors: Daniel Jarrett, Bogdan Cebere, Tennison Liu, Alicia Curth, Mihaela van der Schaar

Abstract: Consider the problem of imputing missing values in a dataset. One the one hand, conventional approaches using iterative imputation benefit from the simplicity and customizability of learning conditional distributions directly, but suffer from the practical requirement for appropriate model specification of each and every variable. On the other hand, recent methods using deep generative modeling be… ▽ More Consider the problem of imputing missing values in a dataset. One the one hand, conventional approaches using iterative imputation benefit from the simplicity and customizability of learning conditional distributions directly, but suffer from the practical requirement for appropriate model specification of each and every variable. On the other hand, recent methods using deep generative modeling benefit from the capacity and efficiency of learning with neural network function approximators, but are often difficult to optimize and rely on stronger data assumptions. In this work, we study an approach that marries the advantages of both: We propose *HyperImpute*, a generalized iterative imputation framework for adaptively and automatically configuring column-wise models and their hyperparameters. Practically, we provide a concrete implementation with out-of-the-box learners, optimizers, simulators, and extensible interfaces. Empirically, we investigate this framework via comprehensive experiments and sensitivities on a variety of public datasets, and demonstrate its ability to generate accurate imputations relative to a strong suite of benchmarks. Contrary to recent work, we believe our findings constitute a strong defense of the iterative imputation paradigm. △ Less

Submitted 15 June, 2022; originally announced June 2022.

Journal ref: In Proc. 39th International Conference on Machine Learning (ICML 2022)

arXiv:2205.06077 [pdf]

Chromium-Doped Bismuth Antimony Telluride for Future Quantum Hall Resistance Standards

Authors: Albert F. Rigosi Linsey K. Rodenbach, Alireza R. Panna, Shamith U. Payagala, Ilan T. Rosen, Joseph A. Hagmann, Peng Zhang, Lixuan Tai, Kang L. Wang, Dean G. Jarrett, Randolph E. Elmquist, Jason M. Underwood, David B. Newell, David Goldhaber-Gordon

Abstract: Since 2017, epitaxial graphene has been the base material for the US national standard for resistance. A future avenue of research within electrical metrology is to remove the need for strong magnetic fields, as is currently the case for devices exhibiting the quantum Hall effect. The quantum Hall effect is just one of many research endeavours that revolve around recent quantum physical phenomena… ▽ More Since 2017, epitaxial graphene has been the base material for the US national standard for resistance. A future avenue of research within electrical metrology is to remove the need for strong magnetic fields, as is currently the case for devices exhibiting the quantum Hall effect. The quantum Hall effect is just one of many research endeavours that revolve around recent quantum physical phenomena like composite fermions, charge density waves, and topological properties [1-2]. New materials, like magnetically doped topological insulators (MTIs), offer access to the quantum anomalous Hall effect, which in its ideal form, could become a future resistance standard needing only a small permanent magnet to activate a quantized resistance value [3-5]. Furthermore, these devices could operate at zero-field for measurements, making the dissemination of the ohm more economical and portable. Here we present results on precision measurements of the h/e2 quantized plateau of Cr-Doped (BixSb1-x)2Te3 and give them context by comparing them to modern graphene-based resistance standards. Ultimately, MTI-based devices could be combined in a single system with magnetic-field-averse Josephson voltage standards to obtain an alternative quantum current standard. △ Less

Submitted 12 May, 2022; originally announced May 2022.

arXiv:2201.03621 [pdf]

Designs for programmable quantum resistance standards based on epitaxial graphene p-n junctions

Authors: Jiuning Hu, Albert F. Rigosi, Mattias Kruskopf, Yanfei Yang, Bi-Yi Wu, Jifa Tian, Alireza R. Panna, Hsin-Yen Lee, Shamith U. Payagala, George R. Jones, Marlin E. Kraft, Dean G. Jarrett, Kenji Watanabe, Takashi Taniguchi, Randolph E. Elmquist, David B. Newell

Abstract: We report the fabrication and measurement of top gated epitaxial graphene p-n junctions where exfoliated hexagonal boron nitride (h-BN) is used as the gate dielectric. The four-terminal longitudinal resistance across a single junction is well quantized at the von Klitzing constant R_K with a relative uncertainty of 10-7. After the exploration of numerous parameter spaces, we summarize the conditio… ▽ More We report the fabrication and measurement of top gated epitaxial graphene p-n junctions where exfoliated hexagonal boron nitride (h-BN) is used as the gate dielectric. The four-terminal longitudinal resistance across a single junction is well quantized at the von Klitzing constant R_K with a relative uncertainty of 10-7. After the exploration of numerous parameter spaces, we summarize the conditions upon which these devices could function as potential resistance standards. Furthermore, we offer designs of programmable electrical resistance standards over six orders of magnitude by using external gating. △ Less

Submitted 10 January, 2022; originally announced January 2022.

arXiv:2107.06317 [pdf, other]

Inverse Contextual Bandits: Learning How Behavior Evolves over Time

Authors: Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Abstract: Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare. Though conventional approaches to policy learning almost invariably assume stationarity in behavior, this is hardly true in practice: Medical practice is constantly evolving as clinical professionals fine-tune their knowledge over tim… ▽ More Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare. Though conventional approaches to policy learning almost invariably assume stationarity in behavior, this is hardly true in practice: Medical practice is constantly evolving as clinical professionals fine-tune their knowledge over time. For instance, as the medical community's understanding of organ transplantations has progressed over the years, a pertinent question is: How have actual organ allocation policies been evolving? To give an answer, we desire a policy learning method that provides interpretable representations of decision-making, in particular capturing an agent's non-stationary knowledge of the world, as well as operating in an offline manner. First, we model the evolving behavior of decision-makers in terms of contextual bandits, and formalize the problem of Inverse Contextual Bandits (ICB). Second, we propose two concrete algorithms as solutions, learning parametric and nonparametric representations of an agent's behavior. Finally, using both real and simulated data for liver transplantations, we illustrate the applicability and explainability of our method, as well as benchmarking and validating its accuracy. △ Less

Submitted 8 June, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

Comments: In Proceedings of the 39th International Conference on Machine Learning

arXiv:2106.04240 [pdf, other]

The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation

Authors: Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, Mihaela van der Schaar

Abstract: Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes. Several factors including the availability of public data, the intrinsically offline nature of the problem, and the complexity of human decision making, has meant that the mainstream development of algorithms is often geared… ▽ More Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes. Several factors including the availability of public data, the intrinsically offline nature of the problem, and the complexity of human decision making, has meant that the mainstream development of algorithms is often geared towards optimal performance in tasks that do not necessarily translate well into the medical regime; often overlooking more niche issues commonly associated with the area. We therefore present a new benchmarking suite designed specifically for medical sequential decision making: the Medkit-Learn(ing) Environment, a publicly available Python package providing simple and easy access to high-fidelity synthetic medical data. While providing a standardised way to compare algorithms in a realistic medical setting we employ a generating process that disentangles the policy and environment dynamics to allow for a range of customisations, thus enabling systematic evaluation of algorithms' robustness against specific challenges prevalent in healthcare. △ Less

Submitted 14 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

arXiv:2007.13531 [pdf, other]

Learning "What-if" Explanations for Sequential Decision-Making

Authors: Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Abstract: Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i.e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions. In this paper, we propose learning explanations of expert decisions by modeling their reward function… ▽ More Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i.e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions. In this paper, we propose learning explanations of expert decisions by modeling their reward function in terms of preferences with respect to "what if" outcomes: Given the current history of observations, what would happen if we took a particular action? To learn these cost-benefit tradeoffs associated with the expert's actions, we integrate counterfactual reasoning into batch inverse reinforcement learning. This offers a principled way of defining reward functions and explaining expert behavior, and also satisfies the constraints of real-world decision-making -- where active experimentation is often impossible (e.g. in healthcare). Additionally, by estimating the effects of different actions, counterfactuals readily tackle the off-policy nature of policy evaluation in the batch setting, and can naturally accommodate settings where the expert policies depend on histories of observations rather than just current states. Through illustrative experiments in both real and simulated medical environments, we highlight the effectiveness of our batch, counterfactual inverse reinforcement learning approach in recovering accurate and interpretable descriptions of behavior. △ Less

Submitted 30 March, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

Comments: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

arXiv:2007.12087 [pdf, other]

Hide-and-Seek Privacy Challenge

Authors: James Jordon, Daniel Jarrett, **sung Yoon, Tavian Barnes, Paul Elbers, Patrick Thoral, Ari Ercole, Cheng Zhang, Danielle Belgrave, Mihaela van der Schaar

Abstract: The clinical time-series setting poses a unique combination of challenges to data modeling and sharing. Due to the high dimensionality of clinical time series, adequate de-identification to preserve privacy while retaining data utility is difficult to achieve using common de-identification techniques. An innovative approach to this problem is synthetic data generation. From a technical perspective… ▽ More The clinical time-series setting poses a unique combination of challenges to data modeling and sharing. Due to the high dimensionality of clinical time series, adequate de-identification to preserve privacy while retaining data utility is difficult to achieve using common de-identification techniques. An innovative approach to this problem is synthetic data generation. From a technical perspective, a good generative model for time-series data should preserve temporal dynamics, in the sense that new sequences respect the original relationships between high-dimensional variables across time. From the privacy perspective, the model should prevent patient re-identification by limiting vulnerability to membership inference attacks. The NeurIPS 2020 Hide-and-Seek Privacy Challenge is a novel two-tracked competition to simultaneously accelerate progress in tackling both problems. In our head-to-head format, participants in the synthetic data generation track (i.e. "hiders") and the patient re-identification track (i.e. "seekers") are directly pitted against each other by way of a new, high-quality intensive care time-series dataset: the AmsterdamUMCdb dataset. Ultimately, we seek to advance generative techniques for dense and high-dimensional temporal data streams that are (1) clinically meaningful in terms of fidelity and predictivity, as well as (2) capable of minimizing membership privacy risks in terms of the concrete notion of patient re-identification. △ Less

Submitted 24 July, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

Comments: 19 pages, 5 figures. Part of the NeurIPS 2020 competition track

arXiv:2006.14154 [pdf, other]

Strictly Batch Imitation Learning by Energy-based Distribution Matching

Authors: Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

Abstract: Consider learning a policy purely on the basis of demonstrated behavior -- that is, with no access to reinforcement signals, no knowledge of transition dynamics, and no further interaction with the environment. This *strictly batch imitation learning* problem arises wherever live experimentation is costly, such as in healthcare. One solution is simply to retrofit existing algorithms for apprentice… ▽ More Consider learning a policy purely on the basis of demonstrated behavior -- that is, with no access to reinforcement signals, no knowledge of transition dynamics, and no further interaction with the environment. This *strictly batch imitation learning* problem arises wherever live experimentation is costly, such as in healthcare. One solution is simply to retrofit existing algorithms for apprenticeship learning to work in the offline setting. But such an approach leans heavily on off-policy evaluation or offline model estimation, and can be indirect and inefficient. We argue that a good solution should be able to explicitly parameterize a policy (i.e. respecting action conditionals), implicitly learn from rollout dynamics (i.e. leveraging state marginals), and -- crucially -- operate in an entirely offline fashion. To address this challenge, we propose a novel technique by *energy-based distribution matching* (EDM): By identifying parameterizations of the (discriminative) model of a policy with the (generative) energy function for state distributions, EDM yields a simple but effective solution that equivalently minimizes a divergence between the occupancy measure for the demonstrator and a model thereof for the imitator. Through experiments with application to control and healthcare settings, we illustrate consistent performance gains over existing algorithms for strictly batch imitation learning. △ Less

Submitted 14 January, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

Comments: In Proc. 34th International Conference on Neural Information Processing Systems (NeurIPS 2020)

arXiv:2006.14141 [pdf, other]

Inverse Active Sensing: Modeling and Understanding Timely Decision-Making

Authors: Daniel Jarrett, Mihaela van der Schaar

Abstract: Evidence-based decision-making entails collecting (costly) observations about an underlying phenomenon of interest, and subsequently committing to an (informed) decision on the basis of accumulated evidence. In this setting, active sensing is the goal-oriented problem of efficiently selecting which acquisitions to make, and when and what decision to settle on. As its complement, inverse active sen… ▽ More Evidence-based decision-making entails collecting (costly) observations about an underlying phenomenon of interest, and subsequently committing to an (informed) decision on the basis of accumulated evidence. In this setting, active sensing is the goal-oriented problem of efficiently selecting which acquisitions to make, and when and what decision to settle on. As its complement, inverse active sensing seeks to uncover an agent's preferences and strategy given their observable decision-making behavior. In this paper, we develop an expressive, unified framework for the general setting of evidence-based decision-making under endogenous, context-dependent time pressure---which requires negotiating (subjective) tradeoffs between accuracy, speediness, and cost of information. Using this language, we demonstrate how it enables modeling intuitive notions of surprise, suspense, and optimality in decision strategies (the forward problem). Finally, we illustrate how this formulation enables understanding decision-making behavior by quantifying preferences implicit in observed decision strategies (the inverse problem). △ Less

Submitted 24 June, 2020; originally announced June 2020.

Journal ref: In Proc. 37th International Conference on Machine Learning (ICML 2020)

arXiv:2001.08345 [pdf, other]

Target-Embedding Autoencoders for Supervised Representation Learning

Authors: Daniel Jarrett, Mihaela van der Schaar

Abstract: Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings. This paper analyzes a framework for improving generalization in a purely supervised setting, where the target space is high-dimensional. We motivate and formalize the general framework of target-embedding autoencoders (TEA) for supervised prediction, learning intermedia… ▽ More Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings. This paper analyzes a framework for improving generalization in a purely supervised setting, where the target space is high-dimensional. We motivate and formalize the general framework of target-embedding autoencoders (TEA) for supervised prediction, learning intermediate latent representations jointly optimized to be both predictable from features as well as predictive of targets---encoding the prior that variations in targets are driven by a compact set of underlying factors. As our theoretical contribution, we provide a guarantee of generalization for linear TEAs by demonstrating uniform stability, interpreting the benefit of the auxiliary reconstruction task as a form of regularization. As our empirical contribution, we extend validation of this approach beyond existing static classification applications to multivariate sequence forecasting, verifying their advantage on both linear and nonlinear recurrent architectures---thereby underscoring the further generality of this framework beyond feedforward instantiations. △ Less

Submitted 22 January, 2020; originally announced January 2020.

Journal ref: In Proc. 8th International Conference on Learning Representations (ICLR 2020)

arXiv:2001.03898 [pdf, other]

Stepwise Model Selection for Sequence Prediction via Deep Kernel Learning

Authors: Yao Zhang, Daniel Jarrett, Mihaela van der Schaar

Abstract: An essential problem in automated machine learning (AutoML) is that of model selection. A unique challenge in the sequential setting is the fact that the optimal model itself may vary over time, depending on the distribution of features and labels available up to each point in time. In this paper, we propose a novel Bayesian optimization (BO) algorithm to tackle the challenge of model selection in… ▽ More An essential problem in automated machine learning (AutoML) is that of model selection. A unique challenge in the sequential setting is the fact that the optimal model itself may vary over time, depending on the distribution of features and labels available up to each point in time. In this paper, we propose a novel Bayesian optimization (BO) algorithm to tackle the challenge of model selection in this setting. This is accomplished by treating the performance at each time step as its own black-box function. In order to solve the resulting multiple black-box function optimization problem jointly and efficiently, we exploit potential correlations among black-box functions using deep kernel learning (DKL). To the best of our knowledge, we are the first to formulate the problem of stepwise model selection (SMS) for sequence prediction, and to design and demonstrate an efficient joint-learning algorithm for this purpose. Using multiple real-world datasets, we verify that our proposed method outperforms both standard BO and multi-objective BO algorithms on a variety of sequence prediction tasks. △ Less

Submitted 14 February, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

Journal ref: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

arXiv:1811.10746 [pdf, other]

MATCH-Net: Dynamic Prediction in Survival Analysis using Convolutional Neural Networks

Authors: Daniel Jarrett, **sung Yoon, Mihaela van der Schaar

Abstract: Accurate prediction of disease trajectories is critical for early identification and timely treatment of patients at risk. Conventional methods in survival analysis are often constrained by strong parametric assumptions and limited in their ability to learn from high-dimensional data, while existing neural network models are not readily-adapted to the longitudinal setting. This paper develops a no… ▽ More Accurate prediction of disease trajectories is critical for early identification and timely treatment of patients at risk. Conventional methods in survival analysis are often constrained by strong parametric assumptions and limited in their ability to learn from high-dimensional data, while existing neural network models are not readily-adapted to the longitudinal setting. This paper develops a novel convolutional approach that addresses these drawbacks. We present MATCH-Net: a Missingness-Aware Temporal Convolutional Hitting-time Network, designed to capture temporal dependencies and heterogeneous interactions in covariate trajectories and patterns of missingness. To the best of our knowledge, this is the first investigation of temporal convolutions in the context of dynamic prediction for personalized risk prognosis. Using real-world data from the Alzheimer's Disease Neuroimaging Initiative, we demonstrate state-of-the-art performance without making any assumptions regarding underlying longitudinal or time-to-event processes attesting to the model's potential utility in clinical decision support. △ Less

Submitted 26 November, 2018; originally announced November 2018.

Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

Report number: ML4H/2018/36

Showing 1–26 of 26 results for author: Jarrett, D