-
Noise reduction by bias cooling in gated Si/SixGe1-x quantum dots
Authors:
Julian Ferrero,
Thomas Koch,
Sonja Vogel,
Daniel Schroller,
Viktor Adam,
Ran Xue,
Inga Seidler,
Lars R. Schreiber,
Hendrik Bluhm,
Wolfgang Wernsdorfer
Abstract:
Silicon-Germanium heterostructures are a promising quantum circuit platform, but crucial aspects as the long-term charge dynamics and cooldown-to-cooldown variations are still widely unexplored quantitatively. In this letter we present the results of an extensive bias cooling study performed on gated silicon-germanium quantum dots with an Al2O3-dielectric. Over 80 cooldowns were performed in the c…
▽ More
Silicon-Germanium heterostructures are a promising quantum circuit platform, but crucial aspects as the long-term charge dynamics and cooldown-to-cooldown variations are still widely unexplored quantitatively. In this letter we present the results of an extensive bias cooling study performed on gated silicon-germanium quantum dots with an Al2O3-dielectric. Over 80 cooldowns were performed in the course of our investigations. The performance of the devices is assessed by low-frequency charge noise measurements in the band of 200 micro Hertz to 10 milli Hertz. We measure the total noise power as a function of the applied voltage during cooldown in four different devices and find a minimum in noise at 0.7V bias cooling voltage for all observed samples. We manage to decrease the total noise power median by a factor of 6 and compute a reduced tunneling current density using Schrödinger-Poisson simulations. Furthermore, we show the variation in noise from the same device in the course of eleven different cooldowns performed under the nominally same conditions.
△ Less
Submitted 8 May, 2024; v1 submitted 30 April, 2024;
originally announced May 2024.
-
Variational Gaussian Process Diffusion Processes
Authors:
Prakhar Verma,
Vincent Adam,
Arno Solin
Abstract:
Diffusion processes are a class of stochastic differential equations (SDEs) providing a rich family of expressive models that arise naturally in dynamic modelling tasks. Probabilistic inference and learning under generative models with latent processes endowed with a non-linear diffusion process prior are intractable problems. We build upon work within variational inference, approximating the post…
▽ More
Diffusion processes are a class of stochastic differential equations (SDEs) providing a rich family of expressive models that arise naturally in dynamic modelling tasks. Probabilistic inference and learning under generative models with latent processes endowed with a non-linear diffusion process prior are intractable problems. We build upon work within variational inference, approximating the posterior process as a linear diffusion process, and point out pathologies in the approach. We propose an alternative parameterization of the Gaussian variational process using a site-based exponential family description. This allows us to trade a slow inference algorithm with fixed-point iterations for a fast algorithm for convex optimization akin to natural gradient descent, which also provides a better objective for learning model parameters.
△ Less
Submitted 27 February, 2024; v1 submitted 3 June, 2023;
originally announced June 2023.
-
Dual Parameterization of Sparse Variational Gaussian Processes
Authors:
Vincent Adam,
Paul E. Chang,
Mohammad Emtiyaz Khan,
Arno Solin
Abstract:
Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up in…
▽ More
Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up inference using natural gradient descent, and provides a tighter evidence lower bound for hyperparameter learning. The approach has the same memory cost as the current SVGP methods, but it is faster and more accurate.
△ Less
Submitted 19 January, 2022; v1 submitted 5 November, 2021;
originally announced November 2021.
-
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Authors:
John McLeod,
Hrvoje Stojic,
Vincent Adam,
Dongho Kim,
Jordi Grau-Moya,
Peter Vrancx,
Felix Leibfried
Abstract:
In the past decade, model-free reinforcement learning (RL) has provided solutions to challenging domains such as robotics. Model-based RL shows the prospect of being more sample-efficient than model-free methods in terms of agent-environment interactions, because the model enables to extrapolate to unseen situations. In the more recent past, model-based methods have shown superior results compared…
▽ More
In the past decade, model-free reinforcement learning (RL) has provided solutions to challenging domains such as robotics. Model-based RL shows the prospect of being more sample-efficient than model-free methods in terms of agent-environment interactions, because the model enables to extrapolate to unseen situations. In the more recent past, model-based methods have shown superior results compared to model-free methods in some challenging domains with non-linear state transitions. At the same time, it has become apparent that RL is not market-ready yet and that many real-world applications are going to require model-based approaches, because model-free methods are too sample-inefficient and show poor performance in early stages of training. The latter is particularly important in industry, e.g. in production systems that directly impact a company's revenue. This demonstrates the necessity for a toolbox to push the boundaries for model-based RL. While there is a plethora of toolboxes for model-free RL, model-based RL has received little attention in terms of toolbox development. Bellman aims to fill this gap and introduces the first thoroughly designed and tested model-based RL toolbox using state-of-the-art software engineering practices. Our modular approach enables to combine a wide range of environment models with generic model-based agent classes that recover state-of-the-art algorithms. We also provide an experiment harness to compare both model-free and model-based agents in a systematic fashion w.r.t. user-defined evaluation metrics (e.g. cumulative reward). This paves the way for new research directions, e.g. investigating uncertainty-aware environment models that are not necessarily neural-network-based, or develo** algorithms to solve industrially-motivated benchmarks that share characteristics with real-world problems.
△ Less
Submitted 13 April, 2021; v1 submitted 26 March, 2021;
originally announced March 2021.
-
Sparse Algorithms for Markovian Gaussian Processes
Authors:
William J. Wilkinson,
Arno Solin,
Vincent Adam
Abstract:
Approximate Bayesian inference methods that scale to very large datasets are crucial in leveraging probabilistic models for real-world time series. Sparse Markovian Gaussian processes combine the use of inducing variables with efficient Kalman filter-like recursions, resulting in algorithms whose computational and memory requirements scale linearly in the number of inducing points, whilst also ena…
▽ More
Approximate Bayesian inference methods that scale to very large datasets are crucial in leveraging probabilistic models for real-world time series. Sparse Markovian Gaussian processes combine the use of inducing variables with efficient Kalman filter-like recursions, resulting in algorithms whose computational and memory requirements scale linearly in the number of inducing points, whilst also enabling parallel parameter updates and stochastic optimisation. Under this paradigm, we derive a general site-based approach to approximate inference, whereby we approximate the non-Gaussian likelihood with local Gaussian terms, called sites. Our approach results in a suite of novel sparse extensions to algorithms from both the machine learning and signal processing literature, including variational inference, expectation propagation, and the classical nonlinear Kalman smoothers. The derived methods are suited to large time series, and we also demonstrate their applicability to spatio-temporal data, where the model has separate inducing points in both time and space.
△ Less
Submitted 9 June, 2021; v1 submitted 19 March, 2021;
originally announced March 2021.
-
A Framework for Interdomain and Multioutput Gaussian Processes
Authors:
Mark van der Wilk,
Vincent Dutordoir,
ST John,
Artem Artemev,
Vincent Adam,
James Hensman
Abstract:
One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference. In order to improve the utility of GPs we need a modular system that allows rapid implementation and testing, as seen in the neural network community. We present a mathematica…
▽ More
One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference. In order to improve the utility of GPs we need a modular system that allows rapid implementation and testing, as seen in the neural network community. We present a mathematical and software framework for scalable approximate inference in GPs, which combines interdomain approximations and multiple outputs. Our framework, implemented in GPflow, provides a unified interface for many existing multioutput models, as well as more recent convolutional structures. This simplifies the creation of deep models with GPs, and we hope that this work will encourage more interest in this approach.
△ Less
Submitted 2 March, 2020;
originally announced March 2020.
-
Non-linear regression models for behavioral and neural data analysis
Authors:
Vincent Adam,
Alexandre Hyafil
Abstract:
Regression models are popular tools in empirical sciences to infer the influence of a set of variables onto a dependent variable given an experimental dataset. In neuroscience and cognitive psychology, Generalized Linear Models (GLMs) -including linear regression, logistic regression, and Poisson GLM- is the regression model of choice to study the factors that drive participant's choices, reaction…
▽ More
Regression models are popular tools in empirical sciences to infer the influence of a set of variables onto a dependent variable given an experimental dataset. In neuroscience and cognitive psychology, Generalized Linear Models (GLMs) -including linear regression, logistic regression, and Poisson GLM- is the regression model of choice to study the factors that drive participant's choices, reaction times and neural activations. These methods are however limited as they only capture linear contributions of each regressors. Here, we introduce an extension of GLMs called Generalized Unrestricted Models (GUMs), which allows to infer a much richer set of contributions of the regressors to the dependent variable, including possible interactions between the regressors. In a GUM, each regressor is passed through a linear or nonlinear function, and the contribution of the different resulting transformed regressors can be summed or multiplied to generate a predictor for the dependent variable. We propose a Bayesian treatment of these models in which we endow functions with Gaussian Process priors, and we present two methods to compute a posterior over the functions given a dataset: the Laplace method and a sparse variational approach, which scales better for large dataset. For each method, we assess the quality of the model estimation and we detail how the hyperparameters (defining for example the expected smoothness of the function) can be fitted. Finally, we illustrate the power of the method on a behavioral dataset where subjects reported the average perceived orientation of a series of gratings. The method allows to recover the map** of the grating angle onto perceptual evidence for each subject, as well as the impact of the grating based on its position. Overall, GUMs provides a very rich and flexible framework to run nonlinear regression analysis in neuroscience, psychology, and beyond.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
Doubly Sparse Variational Gaussian Processes
Authors:
Vincent Adam,
Stefanos Eleftheriadis,
Nicolas Durrande,
Artem Artemev,
James Hensman
Abstract:
The use of Gaussian process models is typically limited to datasets with a few tens of thousands of observations due to their complexity and memory footprint. The two most commonly used methods to overcome this limitation are 1) the variational sparse approximation which relies on inducing points and 2) the state-space equivalent formulation of Gaussian processes which can be seen as exploiting so…
▽ More
The use of Gaussian process models is typically limited to datasets with a few tens of thousands of observations due to their complexity and memory footprint. The two most commonly used methods to overcome this limitation are 1) the variational sparse approximation which relies on inducing points and 2) the state-space equivalent formulation of Gaussian processes which can be seen as exploiting some sparsity in the precision matrix. We propose to take the best of both worlds: we show that the inducing point framework is still valid for state space models and that it can bring further computational and memory savings. Furthermore, we provide the natural gradient formulation for the proposed variational parameterisation. Finally, this work makes it possible to use the state-space formulation inside deep Gaussian process models as illustrated in one of the experiments.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
Disentangled Skill Embeddings for Reinforcement Learning
Authors:
Janith C. Petangoda,
Sergio Pascual-Diaz,
Vincent Adam,
Peter Vrancx,
Jordi Grau-Moya
Abstract:
We propose a novel framework for multi-task reinforcement learning (MTRL). Using a variational inference formulation, we learn policies that generalize across both changing dynamics and goals. The resulting policies are parametrized by shared parameters that allow for transfer between different dynamics and goal conditions, and by task-specific latent-space embeddings that allow for specialization…
▽ More
We propose a novel framework for multi-task reinforcement learning (MTRL). Using a variational inference formulation, we learn policies that generalize across both changing dynamics and goals. The resulting policies are parametrized by shared parameters that allow for transfer between different dynamics and goal conditions, and by task-specific latent-space embeddings that allow for specialization to particular tasks. We show how the latent-spaces enable generalization to unseen dynamics and goals conditions. Additionally, policies equipped with such embeddings serve as a space of skills (or options) for hierarchical reinforcement learning. Since we can change task dynamics and goals independently, we name our framework Disentangled Skill Embeddings (DSE).
△ Less
Submitted 21 June, 2019;
originally announced June 2019.
-
Banded Matrix Operators for Gaussian Markov Models in the Automatic Differentiation Era
Authors:
Nicolas Durrande,
Vincent Adam,
Lucas Bordeaux,
Stefanos Eleftheriadis,
James Hensman
Abstract:
Banded matrices can be used as precision matrices in several models including linear state-space models, some Gaussian processes, and Gaussian Markov random fields. The aim of the paper is to make modern inference methods (such as variational inference or gradient-based sampling) available for Gaussian models with banded precision. We show that this can efficiently be achieved by equip** an auto…
▽ More
Banded matrices can be used as precision matrices in several models including linear state-space models, some Gaussian processes, and Gaussian Markov random fields. The aim of the paper is to make modern inference methods (such as variational inference or gradient-based sampling) available for Gaussian models with banded precision. We show that this can efficiently be achieved by equip** an automatic differentiation framework, such as TensorFlow or PyTorch, with some linear algebra operators dedicated to banded matrices. This paper studies the algorithmic aspects of the required operators, details their reverse-mode derivatives, and show that their complexity is linear in the number of observations.
△ Less
Submitted 26 February, 2019;
originally announced February 2019.
-
Scalable GAM using sparse variational Gaussian processes
Authors:
Vincent Adam,
Nicolas Durrande,
ST John
Abstract:
Generalized additive models (GAMs) are a widely used class of models of interest to statisticians as they provide a flexible way to design interpretable models of data beyond linear models. We here propose a scalable and well-calibrated Bayesian treatment of GAMs using Gaussian processes (GPs) and leveraging recent advances in variational inference. We use sparse GPs to represent each component an…
▽ More
Generalized additive models (GAMs) are a widely used class of models of interest to statisticians as they provide a flexible way to design interpretable models of data beyond linear models. We here propose a scalable and well-calibrated Bayesian treatment of GAMs using Gaussian processes (GPs) and leveraging recent advances in variational inference. We use sparse GPs to represent each component and exploit the additive structure of the model to efficiently represent a Gaussian a posteriori coupling between the components.
△ Less
Submitted 28 December, 2018;
originally announced December 2018.
-
Discrete flow posteriors for variational inference in discrete dynamical systems
Authors:
Laurence Aitchison,
Vincent Adam,
Srinivas C. Turaga
Abstract:
Each training step for a variational autoencoder (VAE) requires us to sample from the approximate posterior, so we usually choose simple (e.g. factorised) approximate posteriors in which sampling is an efficient computation that fully exploits GPU parallelism. However, such simple approximate posteriors are often insufficient, as they eliminate statistical dependencies in the posterior. While it i…
▽ More
Each training step for a variational autoencoder (VAE) requires us to sample from the approximate posterior, so we usually choose simple (e.g. factorised) approximate posteriors in which sampling is an efficient computation that fully exploits GPU parallelism. However, such simple approximate posteriors are often insufficient, as they eliminate statistical dependencies in the posterior. While it is possible to use normalizing flow approximate posteriors for continuous latents, some problems have discrete latents and strong statistical dependencies. The most natural approach to model these dependencies is an autoregressive distribution, but sampling from such distributions is inherently sequential and thus slow. We develop a fast, parallel sampling procedure for autoregressive distributions based on fixed-point iterations which enables efficient and accurate variational inference in discrete state-space latent variable dynamical systems. To optimize the variational bound, we considered two ways to evaluate probabilities: inserting the relaxed samples directly into the pmf for the discrete distribution, or converting to continuous logistic latent variables and interpreting the K-step fixed-point iterations as a normalizing flow. We found that converting to continuous latent variables gave considerable additional scope for mismatch between the true and approximate posteriors, which resulted in biased inferences, we thus used the former approach. Using our fast sampling procedure, we were able to realize the benefits of correlated posteriors, including accurate uncertainty estimates for one cell, and accurate connectivity estimates for multiple cells, in an order of magnitude less time.
△ Less
Submitted 28 May, 2018;
originally announced May 2018.
-
Structured Variational Inference for Coupled Gaussian Processes
Authors:
Vincent Adam
Abstract:
Sparse variational approximations allow for principled and scalable inference in Gaussian Process (GP) models. In settings where several GPs are part of the generative model, theses GPs are a posteriori coupled. For many applications such as regression where predictive accuracy is the quantity of interest, this coupling is not crucial. Howewer if one is interested in posterior uncertainty, it cann…
▽ More
Sparse variational approximations allow for principled and scalable inference in Gaussian Process (GP) models. In settings where several GPs are part of the generative model, theses GPs are a posteriori coupled. For many applications such as regression where predictive accuracy is the quantity of interest, this coupling is not crucial. Howewer if one is interested in posterior uncertainty, it cannot be ignored. A key element of variational inference schemes is the choice of the approximate posterior parameterization. When the number of latent variables is large, mean field (MF) methods provide fast and accurate posterior means while more structured posterior lead to inference algorithm of greater computational complexity. Here, we extend previous sparse GP approximations and propose a novel parameterization of variational posteriors in the multi-GP setting allowing for fast and scalable inference capturing posterior dependencies.
△ Less
Submitted 29 November, 2017; v1 submitted 3 November, 2017;
originally announced November 2017.
-
Structure of superoxide reductase bound to ferrocyanide and active site expansion upon X-ray-induced photo-reduction
Authors:
Virgile Adam,
Antoine Royant,
Vincent Nivière,
Fernando P Molina-Heredia,
Dominique Bourgeois
Abstract:
Some sulfate-reducing and microaerophilic bacteria rely on the enzyme superoxide reductase (SOR) to eliminate the toxic superoxide anion radical (O2*-). SOR catalyses the one-electron reduction of O2*- to hydrogen peroxide at a nonheme ferrous iron center. The structures of Desulfoarculus baarsii SOR (mutant E47A) alone and in complex with ferrocyanide were solved to 1.15 and 1.7 A resolution, res…
▽ More
Some sulfate-reducing and microaerophilic bacteria rely on the enzyme superoxide reductase (SOR) to eliminate the toxic superoxide anion radical (O2*-). SOR catalyses the one-electron reduction of O2*- to hydrogen peroxide at a nonheme ferrous iron center. The structures of Desulfoarculus baarsii SOR (mutant E47A) alone and in complex with ferrocyanide were solved to 1.15 and 1.7 A resolution, respectively. The latter structure, the first ever reported of a complex between ferrocyanide and a protein, reveals that this organo-metallic compound entirely plugs the SOR active site, coordinating the active iron through a bent cyano bridge. The subtle structural differences between the mixed-valence and the fully reduced SOR-ferrocyanide adducts were investigated by taking advantage of the photoelectrons induced by X-rays. The results reveal that photo-reduction from Fe(III) to Fe(II) of the iron center, a very rapid process under a powerful synchrotron beam, induces an expansion of the SOR active site.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
Detoxification of superoxide without production of H2O2: antioxidant activity of superoxide reductase complexed with ferrocyanide
Authors:
Fernando P Molina-Heredia,
Chantal Houée-Levin,
Catherine Berthomieu,
Danièle Touati,
Emilie Tremey,
Vincent Favaudon,
Virgile Adam,
Vincent Nivière
Abstract:
The superoxide radical O(2)(-.) is a toxic by-product of oxygen metabolism. Two O(2)(-.) detoxifying enzymes have been described so far, superoxide dismutase and superoxide reductase (SOR), both forming H2O2 as a reaction product. Recently, the SOR active site, a ferrous iron in a [Fe(2+) (N-His)(4) (S-Cys)] pentacoordination, was shown to have the ability to form a complex with the organometallic…
▽ More
The superoxide radical O(2)(-.) is a toxic by-product of oxygen metabolism. Two O(2)(-.) detoxifying enzymes have been described so far, superoxide dismutase and superoxide reductase (SOR), both forming H2O2 as a reaction product. Recently, the SOR active site, a ferrous iron in a [Fe(2+) (N-His)(4) (S-Cys)] pentacoordination, was shown to have the ability to form a complex with the organometallic compound ferrocyanide. Here, we have investigated in detail the reactivity of the SOR-ferrocyanide complex with O(2)(-.) by pulse and gamma-ray radiolysis, infrared, and UV-visible spectroscopies. The complex reacts very efficiently with O(2)(-.). However, the presence of the ferrocyanide adduct markedly modifies the reaction mechanism of SOR, with the formation of transient intermediates different from those observed for SOR alone. A one-electron redox chemistry appears to be carried out by the ferrocyanide moiety of the complex, whereas the SOR iron site remains in the reduced state. Surprisingly, the toxic H2O2 species is no longer the reaction product. Accordingly, in vivo experiments showed that formation of the SOR-ferrocyanide complex increased the antioxidant capabilities of SOR expressed in an Escherichia coli sodA sodB recA mutant strain. Altogether, these data describe an unprecedented O(2)(-.) detoxification activity, catalyzed by the SOR-ferrocyanide complex, which does not conduct to the production of the toxic H2O2 species.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
Raman-assisted crystallography reveals end-on peroxide intermediates in a nonheme iron enzyme
Authors:
Gergely Katona,
Philippe Carpentier,
Vincent Nivière,
Patricia Amara,
Virgile Adam,
Jérémy Ohana,
Nikolay Tsanov,
Dominique Bourgeois
Abstract:
Iron-peroxide intermediates are central in the reaction cycle of many iron-containing biomolecules. We trapped iron(III)-(hydro)peroxo species in crystals of superoxide reductase (SOR), a nonheme mononuclear iron enzyme that scavenges superoxide radicals. X-ray diffraction data at 1.95 angstrom resolution and Raman spectra recorded in crystallo revealed iron-(hydro)peroxo intermediates with the (h…
▽ More
Iron-peroxide intermediates are central in the reaction cycle of many iron-containing biomolecules. We trapped iron(III)-(hydro)peroxo species in crystals of superoxide reductase (SOR), a nonheme mononuclear iron enzyme that scavenges superoxide radicals. X-ray diffraction data at 1.95 angstrom resolution and Raman spectra recorded in crystallo revealed iron-(hydro)peroxo intermediates with the (hydro)peroxo group bound end-on. The dynamic SOR active site promotes the formation of transient hydrogen bond networks, which presumably assist the cleavage of the iron-oxygen bond in order to release the reaction product, hydrogen peroxide.
△ Less
Submitted 16 December, 2014;
originally announced December 2014.