-
Geometry dependence of TLS noise and loss in a-SiC:H parallel plate capacitors for superconducting microwave resonators
Authors:
K. Kouwenhoven,
G. P. J. van Doorn,
B. T. Buijtendorp,
S. A. H. de Rooij,
D. Lamers,
D. J. Thoen,
V. Murugesan,
J. J. A. Baselmans,
P. J. de Visser
Abstract:
Parallel plate capacitors (PPC) significantly reduce the size of superconducting microwave resonators, reducing the pixel pitch for arrays of single photon energy-resolving kinetic inductance detectors (KIDs). The frequency noise of KIDs is typically limited by tunneling Two-Level Systems (TLS), which originate from lattice defects in the dielectric materials required for PPCs. How the frequency n…
▽ More
Parallel plate capacitors (PPC) significantly reduce the size of superconducting microwave resonators, reducing the pixel pitch for arrays of single photon energy-resolving kinetic inductance detectors (KIDs). The frequency noise of KIDs is typically limited by tunneling Two-Level Systems (TLS), which originate from lattice defects in the dielectric materials required for PPCs. How the frequency noise level depends on the PPC's dimensions has not been experimentally addressed. We measure the frequency noise of 56 resonators with a-SiC:H PPCs, which cover a factor 44 in PPC area and a factor 4 in dielectric thickness. To support the noise analysis, we measure the TLS-induced, power-dependent, intrinsic loss and temperature-dependent resonance frequency shift of the resonators. From the TLS models, we expect a geometry-independent microwave loss and resonance frequency shift, set by the TLS properties of the dielectric. However, we observe a thickness-dependent microwave loss and resonance frequency shift, explained by surface layers that limit the performance of PPC-based resonators. For a uniform dielectric, the frequency noise level should scale directly inversely with the PPC area and thickness. We observe that an increase in PPC size reduces the frequency noise, but the exact scaling is, in some cases, weaker than expected. Finally, we derive an engineering guideline for the design of KIDs based on PPC-based resonators.
△ Less
Submitted 8 May, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Resolving Power of Visible to Near-Infrared Hybrid $β$-Ta/NbTiN Kinetic Inductance Detectors
Authors:
Kevin Kouwenhoven,
Daniel Fan,
Enrico Biancalani,
Steven A. H. de Rooij,
Tawab Karim,
Carlas S. Smith,
Vignesh Murugesan,
David J. Thoen,
Jochem J. A. Baselmans,
Pieter J. de Visser
Abstract:
Kinetic Inductance Detectors (KIDs) are superconducting energy-resolving detectors, sensitive to single photons from the near-infrared to ultraviolet. We study a hybrid KID design consisting of a beta phase tantalum ($β$-Ta) inductor and a NbTiN interdigitated capacitor (IDC). The devices show an average intrinsic quality factor $Q_i$ of 4.3$\times10^5$ $\pm$ 1.3 $\times10^5$. To increase the powe…
▽ More
Kinetic Inductance Detectors (KIDs) are superconducting energy-resolving detectors, sensitive to single photons from the near-infrared to ultraviolet. We study a hybrid KID design consisting of a beta phase tantalum ($β$-Ta) inductor and a NbTiN interdigitated capacitor (IDC). The devices show an average intrinsic quality factor $Q_i$ of 4.3$\times10^5$ $\pm$ 1.3 $\times10^5$. To increase the power captured by the light sensitive inductor, we 3D-print an array of 150$\times$150 $μ$m resin micro lenses on the backside of the sapphire substrate. The shape deviation between design and printed lenses is smaller than 1$μ$m, and the alignment accuracy of this process is $δ_x = +5.8 \pm 0.5$ $μ$m and $δ_y = +8.3 \pm 3.3$ $μ$m. We measure a resolving power for 1545-402 nm that is limited to 4.9 by saturation in the KID's phase response. We can model the saturation in the phase response with the evolution of the number of quasiparticles generated by a photon event. An alternative coordinate system that has a linear response raises the resolving power to 5.9 at 402 nm. We verify the measured resolving power with a two-line measurement using a laser source and a monochromator. We discuss several improvements that can be made to the devices on a route towards KID arrays with high resolving powers.
△ Less
Submitted 13 February, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Phonon-trap** enhanced energy resolution in superconducting single photon detectors
Authors:
Pieter J. de Visser,
Steven A. H. de Rooij,
Vignesh Murugesan,
David J. Thoen,
Jochem J. A. Baselmans
Abstract:
A noiseless, photon counting detector, which resolves the energy of each photon, could radically change astronomy, biophysics and quantum optics. Superconducting detectors promise an intrinsic resolving power at visible wavelengths of $R=E/δE\approx100$ due to their low excitation energy. We study superconducting energy-resolving Microwave Kinetic Inductance Detectors (MKIDs), which hold particula…
▽ More
A noiseless, photon counting detector, which resolves the energy of each photon, could radically change astronomy, biophysics and quantum optics. Superconducting detectors promise an intrinsic resolving power at visible wavelengths of $R=E/δE\approx100$ due to their low excitation energy. We study superconducting energy-resolving Microwave Kinetic Inductance Detectors (MKIDs), which hold particular promise for larger cameras. A visible/near-infrared photon absorbed in the superconductor creates a few thousand quasiparticles through several stages of electron-phonon interaction. Here we demonstrate experimentally that the resolving power of MKIDs at visible to near-infrared wavelengths is limited by the loss of hot phonons during this process. We measure the resolving power of our aluminum-based detector as a function of photon energy using four lasers with wavelengths between $1545-402$ nm. For detectors on thick SiN/Si and sapphire substrates the resolving power is limited to $10-21$ for the respective wavelengths, consistent with the loss of hot phonons. When we suspend the sensitive part of the detector on a 110 nm thick SiN membrane, the measured resolving power improves to $19-52$ respectively. The improvement is equivalent to a factor $8\pm2$ stronger phonon trap** on the membrane, which is consistent with a geometrical phonon propagation model for these hot phonons. We discuss a route towards the Fano limit by phonon engineering.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Strong Reduction of Quasiparticle Fluctuations in a Superconductor due to Decoupling of the Quasiparticle Number and Lifetime
Authors:
Steven A. H. de Rooij,
Jochem J. A. Baselmans,
Vignesh Murugesan,
David J. Thoen,
Pieter J. de Visser
Abstract:
We measure temperature dependent quasiparticle fluctuations in a small Al volume, embedded in a NbTiN superconducting microwave resonator. The resonator design allows for read-out close to equilibrium. By placing the Al film on a membrane, we enhance the fluctuation level and separate quasiparticle from phonon effects. When lowering the temperature, the recombination time saturates and the fluctua…
▽ More
We measure temperature dependent quasiparticle fluctuations in a small Al volume, embedded in a NbTiN superconducting microwave resonator. The resonator design allows for read-out close to equilibrium. By placing the Al film on a membrane, we enhance the fluctuation level and separate quasiparticle from phonon effects. When lowering the temperature, the recombination time saturates and the fluctuation level reduces a factor $\sim$100. From this we deduce that the number of free quasiparticles is still thermal. Therefore, the theoretical, inverse relation between quasiparticle number and recombination time is invalid in this experiment. This is consistent with quasiparticle trap**, where on-trap recombination limits the observed quasiparticle lifetime.
△ Less
Submitted 24 November, 2021; v1 submitted 8 March, 2021;
originally announced March 2021.
-
A tutorial on MDL hypothesis testing for graph analysis
Authors:
Peter Bloem,
Steven de Rooij
Abstract:
This document provides a tutorial description of the use of the MDL principle in complex graph analysis. We give a brief summary of the preliminary subjects, and describe the basic principle, using the example of analysing the size of the largest clique in a graph. We also provide a discussion of how to interpret the results of such an analysis, making note of several common pitfalls.
This document provides a tutorial description of the use of the MDL principle in complex graph analysis. We give a brief summary of the preliminary subjects, and describe the basic principle, using the example of analysing the size of the largest clique in a graph. We also provide a discussion of how to interpret the results of such an analysis, making note of several common pitfalls.
△ Less
Submitted 31 October, 2018;
originally announced October 2018.
-
An Expectation-Maximization Algorithm for the Fractal Inverse Problem
Authors:
Peter Bloem,
Steven de Rooij
Abstract:
We present an Expectation-Maximization algorithm for the fractal inverse problem: the problem of fitting a fractal model to data. In our setting the fractals are Iterated Function Systems (IFS), with similitudes as the family of transformations. The data is a point cloud in ${\mathbb R}^H$ with arbitrary dimension $H$. Each IFS defines a probability distribution on ${\mathbb R}^H$, so that the fra…
▽ More
We present an Expectation-Maximization algorithm for the fractal inverse problem: the problem of fitting a fractal model to data. In our setting the fractals are Iterated Function Systems (IFS), with similitudes as the family of transformations. The data is a point cloud in ${\mathbb R}^H$ with arbitrary dimension $H$. Each IFS defines a probability distribution on ${\mathbb R}^H$, so that the fractal inverse problem can be cast as a problem of parameter estimation. We show that the algorithm reconstructs well-known fractals from data, with the model converging to high precision parameters. We also show the utility of the model as an approximation for datasources outside the IFS model class.
△ Less
Submitted 30 June, 2017; v1 submitted 9 June, 2017;
originally announced June 2017.
-
Large-scale network motif analysis using compression
Authors:
Peter Bloem,
Steven de Rooij
Abstract:
We introduce a new method for finding network motifs: interesting or informative subgraph patterns in a network. Subgraphs are motifs when their frequency in the data is high compared to the expected frequency under a null model. To compute this expectation, a full or approximate count of the occurrences of a motif is normally repeated on as many as 1000 random graphs sampled from the null model;…
▽ More
We introduce a new method for finding network motifs: interesting or informative subgraph patterns in a network. Subgraphs are motifs when their frequency in the data is high compared to the expected frequency under a null model. To compute this expectation, a full or approximate count of the occurrences of a motif is normally repeated on as many as 1000 random graphs sampled from the null model; a prohibitively expensive step. We use ideas from the Minimum Description Length (MDL) literature to define a new measure of motif relevance. With our method, samples from the null model are not required. Instead we compute the probability of the data under the null model and compare this to the probability under a specially designed alternative model. With this new relevance test, we can search for motifs by random sampling, rather than requiring an accurate count of all instances of a motif. This allows motif analysis to scale to networks with billions of links.
△ Less
Submitted 18 May, 2019; v1 submitted 8 January, 2017;
originally announced January 2017.
-
Universal Codes from Switching Strategies
Authors:
Wouter M. Koolen,
Steven de Rooij
Abstract:
We discuss algorithms for combining sequential prediction strategies, a task which can be viewed as a natural generalisation of the concept of universal coding. We describe a graphical language based on Hidden Markov Models for defining prediction strategies, and we provide both existing and new models as examples. The models include efficient, parameterless models for switching between the input…
▽ More
We discuss algorithms for combining sequential prediction strategies, a task which can be viewed as a natural generalisation of the concept of universal coding. We describe a graphical language based on Hidden Markov Models for defining prediction strategies, and we provide both existing and new models as examples. The models include efficient, parameterless models for switching between the input strategies over time, including a model for the case where switches tend to occur in clusters, and finally a new model for the scenario where the prediction strategies have a known relationship, and where jumps are typically between strongly related ones. This last model is relevant for coding time series data where parameter drift is expected. As theoretical ontributions we introduce an interpolation construction that is useful in the development and analysis of new algorithms, and we establish a new sophisticated lemma for analysing the individual sequence regret of parameterised models.
△ Less
Submitted 25 November, 2013;
originally announced November 2013.
-
Follow the Leader If You Can, Hedge If You Must
Authors:
Steven de Rooij,
Tim van Erven,
Peter D. Grünwald,
Wouter M. Koolen
Abstract:
Follow-the-Leader (FTL) is an intuitive sequential prediction strategy that guarantees constant regret in the stochastic setting, but has terrible performance for worst-case data. Other hedging strategies have better worst-case guarantees but may perform much worse than FTL if the data are not maximally adversarial. We introduce the FlipFlop algorithm, which is the first method that provably combi…
▽ More
Follow-the-Leader (FTL) is an intuitive sequential prediction strategy that guarantees constant regret in the stochastic setting, but has terrible performance for worst-case data. Other hedging strategies have better worst-case guarantees but may perform much worse than FTL if the data are not maximally adversarial. We introduce the FlipFlop algorithm, which is the first method that provably combines the best of both worlds.
As part of our construction, we develop AdaHedge, which is a new way of dynamically tuning the learning rate in Hedge without using the doubling trick. AdaHedge refines a method by Cesa-Bianchi, Mansour and Stoltz (2007), yielding slightly improved worst-case guarantees. By interleaving AdaHedge and FTL, the FlipFlop algorithm achieves regret within a constant factor of the FTL regret, without sacrificing AdaHedge's worst-case guarantees.
AdaHedge and FlipFlop do not need to know the range of the losses in advance; moreover, unlike earlier methods, both have the intuitive property that the issued weights are invariant under rescaling and translation of the losses. The losses are also allowed to be negative, in which case they may be interpreted as gains.
△ Less
Submitted 17 January, 2013; v1 submitted 3 January, 2013;
originally announced January 2013.
-
Adaptive Hedge
Authors:
Tim van Erven,
Peter Grünwald,
Wouter M. Koolen,
Steven de Rooij
Abstract:
Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new wa…
▽ More
Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new way of setting the learning rate, which adapts to the difficulty of the learning problem: in the worst case our procedure still guarantees optimal performance, but on easy instances it achieves much smaller regret. In particular, our adaptive method achieves constant regret in a probabilistic setting, when there exists an action that on average obtains strictly smaller loss than all other actions. We also provide a simulation study comparing our approach to existing methods.
△ Less
Submitted 28 October, 2011;
originally announced October 2011.
-
Probability-free pricing of adjusted American lookbacks
Authors:
A. Philip Dawid,
Steven de Rooij,
Peter Grunwald,
Wouter M. Koolen,
Glenn Shafer,
Alexander Shen,
Nikolai Vereshchagin,
Vladimir Vovk
Abstract:
Consider an American option that pays G(X^*_t) when exercised at time t, where G is a positive increasing function, X^*_t := \sup_{s\le t}X_s, and X_s is the price of the underlying security at time s. Assuming zero interest rates, we show that the seller of this option can hedge his position by trading in the underlying security if he begins with initial capital X_0\int_{X_0}^{\infty}G(x)x^{-2}dx…
▽ More
Consider an American option that pays G(X^*_t) when exercised at time t, where G is a positive increasing function, X^*_t := \sup_{s\le t}X_s, and X_s is the price of the underlying security at time s. Assuming zero interest rates, we show that the seller of this option can hedge his position by trading in the underlying security if he begins with initial capital X_0\int_{X_0}^{\infty}G(x)x^{-2}dx (and this is the smallest initial capital that allows him to hedge his position). This leads to strategies for trading that are always competitive both with a given strategy's current performance and, to a somewhat lesser degree, with its best performance so far. It also leads to methods of statistical testing that avoid sacrificing too much of the maximum statistical significance that they achieve in the course of accumulating data.
△ Less
Submitted 20 August, 2011;
originally announced August 2011.
-
Insuring against loss of evidence in game-theoretic probability
Authors:
A. Philip Dawid,
Steven de Rooij,
Glenn Shafer,
Alexander Shen,
Nikolai Vereshchagin,
Vladimir Vovk
Abstract:
We consider the game-theoretic scenario of testing the performance of Forecaster by Sceptic who gambles against the forecasts. Sceptic's current capital is interpreted as the amount of evidence he has found against Forecaster. Reporting the maximum of Sceptic's capital so far exaggerates the evidence. We characterize the set of all increasing functions that remove the exaggeration. This result can…
▽ More
We consider the game-theoretic scenario of testing the performance of Forecaster by Sceptic who gambles against the forecasts. Sceptic's current capital is interpreted as the amount of evidence he has found against Forecaster. Reporting the maximum of Sceptic's capital so far exaggerates the evidence. We characterize the set of all increasing functions that remove the exaggeration. This result can be used for insuring against loss of evidence.
△ Less
Submitted 21 October, 2010; v1 submitted 11 May, 2010;
originally announced May 2010.
-
Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma
Authors:
Tim van Erven,
Peter Grunwald,
Steven de Rooij
Abstract:
Bayesian model averaging, model selection and its approximations such as BIC are generally statistically consistent, but sometimes achieve slower rates og convergence than other methods such as AIC and leave-one-out cross-validation. On the other hand, these other methods can br inconsistent. We identify the "catch-up phenomenon" as a novel explanation for the slow convergence of Bayesian method…
▽ More
Bayesian model averaging, model selection and its approximations such as BIC are generally statistically consistent, but sometimes achieve slower rates og convergence than other methods such as AIC and leave-one-out cross-validation. On the other hand, these other methods can br inconsistent. We identify the "catch-up phenomenon" as a novel explanation for the slow convergence of Bayesian methods. Based on this analysis we define the switch distribution, a modification of the Bayesian marginal distribution. We show that, under broad conditions,model selection and prediction based on the switch distribution is both consistent and achieves optimal convergence rates, thereby resolving the AIC-BIC dilemma. The method is practical; we give an efficient implementation. The switch distribution has a data compression interpretation, and can thus be viewed as a "prequential" or MDL method; yet it is different from the MDL methods that are usually considered in the literature. We compare the switch distribution to Bayes factor model selection and leave-one-out cross-validation.
△ Less
Submitted 7 July, 2008;
originally announced July 2008.
-
Combining Expert Advice Efficiently
Authors:
Wouter Koolen,
Steven de Rooij
Abstract:
We show how models for prediction with expert advice can be defined concisely and clearly using hidden Markov models (HMMs); standard HMM algorithms can then be used to efficiently calculate, among other things, how the expert predictions should be weighted according to the model. We cast many existing models as HMMs and recover the best known running times in each case. We also describe two new…
▽ More
We show how models for prediction with expert advice can be defined concisely and clearly using hidden Markov models (HMMs); standard HMM algorithms can then be used to efficiently calculate, among other things, how the expert predictions should be weighted according to the model. We cast many existing models as HMMs and recover the best known running times in each case. We also describe two new models: the switch distribution, which was recently developed to improve Bayesian/Minimum Description Length model selection, and a new generalisation of the fixed share algorithm based on run-length coding. We give loss bounds for all models and shed new light on their relationships.
△ Less
Submitted 15 February, 2008; v1 submitted 14 February, 2008;
originally announced February 2008.
-
Approximating Rate-Distortion Graphs of Individual Data: Experiments in Lossy Compression and Denoising
Authors:
Steven de Rooij,
Paul Vitanyi
Abstract:
Classical rate-distortion theory requires knowledge of an elusive source distribution. Instead, we analyze rate-distortion properties of individual objects using the recently developed algorithmic rate-distortion theory. The latter is based on the noncomputable notion of Kolmogorov complexity. To apply the theory we approximate the Kolmogorov complexity by standard data compression techniques, a…
▽ More
Classical rate-distortion theory requires knowledge of an elusive source distribution. Instead, we analyze rate-distortion properties of individual objects using the recently developed algorithmic rate-distortion theory. The latter is based on the noncomputable notion of Kolmogorov complexity. To apply the theory we approximate the Kolmogorov complexity by standard data compression techniques, and perform a number of experiments with lossy compression and denoising of objects from different domains. We also introduce a natural generalization to lossy compression with side information. To maintain full generality we need to address a difficult searching problem. While our solutions are therefore not time efficient, we do observe good denoising and compression performance.
△ Less
Submitted 21 September, 2006;
originally announced September 2006.
-
Asymptotic Log-loss of Prequential Maximum Likelihood Codes
Authors:
Peter Grunwald,
Steven de Rooij
Abstract:
We analyze the Dawid-Rissanen prequential maximum likelihood codes relative to one-parameter exponential family models M. If data are i.i.d. according to an (essentially) arbitrary P, then the redundancy grows at rate c/2 ln n. We show that c=v1/v2, where v1 is the variance of P, and v2 is the variance of the distribution m* in M that is closest to P in KL divergence. This shows that prequential…
▽ More
We analyze the Dawid-Rissanen prequential maximum likelihood codes relative to one-parameter exponential family models M. If data are i.i.d. according to an (essentially) arbitrary P, then the redundancy grows at rate c/2 ln n. We show that c=v1/v2, where v1 is the variance of P, and v2 is the variance of the distribution m* in M that is closest to P in KL divergence. This shows that prequential codes behave quite differently from other important universal codes such as the 2-part MDL, Shtarkov and Bayes codes, for which c=1. This behavior is undesirable in an MDL model selection setting.
△ Less
Submitted 1 February, 2005;
originally announced February 2005.
-
An Empirical Study of MDL Model Selection with Infinite Parametric Complexity
Authors:
Steven de Rooij,
Peter Grunwald
Abstract:
Parametric complexity is a central concept in MDL model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on Jeffreys' prior can not be used. Several ways to resolve this problem have been proposed. We conduct experiments to compare and…
▽ More
Parametric complexity is a central concept in MDL model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on Jeffreys' prior can not be used. Several ways to resolve this problem have been proposed. We conduct experiments to compare and evaluate their behaviour on small sample sizes.
We find interestingly poor behaviour for the plug-in predictive code; a restricted NML model performs quite well but it is questionable if the results validate its theoretical motivation. The Bayesian model with the improper Jeffreys' prior is the most dependable.
△ Less
Submitted 14 January, 2005;
originally announced January 2005.