-
SmoQyDQMC.jl: A flexible implementation of determinant quantum Monte Carlo for Hubbard and electron-phonon interactions
Authors:
Benjamin Cohen-Stead,
Sohan Malkaruge Costa,
James Neuhaus,
Andy Tanjaroon Ly,
Yutan Zhang,
Richard Scalettar,
Kipton Barros,
Steven Johnston
Abstract:
We introduce the SmoQyDQMC.jl package, a Julia implementation of the determinant quantum Monte Carlo algorithm. SmoQyDQMC.jl supports generalized tight-binding Hamiltonians with on-site Hubbard and generalized electron-phonon interactions, including non-linear $e$-ph coupling and anharmonic lattice potentials. Our implementations use hybrid Monte Carlo methods with exact forces for sampling the ph…
▽ More
We introduce the SmoQyDQMC.jl package, a Julia implementation of the determinant quantum Monte Carlo algorithm. SmoQyDQMC.jl supports generalized tight-binding Hamiltonians with on-site Hubbard and generalized electron-phonon interactions, including non-linear $e$-ph coupling and anharmonic lattice potentials. Our implementations use hybrid Monte Carlo methods with exact forces for sampling the phonon fields, enabling efficient simulation of low-energy phonon branches, including acoustic phonons. The SmoQyDQMC.jl package also uses a flexible scripting interface, allowing users to adapt it to different workflows and interface with other software packages in the Julia ecosystem. The code for this package can be downloaded from our GitHub repository at https://github.com/SmoQySuite/SmoQyDQMC.jl or installed using the Julia package manager. The online documentation, including examples, can be obtained from our document page at https://smoqysuite.github.io/SmoQyDQMC.jl/stable/.
△ Less
Submitted 17 April, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
AI and ethics in insurance: a new solution to mitigate proxy discrimination in risk modeling
Authors:
Marguerite Sauce,
Antoine Chancel,
Antoine Ly
Abstract:
The development of Machine Learning is experiencing growing interest from the general public, and in recent years there have been numerous press articles questioning its objectivity: racism, sexism, \dots Driven by the growing attention of regulators on the ethical use of data in insurance, the actuarial community must rethink pricing and risk selection practices for fairer insurance. Equity is a…
▽ More
The development of Machine Learning is experiencing growing interest from the general public, and in recent years there have been numerous press articles questioning its objectivity: racism, sexism, \dots Driven by the growing attention of regulators on the ethical use of data in insurance, the actuarial community must rethink pricing and risk selection practices for fairer insurance. Equity is a philosophy concept that has many different definitions in every jurisdiction that influence each other without currently reaching consensus. In Europe, the Charter of Fundamental Rights defines guidelines on discrimination, and the use of sensitive personal data in algorithms is regulated. If the simple removal of the protected variables prevents any so-called `direct' discrimination, models are still able to `indirectly' discriminate between individuals thanks to latent interactions between variables, which bring better performance (and therefore a better quantification of risk, segmentation of prices, and so on). After introducing the key concepts related to discrimination, we illustrate the complexity of quantifying them. We then propose an innovative method, not yet met in the literature, to reduce the risks of indirect discrimination thanks to mathematical concepts of linear algebra. This technique is illustrated in a concrete case of risk selection in life insurance, demonstrating its simplicity of use and its promising performance.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
A comparative study of the superconductivity in the Holstein and optical Su-Schrieffer-Heeger models
Authors:
Andy Tanjaroon Ly,
Benjamin Cohen-Stead,
Sohan Malkaruge Costa,
Steven Johnston
Abstract:
Theoretical studies suggest that Su-Schrieffer-Heeger-like electron-phonon ($e$-ph) interactions can mediate high-temperature bipolaronic superconductivity that is robust against repulsive electron-electron interactions. Here we present a comparative analysis of the pairing and competing charge/bond correlations in the two-dimensional Holstein and optical Su-Schrieffer-Heeger (SSH) models using nu…
▽ More
Theoretical studies suggest that Su-Schrieffer-Heeger-like electron-phonon ($e$-ph) interactions can mediate high-temperature bipolaronic superconductivity that is robust against repulsive electron-electron interactions. Here we present a comparative analysis of the pairing and competing charge/bond correlations in the two-dimensional Holstein and optical Su-Schrieffer-Heeger (SSH) models using numerically exact determinant quantum Monte Carlo. We find that the SSH interactions support light bipolarons and strong superconducting correlations out to relatively large values of the $e$-ph coupling $λ$ and densities near half-filling, while the Holstein interaction does not due to the formation of heavy bipolarons and competing charge-density-wave order. We further find that the Holstein and SSH models have comparable pairing correlations in the weak coupling limit for carrier concentrations $\langle n \rangle \ll 1$, where competing orders and polaronic effects are absent. These results support the proposal that SSH (bi)polarons can support superconductivity to larger values of $λ$ in comparison to the Holstein polaron, but that the resulting $T_\mathrm{c}$ gains are small in the weak coupling limit. We also find that the SSH model's pairing correlations are suppressed after including a weak on-site Hubbard repulsion. These results have important implications for identifying and engineering bipolaronic superconductivity.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
A comparative determinant quantum Monte Carlo study of the acoustic and optical variants of the Su-Schrieffer-Heeger model
Authors:
Sohan Malkaruge Costa,
Benjamin Cohen-Stead,
Andy Tanjaroon Ly,
James Neuhaus,
Steven Johnston
Abstract:
We compare the acoustic Su-Schrieffer-Heeger (SSH) model with two of its optical variants where the phonons are defined on either on the sites or bonds of the system. First, we discuss how to make fair comparisons between these models in any dimension by ensuring their dimensionless coupling $λ$ and relevant phonon energies are the same. We then use determinant quantum Monte Carlo to perform non-p…
▽ More
We compare the acoustic Su-Schrieffer-Heeger (SSH) model with two of its optical variants where the phonons are defined on either on the sites or bonds of the system. First, we discuss how to make fair comparisons between these models in any dimension by ensuring their dimensionless coupling $λ$ and relevant phonon energies are the same. We then use determinant quantum Monte Carlo to perform non-perturbative and sign-problem-free simulations of all three models on one-dimensional chains at and away from half-filling. By comparing the results obtained from each model, we demonstrate that the optical and acoustic models produce near identical results within error bars for suitably chosen phonon energies and $λ$ at half-filling. In contrast, the bond model has quantitatively different behavior due to its coupling to the $q = 0$ phonon mode. These differences also manifest in the total length of the chain, which shrinks for the bond model but not for the acoustic and optical models when $λ\neq 0$. Our results have important implications for quantum Monte Carlo modeling of SSH-like interactions, where these models are sometimes regarded as being interchangeable
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Empirical prior distributions for Bayesian meta-analyses of binary and time to event outcomes
Authors:
František Bartoš,
Willem M. Otte,
Quentin F. Gronau,
Bram Timmers,
Alexander Ly,
Eric-Jan Wagenmakers
Abstract:
Bayesian model-averaged meta-analysis allows quantification of evidence for both treatment effectiveness $μ$ and across-study heterogeneity $τ$. We use the Cochrane Database of Systematic Reviews to develop discipline-wide empirical prior distributions for $μ$ and $τ$ for meta-analyses of binary and time-to-event clinical trial outcomes. First, we use 50% of the database to estimate parameters of…
▽ More
Bayesian model-averaged meta-analysis allows quantification of evidence for both treatment effectiveness $μ$ and across-study heterogeneity $τ$. We use the Cochrane Database of Systematic Reviews to develop discipline-wide empirical prior distributions for $μ$ and $τ$ for meta-analyses of binary and time-to-event clinical trial outcomes. First, we use 50% of the database to estimate parameters of different required parametric families. Second, we use the remaining 50% of the database to select the best-performing parametric families and explore essential assumptions about the presence or absence of the treatment effectiveness and across-study heterogeneity in real data. We find that most meta-analyses of binary outcomes are more consistent with the absence of the meta-analytic effect or heterogeneity while meta-analyses of time-to-event outcomes are more consistent with the presence of the meta-analytic effect or heterogeneity. Finally, we use the complete database - with close to half a million trial outcomes - to propose specific empirical prior distributions, both for the field in general and for specific medical subdisciplines. An example from acute respiratory infections demonstrates how the proposed prior distributions can be used to conduct a Bayesian model-averaged meta-analysis in the open-source software R and JASP.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Bayesian Learning of Gas Transport in Three-Dimensional Fracture Networks
Authors:
Yingqi Shi,
Donald J. Berry,
John Kath,
Shams Lodhy,
An Ly,
Allon G. Percus,
Jeffrey D. Hyman,
Kelly Moran,
Justin Strait,
Matthew R. Sweeney,
Hari S. Viswanathan,
Philip H. Stauffer
Abstract:
Modeling gas flow through fractures of subsurface rock is a particularly challenging problem because of the heterogeneous nature of the material. High-fidelity simulations using discrete fracture network (DFN) models are one methodology for predicting gas particle breakthrough times at the surface, but are computationally demanding. We propose a Bayesian machine learning method that serves as an e…
▽ More
Modeling gas flow through fractures of subsurface rock is a particularly challenging problem because of the heterogeneous nature of the material. High-fidelity simulations using discrete fracture network (DFN) models are one methodology for predicting gas particle breakthrough times at the surface, but are computationally demanding. We propose a Bayesian machine learning method that serves as an efficient surrogate model, or emulator, for these three-dimensional DFN simulations. Our model trains on a small quantity of simulation data and, using a graph/path-based decomposition of the fracture network, rapidly predicts quantiles of the breakthrough time distribution. The approach, based on Gaussian Process Regression (GPR), outputs predictions that are within 20-30% of high-fidelity DFN simulation results. Unlike previously proposed methods, it also provides uncertainty quantification, outputting confidence intervals that are essential given the uncertainty inherent in subsurface modeling. Our trained model runs within a fraction of a second, which is considerably faster than other methods with comparable accuracy and multiple orders of magnitude faster than high-fidelity simulations.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Noise2Music: Text-conditioned Music Generation with Diffusion Models
Authors:
Qingqing Huang,
Daniel S. Park,
Tao Wang,
Timo I. Denk,
Andy Ly,
Nanxin Chen,
Zhengdong Zhang,
Zhishuai Zhang,
Jiahui Yu,
Christian Frank,
Jesse Engel,
Quoc V. Le,
William Chan,
Zhifeng Chen,
Wei Han
Abstract:
We introduce Noise2Music, where a series of diffusion models is trained to generate high-quality 30-second music clips from text prompts. Two types of diffusion models, a generator model, which generates an intermediate representation conditioned on text, and a cascader model, which generates high-fidelity audio conditioned on the intermediate representation and possibly the text, are trained and…
▽ More
We introduce Noise2Music, where a series of diffusion models is trained to generate high-quality 30-second music clips from text prompts. Two types of diffusion models, a generator model, which generates an intermediate representation conditioned on text, and a cascader model, which generates high-fidelity audio conditioned on the intermediate representation and possibly the text, are trained and utilized in succession to generate high-fidelity music. We explore two options for the intermediate representation, one using a spectrogram and the other using audio with lower fidelity. We find that the generated audio is not only able to faithfully reflect key elements of the text prompt such as genre, tempo, instruments, mood, and era, but goes beyond to ground fine-grained semantics of the prompt. Pretrained large language models play a key role in this story -- they are used to generate paired text for the audio of the training set and to extract embeddings of the text prompts ingested by the diffusion models.
Generated examples: https://google-research.github.io/noise2music
△ Less
Submitted 6 March, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks
Authors:
Adrian Ly,
Richard Dazeley,
Peter Vamplew,
Francisco Cruz,
Sunil Aryal
Abstract:
Deep Q-Networks algorithm (DQN) was the first reinforcement learning algorithm using deep neural network to successfully surpass human level performance in a number of Atari learning environments. However, divergent and unstable behaviour have been long standing issues in DQNs. The unstable behaviour is often characterised by overestimation in the $Q$-values, commonly referred to as the overestima…
▽ More
Deep Q-Networks algorithm (DQN) was the first reinforcement learning algorithm using deep neural network to successfully surpass human level performance in a number of Atari learning environments. However, divergent and unstable behaviour have been long standing issues in DQNs. The unstable behaviour is often characterised by overestimation in the $Q$-values, commonly referred to as the overestimation bias. To address the overestimation bias and the divergent behaviour, a number of heuristic extensions have been proposed. Notably, multi-step updates have been shown to drastically reduce unstable behaviour while improving agent's training performance. However, agents are often highly sensitive to the selection of the multi-step update horizon ($n$), and our empirical experiments show that a poorly chosen static value for $n$ can in many cases lead to worse performance than single-step DQN. Inspired by the success of $n$-step DQN and the effects that multi-step updates have on overestimation bias, this paper proposes a new algorithm that we call `Elastic Step DQN' (ES-DQN). It dynamically varies the step size horizon in multi-step updates based on the similarity of states visited. Our empirical evaluation shows that ES-DQN out-performs $n$-step with fixed $n$ updates, Double DQN and Average DQN in several OpenAI Gym environments while at the same time alleviating the overestimation bias.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Applying Machine Learning to Life Insurance: some knowledge sharing to master it
Authors:
Antoine Chancel,
Laura Bradier,
Antoine Ly,
Razvan Ionescu,
Laurene Martin,
Marguerite Sauce
Abstract:
Machine Learning permeates many industries, which brings new source of benefits for companies. However within the life insurance industry, Machine Learning is not widely used in practice as over the past years statistical models have shown their efficiency for risk assessment. Thus insurers may face difficulties to assess the value of the artificial intelligence. Focusing on the modification of th…
▽ More
Machine Learning permeates many industries, which brings new source of benefits for companies. However within the life insurance industry, Machine Learning is not widely used in practice as over the past years statistical models have shown their efficiency for risk assessment. Thus insurers may face difficulties to assess the value of the artificial intelligence. Focusing on the modification of the life insurance industry over time highlights the stake of using Machine Learning for insurers and benefits that it can bring by unleashing data value. This paper reviews traditional actuarial methodologies for survival modeling and extends them with Machine Learning techniques. It points out differences with regular machine learning models and emphasizes importance of specific implementations to face censored data with machine learning models family. In complement to this article, a Python library has been developed. Different open-source Machine Learning algorithms have been adjusted to adapt the specificities of life insurance data, namely censoring and truncation. Such models can be easily applied from this SCOR library to accurately model life insurance risks.
△ Less
Submitted 27 September, 2022; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Model Transparency and Interpretability : Survey and Application to the Insurance Industry
Authors:
Dimitri Delcaillau,
Antoine Ly,
Alize Papp,
Franck Vermet
Abstract:
The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transforms data (upstream and downstream). Thus, needs increase to define the relationships between individual data and the choice that an algorithm could make based on its analysis (e.g. the recommendation of one product or one promotional offer, or an insurance rate representative of t…
▽ More
The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transforms data (upstream and downstream). Thus, needs increase to define the relationships between individual data and the choice that an algorithm could make based on its analysis (e.g. the recommendation of one product or one promotional offer, or an insurance rate representative of the risk). Model users must ensure that models do not discriminate and that it is also possible to explain their results. This paper introduces the importance of model interpretation and tackles the notion of model transparency. Within an insurance context, it specifically illustrates how some tools can be used to enforce the control of actuarial models that can nowadays leverage on machine learning. On a simple example of loss frequency estimation in car insurance, we show the interest of some interpretability methods to adapt explanation to the target audience.
△ Less
Submitted 1 September, 2022;
originally announced September 2022.
-
Evidential Calibration of Confidence Intervals
Authors:
Samuel Pawel,
Alexander Ly,
Eric-Jan Wagenmakers
Abstract:
We present a novel and easy-to-use method for calibrating error-rate based confidence intervals to evidence-based support intervals. Support intervals are obtained from inverting Bayes factors based on a parameter estimate and its standard error. A $k$ support interval can be interpreted as "the observed data are at least $k$ times more likely under the included parameter values than under a speci…
▽ More
We present a novel and easy-to-use method for calibrating error-rate based confidence intervals to evidence-based support intervals. Support intervals are obtained from inverting Bayes factors based on a parameter estimate and its standard error. A $k$ support interval can be interpreted as "the observed data are at least $k$ times more likely under the included parameter values than under a specified alternative". Support intervals depend on the specification of prior distributions for the parameter under the alternative, and we present several types that allow different forms of external knowledge to be encoded. We also show how prior specification can to some extent be avoided by considering a class of prior distributions and then computing so-called minimum support intervals which, for a given class of priors, have a one-to-one map** with confidence intervals. We also illustrate how the sample size of a future study can be determined based on the concept of support. Finally, we show how the bound for the type I error rate of Bayes factors leads to a bound for the coverage of support intervals. An application to data from a clinical trial illustrates how support intervals can lead to inferences that are both intuitive and informative.
△ Less
Submitted 27 June, 2023; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Stripe correlations in the two-dimensional Hubbard-Holstein model
Authors:
Seher Karakuzu,
Andy Tanjaroon Ly,
Peizhi Mai,
James Neuhaus,
Thomas A. Maier,
Steven Johnston
Abstract:
Several state-of-the-art numerical methods have observed static or fluctuating spin and charge stripes in doped two-dimensional Hubbard models, suggesting that these orders play a significant role in sha** the cuprate phase diagram. Many experiments, however, also indicate that the cuprates have strong electron-phonon ($e$-ph) coupling, and it is unclear how this interaction influences stripe co…
▽ More
Several state-of-the-art numerical methods have observed static or fluctuating spin and charge stripes in doped two-dimensional Hubbard models, suggesting that these orders play a significant role in sha** the cuprate phase diagram. Many experiments, however, also indicate that the cuprates have strong electron-phonon ($e$-ph) coupling, and it is unclear how this interaction influences stripe correlations. We study static and fluctuating stripe orders in the doped singleband Hubbard-Holstein model using zero temperature variational Monte Carlo and finite temperature determinant quantum Monte Carlo. We find that the lattice couples more strongly with the charge component of the stripes, leading to an enhancement or suppression of stripe correlations, depending on model parameters like the next-nearest-neighbor hop** $t^\prime$ or phonon energy $Ω$. Our results help elucidate how the $e$-ph interaction can tip the delicate balance between stripe and superconducting correlations in the Hubbard-Holstein model with implications for our understanding of the high-$T_\mathrm{c}$ cuprates.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
History and Nature of the Jeffreys-Lindley Paradox
Authors:
Eric-Jan Wagenmakers,
Alexander Ly
Abstract:
The Jeffreys-Lindley paradox exposes a rift between Bayesian and frequentist hypothesis testing that strikes at the heart of statistical inference. Contrary to what most current literature suggests, the paradox was central to the Bayesian testing methodology developed by Sir Harold Jeffreys in the late 1930s. Jeffreys showed that the evidence against a point-null hypothesis $\mathcal{H}_0$ scales…
▽ More
The Jeffreys-Lindley paradox exposes a rift between Bayesian and frequentist hypothesis testing that strikes at the heart of statistical inference. Contrary to what most current literature suggests, the paradox was central to the Bayesian testing methodology developed by Sir Harold Jeffreys in the late 1930s. Jeffreys showed that the evidence against a point-null hypothesis $\mathcal{H}_0$ scales with $\sqrt{n}$ and repeatedly argued that it would therefore be mistaken to set a threshold for rejecting $\mathcal{H}_0$ at a constant multiple of the standard error. Here we summarize Jeffreys's early work on the paradox and clarify his reasons for including the $\sqrt{n}$ term. The prior distribution is seen to play a crucial role; by implicitly correcting for selection, small parameter values are identified as relatively surprising under $\mathcal{H}_1$. We highlight the general nature of the paradox by presenting both a fully frequentist and a fully Bayesian version. We also demonstrate that the paradox does not depend on assigning prior mass to a point hypothesis, as is commonly believed.
△ Less
Submitted 20 July, 2022; v1 submitted 19 November, 2021;
originally announced November 2021.
-
Bayesian Model-Averaged Meta-Analysis in Medicine
Authors:
František Bartoš,
Quentin F. Gronau,
Bram Timmers,
Willem M. Otte,
Alexander Ly,
Eric-Jan Wagenmakers
Abstract:
We outline a Bayesian model-averaged meta-analysis for standardized mean differences in order to quantify evidence for both treatment effectiveness $δ$ and across-study heterogeneity $τ$. We construct four competing models by orthogonally combining two present-absent assumptions, one for the treatment effect and one for across-study heterogeneity. To inform the choice of prior distributions for th…
▽ More
We outline a Bayesian model-averaged meta-analysis for standardized mean differences in order to quantify evidence for both treatment effectiveness $δ$ and across-study heterogeneity $τ$. We construct four competing models by orthogonally combining two present-absent assumptions, one for the treatment effect and one for across-study heterogeneity. To inform the choice of prior distributions for the model parameters, we used 50% of the Cochrane Database of Systematic Reviews to specify rival prior distributions for $δ$ and $τ$. The relative predictive performance of the competing models and rival prior distributions was assessed using the remaining 50\% of the Cochrane Database. On average, $\mathcal{H}_1^r$ -- the model that assumes the presence of a treatment effect as well as across-study heterogeneity -- outpredicted the other models, but not by a large margin. Within $\mathcal{H}_1^r$, predictive adequacy was relatively constant across the rival prior distributions. We propose specific empirical prior distributions, both for the field in general and for each of 46 specific medical subdisciplines. An example from oral health demonstrates how the proposed prior distributions can be used to conduct a Bayesian model-averaged meta-analysis in the open-source software R and JASP. The preregistered analysis plan is available at https://osf.io/zs3df/.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
Generic E-Variables for Exact Sequential k-Sample Tests that allow for Optional Stop**
Authors:
Rosanne Turner,
Alexander Ly,
Peter Grünwald
Abstract:
We develop E-variables for testing whether two or more data streams come from the same source or not, and more generally, whether the difference between the sources is larger than some minimal effect size. These E-variables lead to exact, nonasymptotic tests that remain safe, i.e. keep their type-I error guarantees, under flexible sampling scenarios such as optional stop** and continuation. In s…
▽ More
We develop E-variables for testing whether two or more data streams come from the same source or not, and more generally, whether the difference between the sources is larger than some minimal effect size. These E-variables lead to exact, nonasymptotic tests that remain safe, i.e. keep their type-I error guarantees, under flexible sampling scenarios such as optional stop** and continuation. In special cases our E-variables also have an optimal 'growth' property under the alternative. While the construction is generic, we illustrate it through the special case of k x 2 contingency tables, where we also allow for the incorporation of different restrictions on a composite alternative. Comparison to p-value analysis in simulations and a real-world example show that E-variables, through their flexibility, often allow for early stop** of data collection, thereby retaining similar power as classical methods, while also retaining the option of extending or combining data afterwards.
△ Less
Submitted 22 June, 2022; v1 submitted 4 June, 2021;
originally announced June 2021.
-
GSPMD: General and Scalable Parallelization for ML Computation Graphs
Authors:
Yuanzhong Xu,
HyoukJoong Lee,
Dehao Chen,
Blake Hechtman,
Yan** Huang,
Rahul Joshi,
Maxim Krikun,
Dmitry Lepikhin,
Andy Ly,
Marcello Maggioni,
Ruoming Pang,
Noam Shazeer,
Shibo Wang,
Tao Wang,
Yonghui Wu,
Zhifeng Chen
Abstract:
We present GSPMD, an automatic, compiler-based parallelization system for common machine learning computations. It allows users to write programs in the same way as for a single device, then give hints through a few annotations on how to distribute tensors, based on which GSPMD will parallelize the computation. Its representation of partitioning is simple yet general, allowing it to express differ…
▽ More
We present GSPMD, an automatic, compiler-based parallelization system for common machine learning computations. It allows users to write programs in the same way as for a single device, then give hints through a few annotations on how to distribute tensors, based on which GSPMD will parallelize the computation. Its representation of partitioning is simple yet general, allowing it to express different or mixed paradigms of parallelism on a wide variety of models.
GSPMD infers the partitioning for every operator based on limited user annotations, making it convenient to scale existing single-device programs. It solves several technical challenges for production usage, allowing GSPMD to achieve 50% to 62% compute utilization on up to 2048 Cloud TPUv3 cores for models with up to one trillion parameters.
△ Less
Submitted 23 December, 2021; v1 submitted 10 May, 2021;
originally announced May 2021.
-
Bayes Factors for Peri-Null Hypotheses
Authors:
Alexander Ly,
Eric-Jan Wagenmakers
Abstract:
A perennial objection against Bayes factor point-null hypothesis tests is that the point-null hypothesis is known to be false from the outset. We examine the consequences of approximating the sharp point-null hypothesis by a hazy `peri-null' hypothesis instantiated as a narrow prior distribution centered on the point of interest. The peri-null Bayes factor then equals the point-null Bayes factor m…
▽ More
A perennial objection against Bayes factor point-null hypothesis tests is that the point-null hypothesis is known to be false from the outset. We examine the consequences of approximating the sharp point-null hypothesis by a hazy `peri-null' hypothesis instantiated as a narrow prior distribution centered on the point of interest. The peri-null Bayes factor then equals the point-null Bayes factor multiplied by a correction term which is itself a Bayes factor. For moderate sample sizes, the correction term is relatively inconsequential; however, for large sample sizes the correction term becomes influential and causes the peri-null Bayes factor to be inconsistent and approach a limit that depends on the ratio of prior ordinates evaluated at the maximum likelihood estimate. We characterize the asymptotic behavior of the peri-null Bayes factor and briefly discuss suggestions on how to construct peri-null Bayes factor hypothesis tests that are also consistent.
△ Less
Submitted 19 May, 2022; v1 submitted 14 February, 2021;
originally announced February 2021.
-
Learned Indexes for a Google-scale Disk-based Database
Authors:
Hussam Abu-Libdeh,
Deniz Altınbüken,
Alex Beutel,
Ed H. Chi,
Lyric Doshi,
Tim Kraska,
Xiaozhou,
Li,
Andy Ly,
Christopher Olston
Abstract:
There is great excitement about learned index structures, but understandable skepticism about the practicality of a new method uprooting decades of research on B-Trees. In this paper, we work to remove some of that uncertainty by demonstrating how a learned index can be integrated in a distributed, disk-based database system: Google's Bigtable. We detail several design decisions we made to integra…
▽ More
There is great excitement about learned index structures, but understandable skepticism about the practicality of a new method uprooting decades of research on B-Trees. In this paper, we work to remove some of that uncertainty by demonstrating how a learned index can be integrated in a distributed, disk-based database system: Google's Bigtable. We detail several design decisions we made to integrate learned indexes in Bigtable. Our results show that integrating learned index significantly improves the end-to-end read latency and throughput for Bigtable.
△ Less
Submitted 23 December, 2020;
originally announced December 2020.
-
The Anytime-Valid Logrank Test: Error Control Under Continuous Monitoring with Unlimited Horizon
Authors:
J. ter Schure,
M. F. Perez-Ortiz,
A. Ly,
P. Grunwald
Abstract:
We introduce the anytime-valid (AV) logrank test, a version of the logrank test that provides type-I error guarantees under optional stop** and optional continuation. The test is sequential without the need to specify a maximum sample size or stop** rule, and allows for cumulative meta-analysis with type-I error control. The method can be extended to define anytime-valid confidence intervals.…
▽ More
We introduce the anytime-valid (AV) logrank test, a version of the logrank test that provides type-I error guarantees under optional stop** and optional continuation. The test is sequential without the need to specify a maximum sample size or stop** rule, and allows for cumulative meta-analysis with type-I error control. The method can be extended to define anytime-valid confidence intervals. The logrank test is an instance of the martingale tests based on E-variables that have been recently developed. We demonstrate type-I error guarantees for the test in a semiparametric setting of proportional hazards and show how to extend it to ties, Cox' regression and confidence sequences. Using a Gaussian approximation on the logrank statistic, we show that the AV logrank test (which itself is always exact) has a similar rejection region to O'Brien-Fleming alpha-spending but with the potential to achieve 100% power by optional continuation. Although our approach to study design requires a larger sample size, the *expected* sample size is competitive by optional stop**.
△ Less
Submitted 1 May, 2023; v1 submitted 13 November, 2020;
originally announced November 2020.
-
A survey on natural language processing (nlp) and applications in insurance
Authors:
Antoine Ly,
Benno Uthayasooriyar,
Tingting Wang
Abstract:
Text is the most widely used means of communication today. This data is abundant but nevertheless complex to exploit within algorithms. For years, scientists have been trying to implement different techniques that enable computers to replicate some mechanisms of human reading. During the past five years, research disrupted the capacity of the algorithms to unleash the value of text data. It brings…
▽ More
Text is the most widely used means of communication today. This data is abundant but nevertheless complex to exploit within algorithms. For years, scientists have been trying to implement different techniques that enable computers to replicate some mechanisms of human reading. During the past five years, research disrupted the capacity of the algorithms to unleash the value of text data. It brings today, many opportunities for the insurance industry.Understanding those methods and, above all, knowing how to apply them is a major challenge and key to unleash the value of text data that have been stored for many years. Processing language with computer brings many new opportunities especially in the insurance sector where reports are central in the information used by insurers. SCOR's Data Analytics team has been working on the implementation of innovative tools or products that enable the use of the latest research on text analysis. Understanding text mining techniques in insurance enhances the monitoring of the underwritten risks and many processes that finally benefit policyholders.This article proposes to explain opportunities that Natural Language Processing (NLP) are providing to insurance. It details different methods used today in practice traces back the story of them. We also illustrate the implementation of certain methods using open source libraries and python codes that we have developed to facilitate the use of these techniques.After giving a general overview on the evolution of text mining during the past few years,we share about how to conduct a full study with text mining and share some examples to serve those models into insurance products or services. Finally, we explained in more details every step that composes a Natural Language Processing study to ensure the reader can have a deep understanding on the implementation.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Interpretabilité des modèles : état des lieux des méthodes et application à l'assurance
Authors:
Dimitri Delcaillau,
Antoine Ly,
Franck Vermet,
Alizé Papp
Abstract:
Since May 2018, the General Data Protection Regulation (GDPR) has introduced new obligations to industries. By setting a legal framework, it notably imposes strong transparency on the use of personal data. Thus, people must be informed of the use of their data and must consent the usage of it. Data is the raw material of many models which today make it possible to increase the quality and performa…
▽ More
Since May 2018, the General Data Protection Regulation (GDPR) has introduced new obligations to industries. By setting a legal framework, it notably imposes strong transparency on the use of personal data. Thus, people must be informed of the use of their data and must consent the usage of it. Data is the raw material of many models which today make it possible to increase the quality and performance of digital services. Transparency on the use of data also requires a good understanding of its use through different models. The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transform data (upstream and downstream of a model), thus making it possible to define the relationships between the individual's data and the choice that an algorithm could make based on the analysis of the latter. (For example, the recommendation of one product or one promotional offer or an insurance rate representative of the risk.) Models users must ensure that models do not discriminate against and that it is also possible to explain its result. The widening of the panel of predictive algorithms - made possible by the evolution of computing capacities -- leads scientists to be vigilant about the use of models and to consider new tools to better understand the decisions deduced from them . Recently, the community has been particularly active on model transparency with a marked intensification of publications over the past three years. The increasingly frequent use of more complex algorithms (\textit{deep learning}, Xgboost, etc.) presenting attractive performances is undoubtedly one of the causes of this interest. This article thus presents an inventory of methods of interpreting models and their uses in an insurance context.
△ Less
Submitted 25 July, 2020;
originally announced July 2020.
-
Default Bayes Factors for Testing the (In)equality of Several Population Variances
Authors:
Fabian Dablander,
Don van den Bergh,
Eric-Jan Wagenmakers,
Alexander Ly
Abstract:
Testing the (in)equality of variances is an important problem in many statistical applications. We develop default Bayes factor tests to assess the (in)equality of two or more population variances, as well as a test for whether the population variances equal a specific value. The resulting test can be used to check assumptions for commonly used procedures such as the $t$-test or ANOVA, or test sub…
▽ More
Testing the (in)equality of variances is an important problem in many statistical applications. We develop default Bayes factor tests to assess the (in)equality of two or more population variances, as well as a test for whether the population variances equal a specific value. The resulting test can be used to check assumptions for commonly used procedures such as the $t$-test or ANOVA, or test substantive hypotheses concerning variances directly. We show that our Bayes factor fulfills a number of desiderata. Researchers may have directed hypotheses such as $σ_{1}^{2} > σ_{2}^{2}$, they may want to extend $\mathcal{H}_{0}$ to have a null-region, or wish to combine hypotheses about equality with hypotheses about inequality, for example $σ_{1}^{2} = σ_{2}^{2} > (σ_{3}^{2}, σ_{4}^{2})$. We extend our Bayes factor test to allow for these deviations from our proposed default and illustrate it on a number of practical examples. Our procedure is implemented in the R package $bfvartest$.
△ Less
Submitted 31 July, 2022; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Photonic devices fabricated from (111) oriented single crystal diamond
Authors:
Blake Regan,
Sejeong Kim,
Anh Tu Huy Ly,
Aleksandra Trycz,
Kerem Bray,
Kumaravelu Ganesan,
Milos Toth,
Igor Aharonovich
Abstract:
Diamond is a material of choice in the pursuit of integrated quantum photonic technologies. So far, the majority of photonic devices fabricated from diamond, are made from (100)-oriented crystals. In this work, we demonstrate a methodology for the fabrication of optically-active membranes from (111)-oriented diamond. We use a liftoff technique to generate membranes, followed by chemical vapour dep…
▽ More
Diamond is a material of choice in the pursuit of integrated quantum photonic technologies. So far, the majority of photonic devices fabricated from diamond, are made from (100)-oriented crystals. In this work, we demonstrate a methodology for the fabrication of optically-active membranes from (111)-oriented diamond. We use a liftoff technique to generate membranes, followed by chemical vapour deposition of diamond in the presence of silicon to generate homogenous silicon vacancy colour centers with emission properties that are superior to those in (100)-oriented diamond. We further use the diamond membranes to fabricate high quality microring resonators with quality factors exceeding ~ 3000. Supported by finite difference time domain calculations, we discuss the advantages of (111) oriented structures as building blocks for quantum nanophotonic devices.
△ Less
Submitted 9 September, 2019;
originally announced September 2019.
-
Ultra-low noise dual-frequency VECSEL at telecom wavelength using fully correlated pum**
Authors:
Hui Liu,
Grégory Gredat,
Syamsundar De,
Ihsan Fsaifes,
Aliou Ly,
Rémy Vatré,
Ghaya Baili,
Sophie Bouchoule,
Fabienne Goldfarb,
Fabien Bretenaker
Abstract:
An ultra-low intensity and beatnote phase noise dual-frequency vertical-external-cavity surface-emitting laser is built at telecom wavelength. The pump laser is realized by polarization combining two single-mode fibered laser diodes in a single-mode fiber, leading to a 100 % in-phase correlation of the pump noises for the two modes. The relative intensity noise is lower than -140 dB/Hz, and the be…
▽ More
An ultra-low intensity and beatnote phase noise dual-frequency vertical-external-cavity surface-emitting laser is built at telecom wavelength. The pump laser is realized by polarization combining two single-mode fibered laser diodes in a single-mode fiber, leading to a 100 % in-phase correlation of the pump noises for the two modes. The relative intensity noise is lower than -140 dB/Hz, and the beatnote phase noise is suppressed by 30 dB, getting close to the spontaneous emission limit. The role of the imperfect cancellation of the thermal effect resulting from unbalanced pum** of the two modes in the residual phase noise is evidenced.
△ Less
Submitted 10 December, 2018;
originally announced December 2018.
-
Bayesian Rank-Based Hypothesis Testing for the Rank Sum Test, the Signed Rank Test, and Spearman's $ρ$
Authors:
Johnny van Doorn,
Alexander Ly,
Maarten Marsman,
Eric-Jan Wagenmakers
Abstract:
Bayesian inference for rank-order problems is frustrated by the absence of an explicit likelihood function. This hurdle can be overcome by assuming a latent normal representation that is consistent with the ordinal information in the data: the observed ranks are conceptualized as an impoverished reflection of an underlying continuous scale, and inference concerns the parameters that govern the lat…
▽ More
Bayesian inference for rank-order problems is frustrated by the absence of an explicit likelihood function. This hurdle can be overcome by assuming a latent normal representation that is consistent with the ordinal information in the data: the observed ranks are conceptualized as an impoverished reflection of an underlying continuous scale, and inference concerns the parameters that govern the latent representation. We apply this generic data-augmentation method to obtain Bayes factors for three popular rank-based tests: the rank sum test, the signed rank test, and Spearman's $ρ_s$.
△ Less
Submitted 17 May, 2019; v1 submitted 19 December, 2017;
originally announced December 2017.
-
Econométrie et Machine Learning
Authors:
Arthur Charpentier,
Emmanuel Flachaire,
Antoine Ly
Abstract:
Econometrics and machine learning seem to have one common goal: to construct a predictive model, for a variable of interest, using explanatory variables (or features). However, these two fields developed in parallel, thus creating two different cultures, to paraphrase Breiman (2001). The first was to build probabilistic models to describe economic phenomena. The second uses algorithms that will le…
▽ More
Econometrics and machine learning seem to have one common goal: to construct a predictive model, for a variable of interest, using explanatory variables (or features). However, these two fields developed in parallel, thus creating two different cultures, to paraphrase Breiman (2001). The first was to build probabilistic models to describe economic phenomena. The second uses algorithms that will learn from their mistakes, with the aim, most often to classify (sounds, images, etc.). Recently, however, learning models have proven to be more effective than traditional econometric techniques (with a price to pay less explanatory power), and above all, they manage to manage much larger data. In this context, it becomes necessary for econometricians to understand what these two cultures are, what opposes them and especially what brings them closer together, in order to appropriate tools developed by the statistical learning community to integrate them into Econometric models.
△ Less
Submitted 19 March, 2018; v1 submitted 26 July, 2017;
originally announced August 2017.
-
Hydrodynamic interactions in DNA thermophoresis
Authors:
Aboubakry Ly,
Aloïs Würger
Abstract:
We theoretically study the molecular-weight dependence of DNA thermophoresis, which arises from mutual advection of the n repeat units of the molecular chain. As a main result we find that the dominant driving forces, i.e., the thermally induced permittivity gradient and the electrolyte Seebeck effect, result in characteristic hydrodynamic screening. In comparison with recent experimental data on…
▽ More
We theoretically study the molecular-weight dependence of DNA thermophoresis, which arises from mutual advection of the n repeat units of the molecular chain. As a main result we find that the dominant driving forces, i.e., the thermally induced permittivity gradient and the electrolyte Seebeck effect, result in characteristic hydrodynamic screening. In comparison with recent experimental data on single-stranded DNA (2 $\le$ n $\le$ 80), our theory quantitatively describes the increase of the drift velocity up to n = 30; the slowing-down of longer molecules is well accounted for by a simple model for counterion condensation. It turns out that thermophoresis may change sign as a function of n: For an appropriate choice of the salt-specific Seebeck coefficient, short molecules move to the cold and long ones to the hot; this could be used for separating DNA by molecular weight.
△ Less
Submitted 22 December, 2017; v1 submitted 29 June, 2017;
originally announced June 2017.
-
A Tutorial on Fisher Information
Authors:
Alexander Ly,
Maarten Marsman,
Josine Verhagen,
Raoul Grasman,
Eric-Jan Wagenmakers
Abstract:
In many statistical applications that concern mathematical psychologists, the concept of Fisher information plays an important role. In this tutorial we clarify the concept of Fisher information as it manifests itself across three different statistical paradigms. First, in the frequentist paradigm, Fisher information is used to construct hypothesis tests and confidence intervals using maximum like…
▽ More
In many statistical applications that concern mathematical psychologists, the concept of Fisher information plays an important role. In this tutorial we clarify the concept of Fisher information as it manifests itself across three different statistical paradigms. First, in the frequentist paradigm, Fisher information is used to construct hypothesis tests and confidence intervals using maximum likelihood estimators; second, in the Bayesian paradigm, Fisher information is used to define a default prior; lastly, in the minimum description length paradigm, Fisher information is used to measure model complexity.
△ Less
Submitted 17 October, 2017; v1 submitted 2 May, 2017;
originally announced May 2017.
-
Informed Bayesian T-Tests
Authors:
Quentin F. Gronau,
Alexander Ly,
Eric-Jan Wagenmakers
Abstract:
Across the empirical sciences, few statistical procedures rival the popularity of the frequentist t-test. In contrast, the Bayesian versions of the t-test have languished in obscurity. In recent years, however, the theoretical and practical advantages of the Bayesian t-test have become increasingly apparent and various Bayesian t-tests have been proposed, both objective ones (based on general desi…
▽ More
Across the empirical sciences, few statistical procedures rival the popularity of the frequentist t-test. In contrast, the Bayesian versions of the t-test have languished in obscurity. In recent years, however, the theoretical and practical advantages of the Bayesian t-test have become increasingly apparent and various Bayesian t-tests have been proposed, both objective ones (based on general desiderata) and subjective ones (based on expert knowledge). Here we propose a flexible t-prior for standardized effect size that allows computation of the Bayes factor by evaluating a single numerical integral. This specification contains previous objective and subjective t-test Bayes factors as special cases. Furthermore, we propose two measures for informed prior distributions that quantify the departure from the objective Bayes factor desiderata of predictive matching and information consistency. We illustrate the use of informed prior distributions based on an expert prior elicitation effort.
△ Less
Submitted 14 December, 2018; v1 submitted 8 April, 2017;
originally announced April 2017.
-
A Tutorial on Bridge Sampling
Authors:
Quentin F. Gronau,
Alexandra Sarafoglou,
Dora Matzke,
Alexander Ly,
Udo Boehm,
Maarten Marsman,
David S. Leslie,
Jonathan J. Forster,
Eric-Jan Wagenmakers,
Helen Steingroever
Abstract:
The marginal likelihood plays an important role in many areas of Bayesian statistics such as parameter estimation, model comparison, and model averaging. In most applications, however, the marginal likelihood is not analytically tractable and must be approximated using numerical methods. Here we provide a tutorial on bridge sampling (Bennett, 1976; Meng & Wong, 1996), a reliable and relatively str…
▽ More
The marginal likelihood plays an important role in many areas of Bayesian statistics such as parameter estimation, model comparison, and model averaging. In most applications, however, the marginal likelihood is not analytically tractable and must be approximated using numerical methods. Here we provide a tutorial on bridge sampling (Bennett, 1976; Meng & Wong, 1996), a reliable and relatively straightforward sampling method that allows researchers to obtain the marginal likelihood for models of varying complexity. First, we introduce bridge sampling and three related sampling methods using the beta-binomial model as a running example. We then apply bridge sampling to estimate the marginal likelihood for the Expectancy Valence (EV) model---a popular model for reinforcement learning. Our results indicate that bridge sampling provides accurate estimates for both a single participant and a hierarchical version of the EV model. We conclude that bridge sampling is an attractive method for mathematical psychologists who typically aim to approximate the marginal likelihood for a limited set of possibly high-dimensional models.
△ Less
Submitted 11 October, 2017; v1 submitted 17 March, 2017;
originally announced March 2017.
-
Nanoscale Seebeck effect at hot metal nanostructures
Authors:
Aboubakry Ly,
Arghya Majee,
Alois Würger
Abstract:
We theoretically study the electrolyte Seebeck effect in the vicinity of a heated metal nanostructure, such as the cap of an active Janus colloid in an electrolyte, or gold- coated interfaces in optofluidic devices. The thermocharge accumulated at the surface varies with the local temperature, thus modulating the diffuse part of the electric double layer. On a conducting surface with non-uniform t…
▽ More
We theoretically study the electrolyte Seebeck effect in the vicinity of a heated metal nanostructure, such as the cap of an active Janus colloid in an electrolyte, or gold- coated interfaces in optofluidic devices. The thermocharge accumulated at the surface varies with the local temperature, thus modulating the diffuse part of the electric double layer. On a conducting surface with non-uniform temperature, the isopotential condition imposes a significant polarization charge within the metal. Surprisingly, this does not affect the slip velocity, which takes the same value on insulating and conducting surfaces. Our results for specific-ion effects agree qualitatively with recent observations for Janus colloids in different electrolyte solutions. Comparing the thermal, hydrodynamic, and ion diffusion time scales, we expect a rich transient behavior at the onset of thermally powered swimming, extending to microseconds after switching on the heating.
△ Less
Submitted 26 February, 2018; v1 submitted 9 March, 2017;
originally announced March 2017.
-
Bayesian Estimation of Kendall's tau Using a Latent Normal Approach
Authors:
Johnny van Doorn,
Alexander Ly,
Maarten Marsman,
Eric-Jan Wagenmakers
Abstract:
The rank-based association between two variables can be modeled by introducing a latent normal level to ordinal data. We demonstrate how this approach yields Bayesian inference for Kendall's rank correlation coefficient, improving on a recent Bayesian solution from asymptotic properties of the test statistic.
The rank-based association between two variables can be modeled by introducing a latent normal level to ordinal data. We demonstrate how this approach yields Bayesian inference for Kendall's rank correlation coefficient, improving on a recent Bayesian solution from asymptotic properties of the test statistic.
△ Less
Submitted 24 May, 2018; v1 submitted 6 March, 2017;
originally announced March 2017.
-
30-Hz relative linewidth watt output power 1.65-$μ$m continuous-wave singly resonant optical parametric oscillator
Authors:
Aliou LY,
Christophe Siour,
Fabien Bretenaker
Abstract:
We built a 1-watt cw singly resonant optical parametric oscillator operating at an idler wavelength of 1.65~$μ$m for application to quantum interfaces. The non resonant idler is frequency stabilized by side-fringe locking on a relatively high-finesse Fabry-Perot cavity, and the influence of intensity noise is carefully analyzed. A relative linewidth down to the sub-kHz level (about 30 Hz over 2 s)…
▽ More
We built a 1-watt cw singly resonant optical parametric oscillator operating at an idler wavelength of 1.65~$μ$m for application to quantum interfaces. The non resonant idler is frequency stabilized by side-fringe locking on a relatively high-finesse Fabry-Perot cavity, and the influence of intensity noise is carefully analyzed. A relative linewidth down to the sub-kHz level (about 30 Hz over 2 s) is achieved. A very good long term stability is obtained for both frequency and intensity.
△ Less
Submitted 12 April, 2017; v1 submitted 29 November, 2016;
originally announced November 2016.
-
Analytic Posteriors for Pearson's Correlation Coefficient
Authors:
Alexander Ly,
Maarten Marsman,
Eric-Jan Wagenmakers
Abstract:
Pearson's correlation is one of the most common measures of linear dependence. Recently, Bernardo (2015) introduced a flexible class of priors to study this measure in a Bayesian setting. For this large class of priors we show that the (marginal) posterior for Pearson's correlation coefficient and all of the posterior moments are analytic. Our results are available in the open-source software pack…
▽ More
Pearson's correlation is one of the most common measures of linear dependence. Recently, Bernardo (2015) introduced a flexible class of priors to study this measure in a Bayesian setting. For this large class of priors we show that the (marginal) posterior for Pearson's correlation coefficient and all of the posterior moments are analytic. Our results are available in the open-source software package JASP.
△ Less
Submitted 28 April, 2017; v1 submitted 5 October, 2015;
originally announced October 2015.
-
Frequency stabilization of the non resonant wave of a continuous-wave singly resonant optical parametric oscillator
Authors:
Aliou Ly,
Benjamin Szymanski,
Fabien Bretenaker
Abstract:
We present an experimental technique allowing to stabilize the frequency of the non resonant wave in a singly resonant optical parametric oscillator (SRO) down to the kHz level, much below the pump frequency noise level. By comparing the frequency of the non resonant wave with a reference cavity, the pump frequency noise is imposed to the frequency of the resonant wave, and is thus subtracted from…
▽ More
We present an experimental technique allowing to stabilize the frequency of the non resonant wave in a singly resonant optical parametric oscillator (SRO) down to the kHz level, much below the pump frequency noise level. By comparing the frequency of the non resonant wave with a reference cavity, the pump frequency noise is imposed to the frequency of the resonant wave, and is thus subtracted from the frequency of the non resonant wave. This permits the non resonant wave obtained from such a SRO to be simultaneously powerful and frequency stable, which is usually impossible to obtain when the resonant wave frequency is stabilized.
△ Less
Submitted 20 March, 2015;
originally announced March 2015.