-
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models
Authors:
Raeid Saqur,
Anastasis Kratsios,
Florian Krach,
Yannick Limmer,
Jacob-Junqi Tian,
John Willes,
Blanka Horvath,
Frank Rudzicz
Abstract:
We propose MoE-F -- a formalised mechanism for combining $N$ pre-trained expert Large Language Models (LLMs) in online time-series prediction tasks by adaptively forecasting the best weighting of LLM predictions at every time step. Our mechanism leverages the conditional information in each expert's running performance to forecast the best combination of LLMs for predicting the time series in its…
▽ More
We propose MoE-F -- a formalised mechanism for combining $N$ pre-trained expert Large Language Models (LLMs) in online time-series prediction tasks by adaptively forecasting the best weighting of LLM predictions at every time step. Our mechanism leverages the conditional information in each expert's running performance to forecast the best combination of LLMs for predicting the time series in its next step. Diverging from static (learned) Mixture of Experts (MoE) methods, MoE-F employs time-adaptive stochastic filtering techniques to combine experts. By framing the expert selection problem as a finite state-space, continuous-time Hidden Markov model (HMM), we can leverage the Wohman-Shiryaev filter. Our approach first constructs $N$ parallel filters corresponding to each of the $N$ individual LLMs. Each filter proposes its best combination of LLMs, given the information that they have access to. Subsequently, the $N$ filter outputs are aggregated to optimize a lower bound for the loss of the aggregated LLMs, which can be optimized in closed-form, thus generating our ensemble predictor. Our contributions here are: (I) the MoE-F algorithm -- deployable as a plug-and-play filtering harness, (II) theoretical optimality guarantees of the proposed filtering-based gating algorithm, and (III) empirical evaluation and ablative results using state of the art foundational and MoE LLMs on a real-world Financial Market Movement task where MoE-F attains a remarkable 17% absolute and 48.5% relative F1 measure improvement over the next best performing individual LLM expert.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Reality Only Happens Once: Single-Path Generalization Bounds for Transformers
Authors:
Yannick Limmer,
Anastasis Kratsios,
Xuwei Yang,
Raeid Saqur,
Blanka Horvath
Abstract:
One of the inherent challenges in deploying transformers on time series is that \emph{reality only happens once}; namely, one typically only has access to a single trajectory of the data-generating process comprised of non-i.i.d. observations. We derive non-asymptotic statistical guarantees in this setting through bounds on the \textit{generalization} of a transformer network at a future-time $t$,…
▽ More
One of the inherent challenges in deploying transformers on time series is that \emph{reality only happens once}; namely, one typically only has access to a single trajectory of the data-generating process comprised of non-i.i.d. observations. We derive non-asymptotic statistical guarantees in this setting through bounds on the \textit{generalization} of a transformer network at a future-time $t$, given that it has been trained using $N\le t$ observations from a single perturbed trajectory of a Markov process. Under the assumption that the Markov process satisfies a log-Sobolev inequality, we obtain a generalization bound which effectively converges at the rate of ${O}(1/\sqrt{N})$. Our bound depends explicitly on the activation function ($\operatorname{Swish}$, $\operatorname{GeLU}$, or $\tanh$ are considered), the number of self-attention heads, depth, width, and norm-bounds defining the transformer architecture. Our bound consists of three components: (I) The first quantifies the gap between the stationary distribution of the data-generating Markov process and its distribution at time $t$, this term converges exponentially to $0$. (II) The next term encodes the complexity of the transformer model and, given enough time, eventually converges to $0$ at the rate ${O}(\log(N)^r/\sqrt{N})$ for any $r>0$. (III) The third term guarantees that the bound holds with probability at least $1$-$δ$, and converges at a rate of ${O}(\sqrt{\log(1/δ)}/\sqrt{N})$.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Improving Kernel-Based Nonasymptotic Simultaneous Confidence Bands
Authors:
Balázs Csanád Csáji,
Bálint Horváth
Abstract:
The paper studies the problem of constructing nonparametric simultaneous confidence bands with nonasymptotic and distribition-free guarantees. The target function is assumed to be band-limited and the approach is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The starting point of the paper is a recently developed algorithm to which we propose three types of improvements. F…
▽ More
The paper studies the problem of constructing nonparametric simultaneous confidence bands with nonasymptotic and distribition-free guarantees. The target function is assumed to be band-limited and the approach is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The starting point of the paper is a recently developed algorithm to which we propose three types of improvements. First, we relax the assumptions on the noises by replacing the symmetricity assumption with a weaker distributional invariance principle. Then, we propose a more efficient way to estimate the norm of the target function, and finally we enhance the construction of the confidence bands by tightening the constraints of the underlying convex optimization problems. The refinements are also illustrated through numerical experiments.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Signature Trading: A Path-Dependent Extension of the Mean-Variance Framework with Exogenous Signals
Authors:
Owen Futter,
Blanka Horvath,
Magnus Wiese
Abstract:
In this article we introduce a portfolio optimisation framework, in which the use of rough path signatures (Lyons, 1998) provides a novel method of incorporating path-dependencies in the joint signal-asset dynamics, naturally extending traditional factor models, while kee** the resulting formulas lightweight and easily interpretable. We achieve this by representing a trading strategy as a linear…
▽ More
In this article we introduce a portfolio optimisation framework, in which the use of rough path signatures (Lyons, 1998) provides a novel method of incorporating path-dependencies in the joint signal-asset dynamics, naturally extending traditional factor models, while kee** the resulting formulas lightweight and easily interpretable. We achieve this by representing a trading strategy as a linear functional applied to the signature of a path (which we refer to as "Signature Trading" or "Sig-Trading"). This allows the modeller to efficiently encode the evolution of past time-series observations into the optimisation problem. In particular, we derive a concise formulation of the dynamic mean-variance criterion alongside an explicit solution in our setting, which naturally incorporates a drawdown control in the optimal strategy over a finite time horizon. Secondly, we draw parallels between classical portfolio stategies and Sig-Trading strategies and explain how the latter leads to a pathwise extension of the classical setting via the "Signature Efficient Frontier". Finally, we give examples when trading under an exogenous signal as well as examples for momentum and pair-trading strategies, demonstrated both on synthetic and market data. Our framework combines the best of both worlds between classical theory (whose appeal lies in clear and concise formulae) and between modern, flexible data-driven methods that can handle more realistic datasets. The advantage of the added flexibility of the latter is that one can bypass common issues such as the accumulation of heteroskedastic and asymmetric residuals during the optimisation phase. Overall, Sig-Trading combines the flexibility of data-driven methods without compromising on the clarity of the classical theory and our presented results provide a compelling toolbox that yields superior results for a large class of trading strategies.
△ Less
Submitted 30 August, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Experimental validation of particle-in-cell/Monte Carlo collisions simulations in low-pressure neon capacitively coupled plasmas
Authors:
Chan-Won Park,
Benedek Horváth,
Aranka Derzsi,
Julian Schulze,
J. H. Kim,
Zoltán Donkó,
Hyo-Chang Lee
Abstract:
Plasma simulations are powerful tools for understanding fundamental plasma science phenomena and for process optimization in applications. To ensure their quantitative accuracy, they must be validated against experiments. In this work, such an experimental validation is performed for a 1d3v particle-in-cell simulation complemented with the Monte Carlo treatment of collision processes of a capaciti…
▽ More
Plasma simulations are powerful tools for understanding fundamental plasma science phenomena and for process optimization in applications. To ensure their quantitative accuracy, they must be validated against experiments. In this work, such an experimental validation is performed for a 1d3v particle-in-cell simulation complemented with the Monte Carlo treatment of collision processes of a capacitively coupled radio frequency plasma driven at 13.56 MHz and operated in neon gas. In a geometrically symmetric reactor the electron density in the discharge center and the spatio-temporal distribution of the electron impact excitation rate from the ground into the Ne 2p$_1$ state are measured by a microwave cutoff probe and phase resolved optical emission spectroscopy, respectively. The measurements are conducted for electrode gaps between 50 mm and 90 mm, neutral gas pressures between 20 mTorr and 50 mTorr, and peak-to-peak values of the driving voltage waveform between 250 V and 650 V. Simulations are performed under identical discharge conditions. In the simulations, various combinations of surface coefficients characterising the interactions of electrons and heavy particles with the anodized aluminium electrode surfaces are adopted. We find, that the simulations using a constant effective heavy particle induced secondary electron emission coefficient of 0.3 and a realistic electron-surface interaction model (which considers energy-dependent and material specific elastic and inelastic electron reflection, as well as the emission of true secondary electrons from the surface) yield results which are in good quantitative agreement with the experimental data.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Frequency-dependent electron power absorption mode transitions in capacitively coupled argon-oxygen plasmas
Authors:
Aranka Derzsi,
Mate Vass,
Ranna Masheyeva,
Benedek Horvath,
Zoltan Donko,
Peter Hartmann
Abstract:
Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collision (PIC/MCC) simulations are performed to investigate the excitation dynamics in low-pressure capacitively coupled plasmas (CCPs) in argon-oxygen mixtures. The system used for this study is a geometrically symmetric CCP reactor operated in a fixed mixture gas composition, at fix…
▽ More
Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collision (PIC/MCC) simulations are performed to investigate the excitation dynamics in low-pressure capacitively coupled plasmas (CCPs) in argon-oxygen mixtures. The system used for this study is a geometrically symmetric CCP reactor operated in a fixed mixture gas composition, at fixed pressure and voltage amplitude, with a wide range of driving RF frequencies (2$~$MHz$~\le f \le~15~$MHz). The measured and calculated spatio-temporal distributions of the electron impact excitation rates from the Ar ground state to the Ar$~\rm{2p_1}$ state (with a wavelength of 750.4~nm) show good qualitative agreement. The distributions show significant frequency dependence, which is generally considered to be predictive of transitions in the dominant discharge operating mode. Three frequency ranges can be distinguished, showing distinctly different excitation characteristics: (i) in the low frequency range ($f \le~3~$MHz), excitation is strong at the sheaths and weak in the bulk region; (ii) at intermediate frequencies (3.5$~$MHz$~\le f \le~5~$MHz), the excitation rate in the bulk region is enhanced and shows striation formation; (iii) above 6$~$MHz, excitation in the bulk gradually decreases with increasing frequency. Boltzmann term analysis was performed to quantify the frequency dependent contributions of the Ohmic and ambipolar terms to the electron power absorption.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Robust Hedging GANs
Authors:
Yannick Limmer,
Blanka Horvath
Abstract:
The availability of deep hedging has opened new horizons for solving hedging problems under a large variety of realistic market conditions. At the same time, any model - be it a traditional stochastic model or a market generator - is at best an approximation of market reality, prone to model-misspecification and estimation errors. This raises the question, how to furnish a modelling setup with too…
▽ More
The availability of deep hedging has opened new horizons for solving hedging problems under a large variety of realistic market conditions. At the same time, any model - be it a traditional stochastic model or a market generator - is at best an approximation of market reality, prone to model-misspecification and estimation errors. This raises the question, how to furnish a modelling setup with tools that can address the risk of discrepancy between anticipated distribution and market reality, in an automated way. Automated robustification is currently attracting increased attention in numerous investment problems, but it is a delicate task due to its imminent implications on risk management. Hence, it is beyond doubt that more activity can be anticipated on this topic to converge towards a consensus on best practices.
This paper presents a natural extension of the original deep hedging framework to address uncertainty in the data generating process via an adversarial approach inspired by GANs to automate robustification in our hedging objective. This is achieved through an interplay of three modular components: (i) a (deep) hedging engine, (ii) a data-generating process (that is model agnostic permitting a large variety of classical models as well as machine learning-based market generators), and (iii) a notion of distance on model space to measure deviations between our market prognosis and reality. We do not restrict the ambiguity set to a region around a reference model, but instead penalize deviations from the anticipated distribution. Our suggested choice for each component is motivated by model agnosticism, allowing a seamless transition between settings. Since all individual components are already used in practice, we believe that our framework is easily adaptable to existing functional settings.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Non-parametric online market regime detection and regime clustering for multidimensional and path-dependent data structures
Authors:
Zacharia Issa,
Blanka Horvath
Abstract:
In this work we present a non-parametric online market regime detection method for multidimensional data structures using a path-wise two-sample test derived from a maximum mean discrepancy-based similarity metric on path space that uses rough path signatures as a feature map. The latter similarity metric has been developed and applied as a discriminator in recent generative models for small data…
▽ More
In this work we present a non-parametric online market regime detection method for multidimensional data structures using a path-wise two-sample test derived from a maximum mean discrepancy-based similarity metric on path space that uses rough path signatures as a feature map. The latter similarity metric has been developed and applied as a discriminator in recent generative models for small data environments, and has been optimised here to the setting where the size of new incoming data is particularly small, for faster reactivity.
On the same principles, we also present a path-wise method for regime clustering which extends our previous work. The presented regime clustering techniques were designed as ex-ante market analysis tools that can identify periods of approximatively similar market activity, but the new results also apply to path-wise, high dimensional-, and to non-Markovian settings as well as to data structures that exhibit autocorrelation.
We demonstrate our clustering tools on easily verifiable synthetic datasets of increasing complexity, and also show how the outlined regime detection techniques can be used as fast on-line automatic regime change detectors or as outlier detection tools, including a fully automated pipeline. Finally, we apply the fine-tuned algorithms to real-world historical data including high-dimensional baskets of equities and the recent price evolution of crypto assets, and we show that our methodology swiftly and accurately indicated historical periods of market turmoil.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
The Scientific Investigation of Unidentified Aerial Phenomena (UAP) Using Multimodal Ground-Based Observatories
Authors:
Wesley Andrés Watters,
Abraham Loeb,
Frank Laukien,
Richard Cloete,
Alex Delacroix,
Sergei Dobroshinsky,
Benjamin Horvath,
Ezra Kelderman,
Sarah Little,
Eric Masson,
Andrew Mead,
Mitch Randall,
Forrest Schultz,
Matthew Szenher,
Foteini Vervelidou,
Abigail White,
Angelique Ahlström,
Carol Cleland,
Spencer Dockal,
Natasha Donahue,
Mark Elowitz,
Carson Ezell,
Alex Gersznowicz,
Nicholas Gold,
Michael G. Hercz
, et al. (13 additional authors not shown)
Abstract:
(Abridged) Unidentified Aerial Phenomena (UAP) have resisted explanation and have received little formal scientific attention for 75 years. A primary objective of the Galileo Project is to build an integrated software and instrumentation system designed to conduct a multimodal census of aerial phenomena and to recognize anomalies. Here we present key motivations for the study of UAP and address hi…
▽ More
(Abridged) Unidentified Aerial Phenomena (UAP) have resisted explanation and have received little formal scientific attention for 75 years. A primary objective of the Galileo Project is to build an integrated software and instrumentation system designed to conduct a multimodal census of aerial phenomena and to recognize anomalies. Here we present key motivations for the study of UAP and address historical objections to this research. We describe an approach for highlighting outlier events in the high-dimensional parameter space of our census measurements. We provide a detailed roadmap for deciding measurement requirements, as well as a science traceability matrix (STM) for connecting sought-after physical parameters to observables and instrument requirements. We also discuss potential strategies for deciding where to locate instruments for development, testing, and final deployment. Our instrument package is multimodal and multispectral, consisting of (1) wide-field cameras in multiple bands for targeting and tracking of aerial objects and deriving their positions and kinematics using triangulation; (2) narrow-field instruments including cameras for characterizing morphology, spectra, polarimetry, and photometry; (3) passive multistatic arrays of antennas and receivers for radar-derived range and kinematics; (4) radio spectrum analyzers to measure radio and microwave emissions; (5) microphones for sampling acoustic emissions in the infrasonic through ultrasonic frequency bands; and (6) environmental sensors for characterizing ambient conditions (temperature, pressure, humidity, and wind velocity), as well as quasistatic electric and magnetic fields, and energetic particles. The use of multispectral instruments and multiple sensor modalities will help to ensure that artifacts are recognized and that true detections are corroborated and verifiable.
△ Less
Submitted 31 May, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
Non-adversarial training of Neural SDEs with signature kernel scores
Authors:
Zacharia Issa,
Blanka Horvath,
Maud Lemercier,
Cristopher Salvi
Abstract:
Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series generation has been previously obtained by training these models adversarially as GANs. However, as typical for GAN architectures, training is notoriously unstable, often suffers from mode collapse, and requires specialised techniques such as weight clip** and gradient pe…
▽ More
Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series generation has been previously obtained by training these models adversarially as GANs. However, as typical for GAN architectures, training is notoriously unstable, often suffers from mode collapse, and requires specialised techniques such as weight clip** and gradient penalty to mitigate these issues. In this paper, we introduce a novel class of scoring rules on pathspace based on signature kernels and use them as objective for training Neural SDEs non-adversarially. By showing strict properness of such kernel scores and consistency of the corresponding estimators, we provide existence and uniqueness guarantees for the minimiser. With this formulation, evaluating the generator-discriminator pair amounts to solving a system of linear path-dependent PDEs which allows for memory-efficient adjoint-based backpropagation. Moreover, because the proposed kernel scores are well-defined for paths with values in infinite dimensional spaces of functions, our framework can be easily extended to generate spatiotemporal data. Our procedure permits conditioning on a rich variety of market conditions and significantly outperforms alternative ways of training Neural SDEs on a variety of tasks including the simulation of rough volatility models, the conditional probabilistic forecasts of real-world forex pairs where the conditioning variable is an observed past trajectory, and the mesh-free generation of limit order book dynamics.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Optimal Stop** via Distribution Regression: a Higher Rank Signature Approach
Authors:
Blanka Horvath,
Maud Lemercier,
Chong Liu,
Terry Lyons,
Cristopher Salvi
Abstract:
Distribution Regression on path-space refers to the task of learning functions map** the law of a stochastic process to a scalar target. The learning procedure based on the notion of path-signature, i.e. a classical transform from rough path theory, was widely used to approximate weakly continuous functionals, such as the pricing functionals of path--dependent options' payoffs. However, this app…
▽ More
Distribution Regression on path-space refers to the task of learning functions map** the law of a stochastic process to a scalar target. The learning procedure based on the notion of path-signature, i.e. a classical transform from rough path theory, was widely used to approximate weakly continuous functionals, such as the pricing functionals of path--dependent options' payoffs. However, this approach fails for Optimal Stop** Problems arising from mathematical finance, such as the pricing of American options, because the corresponding value functions are in general discontinuous with respect to the weak topology. In this paper we develop a rigorous mathematical framework to resolve this issue by recasting an Optimal Stop** Problem as a higher order kernel mean embedding regression based on the notions of higher rank signatures of measure--valued paths and adapted topologies. The core computational component of our algorithm consists in solving a family of two--dimensional hyperbolic PDEs.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Nonlocal dynamics of secondary electrons in capacitively coupled radio frequency discharges
Authors:
Katharina Noesges,
Maximilian Klich,
Aranka Derzsi,
Benedek Horváth,
Julian Schulze,
Ralf Peter Brinkmann,
Thomas Mussenbrock,
Sebastian Wilczek
Abstract:
In capacitively coupled radio frequency (CCRF) discharges, the interaction of the plasma and the surface boundaries is linked to a variety of highly relevant phenomena for technological processes. One possible plasma-surface interaction is the generation of secondary electrons (SEs), which significantly influence the discharge when accelerated in the sheath electric field. However, SEs, in particu…
▽ More
In capacitively coupled radio frequency (CCRF) discharges, the interaction of the plasma and the surface boundaries is linked to a variety of highly relevant phenomena for technological processes. One possible plasma-surface interaction is the generation of secondary electrons (SEs), which significantly influence the discharge when accelerated in the sheath electric field. However, SEs, in particular electron-induced SEs ($\updelta$-electrons), are frequently neglected in theory and simulations. Due to the relatively high threshold energy for the effective generation of $\updelta$-electrons at surfaces, their dynamics are closely connected and entangled with the dynamics of the ion-induced SEs ($\upgamma$-electrons). Thus, a fundamental understanding of the electron dynamics has to be achieved on a nanosecond timescale, and the effects of the different electron groups have to be segregated. This work utilizes $1d3v$ Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations of a symmetric discharge in the low-pressure regime ($p\,=\, 1\,\rm{Pa}$) with the inclusion of realistic electron-surface interactions for silicon dioxide. A diagnostic framework is introduced that segregates the electrons into three groups ("bulk-electrons", "$\upgamma$-electrons", and "$\updelta$-electrons") in order to analyze and discuss their dynamics. A variation of the electrode gap size $L_\mathrm{gap}$ is then presented as a control tool to alter the dynamics of the discharge significantly. It is demonstrated that this control results in two different regimes of low and high plasma density, respectively. The fundamental electron dynamics of both regimes are explained, which requires a complete analysis starting at global parameters (e.g., densities) down to single electron trajectories.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Kernels of operators on certain Banach spaces associated with almost disjoint families
Authors:
Bence Horváth,
Niels Jakob Laustsen
Abstract:
Given an infinite set $Γ$ and an almost disjoint family $\mathcal{A}$ on $Γ$, let $Y_{\mathcal{A}}$ denote the closed subspace of $\ell_\infty(Γ)$ spanned by the indicator functions $1_{\bigcap_{j=1}^n A_j}$ for $n\in\mathbb{N}$ and $A_1,\ldots,A_n\in\mathcal{A}$. We show that if $\mathcal{A}$ has cardinality greater than $Γ$, then $Y_{\mathcal{A}}$ contains closed subspaces which cannot be realis…
▽ More
Given an infinite set $Γ$ and an almost disjoint family $\mathcal{A}$ on $Γ$, let $Y_{\mathcal{A}}$ denote the closed subspace of $\ell_\infty(Γ)$ spanned by the indicator functions $1_{\bigcap_{j=1}^n A_j}$ for $n\in\mathbb{N}$ and $A_1,\ldots,A_n\in\mathcal{A}$. We show that if $\mathcal{A}$ has cardinality greater than $Γ$, then $Y_{\mathcal{A}}$ contains closed subspaces which cannot be realised as the kernels of any bounded operators $Y_{\mathcal{A}}\rightarrow \ell_{\infty}(Γ)$. Consequently the spaces $\ell_{\infty}(Γ)$, for any infinite set $Γ$, and $C_0(K_{\mathcal{A}})$, where $\mathcal{A}$ is an uncountable almost disjoint family on $\mathbb{N}$ and $K_{\mathcal{A}}$ the locally compact Mrówka space associated with $\mathcal{A}$, contain closed subspaces which do not arise as the kernels of any bounded operators on them.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Nonparametric, Nonasymptotic Confidence Bands with Paley-Wiener Kernels for Band-Limited Functions
Authors:
Balázs Csanád Csáji,
Bálint Horváth
Abstract:
The paper introduces a method to construct confidence bands for bounded, band-limited functions based on a finite sample of input-output pairs. The approach is distribution-free w.r.t. the observation noises and only the knowledge of the input distribution is assumed. It is nonparametric, that is, it does not require a parametric model of the regression function and the regions have non-asymptotic…
▽ More
The paper introduces a method to construct confidence bands for bounded, band-limited functions based on a finite sample of input-output pairs. The approach is distribution-free w.r.t. the observation noises and only the knowledge of the input distribution is assumed. It is nonparametric, that is, it does not require a parametric model of the regression function and the regions have non-asymptotic guarantees. The algorithm is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The paper first studies the fully observable variant, when there are no noises on the observations and only the inputs are random; then it generalizes the ideas to the noisy case using gradient-perturbation methods. Finally, numerical experiments demonstrating both cases are presented.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Electron power absorption in capacitively coupled neon-oxygen plasmas: a comparison of experimental and computational results
Authors:
A. Derzsi,
P. Hartmann,
M. Vass,
B. Horváth,
M. Gyulai,
I. Korolov,
J. Schulze,
Z. Donkó
Abstract:
Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations are used to study the electron power absorption and excitation/ionization dynamics in capacitively coupled plasmas (CCPs) in mixtures of neon and oxygen gases. The study is performed for a geometrically symmetric CCP reactor with a gap length of 2.5 cm…
▽ More
Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations are used to study the electron power absorption and excitation/ionization dynamics in capacitively coupled plasmas (CCPs) in mixtures of neon and oxygen gases. The study is performed for a geometrically symmetric CCP reactor with a gap length of 2.5 cm at a driving frequency of 10~MHz and a peak-to-peak voltage of 350 V. The pressure of the gas mixture is varied between 15 Pa and 500 Pa, while the neon/oxygen concentration is tuned between 10% and 90%. For all discharge conditions, the spatio-temporal distribution of the electron-impact excitation rate from the Ne ground state into the Ne $\rm{2p^53p_0}$ state measured by PROES and obtained from PIC/MCC simulations show good qualitative agreement. Based on the emission/excitation patterns, multiple operation regimes are identified. Localized bright emission features at the bulk boundaries, caused by local maxima in the electronegativity are found at high pressures and high O$_2$ concentrations. The relative contributions of the ambipolar and the Ohmic electron power absorption are found to vary strongly with the discharge parameters: the Ohmic power absorption is enhanced by both the high collisionality at high pressures and the high electronegativity at low pressures. In the wide parameter regime covered in this study, the PROES measurements are found to accurately represent the ionization dynamics, i.e., the discharge operation mode. This work represents also a successful experimental validation of the discharge model developed for neon-oxygen CCPs.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
Clustering Market Regimes using the Wasserstein Distance
Authors:
Blanka Horvath,
Zacharia Issa,
Aitor Muguruza
Abstract:
The problem of rapid and automated detection of distinct market regimes is a topic of great interest to financial mathematicians and practitioners alike. In this paper, we outline an unsupervised learning algorithm for clustering financial time-series into a suitable number of temporal segments (market regimes). As a special case of the above, we develop a robust algorithm that automates the proce…
▽ More
The problem of rapid and automated detection of distinct market regimes is a topic of great interest to financial mathematicians and practitioners alike. In this paper, we outline an unsupervised learning algorithm for clustering financial time-series into a suitable number of temporal segments (market regimes). As a special case of the above, we develop a robust algorithm that automates the process of classifying market regimes. The method is robust in the sense that it does not depend on modelling assumptions of the underlying time series as our experiments with real datasets show. This method -- dubbed the Wasserstein $k$-means algorithm -- frames such a problem as one on the space of probability measures with finite $p^\text{th}$ moment, in terms of the $p$-Wasserstein distance between (empirical) distributions. We compare our WK-means approach with a more traditional clustering algorithms by studying the so-called maximum mean discrepancy scores between, and within clusters. In both cases it is shown that the WK-means algorithm vastly outperforms all considered competitor approaches. We demonstrate the performance of all approaches both in a controlled environment on synthetic data, and on real data.
△ Less
Submitted 22 October, 2021;
originally announced October 2021.
-
Approximately multiplicative maps between algebras of bounded operators on Banach spaces
Authors:
Yemon Choi,
Bence Horváth,
Niels Jakob Laustsen
Abstract:
We show that for any separable reflexive Banach space $X$ and a large class of Banach spaces $E$, including those with a subsymmetric shrinking basis but also all spaces $L_p$ for $1\leq p \leq \infty$, every bounded linear map ${\mathcal B}(E)\to {\mathcal B}(X)$ which is approximately multiplicative is necessarily close in the operator norm to some bounded homomorphism…
▽ More
We show that for any separable reflexive Banach space $X$ and a large class of Banach spaces $E$, including those with a subsymmetric shrinking basis but also all spaces $L_p$ for $1\leq p \leq \infty$, every bounded linear map ${\mathcal B}(E)\to {\mathcal B}(X)$ which is approximately multiplicative is necessarily close in the operator norm to some bounded homomorphism ${\mathcal B}(E)\to {\mathcal B}(X)$. That is, the pair $({\mathcal B}(E), {\mathcal B}(X))$ has the AMNM property in the sense of Johnson (\textit{J.~London Math.\ Soc.} 1988). Previously this was only known for $E=X=\ell_p$ with $1<p<\infty$; even for those cases, we improve on the previous methods and obtain better constants in various estimates. A crucial role in our approach is played by a new result, motivated by cohomological techniques, which establishes AMNM properties relative to an amenable subalgebra; this generalizes a theorem of Johnson (\textit{op cit.}).
△ Less
Submitted 5 March, 2022; v1 submitted 8 October, 2021;
originally announced October 2021.
-
Hedging under rough volatility
Authors:
Masaaki Fukasawa,
Blanka Horvath,
Peter Tankov
Abstract:
In this chapter we first briefly review the existing approaches to hedging in rough volatility models. Next, we present a simple but general result which shows that in a one-factor rough stochastic volatility model, any option may be perfectly hedged with a dynamic portfolio containing the underlying and one other asset such as a variance swap. In the final section we report the results of a back-…
▽ More
In this chapter we first briefly review the existing approaches to hedging in rough volatility models. Next, we present a simple but general result which shows that in a one-factor rough stochastic volatility model, any option may be perfectly hedged with a dynamic portfolio containing the underlying and one other asset such as a variance swap. In the final section we report the results of a back-test experiment using real data, where VIX options are hedged with a forward variance swap. In this experiment, using a rough volatility model allows to almost completely remove the bias and reduce the overall hedging error by a factor of 27% compared to traditional diffusion-based models.
△ Less
Submitted 9 May, 2021;
originally announced May 2021.
-
A purely infinite Cuntz-like Banach $*$-algebra with no purely infinite ultrapowers
Authors:
Matthew Daws,
Bence Horváth
Abstract:
We continue our investigation, from \cite{dh}, of the ring-theoretic infiniteness properties of ultrapowers of Banach algebras, studying in this paper the notion of being purely infinite. It is well known that a $C^*$-algebra is purely infinite if and only if any of its ultrapowers are. We find examples of Banach algebras, as algebras of operators on Banach spaces, which do have purely infinite ul…
▽ More
We continue our investigation, from \cite{dh}, of the ring-theoretic infiniteness properties of ultrapowers of Banach algebras, studying in this paper the notion of being purely infinite. It is well known that a $C^*$-algebra is purely infinite if and only if any of its ultrapowers are. We find examples of Banach algebras, as algebras of operators on Banach spaces, which do have purely infinite ultrapowers. Our main contribution is the construction of a "Cuntz-like" Banach $*$-algebra which is purely infinite, but whose ultrapowers are not even simple, and hence not purely infinite. This algebra is a naturally occurring analogue of the Cuntz algebra, and of the $L^p$-analogues introduced by Phillips. However, our proof of being purely infinite is combinatorial, but direct, and so differs from existing proofs. We show that there are non-zero traces on our algebra, which in particular implies that our algebra is not isomorphic to any of the $L^p$-analogues of the Cuntz algebra.
△ Less
Submitted 25 March, 2022; v1 submitted 30 April, 2021;
originally announced April 2021.
-
eduPIC: an introductory particle based code for radio-frequency plasma simulation
Authors:
Zoltan Donko,
Aranka Derzsi,
Mate Vass,
Benedek Horvath,
Sebastian Wilczek,
Botond Hartmann,
Peter Hartmann
Abstract:
For the self-consistent description of various plasma sources operated in the low-pressure (nonlocal, kinetic) regime, the Particle-In-Cell simulation approach, combined with the Monte Carlo treatment of collision processes (PIC/MCC), has become an important tool during the past decades. PIC/MCC simulation codes have been developed and maintained by many research groups, some of these codes are av…
▽ More
For the self-consistent description of various plasma sources operated in the low-pressure (nonlocal, kinetic) regime, the Particle-In-Cell simulation approach, combined with the Monte Carlo treatment of collision processes (PIC/MCC), has become an important tool during the past decades. PIC/MCC simulation codes have been developed and maintained by many research groups, some of these codes are available to the community as freeware resources. While this computational approach has already been present for a number of decades, the rapid evolution of the computing infrastructure makes it increasingly more popular and accessible, as simulations of simple systems can be executed now on personal computers or laptops. During the past few years we have experienced an increasing interest in lectures and courses dealing with the basics of particle simulations, including the PIC/MCC technique. In a response to this, this paper (i) provides a tutorial on the physical basis and the algorithms of the PIC/MCC technique and (ii) presents a basic (spatially one-dimensional) electrostatic PIC/MCC simulation code for Capacitively Coupled Plasmas, whose source is made freely available in various programming languages. We share the code in C/C++ versions, as well as in a version written in Rust, which is a rapidly emerging computational language. Our code intends to be a "starting tool" for those who are interested in learning the details of the PIC/MCC technique and would like to develop the "skeleton" code further, for their research purposes.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
Deep Hedging under Rough Volatility
Authors:
Blanka Horvath,
Josef Teichmann,
Zan Zuric
Abstract:
We investigate the performance of the Deep Hedging framework under training paths beyond the (finite dimensional) Markovian setup. In particular we analyse the hedging performance of the original architecture under rough volatility models with view to existing theoretical results for those. Furthermore, we suggest parsimonious but suitable network architectures capable of capturing the non-Markovi…
▽ More
We investigate the performance of the Deep Hedging framework under training paths beyond the (finite dimensional) Markovian setup. In particular we analyse the hedging performance of the original architecture under rough volatility models with view to existing theoretical results for those. Furthermore, we suggest parsimonious but suitable network architectures capable of capturing the non-Markoviantity of time-series. Secondly, we analyse the hedging behaviour in these models in terms of P\&L distributions and draw comparisons to jump diffusion models if the the rebalancing frequency is realistically small.
△ Less
Submitted 3 February, 2021;
originally announced February 2021.
-
Unital Banach algebras not isomorphic to Calkin algebras of separable Banach spaces
Authors:
Bence Horváth,
Tomasz Kania
Abstract:
Recent developments in Banach space theory provided unexpected examples of unital Banach algebras that are isomorphic to Calkin algebras of Banach spaces, however no example of a unital Banach algebra that cannot be realised as a~Calkin algebra has been found so far. This naturally led to the question of possible limitations of such assignments. In the present note we provide examples of unital Ba…
▽ More
Recent developments in Banach space theory provided unexpected examples of unital Banach algebras that are isomorphic to Calkin algebras of Banach spaces, however no example of a unital Banach algebra that cannot be realised as a~Calkin algebra has been found so far. This naturally led to the question of possible limitations of such assignments. In the present note we provide examples of unital Banach algebras meeting the necessary density condition for being the Calkin algebra of a separable Banach space that are not isomorphic to Calkin algebras of such spaces, nonetheless. The examples may be found of the form $C(X)$ for a compact space $X$, $\ell_1(G)$ for some torsion-free Abelian group, and a~simple, unital AF $C^*$-algebra. Extensions to higher densities are also presented.
△ Less
Submitted 3 March, 2021; v1 submitted 25 January, 2021;
originally announced January 2021.
-
Surjective homomorphisms from algebras of operators on long sequence spaces are automatically injective
Authors:
Bence Horváth,
Tomasz Kania
Abstract:
We study automatic injectivity of surjective algebra homomorphisms from $\mathscr{B}(X)$, the algebra of (bounded, linear) operators on $X$, to $\mathscr{B}(Y)$, where $X$ is one of the following \emph{long} sequence spaces: $c_0(λ)$, $\ell_{\infty}^c(λ)$, and $\ell_p(λ)$ ($1 \leqslant p < \infty$) and $Y$ is arbitrary. \textit{En route} to the proof that these spaces do indeed enjoy such a proper…
▽ More
We study automatic injectivity of surjective algebra homomorphisms from $\mathscr{B}(X)$, the algebra of (bounded, linear) operators on $X$, to $\mathscr{B}(Y)$, where $X$ is one of the following \emph{long} sequence spaces: $c_0(λ)$, $\ell_{\infty}^c(λ)$, and $\ell_p(λ)$ ($1 \leqslant p < \infty$) and $Y$ is arbitrary. \textit{En route} to the proof that these spaces do indeed enjoy such a property, we classify two-sided ideals of the algebra of operators of any of the aforementioned Banach spaces that are closed with respect to the `sequential strong operator topology'.
△ Less
Submitted 30 November, 2020; v1 submitted 28 July, 2020;
originally announced July 2020.
-
A Data-driven Market Simulator for Small Data Environments
Authors:
Hans Bühler,
Blanka Horvath,
Terry Lyons,
Imanol Perez Arribas,
Ben Wood
Abstract:
Neural network based data-driven market simulation unveils a new and flexible way of modelling financial time series without imposing assumptions on the underlying stochastic dynamics. Though in this sense generative market simulation is model-free, the concrete modelling choices are nevertheless decisive for the features of the simulated paths. We give a brief overview of currently used generativ…
▽ More
Neural network based data-driven market simulation unveils a new and flexible way of modelling financial time series without imposing assumptions on the underlying stochastic dynamics. Though in this sense generative market simulation is model-free, the concrete modelling choices are nevertheless decisive for the features of the simulated paths. We give a brief overview of currently used generative modelling approaches and performance evaluation metrics for financial time series, and address some of the challenges to achieve good results in the latter. We also contrast some classical approaches of market simulation with simulation based on generative modelling and highlight some advantages and pitfalls of the new approach. While most generative models tend to rely on large amounts of training data, we present here a generative model that works reliably in environments where the amount of available training data is notoriously small. Furthermore, we show how a rough paths perspective combined with a parsimonious Variational Autoencoder framework provides a powerful way for encoding and evaluating financial time series in such environments where available training data is scarce. Finally, we also propose a suitable performance evaluation metric for financial time series and discuss some connections of our Market Generator to deep hedging.
△ Less
Submitted 21 June, 2020;
originally announced June 2020.
-
Perturbations of surjective homomorphisms between algebras of operators on Banach spaces
Authors:
Bence Horváth,
Zsigmond Tarcsay
Abstract:
A remarkable result of Molnár [Proc. Amer. Math. Soc., 126 (1998), 853-861] states that automorphisms of the algebra of operators acting on a separable Hilbert space is stable under "small" perturbations. More precisely, if $φ,ψ$ are endomorphisms of $\mathcal{B}(\mathcal{H})$ such that $\|φ(A)-ψ(A)\|<\|A\|$ and $ψ$ is surjective then so is $φ$. The aim of this paper is to extend this result to a…
▽ More
A remarkable result of Molnár [Proc. Amer. Math. Soc., 126 (1998), 853-861] states that automorphisms of the algebra of operators acting on a separable Hilbert space is stable under "small" perturbations. More precisely, if $φ,ψ$ are endomorphisms of $\mathcal{B}(\mathcal{H})$ such that $\|φ(A)-ψ(A)\|<\|A\|$ and $ψ$ is surjective then so is $φ$. The aim of this paper is to extend this result to a larger class of Banach spaces including $\ell_p$ and $L_p$ spaces ($1<p<+\infty$). En route to the proof we show that for any Banach space $X$ from the above class all faithful, unital, separable, reflexive representations of $\mathcal B (X)$ which preserve rank one operators are in fact isomorphisms.
△ Less
Submitted 24 May, 2021; v1 submitted 10 March, 2020;
originally announced March 2020.
-
Ring-theoretic (in)finiteness in reduced products of Banach algebras
Authors:
Matthew Daws,
Bence Horváth
Abstract:
We study ring-theoretic (in)finiteness properties -- such as \emph{Dedekind-finiteness} and \emph{proper infiniteness} -- of ultraproducts (and more generally, reduced products) of Banach algebras.
Whilst we characterise when an ultraproduct has these ring-theoretic properties in terms of its underlying sequence of algebras, we find that, contrary to the $C^*$-algebraic setting, it is not true i…
▽ More
We study ring-theoretic (in)finiteness properties -- such as \emph{Dedekind-finiteness} and \emph{proper infiniteness} -- of ultraproducts (and more generally, reduced products) of Banach algebras.
Whilst we characterise when an ultraproduct has these ring-theoretic properties in terms of its underlying sequence of algebras, we find that, contrary to the $C^*$-algebraic setting, it is not true in general that an ultraproduct has a ring-theoretic finiteness property if and only if "ultrafilter many" of the underlying sequence of algebras have the same property. This might appear to violate the continuous model theoretic counterpart of Łoś's Theorem; the reason it does not is that for a general Banach algebra, the ring theoretic properties we consider cannot be verified by considering a bounded subset of the algebra of \emph{fixed} bound. For Banach algebras, we construct counter-examples to show, for example, that each component Banach algebra can fail to be Dedekind-finite while the ultraproduct is Dedekind-finite, and we explain why such a counter-example is not possible for $C^*$-algebras. Finally the related notion of having \textit{stable rank one} is also studied for ultraproducts.
△ Less
Submitted 23 June, 2020; v1 submitted 15 December, 2019;
originally announced December 2019.
-
On deep calibration of (rough) stochastic volatility models
Authors:
Christian Bayer,
Blanka Horvath,
Aitor Muguruza,
Benjamin Stemper,
Mehdi Tomas
Abstract:
Techniques from deep learning play a more and more important role for the important task of calibration of financial models. The pioneering paper by Hernandez [Risk, 2017] was a catalyst for resurfacing interest in research in this area. In this paper we advocate an alternative (two-step) approach using deep learning techniques solely to learn the pricing map -- from model parameters to prices or…
▽ More
Techniques from deep learning play a more and more important role for the important task of calibration of financial models. The pioneering paper by Hernandez [Risk, 2017] was a catalyst for resurfacing interest in research in this area. In this paper we advocate an alternative (two-step) approach using deep learning techniques solely to learn the pricing map -- from model parameters to prices or implied volatilities -- rather than directly the calibrated model parameters as a function of observed market data. Having a fast and accurate neural-network-based approximating pricing map (first step), we can then (second step) use traditional model calibration algorithms. In this work we showcase a direct comparison of different potential approaches to the learning stage and present algorithms that provide a suffcient accuracy for practical use. We provide a first neural network-based calibration method for rough volatility models for which calibration can be done on the y. We demonstrate the method via a hands-on calibration engine on the rough Bergomi model, for which classical calibration techniques are diffcult to apply due to the high cost of all known numerical pricing methods. Furthermore, we display and compare different types of sampling and training methods and elaborate on their advantages under different objectives. As a further application we use the fast pricing method for a Bayesian analysis of the calibrated model.
△ Less
Submitted 22 August, 2019;
originally announced August 2019.
-
Deep Learning Volatility
Authors:
Blanka Horvath,
Aitor Muguruza,
Mehdi Tomas
Abstract:
We present a neural network based calibration method that performs the calibration task within a few milliseconds for the full implied volatility surface. The framework is consistently applicable throughout a range of volatility models -including the rough volatility family- and a range of derivative contracts. The aim of neural networks in this work is an off-line approximation of complex pricing…
▽ More
We present a neural network based calibration method that performs the calibration task within a few milliseconds for the full implied volatility surface. The framework is consistently applicable throughout a range of volatility models -including the rough volatility family- and a range of derivative contracts. The aim of neural networks in this work is an off-line approximation of complex pricing functions, which are difficult to represent or time-consuming to evaluate by other means. We highlight how this perspective opens new horizons for quantitative modelling: The calibration bottleneck posed by a slow pricing of derivative contracts is lifted. This brings several numerical pricers and model families (such as rough volatility models) within the scope of applicability in industry practice. The form in which information from available data is extracted and stored influences network performance: This approach is inspired by representing the implied volatility and option prices as a collection of pixels. In a number of applications we demonstrate the prowess of this modelling approach regarding accuracy, speed, robustness and generality and also its potentials towards model recognition.
△ Less
Submitted 22 August, 2019; v1 submitted 28 January, 2019;
originally announced January 2019.
-
When are full representations of algebras of operators on Banach spaces automatically faithful?
Authors:
Bence Horváth
Abstract:
We examine the phenomenon when surjective algebra homomorphisms between algebras of operators on Banach spaces are automatically injective. In the first part of the paper we shall show that for certain Banach spaces $X$ the following property holds: For every non-zero Banach space $Y$ every surjective algebra homomorphism $ψ: \, \mathcal{B}(X) \rightarrow \mathcal{B}(Y)$ is automatically injective…
▽ More
We examine the phenomenon when surjective algebra homomorphisms between algebras of operators on Banach spaces are automatically injective. In the first part of the paper we shall show that for certain Banach spaces $X$ the following property holds: For every non-zero Banach space $Y$ every surjective algebra homomorphism $ψ: \, \mathcal{B}(X) \rightarrow \mathcal{B}(Y)$ is automatically injective. In the second part of the paper we consider the question in the opposite direction: Building on the work of Kania, Koszmider and Laustsen \textit{(Trans. London Math. Soc., 2014)} we show that for every separable, reflexive Banach space $X$ there is a Banach space $Y_X$ and a surjective but not injective algebra homomorphism $ψ: \, \mathcal{B}(Y_X) \rightarrow \mathcal{B}(X)$.
△ Less
Submitted 30 May, 2019; v1 submitted 16 November, 2018;
originally announced November 2018.
-
A Banach space whose algebra of operators is Dedekind-finite but it does not have stable rank one
Authors:
Bence Horváth
Abstract:
In this note we examine the connection between the stable rank one and Dedekind-finite property of the algebra of operators on a Banach space $X$. We show that for the indecomposable but not hereditarily indecomposable Banach space $X_{\infty}$ constructed by Tarbard (Ph.D. Thesis, University of Oxford, 2013), the algebra of operators $B(X_{\infty})$ is Dedekind-finite but does not have stable ran…
▽ More
In this note we examine the connection between the stable rank one and Dedekind-finite property of the algebra of operators on a Banach space $X$. We show that for the indecomposable but not hereditarily indecomposable Banach space $X_{\infty}$ constructed by Tarbard (Ph.D. Thesis, University of Oxford, 2013), the algebra of operators $B(X_{\infty})$ is Dedekind-finite but does not have stable rank one. While this sheds some light on the Banach space structure of $X_{\infty}$ itself, we observe that the indecomposable but not hereditarily indecomposable Banach space constructed by Gowers and Maurey (Math. Ann., 1997) does not possess this property.
△ Less
Submitted 14 January, 2019; v1 submitted 27 July, 2018;
originally announced July 2018.
-
Volatility options in rough volatility models
Authors:
Blanka Horvath,
Antoine Jacquier,
Peter Tankov
Abstract:
We discuss the pricing and hedging of volatility options in some rough volatility models. First, we develop efficient Monte Carlo methods and asymptotic approximations for computing option prices and hedge ratios in models where log-volatility follows a Gaussian Volterra process. While providing a good fit for European options, these models are unable to reproduce the VIX option smile observed in…
▽ More
We discuss the pricing and hedging of volatility options in some rough volatility models. First, we develop efficient Monte Carlo methods and asymptotic approximations for computing option prices and hedge ratios in models where log-volatility follows a Gaussian Volterra process. While providing a good fit for European options, these models are unable to reproduce the VIX option smile observed in the market, and are thus not suitable for VIX products. To accommodate these, we introduce the class of modulated Volterra processes, and show that they successfully capture the VIX smile.
△ Less
Submitted 30 January, 2019; v1 submitted 5 February, 2018;
originally announced February 2018.
-
Dirichlet Forms and Finite Element Methods for the SABR Model
Authors:
Blanka Horvath,
Oleg Reichmann
Abstract:
We propose a deterministic numerical method for pricing vanilla options under the SABR stochastic volatility model, based on a finite element discretization of the Kolmogorov pricing equations via non-symmetric Dirichlet forms. Our pricing method is valid under mild assumptions on parameter configurations of the process both in moderate interest rate environments and in near-zero interest rate reg…
▽ More
We propose a deterministic numerical method for pricing vanilla options under the SABR stochastic volatility model, based on a finite element discretization of the Kolmogorov pricing equations via non-symmetric Dirichlet forms. Our pricing method is valid under mild assumptions on parameter configurations of the process both in moderate interest rate environments and in near-zero interest rate regimes such as the currently prevalent ones. The parabolic Kolmogorov pricing equations for the SABR model are degenerate at the origin, yielding non-standard partial differential equations, for which conventional pricing methods ---designed for non-degenerate parabolic equations--- potentially break down. We derive here the appropriate analytic setup to handle the degeneracy of the model at the origin. That is, we construct an evolution triple of suitably chosen Sobolev spaces with singular weights, consisting of the domain of the SABR-Dirichlet form, its dual space, and the pivotal Hilbert space. In particular, we show well-posedness of the variational formulation of the SABR-pricing equations for vanilla and barrier options on this triple. Furthermore, we present a finite element discretization scheme based on a (weighted) multiresolution wavelet approximation in space and a $θ$-scheme in time and provide an error analysis for this discretization.
△ Less
Submitted 8 January, 2018;
originally announced January 2018.
-
Functional central limit theorems for rough volatility
Authors:
Blanka Horvath,
Antoine Jacquier,
Aitor Muguruza,
Andreas Sojmark
Abstract:
The non-Markovian nature of rough volatility processes makes Monte Carlo methods challenging and it is in fact a major challenge to develop fast and accurate simulation algorithms. We provide an efficient one for stochastic Volterra processes, based on an extension of Donsker's approximation of Brownian motion to the fractional Brownian case with arbitrary Hurst exponent $H \in (0,1)$. Some of the…
▽ More
The non-Markovian nature of rough volatility processes makes Monte Carlo methods challenging and it is in fact a major challenge to develop fast and accurate simulation algorithms. We provide an efficient one for stochastic Volterra processes, based on an extension of Donsker's approximation of Brownian motion to the fractional Brownian case with arbitrary Hurst exponent $H \in (0,1)$. Some of the most relevant consequences of this `rough Donsker (rDonsker) Theorem' are functional weak convergence results in Skorokhod space for discrete approximations of a large class of rough stochastic volatility models. This justifies the validity of simple and easy-to-implement Monte-Carlo methods, for which we provide detailed numerical recipes. We test these against the current benchmark Hybrid scheme~\cite{BLP17} and find remarkable agreement (for a large range of values of~$H$). This rDonsker Theorem further provides a weak convergence proof for the Hybrid scheme itself, and allows to construct binomial trees for rough volatility models, the first available scheme (in the rough volatility context) for early exercise options such as American or Bermudan options.
△ Less
Submitted 11 November, 2023; v1 submitted 8 November, 2017;
originally announced November 2017.
-
Asymptotic behaviour of randomised fractional volatility models
Authors:
B. Horvath,
A. Jacquier,
C. Lacombe
Abstract:
We study the asymptotic behaviour of a class of small-noise diffusions driven by fractional Brownian motion, with random starting points. Different scalings allow for different asymptotic properties of the process (small-time and tail behaviours in particular). In order to do so, we extend some results on sample path large deviations for such diffusions. As an application, we show how these result…
▽ More
We study the asymptotic behaviour of a class of small-noise diffusions driven by fractional Brownian motion, with random starting points. Different scalings allow for different asymptotic properties of the process (small-time and tail behaviours in particular). In order to do so, we extend some results on sample path large deviations for such diffusions. As an application, we show how these results characterise the small-time and tail estimates of the implied volatility for rough volatility models, recently proposed in mathematical finance.
△ Less
Submitted 20 December, 2018; v1 submitted 3 August, 2017;
originally announced August 2017.
-
Short-time near-the-money skew in rough fractional volatility models
Authors:
Christian Bayer,
Peter K. Friz,
Archil Gulisashvili,
Blanka Horvath,
Benjamin Stemper
Abstract:
We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the "rough" regime of Hurst parameter $H < 1/2$. This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around…
▽ More
We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the "rough" regime of Hurst parameter $H < 1/2$. This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around the money while maintaining full analytical tractability. More precisely, this amounts to proving higher order moderate deviation estimates, only recently introduced in the option pricing context. This in turn allows us to push the applicability range of known at-the-money skew approximation formulae from CLT type log-moneyness deviations of order $t^{1/2}$ (recent works of Alòs, León & Vives and Fukasawa) to the wider moderate deviations regime.
△ Less
Submitted 9 March, 2018; v1 submitted 15 March, 2017;
originally announced March 2017.
-
Functional Analytic (Ir-)Regularity Properties of SABR-type Processes
Authors:
Leif Doering,
Blanka Horvath,
Josef Teichmann
Abstract:
The SABR model is a benchmark stochastic volatility model in interest rate markets, which has received much attention in the past decade. Its popularity arose from a tractable asymptotic expansion for implied volatility, derived by heat kernel methods. As markets moved to historically low rates, this expansion appeared to yield inconsistent prices. Since the model is deeply embedded in market prac…
▽ More
The SABR model is a benchmark stochastic volatility model in interest rate markets, which has received much attention in the past decade. Its popularity arose from a tractable asymptotic expansion for implied volatility, derived by heat kernel methods. As markets moved to historically low rates, this expansion appeared to yield inconsistent prices. Since the model is deeply embedded in market practice, alternative pricing methods for SABR have been addressed in numerous approaches in recent years. All standard option pricing methods make certain regularity assumptions on the underlying model, but for SABR these are rarely satisfied. We examine here regularity properties of the model from this perspective with view to a number of (asymptotic and numerical) option pricing methods. In particular, we highlight delicate degeneracies of the SABR model (and related processes) at the origin, which deem the currently used popular heat kernel methods and all related methods from (sub-) Riemannian geometry ill-suited for SABR-type processes, when interest rates are near zero. We describe a more general semigroup framework, which permits to derive a suitable geometry for SABR-type processes (in certain parameter regimes) via symmetric Dirichlet forms. Furthermore, we derive regularity properties (Feller- properties and strong continuity properties) necessary for the applicability of popular numerical schemes to SABR-semigroups, and identify suitable Banach- and Hilbert spaces for these. Finally, we comment on the short time and large time asymptotic behaviour of SABR-type processes beyond the heat-kernel framework.
△ Less
Submitted 8 January, 2017;
originally announced January 2017.
-
On the probability of hitting the boundary for Brownian motions on the SABR plane
Authors:
Archil Gulisashvili,
Blanka Horvath,
Antoine Jacquier
Abstract:
Starting from the hyperbolic Brownian motion as a time-changed Brownian motion, we explore a set of probabilistic models--related to the SABR model in mathematical finance--which can be obtained by geometry-preserving transformations, and show how to translate the properties of the hyperbolic Brownian motion (density, probability mass, drift) to each particular model. Our main result is an explici…
▽ More
Starting from the hyperbolic Brownian motion as a time-changed Brownian motion, we explore a set of probabilistic models--related to the SABR model in mathematical finance--which can be obtained by geometry-preserving transformations, and show how to translate the properties of the hyperbolic Brownian motion (density, probability mass, drift) to each particular model. Our main result is an explicit expression for the probability of any of these models hitting the boundary of their domains, the proof of which relies on the properties of the aforementioned transformations as well as time-change methods.
△ Less
Submitted 18 October, 2016;
originally announced October 2016.
-
Mass at zero in the uncorrelated SABR model and implied volatility asymptotics
Authors:
Archil Gulisashvili,
Blanka Horvath,
Antoine Jacquier
Abstract:
We study the mass at the origin in the uncorrelated SABR stochastic volatility model, and derive several tractable expressions, in particular when time becomes small or large. As an application--in fact the original motivation for this paper--we derive small-strike expansions for the implied volatility when the maturity becomes short or large. These formulae, by definition arbitrage free, allow us…
▽ More
We study the mass at the origin in the uncorrelated SABR stochastic volatility model, and derive several tractable expressions, in particular when time becomes small or large. As an application--in fact the original motivation for this paper--we derive small-strike expansions for the implied volatility when the maturity becomes short or large. These formulae, by definition arbitrage free, allow us to quantify the impact of the mass at zero on existing implied volatility approximations, and in particular how correct/erroneous these approximations become.
△ Less
Submitted 22 November, 2016; v1 submitted 11 February, 2015;
originally announced February 2015.
-
Dynamic dielectric response of electrorheological fluids in drag flow
Authors:
B. Horváth,
I. Szalai
Abstract:
We have determined the response time of dilute electrorheological fluids (ER) in drag flow from the dynamic dielectric response. On the basis of a kinetic rate equation a new formula was derived to approximate the experimental time-dependent dielectric permittivity during the temporal evolution of the microstructure. The dielectric response time was compared to the standard rheological response ti…
▽ More
We have determined the response time of dilute electrorheological fluids (ER) in drag flow from the dynamic dielectric response. On the basis of a kinetic rate equation a new formula was derived to approximate the experimental time-dependent dielectric permittivity during the temporal evolution of the microstructure. The dielectric response time was compared to the standard rheological response time extracted from the time-dependent shear stress, and a good agreement was obtained. We found that the dielectric method is more sensitive to detect any transient during the chain formation process. The experimental saturation value of the dielectric permittivity corresponding to the equilibrium microstructure was estimated on the basis of formulas derived from the Clausius-Mossotti equation.
△ Less
Submitted 17 December, 2015; v1 submitted 30 December, 2014;
originally announced December 2014.
-
Fluctuation-exchange approximation theory of the non-equilibrium singlet-triplet transition
Authors:
Bertalan Horváth,
Bence Lazarovits,
Gergely Zaránd
Abstract:
As a continuation of a previous work [B. Horváth et al., Phys. Rev. B {\bf 82}, 165129 (2010)], here we extend the so-called Fluctuation Exchange Approximation (FLEX) to study the non-equilibrium singlet-triplet transition. We show that, while being relatively fast and a conserving approximation, FLEX is able to recover all important features of the transition, including the evolution of the linea…
▽ More
As a continuation of a previous work [B. Horváth et al., Phys. Rev. B {\bf 82}, 165129 (2010)], here we extend the so-called Fluctuation Exchange Approximation (FLEX) to study the non-equilibrium singlet-triplet transition. We show that, while being relatively fast and a conserving approximation, FLEX is able to recover all important features of the transition, including the evolution of the linear conductance throughout the transition, the two-stage Kondo effect on the triplet side, and the gradual opening of the singlet-triplet gap on the triplet side of the transition. A comparison with numerical renormalization group calculations also shows that FLEX captures rather well the width of the Kondo resonance. FLEX thus offers a viable route to describe correlated multi-level systems under non-equilibrium conditions, and, in its rather general form, as formulated here, it could find a broad application in molecular electronics calculations.
△ Less
Submitted 28 November, 2011; v1 submitted 23 December, 2010;
originally announced December 2010.
-
FLEX-description of the spectral functions near singlet-triplet transition
Authors:
Bertalan Horváth
Abstract:
In a previous article, we have investigated the non-equilibrium two-level Anderson model with a simple iterative perturbation theory. Here we use here a more sophisticated perturbative method, the fluctuation-exchange (FLEX) approximation. The great advantage of FLEX is its \textit{conserving} nature, and that it can describe well the Kondo energy scale, the Kondo-temperature, $T_{K}$. As it was e…
▽ More
In a previous article, we have investigated the non-equilibrium two-level Anderson model with a simple iterative perturbation theory. Here we use here a more sophisticated perturbative method, the fluctuation-exchange (FLEX) approximation. The great advantage of FLEX is its \textit{conserving} nature, and that it can describe well the Kondo energy scale, the Kondo-temperature, $T_{K}$. As it was expected from the results obtained with iterative perturbation theory, the FLEX description can give back also the relevant features of the spectral properties.
△ Less
Submitted 22 June, 2010;
originally announced June 2010.
-
Non-equilibrium transport theory of the singlet-triplet transition: perturbative approach
Authors:
Bertalan Horváth,
Bence Lazarovits,
Gergely Zaránd
Abstract:
We use a simple iterative perturbation theory to study the singlet-triplet (ST) transition in lateral and vertical quantum dots, modeled by the non-equilibrium two-level Anderson model. To a great surprise, the region of stable perturbation theory extends to relatively strong interactions, and this simple approach is able to reproduce all experimentally-observed features of the ST transition, incl…
▽ More
We use a simple iterative perturbation theory to study the singlet-triplet (ST) transition in lateral and vertical quantum dots, modeled by the non-equilibrium two-level Anderson model. To a great surprise, the region of stable perturbation theory extends to relatively strong interactions, and this simple approach is able to reproduce all experimentally-observed features of the ST transition, including the formation of a dip in the differential conductance of a lateral dot indicative of the two-stage Kondo effect, or the maximum in the linear conductance around the transition point. Choosing the right starting point to the perturbation theory is, however, crucial to obtain reliable and meaningful results.
△ Less
Submitted 5 July, 2010; v1 submitted 22 June, 2010;
originally announced June 2010.
-
Perturbative theory of the non-equilibrium singlet-triplet transition
Authors:
B. Horvath,
B. Lazarovits,
G. Zarand
Abstract:
We study equilibrium and non-equilibrium properties of a two-level quantum dot close to the singlet-triplet transition. We treat the on-site Coulomb interaction and Hund's rule coupling perturbatively within the Keldysh formalism. We compute the spectral functions and the differential conductance of the dot. For moderate interactions our perturbative approach captures the Kondo effect and many o…
▽ More
We study equilibrium and non-equilibrium properties of a two-level quantum dot close to the singlet-triplet transition. We treat the on-site Coulomb interaction and Hund's rule coupling perturbatively within the Keldysh formalism. We compute the spectral functions and the differential conductance of the dot. For moderate interactions our perturbative approach captures the Kondo effect and many of the experimentally observed properties.
△ Less
Submitted 2 September, 2009;
originally announced September 2009.
-
Failure of mean-field approach in out-of-equilibrium Anderson model
Authors:
Bertalan Horváth,
Bence Lazarovits,
Olivier Sauret,
Gergely Zaránd
Abstract:
To explore the limitations of the mean field approximation, frequently used in \textit{ab initio} molecular electronics calculations, we study an out-of-equilibrium Anderson impurity model in a scattering formalism. We find regions in the parameter space where both magnetic and non-magnetic solutions are stable. We also observe a hysteresis in the non-equilibrium magnetization and current as a f…
▽ More
To explore the limitations of the mean field approximation, frequently used in \textit{ab initio} molecular electronics calculations, we study an out-of-equilibrium Anderson impurity model in a scattering formalism. We find regions in the parameter space where both magnetic and non-magnetic solutions are stable. We also observe a hysteresis in the non-equilibrium magnetization and current as a function of the applied bias voltage. The mean field method also predicts incorrectly local moment formation for large biases and a spin polarized current, and unphysical kinks appear in various physical quantities. The mean field approximation thus fails in every region where it predicts local moment formation.
△ Less
Submitted 3 December, 2007;
originally announced December 2007.