Search | arXiv e-print repository

Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models

Authors: Raeid Saqur, Anastasis Kratsios, Florian Krach, Yannick Limmer, Jacob-Junqi Tian, John Willes, Blanka Horvath, Frank Rudzicz

Abstract: We propose MoE-F -- a formalised mechanism for combining $N$ pre-trained expert Large Language Models (LLMs) in online time-series prediction tasks by adaptively forecasting the best weighting of LLM predictions at every time step. Our mechanism leverages the conditional information in each expert's running performance to forecast the best combination of LLMs for predicting the time series in its… ▽ More We propose MoE-F -- a formalised mechanism for combining $N$ pre-trained expert Large Language Models (LLMs) in online time-series prediction tasks by adaptively forecasting the best weighting of LLM predictions at every time step. Our mechanism leverages the conditional information in each expert's running performance to forecast the best combination of LLMs for predicting the time series in its next step. Diverging from static (learned) Mixture of Experts (MoE) methods, MoE-F employs time-adaptive stochastic filtering techniques to combine experts. By framing the expert selection problem as a finite state-space, continuous-time Hidden Markov model (HMM), we can leverage the Wohman-Shiryaev filter. Our approach first constructs $N$ parallel filters corresponding to each of the $N$ individual LLMs. Each filter proposes its best combination of LLMs, given the information that they have access to. Subsequently, the $N$ filter outputs are aggregated to optimize a lower bound for the loss of the aggregated LLMs, which can be optimized in closed-form, thus generating our ensemble predictor. Our contributions here are: (I) the MoE-F algorithm -- deployable as a plug-and-play filtering harness, (II) theoretical optimality guarantees of the proposed filtering-based gating algorithm, and (III) empirical evaluation and ablative results using state of the art foundational and MoE LLMs on a real-world Financial Market Movement task where MoE-F attains a remarkable 17% absolute and 48.5% relative F1 measure improvement over the next best performing individual LLM expert. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 29 pages, 5 Appendix sections

MSC Class: 60J05; 60G35; 68T20; 68T42; 68T50 ACM Class: I.2.6; I.2.7; G.3

arXiv:2405.16563 [pdf, other]

Reality Only Happens Once: Single-Path Generalization Bounds for Transformers

Authors: Yannick Limmer, Anastasis Kratsios, Xuwei Yang, Raeid Saqur, Blanka Horvath

Abstract: One of the inherent challenges in deploying transformers on time series is that \emph{reality only happens once}; namely, one typically only has access to a single trajectory of the data-generating process comprised of non-i.i.d. observations. We derive non-asymptotic statistical guarantees in this setting through bounds on the \textit{generalization} of a transformer network at a future-time $t$,… ▽ More One of the inherent challenges in deploying transformers on time series is that \emph{reality only happens once}; namely, one typically only has access to a single trajectory of the data-generating process comprised of non-i.i.d. observations. We derive non-asymptotic statistical guarantees in this setting through bounds on the \textit{generalization} of a transformer network at a future-time $t$, given that it has been trained using $N\le t$ observations from a single perturbed trajectory of a Markov process. Under the assumption that the Markov process satisfies a log-Sobolev inequality, we obtain a generalization bound which effectively converges at the rate of ${O}(1/\sqrt{N})$. Our bound depends explicitly on the activation function ($\operatorname{Swish}$, $\operatorname{GeLU}$, or $\tanh$ are considered), the number of self-attention heads, depth, width, and norm-bounds defining the transformer architecture. Our bound consists of three components: (I) The first quantifies the gap between the stationary distribution of the data-generating Markov process and its distribution at time $t$, this term converges exponentially to $0$. (II) The next term encodes the complexity of the transformer model and, given enough time, eventually converges to $0$ at the rate ${O}(\log(N)^r/\sqrt{N})$ for any $r>0$. (III) The third term guarantees that the bound holds with probability at least $1$-$δ$, and converges at a rate of ${O}(\sqrt{\log(1/δ)}/\sqrt{N})$. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 11 pages (+30 appendix), 3 figures, 6 tables

MSC Class: 60G35; 62M20; 68T07; 41A65

arXiv:2401.15791 [pdf, other]

doi 10.1016/j.ifacol.2023.10.1047

Improving Kernel-Based Nonasymptotic Simultaneous Confidence Bands

Authors: Balázs Csanád Csáji, Bálint Horváth

Abstract: The paper studies the problem of constructing nonparametric simultaneous confidence bands with nonasymptotic and distribition-free guarantees. The target function is assumed to be band-limited and the approach is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The starting point of the paper is a recently developed algorithm to which we propose three types of improvements. F… ▽ More The paper studies the problem of constructing nonparametric simultaneous confidence bands with nonasymptotic and distribition-free guarantees. The target function is assumed to be band-limited and the approach is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The starting point of the paper is a recently developed algorithm to which we propose three types of improvements. First, we relax the assumptions on the noises by replacing the symmetricity assumption with a weaker distributional invariance principle. Then, we propose a more efficient way to estimate the norm of the target function, and finally we enhance the construction of the confidence bands by tightening the constraints of the underlying convex optimization problems. The refinements are also illustrated through numerical experiments. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Journal ref: 22nd IFAC World Congress, Yokohama, Japan, 2023, 10357-10362

arXiv:2308.15135 [pdf, other]

Signature Trading: A Path-Dependent Extension of the Mean-Variance Framework with Exogenous Signals

Authors: Owen Futter, Blanka Horvath, Magnus Wiese

Abstract: In this article we introduce a portfolio optimisation framework, in which the use of rough path signatures (Lyons, 1998) provides a novel method of incorporating path-dependencies in the joint signal-asset dynamics, naturally extending traditional factor models, while kee** the resulting formulas lightweight and easily interpretable. We achieve this by representing a trading strategy as a linear… ▽ More In this article we introduce a portfolio optimisation framework, in which the use of rough path signatures (Lyons, 1998) provides a novel method of incorporating path-dependencies in the joint signal-asset dynamics, naturally extending traditional factor models, while kee** the resulting formulas lightweight and easily interpretable. We achieve this by representing a trading strategy as a linear functional applied to the signature of a path (which we refer to as "Signature Trading" or "Sig-Trading"). This allows the modeller to efficiently encode the evolution of past time-series observations into the optimisation problem. In particular, we derive a concise formulation of the dynamic mean-variance criterion alongside an explicit solution in our setting, which naturally incorporates a drawdown control in the optimal strategy over a finite time horizon. Secondly, we draw parallels between classical portfolio stategies and Sig-Trading strategies and explain how the latter leads to a pathwise extension of the classical setting via the "Signature Efficient Frontier". Finally, we give examples when trading under an exogenous signal as well as examples for momentum and pair-trading strategies, demonstrated both on synthetic and market data. Our framework combines the best of both worlds between classical theory (whose appeal lies in clear and concise formulae) and between modern, flexible data-driven methods that can handle more realistic datasets. The advantage of the added flexibility of the latter is that one can bypass common issues such as the accumulation of heteroskedastic and asymmetric residuals during the optimisation phase. Overall, Sig-Trading combines the flexibility of data-driven methods without compromising on the clarity of the classical theory and our presented results provide a compelling toolbox that yields superior results for a large class of trading strategies. △ Less

Submitted 30 August, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

arXiv:2307.14821 [pdf, other]

Experimental validation of particle-in-cell/Monte Carlo collisions simulations in low-pressure neon capacitively coupled plasmas

Authors: Chan-Won Park, Benedek Horváth, Aranka Derzsi, Julian Schulze, J. H. Kim, Zoltán Donkó, Hyo-Chang Lee

Abstract: Plasma simulations are powerful tools for understanding fundamental plasma science phenomena and for process optimization in applications. To ensure their quantitative accuracy, they must be validated against experiments. In this work, such an experimental validation is performed for a 1d3v particle-in-cell simulation complemented with the Monte Carlo treatment of collision processes of a capaciti… ▽ More Plasma simulations are powerful tools for understanding fundamental plasma science phenomena and for process optimization in applications. To ensure their quantitative accuracy, they must be validated against experiments. In this work, such an experimental validation is performed for a 1d3v particle-in-cell simulation complemented with the Monte Carlo treatment of collision processes of a capacitively coupled radio frequency plasma driven at 13.56 MHz and operated in neon gas. In a geometrically symmetric reactor the electron density in the discharge center and the spatio-temporal distribution of the electron impact excitation rate from the ground into the Ne 2p$_1$ state are measured by a microwave cutoff probe and phase resolved optical emission spectroscopy, respectively. The measurements are conducted for electrode gaps between 50 mm and 90 mm, neutral gas pressures between 20 mTorr and 50 mTorr, and peak-to-peak values of the driving voltage waveform between 250 V and 650 V. Simulations are performed under identical discharge conditions. In the simulations, various combinations of surface coefficients characterising the interactions of electrons and heavy particles with the anodized aluminium electrode surfaces are adopted. We find, that the simulations using a constant effective heavy particle induced secondary electron emission coefficient of 0.3 and a realistic electron-surface interaction model (which considers energy-dependent and material specific elastic and inelastic electron reflection, as well as the emission of true secondary electrons from the surface) yield results which are in good quantitative agreement with the experimental data. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: 19 pages, 6 figures, submitted to Plasma Sources Science and Technology

arXiv:2307.10319 [pdf, other]

Frequency-dependent electron power absorption mode transitions in capacitively coupled argon-oxygen plasmas

Authors: Aranka Derzsi, Mate Vass, Ranna Masheyeva, Benedek Horvath, Zoltan Donko, Peter Hartmann

Abstract: Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collision (PIC/MCC) simulations are performed to investigate the excitation dynamics in low-pressure capacitively coupled plasmas (CCPs) in argon-oxygen mixtures. The system used for this study is a geometrically symmetric CCP reactor operated in a fixed mixture gas composition, at fix… ▽ More Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collision (PIC/MCC) simulations are performed to investigate the excitation dynamics in low-pressure capacitively coupled plasmas (CCPs) in argon-oxygen mixtures. The system used for this study is a geometrically symmetric CCP reactor operated in a fixed mixture gas composition, at fixed pressure and voltage amplitude, with a wide range of driving RF frequencies (2$~$MHz$~\le f \le~15~$MHz). The measured and calculated spatio-temporal distributions of the electron impact excitation rates from the Ar ground state to the Ar$~\rm{2p_1}$ state (with a wavelength of 750.4~nm) show good qualitative agreement. The distributions show significant frequency dependence, which is generally considered to be predictive of transitions in the dominant discharge operating mode. Three frequency ranges can be distinguished, showing distinctly different excitation characteristics: (i) in the low frequency range ($f \le~3~$MHz), excitation is strong at the sheaths and weak in the bulk region; (ii) at intermediate frequencies (3.5$~$MHz$~\le f \le~5~$MHz), the excitation rate in the bulk region is enhanced and shows striation formation; (iii) above 6$~$MHz, excitation in the bulk gradually decreases with increasing frequency. Boltzmann term analysis was performed to quantify the frequency dependent contributions of the Ohmic and ambipolar terms to the electron power absorption. △ Less

Submitted 19 July, 2023; originally announced July 2023.

Comments: arXiv admin note: text overlap with arXiv:2205.06443

arXiv:2307.02310 [pdf, other]

Robust Hedging GANs

Authors: Yannick Limmer, Blanka Horvath

Abstract: The availability of deep hedging has opened new horizons for solving hedging problems under a large variety of realistic market conditions. At the same time, any model - be it a traditional stochastic model or a market generator - is at best an approximation of market reality, prone to model-misspecification and estimation errors. This raises the question, how to furnish a modelling setup with too… ▽ More The availability of deep hedging has opened new horizons for solving hedging problems under a large variety of realistic market conditions. At the same time, any model - be it a traditional stochastic model or a market generator - is at best an approximation of market reality, prone to model-misspecification and estimation errors. This raises the question, how to furnish a modelling setup with tools that can address the risk of discrepancy between anticipated distribution and market reality, in an automated way. Automated robustification is currently attracting increased attention in numerous investment problems, but it is a delicate task due to its imminent implications on risk management. Hence, it is beyond doubt that more activity can be anticipated on this topic to converge towards a consensus on best practices. This paper presents a natural extension of the original deep hedging framework to address uncertainty in the data generating process via an adversarial approach inspired by GANs to automate robustification in our hedging objective. This is achieved through an interplay of three modular components: (i) a (deep) hedging engine, (ii) a data-generating process (that is model agnostic permitting a large variety of classical models as well as machine learning-based market generators), and (iii) a notion of distance on model space to measure deviations between our market prognosis and reality. We do not restrict the ambiguity set to a region around a reference model, but instead penalize deviations from the anticipated distribution. Our suggested choice for each component is motivated by model agnosticism, allowing a seamless transition between settings. Since all individual components are already used in practice, we believe that our framework is easily adaptable to existing functional settings. △ Less

Submitted 5 July, 2023; originally announced July 2023.

ACM Class: G.3

arXiv:2306.15835 [pdf, other]

Non-parametric online market regime detection and regime clustering for multidimensional and path-dependent data structures

Authors: Zacharia Issa, Blanka Horvath

Abstract: In this work we present a non-parametric online market regime detection method for multidimensional data structures using a path-wise two-sample test derived from a maximum mean discrepancy-based similarity metric on path space that uses rough path signatures as a feature map. The latter similarity metric has been developed and applied as a discriminator in recent generative models for small data… ▽ More In this work we present a non-parametric online market regime detection method for multidimensional data structures using a path-wise two-sample test derived from a maximum mean discrepancy-based similarity metric on path space that uses rough path signatures as a feature map. The latter similarity metric has been developed and applied as a discriminator in recent generative models for small data environments, and has been optimised here to the setting where the size of new incoming data is particularly small, for faster reactivity. On the same principles, we also present a path-wise method for regime clustering which extends our previous work. The presented regime clustering techniques were designed as ex-ante market analysis tools that can identify periods of approximatively similar market activity, but the new results also apply to path-wise, high dimensional-, and to non-Markovian settings as well as to data structures that exhibit autocorrelation. We demonstrate our clustering tools on easily verifiable synthetic datasets of increasing complexity, and also show how the outlined regime detection techniques can be used as fast on-line automatic regime change detectors or as outlier detection tools, including a fully automated pipeline. Finally, we apply the fine-tuned algorithms to real-world historical data including high-dimensional baskets of equities and the recent price evolution of crypto assets, and we show that our methodology swiftly and accurately indicated historical periods of market turmoil. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: 65 pages, 52 figures

MSC Class: 60-08

arXiv:2305.18566 [pdf]

doi 10.1142/S2251171723400068

The Scientific Investigation of Unidentified Aerial Phenomena (UAP) Using Multimodal Ground-Based Observatories

Authors: Wesley Andrés Watters, Abraham Loeb, Frank Laukien, Richard Cloete, Alex Delacroix, Sergei Dobroshinsky, Benjamin Horvath, Ezra Kelderman, Sarah Little, Eric Masson, Andrew Mead, Mitch Randall, Forrest Schultz, Matthew Szenher, Foteini Vervelidou, Abigail White, Angelique Ahlström, Carol Cleland, Spencer Dockal, Natasha Donahue, Mark Elowitz, Carson Ezell, Alex Gersznowicz, Nicholas Gold, Michael G. Hercz , et al. (13 additional authors not shown)

Abstract: (Abridged) Unidentified Aerial Phenomena (UAP) have resisted explanation and have received little formal scientific attention for 75 years. A primary objective of the Galileo Project is to build an integrated software and instrumentation system designed to conduct a multimodal census of aerial phenomena and to recognize anomalies. Here we present key motivations for the study of UAP and address hi… ▽ More (Abridged) Unidentified Aerial Phenomena (UAP) have resisted explanation and have received little formal scientific attention for 75 years. A primary objective of the Galileo Project is to build an integrated software and instrumentation system designed to conduct a multimodal census of aerial phenomena and to recognize anomalies. Here we present key motivations for the study of UAP and address historical objections to this research. We describe an approach for highlighting outlier events in the high-dimensional parameter space of our census measurements. We provide a detailed roadmap for deciding measurement requirements, as well as a science traceability matrix (STM) for connecting sought-after physical parameters to observables and instrument requirements. We also discuss potential strategies for deciding where to locate instruments for development, testing, and final deployment. Our instrument package is multimodal and multispectral, consisting of (1) wide-field cameras in multiple bands for targeting and tracking of aerial objects and deriving their positions and kinematics using triangulation; (2) narrow-field instruments including cameras for characterizing morphology, spectra, polarimetry, and photometry; (3) passive multistatic arrays of antennas and receivers for radar-derived range and kinematics; (4) radio spectrum analyzers to measure radio and microwave emissions; (5) microphones for sampling acoustic emissions in the infrasonic through ultrasonic frequency bands; and (6) environmental sensors for characterizing ambient conditions (temperature, pressure, humidity, and wind velocity), as well as quasistatic electric and magnetic fields, and energetic particles. The use of multispectral instruments and multiple sensor modalities will help to ensure that artifacts are recognized and that true detections are corroborated and verifiable. △ Less

Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

Comments: This paper is published in the Journal of Astronomical Instrumentation, 12(1), 2340006 (2023) https://doi.org/10.1142/S2251171723400068

Journal ref: Journal of Astronomical Instrumentation, 12(1), 2340006 (2023)

arXiv:2305.16274 [pdf, other]

Non-adversarial training of Neural SDEs with signature kernel scores

Authors: Zacharia Issa, Blanka Horvath, Maud Lemercier, Cristopher Salvi

Abstract: Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series generation has been previously obtained by training these models adversarially as GANs. However, as typical for GAN architectures, training is notoriously unstable, often suffers from mode collapse, and requires specialised techniques such as weight clip** and gradient pe… ▽ More Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series generation has been previously obtained by training these models adversarially as GANs. However, as typical for GAN architectures, training is notoriously unstable, often suffers from mode collapse, and requires specialised techniques such as weight clip** and gradient penalty to mitigate these issues. In this paper, we introduce a novel class of scoring rules on pathspace based on signature kernels and use them as objective for training Neural SDEs non-adversarially. By showing strict properness of such kernel scores and consistency of the corresponding estimators, we provide existence and uniqueness guarantees for the minimiser. With this formulation, evaluating the generator-discriminator pair amounts to solving a system of linear path-dependent PDEs which allows for memory-efficient adjoint-based backpropagation. Moreover, because the proposed kernel scores are well-defined for paths with values in infinite dimensional spaces of functions, our framework can be easily extended to generate spatiotemporal data. Our procedure permits conditioning on a rich variety of market conditions and significantly outperforms alternative ways of training Neural SDEs on a variety of tasks including the simulation of rough volatility models, the conditional probabilistic forecasts of real-world forex pairs where the conditioning variable is an observed past trajectory, and the mesh-free generation of limit order book dynamics. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: Code available at https://github.com/issaz/sigker-nsdes/

arXiv:2304.01479 [pdf, ps, other]

Optimal Stop** via Distribution Regression: a Higher Rank Signature Approach

Authors: Blanka Horvath, Maud Lemercier, Chong Liu, Terry Lyons, Cristopher Salvi

Abstract: Distribution Regression on path-space refers to the task of learning functions map** the law of a stochastic process to a scalar target. The learning procedure based on the notion of path-signature, i.e. a classical transform from rough path theory, was widely used to approximate weakly continuous functionals, such as the pricing functionals of path--dependent options' payoffs. However, this app… ▽ More Distribution Regression on path-space refers to the task of learning functions map** the law of a stochastic process to a scalar target. The learning procedure based on the notion of path-signature, i.e. a classical transform from rough path theory, was widely used to approximate weakly continuous functionals, such as the pricing functionals of path--dependent options' payoffs. However, this approach fails for Optimal Stop** Problems arising from mathematical finance, such as the pricing of American options, because the corresponding value functions are in general discontinuous with respect to the weak topology. In this paper we develop a rigorous mathematical framework to resolve this issue by recasting an Optimal Stop** Problem as a higher order kernel mean embedding regression based on the notions of higher rank signatures of measure--valued paths and adapted topologies. The core computational component of our algorithm consists in solving a family of two--dimensional hyperbolic PDEs. △ Less

Submitted 3 April, 2023; originally announced April 2023.

Comments: 33 pages

MSC Class: Primary 60L10; Secondary 60L20; 60G40; 91G60

arXiv:2303.15063 [pdf, other]

Nonlocal dynamics of secondary electrons in capacitively coupled radio frequency discharges

Authors: Katharina Noesges, Maximilian Klich, Aranka Derzsi, Benedek Horváth, Julian Schulze, Ralf Peter Brinkmann, Thomas Mussenbrock, Sebastian Wilczek

Abstract: In capacitively coupled radio frequency (CCRF) discharges, the interaction of the plasma and the surface boundaries is linked to a variety of highly relevant phenomena for technological processes. One possible plasma-surface interaction is the generation of secondary electrons (SEs), which significantly influence the discharge when accelerated in the sheath electric field. However, SEs, in particu… ▽ More In capacitively coupled radio frequency (CCRF) discharges, the interaction of the plasma and the surface boundaries is linked to a variety of highly relevant phenomena for technological processes. One possible plasma-surface interaction is the generation of secondary electrons (SEs), which significantly influence the discharge when accelerated in the sheath electric field. However, SEs, in particular electron-induced SEs ($\updelta$-electrons), are frequently neglected in theory and simulations. Due to the relatively high threshold energy for the effective generation of $\updelta$-electrons at surfaces, their dynamics are closely connected and entangled with the dynamics of the ion-induced SEs ($\upgamma$-electrons). Thus, a fundamental understanding of the electron dynamics has to be achieved on a nanosecond timescale, and the effects of the different electron groups have to be segregated. This work utilizes $1d3v$ Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations of a symmetric discharge in the low-pressure regime ($p\,=\, 1\,\rm{Pa}$) with the inclusion of realistic electron-surface interactions for silicon dioxide. A diagnostic framework is introduced that segregates the electrons into three groups ("bulk-electrons", "$\upgamma$-electrons", and "$\updelta$-electrons") in order to analyze and discuss their dynamics. A variation of the electrode gap size $L_\mathrm{gap}$ is then presented as a control tool to alter the dynamics of the discharge significantly. It is demonstrated that this control results in two different regimes of low and high plasma density, respectively. The fundamental electron dynamics of both regimes are explained, which requires a complete analysis starting at global parameters (e.g., densities) down to single electron trajectories. △ Less

Submitted 27 March, 2023; originally announced March 2023.

arXiv:2211.12795 [pdf, ps, other]

Kernels of operators on certain Banach spaces associated with almost disjoint families

Authors: Bence Horváth, Niels Jakob Laustsen

Abstract: Given an infinite set $Γ$ and an almost disjoint family $\mathcal{A}$ on $Γ$, let $Y_{\mathcal{A}}$ denote the closed subspace of $\ell_\infty(Γ)$ spanned by the indicator functions $1_{\bigcap_{j=1}^n A_j}$ for $n\in\mathbb{N}$ and $A_1,\ldots,A_n\in\mathcal{A}$. We show that if $\mathcal{A}$ has cardinality greater than $Γ$, then $Y_{\mathcal{A}}$ contains closed subspaces which cannot be realis… ▽ More Given an infinite set $Γ$ and an almost disjoint family $\mathcal{A}$ on $Γ$, let $Y_{\mathcal{A}}$ denote the closed subspace of $\ell_\infty(Γ)$ spanned by the indicator functions $1_{\bigcap_{j=1}^n A_j}$ for $n\in\mathbb{N}$ and $A_1,\ldots,A_n\in\mathcal{A}$. We show that if $\mathcal{A}$ has cardinality greater than $Γ$, then $Y_{\mathcal{A}}$ contains closed subspaces which cannot be realised as the kernels of any bounded operators $Y_{\mathcal{A}}\rightarrow \ell_{\infty}(Γ)$. Consequently the spaces $\ell_{\infty}(Γ)$, for any infinite set $Γ$, and $C_0(K_{\mathcal{A}})$, where $\mathcal{A}$ is an uncountable almost disjoint family on $\mathbb{N}$ and $K_{\mathcal{A}}$ the locally compact Mrówka space associated with $\mathcal{A}$, contain closed subspaces which do not arise as the kernels of any bounded operators on them. △ Less

Submitted 23 November, 2022; originally announced November 2022.

Comments: 10 pages. Submitted

MSC Class: Primary 46B26; 46E15; 47B01; Secondary 18A30; 47B38

arXiv:2206.13629 [pdf, other]

doi 10.1109/LCSYS.2022.3185143

Nonparametric, Nonasymptotic Confidence Bands with Paley-Wiener Kernels for Band-Limited Functions

Authors: Balázs Csanád Csáji, Bálint Horváth

Abstract: The paper introduces a method to construct confidence bands for bounded, band-limited functions based on a finite sample of input-output pairs. The approach is distribution-free w.r.t. the observation noises and only the knowledge of the input distribution is assumed. It is nonparametric, that is, it does not require a parametric model of the regression function and the regions have non-asymptotic… ▽ More The paper introduces a method to construct confidence bands for bounded, band-limited functions based on a finite sample of input-output pairs. The approach is distribution-free w.r.t. the observation noises and only the knowledge of the input distribution is assumed. It is nonparametric, that is, it does not require a parametric model of the regression function and the regions have non-asymptotic guarantees. The algorithm is based on the theory of Paley-Wiener reproducing kernel Hilbert spaces. The paper first studies the fully observable variant, when there are no noises on the observations and only the inputs are random; then it generalizes the ideas to the noisy case using gradient-perturbation methods. Finally, numerical experiments demonstrating both cases are presented. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Journal ref: IEEE Control Systems Letters, Volume 6, 2022, pp. 3355-3360

arXiv:2205.06443 [pdf, other]

doi 10.1088/1361-6595/ac7b45

Electron power absorption in capacitively coupled neon-oxygen plasmas: a comparison of experimental and computational results

Authors: A. Derzsi, P. Hartmann, M. Vass, B. Horváth, M. Gyulai, I. Korolov, J. Schulze, Z. Donkó

Abstract: Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations are used to study the electron power absorption and excitation/ionization dynamics in capacitively coupled plasmas (CCPs) in mixtures of neon and oxygen gases. The study is performed for a geometrically symmetric CCP reactor with a gap length of 2.5 cm… ▽ More Phase Resolved Optical Emission Spectroscopy (PROES) measurements combined with 1d3v Particle-in-Cell/Monte Carlo Collisions (PIC/MCC) simulations are used to study the electron power absorption and excitation/ionization dynamics in capacitively coupled plasmas (CCPs) in mixtures of neon and oxygen gases. The study is performed for a geometrically symmetric CCP reactor with a gap length of 2.5 cm at a driving frequency of 10~MHz and a peak-to-peak voltage of 350 V. The pressure of the gas mixture is varied between 15 Pa and 500 Pa, while the neon/oxygen concentration is tuned between 10% and 90%. For all discharge conditions, the spatio-temporal distribution of the electron-impact excitation rate from the Ne ground state into the Ne $\rm{2p^53p_0}$ state measured by PROES and obtained from PIC/MCC simulations show good qualitative agreement. Based on the emission/excitation patterns, multiple operation regimes are identified. Localized bright emission features at the bulk boundaries, caused by local maxima in the electronegativity are found at high pressures and high O$_2$ concentrations. The relative contributions of the ambipolar and the Ohmic electron power absorption are found to vary strongly with the discharge parameters: the Ohmic power absorption is enhanced by both the high collisionality at high pressures and the high electronegativity at low pressures. In the wide parameter regime covered in this study, the PROES measurements are found to accurately represent the ionization dynamics, i.e., the discharge operation mode. This work represents also a successful experimental validation of the discharge model developed for neon-oxygen CCPs. △ Less

Submitted 13 May, 2022; originally announced May 2022.

Comments: 34 pages, 16 figures

arXiv:2110.11848 [pdf, other]

Clustering Market Regimes using the Wasserstein Distance

Authors: Blanka Horvath, Zacharia Issa, Aitor Muguruza

Abstract: The problem of rapid and automated detection of distinct market regimes is a topic of great interest to financial mathematicians and practitioners alike. In this paper, we outline an unsupervised learning algorithm for clustering financial time-series into a suitable number of temporal segments (market regimes). As a special case of the above, we develop a robust algorithm that automates the proce… ▽ More The problem of rapid and automated detection of distinct market regimes is a topic of great interest to financial mathematicians and practitioners alike. In this paper, we outline an unsupervised learning algorithm for clustering financial time-series into a suitable number of temporal segments (market regimes). As a special case of the above, we develop a robust algorithm that automates the process of classifying market regimes. The method is robust in the sense that it does not depend on modelling assumptions of the underlying time series as our experiments with real datasets show. This method -- dubbed the Wasserstein $k$-means algorithm -- frames such a problem as one on the space of probability measures with finite $p^\text{th}$ moment, in terms of the $p$-Wasserstein distance between (empirical) distributions. We compare our WK-means approach with a more traditional clustering algorithms by studying the so-called maximum mean discrepancy scores between, and within clusters. In both cases it is shown that the WK-means algorithm vastly outperforms all considered competitor approaches. We demonstrate the performance of all approaches both in a controlled environment on synthetic data, and on real data. △ Less

Submitted 22 October, 2021; originally announced October 2021.

Comments: 37 pages, 40 figures

MSC Class: 91-08 (Primary); 91G60 (Secondary)

arXiv:2110.04072 [pdf, ps, other]

doi 10.1090/tran/8687

Approximately multiplicative maps between algebras of bounded operators on Banach spaces

Authors: Yemon Choi, Bence Horváth, Niels Jakob Laustsen

Abstract: We show that for any separable reflexive Banach space $X$ and a large class of Banach spaces $E$, including those with a subsymmetric shrinking basis but also all spaces $L_p$ for $1\leq p \leq \infty$, every bounded linear map ${\mathcal B}(E)\to {\mathcal B}(X)$ which is approximately multiplicative is necessarily close in the operator norm to some bounded homomorphism… ▽ More We show that for any separable reflexive Banach space $X$ and a large class of Banach spaces $E$, including those with a subsymmetric shrinking basis but also all spaces $L_p$ for $1\leq p \leq \infty$, every bounded linear map ${\mathcal B}(E)\to {\mathcal B}(X)$ which is approximately multiplicative is necessarily close in the operator norm to some bounded homomorphism ${\mathcal B}(E)\to {\mathcal B}(X)$. That is, the pair $({\mathcal B}(E), {\mathcal B}(X))$ has the AMNM property in the sense of Johnson (\textit{J.~London Math.\ Soc.} 1988). Previously this was only known for $E=X=\ell_p$ with $1<p<\infty$; even for those cases, we improve on the previous methods and obtain better constants in various estimates. A crucial role in our approach is played by a new result, motivated by cohomological techniques, which establishes AMNM properties relative to an amenable subalgebra; this generalizes a theorem of Johnson (\textit{op cit.}). △ Less

Submitted 5 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

Comments: v1: AMS-LaTeX, 30 pages. Submitted for publication. v2: incorporates revisions based on feedback from referee; minor updates/corrections to bibliography. Final accepted version, to appear in Trans. Amer. Math. Soc

MSC Class: Primary 39B82; 47L10; Secondary 46B03; 46M18; 47B49

Journal ref: Trans. Amer. Math. Soc. 375 (2022), no. 10, 7121-7147

arXiv:2105.04073 [pdf, other]

Hedging under rough volatility

Authors: Masaaki Fukasawa, Blanka Horvath, Peter Tankov

Abstract: In this chapter we first briefly review the existing approaches to hedging in rough volatility models. Next, we present a simple but general result which shows that in a one-factor rough stochastic volatility model, any option may be perfectly hedged with a dynamic portfolio containing the underlying and one other asset such as a variance swap. In the final section we report the results of a back-… ▽ More In this chapter we first briefly review the existing approaches to hedging in rough volatility models. Next, we present a simple but general result which shows that in a one-factor rough stochastic volatility model, any option may be perfectly hedged with a dynamic portfolio containing the underlying and one other asset such as a variance swap. In the final section we report the results of a back-test experiment using real data, where VIX options are hedged with a forward variance swap. In this experiment, using a rough volatility model allows to almost completely remove the bias and reduce the overall hedging error by a factor of 27% compared to traditional diffusion-based models. △ Less

Submitted 9 May, 2021; originally announced May 2021.

arXiv:2104.14989 [pdf, ps, other]

doi 10.1016/j.jfa.2022.109488

A purely infinite Cuntz-like Banach $*$-algebra with no purely infinite ultrapowers

Authors: Matthew Daws, Bence Horváth

Abstract: We continue our investigation, from \cite{dh}, of the ring-theoretic infiniteness properties of ultrapowers of Banach algebras, studying in this paper the notion of being purely infinite. It is well known that a $C^*$-algebra is purely infinite if and only if any of its ultrapowers are. We find examples of Banach algebras, as algebras of operators on Banach spaces, which do have purely infinite ul… ▽ More We continue our investigation, from \cite{dh}, of the ring-theoretic infiniteness properties of ultrapowers of Banach algebras, studying in this paper the notion of being purely infinite. It is well known that a $C^*$-algebra is purely infinite if and only if any of its ultrapowers are. We find examples of Banach algebras, as algebras of operators on Banach spaces, which do have purely infinite ultrapowers. Our main contribution is the construction of a "Cuntz-like" Banach $*$-algebra which is purely infinite, but whose ultrapowers are not even simple, and hence not purely infinite. This algebra is a naturally occurring analogue of the Cuntz algebra, and of the $L^p$-analogues introduced by Phillips. However, our proof of being purely infinite is combinatorial, but direct, and so differs from existing proofs. We show that there are non-zero traces on our algebra, which in particular implies that our algebra is not isomorphic to any of the $L^p$-analogues of the Cuntz algebra. △ Less

Submitted 25 March, 2022; v1 submitted 30 April, 2021; originally announced April 2021.

Comments: 30 pages. Sections 2 and 3 reorganised. Section 4 about traces and connection with Phillips' $\mathcal{O}_2^p$-algebras added. To appear in the Journal of Functional Analysis

MSC Class: 46M07; 46H10; 46H15 (primary); 43A20 (secondary)

Journal ref: Journal of Functional Analysis (2022)

arXiv:2103.09642 [pdf, other]

doi 10.1088/1361-6595/ac0b55

eduPIC: an introductory particle based code for radio-frequency plasma simulation

Authors: Zoltan Donko, Aranka Derzsi, Mate Vass, Benedek Horvath, Sebastian Wilczek, Botond Hartmann, Peter Hartmann

Abstract: For the self-consistent description of various plasma sources operated in the low-pressure (nonlocal, kinetic) regime, the Particle-In-Cell simulation approach, combined with the Monte Carlo treatment of collision processes (PIC/MCC), has become an important tool during the past decades. PIC/MCC simulation codes have been developed and maintained by many research groups, some of these codes are av… ▽ More For the self-consistent description of various plasma sources operated in the low-pressure (nonlocal, kinetic) regime, the Particle-In-Cell simulation approach, combined with the Monte Carlo treatment of collision processes (PIC/MCC), has become an important tool during the past decades. PIC/MCC simulation codes have been developed and maintained by many research groups, some of these codes are available to the community as freeware resources. While this computational approach has already been present for a number of decades, the rapid evolution of the computing infrastructure makes it increasingly more popular and accessible, as simulations of simple systems can be executed now on personal computers or laptops. During the past few years we have experienced an increasing interest in lectures and courses dealing with the basics of particle simulations, including the PIC/MCC technique. In a response to this, this paper (i) provides a tutorial on the physical basis and the algorithms of the PIC/MCC technique and (ii) presents a basic (spatially one-dimensional) electrostatic PIC/MCC simulation code for Capacitively Coupled Plasmas, whose source is made freely available in various programming languages. We share the code in C/C++ versions, as well as in a version written in Rust, which is a rapidly emerging computational language. Our code intends to be a "starting tool" for those who are interested in learning the details of the PIC/MCC technique and would like to develop the "skeleton" code further, for their research purposes. △ Less

Submitted 17 March, 2021; originally announced March 2021.

arXiv:2102.01962 [pdf, other]

Deep Hedging under Rough Volatility

Authors: Blanka Horvath, Josef Teichmann, Zan Zuric

Abstract: We investigate the performance of the Deep Hedging framework under training paths beyond the (finite dimensional) Markovian setup. In particular we analyse the hedging performance of the original architecture under rough volatility models with view to existing theoretical results for those. Furthermore, we suggest parsimonious but suitable network architectures capable of capturing the non-Markovi… ▽ More We investigate the performance of the Deep Hedging framework under training paths beyond the (finite dimensional) Markovian setup. In particular we analyse the hedging performance of the original architecture under rough volatility models with view to existing theoretical results for those. Furthermore, we suggest parsimonious but suitable network architectures capable of capturing the non-Markoviantity of time-series. Secondly, we analyse the hedging behaviour in these models in terms of P\&L distributions and draw comparisons to jump diffusion models if the the rebalancing frequency is realistically small. △ Less

Submitted 3 February, 2021; originally announced February 2021.

MSC Class: 91-08

arXiv:2101.09950 [pdf, ps, other]

doi 10.1090/proc/15589

Unital Banach algebras not isomorphic to Calkin algebras of separable Banach spaces

Authors: Bence Horváth, Tomasz Kania

Abstract: Recent developments in Banach space theory provided unexpected examples of unital Banach algebras that are isomorphic to Calkin algebras of Banach spaces, however no example of a unital Banach algebra that cannot be realised as a~Calkin algebra has been found so far. This naturally led to the question of possible limitations of such assignments. In the present note we provide examples of unital Ba… ▽ More Recent developments in Banach space theory provided unexpected examples of unital Banach algebras that are isomorphic to Calkin algebras of Banach spaces, however no example of a unital Banach algebra that cannot be realised as a~Calkin algebra has been found so far. This naturally led to the question of possible limitations of such assignments. In the present note we provide examples of unital Banach algebras meeting the necessary density condition for being the Calkin algebra of a separable Banach space that are not isomorphic to Calkin algebras of such spaces, nonetheless. The examples may be found of the form $C(X)$ for a compact space $X$, $\ell_1(G)$ for some torsion-free Abelian group, and a~simple, unital AF $C^*$-algebra. Extensions to higher densities are also presented. △ Less

Submitted 3 March, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

Comments: 7 pp, to appear in Proceedings of the American Mathematical Society

MSC Class: 47L10 (primary); 46B03 (secondary)

Journal ref: Proceedings of the American Mathematical Society 149 (11), (2021), pp 4781--4787

arXiv:2007.14112 [pdf, ps, other]

doi 10.1093/qmath/haaa066

Surjective homomorphisms from algebras of operators on long sequence spaces are automatically injective

Authors: Bence Horváth, Tomasz Kania

Abstract: We study automatic injectivity of surjective algebra homomorphisms from $\mathscr{B}(X)$, the algebra of (bounded, linear) operators on $X$, to $\mathscr{B}(Y)$, where $X$ is one of the following \emph{long} sequence spaces: $c_0(λ)$, $\ell_{\infty}^c(λ)$, and $\ell_p(λ)$ ($1 \leqslant p < \infty$) and $Y$ is arbitrary. \textit{En route} to the proof that these spaces do indeed enjoy such a proper… ▽ More We study automatic injectivity of surjective algebra homomorphisms from $\mathscr{B}(X)$, the algebra of (bounded, linear) operators on $X$, to $\mathscr{B}(Y)$, where $X$ is one of the following \emph{long} sequence spaces: $c_0(λ)$, $\ell_{\infty}^c(λ)$, and $\ell_p(λ)$ ($1 \leqslant p < \infty$) and $Y$ is arbitrary. \textit{En route} to the proof that these spaces do indeed enjoy such a property, we classify two-sided ideals of the algebra of operators of any of the aforementioned Banach spaces that are closed with respect to the `sequential strong operator topology'. △ Less

Submitted 30 November, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

Comments: 22 pp; to appear in Quart. J. Math. (Oxford)

MSC Class: Primary 46H10; 47L10; Secondary 46B03; 46B07; 46B10; 46B26; 47L20

Journal ref: The Quarterly Journal of Mathematics (Oxford) 72, (2021), pp 1167--1189

arXiv:2006.14498 [pdf, other]

A Data-driven Market Simulator for Small Data Environments

Authors: Hans Bühler, Blanka Horvath, Terry Lyons, Imanol Perez Arribas, Ben Wood

Abstract: Neural network based data-driven market simulation unveils a new and flexible way of modelling financial time series without imposing assumptions on the underlying stochastic dynamics. Though in this sense generative market simulation is model-free, the concrete modelling choices are nevertheless decisive for the features of the simulated paths. We give a brief overview of currently used generativ… ▽ More Neural network based data-driven market simulation unveils a new and flexible way of modelling financial time series without imposing assumptions on the underlying stochastic dynamics. Though in this sense generative market simulation is model-free, the concrete modelling choices are nevertheless decisive for the features of the simulated paths. We give a brief overview of currently used generative modelling approaches and performance evaluation metrics for financial time series, and address some of the challenges to achieve good results in the latter. We also contrast some classical approaches of market simulation with simulation based on generative modelling and highlight some advantages and pitfalls of the new approach. While most generative models tend to rely on large amounts of training data, we present here a generative model that works reliably in environments where the amount of available training data is notoriously small. Furthermore, we show how a rough paths perspective combined with a parsimonious Variational Autoencoder framework provides a powerful way for encoding and evaluating financial time series in such environments where available training data is scarce. Finally, we also propose a suitable performance evaluation metric for financial time series and discuss some connections of our Market Generator to deep hedging. △ Less

Submitted 21 June, 2020; originally announced June 2020.

Comments: 27 pages, 9 figures

arXiv:2003.04572 [pdf, ps, other]

doi 10.1090/proc/15666

Perturbations of surjective homomorphisms between algebras of operators on Banach spaces

Authors: Bence Horváth, Zsigmond Tarcsay

Abstract: A remarkable result of Molnár [Proc. Amer. Math. Soc., 126 (1998), 853-861] states that automorphisms of the algebra of operators acting on a separable Hilbert space is stable under "small" perturbations. More precisely, if $φ,ψ$ are endomorphisms of $\mathcal{B}(\mathcal{H})$ such that $\|φ(A)-ψ(A)\|<\|A\|$ and $ψ$ is surjective then so is $φ$. The aim of this paper is to extend this result to a… ▽ More A remarkable result of Molnár [Proc. Amer. Math. Soc., 126 (1998), 853-861] states that automorphisms of the algebra of operators acting on a separable Hilbert space is stable under "small" perturbations. More precisely, if $φ,ψ$ are endomorphisms of $\mathcal{B}(\mathcal{H})$ such that $\|φ(A)-ψ(A)\|<\|A\|$ and $ψ$ is surjective then so is $φ$. The aim of this paper is to extend this result to a larger class of Banach spaces including $\ell_p$ and $L_p$ spaces ($1<p<+\infty$). En route to the proof we show that for any Banach space $X$ from the above class all faithful, unital, separable, reflexive representations of $\mathcal B (X)$ which preserve rank one operators are in fact isomorphisms. △ Less

Submitted 24 May, 2021; v1 submitted 10 March, 2020; originally announced March 2020.

Comments: 14 pp. To appear in Proceedings of the American Mathematical Society

MSC Class: Primary 46H10; 47L10; Secondary 46B03; 46B07; 46B10; 47L20

Journal ref: Proceedings of the American Mathematical Society 150 (2), (2022), pp 747--761

arXiv:1912.07108 [pdf, ps, other]

doi 10.4153/S0008414X20000565

Ring-theoretic (in)finiteness in reduced products of Banach algebras

Authors: Matthew Daws, Bence Horváth

Abstract: We study ring-theoretic (in)finiteness properties -- such as \emph{Dedekind-finiteness} and \emph{proper infiniteness} -- of ultraproducts (and more generally, reduced products) of Banach algebras. Whilst we characterise when an ultraproduct has these ring-theoretic properties in terms of its underlying sequence of algebras, we find that, contrary to the $C^*$-algebraic setting, it is not true i… ▽ More We study ring-theoretic (in)finiteness properties -- such as \emph{Dedekind-finiteness} and \emph{proper infiniteness} -- of ultraproducts (and more generally, reduced products) of Banach algebras. Whilst we characterise when an ultraproduct has these ring-theoretic properties in terms of its underlying sequence of algebras, we find that, contrary to the $C^*$-algebraic setting, it is not true in general that an ultraproduct has a ring-theoretic finiteness property if and only if "ultrafilter many" of the underlying sequence of algebras have the same property. This might appear to violate the continuous model theoretic counterpart of Łoś's Theorem; the reason it does not is that for a general Banach algebra, the ring theoretic properties we consider cannot be verified by considering a bounded subset of the algebra of \emph{fixed} bound. For Banach algebras, we construct counter-examples to show, for example, that each component Banach algebra can fail to be Dedekind-finite while the ultraproduct is Dedekind-finite, and we explain why such a counter-example is not possible for $C^*$-algebras. Finally the related notion of having \textit{stable rank one} is also studied for ultraproducts. △ Less

Submitted 23 June, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

Comments: 35 pages. Abstract and Introduction rewritten, applications of \{L}oś's Theorem added. To appear in the Canadian Journal of Mathematics

MSC Class: 46M07; 46H99 (primary); 16B99; 43A20 (secondary)

Journal ref: Canadian Journal of Mathematics 73 (5), (2021), pp 1423--1458

arXiv:1908.08806 [pdf, other]

On deep calibration of (rough) stochastic volatility models

Authors: Christian Bayer, Blanka Horvath, Aitor Muguruza, Benjamin Stemper, Mehdi Tomas

Abstract: Techniques from deep learning play a more and more important role for the important task of calibration of financial models. The pioneering paper by Hernandez [Risk, 2017] was a catalyst for resurfacing interest in research in this area. In this paper we advocate an alternative (two-step) approach using deep learning techniques solely to learn the pricing map -- from model parameters to prices or… ▽ More Techniques from deep learning play a more and more important role for the important task of calibration of financial models. The pioneering paper by Hernandez [Risk, 2017] was a catalyst for resurfacing interest in research in this area. In this paper we advocate an alternative (two-step) approach using deep learning techniques solely to learn the pricing map -- from model parameters to prices or implied volatilities -- rather than directly the calibrated model parameters as a function of observed market data. Having a fast and accurate neural-network-based approximating pricing map (first step), we can then (second step) use traditional model calibration algorithms. In this work we showcase a direct comparison of different potential approaches to the learning stage and present algorithms that provide a suffcient accuracy for practical use. We provide a first neural network-based calibration method for rough volatility models for which calibration can be done on the y. We demonstrate the method via a hands-on calibration engine on the rough Bergomi model, for which classical calibration techniques are diffcult to apply due to the high cost of all known numerical pricing methods. Furthermore, we display and compare different types of sampling and training methods and elaborate on their advantages under different objectives. As a further application we use the fast pricing method for a Bayesian analysis of the calibrated model. △ Less

Submitted 22 August, 2019; originally announced August 2019.

Comments: arXiv admin note: text overlap with arXiv:1901.09647

MSC Class: 60G15; 60G22; 91G20; 91G60; 91B25

arXiv:1901.09647 [pdf, other]

Deep Learning Volatility

Authors: Blanka Horvath, Aitor Muguruza, Mehdi Tomas

Abstract: We present a neural network based calibration method that performs the calibration task within a few milliseconds for the full implied volatility surface. The framework is consistently applicable throughout a range of volatility models -including the rough volatility family- and a range of derivative contracts. The aim of neural networks in this work is an off-line approximation of complex pricing… ▽ More We present a neural network based calibration method that performs the calibration task within a few milliseconds for the full implied volatility surface. The framework is consistently applicable throughout a range of volatility models -including the rough volatility family- and a range of derivative contracts. The aim of neural networks in this work is an off-line approximation of complex pricing functions, which are difficult to represent or time-consuming to evaluate by other means. We highlight how this perspective opens new horizons for quantitative modelling: The calibration bottleneck posed by a slow pricing of derivative contracts is lifted. This brings several numerical pricers and model families (such as rough volatility models) within the scope of applicability in industry practice. The form in which information from available data is extracted and stored influences network performance: This approach is inspired by representing the implied volatility and option prices as a collection of pixels. In a number of applications we demonstrate the prowess of this modelling approach regarding accuracy, speed, robustness and generality and also its potentials towards model recognition. △ Less

Submitted 22 August, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

arXiv:1811.06865 [pdf, ps, other]

doi 10.4064/sm181116-30-5

When are full representations of algebras of operators on Banach spaces automatically faithful?

Authors: Bence Horváth

Abstract: We examine the phenomenon when surjective algebra homomorphisms between algebras of operators on Banach spaces are automatically injective. In the first part of the paper we shall show that for certain Banach spaces $X$ the following property holds: For every non-zero Banach space $Y$ every surjective algebra homomorphism $ψ: \, \mathcal{B}(X) \rightarrow \mathcal{B}(Y)$ is automatically injective… ▽ More We examine the phenomenon when surjective algebra homomorphisms between algebras of operators on Banach spaces are automatically injective. In the first part of the paper we shall show that for certain Banach spaces $X$ the following property holds: For every non-zero Banach space $Y$ every surjective algebra homomorphism $ψ: \, \mathcal{B}(X) \rightarrow \mathcal{B}(Y)$ is automatically injective. In the second part of the paper we consider the question in the opposite direction: Building on the work of Kania, Koszmider and Laustsen \textit{(Trans. London Math. Soc., 2014)} we show that for every separable, reflexive Banach space $X$ there is a Banach space $Y_X$ and a surjective but not injective algebra homomorphism $ψ: \, \mathcal{B}(Y_X) \rightarrow \mathcal{B}(X)$. △ Less

Submitted 30 May, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

Comments: 28 pp. The SHAI property of both real- and complex Hilbert spaces discussed. To appear in Studia Mathematica

MSC Class: Primary 46H10; 47L10; Secondary 46B03; 46B07; 46B10; 46B26; 47L20

Journal ref: Studia Mathematica 253 (3) (2020), pp 259--282

arXiv:1807.10578 [pdf, other]

doi 10.1515/9783110602418

A Banach space whose algebra of operators is Dedekind-finite but it does not have stable rank one

Authors: Bence Horváth

Abstract: In this note we examine the connection between the stable rank one and Dedekind-finite property of the algebra of operators on a Banach space $X$. We show that for the indecomposable but not hereditarily indecomposable Banach space $X_{\infty}$ constructed by Tarbard (Ph.D. Thesis, University of Oxford, 2013), the algebra of operators $B(X_{\infty})$ is Dedekind-finite but does not have stable ran… ▽ More In this note we examine the connection between the stable rank one and Dedekind-finite property of the algebra of operators on a Banach space $X$. We show that for the indecomposable but not hereditarily indecomposable Banach space $X_{\infty}$ constructed by Tarbard (Ph.D. Thesis, University of Oxford, 2013), the algebra of operators $B(X_{\infty})$ is Dedekind-finite but does not have stable rank one. While this sheds some light on the Banach space structure of $X_{\infty}$ itself, we observe that the indecomposable but not hereditarily indecomposable Banach space constructed by Gowers and Maurey (Math. Ann., 1997) does not possess this property. △ Less

Submitted 14 January, 2019; v1 submitted 27 July, 2018; originally announced July 2018.

Comments: Version 2. References added, Koszmider space example added. To appear in Proc. 24th International Conference on Banach algebras and Applications. 9 pp

MSC Class: 47L10; 46H10; 46B07; 16D25

Journal ref: Banach Algebras and Applications (ed. Mahmoud Filali), De Gruyter Proceedings in Mathematics (2020), pp 165--176

arXiv:1802.01641 [pdf, other]

Volatility options in rough volatility models

Authors: Blanka Horvath, Antoine Jacquier, Peter Tankov

Abstract: We discuss the pricing and hedging of volatility options in some rough volatility models. First, we develop efficient Monte Carlo methods and asymptotic approximations for computing option prices and hedge ratios in models where log-volatility follows a Gaussian Volterra process. While providing a good fit for European options, these models are unable to reproduce the VIX option smile observed in… ▽ More We discuss the pricing and hedging of volatility options in some rough volatility models. First, we develop efficient Monte Carlo methods and asymptotic approximations for computing option prices and hedge ratios in models where log-volatility follows a Gaussian Volterra process. While providing a good fit for European options, these models are unable to reproduce the VIX option smile observed in the market, and are thus not suitable for VIX products. To accommodate these, we introduce the class of modulated Volterra processes, and show that they successfully capture the VIX smile. △ Less

Submitted 30 January, 2019; v1 submitted 5 February, 2018; originally announced February 2018.

Comments: 52 pages, 33 figures

MSC Class: 60G15; 60G22; 91G20; 91G60; 91B25

arXiv:1801.02719 [pdf, other]

Dirichlet Forms and Finite Element Methods for the SABR Model

Authors: Blanka Horvath, Oleg Reichmann

Abstract: We propose a deterministic numerical method for pricing vanilla options under the SABR stochastic volatility model, based on a finite element discretization of the Kolmogorov pricing equations via non-symmetric Dirichlet forms. Our pricing method is valid under mild assumptions on parameter configurations of the process both in moderate interest rate environments and in near-zero interest rate reg… ▽ More We propose a deterministic numerical method for pricing vanilla options under the SABR stochastic volatility model, based on a finite element discretization of the Kolmogorov pricing equations via non-symmetric Dirichlet forms. Our pricing method is valid under mild assumptions on parameter configurations of the process both in moderate interest rate environments and in near-zero interest rate regimes such as the currently prevalent ones. The parabolic Kolmogorov pricing equations for the SABR model are degenerate at the origin, yielding non-standard partial differential equations, for which conventional pricing methods ---designed for non-degenerate parabolic equations--- potentially break down. We derive here the appropriate analytic setup to handle the degeneracy of the model at the origin. That is, we construct an evolution triple of suitably chosen Sobolev spaces with singular weights, consisting of the domain of the SABR-Dirichlet form, its dual space, and the pivotal Hilbert space. In particular, we show well-posedness of the variational formulation of the SABR-pricing equations for vanilla and barrier options on this triple. Furthermore, we present a finite element discretization scheme based on a (weighted) multiresolution wavelet approximation in space and a $θ$-scheme in time and provide an error analysis for this discretization. △ Less

Submitted 8 January, 2018; originally announced January 2018.

arXiv:1711.03078 [pdf, ps, other]

Functional central limit theorems for rough volatility

Authors: Blanka Horvath, Antoine Jacquier, Aitor Muguruza, Andreas Sojmark

Abstract: The non-Markovian nature of rough volatility processes makes Monte Carlo methods challenging and it is in fact a major challenge to develop fast and accurate simulation algorithms. We provide an efficient one for stochastic Volterra processes, based on an extension of Donsker's approximation of Brownian motion to the fractional Brownian case with arbitrary Hurst exponent $H \in (0,1)$. Some of the… ▽ More The non-Markovian nature of rough volatility processes makes Monte Carlo methods challenging and it is in fact a major challenge to develop fast and accurate simulation algorithms. We provide an efficient one for stochastic Volterra processes, based on an extension of Donsker's approximation of Brownian motion to the fractional Brownian case with arbitrary Hurst exponent $H \in (0,1)$. Some of the most relevant consequences of this `rough Donsker (rDonsker) Theorem' are functional weak convergence results in Skorokhod space for discrete approximations of a large class of rough stochastic volatility models. This justifies the validity of simple and easy-to-implement Monte-Carlo methods, for which we provide detailed numerical recipes. We test these against the current benchmark Hybrid scheme~\cite{BLP17} and find remarkable agreement (for a large range of values of~$H$). This rDonsker Theorem further provides a weak convergence proof for the Hybrid scheme itself, and allows to construct binomial trees for rough volatility models, the first available scheme (in the rough volatility context) for early exercise options such as American or Bermudan options. △ Less

Submitted 11 November, 2023; v1 submitted 8 November, 2017; originally announced November 2017.

Comments: 40 pages, 18 figures

MSC Class: 60F17; 60F05; 60G15; 60G22; 91G20; 91G60; 91B25

arXiv:1708.01121 [pdf, ps, other]

Asymptotic behaviour of randomised fractional volatility models

Authors: B. Horvath, A. Jacquier, C. Lacombe

Abstract: We study the asymptotic behaviour of a class of small-noise diffusions driven by fractional Brownian motion, with random starting points. Different scalings allow for different asymptotic properties of the process (small-time and tail behaviours in particular). In order to do so, we extend some results on sample path large deviations for such diffusions. As an application, we show how these result… ▽ More We study the asymptotic behaviour of a class of small-noise diffusions driven by fractional Brownian motion, with random starting points. Different scalings allow for different asymptotic properties of the process (small-time and tail behaviours in particular). In order to do so, we extend some results on sample path large deviations for such diffusions. As an application, we show how these results characterise the small-time and tail estimates of the implied volatility for rough volatility models, recently proposed in mathematical finance. △ Less

Submitted 20 December, 2018; v1 submitted 3 August, 2017; originally announced August 2017.

Comments: 23 pages, main assumptions relaxed, some typos corrected

MSC Class: 41A60; 60F10; 60G15; 60G22

arXiv:1703.05132 [pdf, other]

Short-time near-the-money skew in rough fractional volatility models

Authors: Christian Bayer, Peter K. Friz, Archil Gulisashvili, Blanka Horvath, Benjamin Stemper

Abstract: We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the "rough" regime of Hurst parameter $H < 1/2$. This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around… ▽ More We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the "rough" regime of Hurst parameter $H < 1/2$. This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around the money while maintaining full analytical tractability. More precisely, this amounts to proving higher order moderate deviation estimates, only recently introduced in the option pricing context. This in turn allows us to push the applicability range of known at-the-money skew approximation formulae from CLT type log-moneyness deviations of order $t^{1/2}$ (recent works of Alòs, León & Vives and Fukasawa) to the wider moderate deviations regime. △ Less

Submitted 9 March, 2018; v1 submitted 15 March, 2017; originally announced March 2017.

MSC Class: 91G20; 60H30; 60F10; 60H07; 60G22; 60G18

arXiv:1701.02015 [pdf, other]

Functional Analytic (Ir-)Regularity Properties of SABR-type Processes

Authors: Leif Doering, Blanka Horvath, Josef Teichmann

Abstract: The SABR model is a benchmark stochastic volatility model in interest rate markets, which has received much attention in the past decade. Its popularity arose from a tractable asymptotic expansion for implied volatility, derived by heat kernel methods. As markets moved to historically low rates, this expansion appeared to yield inconsistent prices. Since the model is deeply embedded in market prac… ▽ More The SABR model is a benchmark stochastic volatility model in interest rate markets, which has received much attention in the past decade. Its popularity arose from a tractable asymptotic expansion for implied volatility, derived by heat kernel methods. As markets moved to historically low rates, this expansion appeared to yield inconsistent prices. Since the model is deeply embedded in market practice, alternative pricing methods for SABR have been addressed in numerous approaches in recent years. All standard option pricing methods make certain regularity assumptions on the underlying model, but for SABR these are rarely satisfied. We examine here regularity properties of the model from this perspective with view to a number of (asymptotic and numerical) option pricing methods. In particular, we highlight delicate degeneracies of the SABR model (and related processes) at the origin, which deem the currently used popular heat kernel methods and all related methods from (sub-) Riemannian geometry ill-suited for SABR-type processes, when interest rates are near zero. We describe a more general semigroup framework, which permits to derive a suitable geometry for SABR-type processes (in certain parameter regimes) via symmetric Dirichlet forms. Furthermore, we derive regularity properties (Feller- properties and strong continuity properties) necessary for the applicability of popular numerical schemes to SABR-semigroups, and identify suitable Banach- and Hilbert spaces for these. Finally, we comment on the short time and large time asymptotic behaviour of SABR-type processes beyond the heat-kernel framework. △ Less

Submitted 8 January, 2017; originally announced January 2017.

MSC Class: 60H30; 58J65; 60J55

arXiv:1610.05636 [pdf, ps, other]

On the probability of hitting the boundary for Brownian motions on the SABR plane

Authors: Archil Gulisashvili, Blanka Horvath, Antoine Jacquier

Abstract: Starting from the hyperbolic Brownian motion as a time-changed Brownian motion, we explore a set of probabilistic models--related to the SABR model in mathematical finance--which can be obtained by geometry-preserving transformations, and show how to translate the properties of the hyperbolic Brownian motion (density, probability mass, drift) to each particular model. Our main result is an explici… ▽ More Starting from the hyperbolic Brownian motion as a time-changed Brownian motion, we explore a set of probabilistic models--related to the SABR model in mathematical finance--which can be obtained by geometry-preserving transformations, and show how to translate the properties of the hyperbolic Brownian motion (density, probability mass, drift) to each particular model. Our main result is an explicit expression for the probability of any of these models hitting the boundary of their domains, the proof of which relies on the properties of the aforementioned transformations as well as time-change methods. △ Less

Submitted 18 October, 2016; originally announced October 2016.

Comments: 11 pages. arXiv admin note: substantial text overlap with arXiv:1502.03254

MSC Class: 58J65; 60J60

arXiv:1502.03254 [pdf, ps, other]

Mass at zero in the uncorrelated SABR model and implied volatility asymptotics

Authors: Archil Gulisashvili, Blanka Horvath, Antoine Jacquier

Abstract: We study the mass at the origin in the uncorrelated SABR stochastic volatility model, and derive several tractable expressions, in particular when time becomes small or large. As an application--in fact the original motivation for this paper--we derive small-strike expansions for the implied volatility when the maturity becomes short or large. These formulae, by definition arbitrage free, allow us… ▽ More We study the mass at the origin in the uncorrelated SABR stochastic volatility model, and derive several tractable expressions, in particular when time becomes small or large. As an application--in fact the original motivation for this paper--we derive small-strike expansions for the implied volatility when the maturity becomes short or large. These formulae, by definition arbitrage free, allow us to quantify the impact of the mass at zero on existing implied volatility approximations, and in particular how correct/erroneous these approximations become. △ Less

Submitted 22 November, 2016; v1 submitted 11 February, 2015; originally announced February 2015.

Comments: 15 pages, 2 tables, 8 figures This updated version concentrates on the small- and large-time asymptotic behaviour of the mass at zero in the uncorrelated SABR model. Some geometric considerations regarding the correlated case are provided in a companion paper arXiv:1610.05636

MSC Class: 58J37; 60H30; 58J65

arXiv:1412.8645 [pdf, ps, other]

doi 10.1103/PhysRevE.92.042308

Dynamic dielectric response of electrorheological fluids in drag flow

Authors: B. Horváth, I. Szalai

Abstract: We have determined the response time of dilute electrorheological fluids (ER) in drag flow from the dynamic dielectric response. On the basis of a kinetic rate equation a new formula was derived to approximate the experimental time-dependent dielectric permittivity during the temporal evolution of the microstructure. The dielectric response time was compared to the standard rheological response ti… ▽ More We have determined the response time of dilute electrorheological fluids (ER) in drag flow from the dynamic dielectric response. On the basis of a kinetic rate equation a new formula was derived to approximate the experimental time-dependent dielectric permittivity during the temporal evolution of the microstructure. The dielectric response time was compared to the standard rheological response time extracted from the time-dependent shear stress, and a good agreement was obtained. We found that the dielectric method is more sensitive to detect any transient during the chain formation process. The experimental saturation value of the dielectric permittivity corresponding to the equilibrium microstructure was estimated on the basis of formulas derived from the Clausius-Mossotti equation. △ Less

Submitted 17 December, 2015; v1 submitted 30 December, 2014; originally announced December 2014.

Comments: 8 pages, 8 figures

Journal ref: Phys. Rev. E 92 (2015) 042308

arXiv:1012.5326 [pdf, ps, other]

doi 10.1103/PhysRevB.84.205117

Fluctuation-exchange approximation theory of the non-equilibrium singlet-triplet transition

Authors: Bertalan Horváth, Bence Lazarovits, Gergely Zaránd

Abstract: As a continuation of a previous work [B. Horváth et al., Phys. Rev. B {\bf 82}, 165129 (2010)], here we extend the so-called Fluctuation Exchange Approximation (FLEX) to study the non-equilibrium singlet-triplet transition. We show that, while being relatively fast and a conserving approximation, FLEX is able to recover all important features of the transition, including the evolution of the linea… ▽ More As a continuation of a previous work [B. Horváth et al., Phys. Rev. B {\bf 82}, 165129 (2010)], here we extend the so-called Fluctuation Exchange Approximation (FLEX) to study the non-equilibrium singlet-triplet transition. We show that, while being relatively fast and a conserving approximation, FLEX is able to recover all important features of the transition, including the evolution of the linear conductance throughout the transition, the two-stage Kondo effect on the triplet side, and the gradual opening of the singlet-triplet gap on the triplet side of the transition. A comparison with numerical renormalization group calculations also shows that FLEX captures rather well the width of the Kondo resonance. FLEX thus offers a viable route to describe correlated multi-level systems under non-equilibrium conditions, and, in its rather general form, as formulated here, it could find a broad application in molecular electronics calculations. △ Less

Submitted 28 November, 2011; v1 submitted 23 December, 2010; originally announced December 2010.

Comments: 11 pages, 16 figures, new subsections added

Journal ref: Phys. Rev. B 84, 205117 (2011)

arXiv:1006.4287 [pdf, ps, other]

FLEX-description of the spectral functions near singlet-triplet transition

Authors: Bertalan Horváth

Abstract: In a previous article, we have investigated the non-equilibrium two-level Anderson model with a simple iterative perturbation theory. Here we use here a more sophisticated perturbative method, the fluctuation-exchange (FLEX) approximation. The great advantage of FLEX is its \textit{conserving} nature, and that it can describe well the Kondo energy scale, the Kondo-temperature, $T_{K}$. As it was e… ▽ More In a previous article, we have investigated the non-equilibrium two-level Anderson model with a simple iterative perturbation theory. Here we use here a more sophisticated perturbative method, the fluctuation-exchange (FLEX) approximation. The great advantage of FLEX is its \textit{conserving} nature, and that it can describe well the Kondo energy scale, the Kondo-temperature, $T_{K}$. As it was expected from the results obtained with iterative perturbation theory, the FLEX description can give back also the relevant features of the spectral properties. △ Less

Submitted 22 June, 2010; originally announced June 2010.

arXiv:1006.4286 [pdf, ps, other]

doi 10.1103/PhysRevB.82.165129

Non-equilibrium transport theory of the singlet-triplet transition: perturbative approach

Authors: Bertalan Horváth, Bence Lazarovits, Gergely Zaránd

Abstract: We use a simple iterative perturbation theory to study the singlet-triplet (ST) transition in lateral and vertical quantum dots, modeled by the non-equilibrium two-level Anderson model. To a great surprise, the region of stable perturbation theory extends to relatively strong interactions, and this simple approach is able to reproduce all experimentally-observed features of the ST transition, incl… ▽ More We use a simple iterative perturbation theory to study the singlet-triplet (ST) transition in lateral and vertical quantum dots, modeled by the non-equilibrium two-level Anderson model. To a great surprise, the region of stable perturbation theory extends to relatively strong interactions, and this simple approach is able to reproduce all experimentally-observed features of the ST transition, including the formation of a dip in the differential conductance of a lateral dot indicative of the two-stage Kondo effect, or the maximum in the linear conductance around the transition point. Choosing the right starting point to the perturbation theory is, however, crucial to obtain reliable and meaningful results. △ Less

Submitted 5 July, 2010; v1 submitted 22 June, 2010; originally announced June 2010.

Journal ref: Phys. Rev. B 82, 165129 (2010)

arXiv:0909.0441 [pdf, ps, other]

doi 10.1088/1742-6596/200/1/012063

Perturbative theory of the non-equilibrium singlet-triplet transition

Authors: B. Horvath, B. Lazarovits, G. Zarand

Abstract: We study equilibrium and non-equilibrium properties of a two-level quantum dot close to the singlet-triplet transition. We treat the on-site Coulomb interaction and Hund's rule coupling perturbatively within the Keldysh formalism. We compute the spectral functions and the differential conductance of the dot. For moderate interactions our perturbative approach captures the Kondo effect and many o… ▽ More We study equilibrium and non-equilibrium properties of a two-level quantum dot close to the singlet-triplet transition. We treat the on-site Coulomb interaction and Hund's rule coupling perturbatively within the Keldysh formalism. We compute the spectral functions and the differential conductance of the dot. For moderate interactions our perturbative approach captures the Kondo effect and many of the experimentally observed properties. △ Less

Submitted 2 September, 2009; originally announced September 2009.

Comments: Contribution to the proceedings of ICM2009 (International Conference on Magnetism)

Journal ref: Journal of Physics: Conference Series 200 (2010) 012063

arXiv:0712.0296 [pdf, ps, other]

doi 10.1103/PhysRevB.77.113108

Failure of mean-field approach in out-of-equilibrium Anderson model

Authors: Bertalan Horváth, Bence Lazarovits, Olivier Sauret, Gergely Zaránd

Abstract: To explore the limitations of the mean field approximation, frequently used in \textit{ab initio} molecular electronics calculations, we study an out-of-equilibrium Anderson impurity model in a scattering formalism. We find regions in the parameter space where both magnetic and non-magnetic solutions are stable. We also observe a hysteresis in the non-equilibrium magnetization and current as a f… ▽ More To explore the limitations of the mean field approximation, frequently used in \textit{ab initio} molecular electronics calculations, we study an out-of-equilibrium Anderson impurity model in a scattering formalism. We find regions in the parameter space where both magnetic and non-magnetic solutions are stable. We also observe a hysteresis in the non-equilibrium magnetization and current as a function of the applied bias voltage. The mean field method also predicts incorrectly local moment formation for large biases and a spin polarized current, and unphysical kinks appear in various physical quantities. The mean field approximation thus fails in every region where it predicts local moment formation. △ Less

Submitted 3 December, 2007; originally announced December 2007.

Comments: 5 pages, 5 figures

Journal ref: Phys. Rev. B 77, 113108 (2008)

Showing 1–44 of 44 results for author: Horvath, B