-
Adapted optimal transport between Gaussian processes in discrete time
Authors:
Madhu Gunasingam,
Ting-Kam Leonard Wong
Abstract:
We derive explicitly the adapted $2$-Wasserstein distance between non-degenerate Gaussian distributions on $\mathbb{R}^N$ and characterize the optimal bicausal coupling(s). This leads to an adapted version of the Bures-Wasserstein distance on the space of positive definite matrices.
We derive explicitly the adapted $2$-Wasserstein distance between non-degenerate Gaussian distributions on $\mathbb{R}^N$ and characterize the optimal bicausal coupling(s). This leads to an adapted version of the Bures-Wasserstein distance on the space of positive definite matrices.
△ Less
Submitted 30 April, 2024; v1 submitted 9 April, 2024;
originally announced April 2024.
-
JKO schemes with general transport costs
Authors:
Cale Rankin,
Ting-Kam Leonard Wong
Abstract:
We modify the JKO scheme, which is a time discretization of Wasserstein gradient flows, by replacing the Wasserstein distance with more general transport costs on manifolds. We show when the cost function has a mixed Hessian which defines a Riemannian metric, our modified JKO scheme converges under suitable conditions to the corresponding Riemannian Fokker--Planck equation. Thus on a Riemannian ma…
▽ More
We modify the JKO scheme, which is a time discretization of Wasserstein gradient flows, by replacing the Wasserstein distance with more general transport costs on manifolds. We show when the cost function has a mixed Hessian which defines a Riemannian metric, our modified JKO scheme converges under suitable conditions to the corresponding Riemannian Fokker--Planck equation. Thus on a Riemannian manifold one may replace the (squared) Riemannian distance with any cost function which induces the metric. Of interest is when the Riemannian distance is computationally intractable, but a suitable cost has a simple analytic expression. We consider the Fokker--Planck equation on compact submanifolds with the Neumann boundary condition and on complete Riemannian manifolds with a finite drift condition. As an application we consider Hessian manifolds, taking as a cost the Bregman divergence.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
The Asteroseismological Richness of RCB and dLHdC Stars
Authors:
Tin Long Sunny Wong,
Lars Bildsten
Abstract:
RCB stars are $L\approx10^4\,L_{\odot}$ solar-mass objects that can exhibit large periods of extinction from dust ejection episodes. Many exhibit semiregular pulsations in the range of $30-50$ days with semi-amplitudes of $0.05-0.3$ magnitude. Space-based photometry has discovered that solar-like oscillations are ubiquitous in hydrogen-dominated stars that have substantial outer convective envelop…
▽ More
RCB stars are $L\approx10^4\,L_{\odot}$ solar-mass objects that can exhibit large periods of extinction from dust ejection episodes. Many exhibit semiregular pulsations in the range of $30-50$ days with semi-amplitudes of $0.05-0.3$ magnitude. Space-based photometry has discovered that solar-like oscillations are ubiquitous in hydrogen-dominated stars that have substantial outer convective envelopes, so we explore the hypothesis that the pulsations in RCB stars and the closely related dustless hydrogen-deficient carbon (dLHdC) stars, which have large convective outer envelopes of nearly pure helium, have a similar origin. Through stellar modeling and pulsation calculations, we find that the observed periods and amplitudes of these pulsations follows the well-measured phenomenology of their H-rich brethren. In particular, we show that the observed modes are likely of angular orders $l=0,1$ and $2$ and predominantly of an acoustic nature (i.e. $p$-modes with low radial order). The modes with largest amplitude are near the acoustic cut-off frequency appropriately rescaled to the helium-dominated envelope, and the observed amplitudes are consistent with that seen in high luminosity ($L>10^3\,L_{\odot}$) H-rich giants. We also find that for $T_{\mathrm{eff}}\gtrsim5400\,\mathrm{K}$, an HdC stellar model exhibits a radiative layer between two outer convective zones, creating a $g$-mode cavity that supports much longer period ($\approx 100$ days) oscillations. Our initial work was focused primarily on the adiabatic modes, but we expect that subsequent space-based observations of these targets (e.g. with TESS or Plato) are likely to lead to a larger set of detected frequencies that would allow for a deeper study of the interiors of these rare stars.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Information Geometry for the Working Information Theorist
Authors:
Kumar Vijay Mishra,
M. Ashok Kumar,
Ting-Kam Leonard Wong
Abstract:
Information geometry is a study of statistical manifolds, that is, spaces of probability distributions from a geometric perspective. Its classical information-theoretic applications relate to statistical concepts such as Fisher information, sufficient statistics, and efficient estimators. Today, information geometry has emerged as an interdisciplinary field that finds applications in diverse areas…
▽ More
Information geometry is a study of statistical manifolds, that is, spaces of probability distributions from a geometric perspective. Its classical information-theoretic applications relate to statistical concepts such as Fisher information, sufficient statistics, and efficient estimators. Today, information geometry has emerged as an interdisciplinary field that finds applications in diverse areas such as radar sensing, array signal processing, quantum physics, deep learning, and optimal transport. This article presents an overview of essential information geometry to initiate an information theorist, who may be unfamiliar with this exciting area of research. We explain the concepts of divergences on statistical manifolds, generalized notions of distances, orthogonality, and geodesics, thereby paving the way for concrete applications and novel theoretical investigations. We also highlight some recent information-geometric developments, which are of interest to the broader information theory community.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Dynamical He Flashes in Double White Dwarf Binaries
Authors:
Tin Long Sunny Wong,
Lars Bildsten
Abstract:
The detonation of an overlying helium layer on a $0.8-1.1\,\mathrm{M}_{\odot}$ carbon-oxygen (CO) white dwarf (WD) can detonate the CO WD and create a thermonuclear supernova (SN). Many authors have recently shown that when the mass of the He layer is low ($\lesssim 0.03\,\mathrm{M}_{\odot}$), the ashes from its detonation minimally impact the spectra and light-curve from the CO detonation, allowi…
▽ More
The detonation of an overlying helium layer on a $0.8-1.1\,\mathrm{M}_{\odot}$ carbon-oxygen (CO) white dwarf (WD) can detonate the CO WD and create a thermonuclear supernova (SN). Many authors have recently shown that when the mass of the He layer is low ($\lesssim 0.03\,\mathrm{M}_{\odot}$), the ashes from its detonation minimally impact the spectra and light-curve from the CO detonation, allowing the explosion to appear remarkably similar to Type Ia SNe. These new insights motivate our investigation of dynamical He shell burning, and our search for a binary scenario that stably accumulates thermally unstable He shells in the $0.01-0.08\,\mathrm{M}_{\odot}$ range, thick enough to detonate, but also often thin enough for minimal impact on the observables. We first show that our improved non-adiabatic evolution of convective He shell burning in this shell mass range leads to conditions ripe for a He detonation. We also find that a stable mass-transfer scenario with a high entropy He WD donor of mass $0.15-0.25\,\mathrm{M}_\odot$ yields the He shell masses needed to achieve the double detonations. This scenario also predicts that the surviving He donor leaves with a space velocity consistent with the unusual runaway object, D6-2. We find that hot He WD donors originate in common envelope events when a $1.3-2.0\,\mathrm{M}_\odot$ star fills its Roche lobe at the base of the red giant branch at orbital periods of $1-10$ days with the CO WD.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Orbital decay in an accreting and eclipsing 13.7 minute orbital period binary with a luminous donor
Authors:
Kevin B. Burdge,
Kareem El-Badry,
Saul Rappaport,
Tin Long Sunny Wong,
Evan B. Bauer,
Lars Bildsten,
Ilaria Caiazzo,
Deepto Chakrabarty,
Emma Chickles,
Matthew J. Graham,
Erin Kara,
S. R. Kulkarni,
Thomas R. Marsh,
Melania Nynka,
Thomas A. Prince,
Robert A. Simcoe,
Jan van Roestel,
Zach Vanderbosch,
Eric C. Bellm,
Richard G. Dekany,
Andrew J. Drake,
George Helou,
Frank J. Masci,
Jennifer Milburn,
Reed Riddle
, et al. (2 additional authors not shown)
Abstract:
We report the discovery of ZTF J0127+5258, a compact mass-transferring binary with an orbital period of 13.7 minutes. The system contains a white dwarf accretor, which likely originated as a post-common envelope carbon-oxygen (CO) white dwarf, and a warm donor ($T_{\rm eff,\,donor}= 16,400\pm1000\,\rm K$). The donor probably formed during a common envelope phase between the CO white dwarf and an e…
▽ More
We report the discovery of ZTF J0127+5258, a compact mass-transferring binary with an orbital period of 13.7 minutes. The system contains a white dwarf accretor, which likely originated as a post-common envelope carbon-oxygen (CO) white dwarf, and a warm donor ($T_{\rm eff,\,donor}= 16,400\pm1000\,\rm K$). The donor probably formed during a common envelope phase between the CO white dwarf and an evolving giant which left behind a helium star or helium white dwarf in a close orbit with the CO white dwarf. We measure gravitational wave-driven orbital inspiral with $\sim 35σ$ significance, which yields a joint constraint on the component masses and mass transfer rate. While the accretion disk in the system is dominated by ionized helium emission, the donor exhibits a mixture of hydrogen and helium absorption lines. Phase-resolved spectroscopy yields a donor radial-velocity semi-amplitude of $771\pm27\,\rm km\, s^{-1}$, and high-speed photometry reveals that the system is eclipsing. We detect a {\it Chandra} X-ray counterpart with $L_{X}\sim 3\times 10^{31}\,\rm erg\,s^{-1}$. Depending on the mass-transfer rate, the system will likely evolve into either a stably mass-transferring helium CV, merge to become an R Crb star, or explode as a Type Ia supernova in the next million years. We predict that the Laser Space Interferometer Antenna (LISA) will detect the source with a signal-to-noise ratio of $24\pm6$ after 4 years of observations. The system is the first \emph{LISA}-loud mass-transferring binary with an intrinsically luminous donor, a class of sources that provide the opportunity to leverage the synergy between optical and infrared time domain surveys, X-ray facilities, and gravitational-wave observatories to probe general relativity, accretion physics, and binary evolution.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Bregman-Wasserstein divergence: geometry and applications
Authors:
Cale Rankin,
Ting-Kam Leonard Wong
Abstract:
Consider the Monge-Kantorovich optimal transport problem where the cost function is given by a Bregman divergence. The associated transport cost, which we call the Bregman-Wasserstein divergence, presents a natural asymmetric extension of the squared $2$-Wasserstein metric and has recently found applications in statistics and machine learning. On the other hand, Bregman divergence is a fundamental…
▽ More
Consider the Monge-Kantorovich optimal transport problem where the cost function is given by a Bregman divergence. The associated transport cost, which we call the Bregman-Wasserstein divergence, presents a natural asymmetric extension of the squared $2$-Wasserstein metric and has recently found applications in statistics and machine learning. On the other hand, Bregman divergence is a fundamental object in information geometry and induces a dually flat geometry on the underlying manifold. Using the Bregman-Wasserstein divergence, we lift this dualistic geometry to the space of probability measures, thus extending Otto's weak Riemannian structure of the Wasserstein space to statistical manifolds. We do so by generalizing Lott's formal geometric computations on the Wasserstein space. In particular, we define primal and dual connections on the space of probability measures and show that they are conjugate with respect to Otto's metric. We also define primal and dual displacement interpolations which satisfy the corresponding geodesic equations. As applications, we study displacement convexity and the Bregman-Wasserstein barycenter.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
Efficient Convex PCA with applications to Wasserstein geodesic PCA and ranked data
Authors:
Steven Campbell,
Ting-Kam Leonard Wong
Abstract:
Convex PCA, which was introduced by Bigot et al., is a dimension reduction methodology for data with values in a convex subset of a Hilbert space. This setting arises naturally in many applications, including distributional data in the Wasserstein space of an interval, and ranked compositional data under the Aitchison geometry. Our contribution in this paper is threefold. First, we present several…
▽ More
Convex PCA, which was introduced by Bigot et al., is a dimension reduction methodology for data with values in a convex subset of a Hilbert space. This setting arises naturally in many applications, including distributional data in the Wasserstein space of an interval, and ranked compositional data under the Aitchison geometry. Our contribution in this paper is threefold. First, we present several new theoretical results including consistency as well as continuity and differentiability of the objective function in the finite dimensional case. Second, we develop a numerical implementation of finite dimensional convex PCA when the convex set is polyhedral, and show that this provides a natural approximation of Wasserstein geodesic PCA. Third, we illustrate our results with two financial applications, namely distributions of stock returns ranked by size and the capital distribution curve, both of which are of independent interest in stochastic portfolio theory.
△ Less
Submitted 14 August, 2023; v1 submitted 5 November, 2022;
originally announced November 2022.
-
Conformal Mirror Descent with Logarithmic Divergences
Authors:
Amanjit Singh Kainth,
Ting-Kam Leonard Wong,
Frank Rudzicz
Abstract:
The logarithmic divergence is an extension of the Bregman divergence motivated by optimal transport and a generalized convex duality, and satisfies many remarkable properties. Using the geometry induced by the logarithmic divergence, we introduce a generalization of continuous time mirror descent that we term the conformal mirror descent. We derive its dynamics under a generalized mirror map, and…
▽ More
The logarithmic divergence is an extension of the Bregman divergence motivated by optimal transport and a generalized convex duality, and satisfies many remarkable properties. Using the geometry induced by the logarithmic divergence, we introduce a generalization of continuous time mirror descent that we term the conformal mirror descent. We derive its dynamics under a generalized mirror map, and show that it is a time change of a corresponding Hessian gradient flow. We also prove convergence results in continuous time. We apply the conformal mirror descent to online estimation of a generalized exponential family, and construct a family of gradient flows on the unit simplex via the Dirichlet optimal transport problem.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Modules for Experiments in Stellar Astrophysics (MESA): Time-Dependent Convection, Energy Conservation, Automatic Differentiation, and Infrastructure
Authors:
Adam S. Jermyn,
Evan B. Bauer,
Josiah Schwab,
R. Farmer,
Warrick H. Ball,
Earl P. Bellinger,
Aaron Dotter,
Meridith Joyce,
Pablo Marchant,
Joey S. G. Mombarg,
William M. Wolf,
Tin Long Sunny Wong,
Giulia C. Cinquegrana,
Eoin Farrell,
R. Smolec,
Anne Thoul,
Matteo Cantiello,
Falk Herwig,
Odette Toloza,
Lars Bildsten,
Richard H. D. Townsend,
F. X. Timmes
Abstract:
We update the capabilities of the open-knowledge software instrument Modules for Experiments in Stellar Astrophysics (MESA). The new auto_diff module implements automatic differentiation in MESA, an enabling capability that alleviates the need for hard-coded analytic expressions or finite difference approximations. We significantly enhance the treatment of the growth and decay of convection in MES…
▽ More
We update the capabilities of the open-knowledge software instrument Modules for Experiments in Stellar Astrophysics (MESA). The new auto_diff module implements automatic differentiation in MESA, an enabling capability that alleviates the need for hard-coded analytic expressions or finite difference approximations. We significantly enhance the treatment of the growth and decay of convection in MESA with a new model for time-dependent convection, which is particularly important during late-stage nuclear burning in massive stars and electron degenerate ignition events. We strengthen MESA's implementation of the equation of state, and we quantify continued improvements to energy accounting and solver accuracy through a discussion of different energy equation features and enhancements. To improve the modeling of stars in MESA we describe key updates to the treatment of stellar atmospheres, molecular opacities, Compton opacities, conductive opacities, element diffusion coefficients, and nuclear reaction rates. We introduce treatments of starspots, an important consideration for low-mass stars, and modifications for superadiabatic convection in radiation-dominated regions. We describe new approaches for increasing the efficiency of calculating monochromatic opacities and radiative levitation, and for increasing the efficiency of evolving the late stages of massive stars with a new operator split nuclear burning mode. We close by discussing major updates to MESA's software infrastructure that enhance source code development and community engagement.
△ Less
Submitted 30 December, 2022; v1 submitted 7 August, 2022;
originally announced August 2022.
-
An isomorphism theorem for models of Weak König's Lemma without primitive recursion
Authors:
Marta Fiori-Carones,
Leszek Aleksander Kołodziejczyk,
Tin Lok Wong,
Keita Yokoyama
Abstract:
We prove that if $(M,\mathcal{X})$ and $(M,\mathcal{Y})$ are countable models of the theory $\mathrm{WKL}^*_0$ such that $\mathrm{I}Σ_1(A)$ fails for some $A \in \mathcal{X} \cap \mathcal{Y}$, then $(M,\mathcal{X})$ and $(M,\mathcal{Y})$ are isomorphic. As a consequence, the analytic hierarchy collapses to $Δ^1_1$ provably in $\mathrm{WKL}^*_0 + \neg\mathrm{I}Σ^0_1$, and $\mathrm{WKL}$ is the stro…
▽ More
We prove that if $(M,\mathcal{X})$ and $(M,\mathcal{Y})$ are countable models of the theory $\mathrm{WKL}^*_0$ such that $\mathrm{I}Σ_1(A)$ fails for some $A \in \mathcal{X} \cap \mathcal{Y}$, then $(M,\mathcal{X})$ and $(M,\mathcal{Y})$ are isomorphic. As a consequence, the analytic hierarchy collapses to $Δ^1_1$ provably in $\mathrm{WKL}^*_0 + \neg\mathrm{I}Σ^0_1$, and $\mathrm{WKL}$ is the strongest $Π^1_2$ statement that is $Π^1_1$-conservative over $\mathrm{RCA}^*_0 + \neg\mathrm{I}Σ^0_1$.
Applying our results to the $Δ^0_n$-definable sets in models of $\mathrm{RCA}^*_0 + \mathrm{B}Σ^0_n + \neg\mathrm{I}Σ^0_n$ that also satisfy an appropriate relativization of Weak König's Lemma, we prove that for each $n \ge 1$, the set of $Π^1_2$ sentences that are $Π^1_1$-conservative over $\mathrm{RCA}^*_0 + \mathrm{B}Σ^0_n + \neg\mathrm{I}Σ^0_n$ is c.e. In contrast, we prove that the set of $Π^1_2$ sentences that are $Π^1_1$-conservative over $\mathrm{RCA}^*_0 + \mathrm{B}Σ^0_n$ is $Π_2$-complete. This answers a question of Towsner.
We also show that $\mathrm{RCA}_0 + \mathrm{RT}^2_2$ is $Π^1_1$-conservative over $\mathrm{B}Σ^0_2$ if and only if it is conservative over $\mathrm{B}Σ^0_2$ with respect to $\forall Π^0_5$ sentences.
△ Less
Submitted 27 August, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Pre-Explosion Properties of Helium Star Donors to Thermonuclear Supernovae
Authors:
Tin Long Sunny Wong,
Josiah Schwab,
Ylva Götberg
Abstract:
Helium star - carbon-oxygen white dwarf (CO WD) binaries are potential single-degenerate progenitor systems of thermonuclear supernovae. Revisiting a set of binary evolution calculations using the stellar evolution code $\texttt{MESA}$, we refine our previous predictions about which systems can lead to a thermonuclear supernova and then characterize the properties of the helium star donor at the t…
▽ More
Helium star - carbon-oxygen white dwarf (CO WD) binaries are potential single-degenerate progenitor systems of thermonuclear supernovae. Revisiting a set of binary evolution calculations using the stellar evolution code $\texttt{MESA}$, we refine our previous predictions about which systems can lead to a thermonuclear supernova and then characterize the properties of the helium star donor at the time of explosion. We convert these model properties to NUV/optical magnitudes assuming a blackbody spectrum and support this approach using a matched stellar atmosphere model. These models will be valuable to compare with pre-explosion imaging for future supernovae, though we emphasize the observational difficulty of detecting extremely blue companions. The pre-explosion source detected in association with SN 2012Z has been interpreted as a helium star binary containing an initially ultra-massive WD in a multi-day orbit. However, extending our binary models to initial CO WD masses of up to $1.2\,M_{\odot}$, we find that these systems undergo off-center carbon ignitions and thus are not expected to produce thermonuclear supernovae. This tension suggests that, if SN 2012Z is associated with a helium star - WD binary, then the pre-explosion optical light from the system must be significantly modified by the binary environment and/or the WD does not have a carbon-rich interior composition.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Mass Transfer and Stellar Evolution of the White Dwarfs in AM CVn Binaries
Authors:
Tin Long Sunny Wong,
Lars Bildsten
Abstract:
We calculate the stellar evolution of both white dwarfs (WDs) in AM CVn binaries with orbital periods of $P_{\mathrm{orb}} \approx 5-70$ minutes. We focus on the cases where the donor starts as a $M_{\mathrm{He}} < 0.2 \, M_{\odot}$ Helium WD and the accretor is a $M_{\mathrm{WD}} > 0.6 \, M_{\odot}$ WD. Using Modules for Experiments in Stellar Astrophysics (MESA), we simultaneously evolve both WD…
▽ More
We calculate the stellar evolution of both white dwarfs (WDs) in AM CVn binaries with orbital periods of $P_{\mathrm{orb}} \approx 5-70$ minutes. We focus on the cases where the donor starts as a $M_{\mathrm{He}} < 0.2 \, M_{\odot}$ Helium WD and the accretor is a $M_{\mathrm{WD}} > 0.6 \, M_{\odot}$ WD. Using Modules for Experiments in Stellar Astrophysics (MESA), we simultaneously evolve both WDs assuming conservative mass transfer and angular momentum loss from gravitational radiation. This self-consistent evolution yields the important feedback of the properties of the donor on the mass transfer rate, $\dot{M}$, as well as the thermal evolution of the accreting WD. Consistent with earlier work, we find that the high $\dot{M}$'s at early times forces an adiabatic evolution of the donor for $P_{\mathrm{orb}} < 30$ minutes so that its mass-radius relation depends primarily on its initial entropy. As the donor reaches $ M_{\mathrm{He}} \approx 0.02-0.03 \, M_{\odot}$ at $P_{\mathrm{orb}} \simeq 30 $ minutes, it becomes fully convective and could lose entropy and expand much less than expected under further mass loss. However, we show that the lack of reliable opacities for the donor's surface inhibit a secure prediction for this possible cooling. Our calculations capture the core heating that occurs during the first $\approx 10^7$ years of accretion and continue the evolution into the phase of WD cooling that follows. When compared to existing data for accreting WDs, as seen by Cheng and collaborators for isolated WDs, we also find that the accreting WDs are not as cool as we would expect given the amount of time they have had to cool.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Improved Dynamical Masses for Six Brown Dwarf Companions Using Hipparcos and Gaia EDR3
Authors:
G. Mirek Brandt,
Trent J. Dupuy,
Yiting Li,
Minghan Chen,
Timothy D. Brandt,
Tin Long Sunny Wong,
Thayne Currie,
Brendan P. Bowler,
Michael C. Liu,
William M. J. Best,
Mark W. Phillips
Abstract:
We present comprehensive orbital analyses and dynamical masses for the substellar companions Gl~229~B, Gl~758~B, HD~13724~B, HD~19467~B, HD~33632~Ab, and HD~72946~B. Our dynamical fits incorporate radial velocities, relative astrometry, and most importantly calibrated Hipparcos-Gaia EDR3 accelerations. For HD~33632~A and HD~72946 we perform three-body fits that account for their outer stellar comp…
▽ More
We present comprehensive orbital analyses and dynamical masses for the substellar companions Gl~229~B, Gl~758~B, HD~13724~B, HD~19467~B, HD~33632~Ab, and HD~72946~B. Our dynamical fits incorporate radial velocities, relative astrometry, and most importantly calibrated Hipparcos-Gaia EDR3 accelerations. For HD~33632~A and HD~72946 we perform three-body fits that account for their outer stellar companions. We present new relative astrometry of Gl~229~B with Keck/NIRC2, extending its observed baseline to 25 years. We obtain a $<$1\% mass measurement of $71.4 \pm 0.6\,M_{\rm Jup}$ for the first T dwarf Gl~229~B and a 1.2\% mass measurement of its host star ($0.579 \pm 0.007\,M_{\odot}$) that agrees with the high-mass-end of the M dwarf mass-luminosity relation. We perform a homogeneous analysis of the host stars' ages and use them, along with the companions' measured masses and luminosities, to test substellar evolutionary models. Gl~229~B is the most discrepant, as models predict that an object this massive cannot cool to such a low luminosity within a Hubble time, implying that it may be an unresolved binary. The other companions are generally consistent with models, except for HD~13724~B that has a host-star activity age 3.8$σ$ older than its substellar cooling age. Examining our results in context with other mass-age-luminosity benchmarks, we find no trend with spectral type but instead note that younger or lower-mass brown dwarfs are over-luminous compared to models, while older or higher-mass brown dwarfs are under-luminous. The presented mass measurements for some companions are so precise that the stellar host ages, not the masses, limit the analysis.
△ Less
Submitted 30 September, 2021; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Tsallis and Rényi deformations linked via a new $λ$-duality
Authors:
Ting-Kam Leonard Wong,
Jun Zhang
Abstract:
Tsallis and Rényi entropies, which are monotone transformations of each other, are deformations of the celebrated Shannon entropy. Maximization of these deformed entropies, under suitable constraints, leads to the $q$-exponential family which has applications in non-extensive statistical physics, information theory and statistics. In previous information-geometric studies, the $q$-exponential fami…
▽ More
Tsallis and Rényi entropies, which are monotone transformations of each other, are deformations of the celebrated Shannon entropy. Maximization of these deformed entropies, under suitable constraints, leads to the $q$-exponential family which has applications in non-extensive statistical physics, information theory and statistics. In previous information-geometric studies, the $q$-exponential family was analyzed using classical convex duality and Bregman divergence. In this paper, we show that a generalized $λ$-duality, where $λ= 1 - q$ is the constant information-geometric curvature, leads to a generalized exponential family which is essentially equivalent to the $q$-exponential family and has deep connections with Rényi entropy and optimal transport. Using this generalized convex duality and its associated logarithmic divergence, we show that our $λ$-exponential family satisfies properties that parallel and generalize those of the exponential family. Under our framework, the Rényi entropy and divergence arise naturally, and we give a new proof of the Tsallis/Rényi entropy maximizing property of the $q$-exponential family. We also introduce a $λ$-mixture family which may be regarded as the dual of the $λ$-exponential family, and connect it with other mixture-type families. Finally, we discuss a duality between the $λ$-exponential family and the $λ$-logarithmic divergence, and study its statistical consequences.
△ Less
Submitted 12 January, 2022; v1 submitted 25 July, 2021;
originally announced July 2021.
-
Projections with logarithmic divergences
Authors:
Zhixu Tao,
Ting-Kam Leonard Wong
Abstract:
In information geometry, generalized exponential families and statistical manifolds with curvature are under active investigation in recent years. In this paper we consider the statistical manifold induced by a logarithmic $L^{(α)}$-divergence which generalizes the Bregman divergence. It is known that such a manifold is dually projectively flat with constant negative sectional curvature, and is cl…
▽ More
In information geometry, generalized exponential families and statistical manifolds with curvature are under active investigation in recent years. In this paper we consider the statistical manifold induced by a logarithmic $L^{(α)}$-divergence which generalizes the Bregman divergence. It is known that such a manifold is dually projectively flat with constant negative sectional curvature, and is closely related to the $\mathcal{F}^{(α)}$-family, a generalized exponential family introduced by the second author. Our main result constructs a dual foliation of the statistical manifold, i.e., an orthogonal decomposition consisting of primal and dual autoparallel submanifolds. This decomposition, which can be naturally interpreted in terms of primal and dual projections with respect to the logarithmic divergence, extends the dual foliation of a dually flat manifold studied by Amari. As an application, we formulate a new $L^{(α)}$-PCA problem which generalizes the exponential family PCA.
△ Less
Submitted 8 May, 2021;
originally announced May 2021.
-
Functional portfolio optimization in stochastic portfolio theory
Authors:
Steven Campbell,
Ting-Kam Leonard Wong
Abstract:
In this paper we develop a concrete and fully implementable approach to the optimization of functionally generated portfolios in stochastic portfolio theory. The main idea is to optimize over a family of rank-based portfolios parameterized by an exponentially concave function on the unit interval. This choice can be motivated by the long term stability of the capital distribution observed in large…
▽ More
In this paper we develop a concrete and fully implementable approach to the optimization of functionally generated portfolios in stochastic portfolio theory. The main idea is to optimize over a family of rank-based portfolios parameterized by an exponentially concave function on the unit interval. This choice can be motivated by the long term stability of the capital distribution observed in large equity markets, and allows us to circumvent the curse of dimensionality. The resulting optimization problem, which is convex, allows for various regularizations and constraints to be imposed on the generating function. We prove an existence and uniqueness result for our optimization problem and provide a stability estimate in terms of a Wasserstein metric of the input measure. Then, we formulate a discretization which can be implemented numerically using available software packages and analyze its approximation error. Finally, we present empirical examples using CRSP data from the US stock market, including the performance of the portfolios allowing for dividends, defaults, and transaction costs.
△ Less
Submitted 9 October, 2021; v1 submitted 19 March, 2021;
originally announced March 2021.
-
An Outburst by AM CVn binary SDSS J113732.32+405458.3
Authors:
Tin Long Sunny Wong,
Jan van Roestel,
Thomas Kupfer,
Lars Bildsten
Abstract:
We report the discovery of a one magnitude increase in the optical brightness of the 59.63 minute orbital period AM CVn binary SDSS J113732.32+405458.3. Public $g$, $r$, and $i$ band data from the Zwicky Transient Facility (ZTF) exhibit a decline over a 300 day period, while a few data points from commissioning show that the peak was likely seen. Such an outburst is likely due to a change in the s…
▽ More
We report the discovery of a one magnitude increase in the optical brightness of the 59.63 minute orbital period AM CVn binary SDSS J113732.32+405458.3. Public $g$, $r$, and $i$ band data from the Zwicky Transient Facility (ZTF) exhibit a decline over a 300 day period, while a few data points from commissioning show that the peak was likely seen. Such an outburst is likely due to a change in the state of the accretion disk, making this the longest period AM CVn binary to reveal an unstable accretion disk. The object is now back to its previously observed (by SDSS and PS-1) quiescent brightness that is likely set by the accreting white dwarf. Prior observations of this object also imply that the recurrence times for such outbursts are likely more than 12 years.
△ Less
Submitted 18 December, 2020;
originally announced December 2020.
-
Ramsey's theorem for pairs, collection, and proof size
Authors:
Leszek Aleksander Kołodziejczyk,
Tin Lok Wong,
Keita Yokoyama
Abstract:
We prove that any proof of a $\forall Σ^0_2$ sentence in the theory $\mathrm{WKL}_0 + \mathrm{RT}^2_2$ can be translated into a proof in $\mathrm{RCA}_0$ at the cost of a polynomial increase in size. In fact, the proof in $\mathrm{RCA}_0$ can be found by a polynomial-time algorithm. On the other hand, $\mathrm{RT}^2_2$ has non-elementary speedup over the weaker base theory $\mathrm{RCA}^*_0$ for p…
▽ More
We prove that any proof of a $\forall Σ^0_2$ sentence in the theory $\mathrm{WKL}_0 + \mathrm{RT}^2_2$ can be translated into a proof in $\mathrm{RCA}_0$ at the cost of a polynomial increase in size. In fact, the proof in $\mathrm{RCA}_0$ can be found by a polynomial-time algorithm. On the other hand, $\mathrm{RT}^2_2$ has non-elementary speedup over the weaker base theory $\mathrm{RCA}^*_0$ for proofs of $Σ_1$ sentences.
We also show that for $n \ge 0$, proofs of $Π_{n+2}$ sentences in $\mathrm{B}Σ_{n+1}+\exp$ can be translated into proofs in $\mathrm{I}Σ_{n} + \exp$ at polynomial cost. Moreover, the $Π_{n+2}$-conservativity of $\mathrm{B}Σ_{n+1} + \exp$ over $\mathrm{I}Σ_{n} + \exp$ can be proved in $\mathrm{PV}$, a fragment of bounded arithmetic corresponding to polynomial-time computation. For $n \ge 1$, this answers a question of Clote, Hájek, and Paris.
△ Less
Submitted 16 January, 2021; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Scalable Gradients for Stochastic Differential Equations
Authors:
Xuechen Li,
Ting-Kam Leonard Wong,
Ricky T. Q. Chen,
David Duvenaud
Abstract:
The adjoint sensitivity method scalably computes gradients of solutions to ordinary differential equations. We generalize this method to stochastic differential equations, allowing time-efficient and constant-memory computation of gradients with high-order adaptive solvers. Specifically, we derive a stochastic differential equation whose solution is the gradient, a memory-efficient algorithm for c…
▽ More
The adjoint sensitivity method scalably computes gradients of solutions to ordinary differential equations. We generalize this method to stochastic differential equations, allowing time-efficient and constant-memory computation of gradients with high-order adaptive solvers. Specifically, we derive a stochastic differential equation whose solution is the gradient, a memory-efficient algorithm for caching noise, and conditions under which numerical solutions converge. In addition, we combine our method with gradient-based stochastic variational inference for latent stochastic differential equations. We use our method to fit stochastic dynamics defined by neural networks, achieving competitive performance on a 50-dimensional motion capture dataset.
△ Less
Submitted 18 October, 2020; v1 submitted 5 January, 2020;
originally announced January 2020.
-
Where Pigeonhole Principles meet König Lemmas
Authors:
David Belanger,
Chitat Chong,
Wei Wang,
Tin Lok Wong,
Yue Yang
Abstract:
We study the pigeonhole principle for $Σ_2$-definable injections with domain twice as large as the codomain, and the weak König lemma for $Δ^0_2$-definable trees in which every level has at least half of the possible nodes. We show that the latter implies the existence of $2$-random reals, and is conservative over the former. We also show that the former is strictly weaker than the usual pigeonhol…
▽ More
We study the pigeonhole principle for $Σ_2$-definable injections with domain twice as large as the codomain, and the weak König lemma for $Δ^0_2$-definable trees in which every level has at least half of the possible nodes. We show that the latter implies the existence of $2$-random reals, and is conservative over the former. We also show that the former is strictly weaker than the usual pigeonhole principle for $Σ_2$-definable injections.
△ Less
Submitted 7 December, 2019;
originally announced December 2019.
-
Random concave functions
Authors:
Peter Baxendale,
Ting-Kam Leonard Wong
Abstract:
Spaces of convex and concave functions appear naturally in theory and applications. For example, convex regression and log-concave density estimation are important topics in nonparametric statistics. In stochastic portfolio theory, concave functions on the unit simplex measure the concentration of capital, and their gradient maps define novel investment strategies. The gradient maps may also be re…
▽ More
Spaces of convex and concave functions appear naturally in theory and applications. For example, convex regression and log-concave density estimation are important topics in nonparametric statistics. In stochastic portfolio theory, concave functions on the unit simplex measure the concentration of capital, and their gradient maps define novel investment strategies. The gradient maps may also be regarded as optimal transport maps on the simplex. In this paper we construct and study probability measures supported on spaces of concave functions. These measures may serve as prior distributions in Bayesian statistics and Cover's universal portfolio, and induce distribution-valued random variables via optimal transport. The random concave functions are constructed on the unit simplex by taking a suitably scaled (mollified, or soft) minimum of random hyperplanes. Depending on the regime of the parameters, we show that as the number of hyperplanes tends to infinity there are several possible limiting behaviors. In particular, there is a transition from a deterministic almost sure limit to a non-trivial limiting distribution that can be characterized using convex duality and Poisson point processes.
△ Less
Submitted 24 May, 2021; v1 submitted 30 October, 2019;
originally announced October 2019.
-
Logarithmic divergences: geometry and interpretation of curvature
Authors:
Ting-Kam Leonard Wong,
Jiaowen Yang
Abstract:
We study the logarithmic $L^{(α)}$-divergence which extrapolates the Bregman divergence and corresponds to solutions to novel optimal transport problems. We show that this logarithmic divergence is equivalent to a conformal transformation of the Bregman divergence, and, via an explicit affine immersion, is equivalent to Kurose's geometric divergence. In particular, the $L^{(α)}$-divergence is a ca…
▽ More
We study the logarithmic $L^{(α)}$-divergence which extrapolates the Bregman divergence and corresponds to solutions to novel optimal transport problems. We show that this logarithmic divergence is equivalent to a conformal transformation of the Bregman divergence, and, via an explicit affine immersion, is equivalent to Kurose's geometric divergence. In particular, the $L^{(α)}$-divergence is a canonical divergence of a statistical manifold with constant sectional curvature $-α$. For such a manifold, we give a geometric interpretation of its sectional curvature in terms of how the divergence between a pair of primal and dual geodesics differ from the dually flat case. Further results can be found in our follow-up paper [27] which uncovers a novel relation between optimal transport and information geometry.
△ Less
Submitted 17 June, 2019;
originally announced June 2019.
-
Pseudo-Riemannian geometry embeds information geometry in optimal transport
Authors:
Ting-Kam Leonard Wong,
Jiaowen Yang
Abstract:
Optimal transport and information geometry both study geometric structures on spaces of probability distributions. Optimal transport characterizes the cost-minimizing movement from one distribution to another, while information geometry originates from coordinate-invariant properties of statistical inference. Their connections and applications in statistics and machine learning have started to gai…
▽ More
Optimal transport and information geometry both study geometric structures on spaces of probability distributions. Optimal transport characterizes the cost-minimizing movement from one distribution to another, while information geometry originates from coordinate-invariant properties of statistical inference. Their connections and applications in statistics and machine learning have started to gain more attention. In this paper we give a new differential geometric connection between the two fields. Namely, the pseudo-Riemannian framework of Kim and McCann, a geometric perspective on the fundamental Ma-Trudinger-Wang (MTW) condition in the regularity theory of optimal transport maps, encodes the dualistic structure of statistical manifold. This general relation is described using the natural framework of $c$-divergence, a divergence defined by an optimal transport map. As a by-product, we obtain a new information-geometric interpretation of the MTW tensor. This connection sheds light on old and new aspects of information geometry. The dually flat geometry of Bregman divergence corresponds to the quadratic cost and the pseudo-Euclidean space, and the $L^{(α)}$-divergence introduced by Pal and the first author has constant sectional curvature in a sense to be made precise. In these cases we give a geometric interpretation of the information-geometric curvature in terms of the divergence between a primal-dual pair of geodesics.
△ Less
Submitted 5 May, 2021; v1 submitted 31 May, 2019;
originally announced June 2019.
-
Evolution of Helium Star - White Dwarf Binaries Leading up to Thermonuclear Supernovae
Authors:
Tin Long Sunny Wong,
Josiah Schwab
Abstract:
We perform binary evolution calculations on helium star - carbon-oxygen white dwarf (CO WD) binaries using the stellar evolution code MESA. This single degenerate channel may contribute significantly to thermonuclear supernovae at short delay times. We examine the thermal-timescale mass transfer from a 1.1 - 2.0 $M_{\odot}$ helium star to a 0.90 - 1.05 $M_{\odot}$ CO WD for initial orbital periods…
▽ More
We perform binary evolution calculations on helium star - carbon-oxygen white dwarf (CO WD) binaries using the stellar evolution code MESA. This single degenerate channel may contribute significantly to thermonuclear supernovae at short delay times. We examine the thermal-timescale mass transfer from a 1.1 - 2.0 $M_{\odot}$ helium star to a 0.90 - 1.05 $M_{\odot}$ CO WD for initial orbital periods in the range 0.05 - 1 day. Systems in this range may produce a thermonuclear supernova, helium novae, a helium star - oxygen-neon WD binary, or a detached double CO WD binary. Our time-dependent calculations that resolve the stellar structures of both binary components allow accurate distinction between the eventual formation of a thermonuclear supernova (via central ignition of carbon burning) and that of an ONe WD (in the case of off-center ignition). Furthermore, we investigate the effect of a slow WD wind which implies a specific angular momentum loss from the binary that is larger than typically assumed. We find that this does not significantly alter the region of parameter space over which systems evolve toward thermonuclear supernovae. Our determination of the correspondence between initial binary parameters and the final outcome informs population synthesis studies of the contribution of the helium donor channel to thermonuclear supernovae. In addition, we constrain the orbital properties and observable stellar properties of the progenitor binaries of thermonuclear supernovae and helium novae.
△ Less
Submitted 7 May, 2019; v1 submitted 14 January, 2019;
originally announced January 2019.
-
Time-consistent conditional expectation under probability distortion
Authors:
** Ma,
Ting-Kam Leonard Wong,
Jianfeng Zhang
Abstract:
We introduce a new notion of conditional nonlinear expectation under probability distortion. Such a distorted nonlinear expectation is not sub-additive in general, so it is beyond the scope of Peng's framework of nonlinear expectations. A more fundamental problem when extending the distorted expectation to a dynamic setting is time-inconsistency, that is, the usual "tower property" fails. By local…
▽ More
We introduce a new notion of conditional nonlinear expectation under probability distortion. Such a distorted nonlinear expectation is not sub-additive in general, so it is beyond the scope of Peng's framework of nonlinear expectations. A more fundamental problem when extending the distorted expectation to a dynamic setting is time-inconsistency, that is, the usual "tower property" fails. By localizing the probability distortion and restricting to a smaller class of random variables, we introduce a so-called distorted probability and construct a conditional expectation in such a way that it coincides with the original nonlinear expectation at time zero, but has a time-consistent dynamics in the sense that the tower property remains valid. Furthermore, we show that in the continuous time model this conditional expectation corresponds to a parabolic differential equation whose coefficient involves the law of the underlying diffusion. This work is the first step towards a new understanding of nonlinear expectations under probability distortion, and will potentially be a helpful tool for solving time-inconsistent stochastic optimization problems.
△ Less
Submitted 25 June, 2020; v1 submitted 21 September, 2018;
originally announced September 2018.
-
Multiplicative Schrödinger problem and the Dirichlet transport
Authors:
Soumik Pal,
Ting-Kam Leonard Wong
Abstract:
We consider an optimal transport problem on the unit simplex whose solutions are given by gradients of exponentially concave functions and prove two main results. First, we show that the optimal transport is the large deviation limit of a particle system of Dirichlet processes transporting one probability measure on the unit simplex to another by coordinatewise multiplication and normalizing. The…
▽ More
We consider an optimal transport problem on the unit simplex whose solutions are given by gradients of exponentially concave functions and prove two main results. First, we show that the optimal transport is the large deviation limit of a particle system of Dirichlet processes transporting one probability measure on the unit simplex to another by coordinatewise multiplication and normalizing. The structure of our Lagrangian and the appearance of the Dirichlet process relate our problem closely to the entropic measure on the Wasserstein space as defined by von-Renesse and Sturm in the context of Wasserstein diffusion. The limiting procedure is a triangular limit where we allow simultaneously the number of particles to grow to infinity while the `noise' tends to zero. The method, which generalizes easily to many other cost functions, including the squared Euclidean distance, provides a novel combination of the Schrödinger problem approach due to C. Léonard and the related Brownian particle systems by Adams et al.which does not require gamma convergence. Second, we analyze the behavior of entropy along the paths of transport. The reference measure on the simplex is taken to be the Dirichlet measure with all zero parameters which relates to the finite-dimensional distributions of the entropic measure. The interpolating curves are not the usual McCann lines. Nevertheless we show that entropy plus a multiple of the transport cost remains convex, which is reminiscent of the semiconvexity of entropy along lines of McCann interpolations in negative curvature spaces. We also obtain, under suitable conditions, dimension-free bounds of the optimal transport cost in terms of entropy.
△ Less
Submitted 4 July, 2020; v1 submitted 15 July, 2018;
originally announced July 2018.
-
Random walks and induced Dirichlet forms on compact spaces of homogeneous type
Authors:
Shi-Lei Kong,
Ka-Sing Lau,
Ting-Kam Leonard Wong
Abstract:
We extend our study of random walks and induced Dirichlet forms on self-similar sets [arXiv:1604.05440, 1612.01708] to compact spaces of homogeneous type $(K, ρ,μ)$. A successive partition on $K$ brings a natural augmented tree structure $(X, E)$ that is Gromov hyperbolic, and the hyperbolic boundary is Hölder equivalent to $K$. We then introduce a class of transient reversible random walks on…
▽ More
We extend our study of random walks and induced Dirichlet forms on self-similar sets [arXiv:1604.05440, 1612.01708] to compact spaces of homogeneous type $(K, ρ,μ)$. A successive partition on $K$ brings a natural augmented tree structure $(X, E)$ that is Gromov hyperbolic, and the hyperbolic boundary is Hölder equivalent to $K$. We then introduce a class of transient reversible random walks on $(X, E)$ with return ratio $λ$. Using Silverstein's theory of Markov chains, we prove that the random walk induces an energy form on $K$ with $$ {\mathcal E}_K [u] \asymp \iint_{K\times K \setminus Δ} \frac{|u(ξ) - u(η)|^2}{V(ξ, η)ρ(ξ, η)^β} dμ(ξ) dμ(η), $$ where $V(ξ, η)$ is the $μ$-volume of the ball centered at $ξ$ with radius $ρ(ξ, η)$, $Δ$ is the diagonal, and $β$ depends on $λ$. In particular, for an $α$-set in ${\mathbb R}^d$, the kernel of the energy form is of order $\frac{1}{|ξ-η|^{α+β}}$. We also discuss conditions for this energy form to be a non-local regular Dirichlet form.
△ Less
Submitted 8 April, 2018;
originally announced April 2018.
-
Logarithmic divergences from optimal transport and Rényi geometry
Authors:
Ting-Kam Leonard Wong
Abstract:
Divergences, also known as contrast functions, are distance-like quantities defined on manifolds of non-negative or probability measures. Using the duality in optimal transport, we introduce and study the one-parameter family of $L^{(\pm α)}$-divergences. It includes the Bregman divergence corresponding to the Euclidean quadratic cost, and the $L$-divergence introduced by Pal and the author in con…
▽ More
Divergences, also known as contrast functions, are distance-like quantities defined on manifolds of non-negative or probability measures. Using the duality in optimal transport, we introduce and study the one-parameter family of $L^{(\pm α)}$-divergences. It includes the Bregman divergence corresponding to the Euclidean quadratic cost, and the $L$-divergence introduced by Pal and the author in connection with portfolio theory and a logarithmic cost function. They admit natural generalizations of exponential family that are closely related to the $α$-family and $q$-exponential family. In particular, the $L^{(\pm α)}$-divergences of the corresponding potential functions are Rényi divergences. Using this unified framework we prove that the induced geometries are dually projectively flat with constant sectional curvatures, and a generalized Pythagorean theorem holds true. Conversely, we show that if a statistical manifold is dually projectively flat with constant curvature $\pm α$ with $α> 0$, then it is locally induced by an $L^{(\mp α)}$-divergence. We define in this context a canonical divergence which extends the one for dually flat manifolds.
△ Less
Submitted 3 September, 2018; v1 submitted 10 December, 2017;
originally announced December 2017.
-
On portfolios generated by optimal transport
Authors:
Ting-Kam Leonard Wong
Abstract:
First introduced by Fernholz in stochastic portfolio theory, functionally generated portfolio allows its investment performance to be attributed to directly observable and easily interpretable market quantities. In previous works we showed that Fernholz's multiplicatively generated portfolio has deep connections with optimal transport and the information geometry of exponentially concave functions…
▽ More
First introduced by Fernholz in stochastic portfolio theory, functionally generated portfolio allows its investment performance to be attributed to directly observable and easily interpretable market quantities. In previous works we showed that Fernholz's multiplicatively generated portfolio has deep connections with optimal transport and the information geometry of exponentially concave functions. Recently, Karatzas and Ruf introduced a new additive portfolio generation whose relation with optimal transport was studied by Vervuurt. We show that additively generated portfolio can be interpreted in terms of the well-known dually flat information geometry of Bregman divergence. Moreover, we characterize, in a sense to be made precise, all possible forms of functional portfolio constructions that contain additive and multiplicative generations as special cases. Each construction involves a divergence functional on the unit simplex measuring the market volatility captured, and admits a pathwise decomposition for the portfolio value. We illustrate with an empirical example.
△ Less
Submitted 25 September, 2017; v1 submitted 10 September, 2017;
originally announced September 2017.
-
Some observations on the logical foundations of inductive theorem proving
Authors:
Stefan Hetzl,
Tin Lok Wong
Abstract:
In this paper we study the logical foundations of automated inductive theorem proving. To that aim we first develop a theoretical model that is centered around the difficulty of finding induction axioms which are sufficient for proving a goal.
Based on this model, we then analyze the following aspects: the choice of a proof shape, the choice of an induction rule and the language of the induction…
▽ More
In this paper we study the logical foundations of automated inductive theorem proving. To that aim we first develop a theoretical model that is centered around the difficulty of finding induction axioms which are sufficient for proving a goal.
Based on this model, we then analyze the following aspects: the choice of a proof shape, the choice of an induction rule and the language of the induction formula. In particular, using model-theoretic techniques, we clarify the relationship between notions of inductiveness that have been considered in the literature on automated inductive theorem proving. This is a corrected version of the paper arXiv:1704.01930v5 published originally on Nov.~16, 2017.
△ Less
Submitted 12 April, 2018; v1 submitted 6 April, 2017;
originally announced April 2017.
-
Cover's universal portfolio, stochastic portfolio theory and the numeraire portfolio
Authors:
Christa Cuchiero,
Walter Schachermayer,
Ting-Kam Leonard Wong
Abstract:
Cover's celebrated theorem states that the long run yield of a properly chosen "universal" portfolio is as good as the long run yield of the best retrospectively chosen constant rebalanced portfolio. The "universality" pertains to the fact that this result is model-free, i.e., not dependent on an underlying stochastic process. We extend Cover's theorem to the setting of stochastic portfolio theory…
▽ More
Cover's celebrated theorem states that the long run yield of a properly chosen "universal" portfolio is as good as the long run yield of the best retrospectively chosen constant rebalanced portfolio. The "universality" pertains to the fact that this result is model-free, i.e., not dependent on an underlying stochastic process. We extend Cover's theorem to the setting of stochastic portfolio theory as initiated by R. Fernholz: the rebalancing rule need not to be constant anymore but may depend on the present state of the stock market. This model-free result is complemented by a comparison with the log-optimal numeraire portfolio when fixing a stochastic model of the stock market. Roughly speaking, under appropriate assumptions, the optimal long run yield coincides for the three approaches mentioned in the title of this paper. We present our results in discrete and continuous time.
△ Less
Submitted 29 November, 2016;
originally announced November 2016.
-
Exponentially concave functions and a new information geometry
Authors:
Soumik Pal,
Ting-Kam Leonard Wong
Abstract:
A function is exponentially concave if its exponential is concave. We consider exponentially concave functions on the unit simplex. In a previous paper we showed that gradient maps of exponentially concave functions provide solutions to a Monge-Kantorovich optimal transport problem and give a better gradient approximation than those of ordinary concave functions. The approximation error, called L-…
▽ More
A function is exponentially concave if its exponential is concave. We consider exponentially concave functions on the unit simplex. In a previous paper we showed that gradient maps of exponentially concave functions provide solutions to a Monge-Kantorovich optimal transport problem and give a better gradient approximation than those of ordinary concave functions. The approximation error, called L-divergence, is different from the usual Bregman divergence. Using tools of information geometry and optimal transport, we show that L-divergence induces a new information geometry on the simplex consisting of a Riemannian metric and a pair of dually coupled affine connections which defines two kinds of geodesics. We show that the induced geometry is dually projectively flat but not flat. Nevertheless, we prove an analogue of the celebrated generalized Pythagorean theorem from classical information geometry. On the other hand, we consider displacement interpolation under a Lagrangian integral action that is consistent with the optimal transport problem and show that the action minimizing curves are dual geodesics. The Pythagorean theorem is also shown to have an interesting application of determining the optimal trading frequency in stochastic portfolio theory.
△ Less
Submitted 31 May, 2017; v1 submitted 19 May, 2016;
originally announced May 2016.
-
Random walks and induced Dirichlet forms on self-similar sets
Authors:
Shi-Lei Kong,
Ka-Sing Lau,
Ting-Kam Leonard Wong
Abstract:
Let $K$ be a self-similar set satisfying the open set condition. Following Kaimanovich's elegant idea, it has been proved that on the symbolic space $X$ of $K$ a natural augmented tree structure ${\mathfrak E}$ exists; it is hyperbolic, and the hyperbolic boundary $\partial_HX$ with the Gromov metric is Hölder equivalent to $K$. In this paper we consider certain reversible random walks with return…
▽ More
Let $K$ be a self-similar set satisfying the open set condition. Following Kaimanovich's elegant idea, it has been proved that on the symbolic space $X$ of $K$ a natural augmented tree structure ${\mathfrak E}$ exists; it is hyperbolic, and the hyperbolic boundary $\partial_HX$ with the Gromov metric is Hölder equivalent to $K$. In this paper we consider certain reversible random walks with return ratio $0< λ<1$ on $(X, {\mathfrak E})$. We show that the Martin boundary ${\mathcal M}$ can be identified with $\partial_H X$ and $K$. With this setup and a device of Silverstein, we obtain precise estimates of the Martin kernel and the Naïm kernel in terms of the Gromov product. Moreover, the Naïm kernel turns out to be a jump kernel satisfying the estimate $Θ(ξ, η) \asymp |ξ-η|^{-(α+ β)}$, where $α$ is the Hausdorff dimension of $K$ and $β$ depends on $λ$. For suitable $β$, the kernel defines a regular non-local Dirichlet form on $K$. This extends the results of Kigami concerning random walks on certain trees with Cantor-type sets as boundaries.
△ Less
Submitted 19 October, 2017; v1 submitted 19 April, 2016;
originally announced April 2016.
-
Universal portfolios in stochastic portfolio theory
Authors:
Ting-Kam Leonard Wong
Abstract:
Consider a family of portfolio strategies with the aim of achieving the asymptotic growth rate of the best one. The idea behind Cover's universal portfolio is to build a wealth-weighted average which can be viewed as a buy-and-hold portfolio of portfolios. When an optimal portfolio exists, the wealth-weighted average converges to it by concentration of wealth. Working under a discrete time and pat…
▽ More
Consider a family of portfolio strategies with the aim of achieving the asymptotic growth rate of the best one. The idea behind Cover's universal portfolio is to build a wealth-weighted average which can be viewed as a buy-and-hold portfolio of portfolios. When an optimal portfolio exists, the wealth-weighted average converges to it by concentration of wealth. Working under a discrete time and pathwise setup, we show under suitable conditions that the distribution of wealth in the family satisfies a pathwise large deviation principle as time tends to infinity. Our main result extends Cover's portfolio to the nonparametric family of functionally generated portfolios in stochastic portfolio theory and establishes its asymptotic universality.
△ Less
Submitted 12 December, 2016; v1 submitted 9 October, 2015;
originally announced October 2015.
-
Optimization of relative arbitrage
Authors:
Ting-Kam Leonard Wong
Abstract:
In stochastic portfolio theory, a relative arbitrage is an equity portfolio which is guaranteed to outperform a benchmark portfolio over a finite horizon. When the market is diverse and sufficiently volatile, and the benchmark is the market or a buy-and-hold portfolio, functionally generated portfolios introduced by Fernholz provide a systematic way of constructing relative arbitrages. In this pap…
▽ More
In stochastic portfolio theory, a relative arbitrage is an equity portfolio which is guaranteed to outperform a benchmark portfolio over a finite horizon. When the market is diverse and sufficiently volatile, and the benchmark is the market or a buy-and-hold portfolio, functionally generated portfolios introduced by Fernholz provide a systematic way of constructing relative arbitrages. In this paper we show that if the market portfolio is replaced by the equal or entropy weighted portfolio among many others, no relative arbitrages can be constructed under the same conditions using functionally generated portfolios. We also introduce and study a shaped-constrained optimization problem for functionally generated portfolios in the spirit of maximum likelihood estimation of a log-concave density.
△ Less
Submitted 24 November, 2014; v1 submitted 31 July, 2014;
originally announced July 2014.
-
The geometry of relative arbitrage
Authors:
Soumik Pal,
Ting-Kam Leonard Wong
Abstract:
Consider an equity market with $n$ stocks. The vector of proportions of the total market capitalizations that belong to each stock is called the market weight. The market weight defines the market portfolio which is a buy-and-hold portfolio representing the performance of the entire stock market. Consider a function that assigns a portfolio vector to each possible value of the market weight, and w…
▽ More
Consider an equity market with $n$ stocks. The vector of proportions of the total market capitalizations that belong to each stock is called the market weight. The market weight defines the market portfolio which is a buy-and-hold portfolio representing the performance of the entire stock market. Consider a function that assigns a portfolio vector to each possible value of the market weight, and we perform self-financing trading using this portfolio function. We study the problem of characterizing functions such that the resulting portfolio will outperform the market portfolio in the long run under the conditions of diversity and sufficient volatility. No other assumption on the future behavior of stock prices is made. We prove that the only solutions are functionally generated portfolios in the sense of Fernholz. A second characterization is given as the optimal maps of a remarkable optimal transport problem. Both characterizations follow from a novel property of portfolios called multiplicative cyclical monotonicity.
△ Less
Submitted 27 July, 2015; v1 submitted 15 February, 2014;
originally announced February 2014.
-
Energy, entropy, and arbitrage
Authors:
Soumik Pal,
Ting-Kam Leonard Wong
Abstract:
We introduce a pathwise approach to analyze the relative performance of an equity portfolio with respect to a benchmark market portfolio. In this energy-entropy framework, the relative performance is decomposed into three components: a volatility term, a relative entropy term measuring the distance between the portfolio weights and the market capital distribution, and another entropy term that can…
▽ More
We introduce a pathwise approach to analyze the relative performance of an equity portfolio with respect to a benchmark market portfolio. In this energy-entropy framework, the relative performance is decomposed into three components: a volatility term, a relative entropy term measuring the distance between the portfolio weights and the market capital distribution, and another entropy term that can be controlled by the investor by adopting a suitable rebalancing strategy. This framework leads to a class of portfolio strategies that allows one to outperform, in the long run, a market that is diverse and sufficiently volatile in the sense of stochastic portfolio theory. The framework is illustrated with several empirical examples.
△ Less
Submitted 1 January, 2016; v1 submitted 25 August, 2013;
originally announced August 2013.
-
Superlattices of Bi2Se3/In2Se3: Growth Characteristics and Structural Properties
Authors:
Z. Y. Wang,
X. Guo,
H. D. Li,
T. L. Wong,
N. Wang,
M. H. Xie
Abstract:
Superlattices (SLs) consisted of alternating Bi2Se3 and In2Se3 layers are grown on Si(111) by molecular-beam epitaxy. Bi2Se3, a three-dimensional topological insulator (TI), showed good chemical and structural compatibility with In2Se3, a normal band insulator with large energy bandgap. The individual layers in the SLs are very uniform and the hetero-interfaces are sharp. Therefore, such SL struct…
▽ More
Superlattices (SLs) consisted of alternating Bi2Se3 and In2Se3 layers are grown on Si(111) by molecular-beam epitaxy. Bi2Se3, a three-dimensional topological insulator (TI), showed good chemical and structural compatibility with In2Se3, a normal band insulator with large energy bandgap. The individual layers in the SLs are very uniform and the hetero-interfaces are sharp. Therefore, such SL structures are potential candidates for explorations of the quantum size effects of TIs.
△ Less
Submitted 3 June, 2011;
originally announced June 2011.
-
Van der Waals epitaxy of Bi2Se3 on Si(111) vicinal surface: An approach to prepare high-quality thin films of topological insulator
Authors:
H. D. Li,
Z. Y. Wang,
X. Kan,
X. Guo,
H. T. He,
Z. Wang,
J. N. Wang,
T. L. Wong,
N. Wang,
M. H. Xie
Abstract:
Epitaxial growth of topological insulator Bi2Se3 thin films on nominally flat and vicinal Si(111) substrates is studied. In order to achieve planner growth front and better quality epifilms, a two-step growth method is adopted for the van der Waal epitaxy of Bi2Se3 to proceed. By employing vicinal Si(111) substrate surfaces, the in-pane growth rate anisotropy of Bi2Se3 is explored to achieve singl…
▽ More
Epitaxial growth of topological insulator Bi2Se3 thin films on nominally flat and vicinal Si(111) substrates is studied. In order to achieve planner growth front and better quality epifilms, a two-step growth method is adopted for the van der Waal epitaxy of Bi2Se3 to proceed. By employing vicinal Si(111) substrate surfaces, the in-pane growth rate anisotropy of Bi2Se3 is explored to achieve single crystalline Bi2Se3 epifilms, in which threading defects and twins are effectively suppressed. Optimization of the growth parameters has resulted in vicinal Bi2Se3 films showing a carrier mobility of ~ 2000 cm2V-1s-1 and the background do** of ~ 3 x 1018 cm-3 of the as-grown layers. Such samples not only show relatively high magnetoresistance but also a linear dependence on magnetic field.
△ Less
Submitted 11 May, 2010; v1 submitted 4 May, 2010;
originally announced May 2010.