-
Self-distributive structures in physics
Authors:
Tobias Fritz
Abstract:
It is an important feature of our existing physical theories that observables generate one-parameter groups of transformations. In classical Hamiltonian mechanics and quantum mechanics, this is due to the fact that the observables form a Lie algebra, and it manifests itself in Noether's theorem. In this paper, we introduce Lie quandles as the minimal mathematical structure needed to express the id…
▽ More
It is an important feature of our existing physical theories that observables generate one-parameter groups of transformations. In classical Hamiltonian mechanics and quantum mechanics, this is due to the fact that the observables form a Lie algebra, and it manifests itself in Noether's theorem. In this paper, we introduce Lie quandles as the minimal mathematical structure needed to express the idea that observables generate transformations. This is based on the notion of a quandle used most famously in knot theory, whose main defining property is the self-distributivity equation $x \triangleright (y \triangleright z) = (x \triangleright y) \triangleright (x \triangleright z)$. We argue that Lie quandles can be thought of as nonlinear generalizations of Lie algebras.
We also observe that taking convex combinations of points in vector spaces, which physically corresponds to mixing states, satisfies the same form of self-distributivity.
△ Less
Submitted 10 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Differential geometry and general relativity with algebraifolds
Authors:
Tobias Fritz
Abstract:
It is often noted that many of the basic concepts of differential geometry, such as the definition of connection, are purely algebraic in nature. Here, we review and extend existing work on fully algebraic formulations of differential geometry which eliminate the need for an underlying manifold. While the literature contains various independent approaches to this, we focus on one particular approa…
▽ More
It is often noted that many of the basic concepts of differential geometry, such as the definition of connection, are purely algebraic in nature. Here, we review and extend existing work on fully algebraic formulations of differential geometry which eliminate the need for an underlying manifold. While the literature contains various independent approaches to this, we focus on one particular approach that we argue to be the most natural one based on the definition of \emph{algebraifold}, by which we mean a commutative algebra $\mathcal{A}$ for which the module of derivations of $\mathcal{A}$ is finitely generated projective. Over $\mathbb{R}$ as the base ring, this class of algebras includes the algebra $C^\infty(M)$ of smooth functions on a manifold $M$, and similarly for analytic functions. An importantly different example is the Colombeau algebra of generalized functions on $M$, which makes distributional differential geometry an instance of our formalism. Another instance is a fibred version of smooth differential geometry, since any smooth submersion $M \to N$ makes $C^\infty(M)$ into an algebraifold with $C^\infty(N)$ as the base ring.Over any field $k$ of characteristic zero, examples include the algebra of regular functions on a smooth affine variety as well as any function field.
Our development of differential geometry in terms of algebraifolds comprises tensors, connections, curvature, geodesics and we briefly consider general relativity.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Hidden Markov Models and the Bayes Filter in Categorical Probability
Authors:
Tobias Fritz,
Andreas Klingler,
Drew McNeely,
Areeb Shah-Mohammed,
Yuwen Wang
Abstract:
We use Markov categories to develop generalizations of the theory of Markov chains and hidden Markov models in an abstract setting. This comprises characterizations of hidden Markov models in terms of local and global conditional independences as well as existing algorithms for Bayesian filtering and smoothing applicable in all Markov categories with conditionals. We show that these algorithms spe…
▽ More
We use Markov categories to develop generalizations of the theory of Markov chains and hidden Markov models in an abstract setting. This comprises characterizations of hidden Markov models in terms of local and global conditional independences as well as existing algorithms for Bayesian filtering and smoothing applicable in all Markov categories with conditionals. We show that these algorithms specialize to existing ones such as the Kalman filter, forward-backward algorithm, and the Rauch-Tung-Striebel smoother when instantiated in appropriate Markov categories. Under slightly stronger assumptions, we also prove that the sequence of outputs of the Bayes filter is itself a Markov chain with a concrete formula for its transition maps.
There are two main features of this categorical framework. The first is its generality, as it can be used in any Markov category with conditionals. In particular, it provides a systematic unified account of hidden Markov models and algorithms for filtering and smoothing in discrete probability, Gaussian probability, measure-theoretic probability, possibilistic nondeterminism and others at the same time. The second feature is the intuitive visual representation of information flow in these algorithms in terms of string diagrams.
△ Less
Submitted 25 February, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Involutive Markov categories and the quantum de Finetti theorem
Authors:
Tobias Fritz,
Antonio Lorenzin
Abstract:
Markov categories have recently emerged as a powerful high-level framework for probability theory and theoretical statistics. Here we study a quantum version of this concept, called involutive Markov categories. First, we show that these are equivalent to Parzygnat's quantum Markov categories but argue that they are simpler to work with. Our main examples of involutive Markov categories involve C*…
▽ More
Markov categories have recently emerged as a powerful high-level framework for probability theory and theoretical statistics. Here we study a quantum version of this concept, called involutive Markov categories. First, we show that these are equivalent to Parzygnat's quantum Markov categories but argue that they are simpler to work with. Our main examples of involutive Markov categories involve C*-algebras (of any dimension) as objects and completely positive unital maps as morphisms in the picture of interest. Second, we prove a quantum de Finetti theorem for both the minimal and the maximal C*-tensor norms, and we develop a categorical description of such quantum de Finetti theorems which amounts to a universal property of state spaces.
△ Less
Submitted 12 January, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Absolute continuity, supports and idempotent splitting in categorical probability
Authors:
Tobias Fritz,
Tomáš Gonda,
Antonio Lorenzin,
Paolo Perrone,
Dario Stein
Abstract:
Markov categories have recently turned out to be a powerful high-level framework for probability and statistics. They accommodate purely categorical definitions of notions like conditional probability and almost sure equality, as well as proofs of fundamental results such as the Hewitt-Savage 0/1 Law, the de Finetti Theorem and the Ergodic Decomposition Theorem. In this work, we develop additional…
▽ More
Markov categories have recently turned out to be a powerful high-level framework for probability and statistics. They accommodate purely categorical definitions of notions like conditional probability and almost sure equality, as well as proofs of fundamental results such as the Hewitt-Savage 0/1 Law, the de Finetti Theorem and the Ergodic Decomposition Theorem. In this work, we develop additional relevant notions from probability theory in the setting of Markov categories. This comprises improved versions of previously introduced definitions of absolute continuity and supports, as well as a detailed study of idempotents and idempotent splitting in Markov categories. Our main result on idempotent splitting is that every idempotent measurable Markov kernel between standard Borel spaces splits through another standard Borel space, and we derive this as an instance of a general categorical criterion for idempotent splitting in Markov categories.
△ Less
Submitted 6 September, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Weakly Markov categories and weakly affine monads
Authors:
Tobias Fritz,
Fabio Gadducci,
Paolo Perrone,
Davide Trotta
Abstract:
Introduced in the 1990s in the context of the algebraic approach to graph rewriting, gs-monoidal categories are symmetric monoidal categories where each object is equipped with the structure of a commutative comonoid. They arise for example as Kleisli categories of commutative monads on cartesian categories, and as such they provide a general framework for effectful computation. Recently proposed…
▽ More
Introduced in the 1990s in the context of the algebraic approach to graph rewriting, gs-monoidal categories are symmetric monoidal categories where each object is equipped with the structure of a commutative comonoid. They arise for example as Kleisli categories of commutative monads on cartesian categories, and as such they provide a general framework for effectful computation. Recently proposed in the context of categorical probability, Markov categories are gs-monoidal categories where the monoidal unit is also terminal, and they arise for example as Kleisli categories of commutative affine monads, where affine means that the monad preserves the monoidal unit.
The aim of this paper is to study a new condition on the gs-monoidal structure, resulting in the concept of weakly Markov categories, which is intermediate between gs-monoidal categories and Markov ones. In a weakly Markov category, the morphisms to the monoidal unit are not necessarily unique, but form a group. As we show, these categories exhibit a rich theory of conditional independence for morphisms, generalising the known theory for Markov categories. We also introduce the corresponding notion for commutative monads, which we call weakly affine, and for which we give two equivalent characterisations.
The paper argues that these monads are relevant to the study of categorical probability. A case at hand is the monad of finite non-zero measures, which is weakly affine but not affine. Such structures allow to investigate probability without normalisation within an elegant categorical framework.
△ Less
Submitted 25 August, 2023; v1 submitted 24 March, 2023;
originally announced March 2023.
-
Matrix majorization in large samples
Authors:
Muhammad Usman Farooq,
Tobias Fritz,
Erkka Haapasalo,
Marco Tomamichel
Abstract:
One tuple of probability vectors is more informative than another tuple when there exists a single stochastic matrix transforming the probability vectors of the first tuple into the probability vectors of the other. This is called matrix majorization. Solving an open problem raised by Mu et al, we show that if certain monotones - namely multivariate extensions of Rényi divergences - are strictly o…
▽ More
One tuple of probability vectors is more informative than another tuple when there exists a single stochastic matrix transforming the probability vectors of the first tuple into the probability vectors of the other. This is called matrix majorization. Solving an open problem raised by Mu et al, we show that if certain monotones - namely multivariate extensions of Rényi divergences - are strictly ordered between the two tuples, then for sufficiently large $n$, there exists a stochastic matrix taking the $n$-fold Kronecker power of each input distribution to the $n$-fold Kronecker power of the corresponding output distribution. The same conditions, with non-strict ordering for the monotones, are also necessary for such matrix majorization in large samples.
Our result also gives conditions for the existence of a sequence of statistical maps that asymptotically (with vanishing error) convert a single copy of each input distribution to the corresponding output distribution with the help of a catalyst that is returned unchanged. Allowing for transformation with arbitrarily small error, we find conditions that are both necessary and sufficient for such catalytic matrix majorization.
We derive our results by building on a general algebraic theory of preordered semirings recently developed by one of the authors. This also allows us to recover various existing results on majorization in large samples and in the catalytic regime as well as relative majorization in a unified manner.
△ Less
Submitted 8 January, 2024; v1 submitted 18 January, 2023;
originally announced January 2023.
-
Dilations and information flow axioms in categorical probability
Authors:
Tobias Fritz,
Tomáš Gonda,
Nicholas Gauguin Houghton-Larsen,
Antonio Lorenzin,
Paolo Perrone,
Dario Stein
Abstract:
We study the positivity and causality axioms for Markov categories as properties of dilations and information flow in Markov categories, and in variations thereof for arbitrary semicartesian monoidal categories. These help us show that being a positive Markov category is merely an additional property of a symmetric monoidal category (rather than extra structure). We also characterize the positivit…
▽ More
We study the positivity and causality axioms for Markov categories as properties of dilations and information flow in Markov categories, and in variations thereof for arbitrary semicartesian monoidal categories. These help us show that being a positive Markov category is merely an additional property of a symmetric monoidal category (rather than extra structure). We also characterize the positivity of representable Markov categories and prove that causality implies positivity, but not conversely. Finally, we note that positivity fails for quasi-Borel spaces and interpret this failure as a privacy property of probabilistic name generation.
△ Less
Submitted 9 June, 2023; v1 submitted 4 November, 2022;
originally announced November 2022.
-
The d-separation criterion in Categorical Probability
Authors:
Tobias Fritz,
Andreas Klingler
Abstract:
The d-separation criterion detects the compatibility of a joint probability distribution with a directed acyclic graph through certain conditional independences. In this work, we study this problem in the context of categorical probability theory by introducing a categorical definition of causal models, a categorical notion of d-separation, and proving an abstract version of the d-separation crite…
▽ More
The d-separation criterion detects the compatibility of a joint probability distribution with a directed acyclic graph through certain conditional independences. In this work, we study this problem in the context of categorical probability theory by introducing a categorical definition of causal models, a categorical notion of d-separation, and proving an abstract version of the d-separation criterion. This approach has two main benefits. First, categorical d-separation is a very intuitive criterion based on topological connectedness. Second, our results apply both to measure-theoretic probability (with standard Borel spaces) and beyond probability theory, including to deterministic and possibilistic networks. It therefore provides a clean proof of the equivalence of local and global Markov properties with causal compatibility for continuous and mixed random variables as well as deterministic and possibilistic variables.
△ Less
Submitted 20 February, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Non-abelian and $\varepsilon$-curved homological algebra with arrow categories
Authors:
Tobias Fritz
Abstract:
Grandis's non-abelian homological algebra generalizes standard homological algebra in abelian categories to \textit{homological categories}, which are a broader class of categories including for example the category of lattices and Galois connections. Here, we prove that if $\mathsf{C}$ is any category with an ideal of null morphisms with respect to which (co)kernels exist, then the arrow category…
▽ More
Grandis's non-abelian homological algebra generalizes standard homological algebra in abelian categories to \textit{homological categories}, which are a broader class of categories including for example the category of lattices and Galois connections. Here, we prove that if $\mathsf{C}$ is any category with an ideal of null morphisms with respect to which (co)kernels exist, then the arrow category of $\mathsf{C}$ is a homological category. This broadens the applicability of Grandis's framework substantially. In particular, one can form the homology of chain complexes in $\mathsf{C}$ by taking the homology objects to be morphisms of $\mathsf{C}$, which one may think of as maps from an object of cycles to an object of chains modulo boundaries.
One situation to which Grandis's original framework does not apply is \textit{$\varepsilon$-curved homological algebra}. This refers to chain complexes of normed spaces whose differential squares to zero only approximately, in the sense that $\|d^2\| \leq \varepsilon$ for some $\varepsilon > 0$. This is relevant for example in the theory of approximate representations of groups, where Kazhdan has successfully employed $\varepsilon$-curved homological techniques in an ad-hoc manner. We develop some basics of $\varepsilon$-curved homological algebra and note that our result on arrow categories facilitates the application of Grandis's theory.
△ Less
Submitted 16 September, 2022; v1 submitted 23 June, 2022;
originally announced June 2022.
-
Asymptotic and catalytic containment of representations of $\mathsf{SU}(n)$
Authors:
Tobias Fritz
Abstract:
Given two finite-dimensional representations $ρ$ and $σ$ of $\mathsf{SU}(n)$, when is there $n \in \mathbb{N}$ such that $ρ^{\otimes n}$ is isomorphic to a subrepresentation of $σ^{\otimes n}$? When is there a third representation $η$ such that $ρ\otimes η$ is a subrepresentation of $σ\otimes η$? We call these the questions of asymptotic and catalytic containment, respectively.
We answer both qu…
▽ More
Given two finite-dimensional representations $ρ$ and $σ$ of $\mathsf{SU}(n)$, when is there $n \in \mathbb{N}$ such that $ρ^{\otimes n}$ is isomorphic to a subrepresentation of $σ^{\otimes n}$? When is there a third representation $η$ such that $ρ\otimes η$ is a subrepresentation of $σ\otimes η$? We call these the questions of asymptotic and catalytic containment, respectively.
We answer both questions in terms of an explicit family of inequalities. These inequalities are almost necessary and sufficient in the following sense. If two representations satisfy all inequalities strictly, then asymptotic and catalytic containment follow (the former in generic cases). Conversely, if asymptotic or catalytic containment holds, then the inequalities must hold non-strictly. These results are an instance of a recent \emph{Vergleichsstellensatz} applied to the representation semiring.
△ Less
Submitted 10 September, 2022; v1 submitted 22 May, 2022;
originally announced May 2022.
-
From Gs-monoidal to Oplax Cartesian Categories: Constructions and Functorial Completeness
Authors:
Tobias Fritz,
Fabio Gadducci,
Davide Trotta,
Andrea Corradini
Abstract:
Originally introduced in the context of the algebraic approach to term graph rewriting, the notion of gs-monoidal category has surfaced a few times under different monikers in the last decades. They can be thought of as symmetric monoidal categories whose arrows are generalised relations, with enough structure to talk about domains and partial functions, but less structure than cartesian bicategor…
▽ More
Originally introduced in the context of the algebraic approach to term graph rewriting, the notion of gs-monoidal category has surfaced a few times under different monikers in the last decades. They can be thought of as symmetric monoidal categories whose arrows are generalised relations, with enough structure to talk about domains and partial functions, but less structure than cartesian bicategories. The aim of this paper is threefold. The first goal is to extend the original definition of gs-monoidality by enriching it with a preorder on arrows, giving rise to what we call oplax cartesian categories. Second, we show that (preorder-enriched) gs-monoidal categories naturally arise both as Kleisli categories and as span categories, and the relation between the resulting formalisms is explored. Finally, we present two theorems concerning Yoneda embeddings on the one hand and functorial completeness on the other, the latter inducing a completeness result also for lax functors from oplax cartesian categories to $\mathbf{Rel}$.
△ Less
Submitted 29 September, 2023; v1 submitted 13 May, 2022;
originally announced May 2022.
-
Free gs-monoidal categories and free Markov categories
Authors:
Tobias Fritz,
Wendong Liang
Abstract:
Categorical probability has recently seen significant advances through the formalism of Markov categories, within which several classical theorems have been proven in entirely abstract categorical terms. Closely related to Markov categories are gs-monoidal categories, also known as CD categories. These omit a condition that implements the normalization of probability. Extending work of Corradini a…
▽ More
Categorical probability has recently seen significant advances through the formalism of Markov categories, within which several classical theorems have been proven in entirely abstract categorical terms. Closely related to Markov categories are gs-monoidal categories, also known as CD categories. These omit a condition that implements the normalization of probability. Extending work of Corradini and Gadducci, we construct free gs-monoidal and free Markov categories generated by a collection of morphisms of arbitrary arity and coarity. For free gs-monoidal categories, this comes in the form of an explicit combinatorial description of their morphisms as structured cospans of labeled hypergraphs. These can be thought of as a formalization of gs-monoidal string diagrams ($=$term graphs) as a combinatorial data structure. We formulate the appropriate $2$-categorical universal property based on ideas of Walters and prove that our categories satisfy it.
We expect our free categories to be relevant for computer implementations and we also argue that they can be used as statistical causal models generalizing Bayesian networks.
△ Less
Submitted 8 February, 2023; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Abstract Vergleichsstellensätze for preordered semifields and semirings II
Authors:
Tobias Fritz
Abstract:
This paper continues our foundational work on real algebra with preordered semifields and semirings. We prove two abstract Vergleichsstellensätze for commutative preordered semirings of polynomial growth. These generalize the results of Part I in that we now no longer assume $1 \ge 0$. This adds substantial technical complications: our Vergleichsstellensätze now also need to take into account infi…
▽ More
This paper continues our foundational work on real algebra with preordered semifields and semirings. We prove two abstract Vergleichsstellensätze for commutative preordered semirings of polynomial growth. These generalize the results of Part I in that we now no longer assume $1 \ge 0$. This adds substantial technical complications: our Vergleichsstellensätze now also need to take into account infinitesimal information encoded in the form of monotone derivations in addition to the monotone homomorphisms to the nonnegative reals and tropical reals. The proof relies on a number of technical results that we develop along the way, including surprising implications between inequalities in preordered semifields and a type classification for multiplicatively Archimedean fully preordered semifields.
A companion paper uses these results in order to derive a limit theorem for random walks on topological abelian groups which provides sufficient and close to necessary conditions for when one random walk will dominate another at late times.
△ Less
Submitted 8 November, 2022; v1 submitted 11 December, 2021;
originally announced December 2021.
-
Weak cartesian properties of simplicial sets
Authors:
Carmen Constantin,
Tobias Fritz,
Paolo Perrone,
Brandon Shapiro
Abstract:
Many special classes of simplicial sets, such as the nerves of categories or groupoids, the 2-Segal sets of Dyckerhoff and Kapranov, and the (discrete) decomposition spaces of Gálvez, Kock, and Tonks, are characterized by the property of sending certain commuting squares in the simplex category $Δ$ to pullback squares of sets. We introduce weaker analogues of these properties called completeness c…
▽ More
Many special classes of simplicial sets, such as the nerves of categories or groupoids, the 2-Segal sets of Dyckerhoff and Kapranov, and the (discrete) decomposition spaces of Gálvez, Kock, and Tonks, are characterized by the property of sending certain commuting squares in the simplex category $Δ$ to pullback squares of sets. We introduce weaker analogues of these properties called completeness conditions, which require squares in $Δ$ to be sent to weak pullbacks of sets, defined similarly to pullback squares but without the uniqueness property of induced maps. We show that some of these completeness conditions provide a simplicial set with lifts against certain subsets of simplices first introduced in the theory of database design. We also provide reduced criteria for checking these properties using factorization results for pushouts squares in $Δ$, which we characterize completely, along with several other classes of squares in $Δ$. Examples of simplicial sets with completeness conditions include quasicategories, many of the compositories and gleaves of Flori and Fritz, and bar constructions for algebras of certain classes of monads. The latter is our motivating example.
△ Less
Submitted 24 October, 2023; v1 submitted 10 May, 2021;
originally announced May 2021.
-
De Finetti's Theorem in Categorical Probability
Authors:
Tobias Fritz,
Tomáš Gonda,
Paolo Perrone
Abstract:
We present a novel proof of de Finetti's Theorem characterizing permutation-invariant probability measures of infinite sequences of variables, so-called exchangeable measures. The proof is phrased in the language of Markov categories, which provide an abstract categorical framework for probability and information flow. The diagrammatic and abstract nature of the arguments makes the proof intuitive…
▽ More
We present a novel proof of de Finetti's Theorem characterizing permutation-invariant probability measures of infinite sequences of variables, so-called exchangeable measures. The proof is phrased in the language of Markov categories, which provide an abstract categorical framework for probability and information flow. The diagrammatic and abstract nature of the arguments makes the proof intuitive and easy to follow. We also show how the usual measure-theoretic version of de Finetti's Theorem for standard Borel spaces is an instance of this result.
△ Less
Submitted 16 September, 2021; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Amenability of semigroups and common multiples in $\ell^1_+$
Authors:
Tobias Fritz
Abstract:
In this note, we prove that a semigroup $S$ is left amenable if and only if every two nonzero elements of $\ell^1_+(S)$ have a common nonzero right multiple, where $\ell^1_+(S)$ is the positive part of the Banach algebra $\ell^1(S)$, or equivalently the semiring of finite measures on $S$. This characterization of amenability is new even for groups.
In this note, we prove that a semigroup $S$ is left amenable if and only if every two nonzero elements of $\ell^1_+(S)$ have a common nonzero right multiple, where $\ell^1_+(S)$ is the positive part of the Banach algebra $\ell^1(S)$, or equivalently the semiring of finite measures on $S$. This characterization of amenability is new even for groups.
△ Less
Submitted 27 January, 2021; v1 submitted 26 January, 2021;
originally announced January 2021.
-
Representable Markov Categories and Comparison of Statistical Experiments in Categorical Probability
Authors:
Tobias Fritz,
Tomáš Gonda,
Paolo Perrone,
Eigil Fjeldgren Rischel
Abstract:
Markov categories are a recent categorical approach to the mathematical foundations of probability and statistics. Here, this approach is advanced by stating and proving equivalent conditions for second-order stochastic dominance, a widely used way of comparing probability distributions by their spread. Furthermore, we lay foundation for the theory of comparing statistical experiments within Marko…
▽ More
Markov categories are a recent categorical approach to the mathematical foundations of probability and statistics. Here, this approach is advanced by stating and proving equivalent conditions for second-order stochastic dominance, a widely used way of comparing probability distributions by their spread. Furthermore, we lay foundation for the theory of comparing statistical experiments within Markov categories by stating and proving the classical Blackwell-Sherman-Stein Theorem. Our version not only offers new insight into the proof, but its abstract nature also makes the result more general, automatically specializing to the standard Blackwell-Sherman-Stein Theorem in measure-theoretic probability as well as a Bayesian version that involves prior-dependent garbling. Along the way, we define and characterize representable Markov categories, within which one can talk about Markov kernels to or from spaces of distributions. We do so by exploring the relation between Markov categories and Kleisli categories of probability monads.
△ Less
Submitted 8 May, 2023; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Partial Evaluations and the Compositional Structure of the Bar Construction
Authors:
Carmen Constantin,
Paolo Perrone,
Tobias Fritz,
Brandon T. Shapiro
Abstract:
The algebraic expression $3 + 2 + 6$ can be evaluated to $11$, but it can also be partially evaluated to $5 + 6$. In categorical algebra, such partial evaluations can be defined in terms of the $1$-skeleton of the bar construction for algebras of a monad. We show that this partial evaluation relation can be seen as the relation internal to the category of algebras generated by relating a formal ex…
▽ More
The algebraic expression $3 + 2 + 6$ can be evaluated to $11$, but it can also be partially evaluated to $5 + 6$. In categorical algebra, such partial evaluations can be defined in terms of the $1$-skeleton of the bar construction for algebras of a monad. We show that this partial evaluation relation can be seen as the relation internal to the category of algebras generated by relating a formal expression to its total evaluation. The relation is transitive for many monads which describe commonly encountered algebraic structures, and more generally for BC monads on $\mathsf{Set}$ (which are those monads for which the underlying functor and the multiplication are weakly cartesian). We find that this is not true for all monads: we describe a finitary monad on $\mathsf{Set}$ for which the partial evaluation relation on the terminal algebra is not transitive. With the perspective of higher algebraic rewriting in mind, we then investigate the compositional structure of the bar construction in all dimensions. We show that for algebras of BC monads, the bar construction has fillers for all directed acyclic configurations in $Δ^n$, but generally not all inner horns.
△ Less
Submitted 14 March, 2023; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Characterizing the asymptotic and catalytic stochastic orders on topological abelian groups
Authors:
Tobias Fritz
Abstract:
We study the usual stochastic order between probability measures on preordered topological abelian groups, focusing on asymptotic and catalytic versions of the order. In the asymptotic version, a measure $μ$ dominates a measure $ν$ if the i.i.d.~random walk generated by $μ$ first-order dominates the one generated by $ν$ at late times. In the catalytic version, $μ$ dominates $ν$ if there is a third…
▽ More
We study the usual stochastic order between probability measures on preordered topological abelian groups, focusing on asymptotic and catalytic versions of the order. In the asymptotic version, a measure $μ$ dominates a measure $ν$ if the i.i.d.~random walk generated by $μ$ first-order dominates the one generated by $ν$ at late times. In the catalytic version, $μ$ dominates $ν$ if there is a third $τ$ such that the convolution $μ\ast τ$ first-order dominates $ν\ast τ$.
Provided that the preorder on $G$ is induced by a suitably large positive cone and that both measures are compactly supported Radon, our main result gives a sufficient condition for asymptotic and catalytic dominance to hold in terms of a family of inequalities closely related to the cumulant-generating functions. While this sufficient condition requires these inequalities to be strict, the non-strict versions of these inequalities are easily seen to be necessary. In this sense, our result gives conditions that are necessary and sufficient in generic cases. This result has been known for $G = \mathbb{R}$, but is new already for $\mathbb{R}^n$ with $n > 1$. It is a direct application of a recently proven theorem of real algebra, namely a \emph{Vergleichsstellensatz} for preordered semirings.
We finally use our result to derive a formula for the rate at which the probabilities of a random walk decay \emph{relative} to those of another, now for walks on a preordered topological vector space with compactly supported Radon steps. Taking one of these walks to be deterministic reproduces a version of Cramér's large deviation theorem for infinite dimensions.
△ Less
Submitted 9 November, 2023; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Abstract Vergleichsstellensätze for preordered semifields and semirings I
Authors:
Tobias Fritz
Abstract:
Real algebra is usually thought of as the study of certain kinds of preorders on fields and rings. Among its core themes are the separation theorems known as Positivstellensätze. However, there is a nascent subfield of real algebra which studies preordered semirings and semifields, which is motivated by applications to probability, graph theory and theoretical computer science, among others. Here,…
▽ More
Real algebra is usually thought of as the study of certain kinds of preorders on fields and rings. Among its core themes are the separation theorems known as Positivstellensätze. However, there is a nascent subfield of real algebra which studies preordered semirings and semifields, which is motivated by applications to probability, graph theory and theoretical computer science, among others. Here, we contribute to this subfield by develo** a number of foundational results for it, with two abstract Vergleichsstellensätze being our main theorems.
Our first Vergleichsstellensatz states that every semifield preorder is the intersection of its total extensions. We apply this to derive our second main result, a Vergleichsstellensatz for certain non-Archimedean preordered semirings in which the homomorphisms to the tropical reals play an important role. We show how this result recovers the existing Vergleichsstellensatz of Strassen and (through the latter) the classical Positivstellensatz of Krivine--Kadison--Dubois.
△ Less
Submitted 8 February, 2023; v1 submitted 30 March, 2020;
originally announced March 2020.
-
Infinite products and zero-one laws in categorical probability
Authors:
Tobias Fritz,
Eigil Fjeldgren Rischel
Abstract:
Markov categories are a recent category-theoretic approach to the foundations of probability and statistics. Here we develop this approach further by treating infinite products and the Kolmogorov extension theorem. This is relevant for all aspects of probability theory in which infinitely many random variables appear at a time. These infinite tensor products $\bigotimes_{i \in J} X_i$ come in two…
▽ More
Markov categories are a recent category-theoretic approach to the foundations of probability and statistics. Here we develop this approach further by treating infinite products and the Kolmogorov extension theorem. This is relevant for all aspects of probability theory in which infinitely many random variables appear at a time. These infinite tensor products $\bigotimes_{i \in J} X_i$ come in two versions: a weaker but more general one for families of objects $(X_i)_{i \in J}$ in semicartesian symmetric monoidal categories, and a stronger but more specific one for families of objects in Markov categories.
As a first application, we state and prove versions of the zero-one laws of Kolmogorov and Hewitt-Savage for Markov categories. This gives general versions of these results which can be instantiated not only in measure-theoretic probability, where they specialize to the standard ones in the setting of standard Borel spaces, but also in other contexts.
△ Less
Submitted 17 August, 2020; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Monotone homomorphisms on convolution semigroups
Authors:
Tobias Fritz,
Xiaosheng Mu,
Omer Tamuz
Abstract:
We study monotone homomorphisms on the semigroup of probability measures on $\mathbb{R}$, by which we mean maps to the reals that are monotone with respect to the stochastic order and additive under convolution. We show that scalar multiples of the expectation are the unique monotone homomorphisms on the semigroup of measures with finite $p$-th moment, for any $1 \le p < \infty$. We also prove tha…
▽ More
We study monotone homomorphisms on the semigroup of probability measures on $\mathbb{R}$, by which we mean maps to the reals that are monotone with respect to the stochastic order and additive under convolution. We show that scalar multiples of the expectation are the unique monotone homomorphisms on the semigroup of measures with finite $p$-th moment, for any $1 \le p < \infty$. We also prove that the entire semigroup of probability measures admits no non-zero monotone homomorphism.
△ Less
Submitted 3 February, 2021; v1 submitted 3 December, 2019;
originally announced December 2019.
-
Probability, valuations, hyperspace: Three monads on Top and the support as a morphism
Authors:
Tobias Fritz,
Paolo Perrone,
Sharwin Rezagholi
Abstract:
We consider three monads on Top, the category of topological spaces, which formalize topological aspects of probability and possibility in categorical terms. The first one is the Hoare hyperspace monad H, which assigns to every space its space of closed subsets equipped with the lower Vietoris topology. The second is the monad V of continuous valuations, also known as the extended probabilistic po…
▽ More
We consider three monads on Top, the category of topological spaces, which formalize topological aspects of probability and possibility in categorical terms. The first one is the Hoare hyperspace monad H, which assigns to every space its space of closed subsets equipped with the lower Vietoris topology. The second is the monad V of continuous valuations, also known as the extended probabilistic powerdomain. We construct both monads in a unified way in terms of double dualization. This reveals a close analogy between them, and allows us to prove that the operation of taking the support of a continuous valuation is a morphism of monads from V to H. In particular, this implies that every H-algebra (topological complete semilattice) is also a V-algebra. Third, we show that V can be restricted to a submonad of tau-smooth probability measures on Top. By composing these two morphisms of monads, we obtain that taking the support of a tau-smooth probability measure is also a morphism of monads.
△ Less
Submitted 16 September, 2021; v1 submitted 8 October, 2019;
originally announced October 2019.
-
A synthetic approach to Markov kernels, conditional independence and theorems on sufficient statistics
Authors:
Tobias Fritz
Abstract:
We develop Markov categories as a framework for synthetic probability and statistics, following work of Golubtsov as well as Cho and Jacobs. This means that we treat the following concepts in purely abstract categorical terms: conditioning and disintegration; various versions of conditional independence and its standard properties; conditional products; almost surely; sufficient statistics; versio…
▽ More
We develop Markov categories as a framework for synthetic probability and statistics, following work of Golubtsov as well as Cho and Jacobs. This means that we treat the following concepts in purely abstract categorical terms: conditioning and disintegration; various versions of conditional independence and its standard properties; conditional products; almost surely; sufficient statistics; versions of theorems on sufficient statistics due to Fisher--Neyman, Basu, and Bahadur.
Besides the conceptual clarity offered by our categorical setup, its main advantage is that it provides a uniform treatment of various types of probability theory, including discrete probability theory, measure-theoretic probability with general measurable spaces, Gaussian probability, stochastic processes of either of these kinds, and many others.
△ Less
Submitted 31 May, 2020; v1 submitted 19 August, 2019;
originally announced August 2019.
-
The universal property of infinite direct sums in C$^*$-categories and W$^*$-categories
Authors:
Tobias Fritz,
Bas Westerbaan
Abstract:
When formulating universal properties for objects in a dagger category, one usually expects a universal property to characterize the universal object up to unique unitary isomorphism. We observe that this is automatically the case in the important special case of C$^*$-categories, provided that one uses enrichment in Banach spaces. We then formulate such a universal property for infinite direct su…
▽ More
When formulating universal properties for objects in a dagger category, one usually expects a universal property to characterize the universal object up to unique unitary isomorphism. We observe that this is automatically the case in the important special case of C$^*$-categories, provided that one uses enrichment in Banach spaces. We then formulate such a universal property for infinite direct sums in C$^*$-categories, and prove the equivalence with the existing definition due to Ghez, Lima and Roberts in the case of W$^*$-categories. These infinite direct sums specialize to the usual ones in the category of Hilbert spaces, and more generally in any W$^*$-category of normal representations of a W$^*$-algebra.
Finding a universal property for the more general case of direct integrals remains an open problem.
△ Less
Submitted 3 September, 2019; v1 submitted 10 July, 2019;
originally announced July 2019.
-
A unified construction of semiring-homomorphic graph invariants
Authors:
Tobias Fritz
Abstract:
It has recently been observed by Zuiddam that finite graphs form a preordered commutative semiring under the graph homomorphism preorder together with join and disjunctive product as addition and multiplication, respectively. This led to a new characterization of the Shannon capacity $Θ$ via Strassen's Positivstellensatz: $Θ(\bar{G}) = \inf_f f(G)$, where $f : \mathsf{Graph} \to \mathbb{R}_+$ rang…
▽ More
It has recently been observed by Zuiddam that finite graphs form a preordered commutative semiring under the graph homomorphism preorder together with join and disjunctive product as addition and multiplication, respectively. This led to a new characterization of the Shannon capacity $Θ$ via Strassen's Positivstellensatz: $Θ(\bar{G}) = \inf_f f(G)$, where $f : \mathsf{Graph} \to \mathbb{R}_+$ ranges over all monotone semiring homomorphisms.
Constructing and classifying graph invariants $\mathsf{Graph} \to \mathbb{R}_+$ which are monotone under graph homomorphisms, additive under join, and multiplicative under disjunctive product is therefore of major interest. We call such invariants semiring-homomorphic. The only known such invariants are all of a fractional nature: the fractional chromatic number, the projective rank, the fractional Haemers bounds, as well as the Lovász number (with the latter two evaluated on the complementary graph). Here, we provide a unified construction of these invariants based on linear-like semiring families of graphs. Along the way, we also investigate the additional algebraic structure on the semiring of graphs corresponding to fractionalization.
Linear-like semiring families of graphs are a new concept of combinatorial geometry different from matroids which may be of independent interest.
△ Less
Submitted 26 July, 2020; v1 submitted 4 January, 2019;
originally announced January 2019.
-
A generalization of Strassen's Positivstellensatz
Authors:
Tobias Fritz
Abstract:
Strassen's Positivstellensatz is a powerful but little known theorem on preordered commutative semirings satisfying a boundedness condition similar to Archimedeanicity. It characterizes the relaxed preorder induced by all monotone homomorphisms to $\mathbb{R}_+$ in terms of a condition involving large powers. Here, we generalize and strengthen Strassen's result. As a generalization, we replace the…
▽ More
Strassen's Positivstellensatz is a powerful but little known theorem on preordered commutative semirings satisfying a boundedness condition similar to Archimedeanicity. It characterizes the relaxed preorder induced by all monotone homomorphisms to $\mathbb{R}_+$ in terms of a condition involving large powers. Here, we generalize and strengthen Strassen's result. As a generalization, we replace the boundedness condition by a polynomial growth condition; as a strengthening, we prove two further equivalent characterizations of the homomorphism-induced preorder in our generalized setting.
△ Less
Submitted 28 March, 2020; v1 submitted 19 October, 2018;
originally announced October 2018.
-
Antisymmetry of the stochastic order on all ordered topological spaces
Authors:
Tobias Fritz
Abstract:
In this short note, we prove that the stochastic order of Radon probability measures on any ordered topological space is antisymmetric. This has been known before in various special cases. We give a simple and elementary proof of the general result.
In this short note, we prove that the stochastic order of Radon probability measures on any ordered topological space is antisymmetric. This has been known before in various special cases. We give a simple and elementary proof of the general result.
△ Less
Submitted 19 November, 2019; v1 submitted 15 October, 2018;
originally announced October 2018.
-
Monads, partial evaluations, and rewriting
Authors:
Tobias Fritz,
Paolo Perrone
Abstract:
Monads can be interpreted as encoding formal expressions, or formal operations in the sense of universal algebra. We give a construction which formalizes the idea of "evaluating an expression partially": for example, "2+3" can be obtained as a partial evaluation of "2+2+1". This construction can be given for any monad, and it is linked to the famous bar construction, of which it gives an operation…
▽ More
Monads can be interpreted as encoding formal expressions, or formal operations in the sense of universal algebra. We give a construction which formalizes the idea of "evaluating an expression partially": for example, "2+3" can be obtained as a partial evaluation of "2+2+1". This construction can be given for any monad, and it is linked to the famous bar construction, of which it gives an operational interpretation: the bar construction induces a simplicial set, and its 1-cells are partial evaluations.
We study the properties of partial evaluations for general monads. We prove that whenever the monad is weakly cartesian, partial evaluations can be composed via the usual Kan filler property of simplicial sets, of which we give an interpretation in terms of substitution of terms.
In terms of rewritings, partial evaluations give an abstract reduction system which is reflexive, confluent, and transitive whenever the monad is weakly cartesian.
For the case of probability monads, partial evaluations correspond to what probabilists call conditional expectation of random variables.
This manuscript is part of a work in progress on a general rewriting interpretation of the bar construction.
△ Less
Submitted 16 May, 2020; v1 submitted 14 October, 2018;
originally announced October 2018.
-
A Criterion for Kan Extensions of Lax Monoidal Functors
Authors:
Tobias Fritz,
Paolo Perrone
Abstract:
In this mainly expository note, we state a criterion for when a left Kan extension of a lax monoidal functor along a strong monoidal functor can itself be equipped with a lax monoidal structure, in a way that results in a left Kan extension in MonCat. This belongs to the general theory of algebraic Kan extensions, as developed by Melliès-Tabareau, Koudenburg and Weber, and is very close to an inst…
▽ More
In this mainly expository note, we state a criterion for when a left Kan extension of a lax monoidal functor along a strong monoidal functor can itself be equipped with a lax monoidal structure, in a way that results in a left Kan extension in MonCat. This belongs to the general theory of algebraic Kan extensions, as developed by Melliès-Tabareau, Koudenburg and Weber, and is very close to an instance of a theorem of Koudenburg. We find this special case particularly important due to its connections with the theory of graded monads.
△ Less
Submitted 27 September, 2018;
originally announced September 2018.
-
Stochastic order on metric spaces and the ordered Kantorovich monad
Authors:
Tobias Fritz,
Paolo Perrone
Abstract:
In earlier work, we had introduced the Kantorovich probability monad on complete metric spaces, extending a construction due to van Breugel. Here we extend the Kantorovich monad further to a certain class of ordered metric spaces, by endowing the spaces of probability measures with the usual stochastic order. It can be considered a metric analogue of the probabilistic powerdomain.
The spaces we…
▽ More
In earlier work, we had introduced the Kantorovich probability monad on complete metric spaces, extending a construction due to van Breugel. Here we extend the Kantorovich monad further to a certain class of ordered metric spaces, by endowing the spaces of probability measures with the usual stochastic order. It can be considered a metric analogue of the probabilistic powerdomain.
The spaces we consider, which we call L-ordered, are spaces where the order satisfies a mild compatibility condition with the metric itself, rather than merely with the underlying topology. As we show, this is related to the theory of Lawvere metric spaces, in which the partial order structure is induced by the zero distances.
We show that the algebras of the ordered Kantorovich monad are the closed convex subsets of Banach spaces equipped with a closed positive cone, with algebra morphisms given by the short and monotone affine maps. Considering the category of L-ordered metric spaces as a locally posetal 2-category, the lax and oplax algebra morphisms are exactly the concave and convex short maps, respectively.
In the unordered case, we had identified the Wasserstein space as the colimit of the spaces of empirical distributions of finite sequences. We prove that this extends to the ordered setting as well by showing that the stochastic order arises by completing the order between the finite sequences, generalizing a recent result of Lawson. The proof holds on any metric space equipped with a closed partial order.
△ Less
Submitted 18 February, 2020; v1 submitted 29 August, 2018;
originally announced August 2018.
-
Optimal bounds on the positivity of a matrix from a few moments
Authors:
Gemma de las Cuevas,
Tobias Fritz,
Tim Netzer
Abstract:
In many contexts one encounters Hermitian operators $M$ on a Hilbert space whose dimension is so large that it is impossible to write down all matrix entries in an orthonormal basis. How does one determine whether such $M$ is positive semidefinite? Here we approach this problem by deriving asymptotically optimal bounds to the distance to the positive semidefinite cone in Schatten $p$-norm for all…
▽ More
In many contexts one encounters Hermitian operators $M$ on a Hilbert space whose dimension is so large that it is impossible to write down all matrix entries in an orthonormal basis. How does one determine whether such $M$ is positive semidefinite? Here we approach this problem by deriving asymptotically optimal bounds to the distance to the positive semidefinite cone in Schatten $p$-norm for all integer $p\in[1,\infty)$, assuming that we know the moments $\mathbf{tr}(M^k)$ up to a certain order $k=1,\ldots, m$. We then provide three methods to compute these bounds and relaxations thereof: the sos polynomial method (a semidefinite program), the Handelman method (a linear program relaxation), and the Chebyshev method (a relaxation not involving any optimization). We investigate the analytical and numerical performance of these methods and present a number of example computations, partly motivated by applications to tensor networks and to the theory of free spectrahedra.
△ Less
Submitted 16 April, 2020; v1 submitted 28 August, 2018;
originally announced August 2018.
-
Curious properties of free hypergraph C*-algebras
Authors:
Tobias Fritz
Abstract:
A finite hypergraph $H$ consists of a finite set of vertices $V(H)$ and a collection of subsets $E(H) \subseteq 2^{V(H)}$ which we consider as partition of unity relations between projection operators. These partition of unity relations freely generate a universal C*-algebra, which we call the "free hypergraph C*-algebra" $C^*(H)$. General free hypergraph C*-algebras were first studied in the cont…
▽ More
A finite hypergraph $H$ consists of a finite set of vertices $V(H)$ and a collection of subsets $E(H) \subseteq 2^{V(H)}$ which we consider as partition of unity relations between projection operators. These partition of unity relations freely generate a universal C*-algebra, which we call the "free hypergraph C*-algebra" $C^*(H)$. General free hypergraph C*-algebras were first studied in the context of quantum contextuality. As special cases, the class of free hypergraph C*-algebras comprises quantum permutation groups, maximal group C*-algebras of graph products of finite cyclic groups, and the C*-algebras associated to quantum graph homomorphism, isomorphism, and colouring.
Here, we conduct the first systematic study of aspects of free hypergraph C*-algebras. We show that they coincide with the class of finite colimits of finite-dimensional commutative C*-algebras, and also with the class of C*-algebras associated to synchronous nonlocal games. We had previously shown that it is undecidable to determine whether $C^*(H)$ is nonzero for given $H$. We now show that it is also undecidable to determine whether a given $C^*(H)$ is residually finite-dimensional, and similarly whether it only has infinite-dimensional representations, and whether it has a tracial state. It follows that for each one of these properties, there is $H$ such that the question whether $C^*(H)$ has this property is independent of the ZFC axioms, assuming that these are consistent. We clarify some of the subtleties associated with such independence results in an appendix.
△ Less
Submitted 9 July, 2019; v1 submitted 28 August, 2018;
originally announced August 2018.
-
Bimonoidal Structure of Probability Monads
Authors:
Tobias Fritz,
Paolo Perrone
Abstract:
We give a conceptual treatment of the notion of joints, marginals, and independence in the setting of categorical probability. This is achieved by endowing the usual probability monads (like the Giry monad) with a monoidal and an opmonoidal structure, mutually compatible (i.e. a bimonoidal structure). If the underlying monoidal category is cartesian monoidal, a bimonoidal structure is given unique…
▽ More
We give a conceptual treatment of the notion of joints, marginals, and independence in the setting of categorical probability. This is achieved by endowing the usual probability monads (like the Giry monad) with a monoidal and an opmonoidal structure, mutually compatible (i.e. a bimonoidal structure). If the underlying monoidal category is cartesian monoidal, a bimonoidal structure is given uniquely by a commutative strength. However, if the underlying monoidal category is not cartesian monoidal, a strength is not enough to guarantee all the desired properties of joints and marginals. A bimonoidal structure is then the correct requirement for the more general case.
We explain the theory and the operational interpretation, with the help of the graphical calculus for monoidal categories. We give a definition of stochastic independence based on the bimonoidal structure, compatible with the intuition and with other approaches in the literature for cartesian monoidal categories. We then show as an example that the Kantorovich monad on the category of complete metric spaces is a bimonoidal monad for a non-cartesian monoidal structure.
△ Less
Submitted 31 January, 2020; v1 submitted 10 April, 2018;
originally announced April 2018.
-
A Probability Monad as the Colimit of Spaces of Finite Samples
Authors:
Tobias Fritz,
Paolo Perrone
Abstract:
We define and study a probability monad on the category of complete metric spaces and short maps. It assigns to each space the space of Radon probability measures on it with finite first moment, equipped with the Kantorovich-Wasserstein distance. This monad is analogous to the Giry monad on the category of Polish spaces, and it extends a construction due to van Breugel for compact and for 1-bounde…
▽ More
We define and study a probability monad on the category of complete metric spaces and short maps. It assigns to each space the space of Radon probability measures on it with finite first moment, equipped with the Kantorovich-Wasserstein distance. This monad is analogous to the Giry monad on the category of Polish spaces, and it extends a construction due to van Breugel for compact and for 1-bounded complete metric spaces.
We prove that this Kantorovich monad arises from a colimit construction on finite power-like constructions, which formalizes the intuition that probability measures are limits of finite samples. The proof relies on a criterion for when an ordinary left Kan extension of lax monoidal functors is a monoidal Kan extension. The colimit characterization allows the development of integration theory and the treatment of measures on spaces of measures, without measure theory.
We also show that the category of algebras of the Kantorovich monad is equivalent to the category of closed convex subsets of Banach spaces with short affine maps as morphisms.
△ Less
Submitted 12 March, 2019; v1 submitted 14 December, 2017;
originally announced December 2017.
-
Spectrahedral Containment and Operator Systems with Finite-Dimensional Realization
Authors:
Tobias Fritz,
Tim Netzer,
Andreas Thom
Abstract:
Containment problems for polytopes and spectrahedra appear in various applications, such as linear and semidefinite programming, combinatorics, convexity and stability analysis of differential equations. This paper explores the theoretical background of a method proposed by Ben-Tal and Nemirovksi. Their method provides a strengthening of the containment problem, that is algorithmically well tracta…
▽ More
Containment problems for polytopes and spectrahedra appear in various applications, such as linear and semidefinite programming, combinatorics, convexity and stability analysis of differential equations. This paper explores the theoretical background of a method proposed by Ben-Tal and Nemirovksi. Their method provides a strengthening of the containment problem, that is algorithmically well tractable. To analyze this method, we study abstract operator systems, and investigate when they have a finite-dimensional concrete realization. Our results give some profound insight into their approach. They imply that when testing the inclusion of a fixed polyhedral cone in an arbitrary spectrahedron, the strengthening is tight if and only if the polyhedral cone is a simplex. This is true independent of the representation of the polytope. We also deduce error bounds in the other cases, simplifying and extending recent results by various authors.
△ Less
Submitted 10 April, 2017; v1 submitted 26 September, 2016;
originally announced September 2016.
-
The Inflation Technique for Causal Inference with Latent Variables
Authors:
Elie Wolfe,
Robert W. Spekkens,
Tobias Fritz
Abstract:
The problem of causal inference is to determine if a given probability distribution on observed variables is compatible with some causal structure. The difficult case is when the causal structure includes latent variables. We here introduce the $\textit{inflation technique}$ for tackling this problem. An inflation of a causal structure is a new causal structure that can contain multiple copies of…
▽ More
The problem of causal inference is to determine if a given probability distribution on observed variables is compatible with some causal structure. The difficult case is when the causal structure includes latent variables. We here introduce the $\textit{inflation technique}$ for tackling this problem. An inflation of a causal structure is a new causal structure that can contain multiple copies of each of the original variables, but where the ancestry of each copy mirrors that of the original. To every distribution of the observed variables that is compatible with the original causal structure, we assign a family of marginal distributions on certain subsets of the copies that are compatible with the inflated causal structure. It follows that compatibility constraints for the inflation can be translated into compatibility constraints for the original causal structure. Even if the constraints at the level of inflation are weak, such as observable statistical independences implied by disjoint causal ancestry, the translated constraints can be strong. We apply this method to derive new inequalities whose violation by a distribution witnesses that distribution's incompatibility with the causal structure (of which Bell inequalities and Pearl's instrumental inequality are prominent examples). We describe an algorithm for deriving all such inequalities for the original causal structure that follow from ancestral independences in the inflation. For three observed binary variables with pairwise common causes, it yields inequalities that are stronger in at least some aspects than those obtainable by existing methods. We also describe an algorithm that derives a weaker set of inequalities but is more efficient. Finally, we discuss which inflations are such that the inequalities one obtains from them remain valid even for quantum (and post-quantum) generalizations of the notion of a causal model.
△ Less
Submitted 22 July, 2019; v1 submitted 2 September, 2016;
originally announced September 2016.
-
Quantum logic is undecidable
Authors:
Tobias Fritz
Abstract:
We investigate the first-order theory of closed subspaces of complex Hilbert spaces in the signature $(\lor,\perp,0,1)$, where `$\perp$' is the orthogonality relation. Our main result is that already its quasi-identities are undecidable: there is no algorithm to decide whether an implication between equations and orthogonality relations implies another equation. This is a corollary of a recent res…
▽ More
We investigate the first-order theory of closed subspaces of complex Hilbert spaces in the signature $(\lor,\perp,0,1)$, where `$\perp$' is the orthogonality relation. Our main result is that already its quasi-identities are undecidable: there is no algorithm to decide whether an implication between equations and orthogonality relations implies another equation. This is a corollary of a recent result of Slofstra in combinatorial group theory. It follows upon reinterpreting that result in terms of the hypergraph approach to quantum contextuality, for which it constitutes a proof of the inverse sandwich conjecture. It can also be interpreted as stating that a certain quantum satisfiability problem is undecidable.
△ Less
Submitted 21 June, 2021; v1 submitted 20 July, 2016;
originally announced July 2016.
-
(Almost) C*-algebras as sheaves with self-action
Authors:
Cecilia Flori,
Tobias Fritz
Abstract:
Via Gelfand duality, a unital C*-algebra $A$ induces a functor from compact Hausdorff spaces to sets, $\mathsf{CHaus}\to\mathsf{Set}$. We show how this functor encodes standard functional calculus in $A$ as well as its multivariate generalization. Certain sheaf conditions satisfied by this functor provide a further generalization of functional calculus. Considering such sheaves…
▽ More
Via Gelfand duality, a unital C*-algebra $A$ induces a functor from compact Hausdorff spaces to sets, $\mathsf{CHaus}\to\mathsf{Set}$. We show how this functor encodes standard functional calculus in $A$ as well as its multivariate generalization. Certain sheaf conditions satisfied by this functor provide a further generalization of functional calculus. Considering such sheaves $\mathsf{CHaus}\to\mathsf{Set}$ abstractly, we prove that the piecewise C*-algebras of van den Berg and Heunen are equivalent to a full subcategory of the category of sheaves, where a simple additional constraint characterizes the objects in the subcategory. It is open whether this additional constraint holds automatically, in which case piecewise C*-algebras would be the same as sheaves $\mathsf{CHaus}\to\mathsf{Set}$.
Intuitively, these structures capture the commutative aspects of C*-algebra theory. In order to find a complete reaxiomatization of unital C*-algebras within this language, we introduce almost C*-algebras as piecewise C*-algebras equipped with a notion of inner automorphisms in terms of a self-action. We provide some evidence for the conjecture that the forgetful functor from unital C*-algebras to almost C*-algebras is fully faithful, and ask whether it is an equivalence of categories. We also develop an analogous notion of almost group, and prove that the forgetful functor from groups to almost groups is not full.
In terms of quantum physics, our work can be seen as an attempt at a reconstruction of quantum theory from physically meaningful axioms, as realized by Hardy and others in a different framework. Our ideas are inspired by and also provide new input for the topos-theoretic approach to quantum theory.
△ Less
Submitted 2 September, 2017; v1 submitted 5 December, 2015;
originally announced December 2015.
-
Resource convertibility and ordered commutative monoids
Authors:
Tobias Fritz
Abstract:
Resources and their use and consumption form a central part of our life. Many branches of science and engineering are concerned with the question of which given resource objects can be converted into which target resource objects. For example, information theory studies the conversion of a noisy communication channel instance into an exchange of information. Inspired by work in quantum information…
▽ More
Resources and their use and consumption form a central part of our life. Many branches of science and engineering are concerned with the question of which given resource objects can be converted into which target resource objects. For example, information theory studies the conversion of a noisy communication channel instance into an exchange of information. Inspired by work in quantum information theory, we develop a general mathematical toolbox for this type of question. The convertibility of resources into other ones and the possibility of combining resources is accurately captured by the mathematics of ordered commutative monoids. As an intuitive example, we consider chemistry, where chemical reaction equations such as \[ \mathrm{2H_2 + O_2} \to \mathrm{2H_2O} \] are concerned both with a convertibility relation "$\to$" and a combination operation "$+$". We study ordered commutative monoids from an algebraic and functional-analytic perspective and derive a wealth of results which should have applications to concrete resource theories, such as a formula for rates of conversion. As a running example showing that ordered commutative monoids are also of purely mathematical interest, we exemplify our results with the ordered commutative monoid of graphs.
While closely related to both Girard's linear logic and to Deutsch's constructor theory, our framework also produces results very reminiscent of the utility theorem of von Neumann and Morgenstern in decision theory and of a theorem of Lieb and Yngvason on thermodynamics.
Concerning pure algebra, our observation is that some pieces of algebra can be developed in a context in which equality is not necessarily symmetric, i.e. in which the equality relation is replaced by an ordering relation. For example, notions like cancellativity or torsion-freeness are still sensible and very natural concepts in our ordered setting.
△ Less
Submitted 1 July, 2015; v1 submitted 14 April, 2015;
originally announced April 2015.
-
Notes on Triangulated Categories
Authors:
Tobias Fritz
Abstract:
We give an elementary introduction to the theory of triangulated categories covering their axioms, homological algebra in triangulated categories, triangulated subcategories, and Verdier localization. We try to use a minimal set of axioms for triangulated categories and derive all other statements from these, including the existence of biproducts. We conclude with a list of examples.
We give an elementary introduction to the theory of triangulated categories covering their axioms, homological algebra in triangulated categories, triangulated subcategories, and Verdier localization. We try to use a minimal set of axioms for triangulated categories and derive all other statements from these, including the existence of biproducts. We conclude with a list of examples.
△ Less
Submitted 15 July, 2014; v1 submitted 14 July, 2014;
originally announced July 2014.
-
A Bayesian Characterization of Relative Entropy
Authors:
John C. Baez,
Tobias Fritz
Abstract:
We give a new characterization of relative entropy, also known as the Kullback-Leibler divergence. We use a number of interesting categories related to probability theory. In particular, we consider a category FinStat where an object is a finite set equipped with a probability distribution, while a morphism is a measure-preserving function $f: X \to Y$ together with a stochastic right inverse…
▽ More
We give a new characterization of relative entropy, also known as the Kullback-Leibler divergence. We use a number of interesting categories related to probability theory. In particular, we consider a category FinStat where an object is a finite set equipped with a probability distribution, while a morphism is a measure-preserving function $f: X \to Y$ together with a stochastic right inverse $s: Y \to X$. The function $f$ can be thought of as a measurement process, while s provides a hypothesis about the state of the measured system given the result of a measurement. Given this data we can define the entropy of the probability distribution on $X$ relative to the "prior" given by pushing the probability distribution on $Y$ forwards along $s$. We say that $s$ is "optimal" if these distributions agree. We show that any convex linear, lower semicontinuous functor from FinStat to the additive monoid $[0,\infty]$ which vanishes when $s$ is optimal must be a scalar multiple of this relative entropy. Our proof is independent of all earlier characterizations, but inspired by the work of Petz.
△ Less
Submitted 11 July, 2014; v1 submitted 13 February, 2014;
originally announced February 2014.
-
Compositories and Gleaves
Authors:
Cecilia Flori,
Tobias Fritz
Abstract:
Sheaves are objects of a local nature: a global section is determined by how it looks locally. Hence, a sheaf cannot describe mathematical structures which contain global or nonlocal geometric information. To fill this gap, we introduce the theory of "gleaves", which are presheaves equipped with an additional "gluing operation" of compatible pairs of local sections. This generalizes the conditiona…
▽ More
Sheaves are objects of a local nature: a global section is determined by how it looks locally. Hence, a sheaf cannot describe mathematical structures which contain global or nonlocal geometric information. To fill this gap, we introduce the theory of "gleaves", which are presheaves equipped with an additional "gluing operation" of compatible pairs of local sections. This generalizes the conditional product structures of Dawid and Studený, which correspond to gleaves on distributive lattices. Our examples include the gleaf of metric spaces and the gleaf of joint probability distributions. A result of Johnstone shows that a category of gleaves can have a subobject classifier despite not being cartesian closed.
Gleaves over the simplex category $Δ$, which we call compositories, can be interpreted as a new kind of higher category in which the composition of an $m$-morphism and an $n$-morphism along a common $k$-morphism face results in an $(m+n-k)$-morphism. The distinctive feature of this composition operation is that the original morphisms can be recovered from the composite morphism as initial and final faces. Examples of compositories include nerves of categories and compositories of higher spans.
△ Less
Submitted 20 October, 2016; v1 submitted 29 August, 2013;
originally announced August 2013.
-
A Combinatorial Approach to Nonlocality and Contextuality
Authors:
Antonio Acín,
Tobias Fritz,
Anthony Leverrier,
Ana Belén Sainz
Abstract:
So far, most of the literature on (quantum) contextuality and the Kochen-Specker theorem seems either to concern particular examples of contextuality, or be considered as quantum logic. Here, we develop a general formalism for contextuality scenarios based on the combinatorics of hypergraphs which significantly refines a similar recent approach by Cabello, Severini and Winter (CSW). In contrast to…
▽ More
So far, most of the literature on (quantum) contextuality and the Kochen-Specker theorem seems either to concern particular examples of contextuality, or be considered as quantum logic. Here, we develop a general formalism for contextuality scenarios based on the combinatorics of hypergraphs which significantly refines a similar recent approach by Cabello, Severini and Winter (CSW). In contrast to CSW, we explicitly include the normalization of probabilities, which gives us a much finer control over the various sets of probabilistic models like classical, quantum and generalized probabilistic. In particular, our framework specializes to (quantum) nonlocality in the case of Bell scenarios, which arise very naturally from a certain product of contextuality scenarios due to Foulis and Randall. In the spirit of CSW, we find close relationships to several graph invariants. The recently proposed Local Orthogonality principle turns out to be a special case of a general principle for contextuality scenarios related to the Shannon capacity of graphs. Our results imply that it is strictly dominated by a low level of the Navascués-Pironio-Acín hierarchy of semidefinite programs, which we also apply to contextuality scenarios.
We derive a wealth of results in our framework, many of these relating to quantum and supraquantum contextuality and nonlocality, and state numerous open problems. For example, we show that the set of quantum models on a contextuality scenario can in general not be characterized in terms of a graph invariant.
In terms of graph theory, our main result is this: there exist two graphs $G_1$ and $G_2$ with the properties \begin{align*} α(G_1) &= Θ(G_1), & α(G_2) &= \vartheta(G_2), \\[6pt] Θ(G_1\boxtimes G_2) & > Θ(G_1)\cdot Θ(G_2),& Θ(G_1 + G_2) & > Θ(G_1) + Θ(G_2). \end{align*}
△ Less
Submitted 12 January, 2015; v1 submitted 17 December, 2012;
originally announced December 2012.
-
Can you compute the operator norm?
Authors:
Tobias Fritz,
Tim Netzer,
Andreas Thom
Abstract:
In this note we address various algorithmic problems that arise in the computation of the operator norm in unitary representations of a group on Hilbert space. We show that the operator norm in the universal unitary representation is computable if the group is residually finite-dimensional or amenable with decidable word problem. Moreover, we relate the computability of the operator norm on the pr…
▽ More
In this note we address various algorithmic problems that arise in the computation of the operator norm in unitary representations of a group on Hilbert space. We show that the operator norm in the universal unitary representation is computable if the group is residually finite-dimensional or amenable with decidable word problem. Moreover, we relate the computability of the operator norm on the product of non-abelian free groups to Kirchberg's QWEP Conjecture, a fundamental open problem in the theory of operator algebras.
△ Less
Submitted 3 January, 2013; v1 submitted 4 July, 2012;
originally announced July 2012.
-
On infinite-dimensional state spaces
Authors:
Tobias Fritz
Abstract:
It is well-known that the canonical commutation relation $[x,p]=i$ can be realized only on an infinite-dimensional Hilbert space. While any finite set of experimental data can also be explained in terms of a finite-dimensional Hilbert space by approximating the commutation relation, Occam's razor prefers the infinite-dimensional model in which $[x,p]=i$ holds on the nose. This reasoning one will n…
▽ More
It is well-known that the canonical commutation relation $[x,p]=i$ can be realized only on an infinite-dimensional Hilbert space. While any finite set of experimental data can also be explained in terms of a finite-dimensional Hilbert space by approximating the commutation relation, Occam's razor prefers the infinite-dimensional model in which $[x,p]=i$ holds on the nose. This reasoning one will necessarily have to make in any approach which tries to detect the infinite-dimensionality. One drawback of using the canonical commutation relation for this purpose is that it has unclear operational meaning. Here, we identify an operationally well-defined context from which an analogous conclusion can be drawn: if two unitary transformations $U,V$ on a quantum system satisfy the relation $V^{-1}U^2V=U^3$, then finite-dimensionality entails the relation $UV^{-1}UV=V^{-1}UVU$; this implication strongly fails in some infinite-dimensional realizations. This is a result from combinatorial group theory for which we give a new proof. This proof adapts to the consideration of cases where the assumed relation $V^{-1}U^2V=U^3$ holds only up to $\eps$ and then yields a lower bound on the dimension.
△ Less
Submitted 26 April, 2013; v1 submitted 16 February, 2012;
originally announced February 2012.
-
Polyhedral duality in Bell scenarios with two binary observables
Authors:
Tobias Fritz
Abstract:
For the Bell scenario with two parties and two binary observables per party, it is known that the no-signaling polytope is the polyhedral dual (polar) of the Bell polytope. Computational evidence suggests that this duality also holds for three parties. Using ideas of Werner, Wolf, Żukowski and Brukner, we prove this for any number of parties by describing a simple linear bijection map** (tight)…
▽ More
For the Bell scenario with two parties and two binary observables per party, it is known that the no-signaling polytope is the polyhedral dual (polar) of the Bell polytope. Computational evidence suggests that this duality also holds for three parties. Using ideas of Werner, Wolf, Żukowski and Brukner, we prove this for any number of parties by describing a simple linear bijection map** (tight) Bell inequalities to (extremal) no-signaling boxes and vice versa. Furthermore, a symmetry-based technique for extending Bell inequalities (resp. no-signaling boxes) with two binary observables from n parties to n+1 parties is described; the Mermin-Klyshko family of Bell inequalities arises in this way, as well as 11 of the 46 classes of tight Bell inequalities for 3 parties. Finally, we ask whether the set of quantum correlations is self-dual with respect to our transformation. We find this not to be the case in general, although it holds for 2 parties on the level of correlations. This self-duality implies Tsirelson's bound for the CHSH inequality.
△ Less
Submitted 18 June, 2012; v1 submitted 1 February, 2012;
originally announced February 2012.
-
Entropic Inequalities and Marginal Problems
Authors:
Tobias Fritz,
Rafael Chaves
Abstract:
A marginal problem asks whether a given family of marginal distributions for some set of random variables arises from some joint distribution of these variables. Here we point out that the existence of such a joint distribution imposes non-trivial conditions already on the level of Shannon entropies of the given marginals. These entropic inequalities are necessary (but not sufficient) criteria for…
▽ More
A marginal problem asks whether a given family of marginal distributions for some set of random variables arises from some joint distribution of these variables. Here we point out that the existence of such a joint distribution imposes non-trivial conditions already on the level of Shannon entropies of the given marginals. These entropic inequalities are necessary (but not sufficient) criteria for the existence of a joint distribution. For every marginal problem, a list of such Shannon-type entropic inequalities can be calculated by Fourier-Motzkin elimination, and we offer a software interface to a Fourier-Motzkin solver for doing so. For the case that the hypergraph of given marginals is a cycle graph, we provide a complete analytic solution to the problem of classifying all relevant entropic inequalities, and use this result to bound the decay of correlations in stochastic processes. Furthermore, we show that Shannon-type inequalities for differential entropies are not relevant for continuous-variable marginal problems; non-Shannon-type inequalities are, both in the discrete and in the continuous case. In contrast to other approaches, our general framework easily adapts to situations where one has additional (conditional) independence requirements on the joint distribution, as in the case of graphical models. We end with a list of open problems.
A complementary article discusses applications to quantum nonlocality and contextuality.
△ Less
Submitted 26 September, 2012; v1 submitted 20 December, 2011;
originally announced December 2011.
-
Velocity Polytopes of Periodic Graphs and a No-Go Theorem for Digital Physics
Authors:
Tobias Fritz
Abstract:
A periodic graph in dimension $d$ is a directed graph with a free action of $\Z^d$ with only finitely many orbits. It can conveniently be represented in terms of an associated finite graph with weights in $\Z^d$, corresponding to a $\Z^d$-bundle with connection. Here we use the weight sums along cycles in this associated graph to construct a certain polytope in $\R^d$, which we regard as a geometr…
▽ More
A periodic graph in dimension $d$ is a directed graph with a free action of $\Z^d$ with only finitely many orbits. It can conveniently be represented in terms of an associated finite graph with weights in $\Z^d$, corresponding to a $\Z^d$-bundle with connection. Here we use the weight sums along cycles in this associated graph to construct a certain polytope in $\R^d$, which we regard as a geometrical invariant associated to the periodic graph. It is the unit ball of a norm on $\R^d$ describing the large-scale geometry of the graph. It has a physical interpretation as the set of attainable velocities of a particle on the graph which can hop along one edge per timestep. Since a polytope necessarily has distinguished directions, there is no periodic graph for which this velocity set is isotropic. In the context of classical physics, this can be viewed as a no-go theorem for the emergence of an isotropic space from a discrete structure.
△ Less
Submitted 17 June, 2013; v1 submitted 9 September, 2011;
originally announced September 2011.