-
Classification of Non-Degenerate Symmetric Bilinear and Quadratic Forms in the Verlinde Category $\mathrm{Ver}_4^+$
Authors:
Iz Chen,
Arun S. Kannan,
Krishna Pothapragada
Abstract:
Although Deligne's theorem classifies all symmetric tensor categories (STCs) with moderate growth over algebraically closed fields of characteristic zero, the classification does not extend to positive characteristic. At the forefront of the study of STCs is the search for an analog to Deligne's theorem in positive characteristic, and it has become increasingly apparent that the Verlinde categorie…
▽ More
Although Deligne's theorem classifies all symmetric tensor categories (STCs) with moderate growth over algebraically closed fields of characteristic zero, the classification does not extend to positive characteristic. At the forefront of the study of STCs is the search for an analog to Deligne's theorem in positive characteristic, and it has become increasingly apparent that the Verlinde categories are to play a significant role. Moreover, these categories are largely unstudied, but have already shown very interesting phenomena as both a generalization of and a departure from superalgebra and supergeometry. In this paper, we study $\mathrm{Ver}_4^+$, the simplest non-trivial Verlinde category in characteristic $2$. In particular, we classify all isomorphism classes of non-degenerate symmetric bilinear forms and non-degenerate quadratic forms and study the associated Witt semi-ring that arises from the addition and multiplication operations on bilinear forms.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
The Steinberg Tensor Product Theorem for General Linear Group Schemes in the Verlinde Category
Authors:
Arun S. Kannan
Abstract:
The Steinberg tensor product theorem is a fundamental result in the modular representation theory of reductive algebraic groups. It describes any finite-dimensional simple module of highest weight $λ$ over such a group as the tensor product of Frobenius twists of simple modules with highest weights the weights appearing in a $p$-adic decomposition of $λ$, thereby reducing the character problem to…
▽ More
The Steinberg tensor product theorem is a fundamental result in the modular representation theory of reductive algebraic groups. It describes any finite-dimensional simple module of highest weight $λ$ over such a group as the tensor product of Frobenius twists of simple modules with highest weights the weights appearing in a $p$-adic decomposition of $λ$, thereby reducing the character problem to a a finite collection of weights. In recent years this theorem has been extended to various quasi-reductive supergroup schemes. In this paper, we prove the analogous result for the general linear group scheme $GL(X)$ for any object $X$ in the Verlinde category $\mathrm{Ver}_p$.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
From the Albert algebra to Kac's ten-dimensional Jordan superalgebra via tensor categories in characteristic 5
Authors:
Alberto Elduque,
Pavel Etingof,
Arun S. Kannan
Abstract:
Kac's ten-dimensional simple Jordan superalgebra over a field of characteristic 5 is obtained from a process of semisimplification, via tensor categories, from the exceptional simple Jordan algebra (or Albert algebra), together with a suitable order 5 automorphism. This explains McCrimmon's 'bizarre result' asserting that, in characteristic 5, Kac's superalgebra is a sort of 'degree 3 Jordan super…
▽ More
Kac's ten-dimensional simple Jordan superalgebra over a field of characteristic 5 is obtained from a process of semisimplification, via tensor categories, from the exceptional simple Jordan algebra (or Albert algebra), together with a suitable order 5 automorphism. This explains McCrimmon's 'bizarre result' asserting that, in characteristic 5, Kac's superalgebra is a sort of 'degree 3 Jordan superalgebra'. As an outcome, the exceptional simple Lie superalgebra el(5;5), specific of characteristic 5, is obtained from the simple Lie algebra of type $E_8$ and an order 5 automorphism. In the process, precise recipes to obtain superalgebras from algebras in the category of representations of the cyclic group $C_p$, over a field of characteristic $p>2$, are given.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features
Authors:
Konstandinos Kotsiopoulos,
Alexey Miroshnikov,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor…
▽ More
In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor vector. By viewing these explainers as expectations over appropriate sample spaces, we design a novel Monte Carlo sampling algorithm that estimates them at a reduced complexity that depends linearly on the size of the background dataset. We set up a rigorous framework for the statistical analysis and obtain error bounds for our sampling methods. The advantage of this approach is that it is fast, easily implementable, and model-agnostic. Furthermore, it has similar statistical accuracy as other known estimation techniques that are more complex and model-specific. We provide rigorous proofs of statistical convergence, as well as numerical experiments whose results agree with our theoretical findings.
△ Less
Submitted 18 April, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Benefits of Multiobjective Learning in Solar Energy Prediction
Authors:
Aswin Kannan
Abstract:
While the space of renewable energy forecasting has received significant attention in the last decade, literature has primarily focused on machine learning models that train on only one objective at a time. A host of classification (and regression) tasks in energy markets lead to highly imbalanced training data. Say, to balance reserves, it is natural for market regulators to have a choice to be m…
▽ More
While the space of renewable energy forecasting has received significant attention in the last decade, literature has primarily focused on machine learning models that train on only one objective at a time. A host of classification (and regression) tasks in energy markets lead to highly imbalanced training data. Say, to balance reserves, it is natural for market regulators to have a choice to be more/less averse to false negatives (can lead to poor operating efficiency and costs) than to false positives (can lead to market shortfall). Besides accuracy, other metrics like algorithmic bias, RMBE (in regression problems), inferencing time, and model sparsity are also very crucial. This paper is amongst the firsts in the field of renewable energy forecasting that attempts to present a Pareto frontier of solutions (tradeoffs), that answers the question on handling multiple objectives by means of using the XGBoost model (Gradient Boosted Trees). Our proposed algorithm relies on using a sequence of weighted (uniform meshes) single objective model training routines. Real world data examples from the Amherst (Massachusetts, United States) solar energy prediction panels with both triobjective (focus on accuracy) and biojective (focus on fairness/bias) classification instances are considered. Numerical experiments appear promising and clear advantages over single objective methods are seen by observing the spread and variety of solutions (model configurations).
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
Representation Stability and Finite Orthogonal Groups
Authors:
Zifan Wang,
Arun S. Kannan
Abstract:
In this paper, we prove stability results about orthogonal groups over finite commutative rings where 2 is a unit. Inspired by Putman and Sam (2017), we construct a category $\mathbf{OrI}(R)$ and prove a Noetherianity theorem for the category of $\mathbf{OrI}(R)$-modules. This implies an asymptotic structure theorem for orthogonal groups. In addition, we show general homological stability theorems…
▽ More
In this paper, we prove stability results about orthogonal groups over finite commutative rings where 2 is a unit. Inspired by Putman and Sam (2017), we construct a category $\mathbf{OrI}(R)$ and prove a Noetherianity theorem for the category of $\mathbf{OrI}(R)$-modules. This implies an asymptotic structure theorem for orthogonal groups. In addition, we show general homological stability theorems for orthogonal groups, with both untwisted and twisted coefficients, partially generalizing a result of Charney (1987).
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
Stable Centres II: Finite Classical Groups
Authors:
Arun S. Kannan,
Christopher Ryba
Abstract:
Farahat and Higman constructed an algebra $\mathrm{FH}$ interpolating the centres of symmetric group algebras $Z(\mathbb{Z}S_n)$ by proving that the structure constants in these rings are "polynomial in $n$". Inspired by a construction of $\mathrm{FH}$ due to Ivanov and Kerov, we prove for $G_n = GL_n, U_n, Sp_{2n}, O_n$, that the structure constants of $Z(\mathbb{Z}G_n(\mathbb{F}_q))$ are "polyno…
▽ More
Farahat and Higman constructed an algebra $\mathrm{FH}$ interpolating the centres of symmetric group algebras $Z(\mathbb{Z}S_n)$ by proving that the structure constants in these rings are "polynomial in $n$". Inspired by a construction of $\mathrm{FH}$ due to Ivanov and Kerov, we prove for $G_n = GL_n, U_n, Sp_{2n}, O_n$, that the structure constants of $Z(\mathbb{Z}G_n(\mathbb{F}_q))$ are "polynomial in $q^n$", allowing us to construct an equivalent of the Farahat-Higman algebra in each case.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to…
▽ More
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to the bias, we reduce the dimensionality of the problem by mitigating the bias originating from those predictors. The post-processing methodology involves resha** the predictor distributions by balancing the positive and negative bias explanations and allows for the regressor bias to decrease. We design an algorithm that uses Bayesian optimization to construct the bias-performance efficient frontier over the family of post-processed models, from which an optimal model is selected. Our novel methodology performs optimization in low-dimensional spaces and avoids expensive model retraining.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
New Constructions of Exceptional Simple Lie Superalgebras with Integer Cartan Matrix in Characteristics 3 and 5 via Tensor Categories
Authors:
Arun S. Kannan
Abstract:
Using tensor categories, we present new constructions of several of the exceptional simple Lie superalgebras with integer Cartan matrix in characteristic $p = 3$ and $p = 5$ from the complete classification of modular Lie superalgebras with indecomposable Cartan matrix and their simple subquotients over algebraically closed fields by Bouarroudj, Grozman, and Leites in 2009. Specifically, let…
▽ More
Using tensor categories, we present new constructions of several of the exceptional simple Lie superalgebras with integer Cartan matrix in characteristic $p = 3$ and $p = 5$ from the complete classification of modular Lie superalgebras with indecomposable Cartan matrix and their simple subquotients over algebraically closed fields by Bouarroudj, Grozman, and Leites in 2009. Specifically, let $\mathbfα_p$ denote the kernel of the Frobenius endomorphism on the additive group scheme $\mathbb{G}_a$ over an algebraically closed field of characteristic $p$. The Verlinde category $\mathrm{Ver}_p$ is the semisimplification of the representation category $\mathrm{Rep} \ \mathbfα_p$, and $\mathrm{Ver}_p$ contains the category of super vector spaces as a full subcategory. Each exceptional Lie superalgebra we construct is realized as the image of an exceptional Lie algebra equipped with a nilpotent derivation of order at most $p$ under the semisimplification functor from $\mathrm{Rep} \ \mathbfα_p$ to $\mathrm{Ver}_p$.
△ Less
Submitted 16 May, 2022; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Lectures on Symmetric Tensor Categories
Authors:
Pavel Etingof,
Arun S. Kannan
Abstract:
This is an expanded version of the notes by the second author of the lectures on symmetric tensor categories given by the first author at Ohio State University in March 2019 and later at ICRA-2020 in November 2020. We review some aspects of the current state of the theory of symmetric tensor categories and discuss their applications, including ones unavailable in the literature.
This is an expanded version of the notes by the second author of the lectures on symmetric tensor categories given by the first author at Ohio State University in March 2019 and later at ICRA-2020 in November 2020. We review some aspects of the current state of the theory of symmetric tensor categories and discuss their applications, including ones unavailable in the literature.
△ Less
Submitted 10 November, 2021; v1 submitted 8 March, 2021;
originally announced March 2021.
-
Stability theory of game-theoretic group feature explanations for machine learning models
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Khashayar Filom,
Arjun Ravi Kannan
Abstract:
In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f…
▽ More
In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds for both marginal and conditional explanations. The differences between the two games are then elucidated, such as showing that the marginal explanations can become discontinuous on some naturally-designed domains, while the conditional explanations remain stable. In the second part of our work, group explanation methodologies are devised based on game values with coalition structure, where the features are grouped based on dependencies. We show analytically that grou** features this way has a stabilizing effect on the marginal operator on both group and individual levels, and allows for the unification of marginal and conditional explanations. Our results are verified in a number of numerical experiments where an information-theoretic measure of dependence is used for grou**.
△ Less
Submitted 3 April, 2024; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Wasserstein-based fairness interpretability framework for machine learning models
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa…
▽ More
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the favorability of both the model and predictors with respect to the non-protected class. The quantification is accomplished by the use of transport theory, which gives rise to the decomposition of the model bias and bias explanations to positive and negative contributions. To gain more insight into the role of favorability and allow for additivity of bias explanations, we adapt techniques from cooperative game theory.
△ Less
Submitted 8 March, 2022; v1 submitted 5 November, 2020;
originally announced November 2020.
-
Characters for Projective Modules in the BGG Category $\mathcal{O}$ for the Orthosymplectic Lie Superalgebra $\mathfrak{osp}(3|4)$
Authors:
Arun S. Kannan,
Honglin Zhu
Abstract:
We determine the Verma multiplicities of standard filtrations of projective modules for integral atypical blocks in the BGG category $\mathcal{O}$ for the orthosymplectic Lie superalgebras $\mathfrak{osp}(3|4)$ by way of translation functors. We then explicitly determine the composition factor multiplicities of Verma modules using BGG reciprocity.
We determine the Verma multiplicities of standard filtrations of projective modules for integral atypical blocks in the BGG category $\mathcal{O}$ for the orthosymplectic Lie superalgebras $\mathfrak{osp}(3|4)$ by way of translation functors. We then explicitly determine the composition factor multiplicities of Verma modules using BGG reciprocity.
△ Less
Submitted 20 November, 2020; v1 submitted 11 June, 2020;
originally announced June 2020.
-
Characters for Projective Modules in the BGG Category O for General Linear Lie Superalgebras
Authors:
Arun S. Kannan
Abstract:
We determine the Verma multiplicities and the characters of projective modules for atypical blocks in the BGG Category O for the general linear Lie superalgebras $\frak{gl}(2|2)$ and $\frak{gl}(3|1)$. We then explicitly determine the composition factor multiplcities of Verma modules in the atypicality 2 block of $\frak{gl}(2|2)$.
We determine the Verma multiplicities and the characters of projective modules for atypical blocks in the BGG Category O for the general linear Lie superalgebras $\frak{gl}(2|2)$ and $\frak{gl}(3|1)$. We then explicitly determine the composition factor multiplcities of Verma modules in the atypicality 2 block of $\frak{gl}(2|2)$.
△ Less
Submitted 30 October, 2018;
originally announced October 2018.
-
Distributed Stochastic Optimization under Imperfect Information
Authors:
Aswin Kannan,
Angelia Nedich,
Uday V. Shanbhag
Abstract:
We consider a stochastic convex optimization problem that requires minimizing a sum of misspecified agentspecific expectation-valued convex functions over the intersection of a collection of agent-specific convex sets. This misspecification is manifested in a parametric sense and may be resolved through solving a distinct stochastic convex learning problem. Our interest lies in the development of…
▽ More
We consider a stochastic convex optimization problem that requires minimizing a sum of misspecified agentspecific expectation-valued convex functions over the intersection of a collection of agent-specific convex sets. This misspecification is manifested in a parametric sense and may be resolved through solving a distinct stochastic convex learning problem. Our interest lies in the development of distributed algorithms in which every agent makes decisions based on the knowledge of its objective and feasibility set while learning the decisions of other agents by communicating with its local neighbors over a time-varying connectivity graph. While a significant body of research currently exists in the context of such problems, we believe that the misspecified generalization of this problem is both important and has seen little study, if at all. Accordingly, our focus lies on the simultaneous resolution of both problems through a joint set of schemes that combine three distinct steps: (i) An alignment step in which every agent updates its current belief by averaging over the beliefs of its neighbors; (ii) A projected (stochastic) gradient step in which every agent further updates this averaged estimate; and (iii) A learning step in which agents update their belief of the misspecified parameter by utilizing a stochastic gradient step. Under an assumption of mere convexity on agent objectives and strong convexity of the learning problems, we show that the sequences generated by this collection of update rules converge almost surely to the solution of the correctly specified stochastic convex optimization problem and the stochastic learning problem, respectively.
△ Less
Submitted 20 September, 2015; v1 submitted 13 September, 2015;
originally announced September 2015.
-
Optimal stochastic extragradient schemes for pseudomonotone stochastic variational inequality problems and their variants
Authors:
Aswin Kannan,
Uday V. Shanbhag
Abstract:
We consider the stochastic variational inequality problem in which the map is expectation-valued in a component-wise sense. Much of the available convergence theory and rate statements for stochastic approximation schemes are limited to monotone maps. However, non-monotone stochastic variational inequality problems are not uncommon and are seen to arise from product pricing, fractional optimizatio…
▽ More
We consider the stochastic variational inequality problem in which the map is expectation-valued in a component-wise sense. Much of the available convergence theory and rate statements for stochastic approximation schemes are limited to monotone maps. However, non-monotone stochastic variational inequality problems are not uncommon and are seen to arise from product pricing, fractional optimization problems, and subclasses of economic equilibrium problems. Motivated by the need to address a broader class of maps, we make the following contributions: (i) We present an extragradient-based stochastic approximation scheme and prove that the iterates converge to a solution of the original problem under either pseudomonotonicity requirements or a suitably defined acute angle condition. Such statements are shown to be generalizable to the stochastic mirror-prox framework; (ii) Under strong pseudomonotonicity, we show that the mean-squared error in the solution iterates produced by the extragradient SA scheme converges at the optimal rate of O(1/k) statements that were hitherto unavailable K in this regime. Notably, we optimize the initial steplength by obtaining an ε-infimum of a discontinuous nonconvex function. Similar statements are derived for mirror-prox generalizations and can accommodate monotone SVIs under a weak-sharpness requirement. Finally, both the asymptotics and the empirical rates of the schemes are studied on a set of variational problems where it is seen that the theoretically specified initial steplength leads to significant performance benefits.
△ Less
Submitted 21 November, 2019; v1 submitted 7 October, 2014;
originally announced October 2014.