Search | arXiv e-print repository

Classification of Non-Degenerate Symmetric Bilinear and Quadratic Forms in the Verlinde Category $\mathrm{Ver}_4^+$

Authors: Iz Chen, Arun S. Kannan, Krishna Pothapragada

Abstract: Although Deligne's theorem classifies all symmetric tensor categories (STCs) with moderate growth over algebraically closed fields of characteristic zero, the classification does not extend to positive characteristic. At the forefront of the study of STCs is the search for an analog to Deligne's theorem in positive characteristic, and it has become increasingly apparent that the Verlinde categorie… ▽ More Although Deligne's theorem classifies all symmetric tensor categories (STCs) with moderate growth over algebraically closed fields of characteristic zero, the classification does not extend to positive characteristic. At the forefront of the study of STCs is the search for an analog to Deligne's theorem in positive characteristic, and it has become increasingly apparent that the Verlinde categories are to play a significant role. Moreover, these categories are largely unstudied, but have already shown very interesting phenomena as both a generalization of and a departure from superalgebra and supergeometry. In this paper, we study $\mathrm{Ver}_4^+$, the simplest non-trivial Verlinde category in characteristic $2$. In particular, we classify all isomorphism classes of non-degenerate symmetric bilinear forms and non-degenerate quadratic forms and study the associated Witt semi-ring that arises from the addition and multiplication operations on bilinear forms. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2404.02786 [pdf, ps, other]

The Steinberg Tensor Product Theorem for General Linear Group Schemes in the Verlinde Category

Authors: Arun S. Kannan

Abstract: The Steinberg tensor product theorem is a fundamental result in the modular representation theory of reductive algebraic groups. It describes any finite-dimensional simple module of highest weight $λ$ over such a group as the tensor product of Frobenius twists of simple modules with highest weights the weights appearing in a $p$-adic decomposition of $λ$, thereby reducing the character problem to… ▽ More The Steinberg tensor product theorem is a fundamental result in the modular representation theory of reductive algebraic groups. It describes any finite-dimensional simple module of highest weight $λ$ over such a group as the tensor product of Frobenius twists of simple modules with highest weights the weights appearing in a $p$-adic decomposition of $λ$, thereby reducing the character problem to a a finite collection of weights. In recent years this theorem has been extended to various quasi-reductive supergroup schemes. In this paper, we prove the analogous result for the general linear group scheme $GL(X)$ for any object $X$ in the Verlinde category $\mathrm{Ver}_p$. △ Less

Submitted 3 April, 2024; originally announced April 2024.

arXiv:2404.01719 [pdf, ps, other]

From the Albert algebra to Kac's ten-dimensional Jordan superalgebra via tensor categories in characteristic 5

Authors: Alberto Elduque, Pavel Etingof, Arun S. Kannan

Abstract: Kac's ten-dimensional simple Jordan superalgebra over a field of characteristic 5 is obtained from a process of semisimplification, via tensor categories, from the exceptional simple Jordan algebra (or Albert algebra), together with a suitable order 5 automorphism. This explains McCrimmon's 'bizarre result' asserting that, in characteristic 5, Kac's superalgebra is a sort of 'degree 3 Jordan super… ▽ More Kac's ten-dimensional simple Jordan superalgebra over a field of characteristic 5 is obtained from a process of semisimplification, via tensor categories, from the exceptional simple Jordan algebra (or Albert algebra), together with a suitable order 5 automorphism. This explains McCrimmon's 'bizarre result' asserting that, in characteristic 5, Kac's superalgebra is a sort of 'degree 3 Jordan superalgebra'. As an outcome, the exceptional simple Lie superalgebra el(5;5), specific of characteristic 5, is obtained from the simple Lie algebra of type $E_8$ and an order 5 automorphism. In the process, precise recipes to obtain superalgebras from algebras in the category of representations of the cyclic group $C_p$, over a field of characteristic $p>2$, are given. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 22 pages

MSC Class: Primary 17C40; Secondary 17C70; 17B25; 18M15

arXiv:2303.10216 [pdf, other]

Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features

Authors: Konstandinos Kotsiopoulos, Alexey Miroshnikov, Khashayar Filom, Arjun Ravi Kannan

Abstract: In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor… ▽ More In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor vector. By viewing these explainers as expectations over appropriate sample spaces, we design a novel Monte Carlo sampling algorithm that estimates them at a reduced complexity that depends linearly on the size of the background dataset. We set up a rigorous framework for the statistical analysis and obtain error bounds for our sampling methods. The advantage of this approach is that it is fast, easily implementable, and model-agnostic. Furthermore, it has similar statistical accuracy as other known estimation techniques that are more complex and model-specific. We provide rigorous proofs of statistical convergence, as well as numerical experiments whose results agree with our theoretical findings. △ Less

Submitted 18 April, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: 31 pages, 6 figures

arXiv:2301.12282 [pdf, other]

Benefits of Multiobjective Learning in Solar Energy Prediction

Authors: Aswin Kannan

Abstract: While the space of renewable energy forecasting has received significant attention in the last decade, literature has primarily focused on machine learning models that train on only one objective at a time. A host of classification (and regression) tasks in energy markets lead to highly imbalanced training data. Say, to balance reserves, it is natural for market regulators to have a choice to be m… ▽ More While the space of renewable energy forecasting has received significant attention in the last decade, literature has primarily focused on machine learning models that train on only one objective at a time. A host of classification (and regression) tasks in energy markets lead to highly imbalanced training data. Say, to balance reserves, it is natural for market regulators to have a choice to be more/less averse to false negatives (can lead to poor operating efficiency and costs) than to false positives (can lead to market shortfall). Besides accuracy, other metrics like algorithmic bias, RMBE (in regression problems), inferencing time, and model sparsity are also very crucial. This paper is amongst the firsts in the field of renewable energy forecasting that attempts to present a Pareto frontier of solutions (tradeoffs), that answers the question on handling multiple objectives by means of using the XGBoost model (Gradient Boosted Trees). Our proposed algorithm relies on using a sequence of weighted (uniform meshes) single objective model training routines. Real world data examples from the Amherst (Massachusetts, United States) solar energy prediction panels with both triobjective (focus on accuracy) and biojective (focus on fairness/bias) classification instances are considered. Numerical experiments appear promising and clear advantages over single objective methods are seen by observing the spread and variety of solutions (model configurations). △ Less

Submitted 28 January, 2023; originally announced January 2023.

Comments: 8 pages

Journal ref: Proceedings of the AI2SE, AAAI, 2023

arXiv:2202.09910 [pdf, ps, other]

doi 10.1007/s10468-023-10202-4

Representation Stability and Finite Orthogonal Groups

Authors: Zifan Wang, Arun S. Kannan

Abstract: In this paper, we prove stability results about orthogonal groups over finite commutative rings where 2 is a unit. Inspired by Putman and Sam (2017), we construct a category $\mathbf{OrI}(R)$ and prove a Noetherianity theorem for the category of $\mathbf{OrI}(R)$-modules. This implies an asymptotic structure theorem for orthogonal groups. In addition, we show general homological stability theorems… ▽ More In this paper, we prove stability results about orthogonal groups over finite commutative rings where 2 is a unit. Inspired by Putman and Sam (2017), we construct a category $\mathbf{OrI}(R)$ and prove a Noetherianity theorem for the category of $\mathbf{OrI}(R)$-modules. This implies an asymptotic structure theorem for orthogonal groups. In addition, we show general homological stability theorems for orthogonal groups, with both untwisted and twisted coefficients, partially generalizing a result of Charney (1987). △ Less

Submitted 20 February, 2022; originally announced February 2022.

Comments: 21 pages, 0 figures

MSC Class: 16P40; 18A25; 18Gxx; 20J05

arXiv:2112.01467 [pdf, ps, other]

Stable Centres II: Finite Classical Groups

Authors: Arun S. Kannan, Christopher Ryba

Abstract: Farahat and Higman constructed an algebra $\mathrm{FH}$ interpolating the centres of symmetric group algebras $Z(\mathbb{Z}S_n)$ by proving that the structure constants in these rings are "polynomial in $n$". Inspired by a construction of $\mathrm{FH}$ due to Ivanov and Kerov, we prove for $G_n = GL_n, U_n, Sp_{2n}, O_n$, that the structure constants of $Z(\mathbb{Z}G_n(\mathbb{F}_q))$ are "polyno… ▽ More Farahat and Higman constructed an algebra $\mathrm{FH}$ interpolating the centres of symmetric group algebras $Z(\mathbb{Z}S_n)$ by proving that the structure constants in these rings are "polynomial in $n$". Inspired by a construction of $\mathrm{FH}$ due to Ivanov and Kerov, we prove for $G_n = GL_n, U_n, Sp_{2n}, O_n$, that the structure constants of $Z(\mathbb{Z}G_n(\mathbb{F}_q))$ are "polynomial in $q^n$", allowing us to construct an equivalent of the Farahat-Higman algebra in each case. △ Less

Submitted 2 December, 2021; originally announced December 2021.

Comments: 38 pages

arXiv:2111.11259 [pdf, other]

Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics

Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

Abstract: This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to… ▽ More This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to the bias, we reduce the dimensionality of the problem by mitigating the bias originating from those predictors. The post-processing methodology involves resha** the predictor distributions by balancing the positive and negative bias explanations and allows for the regressor bias to decrease. We design an algorithm that uses Bayesian optimization to construct the bias-performance efficient frontier over the family of post-processed models, from which an optimal model is selected. Our novel methodology performs optimization in low-dimensional spaces and avoids expensive model retraining. △ Less

Submitted 19 November, 2021; originally announced November 2021.

Comments: 29 pages, 32 figures

MSC Class: 49Q22; 91A12; 68T01

arXiv:2108.05847 [pdf, ps, other]

doi 10.1007/s00031-022-09751-7

New Constructions of Exceptional Simple Lie Superalgebras with Integer Cartan Matrix in Characteristics 3 and 5 via Tensor Categories

Authors: Arun S. Kannan

Abstract: Using tensor categories, we present new constructions of several of the exceptional simple Lie superalgebras with integer Cartan matrix in characteristic $p = 3$ and $p = 5$ from the complete classification of modular Lie superalgebras with indecomposable Cartan matrix and their simple subquotients over algebraically closed fields by Bouarroudj, Grozman, and Leites in 2009. Specifically, let… ▽ More Using tensor categories, we present new constructions of several of the exceptional simple Lie superalgebras with integer Cartan matrix in characteristic $p = 3$ and $p = 5$ from the complete classification of modular Lie superalgebras with indecomposable Cartan matrix and their simple subquotients over algebraically closed fields by Bouarroudj, Grozman, and Leites in 2009. Specifically, let $\mathbfα_p$ denote the kernel of the Frobenius endomorphism on the additive group scheme $\mathbb{G}_a$ over an algebraically closed field of characteristic $p$. The Verlinde category $\mathrm{Ver}_p$ is the semisimplification of the representation category $\mathrm{Rep} \ \mathbfα_p$, and $\mathrm{Ver}_p$ contains the category of super vector spaces as a full subcategory. Each exceptional Lie superalgebra we construct is realized as the image of an exceptional Lie algebra equipped with a nilpotent derivation of order at most $p$ under the semisimplification functor from $\mathrm{Rep} \ \mathbfα_p$ to $\mathrm{Ver}_p$. △ Less

Submitted 16 May, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

arXiv:2103.04878 [pdf, ps, other]

doi 10.4171/ecr/19/7

Lectures on Symmetric Tensor Categories

Authors: Pavel Etingof, Arun S. Kannan

Abstract: This is an expanded version of the notes by the second author of the lectures on symmetric tensor categories given by the first author at Ohio State University in March 2019 and later at ICRA-2020 in November 2020. We review some aspects of the current state of the theory of symmetric tensor categories and discuss their applications, including ones unavailable in the literature. This is an expanded version of the notes by the second author of the lectures on symmetric tensor categories given by the first author at Ohio State University in March 2019 and later at ICRA-2020 in November 2020. We review some aspects of the current state of the theory of symmetric tensor categories and discuss their applications, including ones unavailable in the literature. △ Less

Submitted 10 November, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

Comments: 34 pages, latex; v2 discusses results of the new paper [CEO], and derives stronger corollaries in the appendix

arXiv:2102.10878 [pdf, other]

Stability theory of game-theoretic group feature explanations for machine learning models

Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Khashayar Filom, Arjun Ravi Kannan

Abstract: In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f… ▽ More In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds for both marginal and conditional explanations. The differences between the two games are then elucidated, such as showing that the marginal explanations can become discontinuous on some naturally-designed domains, while the conditional explanations remain stable. In the second part of our work, group explanation methodologies are devised based on game values with coalition structure, where the features are grouped based on dependencies. We show analytically that grou** features this way has a stabilizing effect on the marginal operator on both group and individual levels, and allows for the unification of marginal and conditional explanations. Our results are verified in a number of numerical experiments where an information-theoretic measure of dependence is used for grou**. △ Less

Submitted 3 April, 2024; v1 submitted 22 February, 2021; originally announced February 2021.

Comments: 76 pages, 41 figures. Major revision. The title has been changed

MSC Class: 91A06; 91A12; 91A80; 46N30; 46N99; 68T01

arXiv:2011.03156 [pdf, other]

doi 10.1007/s10994-022-06213-9

Wasserstein-based fairness interpretability framework for machine learning models

Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

Abstract: The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa… ▽ More The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the favorability of both the model and predictors with respect to the non-protected class. The quantification is accomplished by the use of transport theory, which gives rise to the decomposition of the model bias and bias explanations to positive and negative contributions. To gain more insight into the role of favorability and allow for additivity of bias explanations, we adapt techniques from cooperative game theory. △ Less

Submitted 8 March, 2022; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: 39 pages. (submitted for publication)

MSC Class: 49Q22; 91A12; 68T01; 90C08

Journal ref: Machine Learning Journal (2022), Springer

arXiv:2006.06788 [pdf, ps, other]

doi 10.1016/j.jalgebra.2020.10.030

Characters for Projective Modules in the BGG Category $\mathcal{O}$ for the Orthosymplectic Lie Superalgebra $\mathfrak{osp}(3|4)$

Authors: Arun S. Kannan, Honglin Zhu

Abstract: We determine the Verma multiplicities of standard filtrations of projective modules for integral atypical blocks in the BGG category $\mathcal{O}$ for the orthosymplectic Lie superalgebras $\mathfrak{osp}(3|4)$ by way of translation functors. We then explicitly determine the composition factor multiplicities of Verma modules using BGG reciprocity. We determine the Verma multiplicities of standard filtrations of projective modules for integral atypical blocks in the BGG category $\mathcal{O}$ for the orthosymplectic Lie superalgebras $\mathfrak{osp}(3|4)$ by way of translation functors. We then explicitly determine the composition factor multiplicities of Verma modules using BGG reciprocity. △ Less

Submitted 20 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

Comments: arXiv admin note: text overlap with arXiv:1810.13050

arXiv:1810.13050 [pdf, ps, other]

doi 10.1016/j.jalgebra.2019.05.024

Characters for Projective Modules in the BGG Category O for General Linear Lie Superalgebras

Authors: Arun S. Kannan

Abstract: We determine the Verma multiplicities and the characters of projective modules for atypical blocks in the BGG Category O for the general linear Lie superalgebras $\frak{gl}(2|2)$ and $\frak{gl}(3|1)$. We then explicitly determine the composition factor multiplcities of Verma modules in the atypicality 2 block of $\frak{gl}(2|2)$. We determine the Verma multiplicities and the characters of projective modules for atypical blocks in the BGG Category O for the general linear Lie superalgebras $\frak{gl}(2|2)$ and $\frak{gl}(3|1)$. We then explicitly determine the composition factor multiplcities of Verma modules in the atypicality 2 block of $\frak{gl}(2|2)$. △ Less

Submitted 30 October, 2018; originally announced October 2018.

arXiv:1509.03925 [pdf, ps, other]

Distributed Stochastic Optimization under Imperfect Information

Authors: Aswin Kannan, Angelia Nedich, Uday V. Shanbhag

Abstract: We consider a stochastic convex optimization problem that requires minimizing a sum of misspecified agentspecific expectation-valued convex functions over the intersection of a collection of agent-specific convex sets. This misspecification is manifested in a parametric sense and may be resolved through solving a distinct stochastic convex learning problem. Our interest lies in the development of… ▽ More We consider a stochastic convex optimization problem that requires minimizing a sum of misspecified agentspecific expectation-valued convex functions over the intersection of a collection of agent-specific convex sets. This misspecification is manifested in a parametric sense and may be resolved through solving a distinct stochastic convex learning problem. Our interest lies in the development of distributed algorithms in which every agent makes decisions based on the knowledge of its objective and feasibility set while learning the decisions of other agents by communicating with its local neighbors over a time-varying connectivity graph. While a significant body of research currently exists in the context of such problems, we believe that the misspecified generalization of this problem is both important and has seen little study, if at all. Accordingly, our focus lies on the simultaneous resolution of both problems through a joint set of schemes that combine three distinct steps: (i) An alignment step in which every agent updates its current belief by averaging over the beliefs of its neighbors; (ii) A projected (stochastic) gradient step in which every agent further updates this averaged estimate; and (iii) A learning step in which agents update their belief of the misspecified parameter by utilizing a stochastic gradient step. Under an assumption of mere convexity on agent objectives and strong convexity of the learning problems, we show that the sequences generated by this collection of update rules converge almost surely to the solution of the correctly specified stochastic convex optimization problem and the stochastic learning problem, respectively. △ Less

Submitted 20 September, 2015; v1 submitted 13 September, 2015; originally announced September 2015.

arXiv:1410.1628 [pdf, other]

doi 10.1007/s10589-019-00120-x

Optimal stochastic extragradient schemes for pseudomonotone stochastic variational inequality problems and their variants

Authors: Aswin Kannan, Uday V. Shanbhag

Abstract: We consider the stochastic variational inequality problem in which the map is expectation-valued in a component-wise sense. Much of the available convergence theory and rate statements for stochastic approximation schemes are limited to monotone maps. However, non-monotone stochastic variational inequality problems are not uncommon and are seen to arise from product pricing, fractional optimizatio… ▽ More We consider the stochastic variational inequality problem in which the map is expectation-valued in a component-wise sense. Much of the available convergence theory and rate statements for stochastic approximation schemes are limited to monotone maps. However, non-monotone stochastic variational inequality problems are not uncommon and are seen to arise from product pricing, fractional optimization problems, and subclasses of economic equilibrium problems. Motivated by the need to address a broader class of maps, we make the following contributions: (i) We present an extragradient-based stochastic approximation scheme and prove that the iterates converge to a solution of the original problem under either pseudomonotonicity requirements or a suitably defined acute angle condition. Such statements are shown to be generalizable to the stochastic mirror-prox framework; (ii) Under strong pseudomonotonicity, we show that the mean-squared error in the solution iterates produced by the extragradient SA scheme converges at the optimal rate of O(1/k) statements that were hitherto unavailable K in this regime. Notably, we optimize the initial steplength by obtaining an ε-infimum of a discontinuous nonconvex function. Similar statements are derived for mirror-prox generalizations and can accommodate monotone SVIs under a weak-sharpness requirement. Finally, both the asymptotics and the empirical rates of the schemes are studied on a set of variational problems where it is seen that the theoretically specified initial steplength leads to significant performance benefits. △ Less

Submitted 21 November, 2019; v1 submitted 7 October, 2014; originally announced October 2014.

Comments: Computational Optimization and Applications, 2019

Report number: Volume 74, Number 3

Showing 1–16 of 16 results for author: Kannan, A