-
Single Element Error Correction/ in a Euclidean Distance Matrix
Authors:
Abdo Alfakih,
Woosuk L. Jung,
Henry Wolkowicz,
Tina Xu
Abstract:
We consider the \emph{exact} error correction of a noisy Euclidean distance matrix, EDM, where the elements are the squared distances between $n$ points in $R^d$. For our problem we are given two facts: (i) the embedding dimension, $d$, (ii) \emph{exactly one} distance in the data is corrupted by \emph{nonzero noise}. But we do \underline{not} know the magnitude nor position of the noise. Thus the…
▽ More
We consider the \emph{exact} error correction of a noisy Euclidean distance matrix, EDM, where the elements are the squared distances between $n$ points in $R^d$. For our problem we are given two facts: (i) the embedding dimension, $d$, (ii) \emph{exactly one} distance in the data is corrupted by \emph{nonzero noise}. But we do \underline{not} know the magnitude nor position of the noise. Thus there is a combinatorial element to the problem. We present three solution techniques. These use three divide and conquer strategies in combination with three versions of facial reduction that use: exposing vectors, facial vectors, and Gale transforms. This sheds light on the connections between the various forms of facial reduction related to Gale transforms. Our highly successful empirics confirm the success of these approaches as we can solve huge problems of the order of $100,000$ nodes in approximately one minute to machine precision. \\Our algorithm depends on identifying whether a principal submatrix of the \EDM contains the corrupted element. We provide a theorem for doing this that is related to the existing results for identifying \emph{yielding} elements, i.e.,~we provide a characterization for guaranteeing the perturbed EDM remains an EDM with embedding dimension $d$. The characterization is particularly simple in the $d=2$ case. \\In addition, we characterize when the intuitive approach of the nearest EDM problem, solves our problem. In fact, we show that this happens if, and only if, the original distance element is $0$, degenerate, and the perturbation is negative.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Clearing time randomization and transaction fees for auction market design
Authors:
Thibaut Mastrolia,
Tianrui Xu
Abstract:
Flaws of a continuous limit order book mechanism raise the question of whether a continuous trading session and a periodic auction session would bring better efficiency. This paper wants to go further in designing a periodic auction when both a continuous market and a periodic auction market are available to traders. In a periodic auction, we discover that a strategic trader could take advantage o…
▽ More
Flaws of a continuous limit order book mechanism raise the question of whether a continuous trading session and a periodic auction session would bring better efficiency. This paper wants to go further in designing a periodic auction when both a continuous market and a periodic auction market are available to traders. In a periodic auction, we discover that a strategic trader could take advantage of the accumulated information available along the auction duration by arriving at the latest moment before the auction closes, increasing the price impact on the market. Such price impact moves the clearing price away from the efficient price and may disturb the efficiency of a periodic auction market. We thus propose and quantify the effect of two remedies to mitigate these flaws: randomizing the auction's closing time and optimally designing a transaction fees policy. Our results show that these policies encourage a strategic trader to send their orders earlier to enhance the efficiency of the auction market, illustrated by data extracted from Alphabet and Apple stocks.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
A scaling limit of the 2D parabolic Anderson model with exclusion interaction
Authors:
Dirk Erhard,
Martin Hairer,
Tiecheng Xu
Abstract:
We consider the (discrete) parabolic Anderson model $\partial u(t,x)/\partial t=Δu(t,x) +ξ_t(x) u(t,x)$, $t\geq 0$, $x\in \mathbb{Z}^d$. Here, the $ξ$-field is $\mathbb{R}$-valued, acting as a dynamic random environment, and $Δ$ represents the discrete Laplacian. We focus on the case where $ξ$ is given by a rescaled symmetric simple exclusion process which converges to an Ornstein--Uhlenbeck proce…
▽ More
We consider the (discrete) parabolic Anderson model $\partial u(t,x)/\partial t=Δu(t,x) +ξ_t(x) u(t,x)$, $t\geq 0$, $x\in \mathbb{Z}^d$. Here, the $ξ$-field is $\mathbb{R}$-valued, acting as a dynamic random environment, and $Δ$ represents the discrete Laplacian. We focus on the case where $ξ$ is given by a rescaled symmetric simple exclusion process which converges to an Ornstein--Uhlenbeck process. By scaling the Laplacian diffusively and considering the equation on a torus, we demonstrate that in dimension $d=2$, when a suitably renormalized version of the above equation is considered, the sequence of solutions converges in law. This resolves an open problem from~\cite{EH23}, where a similar result was shown in the three-dimensional case. The novel contribution in the present work is the establishment of a gradient bound on the transition probability of a fixed but arbitrary number of labelled exclusion particles.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Anderson Acceleration with Truncated Gram-Schmidt
Authors:
Ziyuan Tang,
Tianshi Xu,
Huan He,
Yousef Saad,
Yuanzhe Xi
Abstract:
Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over the classical AA. In particular, an attractive feature of AATGS is that its iterates obey a three-term recurrence in the situation when it is applied to solving…
▽ More
Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over the classical AA. In particular, an attractive feature of AATGS is that its iterates obey a three-term recurrence in the situation when it is applied to solving symmetric linear problems and this can lead to a considerable reduction of memory and computational costs. We analyze the convergence of AATGS in both full-depth and limited-depth scenarios and establish its equivalence to the classical AA in the linear case. We also report on the effectiveness of AATGS through a set of numerical experiments, ranging from solving nonlinear partial differential equations to tackling nonlinear optimization problems. In particular, the performance of the method is compared with that of the classical AA algorithms.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Singular Solutions for the Conformal Dirac-Einstein Problem on the Sphere
Authors:
Ali Maalaoui,
Vittorio Martino,
Tian Xu
Abstract:
In this paper we investigate the existence of singular solutions to the conformal Dirac-Einstein system. Because of its conformal invariance, there are many similarities with the classical construction of singular solutions for the Yamabe problem. We construct here a family of singular solutions, on the three-dimensional sphere, having exactly two singularities.
In this paper we investigate the existence of singular solutions to the conformal Dirac-Einstein system. Because of its conformal invariance, there are many similarities with the classical construction of singular solutions for the Yamabe problem. We construct here a family of singular solutions, on the three-dimensional sphere, having exactly two singularities.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
A partial order on antichains of a fixed size
Authors:
R. M. Green,
Tianyuan Xu
Abstract:
We introduce a new partial order on the set of all antichains of a fixed size in a given poset. When applied to minuscule posets, these partial orders give rise to distributive lattices that appear in the branching rules for minuscule representations of simply laced complex simple Lie algebras.
We introduce a new partial order on the set of all antichains of a fixed size in a given poset. When applied to minuscule posets, these partial orders give rise to distributive lattices that appear in the branching rules for minuscule representations of simply laced complex simple Lie algebras.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Level-Rank Dualities from $Φ$-Harish-Chandra Series and Affine Springer Fibers
Authors:
Minh-Tâm Quang Trinh,
Ting Xue
Abstract:
For any generic finite reductive group $\mathbb{G}$, integer $e > 0$, and $Φ_e$-cuspidal pair $(\mathbb{L}, λ)$, Broué-Malle-Michel conjectured that the endomorphism rings of the Deligne-Lusztig representations attached to $\mathbb{G}, (\mathbb{L}, λ)$ all come from the same generic cyclotomic Hecke algebra. We propose a new conjecture about the Harish-Chandra theory of such pairs, involving two i…
▽ More
For any generic finite reductive group $\mathbb{G}$, integer $e > 0$, and $Φ_e$-cuspidal pair $(\mathbb{L}, λ)$, Broué-Malle-Michel conjectured that the endomorphism rings of the Deligne-Lusztig representations attached to $\mathbb{G}, (\mathbb{L}, λ)$ all come from the same generic cyclotomic Hecke algebra. We propose a new conjecture about the Harish-Chandra theory of such pairs, involving two integers $e$ and $m$: namely, that the intersection of an $Φ_e$-Harish-Chandra series and a $Φ_m$-Harish-Chandra series is parametrized by both a union of $Φ_m$-blocks of the $Φ_e$-Hecke algebra and a union of $Φ_e$-blocks of the $Φ_m$-Hecke algebra, in a way that matches blocks. We also conjecture that when blocks match, there is an equivalence of categories between their highest-weight covers. When $e = 1$, we provide evidence that our bijections are essentially realized by bimodules that Oblomkov-Yun construct from the cohomology of affine Springer fibers. This suggests a strange analogy: Roughly, homogeneous affine Springer fibers are to roots of unity as tensor products of Deligne-Lusztig representations are to prime powers.
We predict the generic Hecke parameters for arbitrary $Φ$-cuspidal pairs of the groups $\mathbb{G}\mathbb{L}_n$ and $\mathbb{G}\mathbb{U}_n$, unifying the known cases. We prove that they would imply our conjectural bijections for these groups and coprime $e, m$. Then we show that the bijections for $\mathbb{G}\mathbb{L}_n$ are related by affine permutations to Uglov's bijections between bases of higher-level Fock spaces. This relates our conjectural equivalences of categories to those conjectured by Chuang-Miyachi, and proved by several authors, under the name of level-rank duality. Finally, for many cases in exceptional types, we verify that the parameters predicted by Broué-Malle are compatible with our conjectures.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
A Deep Reinforcement Learning Approach to Efficient Distributed Optimization
Authors:
Daokuan Zhu,
Tianqi Xu,
Jie Lu
Abstract:
In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a…
▽ More
In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a deep reinforcement learning (DRL) framework is developed for adaptive configuration within a parameterized unifying algorithmic form, which incorporates an abundance of decentralized first-order and second-order optimization algorithms. We exploit the local consensus and objective information to represent the regularities of problem instances and trace the solving progress, which constitute the states observed by a DRL agent. The framework is trained using Proximal Policy Optimization (PPO) on a number of practical problem instances of similar structures yet different problem data. Experiments on various smooth and non-smooth classes of objective functions demonstrate that our proposed learning-based method outperforms several state-of-the-art distributed optimization algorithms in terms of convergence speed and solution accuracy.
△ Less
Submitted 3 January, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Robust Probabilistic Prediction for Stochastic Dynamical Systems
Authors:
Tao Xu,
Jian** He
Abstract:
It is critical and challenging to design robust predictors for stochastic dynamical systems (SDSs) with uncertainty quantification (UQ) in the prediction. Specifically, robustness guarantees the worst-case performance when the predictor's information set of the system is inadequate, and UQ characterizes how confident the predictor is about the predictions. However, it is difficult for traditional…
▽ More
It is critical and challenging to design robust predictors for stochastic dynamical systems (SDSs) with uncertainty quantification (UQ) in the prediction. Specifically, robustness guarantees the worst-case performance when the predictor's information set of the system is inadequate, and UQ characterizes how confident the predictor is about the predictions. However, it is difficult for traditional robust predictors to provide robust UQ because they were designed to robustify the performance of point predictions. In this paper, we investigate how to robustify the probabilistic prediction for SDS, which can inherently provide robust distributional UQ. To characterize the performance of probabilistic predictors, we generalize the concept of likelihood function to likelihood functional, and prove that this metric is a proper scoring rule. Based on this metric, we propose a framework to quantify when the predictor is robust and analyze how the information set affects the robustness. Our framework makes it possible to design robust probabilistic predictors by solving functional optimization problems concerning different information sets. In particular, we design a class of moment-based optimal robust probabilistic predictors and provide a practical Kalman-filter-based algorithm for implementation. Extensive numerical simulations are provided to elaborate on our results.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Idempotents in the group algebra of the infinite dihedral group
Authors:
Ivan Dimitrov,
Charles Paquette,
David Wehlau,
Tianyuan Xu
Abstract:
We prove that over an algebraically closed field $\mathbb{K}$ of characteristic different from $2$, the group algebra $R=\mathbb{K} D_\infty$ of the infinite dihedral group $D_\infty$ has exactly six conjugacy classes of involutions (equivalently, of idempotents). This allows us to recover the fact that $R$ admits exactly four non-isomorphic indecomposable projective modules of the form $eR$ where…
▽ More
We prove that over an algebraically closed field $\mathbb{K}$ of characteristic different from $2$, the group algebra $R=\mathbb{K} D_\infty$ of the infinite dihedral group $D_\infty$ has exactly six conjugacy classes of involutions (equivalently, of idempotents). This allows us to recover the fact that $R$ admits exactly four non-isomorphic indecomposable projective modules of the form $eR$ where $e$ is an idempotent, a result that was first established by Berman and Buzási.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
Transient asymptotics of the modified Camassa-Holm equation
Authors:
Taiyang Xu,
Yiling Yang,
Lun Zhang
Abstract:
We investigate long time asymptotics of the modified Camassa-Holm equation in three transition zones under a nonzero background. The first transition zone lies between the soliton region and the first oscillatory region, the second one lies between the second oscillatory region and the fast decay region, and possibly, the third one, namely, the collisionless shock region, that bridges the first tr…
▽ More
We investigate long time asymptotics of the modified Camassa-Holm equation in three transition zones under a nonzero background. The first transition zone lies between the soliton region and the first oscillatory region, the second one lies between the second oscillatory region and the fast decay region, and possibly, the third one, namely, the collisionless shock region, that bridges the first transition region and the first oscillatory region. Under a low regularity condition on the initial data, we obtain Painlevé-type asymptotic formulas in the first two transition regions, while the transient asymptotics in the third region involves the Jacobi theta function. We establish our results by performing a $\bar{\partial}$ nonlinear steepest descent analysis to the associated Riemann-Hilbert problem.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Optimal Mixed Strategies to the Zero-sum Linear Differential Game
Authors:
Tao Xu,
Wang Xi,
Jian** He
Abstract:
This paper exploits the weak approximation method to study a zero-sum linear differential game under mixed strategies. The stochastic nature of mixed strategies poses challenges in evaluating the game value and deriving the optimal strategies. To overcome these challenges, we first define the mixed strategy based on time discretization given the control period $δ$. Then, we design a stochastic dif…
▽ More
This paper exploits the weak approximation method to study a zero-sum linear differential game under mixed strategies. The stochastic nature of mixed strategies poses challenges in evaluating the game value and deriving the optimal strategies. To overcome these challenges, we first define the mixed strategy based on time discretization given the control period $δ$. Then, we design a stochastic differential equation (SDE) to approximate the discretized game dynamic with a small approximation error of scale $\mathcal{O}(δ^2)$ in the weak sense. Moreover, we prove that the game payoff is also approximated in the same order of accuracy. Next, we solve the optimal mixed strategies and game values for the linear quadratic differential games. The effect of the control period is explicitly analyzed when the payoff is a terminal cost. Our results provide the first implementable form of the optimal mixed strategies for a zero-sum linear differential game. Finally, we provide numerical examples to illustrate and elaborate on our results.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Non-compactness results for the spinorial Yamabe-type problems with non-smooth geometric data
Authors:
Takeshi Isobe,
Yannick Sire,
Tian Xu
Abstract:
Let $(M,\textit{g},σ)$ be an $m$-dimensional closed spin manifold, with a fixed Riemannian metric $\textit{g}$ and a fixed spin structure $σ$; let $\mathbb{S}(M)$ be the spinor bundle over $M$. The spinorial Yamabe-type problems address the solvability of the following equation \[ D_{\textit{g}}ψ= f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ, \quad ψ:M\to\mathbb{S}(M), \ x\in M \] where $D_{\textit{g}}$ i…
▽ More
Let $(M,\textit{g},σ)$ be an $m$-dimensional closed spin manifold, with a fixed Riemannian metric $\textit{g}$ and a fixed spin structure $σ$; let $\mathbb{S}(M)$ be the spinor bundle over $M$. The spinorial Yamabe-type problems address the solvability of the following equation \[ D_{\textit{g}}ψ= f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ, \quad ψ:M\to\mathbb{S}(M), \ x\in M \] where $D_{\textit{g}}$ is the associated Dirac operator and $f:M\to\mathbb{R}$ is a given function. The study of such nonlinear equation is motivated by its important applications in Spin Geometry: when $m=2$, a solution corresponds to a conformal isometric immersion of the universal covering $\widetilde M$ into $\mathbb{R}^3$ with prescribed mean curvature $f$; meanwhile, for general dimensions and $f\equiv constant\neq0$, a solution provides an upper bound estimate for the Bär-Hijazi-Lott invariant.
The aim of this paper is to establish non-compactness results related to the spinorial Yamabe-type problems. Precisely, concrete analysis is made for two specific models on the manifold $(S^m,\textit{g})$ where the solution set of the spinorial Yamabe-type problem is not compact: $1).$ the geometric potential $f$ is constant (say $f\equiv1$) with the background metric $\textit{g}$ being a $C^k$ perturbation of the canonical round metric $\textit{g}_{S^m}$, which is not conformally flat somewhere on $S^m$; $2).$ $f$ is a perturbation from constant and is of class $C^2$, while the background metric $\textit{g}\equiv\textit{g}_{S^m}$.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Nonequilibrium Joint Fluctuations for Current and Occupation Time in The Symmetric Exclusion Process
Authors:
Dirk Erhard,
Tertuliano Franco,
Tiecheng Xu
Abstract:
We provide a full description for the joint fluctuations of current and occupation time in the one-dimensional nonequilibrium simple symmetric exclusion process, furnishing explicit formulas for the covariances of the limiting Gaussian process. The main novelties consist of a proof of the tightness of the nonequilibrium current based on new correlation estimates, refined estimates on the discrete…
▽ More
We provide a full description for the joint fluctuations of current and occupation time in the one-dimensional nonequilibrium simple symmetric exclusion process, furnishing explicit formulas for the covariances of the limiting Gaussian process. The main novelties consist of a proof of the tightness of the nonequilibrium current based on new correlation estimates, refined estimates on the discrete gradient of the transition probabilities of the SSEP, and a nonequilibrium Kipnis-Varadhan Lemma based on a Fourier approach.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
An Adaptive Factorized Nyström Preconditioner for Regularized Kernel Matrices
Authors:
Shifan Zhao,
Tianshi Xu,
Hua Huang,
Edmond Chow,
Yuanzhe Xi
Abstract:
The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the ra…
▽ More
The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the rank k of the Nyström approximation is large, i.e., for kernel function parameters that lead to kernel matrices with eigenvalues that decay slowly. AFN deliberately chooses a well-conditioned submatrix to solve with and corrects a Nyström approximation with a factorized sparse approximate matrix inverse. This makes AFN efficient for kernel matrices with large numerical ranks. AFN also adaptively chooses the size of this submatrix to balance accuracy and cost.
△ Less
Submitted 9 April, 2024; v1 submitted 11 April, 2023;
originally announced April 2023.
-
Variational operator learning: A unified paradigm marrying training neural operators and solving partial differential equations
Authors:
Tengfei Xu,
Dachuan Liu,
Peng Hao,
Bo Wang
Abstract:
Neural operators as novel neural architectures for fast approximating solution operators of partial differential equations (PDEs), have shown considerable promise for future scientific computing. However, the mainstream of training neural operators is still data-driven, which needs an expensive ground-truth dataset from various sources (e.g., solving PDEs' samples with the conventional solvers, re…
▽ More
Neural operators as novel neural architectures for fast approximating solution operators of partial differential equations (PDEs), have shown considerable promise for future scientific computing. However, the mainstream of training neural operators is still data-driven, which needs an expensive ground-truth dataset from various sources (e.g., solving PDEs' samples with the conventional solvers, real-world experiments) in addition to training stage costs. From a computational perspective, marrying operator learning and specific domain knowledge to solve PDEs is an essential step in reducing dataset costs and label-free learning. We propose a novel paradigm that provides a unified framework of training neural operators and solving PDEs with the variational form, which we refer to as the variational operator learning (VOL). Ritz and Galerkin approach with finite element discretization are developed for VOL to achieve matrix-free approximation of system functional and residual, then direct minimization and iterative update are proposed as two optimization strategies for VOL. Various types of experiments based on reasonable benchmarks about variable heat source, Darcy flow, and variable stiffness elasticity are conducted to demonstrate the effectiveness of VOL. With a label-free training set and a 5-label-only shift set, VOL learns solution operators with its test errors decreasing in a power law with respect to the amount of unlabeled data. To the best of the authors' knowledge, this is the first study that integrates the perspectives of the weak form and efficient iterative methods for solving sparse linear systems into the end-to-end operator learning task.
△ Less
Submitted 9 November, 2023; v1 submitted 9 April, 2023;
originally announced April 2023.
-
Solutions of Spinorial Yamabe-type Problems on $S^m$: Perturbations and Applications
Authors:
Takeshi Isobe,
Tian Xu
Abstract:
This paper is part of a program to establish the existence theory for the conformally invariant Dirac equation \[ D_{\textit{g}}ψ=f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ\] on a closed spin manifold $(M,\textit{g})$ of dimension $m\geq2$ with a fixed spin structure, where $f:M\to\mathbb{R}$ is a given function. The study on such nonlinear equation is motivated by its important applications in Spin Geom…
▽ More
This paper is part of a program to establish the existence theory for the conformally invariant Dirac equation \[ D_{\textit{g}}ψ=f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ\] on a closed spin manifold $(M,\textit{g})$ of dimension $m\geq2$ with a fixed spin structure, where $f:M\to\mathbb{R}$ is a given function. The study on such nonlinear equation is motivated by its important applications in Spin Geometry: when $m=2$, a solution corresponds to an isometric immersion of the universal covering $\widetilde M$ into $\mathbb{R}^3$ with prescribed mean curvature $f$; meanwhile, for general dimensions and $f\equiv constant$, a solution provides an upper bound estimate for the Bär-Hijazi-Lott invariant.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
A Two-level GPU-Accelerated Incomplete LU Preconditioner for General Sparse Linear Systems
Authors:
Tianshi Xu,
Ruipeng Li,
Daniel Osei-Kuffuor
Abstract:
This paper presents a parallel preconditioning approach based on incomplete LU (ILU) factorizations in the framework of Domain Decomposition (DD) for general sparse linear systems. We focus on distributed memory parallel architectures, specifically, those that are equipped with graphic processing units (GPUs). In addition to block Jacobi, we present general purpose two-level ILU Schur complement-b…
▽ More
This paper presents a parallel preconditioning approach based on incomplete LU (ILU) factorizations in the framework of Domain Decomposition (DD) for general sparse linear systems. We focus on distributed memory parallel architectures, specifically, those that are equipped with graphic processing units (GPUs). In addition to block Jacobi, we present general purpose two-level ILU Schur complement-based approaches, where different strategies are presented to solve the coarse-level reduced system. These strategies are combined with modified ILU methods in the construction of the coarse-level operator, in order to effectively remove smooth errors. We leverage available GPU-based sparse matrix kernels to accelerate the setup and the solve phases of the proposed ILU preconditioner. We evaluate the efficiency of the proposed methods as a smoother for algebraic multigrid (AMG) and as a preconditioner for Krylov subspace methods, on challenging anisotropic diffusion problems and a collection of general sparse matrices.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
Computing with Categories in Machine Learning
Authors:
Eli Sennesh,
Tom Xu,
Yoshihiro Maruyama
Abstract:
Category theory has been successfully applied in various domains of science, shedding light on universal principles unifying diverse phenomena and thereby enabling knowledge transfer between them. Applications to machine learning have been pursued recently, and yet there is still a gap between abstract mathematical foundations and concrete applications to machine learning tasks. In this paper we i…
▽ More
Category theory has been successfully applied in various domains of science, shedding light on universal principles unifying diverse phenomena and thereby enabling knowledge transfer between them. Applications to machine learning have been pursued recently, and yet there is still a gap between abstract mathematical foundations and concrete applications to machine learning tasks. In this paper we introduce DisCoPyro as a categorical structure learning framework, which combines categorical structures (such as symmetric monoidal categories and operads) with amortized variational inference, and can be applied, e.g., in program learning for variational autoencoders. We provide both mathematical foundations and concrete applications together with comparison of experimental performance with other models (e.g., neuro-symbolic models). We speculate that DisCoPyro could ultimately contribute to the development of artificial general intelligence.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
A Simple Algorithm For Scaling Up Kernel Methods
Authors:
Teng Andrea Xu,
Bryan Kelly,
Semyon Malamud
Abstract:
The recent discovery of the equivalence between infinitely wide neural networks (NNs) in the lazy training regime and Neural Tangent Kernels (NTKs) (Jacot et al., 2018) has revived interest in kernel methods. However, conventional wisdom suggests kernel methods are unsuitable for large samples due to their computational complexity and memory requirements. We introduce a novel random feature regres…
▽ More
The recent discovery of the equivalence between infinitely wide neural networks (NNs) in the lazy training regime and Neural Tangent Kernels (NTKs) (Jacot et al., 2018) has revived interest in kernel methods. However, conventional wisdom suggests kernel methods are unsuitable for large samples due to their computational complexity and memory requirements. We introduce a novel random feature regression algorithm that allows us (when necessary) to scale to virtually infinite numbers of random features. We illustrate the performance of our method on the CIFAR-10 dataset.
△ Less
Submitted 30 January, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
Curvature effect in the spinorial Yamabe problem on product manifolds
Authors:
Thomas Bartsch,
Tian Xu
Abstract:
Let $(M_1,\textit{g}^{(1)})$, $(M_2,\textit{g}^{(2)})$ be closed Riemannian spin manifolds. We study the existence of solutions of the spinorial Yamabe problem on the product $M_1\times M_2$ equipped with a family of metrics $\varepsilon^{-2}\textit{g}^{(1)}\oplus\textit{g}^{(2)}$, $\varepsilon>0$. Via variational methods and blow-up techniques, we prove the existence of solutions which depend onl…
▽ More
Let $(M_1,\textit{g}^{(1)})$, $(M_2,\textit{g}^{(2)})$ be closed Riemannian spin manifolds. We study the existence of solutions of the spinorial Yamabe problem on the product $M_1\times M_2$ equipped with a family of metrics $\varepsilon^{-2}\textit{g}^{(1)}\oplus\textit{g}^{(2)}$, $\varepsilon>0$. Via variational methods and blow-up techniques, we prove the existence of solutions which depend only on the factor $M_1$, and which exhibit a spike layer as $\varepsilon\to0$. Moreover, we locate the asymptotic position of the peak points of the solutions in terms of the curvature tensor on $(M_1,\textit{g}^{(1)})$.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Constructions of Delaunay-type solutions for the spinorial Yamabe equation on spheres
Authors:
Ali Maalaoui,
Yannick Sire,
Tian Xu
Abstract:
In this paper we construct singular solutions to the critical Dirac equation on spheres. More precisely, first we construct solutions admitting two points singularities that we call Delaunay-type solutions because of their similarities with the Delaunay solutions constructed for the singular Yamabe problem in \cite{MP1 , Schoen1989}. Then we construct another kind of singular solutions admitting a…
▽ More
In this paper we construct singular solutions to the critical Dirac equation on spheres. More precisely, first we construct solutions admitting two points singularities that we call Delaunay-type solutions because of their similarities with the Delaunay solutions constructed for the singular Yamabe problem in \cite{MP1 , Schoen1989}. Then we construct another kind of singular solutions admitting a great circle as a singular set. These solutions are the building blocks for singular solutions on a general Spin manifold.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Approximation of the non-linear water hammer problem by a Lax-Wendroff finite difference scheme
Authors:
Hugo Carrillo-Lincopi,
Alden Waters,
Teke Xu
Abstract:
We study the water hammer problem in the case of a sudden closing of a valve upstream, and we consider a Lax-Wendroff finite difference scheme in order to obtain a numerical solution of this problem. In order to establish the approximation of this scheme to the original case, we rigorously show some properties such as consistency, stability and weak convergence of the scheme under reasonable condi…
▽ More
We study the water hammer problem in the case of a sudden closing of a valve upstream, and we consider a Lax-Wendroff finite difference scheme in order to obtain a numerical solution of this problem. In order to establish the approximation of this scheme to the original case, we rigorously show some properties such as consistency, stability and weak convergence of the scheme under reasonable conditions. In addition, we present some numerical simulations in order to show some features of the numerical method.
△ Less
Submitted 30 March, 2023; v1 submitted 30 November, 2022;
originally announced December 2022.
-
The Ariki--Koike algebras and Rogers--Ramanujan type partitions
Authors:
Shane Chern,
Zhitai Li,
Dennis Stanton,
Ting Xue,
Ae Ja Yee
Abstract:
In 2000, Ariki and Mathas showed that the simple modules of the Ariki--Koike algebras $\mathcal{H}_{\mathbb{C},q;Q_1,\ldots, Q_m}\big(G(m, 1, n)\big)$ (when the parameters are roots of unity and $q\neq 1$) are labeled by the so-called Kleshchev multipartitions. This together with Ariki's categorification theorem enabled Ariki and Mathas to obtain the generating function for the number of Kleshchev…
▽ More
In 2000, Ariki and Mathas showed that the simple modules of the Ariki--Koike algebras $\mathcal{H}_{\mathbb{C},q;Q_1,\ldots, Q_m}\big(G(m, 1, n)\big)$ (when the parameters are roots of unity and $q\neq 1$) are labeled by the so-called Kleshchev multipartitions. This together with Ariki's categorification theorem enabled Ariki and Mathas to obtain the generating function for the number of Kleshchev multipartitions by making use of the Weyl--Kac character formula. In this paper, we revisit this generating function for the $q=-1$ case. This $q=-1$ case is particularly interesting, for the corresponding Kleshchev multipartitions have a very close connection to generalized Rogers--Ramanujan type partitions when $Q_1=\cdots=Q_a=-1$ and $Q_{a+1}=\cdots =Q_m =1$. Based on this connection, we provide an analytic proof of the result of Ariki and Mathas for $q=Q_1=\cdots Q_a=-1$ and $Q_{a+1}=\cdots =Q_m =1$. Our second objective is to investigate simple modules of the Ariki--Koike algebra in a fixed block. It is known that these simple modules in a fixed block are labeled by the Kleshchev multiparitions with a fixed partition residue statistic. This partition statistic is also studied in the works of Berkovich, Garvan, and Uncu. Employing their results, we provide two bivariate generating function identities when $m=2$.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
RAR-PINN algorithm for the data-driven vector-soliton solutions and parameter discovery of coupled nonlinear equations
Authors:
Shu-Mei Qin,
Min Li,
Tao Xu,
Shao-Qun Dong
Abstract:
This work aims to provide an effective deep learning framework to predict the vector-soliton solutions of the coupled nonlinear equations and their interactions. The method we propose here is a physics-informed neural network (PINN) combining with the residual-based adaptive refinement (RAR-PINN) algorithm. Different from the traditional PINN algorithm which takes points randomly, the RAR-PINN alg…
▽ More
This work aims to provide an effective deep learning framework to predict the vector-soliton solutions of the coupled nonlinear equations and their interactions. The method we propose here is a physics-informed neural network (PINN) combining with the residual-based adaptive refinement (RAR-PINN) algorithm. Different from the traditional PINN algorithm which takes points randomly, the RAR-PINN algorithm uses an adaptive point-fetching approach to improve the training efficiency for the solutions with steep gradients. A series of experiment comparisons between the RAR-PINN and traditional PINN algorithms are implemented to a coupled generalized nonlinear Schrödinger (CGNLS) equation as an example. The results indicate that the RAR-PINN algorithm has faster convergence rate and better approximation ability, especially in modeling the shape-changing vector-soliton interactions in the coupled systems. Finally, the RAR-PINN method is applied to perform the data-driven discovery of the CGNLS equation, which shows the dispersion and nonlinear coefficients can be well approximated.
△ Less
Submitted 29 April, 2022;
originally announced May 2022.
-
Representations of free products of semisimple algebras via quivers
Authors:
Andrew Buchanan,
Ivan Dimitrov,
Olivia Grace,
Charles Paquette,
David Wehlau,
Tianyuan Xu
Abstract:
Let $\mathbb{K}$ denote an algebraically closed field and $A$ a free product of finitely many semisimple associative $\mathbb{K}$-algebras. We associate to $A$ a finite acyclic quiver $Γ$ and show that the category of finite dimensional $A$-modules is equivalent to a full subcategory of the category ${\rm rep}(Γ)$ of finite dimensional representations of $Γ$. Under this equivalence, the simple…
▽ More
Let $\mathbb{K}$ denote an algebraically closed field and $A$ a free product of finitely many semisimple associative $\mathbb{K}$-algebras. We associate to $A$ a finite acyclic quiver $Γ$ and show that the category of finite dimensional $A$-modules is equivalent to a full subcategory of the category ${\rm rep}(Γ)$ of finite dimensional representations of $Γ$. Under this equivalence, the simple $A$-modules correspond exactly to the $θ$-stable representations of $Γ$ for some stability parameter $θ$. This gives us necessary conditions for an $A$-module to be simple, conditions which are also sufficient if the module is in general position. Even though there are indecomposable modules that are not simple, we prove that a module in general position is always semisimple. We also discuss the construction of arbitrary finite dimensional modules using nilpotent representations of quivers. Finally, we apply our results to the case of a free product of finite groups when $\mathbb{K}$ has characteristic zero.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
Probabilistic Predictability of Stochastic Dynamical Systems: Metric, Optimality and Application
Authors:
Tao Xu,
Jian** He,
Yushan Li
Abstract:
To assess the quality of a probabilistic prediction for stochastic dynamical systems (SDSs), scoring rules assign a numerical score based on the predictive distribution and the measured state. In this paper, we propose an $ε$-logarithm score that generalizes the celebrated logarithm score by considering a neighborhood with radius $ε$. To begin with, we prove that the $ε$-logarithm score is proper…
▽ More
To assess the quality of a probabilistic prediction for stochastic dynamical systems (SDSs), scoring rules assign a numerical score based on the predictive distribution and the measured state. In this paper, we propose an $ε$-logarithm score that generalizes the celebrated logarithm score by considering a neighborhood with radius $ε$. To begin with, we prove that the $ε$-logarithm score is proper (the expected score is optimized when the predictive distribution meets the ground truth) based on discrete approximations. Then, we characterize the probabilistic predictability of an SDS by the optimal expected score and approximate it with an error of scale $\mathcal{O}(ε)$. The approximation quantitatively shows how the system predictability is jointly determined by the neighborhood radius, the differential entropies of process noises, and the system dimension. In addition to the expected score, we also analyze the asymptotic behaviors of the score on individual trajectories. Specifically, we prove that the score on a trajectory will converge to the probabilistic predictability when the process noises are independent and identically distributed. Moreover, the convergence speed against the trajectory length $T$ is of scale $\mathcal{O}(T^{-\frac{1}{2}})$ in the sense of probability. Finally, we apply the predictability analysis to design unpredictable SDSs. Numerical examples are given to elaborate the results.
△ Less
Submitted 9 December, 2023; v1 submitted 12 May, 2022;
originally announced May 2022.
-
A Probabilistic Generative Model of Free Categories
Authors:
Eli Sennesh,
Tom Xu,
Yoshihiro Maruyama
Abstract:
Applied category theory has recently developed libraries for computing with morphisms in interesting categories, while machine learning has developed ways of learning programs in interesting languages. Taking the analogy between categories and languages seriously, this paper defines a probabilistic generative model of morphisms in free monoidal categories over domain-specific generating objects an…
▽ More
Applied category theory has recently developed libraries for computing with morphisms in interesting categories, while machine learning has developed ways of learning programs in interesting languages. Taking the analogy between categories and languages seriously, this paper defines a probabilistic generative model of morphisms in free monoidal categories over domain-specific generating objects and morphisms. The paper shows how acyclic directed wiring diagrams can model specifications for morphisms, which the model can use to generate morphisms. Amortized variational inference in the generative model then enables learning of parameters (by maximum likelihood) and inference of latent variables (by Bayesian inversion). A concrete experiment shows that the free category prior achieves competitive reconstruction performance on the Omniglot dataset.
△ Less
Submitted 13 May, 2022; v1 submitted 9 May, 2022;
originally announced May 2022.
-
parGeMSLR: A Parallel Multilevel Schur Complement Low-Rank Preconditioning and Solution Package for General Sparse Matrices
Authors:
Tianshi Xu,
Vassilis Kalantzis,
Ruipeng Li,
Yuanzhe Xi,
Geoffrey Dillon,
Yousef Saad
Abstract:
This paper discusses parGeMSLR, a C++/MPI software library for the solution of sparse systems of linear algebraic equations via preconditioned Krylov subspace methods in distributed-memory computing environments. The preconditioner implemented in parGeMSLR is based on algebraic domain decomposition and partitions the symmetrized adjacency graph recursively into several non-overlap** partitions v…
▽ More
This paper discusses parGeMSLR, a C++/MPI software library for the solution of sparse systems of linear algebraic equations via preconditioned Krylov subspace methods in distributed-memory computing environments. The preconditioner implemented in parGeMSLR is based on algebraic domain decomposition and partitions the symmetrized adjacency graph recursively into several non-overlap** partitions via a p-way vertex separator, where p is an integer multiple of the total number of MPI processes. From a numerical perspective, parGeMSLR builds a Schur complement approximate inverse preconditioner as the sum between the matrix inverse of the interface coupling matrix and a low-rank correction term. To reduce the cost associated with the computation of the approximate inverse matrices, parGeMSLR exploits a multilevel partitioning of the algebraic domain. The parGeMSLR library is implemented on top of the Message Passing Interface and can solve both real and complex linear systems. Furthermore, parGeMSLR can take advantage of hybrid computing environments with in-node access to one or more Graphics Processing Units. Finally, the parallel efficiency (weak and strong scaling) of parGeMSLR is demonstrated on a few model problems arising from discretizations of 3D Partial Differential Equations.
△ Less
Submitted 4 May, 2022;
originally announced May 2022.
-
2-roots for simply laced Weyl groups
Authors:
R. M. Green,
Tianyuan Xu
Abstract:
We introduce and study "2-roots", which are symmetrized tensor products of orthogonal roots of Kac--Moody algebras. We concentrate on the case where $W$ is the Weyl group of a simply laced Y-shaped Dynkin diagram $Y_{a,b,c}$ having $n$ vertices and with three branches of arbitrary finite lengths $a$, $b$ and $c$; special cases of this include types $D_n$, $E_n$ (for arbitrary $n \geq 6$), and affi…
▽ More
We introduce and study "2-roots", which are symmetrized tensor products of orthogonal roots of Kac--Moody algebras. We concentrate on the case where $W$ is the Weyl group of a simply laced Y-shaped Dynkin diagram $Y_{a,b,c}$ having $n$ vertices and with three branches of arbitrary finite lengths $a$, $b$ and $c$; special cases of this include types $D_n$, $E_n$ (for arbitrary $n \geq 6$), and affine $E_6$, $E_7$ and $E_8$. We show that a natural codimension-$1$ submodule $M$ of the symmetric square of the reflection representation of $W$ has a remarkable canonical basis $\mathcal{B}$ that consists of 2-roots. We prove that, with respect to $\mathcal{B}$, every element of $W$ is represented by a column sign-coherent matrix in the sense of cluster algebras. If $W$ is a finite simply laced Weyl group, each $W$-orbit of 2-roots has a highest element, analogous to the highest root, and we calculate these elements explicitly. We prove that if $W$ is not of affine type, the module $M$ is completely reducible in characteristic zero and each of its nontrivial direct summands is spanned by a $W$-orbit of 2-roots.
△ Less
Submitted 8 April, 2023; v1 submitted 20 April, 2022;
originally announced April 2022.
-
On the large-time asymptotics of the defocusing mKdV equation with step-like initial data
Authors:
Taiyang Xu
Abstract:
It is concerned with the large-time asymptotics of the Cauchy problem of the defocusing modified Korteweg-de Vries (mKdV) equation with step-like initial data subject to compact perturbations, that is, \begin{align*}
q_{0}(x)-q_{0c}(x)=0, \ \text{for} \ |x|>N \end{align*} with some positive $N$, where \begin{align*}
q_{0c}(x)=\left\{
\begin{aligned}
&c_{l}, \quad x\leqslant 0,
&c_{r}, \q…
▽ More
It is concerned with the large-time asymptotics of the Cauchy problem of the defocusing modified Korteweg-de Vries (mKdV) equation with step-like initial data subject to compact perturbations, that is, \begin{align*}
q_{0}(x)-q_{0c}(x)=0, \ \text{for} \ |x|>N \end{align*} with some positive $N$, where \begin{align*}
q_{0c}(x)=\left\{
\begin{aligned}
&c_{l}, \quad x\leqslant 0,
&c_{r}, \quad x>0,
\end{aligned}
\right. \end{align*} and $c_l>c_{r}>0$. It follows from the standard direct and inverse scattering theory that an RH characterization for the step-like problem is constructed. By performing the nonlinear steepest descent analysis, we mainly derive the large-time asymptotics in the each of four asymptotic zones in the $(x,t)$-half plane.
△ Less
Submitted 28 June, 2024; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Character sheaves for classical graded Lie algebras
Authors:
Ting Xue
Abstract:
In this note we study character sheaves for graded Lie algebras arising from inner automorphisms of special linear groups and Vinberg's type II classical graded Lie algebras.
In this note we study character sheaves for graded Lie algebras arising from inner automorphisms of special linear groups and Vinberg's type II classical graded Lie algebras.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
On Limit Measures and Their Supports for Stochastic Ordinary Differential Equations
Authors:
Tianyuan Xu,
Lifeng Chen,
Jifa Jiang
Abstract:
This paper studies limit measures of stationary measures of stochastic ordinary differential equations on the Euclidean space and tries to determine which invariant measures of an unperturbed system will survive. Under the assumption for SODEs to admit the Freidlin-Wentzell or Dembo-Zeitouni large deviations principle with weaker compactness condition, we prove that limit measures are concentrated…
▽ More
This paper studies limit measures of stationary measures of stochastic ordinary differential equations on the Euclidean space and tries to determine which invariant measures of an unperturbed system will survive. Under the assumption for SODEs to admit the Freidlin-Wentzell or Dembo-Zeitouni large deviations principle with weaker compactness condition, we prove that limit measures are concentrated away from repellers which are topologically transitive, or equivalent classes, or admit Lebesgue measure zero. We also preclude concentrations of limit measures on acyclic saddle or trap chains. This illustrates that limit measures are concentrated on Liapunov stable compact invariant sets. Applications are made to the Morse-Smale systems, the Axiom A systems including structural stability systems and separated star systems, the gradient or gradient-like systems, those systems possessing the Poincare-Bendixson property with a finite number of limit sets to obtain that limit measures live on Liapunov stable critical elements, Liapunov stable basic sets, Liapunov stable equilibria and Liapunov stable limit sets including equilibria, limit cycles and saddle or trap cycles, respectively. A number of nontrivial examples admitting a unique limit measure are provided, which include monostable, multistable systems and those possessing infinite equivalent classes.
△ Less
Submitted 26 January, 2022;
originally announced January 2022.
-
On the Bär-Hijazi-Lott invariant for the Dirac operator and a spinorial proof of the Yamabe problem
Authors:
Yannick Sire,
Tian Xu
Abstract:
Let $M$ be a closed spin manifold of dimension $m\geq6$ equipped with a Riemannian metric $\ig$ and a spin structure $\sa$. Let $\lm_1^+(\tilde\ig)$ be the smallest positive eigenvalue of the Dirac operator $D_{\tilde\ig}$ on $M$ with respect to a metric $\tilde\ig$ conformal to $\ig$. The Bär-Hijazi-Lott invariant is defined by…
▽ More
Let $M$ be a closed spin manifold of dimension $m\geq6$ equipped with a Riemannian metric $\ig$ and a spin structure $\sa$. Let $\lm_1^+(\tilde\ig)$ be the smallest positive eigenvalue of the Dirac operator $D_{\tilde\ig}$ on $M$ with respect to a metric $\tilde\ig$ conformal to $\ig$. The Bär-Hijazi-Lott invariant is defined by $\lm_{min}^+(M,\ig,\sa)=\inf_{\tilde\ig\in[\ig]}\lm_1^+(\tilde\ig)\Vol(M,\tilde\ig)^\frac{1}{m}$. In this paper, we show that \[ \lm_{min}^+(M,\ig,\sa)<\lm_{min}^+(S^m,\ig_{S^m},\sa_{S^m})=\frac m2\Vol(S^m,\ig_{S^m})^{\frac1m} \] provided that $\ig$ is not locally conformally flat. This estimate is a spinorial analogue to an estimate by T. Aubin, solving the Yamabe problem in this setting.
△ Less
Submitted 7 December, 2021;
originally announced December 2021.
-
Additive functionals of exclusion processes from non-equilibrium
Authors:
Luiz Renato Fontes,
Tiecheng Xu
Abstract:
Consider the weakly asymmetric simple exclusion processes on the one-dimensional torus. We study the non-equilibrium fluctuation of a class of additive functionals, and show that its scaling limit is a Gaussian process. The proof is mainly based on the results obtained and techniques developed by Jara and Menezes [Non-equiliburim fluctuations of interacting particle systems, arXiv:1810.09526].
Consider the weakly asymmetric simple exclusion processes on the one-dimensional torus. We study the non-equilibrium fluctuation of a class of additive functionals, and show that its scaling limit is a Gaussian process. The proof is mainly based on the results obtained and techniques developed by Jara and Menezes [Non-equiliburim fluctuations of interacting particle systems, arXiv:1810.09526].
△ Less
Submitted 1 June, 2023; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Invariant systems and character sheaves for graded Lie algebras
Authors:
Kari Vilonen,
Ting Xue
Abstract:
In this paper we explain how to construct all the character sheaves for type I graded classical Lie algebras which we expect to be cuspidal. A new ingredient is the use of invariant systems of differential equations.
In this paper we explain how to construct all the character sheaves for type I graded classical Lie algebras which we expect to be cuspidal. A new ingredient is the use of invariant systems of differential equations.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Character sheaves for symmetric pairs: spin groups
Authors:
Ting Xue
Abstract:
We determine character sheaves for symmetric pairs associated to spin groups. In particular, we determine the cupsidal character sheaves and show that they can be obtained via the nearby cycle construction of [GVX] and its generalisation in [VX2].
We determine character sheaves for symmetric pairs associated to spin groups. In particular, we determine the cupsidal character sheaves and show that they can be obtained via the nearby cycle construction of [GVX] and its generalisation in [VX2].
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
Character sheaves for symmetric pairs: special linear groups
Authors:
Kari Vilonen,
Ting Xue
Abstract:
We give an explicit description of character sheaves for the symmetric pairs associated to inner involutions of the special linear groups. We make use of the general strategy given in [VX1] and central character consideration. We also determine the cuspidal character sheaves.
We give an explicit description of character sheaves for the symmetric pairs associated to inner involutions of the special linear groups. We make use of the general strategy given in [VX1] and central character consideration. We also determine the cuspidal character sheaves.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process
Authors:
Tianjiao Li,
Ziwei Guan,
Shaofeng Zou,
Tengyu Xu,
Yingbin Liang,
Guanghui Lan
Abstract:
The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its utilities/costs. A new primal-dual approach is proposed with a novel integration of three ingredients: entropy regularized policy optimizer, dual variable regularizer, and Nesterov's accelerated gradient descent…
▽ More
The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its utilities/costs. A new primal-dual approach is proposed with a novel integration of three ingredients: entropy regularized policy optimizer, dual variable regularizer, and Nesterov's accelerated gradient descent dual optimizer, all of which are critical to achieve a faster convergence. The finite-time error bound of the proposed approach is characterized. Despite the challenge of the nonconcave objective subject to nonconcave constraints, the proposed approach is shown to converge to the global optimum with a complexity of $\tilde{\mathcal O}(1/ε)$ in terms of the optimality gap and the constraint violation, which improves the complexity of the existing primal-dual approach by a factor of $\mathcal O(1/ε)$ \citep{ding2020natural,paternain2019constrained}. This is the first demonstration that nonconcave CMDP problems can attain the complexity lower bound of $\mathcal O(1/ε)$ for convex optimization subject to convex constraints. Our primal-dual approach and non-asymptotic analysis are agnostic to the RL optimizer used, and thus are more flexible for practical applications. More generally, our approach also serves as the first algorithm that provably accelerates constrained nonconvex optimization with zero duality gap by exploiting the geometries such as the gradient dominance condition, for which the existing acceleration methods for constrained convex optimization are not applicable.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Hydrodynamic limit of Exclusion Processes with slow boundaries on hypercubes
Authors:
Tiecheng Xu
Abstract:
We study the hydrodynamic limit of SSEP with slow boundaries on hypercubes in dimension at least two. The hydrodynamic limit equation is shown to be a heat equation with three different types of boundary conditions according to the slowness of the boundary dynamics.The proof is based on Yau's relative entropy method.
We study the hydrodynamic limit of SSEP with slow boundaries on hypercubes in dimension at least two. The hydrodynamic limit equation is shown to be a heat equation with three different types of boundary conditions according to the slowness of the boundary dynamics.The proof is based on Yau's relative entropy method.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in $\mathbf{a}(2)$-finite Coxeter systems
Authors:
R. M. Green,
Tianyuan Xu
Abstract:
A Coxeter group is said to be \emph{$\mathbf{a}(2)$-finite} if it has finitely many elements of $\mathbf{a}$-value 2 in the sense of Lusztig. In this paper, we give explicit combinatorial descriptions of the left, right, and two-sided Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in an irreducible $\mathbf{a}(2)$-finite Coxeter group. In particular, we introduce elements we call \emph{stubs} to p…
▽ More
A Coxeter group is said to be \emph{$\mathbf{a}(2)$-finite} if it has finitely many elements of $\mathbf{a}$-value 2 in the sense of Lusztig. In this paper, we give explicit combinatorial descriptions of the left, right, and two-sided Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in an irreducible $\mathbf{a}(2)$-finite Coxeter group. In particular, we introduce elements we call \emph{stubs} to parameterize the one-sided cells and we characterize the one-sided cells via both star operations and weak Bruhat orders. We also compute the cardinalities of all the one-sided and two-sided cells.
△ Less
Submitted 25 May, 2023; v1 submitted 20 September, 2021;
originally announced September 2021.
-
On the Cauchy problem of defocusing mKdV equation with finite density initial data: long time asymptotics in soliton-less regions
Authors:
Taiyang Xu,
Zechuan Zhang,
Engui Fan
Abstract:
We investigate the long-time asymptotics for the solutions to the Cauchy problem of defocusing modified Kortweg-de Vries (mKdV) equation with finite density initial data. The present paper is the subsequent work of our previous paper [arXiv:2108.03650], which gives the soliton resolution for the defocusing mKdV equation in the central asymptotic sector $\{(x,t): \vert ξ\vert<6\}$ with $ξ:=x/t$. In…
▽ More
We investigate the long-time asymptotics for the solutions to the Cauchy problem of defocusing modified Kortweg-de Vries (mKdV) equation with finite density initial data. The present paper is the subsequent work of our previous paper [arXiv:2108.03650], which gives the soliton resolution for the defocusing mKdV equation in the central asymptotic sector $\{(x,t): \vert ξ\vert<6\}$ with $ξ:=x/t$. In the present paper, via the Riemann-Hilbert (RH) problem associated to the Cauchy problem, the long-time asymptotics in the soliton-less regions $\{(x,t): \vert ξ\vert>6, |ξ|=\mathcal{O}(1)\}$ for the defocusing mKdV equation are further obtained. It is shown that the leading term of the asymptotics are in compatible with the ``background solution'' and the error terms are derived via rigorous analysis.
△ Less
Submitted 23 June, 2023; v1 submitted 13 August, 2021;
originally announced August 2021.
-
Soliton resolution and asymptotic stability of $N$-soliton solutions for the defocusing mKdV equation with finite density type initial data
Authors:
Zechuan Zhang,
Taiyang Xu,
Engui Fan
Abstract:
We consider the Cauchy problem for the defocusing modified Korteweg-de Vries (mKdV) equation with finite density type initial data. With the $\bar{\partial}$ generalization of the nonlinear steepest descent method of Deift and Zhou, we extrapolate the leading order approximation to the solution of mKdV for large time in the solitonic space-time region $|x/t+4|<2$, and we give bounds for the error…
▽ More
We consider the Cauchy problem for the defocusing modified Korteweg-de Vries (mKdV) equation with finite density type initial data. With the $\bar{\partial}$ generalization of the nonlinear steepest descent method of Deift and Zhou, we extrapolate the leading order approximation to the solution of mKdV for large time in the solitonic space-time region $|x/t+4|<2$, and we give bounds for the error which decay as $t\rightarrow\infty$ for a general class of initial data whose difference from the non-vanishing background possesses a fixed number of finite moments. Our results provide a verification of the soliton resolution conjecture and asymptotic stability of $N$-soliton solutions for mKdV equation with finite density type initial data.
△ Less
Submitted 20 August, 2021; v1 submitted 8 August, 2021;
originally announced August 2021.
-
A Unified Off-Policy Evaluation Approach for General Value Function
Authors:
Tengyu Xu,
Zhuoran Yang,
Zhaoran Wang,
Yingbin Liang
Abstract:
General Value Function (GVF) is a powerful tool to represent both the {\em predictive} and {\em retrospective} knowledge in reinforcement learning (RL). In practice, often multiple interrelated GVFs need to be evaluated jointly with pre-collected off-policy samples. In the literature, the gradient temporal difference (GTD) learning method has been adopted to evaluate GVFs in the off-policy setting…
▽ More
General Value Function (GVF) is a powerful tool to represent both the {\em predictive} and {\em retrospective} knowledge in reinforcement learning (RL). In practice, often multiple interrelated GVFs need to be evaluated jointly with pre-collected off-policy samples. In the literature, the gradient temporal difference (GTD) learning method has been adopted to evaluate GVFs in the off-policy setting, but such an approach may suffer from a large estimation error even if the function approximation class is sufficiently expressive. Moreover, none of the previous work have formally established the convergence guarantee to the ground truth GVFs under the function approximation settings. In this paper, we address both issues through the lens of a class of GVFs with causal filtering, which cover a wide range of RL applications such as reward variance, value gradient, cost in anomaly detection, stationary distribution gradient, etc. We propose a new algorithm called GenTD for off-policy GVFs evaluation and show that GenTD learns multiple interrelated multi-dimensional GVFs as efficiently as a single canonical scalar value function. We further show that unlike GTD, the learned GVFs by GenTD are guaranteed to converge to the ground truth GVFs as long as the function approximation power is sufficiently large. To our best knowledge, GenTD is the first off-policy GVF evaluation algorithm that has global optimality guarantee.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
Identifiability Analysis of Linear Ordinary Differential Equation Systems with a Single Trajectory
Authors:
Xing Qiu,
Tao Xu,
Babak Soltanalizadeh,
Hulin Wu
Abstract:
Ordinary differential equations (ODEs) are widely used to model dynamical behavior of systems. It is important to perform identifiability analysis prior to estimating unknown parameters in ODEs (a.k.a. inverse problem), because if a system is unidentifiable, the estimation procedure may fail or produce erroneous and misleading results.
Although several qualitative identifiability measures have b…
▽ More
Ordinary differential equations (ODEs) are widely used to model dynamical behavior of systems. It is important to perform identifiability analysis prior to estimating unknown parameters in ODEs (a.k.a. inverse problem), because if a system is unidentifiable, the estimation procedure may fail or produce erroneous and misleading results.
Although several qualitative identifiability measures have been proposed, much less effort has been given to develo** \emph{quantitative} (continuous) scores that are robust to uncertainties in the data, especially for those cases in which the data are presented as a single trajectory beginning with one initial value.
In this paper, we first derived a closed-form representation of linear ODE systems that are not identifiable based on a single trajectory. This representation helps researchers design practical systems and choose the right prior structural information in practice. Next, we proposed several quantitative scores for identifiability analysis in practice. In simulation studies, the proposed measures outperformed the main competing method significantly, especially when noise was presented in the data. We also discussed the asymptotic properties of practical identifiability for high-dimensional ODE systems and conclude that, without additional prior information, many random ODE systems are practically unidentifiable when the dimension approaches infinity.
△ Less
Submitted 9 March, 2021;
originally announced March 2021.
-
Critical Sharp Front for Doubly Nonlinear Degenerate Diffusion Equations with Time Delay
Authors:
Tianyuan Xu,
Shanming Ji,
Ming Mei,
**gxue Yin
Abstract:
This paper is concerned with the critical sharp traveling wave for doubly nonlinear diffusion equation with time delay, where the doubly nonlinear degenerate diffusion is defined by $\Big(\big|(u^m)_x\big|^{p-2}(u^m)_x\Big)_x$ with $m>0$ and $p>1$. The doubly nonlinear diffusion equation is proved to admit a unique sharp type traveling wave for the degenerate case $m(p-1)>1$, the so-called slow-di…
▽ More
This paper is concerned with the critical sharp traveling wave for doubly nonlinear diffusion equation with time delay, where the doubly nonlinear degenerate diffusion is defined by $\Big(\big|(u^m)_x\big|^{p-2}(u^m)_x\Big)_x$ with $m>0$ and $p>1$. The doubly nonlinear diffusion equation is proved to admit a unique sharp type traveling wave for the degenerate case $m(p-1)>1$, the so-called slow-diffusion case. This sharp traveling wave associated with the minimal wave speed $c^*(m,p,r)$ is monotonically increasing, where the minimal wave speed satisfies $c^*(m,p,r)<c^*(m,p,0)$ for any time delay $r>0$. The sharp front is $C^1$-smooth for $\frac{1}{p-1}<m< \frac{p}{p-1}$, and piecewise smooth for $m\ge \frac{p}{p-1}$. Our results indicate that time delay slows down the minimal traveling wave speed for the doubly nonlinear degenerate diffusion equations. The approach adopted for proof is the phase transform method combining the variational method. The main technical issue for the proof is to overcome the obstacle caused by the doubly nonlinear degenerate diffusion.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry
Authors:
Ziyi Chen,
Yi Zhou,
Tengyu Xu,
Yingbin Liang
Abstract:
The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geo…
▽ More
The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geometries, and there lacks understanding for general nonconvex minimax optimization. This paper fills such a gap by studying the convergence of a more general proximal-GDA for regularized nonconvex-strongly-concave minimax optimization. Specifically, we show that proximal-GDA admits a novel Lyapunov function, which monotonically decreases in the minimax optimization process and drives the variable sequence to a critical point. By leveraging this Lyapunov function and the KŁ geometry that parameterizes the local geometries of general nonconvex functions, we formally establish the variable convergence of proximal-GDA to a critical point $x^*$, i.e., $x_t\to x^*, y_t\to y^*(x^*)$. Furthermore, over the full spectrum of the KŁ-parameterized geometry, we show that proximal-GDA achieves different types of convergence rates ranging from sublinear convergence up to finite-step convergence, depending on the geometry associated with the KŁ parameter. This is the first theoretical result on the variable convergence for nonconvex minimax optimization.
△ Less
Submitted 17 February, 2021; v1 submitted 9 February, 2021;
originally announced February 2021.
-
A note on Hessenberg varieties
Authors:
Kari Vilonen,
Ting Xue
Abstract:
We give a short proof based on Lusztig's generalized Springer correspondence of some results of [BrCh,BaCr,P].
We give a short proof based on Lusztig's generalized Springer correspondence of some results of [BrCh,BaCr,P].
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
Subregular $J$-rings of Coxeter systems via quiver path algebras
Authors:
Ivan Dimitrov,
Charles Paquette,
David Wehlau,
Tianyuan Xu
Abstract:
We study the subregular $J$-ring $J_C$ of a Coxeter system $(W,S)$, a subring of Lusztig's $J$-ring. We prove that $J_C$ is isomorphic to a quotient of the path algebra of the double quiver of $(W,S)$ by a suitable ideal that we associate to a family of Chebyshev polynomials. As applications, we use quiver representations to study the category mod-$A_K$ of finite dimensional right modules of the a…
▽ More
We study the subregular $J$-ring $J_C$ of a Coxeter system $(W,S)$, a subring of Lusztig's $J$-ring. We prove that $J_C$ is isomorphic to a quotient of the path algebra of the double quiver of $(W,S)$ by a suitable ideal that we associate to a family of Chebyshev polynomials. As applications, we use quiver representations to study the category mod-$A_K$ of finite dimensional right modules of the algebra $A_K=K\otimes_\Z J_C$ over an algebraically closed field $K$ of characteristic zero. Our results include classifications of Coxeter systems for which mod-$A_K$ is semisimple, has finitely many simple modules up to isomorphism, or has a bound on the dimensions of simple modules. Incidentally, we show that every group algebra of a free product of finite cyclic groups is Morita equivalent to the algebra $A_K$ for a suitable Coxeter system; this allows us to specialize the classifications to the module categories of such group algebras.
△ Less
Submitted 17 January, 2021;
originally announced January 2021.
-
Adam revisited: a weighted past gradients perspective
Authors:
Hui Zhong,
Zaiyi Chen,
Chuan Qin,
Zai Huang,
Vincent W. Zheng,
Tong Xu,
Enhong Chen
Abstract:
Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence i…
▽ More
Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence issues, achieving a data-dependent regret bound similar to or better than ADAGRAD is still a challenge to these methods. In this paper, we propose a novel adaptive method weighted adaptive algorithm (WADA) to tackle the non-convergence issues. Unlike AMSGRAD and ADAMNC, we consider using a milder growing weighting strategy on squared past gradient, in which weights grow linearly. Based on this idea, we propose weighted adaptive gradient method framework (WAGMF) and implement WADA algorithm on this framework. Moreover, we prove that WADA can achieve a weighted data-dependent regret bound, which could be better than the original regret bound of ADAGRAD when the gradients decrease rapidly. This bound may partially explain the good performance of ADAM in practice. Finally, extensive experiments demonstrate the effectiveness of WADA and its variants in comparison with several variants of ADAM on training convex problems and deep neural networks.
△ Less
Submitted 1 January, 2021;
originally announced January 2021.