Search | arXiv e-print repository

Single Element Error Correction/ in a Euclidean Distance Matrix

Authors: Abdo Alfakih, Woosuk L. Jung, Henry Wolkowicz, Tina Xu

Abstract: We consider the \emph{exact} error correction of a noisy Euclidean distance matrix, EDM, where the elements are the squared distances between $n$ points in $R^d$. For our problem we are given two facts: (i) the embedding dimension, $d$, (ii) \emph{exactly one} distance in the data is corrupted by \emph{nonzero noise}. But we do \underline{not} know the magnitude nor position of the noise. Thus the… ▽ More We consider the \emph{exact} error correction of a noisy Euclidean distance matrix, EDM, where the elements are the squared distances between $n$ points in $R^d$. For our problem we are given two facts: (i) the embedding dimension, $d$, (ii) \emph{exactly one} distance in the data is corrupted by \emph{nonzero noise}. But we do \underline{not} know the magnitude nor position of the noise. Thus there is a combinatorial element to the problem. We present three solution techniques. These use three divide and conquer strategies in combination with three versions of facial reduction that use: exposing vectors, facial vectors, and Gale transforms. This sheds light on the connections between the various forms of facial reduction related to Gale transforms. Our highly successful empirics confirm the success of these approaches as we can solve huge problems of the order of $100,000$ nodes in approximately one minute to machine precision. \\Our algorithm depends on identifying whether a principal submatrix of the \EDM contains the corrupted element. We provide a theorem for doing this that is related to the existing results for identifying \emph{yielding} elements, i.e.,~we provide a characterization for guaranteeing the perturbed EDM remains an EDM with embedding dimension $d$. The characterization is particularly simple in the $d=2$ case. \\In addition, we characterize when the intuitive approach of the nearest EDM problem, solves our problem. In fact, we show that this happens if, and only if, the original distance element is $0$, degenerate, and the perturbation is negative. △ Less

Submitted 22 June, 2024; originally announced June 2024.

MSC Class: 51K05; 90C26; 90C46; 65K10; 15A48; 90C22

arXiv:2405.09764 [pdf, other]

Clearing time randomization and transaction fees for auction market design

Authors: Thibaut Mastrolia, Tianrui Xu

Abstract: Flaws of a continuous limit order book mechanism raise the question of whether a continuous trading session and a periodic auction session would bring better efficiency. This paper wants to go further in designing a periodic auction when both a continuous market and a periodic auction market are available to traders. In a periodic auction, we discover that a strategic trader could take advantage o… ▽ More Flaws of a continuous limit order book mechanism raise the question of whether a continuous trading session and a periodic auction session would bring better efficiency. This paper wants to go further in designing a periodic auction when both a continuous market and a periodic auction market are available to traders. In a periodic auction, we discover that a strategic trader could take advantage of the accumulated information available along the auction duration by arriving at the latest moment before the auction closes, increasing the price impact on the market. Such price impact moves the clearing price away from the efficient price and may disturb the efficiency of a periodic auction market. We thus propose and quantify the effect of two remedies to mitigate these flaws: randomizing the auction's closing time and optimally designing a transaction fees policy. Our results show that these policies encourage a strategic trader to send their orders earlier to enhance the efficiency of the auction market, illustrated by data extracted from Alphabet and Apple stocks. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 30 pages, 11 figures

arXiv:2403.17669 [pdf, ps, other]

A scaling limit of the 2D parabolic Anderson model with exclusion interaction

Authors: Dirk Erhard, Martin Hairer, Tiecheng Xu

Abstract: We consider the (discrete) parabolic Anderson model $\partial u(t,x)/\partial t=Δu(t,x) +ξ_t(x) u(t,x)$, $t\geq 0$, $x\in \mathbb{Z}^d$. Here, the $ξ$-field is $\mathbb{R}$-valued, acting as a dynamic random environment, and $Δ$ represents the discrete Laplacian. We focus on the case where $ξ$ is given by a rescaled symmetric simple exclusion process which converges to an Ornstein--Uhlenbeck proce… ▽ More We consider the (discrete) parabolic Anderson model $\partial u(t,x)/\partial t=Δu(t,x) +ξ_t(x) u(t,x)$, $t\geq 0$, $x\in \mathbb{Z}^d$. Here, the $ξ$-field is $\mathbb{R}$-valued, acting as a dynamic random environment, and $Δ$ represents the discrete Laplacian. We focus on the case where $ξ$ is given by a rescaled symmetric simple exclusion process which converges to an Ornstein--Uhlenbeck process. By scaling the Laplacian diffusively and considering the equation on a torus, we demonstrate that in dimension $d=2$, when a suitably renormalized version of the above equation is considered, the sequence of solutions converges in law. This resolves an open problem from~\cite{EH23}, where a similar result was shown in the three-dimensional case. The novel contribution in the present work is the establishment of a gradient bound on the transition probability of a fixed but arbitrary number of labelled exclusion particles. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: arXiv admin note: text overlap with arXiv:2103.13479

arXiv:2403.14961 [pdf, ps, other]

Anderson Acceleration with Truncated Gram-Schmidt

Authors: Ziyuan Tang, Tianshi Xu, Huan He, Yousef Saad, Yuanzhe Xi

Abstract: Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over the classical AA. In particular, an attractive feature of AATGS is that its iterates obey a three-term recurrence in the situation when it is applied to solving… ▽ More Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over the classical AA. In particular, an attractive feature of AATGS is that its iterates obey a three-term recurrence in the situation when it is applied to solving symmetric linear problems and this can lead to a considerable reduction of memory and computational costs. We analyze the convergence of AATGS in both full-depth and limited-depth scenarios and establish its equivalence to the classical AA in the linear case. We also report on the effectiveness of AATGS through a set of numerical experiments, ranging from solving nonlinear partial differential equations to tackling nonlinear optimization problems. In particular, the performance of the method is compared with that of the classical AA algorithms. △ Less

Submitted 22 March, 2024; originally announced March 2024.

MSC Class: 65F10; 68W25; 65B99; 65N22

arXiv:2403.13984 [pdf, ps, other]

Singular Solutions for the Conformal Dirac-Einstein Problem on the Sphere

Authors: Ali Maalaoui, Vittorio Martino, Tian Xu

Abstract: In this paper we investigate the existence of singular solutions to the conformal Dirac-Einstein system. Because of its conformal invariance, there are many similarities with the classical construction of singular solutions for the Yamabe problem. We construct here a family of singular solutions, on the three-dimensional sphere, having exactly two singularities. In this paper we investigate the existence of singular solutions to the conformal Dirac-Einstein system. Because of its conformal invariance, there are many similarities with the classical construction of singular solutions for the Yamabe problem. We construct here a family of singular solutions, on the three-dimensional sphere, having exactly two singularities. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 22 pages

MSC Class: 53C27; 35R01

arXiv:2402.06732 [pdf, ps, other]

A partial order on antichains of a fixed size

Authors: R. M. Green, Tianyuan Xu

Abstract: We introduce a new partial order on the set of all antichains of a fixed size in a given poset. When applied to minuscule posets, these partial orders give rise to distributive lattices that appear in the branching rules for minuscule representations of simply laced complex simple Lie algebras. We introduce a new partial order on the set of all antichains of a fixed size in a given poset. When applied to minuscule posets, these partial orders give rise to distributive lattices that appear in the branching rules for minuscule representations of simply laced complex simple Lie algebras. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 12 pages, 2 figures

MSC Class: Primary: 06A07; Secondary: 05E10; 06A11

arXiv:2311.17106 [pdf, ps, other]

Level-Rank Dualities from $Φ$-Harish-Chandra Series and Affine Springer Fibers

Authors: Minh-Tâm Quang Trinh, Ting Xue

Abstract: For any generic finite reductive group $\mathbb{G}$, integer $e > 0$, and $Φ_e$-cuspidal pair $(\mathbb{L}, λ)$, Broué-Malle-Michel conjectured that the endomorphism rings of the Deligne-Lusztig representations attached to $\mathbb{G}, (\mathbb{L}, λ)$ all come from the same generic cyclotomic Hecke algebra. We propose a new conjecture about the Harish-Chandra theory of such pairs, involving two i… ▽ More For any generic finite reductive group $\mathbb{G}$, integer $e > 0$, and $Φ_e$-cuspidal pair $(\mathbb{L}, λ)$, Broué-Malle-Michel conjectured that the endomorphism rings of the Deligne-Lusztig representations attached to $\mathbb{G}, (\mathbb{L}, λ)$ all come from the same generic cyclotomic Hecke algebra. We propose a new conjecture about the Harish-Chandra theory of such pairs, involving two integers $e$ and $m$: namely, that the intersection of an $Φ_e$-Harish-Chandra series and a $Φ_m$-Harish-Chandra series is parametrized by both a union of $Φ_m$-blocks of the $Φ_e$-Hecke algebra and a union of $Φ_e$-blocks of the $Φ_m$-Hecke algebra, in a way that matches blocks. We also conjecture that when blocks match, there is an equivalence of categories between their highest-weight covers. When $e = 1$, we provide evidence that our bijections are essentially realized by bimodules that Oblomkov-Yun construct from the cohomology of affine Springer fibers. This suggests a strange analogy: Roughly, homogeneous affine Springer fibers are to roots of unity as tensor products of Deligne-Lusztig representations are to prime powers. We predict the generic Hecke parameters for arbitrary $Φ$-cuspidal pairs of the groups $\mathbb{G}\mathbb{L}_n$ and $\mathbb{G}\mathbb{U}_n$, unifying the known cases. We prove that they would imply our conjectural bijections for these groups and coprime $e, m$. Then we show that the bijections for $\mathbb{G}\mathbb{L}_n$ are related by affine permutations to Uglov's bijections between bases of higher-level Fock spaces. This relates our conjectural equivalences of categories to those conjectured by Chuang-Miyachi, and proved by several authors, under the name of level-rank duality. Finally, for many cases in exceptional types, we verify that the parameters predicted by Broué-Malle are compatible with our conjectures. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 49 pages

arXiv:2311.08827 [pdf, other]

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

Authors: Daokuan Zhu, Tianqi Xu, Jie Lu

Abstract: In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a… ▽ More In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a deep reinforcement learning (DRL) framework is developed for adaptive configuration within a parameterized unifying algorithmic form, which incorporates an abundance of decentralized first-order and second-order optimization algorithms. We exploit the local consensus and objective information to represent the regularities of problem instances and trace the solving progress, which constitute the states observed by a DRL agent. The framework is trained using Proximal Policy Optimization (PPO) on a number of practical problem instances of similar structures yet different problem data. Experiments on various smooth and non-smooth classes of objective functions demonstrate that our proposed learning-based method outperforms several state-of-the-art distributed optimization algorithms in terms of convergence speed and solution accuracy. △ Less

Submitted 3 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: 10 pages, 5 figures

arXiv:2311.07108 [pdf, other]

Robust Probabilistic Prediction for Stochastic Dynamical Systems

Authors: Tao Xu, Jian** He

Abstract: It is critical and challenging to design robust predictors for stochastic dynamical systems (SDSs) with uncertainty quantification (UQ) in the prediction. Specifically, robustness guarantees the worst-case performance when the predictor's information set of the system is inadequate, and UQ characterizes how confident the predictor is about the predictions. However, it is difficult for traditional… ▽ More It is critical and challenging to design robust predictors for stochastic dynamical systems (SDSs) with uncertainty quantification (UQ) in the prediction. Specifically, robustness guarantees the worst-case performance when the predictor's information set of the system is inadequate, and UQ characterizes how confident the predictor is about the predictions. However, it is difficult for traditional robust predictors to provide robust UQ because they were designed to robustify the performance of point predictions. In this paper, we investigate how to robustify the probabilistic prediction for SDS, which can inherently provide robust distributional UQ. To characterize the performance of probabilistic predictors, we generalize the concept of likelihood function to likelihood functional, and prove that this metric is a proper scoring rule. Based on this metric, we propose a framework to quantify when the predictor is robust and analyze how the information set affects the robustness. Our framework makes it possible to design robust probabilistic predictors by solving functional optimization problems concerning different information sets. In particular, we design a class of moment-based optimal robust probabilistic predictors and provide a practical Kalman-filter-based algorithm for implementation. Extensive numerical simulations are provided to elaborate on our results. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2310.09591 [pdf, ps, other]

Idempotents in the group algebra of the infinite dihedral group

Authors: Ivan Dimitrov, Charles Paquette, David Wehlau, Tianyuan Xu

Abstract: We prove that over an algebraically closed field $\mathbb{K}$ of characteristic different from $2$, the group algebra $R=\mathbb{K} D_\infty$ of the infinite dihedral group $D_\infty$ has exactly six conjugacy classes of involutions (equivalently, of idempotents). This allows us to recover the fact that $R$ admits exactly four non-isomorphic indecomposable projective modules of the form $eR$ where… ▽ More We prove that over an algebraically closed field $\mathbb{K}$ of characteristic different from $2$, the group algebra $R=\mathbb{K} D_\infty$ of the infinite dihedral group $D_\infty$ has exactly six conjugacy classes of involutions (equivalently, of idempotents). This allows us to recover the fact that $R$ admits exactly four non-isomorphic indecomposable projective modules of the form $eR$ where $e$ is an idempotent, a result that was first established by Berman and Buzási. △ Less

Submitted 14 October, 2023; originally announced October 2023.

Comments: 6 pages

MSC Class: Primary: 20C07; Secondary: 16D40; 16S34; 16U40

arXiv:2308.06950 [pdf, ps, other]

Transient asymptotics of the modified Camassa-Holm equation

Authors: Taiyang Xu, Yiling Yang, Lun Zhang

Abstract: We investigate long time asymptotics of the modified Camassa-Holm equation in three transition zones under a nonzero background. The first transition zone lies between the soliton region and the first oscillatory region, the second one lies between the second oscillatory region and the fast decay region, and possibly, the third one, namely, the collisionless shock region, that bridges the first tr… ▽ More We investigate long time asymptotics of the modified Camassa-Holm equation in three transition zones under a nonzero background. The first transition zone lies between the soliton region and the first oscillatory region, the second one lies between the second oscillatory region and the fast decay region, and possibly, the third one, namely, the collisionless shock region, that bridges the first transition region and the first oscillatory region. Under a low regularity condition on the initial data, we obtain Painlevé-type asymptotic formulas in the first two transition regions, while the transient asymptotics in the third region involves the Jacobi theta function. We establish our results by performing a $\bar{\partial}$ nonlinear steepest descent analysis to the associated Riemann-Hilbert problem. △ Less

Submitted 14 August, 2023; originally announced August 2023.

Comments: 58 pages, 16 figures. Comments are welcome

MSC Class: 35Q53; 37K15; 34M50; 35Q15; 35B40; 37K40; 33E17; 34M55

arXiv:2308.01144 [pdf, other]

Optimal Mixed Strategies to the Zero-sum Linear Differential Game

Authors: Tao Xu, Wang Xi, Jian** He

Abstract: This paper exploits the weak approximation method to study a zero-sum linear differential game under mixed strategies. The stochastic nature of mixed strategies poses challenges in evaluating the game value and deriving the optimal strategies. To overcome these challenges, we first define the mixed strategy based on time discretization given the control period $δ$. Then, we design a stochastic dif… ▽ More This paper exploits the weak approximation method to study a zero-sum linear differential game under mixed strategies. The stochastic nature of mixed strategies poses challenges in evaluating the game value and deriving the optimal strategies. To overcome these challenges, we first define the mixed strategy based on time discretization given the control period $δ$. Then, we design a stochastic differential equation (SDE) to approximate the discretized game dynamic with a small approximation error of scale $\mathcal{O}(δ^2)$ in the weak sense. Moreover, we prove that the game payoff is also approximated in the same order of accuracy. Next, we solve the optimal mixed strategies and game values for the linear quadratic differential games. The effect of the control period is explicitly analyzed when the payoff is a terminal cost. Our results provide the first implementable form of the optimal mixed strategies for a zero-sum linear differential game. Finally, we provide numerical examples to illustrate and elaborate on our results. △ Less

Submitted 2 August, 2023; originally announced August 2023.

arXiv:2306.01559 [pdf, ps, other]

Non-compactness results for the spinorial Yamabe-type problems with non-smooth geometric data

Authors: Takeshi Isobe, Yannick Sire, Tian Xu

Abstract: Let $(M,\textit{g},σ)$ be an $m$-dimensional closed spin manifold, with a fixed Riemannian metric $\textit{g}$ and a fixed spin structure $σ$; let $\mathbb{S}(M)$ be the spinor bundle over $M$. The spinorial Yamabe-type problems address the solvability of the following equation \[ D_{\textit{g}}ψ= f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ, \quad ψ:M\to\mathbb{S}(M), \ x\in M \] where $D_{\textit{g}}$ i… ▽ More Let $(M,\textit{g},σ)$ be an $m$-dimensional closed spin manifold, with a fixed Riemannian metric $\textit{g}$ and a fixed spin structure $σ$; let $\mathbb{S}(M)$ be the spinor bundle over $M$. The spinorial Yamabe-type problems address the solvability of the following equation \[ D_{\textit{g}}ψ= f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ, \quad ψ:M\to\mathbb{S}(M), \ x\in M \] where $D_{\textit{g}}$ is the associated Dirac operator and $f:M\to\mathbb{R}$ is a given function. The study of such nonlinear equation is motivated by its important applications in Spin Geometry: when $m=2$, a solution corresponds to a conformal isometric immersion of the universal covering $\widetilde M$ into $\mathbb{R}^3$ with prescribed mean curvature $f$; meanwhile, for general dimensions and $f\equiv constant\neq0$, a solution provides an upper bound estimate for the Bär-Hijazi-Lott invariant. The aim of this paper is to establish non-compactness results related to the spinorial Yamabe-type problems. Precisely, concrete analysis is made for two specific models on the manifold $(S^m,\textit{g})$ where the solution set of the spinorial Yamabe-type problem is not compact: $1).$ the geometric potential $f$ is constant (say $f\equiv1$) with the background metric $\textit{g}$ being a $C^k$ perturbation of the canonical round metric $\textit{g}_{S^m}$, which is not conformally flat somewhere on $S^m$; $2).$ $f$ is a perturbation from constant and is of class $C^2$, while the background metric $\textit{g}\equiv\textit{g}_{S^m}$. △ Less

Submitted 2 June, 2023; originally announced June 2023.

Comments: 45 pages. arXiv admin note: text overlap with arXiv:2304.02807

MSC Class: 53C27; 35R01

arXiv:2304.13790 [pdf, ps, other]

Nonequilibrium Joint Fluctuations for Current and Occupation Time in The Symmetric Exclusion Process

Authors: Dirk Erhard, Tertuliano Franco, Tiecheng Xu

Abstract: We provide a full description for the joint fluctuations of current and occupation time in the one-dimensional nonequilibrium simple symmetric exclusion process, furnishing explicit formulas for the covariances of the limiting Gaussian process. The main novelties consist of a proof of the tightness of the nonequilibrium current based on new correlation estimates, refined estimates on the discrete… ▽ More We provide a full description for the joint fluctuations of current and occupation time in the one-dimensional nonequilibrium simple symmetric exclusion process, furnishing explicit formulas for the covariances of the limiting Gaussian process. The main novelties consist of a proof of the tightness of the nonequilibrium current based on new correlation estimates, refined estimates on the discrete gradient of the transition probabilities of the SSEP, and a nonequilibrium Kipnis-Varadhan Lemma based on a Fourier approach. △ Less

Submitted 26 April, 2023; originally announced April 2023.

Comments: 54 pages

MSC Class: 60F17; 60G15; 60J27

arXiv:2304.05460 [pdf, other]

An Adaptive Factorized Nyström Preconditioner for Regularized Kernel Matrices

Authors: Shifan Zhao, Tianshi Xu, Hua Huang, Edmond Chow, Yuanzhe Xi

Abstract: The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the ra… ▽ More The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the rank k of the Nyström approximation is large, i.e., for kernel function parameters that lead to kernel matrices with eigenvalues that decay slowly. AFN deliberately chooses a well-conditioned submatrix to solve with and corrects a Nyström approximation with a factorized sparse approximate matrix inverse. This makes AFN efficient for kernel matrices with large numerical ranks. AFN also adaptively chooses the size of this submatrix to balance accuracy and cost. △ Less

Submitted 9 April, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

arXiv:2304.04234 [pdf, other]

doi 10.1016/j.jmps.2024.105714

Variational operator learning: A unified paradigm marrying training neural operators and solving partial differential equations

Authors: Tengfei Xu, Dachuan Liu, Peng Hao, Bo Wang

Abstract: Neural operators as novel neural architectures for fast approximating solution operators of partial differential equations (PDEs), have shown considerable promise for future scientific computing. However, the mainstream of training neural operators is still data-driven, which needs an expensive ground-truth dataset from various sources (e.g., solving PDEs' samples with the conventional solvers, re… ▽ More Neural operators as novel neural architectures for fast approximating solution operators of partial differential equations (PDEs), have shown considerable promise for future scientific computing. However, the mainstream of training neural operators is still data-driven, which needs an expensive ground-truth dataset from various sources (e.g., solving PDEs' samples with the conventional solvers, real-world experiments) in addition to training stage costs. From a computational perspective, marrying operator learning and specific domain knowledge to solve PDEs is an essential step in reducing dataset costs and label-free learning. We propose a novel paradigm that provides a unified framework of training neural operators and solving PDEs with the variational form, which we refer to as the variational operator learning (VOL). Ritz and Galerkin approach with finite element discretization are developed for VOL to achieve matrix-free approximation of system functional and residual, then direct minimization and iterative update are proposed as two optimization strategies for VOL. Various types of experiments based on reasonable benchmarks about variable heat source, Darcy flow, and variable stiffness elasticity are conducted to demonstrate the effectiveness of VOL. With a label-free training set and a 5-label-only shift set, VOL learns solution operators with its test errors decreasing in a power law with respect to the amount of unlabeled data. To the best of the authors' knowledge, this is the first study that integrates the perspectives of the weak form and efficient iterative methods for solving sparse linear systems into the end-to-end operator learning task. △ Less

Submitted 9 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

Comments: This version mainly improves the quality of the bitmaps in the results compared to the previous version

arXiv:2304.02807 [pdf, ps, other]

Solutions of Spinorial Yamabe-type Problems on $S^m$: Perturbations and Applications

Authors: Takeshi Isobe, Tian Xu

Abstract: This paper is part of a program to establish the existence theory for the conformally invariant Dirac equation \[ D_{\textit{g}}ψ=f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ\] on a closed spin manifold $(M,\textit{g})$ of dimension $m\geq2$ with a fixed spin structure, where $f:M\to\mathbb{R}$ is a given function. The study on such nonlinear equation is motivated by its important applications in Spin Geom… ▽ More This paper is part of a program to establish the existence theory for the conformally invariant Dirac equation \[ D_{\textit{g}}ψ=f(x)|ψ|_{\textit{g}}^{\frac2{m-1}}ψ\] on a closed spin manifold $(M,\textit{g})$ of dimension $m\geq2$ with a fixed spin structure, where $f:M\to\mathbb{R}$ is a given function. The study on such nonlinear equation is motivated by its important applications in Spin Geometry: when $m=2$, a solution corresponds to an isometric immersion of the universal covering $\widetilde M$ into $\mathbb{R}^3$ with prescribed mean curvature $f$; meanwhile, for general dimensions and $f\equiv constant$, a solution provides an upper bound estimate for the Bär-Hijazi-Lott invariant. △ Less

Submitted 5 April, 2023; originally announced April 2023.

MSC Class: 53C27; 35R01

Journal ref: Trans. Amer. Math. Soc., 2023

arXiv:2303.08881 [pdf, other]

A Two-level GPU-Accelerated Incomplete LU Preconditioner for General Sparse Linear Systems

Authors: Tianshi Xu, Ruipeng Li, Daniel Osei-Kuffuor

Abstract: This paper presents a parallel preconditioning approach based on incomplete LU (ILU) factorizations in the framework of Domain Decomposition (DD) for general sparse linear systems. We focus on distributed memory parallel architectures, specifically, those that are equipped with graphic processing units (GPUs). In addition to block Jacobi, we present general purpose two-level ILU Schur complement-b… ▽ More This paper presents a parallel preconditioning approach based on incomplete LU (ILU) factorizations in the framework of Domain Decomposition (DD) for general sparse linear systems. We focus on distributed memory parallel architectures, specifically, those that are equipped with graphic processing units (GPUs). In addition to block Jacobi, we present general purpose two-level ILU Schur complement-based approaches, where different strategies are presented to solve the coarse-level reduced system. These strategies are combined with modified ILU methods in the construction of the coarse-level operator, in order to effectively remove smooth errors. We leverage available GPU-based sparse matrix kernels to accelerate the setup and the solve phases of the proposed ILU preconditioner. We evaluate the efficiency of the proposed methods as a smoother for algebraic multigrid (AMG) and as a preconditioner for Krylov subspace methods, on challenging anisotropic diffusion problems and a collection of general sparse matrices. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 20 pages, 7 figures

MSC Class: G.1.3

arXiv:2303.04156 [pdf, other]

Computing with Categories in Machine Learning

Authors: Eli Sennesh, Tom Xu, Yoshihiro Maruyama

Abstract: Category theory has been successfully applied in various domains of science, shedding light on universal principles unifying diverse phenomena and thereby enabling knowledge transfer between them. Applications to machine learning have been pursued recently, and yet there is still a gap between abstract mathematical foundations and concrete applications to machine learning tasks. In this paper we i… ▽ More Category theory has been successfully applied in various domains of science, shedding light on universal principles unifying diverse phenomena and thereby enabling knowledge transfer between them. Applications to machine learning have been pursued recently, and yet there is still a gap between abstract mathematical foundations and concrete applications to machine learning tasks. In this paper we introduce DisCoPyro as a categorical structure learning framework, which combines categorical structures (such as symmetric monoidal categories and operads) with amortized variational inference, and can be applied, e.g., in program learning for variational autoencoders. We provide both mathematical foundations and concrete applications together with comparison of experimental performance with other models (e.g., neuro-symbolic models). We speculate that DisCoPyro could ultimately contribute to the development of artificial general intelligence. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: Submitted to AGI 2023

arXiv:2301.11414 [pdf, other]

A Simple Algorithm For Scaling Up Kernel Methods

Authors: Teng Andrea Xu, Bryan Kelly, Semyon Malamud

Abstract: The recent discovery of the equivalence between infinitely wide neural networks (NNs) in the lazy training regime and Neural Tangent Kernels (NTKs) (Jacot et al., 2018) has revived interest in kernel methods. However, conventional wisdom suggests kernel methods are unsuitable for large samples due to their computational complexity and memory requirements. We introduce a novel random feature regres… ▽ More The recent discovery of the equivalence between infinitely wide neural networks (NNs) in the lazy training regime and Neural Tangent Kernels (NTKs) (Jacot et al., 2018) has revived interest in kernel methods. However, conventional wisdom suggests kernel methods are unsuitable for large samples due to their computational complexity and memory requirements. We introduce a novel random feature regression algorithm that allows us (when necessary) to scale to virtually infinite numbers of random features. We illustrate the performance of our method on the CIFAR-10 dataset. △ Less

Submitted 30 January, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

arXiv:2301.04934 [pdf, ps, other]

doi 10.1007/s00526-022-02303-7

Curvature effect in the spinorial Yamabe problem on product manifolds

Authors: Thomas Bartsch, Tian Xu

Abstract: Let $(M_1,\textit{g}^{(1)})$, $(M_2,\textit{g}^{(2)})$ be closed Riemannian spin manifolds. We study the existence of solutions of the spinorial Yamabe problem on the product $M_1\times M_2$ equipped with a family of metrics $\varepsilon^{-2}\textit{g}^{(1)}\oplus\textit{g}^{(2)}$, $\varepsilon>0$. Via variational methods and blow-up techniques, we prove the existence of solutions which depend onl… ▽ More Let $(M_1,\textit{g}^{(1)})$, $(M_2,\textit{g}^{(2)})$ be closed Riemannian spin manifolds. We study the existence of solutions of the spinorial Yamabe problem on the product $M_1\times M_2$ equipped with a family of metrics $\varepsilon^{-2}\textit{g}^{(1)}\oplus\textit{g}^{(2)}$, $\varepsilon>0$. Via variational methods and blow-up techniques, we prove the existence of solutions which depend only on the factor $M_1$, and which exhibit a spike layer as $\varepsilon\to0$. Moreover, we locate the asymptotic position of the peak points of the solutions in terms of the curvature tensor on $(M_1,\textit{g}^{(1)})$. △ Less

Submitted 12 January, 2023; originally announced January 2023.

Comments: 38 Pages. arXiv admin note: text overlap with arXiv:2005.01448

MSC Class: 53C27; 35B40; 35Q40; 35R01; 58E30; 58J60

Journal ref: Calc. Var. Partial Differential Equations 61 (2022), no. 5, Paper No. 194, 35 pp

arXiv:2301.03757 [pdf, other]

Constructions of Delaunay-type solutions for the spinorial Yamabe equation on spheres

Authors: Ali Maalaoui, Yannick Sire, Tian Xu

Abstract: In this paper we construct singular solutions to the critical Dirac equation on spheres. More precisely, first we construct solutions admitting two points singularities that we call Delaunay-type solutions because of their similarities with the Delaunay solutions constructed for the singular Yamabe problem in \cite{MP1 , Schoen1989}. Then we construct another kind of singular solutions admitting a… ▽ More In this paper we construct singular solutions to the critical Dirac equation on spheres. More precisely, first we construct solutions admitting two points singularities that we call Delaunay-type solutions because of their similarities with the Delaunay solutions constructed for the singular Yamabe problem in \cite{MP1 , Schoen1989}. Then we construct another kind of singular solutions admitting a great circle as a singular set. These solutions are the building blocks for singular solutions on a general Spin manifold. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2212.00112 [pdf, other]

Approximation of the non-linear water hammer problem by a Lax-Wendroff finite difference scheme

Authors: Hugo Carrillo-Lincopi, Alden Waters, Teke Xu

Abstract: We study the water hammer problem in the case of a sudden closing of a valve upstream, and we consider a Lax-Wendroff finite difference scheme in order to obtain a numerical solution of this problem. In order to establish the approximation of this scheme to the original case, we rigorously show some properties such as consistency, stability and weak convergence of the scheme under reasonable condi… ▽ More We study the water hammer problem in the case of a sudden closing of a valve upstream, and we consider a Lax-Wendroff finite difference scheme in order to obtain a numerical solution of this problem. In order to establish the approximation of this scheme to the original case, we rigorously show some properties such as consistency, stability and weak convergence of the scheme under reasonable conditions. In addition, we present some numerical simulations in order to show some features of the numerical method. △ Less

Submitted 30 March, 2023; v1 submitted 30 November, 2022; originally announced December 2022.

Comments: 30 pages, 4 figures

arXiv:2209.07713 [pdf, ps, other]

The Ariki--Koike algebras and Rogers--Ramanujan type partitions

Authors: Shane Chern, Zhitai Li, Dennis Stanton, Ting Xue, Ae Ja Yee

Abstract: In 2000, Ariki and Mathas showed that the simple modules of the Ariki--Koike algebras $\mathcal{H}_{\mathbb{C},q;Q_1,\ldots, Q_m}\big(G(m, 1, n)\big)$ (when the parameters are roots of unity and $q\neq 1$) are labeled by the so-called Kleshchev multipartitions. This together with Ariki's categorification theorem enabled Ariki and Mathas to obtain the generating function for the number of Kleshchev… ▽ More In 2000, Ariki and Mathas showed that the simple modules of the Ariki--Koike algebras $\mathcal{H}_{\mathbb{C},q;Q_1,\ldots, Q_m}\big(G(m, 1, n)\big)$ (when the parameters are roots of unity and $q\neq 1$) are labeled by the so-called Kleshchev multipartitions. This together with Ariki's categorification theorem enabled Ariki and Mathas to obtain the generating function for the number of Kleshchev multipartitions by making use of the Weyl--Kac character formula. In this paper, we revisit this generating function for the $q=-1$ case. This $q=-1$ case is particularly interesting, for the corresponding Kleshchev multipartitions have a very close connection to generalized Rogers--Ramanujan type partitions when $Q_1=\cdots=Q_a=-1$ and $Q_{a+1}=\cdots =Q_m =1$. Based on this connection, we provide an analytic proof of the result of Ariki and Mathas for $q=Q_1=\cdots Q_a=-1$ and $Q_{a+1}=\cdots =Q_m =1$. Our second objective is to investigate simple modules of the Ariki--Koike algebra in a fixed block. It is known that these simple modules in a fixed block are labeled by the Kleshchev multiparitions with a fixed partition residue statistic. This partition statistic is also studied in the works of Berkovich, Garvan, and Uncu. Employing their results, we provide two bivariate generating function identities when $m=2$. △ Less

Submitted 16 September, 2022; originally announced September 2022.

arXiv:2205.10230 [pdf, ps, other]

doi 10.1016/j.physd.2022.133562

RAR-PINN algorithm for the data-driven vector-soliton solutions and parameter discovery of coupled nonlinear equations

Authors: Shu-Mei Qin, Min Li, Tao Xu, Shao-Qun Dong

Abstract: This work aims to provide an effective deep learning framework to predict the vector-soliton solutions of the coupled nonlinear equations and their interactions. The method we propose here is a physics-informed neural network (PINN) combining with the residual-based adaptive refinement (RAR-PINN) algorithm. Different from the traditional PINN algorithm which takes points randomly, the RAR-PINN alg… ▽ More This work aims to provide an effective deep learning framework to predict the vector-soliton solutions of the coupled nonlinear equations and their interactions. The method we propose here is a physics-informed neural network (PINN) combining with the residual-based adaptive refinement (RAR-PINN) algorithm. Different from the traditional PINN algorithm which takes points randomly, the RAR-PINN algorithm uses an adaptive point-fetching approach to improve the training efficiency for the solutions with steep gradients. A series of experiment comparisons between the RAR-PINN and traditional PINN algorithms are implemented to a coupled generalized nonlinear Schrödinger (CGNLS) equation as an example. The results indicate that the RAR-PINN algorithm has faster convergence rate and better approximation ability, especially in modeling the shape-changing vector-soliton interactions in the coupled systems. Finally, the RAR-PINN method is applied to perform the data-driven discovery of the CGNLS equation, which shows the dispersion and nonlinear coefficients can be well approximated. △ Less

Submitted 29 April, 2022; originally announced May 2022.

arXiv:2205.08917 [pdf, ps, other]

Representations of free products of semisimple algebras via quivers

Authors: Andrew Buchanan, Ivan Dimitrov, Olivia Grace, Charles Paquette, David Wehlau, Tianyuan Xu

Abstract: Let $\mathbb{K}$ denote an algebraically closed field and $A$ a free product of finitely many semisimple associative $\mathbb{K}$-algebras. We associate to $A$ a finite acyclic quiver $Γ$ and show that the category of finite dimensional $A$-modules is equivalent to a full subcategory of the category ${\rm rep}(Γ)$ of finite dimensional representations of $Γ$. Under this equivalence, the simple… ▽ More Let $\mathbb{K}$ denote an algebraically closed field and $A$ a free product of finitely many semisimple associative $\mathbb{K}$-algebras. We associate to $A$ a finite acyclic quiver $Γ$ and show that the category of finite dimensional $A$-modules is equivalent to a full subcategory of the category ${\rm rep}(Γ)$ of finite dimensional representations of $Γ$. Under this equivalence, the simple $A$-modules correspond exactly to the $θ$-stable representations of $Γ$ for some stability parameter $θ$. This gives us necessary conditions for an $A$-module to be simple, conditions which are also sufficient if the module is in general position. Even though there are indecomposable modules that are not simple, we prove that a module in general position is always semisimple. We also discuss the construction of arbitrary finite dimensional modules using nilpotent representations of quivers. Finally, we apply our results to the case of a free product of finite groups when $\mathbb{K}$ has characteristic zero. △ Less

Submitted 18 May, 2022; originally announced May 2022.

Comments: 25 pages

MSC Class: 16G20; 16D60 (Primary) 16S10; 16E60; 16G60; 20E06 (Secondary)

arXiv:2205.06006 [pdf, other]

Probabilistic Predictability of Stochastic Dynamical Systems: Metric, Optimality and Application

Authors: Tao Xu, Jian** He, Yushan Li

Abstract: To assess the quality of a probabilistic prediction for stochastic dynamical systems (SDSs), scoring rules assign a numerical score based on the predictive distribution and the measured state. In this paper, we propose an $ε$-logarithm score that generalizes the celebrated logarithm score by considering a neighborhood with radius $ε$. To begin with, we prove that the $ε$-logarithm score is proper… ▽ More To assess the quality of a probabilistic prediction for stochastic dynamical systems (SDSs), scoring rules assign a numerical score based on the predictive distribution and the measured state. In this paper, we propose an $ε$-logarithm score that generalizes the celebrated logarithm score by considering a neighborhood with radius $ε$. To begin with, we prove that the $ε$-logarithm score is proper (the expected score is optimized when the predictive distribution meets the ground truth) based on discrete approximations. Then, we characterize the probabilistic predictability of an SDS by the optimal expected score and approximate it with an error of scale $\mathcal{O}(ε)$. The approximation quantitatively shows how the system predictability is jointly determined by the neighborhood radius, the differential entropies of process noises, and the system dimension. In addition to the expected score, we also analyze the asymptotic behaviors of the score on individual trajectories. Specifically, we prove that the score on a trajectory will converge to the probabilistic predictability when the process noises are independent and identically distributed. Moreover, the convergence speed against the trajectory length $T$ is of scale $\mathcal{O}(T^{-\frac{1}{2}})$ in the sense of probability. Finally, we apply the predictability analysis to design unpredictable SDSs. Numerical examples are given to elaborate the results. △ Less

Submitted 9 December, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

arXiv:2205.04545 [pdf, other]

A Probabilistic Generative Model of Free Categories

Authors: Eli Sennesh, Tom Xu, Yoshihiro Maruyama

Abstract: Applied category theory has recently developed libraries for computing with morphisms in interesting categories, while machine learning has developed ways of learning programs in interesting languages. Taking the analogy between categories and languages seriously, this paper defines a probabilistic generative model of morphisms in free monoidal categories over domain-specific generating objects an… ▽ More Applied category theory has recently developed libraries for computing with morphisms in interesting categories, while machine learning has developed ways of learning programs in interesting languages. Taking the analogy between categories and languages seriously, this paper defines a probabilistic generative model of morphisms in free monoidal categories over domain-specific generating objects and morphisms. The paper shows how acyclic directed wiring diagrams can model specifications for morphisms, which the model can use to generate morphisms. Amortized variational inference in the generative model then enables learning of parameters (by maximum likelihood) and inference of latent variables (by Bayesian inversion). A concrete experiment shows that the free category prior achieves competitive reconstruction performance on the Omniglot dataset. △ Less

Submitted 13 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: Submitted to International Conference on Applied Category Theory 2022 (ACT 2022)

MSC Class: 18M35 ACM Class: I.2.6; I.2.4

arXiv:2205.03224 [pdf, other]

parGeMSLR: A Parallel Multilevel Schur Complement Low-Rank Preconditioning and Solution Package for General Sparse Matrices

Authors: Tianshi Xu, Vassilis Kalantzis, Ruipeng Li, Yuanzhe Xi, Geoffrey Dillon, Yousef Saad

Abstract: This paper discusses parGeMSLR, a C++/MPI software library for the solution of sparse systems of linear algebraic equations via preconditioned Krylov subspace methods in distributed-memory computing environments. The preconditioner implemented in parGeMSLR is based on algebraic domain decomposition and partitions the symmetrized adjacency graph recursively into several non-overlap** partitions v… ▽ More This paper discusses parGeMSLR, a C++/MPI software library for the solution of sparse systems of linear algebraic equations via preconditioned Krylov subspace methods in distributed-memory computing environments. The preconditioner implemented in parGeMSLR is based on algebraic domain decomposition and partitions the symmetrized adjacency graph recursively into several non-overlap** partitions via a p-way vertex separator, where p is an integer multiple of the total number of MPI processes. From a numerical perspective, parGeMSLR builds a Schur complement approximate inverse preconditioner as the sum between the matrix inverse of the interface coupling matrix and a low-rank correction term. To reduce the cost associated with the computation of the approximate inverse matrices, parGeMSLR exploits a multilevel partitioning of the algebraic domain. The parGeMSLR library is implemented on top of the Message Passing Interface and can solve both real and complex linear systems. Furthermore, parGeMSLR can take advantage of hybrid computing environments with in-node access to one or more Graphics Processing Units. Finally, the parallel efficiency (weak and strong scaling) of parGeMSLR is demonstrated on a few model problems arising from discretizations of 3D Partial Differential Equations. △ Less

Submitted 4 May, 2022; originally announced May 2022.

Comments: 14 pages, 11 figures

arXiv:2204.09765 [pdf, ps, other]

2-roots for simply laced Weyl groups

Authors: R. M. Green, Tianyuan Xu

Abstract: We introduce and study "2-roots", which are symmetrized tensor products of orthogonal roots of Kac--Moody algebras. We concentrate on the case where $W$ is the Weyl group of a simply laced Y-shaped Dynkin diagram $Y_{a,b,c}$ having $n$ vertices and with three branches of arbitrary finite lengths $a$, $b$ and $c$; special cases of this include types $D_n$, $E_n$ (for arbitrary $n \geq 6$), and affi… ▽ More We introduce and study "2-roots", which are symmetrized tensor products of orthogonal roots of Kac--Moody algebras. We concentrate on the case where $W$ is the Weyl group of a simply laced Y-shaped Dynkin diagram $Y_{a,b,c}$ having $n$ vertices and with three branches of arbitrary finite lengths $a$, $b$ and $c$; special cases of this include types $D_n$, $E_n$ (for arbitrary $n \geq 6$), and affine $E_6$, $E_7$ and $E_8$. We show that a natural codimension-$1$ submodule $M$ of the symmetric square of the reflection representation of $W$ has a remarkable canonical basis $\mathcal{B}$ that consists of 2-roots. We prove that, with respect to $\mathcal{B}$, every element of $W$ is represented by a column sign-coherent matrix in the sense of cluster algebras. If $W$ is a finite simply laced Weyl group, each $W$-orbit of 2-roots has a highest element, analogous to the highest root, and we calculate these elements explicitly. We prove that if $W$ is not of affine type, the module $M$ is completely reducible in characteristic zero and each of its nontrivial direct summands is spanned by a $W$-orbit of 2-roots. △ Less

Submitted 8 April, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: Final version; to appear in Transformation Groups

MSC Class: 17B22; 20F55

arXiv:2204.01299 [pdf, ps, other]

On the large-time asymptotics of the defocusing mKdV equation with step-like initial data

Authors: Taiyang Xu

Abstract: It is concerned with the large-time asymptotics of the Cauchy problem of the defocusing modified Korteweg-de Vries (mKdV) equation with step-like initial data subject to compact perturbations, that is, \begin{align*} q_{0}(x)-q_{0c}(x)=0, \ \text{for} \ |x|>N \end{align*} with some positive $N$, where \begin{align*} q_{0c}(x)=\left\{ \begin{aligned} &c_{l}, \quad x\leqslant 0, &c_{r}, \q… ▽ More It is concerned with the large-time asymptotics of the Cauchy problem of the defocusing modified Korteweg-de Vries (mKdV) equation with step-like initial data subject to compact perturbations, that is, \begin{align*} q_{0}(x)-q_{0c}(x)=0, \ \text{for} \ |x|>N \end{align*} with some positive $N$, where \begin{align*} q_{0c}(x)=\left\{ \begin{aligned} &c_{l}, \quad x\leqslant 0, &c_{r}, \quad x>0, \end{aligned} \right. \end{align*} and $c_l>c_{r}>0$. It follows from the standard direct and inverse scattering theory that an RH characterization for the step-like problem is constructed. By performing the nonlinear steepest descent analysis, we mainly derive the large-time asymptotics in the each of four asymptotic zones in the $(x,t)$-half plane. △ Less

Submitted 28 June, 2024; v1 submitted 4 April, 2022; originally announced April 2022.

Comments: 38 pages, 15 figures, remove one of the authors

MSC Class: 35Q51; 35Q15; 35C20; 37K15; 37K40

arXiv:2202.05831 [pdf, ps, other]

Character sheaves for classical graded Lie algebras

Authors: Ting Xue

Abstract: In this note we study character sheaves for graded Lie algebras arising from inner automorphisms of special linear groups and Vinberg's type II classical graded Lie algebras. In this note we study character sheaves for graded Lie algebras arising from inner automorphisms of special linear groups and Vinberg's type II classical graded Lie algebras. △ Less

Submitted 11 February, 2022; originally announced February 2022.

Comments: 16 pages

arXiv:2201.11298 [pdf, ps, other]

On Limit Measures and Their Supports for Stochastic Ordinary Differential Equations

Authors: Tianyuan Xu, Lifeng Chen, Jifa Jiang

Abstract: This paper studies limit measures of stationary measures of stochastic ordinary differential equations on the Euclidean space and tries to determine which invariant measures of an unperturbed system will survive. Under the assumption for SODEs to admit the Freidlin-Wentzell or Dembo-Zeitouni large deviations principle with weaker compactness condition, we prove that limit measures are concentrated… ▽ More This paper studies limit measures of stationary measures of stochastic ordinary differential equations on the Euclidean space and tries to determine which invariant measures of an unperturbed system will survive. Under the assumption for SODEs to admit the Freidlin-Wentzell or Dembo-Zeitouni large deviations principle with weaker compactness condition, we prove that limit measures are concentrated away from repellers which are topologically transitive, or equivalent classes, or admit Lebesgue measure zero. We also preclude concentrations of limit measures on acyclic saddle or trap chains. This illustrates that limit measures are concentrated on Liapunov stable compact invariant sets. Applications are made to the Morse-Smale systems, the Axiom A systems including structural stability systems and separated star systems, the gradient or gradient-like systems, those systems possessing the Poincare-Bendixson property with a finite number of limit sets to obtain that limit measures live on Liapunov stable critical elements, Liapunov stable basic sets, Liapunov stable equilibria and Liapunov stable limit sets including equilibria, limit cycles and saddle or trap cycles, respectively. A number of nontrivial examples admitting a unique limit measure are provided, which include monostable, multistable systems and those possessing infinite equivalent classes. △ Less

Submitted 26 January, 2022; originally announced January 2022.

Comments: 37 pages, 12 figures

MSC Class: 60H10; 37B35; 60B10; 60F10; 37A50; 37C70

arXiv:2112.03640 [pdf, ps, other]

On the Bär-Hijazi-Lott invariant for the Dirac operator and a spinorial proof of the Yamabe problem

Authors: Yannick Sire, Tian Xu

Abstract: Let $M$ be a closed spin manifold of dimension $m\geq6$ equipped with a Riemannian metric $\ig$ and a spin structure $\sa$. Let $\lm_1^+(\tilde\ig)$ be the smallest positive eigenvalue of the Dirac operator $D_{\tilde\ig}$ on $M$ with respect to a metric $\tilde\ig$ conformal to $\ig$. The Bär-Hijazi-Lott invariant is defined by… ▽ More Let $M$ be a closed spin manifold of dimension $m\geq6$ equipped with a Riemannian metric $\ig$ and a spin structure $\sa$. Let $\lm_1^+(\tilde\ig)$ be the smallest positive eigenvalue of the Dirac operator $D_{\tilde\ig}$ on $M$ with respect to a metric $\tilde\ig$ conformal to $\ig$. The Bär-Hijazi-Lott invariant is defined by $\lm_{min}^+(M,\ig,\sa)=\inf_{\tilde\ig\in[\ig]}\lm_1^+(\tilde\ig)\Vol(M,\tilde\ig)^\frac{1}{m}$. In this paper, we show that \[ \lm_{min}^+(M,\ig,\sa)<\lm_{min}^+(S^m,\ig_{S^m},\sa_{S^m})=\frac m2\Vol(S^m,\ig_{S^m})^{\frac1m} \] provided that $\ig$ is not locally conformally flat. This estimate is a spinorial analogue to an estimate by T. Aubin, solving the Yamabe problem in this setting. △ Less

Submitted 7 December, 2021; originally announced December 2021.

arXiv:2111.08804 [pdf, ps, other]

Additive functionals of exclusion processes from non-equilibrium

Authors: Luiz Renato Fontes, Tiecheng Xu

Abstract: Consider the weakly asymmetric simple exclusion processes on the one-dimensional torus. We study the non-equilibrium fluctuation of a class of additive functionals, and show that its scaling limit is a Gaussian process. The proof is mainly based on the results obtained and techniques developed by Jara and Menezes [Non-equiliburim fluctuations of interacting particle systems, arXiv:1810.09526]. Consider the weakly asymmetric simple exclusion processes on the one-dimensional torus. We study the non-equilibrium fluctuation of a class of additive functionals, and show that its scaling limit is a Gaussian process. The proof is mainly based on the results obtained and techniques developed by Jara and Menezes [Non-equiliburim fluctuations of interacting particle systems, arXiv:1810.09526]. △ Less

Submitted 1 June, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

Comments: The main result of this paper is extended from occupation time to additive functionals. The title is changed. More details are added

arXiv:2111.08403 [pdf, ps, other]

Invariant systems and character sheaves for graded Lie algebras

Authors: Kari Vilonen, Ting Xue

Abstract: In this paper we explain how to construct all the character sheaves for type I graded classical Lie algebras which we expect to be cuspidal. A new ingredient is the use of invariant systems of differential equations. In this paper we explain how to construct all the character sheaves for type I graded classical Lie algebras which we expect to be cuspidal. A new ingredient is the use of invariant systems of differential equations. △ Less

Submitted 16 November, 2021; originally announced November 2021.

arXiv:2111.00403 [pdf, ps, other]

Character sheaves for symmetric pairs: spin groups

Authors: Ting Xue

Abstract: We determine character sheaves for symmetric pairs associated to spin groups. In particular, we determine the cupsidal character sheaves and show that they can be obtained via the nearby cycle construction of [GVX] and its generalisation in [VX2]. We determine character sheaves for symmetric pairs associated to spin groups. In particular, we determine the cupsidal character sheaves and show that they can be obtained via the nearby cycle construction of [GVX] and its generalisation in [VX2]. △ Less

Submitted 31 October, 2021; originally announced November 2021.

arXiv:2110.13451 [pdf, ps, other]

Character sheaves for symmetric pairs: special linear groups

Authors: Kari Vilonen, Ting Xue

Abstract: We give an explicit description of character sheaves for the symmetric pairs associated to inner involutions of the special linear groups. We make use of the general strategy given in [VX1] and central character consideration. We also determine the cuspidal character sheaves. We give an explicit description of character sheaves for the symmetric pairs associated to inner involutions of the special linear groups. We make use of the general strategy given in [VX1] and central character consideration. We also determine the cuspidal character sheaves. △ Less

Submitted 26 October, 2021; originally announced October 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:1806.02506

arXiv:2110.10351 [pdf, other]

Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process

Authors: Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan

Abstract: The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its utilities/costs. A new primal-dual approach is proposed with a novel integration of three ingredients: entropy regularized policy optimizer, dual variable regularizer, and Nesterov's accelerated gradient descent… ▽ More The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its utilities/costs. A new primal-dual approach is proposed with a novel integration of three ingredients: entropy regularized policy optimizer, dual variable regularizer, and Nesterov's accelerated gradient descent dual optimizer, all of which are critical to achieve a faster convergence. The finite-time error bound of the proposed approach is characterized. Despite the challenge of the nonconcave objective subject to nonconcave constraints, the proposed approach is shown to converge to the global optimum with a complexity of $\tilde{\mathcal O}(1/ε)$ in terms of the optimality gap and the constraint violation, which improves the complexity of the existing primal-dual approach by a factor of $\mathcal O(1/ε)$ \citep{ding2020natural,paternain2019constrained}. This is the first demonstration that nonconcave CMDP problems can attain the complexity lower bound of $\mathcal O(1/ε)$ for convex optimization subject to convex constraints. Our primal-dual approach and non-asymptotic analysis are agnostic to the RL optimizer used, and thus are more flexible for practical applications. More generally, our approach also serves as the first algorithm that provably accelerates constrained nonconvex optimization with zero duality gap by exploiting the geometries such as the gradient dominance condition, for which the existing acceleration methods for constrained convex optimization are not applicable. △ Less

Submitted 19 October, 2021; originally announced October 2021.

Comments: The paper was initially submitted for publication in January 2021

arXiv:2109.14491 [pdf, ps, other]

Hydrodynamic limit of Exclusion Processes with slow boundaries on hypercubes

Authors: Tiecheng Xu

Abstract: We study the hydrodynamic limit of SSEP with slow boundaries on hypercubes in dimension at least two. The hydrodynamic limit equation is shown to be a heat equation with three different types of boundary conditions according to the slowness of the boundary dynamics.The proof is based on Yau's relative entropy method. We study the hydrodynamic limit of SSEP with slow boundaries on hypercubes in dimension at least two. The hydrodynamic limit equation is shown to be a heat equation with three different types of boundary conditions according to the slowness of the boundary dynamics.The proof is based on Yau's relative entropy method. △ Less

Submitted 29 September, 2021; originally announced September 2021.

Comments: 18 pages

arXiv:2109.09803 [pdf, ps, other]

Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in $\mathbf{a}(2)$-finite Coxeter systems

Authors: R. M. Green, Tianyuan Xu

Abstract: A Coxeter group is said to be \emph{$\mathbf{a}(2)$-finite} if it has finitely many elements of $\mathbf{a}$-value 2 in the sense of Lusztig. In this paper, we give explicit combinatorial descriptions of the left, right, and two-sided Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in an irreducible $\mathbf{a}(2)$-finite Coxeter group. In particular, we introduce elements we call \emph{stubs} to p… ▽ More A Coxeter group is said to be \emph{$\mathbf{a}(2)$-finite} if it has finitely many elements of $\mathbf{a}$-value 2 in the sense of Lusztig. In this paper, we give explicit combinatorial descriptions of the left, right, and two-sided Kazhdan--Lusztig cells of $\mathbf{a}$-value 2 in an irreducible $\mathbf{a}(2)$-finite Coxeter group. In particular, we introduce elements we call \emph{stubs} to parameterize the one-sided cells and we characterize the one-sided cells via both star operations and weak Bruhat orders. We also compute the cardinalities of all the one-sided and two-sided cells. △ Less

Submitted 25 May, 2023; v1 submitted 20 September, 2021; originally announced September 2021.

Comments: Final version; to appear in Algebraic Combinatorics

MSC Class: Primary: 20F55; Secondary: 20C08

arXiv:2108.06284 [pdf, other]

doi 10.1016/j.jde.2023.06.038

On the Cauchy problem of defocusing mKdV equation with finite density initial data: long time asymptotics in soliton-less regions

Authors: Taiyang Xu, Zechuan Zhang, Engui Fan

Abstract: We investigate the long-time asymptotics for the solutions to the Cauchy problem of defocusing modified Kortweg-de Vries (mKdV) equation with finite density initial data. The present paper is the subsequent work of our previous paper [arXiv:2108.03650], which gives the soliton resolution for the defocusing mKdV equation in the central asymptotic sector $\{(x,t): \vert ξ\vert<6\}$ with $ξ:=x/t$. In… ▽ More We investigate the long-time asymptotics for the solutions to the Cauchy problem of defocusing modified Kortweg-de Vries (mKdV) equation with finite density initial data. The present paper is the subsequent work of our previous paper [arXiv:2108.03650], which gives the soliton resolution for the defocusing mKdV equation in the central asymptotic sector $\{(x,t): \vert ξ\vert<6\}$ with $ξ:=x/t$. In the present paper, via the Riemann-Hilbert (RH) problem associated to the Cauchy problem, the long-time asymptotics in the soliton-less regions $\{(x,t): \vert ξ\vert>6, |ξ|=\mathcal{O}(1)\}$ for the defocusing mKdV equation are further obtained. It is shown that the leading term of the asymptotics are in compatible with the ``background solution'' and the error terms are derived via rigorous analysis. △ Less

Submitted 23 June, 2023; v1 submitted 13 August, 2021; originally announced August 2021.

Comments: 51 pages

MSC Class: 35Q51; 35Q15; 35C20; 37K15; 37K40

Journal ref: J. Differential Equations. 372 (2023), 55-122

arXiv:2108.03650 [pdf, other]

Soliton resolution and asymptotic stability of $N$-soliton solutions for the defocusing mKdV equation with finite density type initial data

Authors: Zechuan Zhang, Taiyang Xu, Engui Fan

Abstract: We consider the Cauchy problem for the defocusing modified Korteweg-de Vries (mKdV) equation with finite density type initial data. With the $\bar{\partial}$ generalization of the nonlinear steepest descent method of Deift and Zhou, we extrapolate the leading order approximation to the solution of mKdV for large time in the solitonic space-time region $|x/t+4|<2$, and we give bounds for the error… ▽ More We consider the Cauchy problem for the defocusing modified Korteweg-de Vries (mKdV) equation with finite density type initial data. With the $\bar{\partial}$ generalization of the nonlinear steepest descent method of Deift and Zhou, we extrapolate the leading order approximation to the solution of mKdV for large time in the solitonic space-time region $|x/t+4|<2$, and we give bounds for the error which decay as $t\rightarrow\infty$ for a general class of initial data whose difference from the non-vanishing background possesses a fixed number of finite moments. Our results provide a verification of the soliton resolution conjecture and asymptotic stability of $N$-soliton solutions for mKdV equation with finite density type initial data. △ Less

Submitted 20 August, 2021; v1 submitted 8 August, 2021; originally announced August 2021.

Comments: 48 pages. arXiv admin note: substantial text overlap with arXiv:1410.6887 by other authors

arXiv:2107.02711 [pdf, ps, other]

A Unified Off-Policy Evaluation Approach for General Value Function

Authors: Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Abstract: General Value Function (GVF) is a powerful tool to represent both the {\em predictive} and {\em retrospective} knowledge in reinforcement learning (RL). In practice, often multiple interrelated GVFs need to be evaluated jointly with pre-collected off-policy samples. In the literature, the gradient temporal difference (GTD) learning method has been adopted to evaluate GVFs in the off-policy setting… ▽ More General Value Function (GVF) is a powerful tool to represent both the {\em predictive} and {\em retrospective} knowledge in reinforcement learning (RL). In practice, often multiple interrelated GVFs need to be evaluated jointly with pre-collected off-policy samples. In the literature, the gradient temporal difference (GTD) learning method has been adopted to evaluate GVFs in the off-policy setting, but such an approach may suffer from a large estimation error even if the function approximation class is sufficiently expressive. Moreover, none of the previous work have formally established the convergence guarantee to the ground truth GVFs under the function approximation settings. In this paper, we address both issues through the lens of a class of GVFs with causal filtering, which cover a wide range of RL applications such as reward variance, value gradient, cost in anomaly detection, stationary distribution gradient, etc. We propose a new algorithm called GenTD for off-policy GVFs evaluation and show that GenTD learns multiple interrelated multi-dimensional GVFs as efficiently as a single canonical scalar value function. We further show that unlike GTD, the learned GVFs by GenTD are guaranteed to converge to the ground truth GVFs as long as the function approximation power is sufficiently large. To our best knowledge, GenTD is the first off-policy GVF evaluation algorithm that has global optimality guarantee. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: submitted for publication

arXiv:2103.05660 [pdf, other]

Identifiability Analysis of Linear Ordinary Differential Equation Systems with a Single Trajectory

Authors: Xing Qiu, Tao Xu, Babak Soltanalizadeh, Hulin Wu

Abstract: Ordinary differential equations (ODEs) are widely used to model dynamical behavior of systems. It is important to perform identifiability analysis prior to estimating unknown parameters in ODEs (a.k.a. inverse problem), because if a system is unidentifiable, the estimation procedure may fail or produce erroneous and misleading results. Although several qualitative identifiability measures have b… ▽ More Ordinary differential equations (ODEs) are widely used to model dynamical behavior of systems. It is important to perform identifiability analysis prior to estimating unknown parameters in ODEs (a.k.a. inverse problem), because if a system is unidentifiable, the estimation procedure may fail or produce erroneous and misleading results. Although several qualitative identifiability measures have been proposed, much less effort has been given to develo** \emph{quantitative} (continuous) scores that are robust to uncertainties in the data, especially for those cases in which the data are presented as a single trajectory beginning with one initial value. In this paper, we first derived a closed-form representation of linear ODE systems that are not identifiable based on a single trajectory. This representation helps researchers design practical systems and choose the right prior structural information in practice. Next, we proposed several quantitative scores for identifiability analysis in practice. In simulation studies, the proposed measures outperformed the main competing method significantly, especially when noise was presented in the data. We also discussed the asymptotic properties of practical identifiability for high-dimensional ODE systems and conclude that, without additional prior information, many random ODE systems are practically unidentifiable when the dimension approaches infinity. △ Less

Submitted 9 March, 2021; originally announced March 2021.

Comments: 40 pages, 6 figures, one Supplementary Text

MSC Class: 34A30; 34A55; 34F05; 93B30; 93C05

arXiv:2103.04966 [pdf, ps, other]

doi 10.1088/1361-6544/ac72e8

Critical Sharp Front for Doubly Nonlinear Degenerate Diffusion Equations with Time Delay

Authors: Tianyuan Xu, Shanming Ji, Ming Mei, **gxue Yin

Abstract: This paper is concerned with the critical sharp traveling wave for doubly nonlinear diffusion equation with time delay, where the doubly nonlinear degenerate diffusion is defined by $\Big(\big|(u^m)_x\big|^{p-2}(u^m)_x\Big)_x$ with $m>0$ and $p>1$. The doubly nonlinear diffusion equation is proved to admit a unique sharp type traveling wave for the degenerate case $m(p-1)>1$, the so-called slow-di… ▽ More This paper is concerned with the critical sharp traveling wave for doubly nonlinear diffusion equation with time delay, where the doubly nonlinear degenerate diffusion is defined by $\Big(\big|(u^m)_x\big|^{p-2}(u^m)_x\Big)_x$ with $m>0$ and $p>1$. The doubly nonlinear diffusion equation is proved to admit a unique sharp type traveling wave for the degenerate case $m(p-1)>1$, the so-called slow-diffusion case. This sharp traveling wave associated with the minimal wave speed $c^*(m,p,r)$ is monotonically increasing, where the minimal wave speed satisfies $c^*(m,p,r)<c^*(m,p,0)$ for any time delay $r>0$. The sharp front is $C^1$-smooth for $\frac{1}{p-1}<m< \frac{p}{p-1}$, and piecewise smooth for $m\ge \frac{p}{p-1}$. Our results indicate that time delay slows down the minimal traveling wave speed for the doubly nonlinear degenerate diffusion equations. The approach adopted for proof is the phase transform method combining the variational method. The main technical issue for the proof is to overcome the obstacle caused by the doubly nonlinear degenerate diffusion. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: arXiv admin note: text overlap with arXiv:1909.11751

arXiv:2102.04653 [pdf, ps, other]

Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Authors: Ziyi Chen, Yi Zhou, Tengyu Xu, Yingbin Liang

Abstract: The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geo… ▽ More The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geometries, and there lacks understanding for general nonconvex minimax optimization. This paper fills such a gap by studying the convergence of a more general proximal-GDA for regularized nonconvex-strongly-concave minimax optimization. Specifically, we show that proximal-GDA admits a novel Lyapunov function, which monotonically decreases in the minimax optimization process and drives the variable sequence to a critical point. By leveraging this Lyapunov function and the KŁ geometry that parameterizes the local geometries of general nonconvex functions, we formally establish the variable convergence of proximal-GDA to a critical point $x^*$, i.e., $x_t\to x^*, y_t\to y^*(x^*)$. Furthermore, over the full spectrum of the KŁ-parameterized geometry, we show that proximal-GDA achieves different types of convergence rates ranging from sublinear convergence up to finite-step convergence, depending on the geometry associated with the KŁ parameter. This is the first theoretical result on the variable convergence for nonconvex minimax optimization. △ Less

Submitted 17 February, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

Comments: To appear in ICLR 2021

arXiv:2101.08652 [pdf, ps, other]

A note on Hessenberg varieties

Authors: Kari Vilonen, Ting Xue

Abstract: We give a short proof based on Lusztig's generalized Springer correspondence of some results of [BrCh,BaCr,P]. We give a short proof based on Lusztig's generalized Springer correspondence of some results of [BrCh,BaCr,P]. △ Less

Submitted 19 January, 2021; originally announced January 2021.

arXiv:2101.06851 [pdf, ps, other]

Subregular $J$-rings of Coxeter systems via quiver path algebras

Authors: Ivan Dimitrov, Charles Paquette, David Wehlau, Tianyuan Xu

Abstract: We study the subregular $J$-ring $J_C$ of a Coxeter system $(W,S)$, a subring of Lusztig's $J$-ring. We prove that $J_C$ is isomorphic to a quotient of the path algebra of the double quiver of $(W,S)$ by a suitable ideal that we associate to a family of Chebyshev polynomials. As applications, we use quiver representations to study the category mod-$A_K$ of finite dimensional right modules of the a… ▽ More We study the subregular $J$-ring $J_C$ of a Coxeter system $(W,S)$, a subring of Lusztig's $J$-ring. We prove that $J_C$ is isomorphic to a quotient of the path algebra of the double quiver of $(W,S)$ by a suitable ideal that we associate to a family of Chebyshev polynomials. As applications, we use quiver representations to study the category mod-$A_K$ of finite dimensional right modules of the algebra $A_K=K\otimes_\Z J_C$ over an algebraically closed field $K$ of characteristic zero. Our results include classifications of Coxeter systems for which mod-$A_K$ is semisimple, has finitely many simple modules up to isomorphism, or has a bound on the dimensions of simple modules. Incidentally, we show that every group algebra of a free product of finite cyclic groups is Morita equivalent to the algebra $A_K$ for a suitable Coxeter system; this allows us to specialize the classifications to the module categories of such group algebras. △ Less

Submitted 17 January, 2021; originally announced January 2021.

Comments: 49 pages, 7 figures

MSC Class: Primary: 20C08; 16G20; Secondary: 16D60; 20C07; 20E06

arXiv:2101.00238 [pdf, other]

doi 10.1007/s11704-019-8457-x

Adam revisited: a weighted past gradients perspective

Authors: Hui Zhong, Zaiyi Chen, Chuan Qin, Zai Huang, Vincent W. Zheng, Tong Xu, Enhong Chen

Abstract: Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence i… ▽ More Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence issues, achieving a data-dependent regret bound similar to or better than ADAGRAD is still a challenge to these methods. In this paper, we propose a novel adaptive method weighted adaptive algorithm (WADA) to tackle the non-convergence issues. Unlike AMSGRAD and ADAMNC, we consider using a milder growing weighting strategy on squared past gradient, in which weights grow linearly. Based on this idea, we propose weighted adaptive gradient method framework (WAGMF) and implement WADA algorithm on this framework. Moreover, we prove that WADA can achieve a weighted data-dependent regret bound, which could be better than the original regret bound of ADAGRAD when the gradients decrease rapidly. This bound may partially explain the good performance of ADAM in practice. Finally, extensive experiments demonstrate the effectiveness of WADA and its variants in comparison with several variants of ADAM on training convex problems and deep neural networks. △ Less

Submitted 1 January, 2021; originally announced January 2021.

Comments: Zhong, Hui, et al. "Adam revisited: a weighted past gradients perspective." Frontiers of Computer Science 14.5 (2020): 1-16

Journal ref: Front. Comput. Sci. 14, 145309 (2020)

Showing 1–50 of 99 results for author: Xu, T