-
Separable integer partition classes and partitions with congruence conditions
Authors:
Thomas Y. He,
C. S. Huang,
H. X. Li,
X. Zhang
Abstract:
In this article, we first investigate the partitions whose parts are congruent to $a$ or $b$ modulo $k$ with the aid of separable integer partition classes with modulus $k$ introduced by Andrews. Then, we introduce the $(k,r)$-overpartitions in which only parts equivalent to $r$ modulo $k$ may be overlined and we will show that the number of $(k,k)$-overpartitions of $n$ equals the number of parti…
▽ More
In this article, we first investigate the partitions whose parts are congruent to $a$ or $b$ modulo $k$ with the aid of separable integer partition classes with modulus $k$ introduced by Andrews. Then, we introduce the $(k,r)$-overpartitions in which only parts equivalent to $r$ modulo $k$ may be overlined and we will show that the number of $(k,k)$-overpartitions of $n$ equals the number of partitions of $n$ such that the $k$-th occurrence of a part may be overlined. Finally, we extend separable integer partition classes with modulus $k$ to overpartitions and then give the generating function for $(k,r)$-modulo overpartitions, which are the $(k,r)$-overpartitions satisfying certain congruence conditions.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Total Positivity of Quasi-Riordan Arrays
Authors:
Tian-Xiao He,
Roksana Słowik
Abstract:
In this paper the total positivity of quasi-Riordan arrays is investigated with use of the sequence characterization of quasi-Riordan arrays. Due to the correlation between quasi-Riordan arrays and Riordan arrays, this study is an in-depth discussion of the total positivity of Riordan arrays.
In this paper the total positivity of quasi-Riordan arrays is investigated with use of the sequence characterization of quasi-Riordan arrays. Due to the correlation between quasi-Riordan arrays and Riordan arrays, this study is an in-depth discussion of the total positivity of Riordan arrays.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Total Positivity of Almost-Riordan Arrays
Authors:
Tian-Xiao He,
Roksana Słowik
Abstract:
In this paper we study the total positivity of almost-Riordan arrays $(d(t)|\, g(t), f(t))$ and establish its necessary conditions and sufficient conditions, particularly, for some well used formal power series $d(t)$. We present a semidirect product of an almost-array and use it to transfer a total positivity problem for an almost-Riordan array to the total positivity problem for a quasi-Riordan…
▽ More
In this paper we study the total positivity of almost-Riordan arrays $(d(t)|\, g(t), f(t))$ and establish its necessary conditions and sufficient conditions, particularly, for some well used formal power series $d(t)$. We present a semidirect product of an almost-array and use it to transfer a total positivity problem for an almost-Riordan array to the total positivity problem for a quasi-Riordan array. We find the sequence characterization of total positivity of the almost-Riordan arrays. The production matrix $J$ of an almost-Riordan array $(d|\, g,f)$ is presented so that $J$ is totally positive implies the total positivity of both the almost-Riordan array $(d|\, g,f)$ and the Riordan array $(g,f)$. We also present a counterexample to illustrate that this sufficient condition is not necessary. If the production matrix $J$ is tridiagonal, then the expressions of its principal minors are given. By using expressions, we find a sufficient and necessary condition of the total positivity of almost-Riordan arrays with tridiagonal production matrices. A numerous examples are given to demonstrate our results.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Grokking Modular Polynomials
Authors:
Darshil Doshi,
Tianyu He,
Aritra Das,
Andrey Gromov
Abstract:
Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest. This limitation remains unmoved by the choice of architecture and training strategies. On the other hand, an analytical solution for the weights of Multi-layer Perceptron (MLP) networks that generalize on the modular addition task is known in the literature. In this work, we (i) extend…
▽ More
Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest. This limitation remains unmoved by the choice of architecture and training strategies. On the other hand, an analytical solution for the weights of Multi-layer Perceptron (MLP) networks that generalize on the modular addition task is known in the literature. In this work, we (i) extend the class of analytical solutions to include modular multiplication as well as modular addition with many terms. Additionally, we show that real networks trained on these datasets learn similar solutions upon generalization (grokking). (ii) We combine these "expert" solutions to construct networks that generalize on arbitrary modular polynomials. (iii) We hypothesize a classification of modular polynomials into learnable and non-learnable via neural networks training; and provide experimental evidence supporting our claims.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
A bijection related to Bressoud's conjecture
Authors:
Y. H. Chen,
Thomas Y. He
Abstract:
Bressoud introduced the partition function $B(α_1,\ldots,α_λ;η,k,r;n)$, which counts the number of partitions with certain difference conditions. Bressoud posed a conjecture on the generating function for the partition function $B(α_1,\ldots,α_λ;η,k,r;n)$ in multi-summation form. In this article, we introduce a bijection related to Bressoud's conjecture. As an application, we give a new companion…
▽ More
Bressoud introduced the partition function $B(α_1,\ldots,α_λ;η,k,r;n)$, which counts the number of partitions with certain difference conditions. Bressoud posed a conjecture on the generating function for the partition function $B(α_1,\ldots,α_λ;η,k,r;n)$ in multi-summation form. In this article, we introduce a bijection related to Bressoud's conjecture. As an application, we give a new companion to the Göllnitz-Gordon identities.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
The Legendre Transform of Convex Lattice Sets
Authors:
Tingting He,
Lin Si
Abstract:
The goal of this paper is to study convex lattice sets by the discrete Legendre transform. The definition of the polar of convex lattice sets in $\mathbb{Z}^n$ is provided. It is worth mentioning that the polar of convex lattice sets have the self-dual property similar to that of convex bodies. Some properties of convex lattice sets are established, for instance, the inclusion relation, the union…
▽ More
The goal of this paper is to study convex lattice sets by the discrete Legendre transform. The definition of the polar of convex lattice sets in $\mathbb{Z}^n$ is provided. It is worth mentioning that the polar of convex lattice sets have the self-dual property similar to that of convex bodies. Some properties of convex lattice sets are established, for instance, the inclusion relation, the union and intersection on the polar of convex lattice sets. In addition, we discuss the relationship between the cross-polytope and the discrete Mahler product. It states that a convex lattice set is the cross-polytope if and only if its discrete Mahler product is the smallest.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
On embeddability of Coxeter groups into the Riordan group
Authors:
Tian-Xiao He,
Nikolai A. Krylov
Abstract:
We prove that a Coxeter group containing an element of finite order, which is generated by two non-commuting involutions, can not be embedded into the Riordan group.
We prove that a Coxeter group containing an element of finite order, which is generated by two non-commuting involutions, can not be embedded into the Riordan group.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Purity for Perfectoidness
Authors:
Tongmu He
Abstract:
There has been a long-standing question about whether being perfectoid for an algebra is local in the analytic topology. We provide affirmative answers for the algebras (e.g., over $\overline{\mathbb{Z}_p}$) whose spectra are inverse limits of semi-stable affine schemes. In fact, we established a valuative criterion for such an algebra being perfectoid, saying that it suffices to check the perfect…
▽ More
There has been a long-standing question about whether being perfectoid for an algebra is local in the analytic topology. We provide affirmative answers for the algebras (e.g., over $\overline{\mathbb{Z}_p}$) whose spectra are inverse limits of semi-stable affine schemes. In fact, we established a valuative criterion for such an algebra being perfectoid, saying that it suffices to check the perfectoidness of the stalks of the associated Riemann-Zariski space. Combining with Gabber-Ramero's computation of differentials of valuation rings, we obtain a differential criterion for perfectoidness. We also establish a purity result for perfectoidness when the limit preserves generic points of the special fibres.
As an application to limits of smooth $p$-adic varieties (on the generic point), assuming either the poly-stable modification conjecture or working only with curves, we prove that stalk-wise perfectoidness implies vanishing of the higher completed étale cohomology groups of the smooth varieties, which is inspired by Scholze's vanishing for Shimura varieties. Moreover, we give an explicit description of the completed étale cohomology group in the top degree in terms of the colimit of Zariski cohomology groups of the structural sheaves.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
An integer programming approach for quick-commerce assortment planning
Authors:
Ya**g Chen,
Taotao He,
Ying Rong,
Yunlong Wang
Abstract:
In this paper, we explore the challenge of assortment planning in the context of quick-commerce, a rapidly-growing business model that aims to deliver time-sensitive products. In order to achieve quick delivery to satisfy the immediate demands of online customers in close proximity, personalized online assortments need to be included in brick-and-mortar store offerings. With the presence of this p…
▽ More
In this paper, we explore the challenge of assortment planning in the context of quick-commerce, a rapidly-growing business model that aims to deliver time-sensitive products. In order to achieve quick delivery to satisfy the immediate demands of online customers in close proximity, personalized online assortments need to be included in brick-and-mortar store offerings. With the presence of this physical linkage requirement and distinct multinomial logit (MNL) choice models for online consumer segments, the firm seeks to maximize overall revenue by selecting an optimal assortment of products for local stores and by tailoring a personalized assortment for each online consumer segment. We refer to this problem as quick-commerce assortment planning (QAP). We employ an integer programming approach to solve this NP-hard problem to global optimality. Specifically, we propose convexification techniques to handle its combinatorial and nonconvex nature. We capture the consumer choice of each online segment using a convex hull representation. By exploiting the geometry behind Luce's choice axiom, we provide a compact polyhedral characterization of the convex hull under various operational constraints that are not totally-unimodular. Furthermore, we conduct a polyhedral study on the relation between assortment decisions for products to offer and choice probabilities of products under the MNL model.Our methodology, coupled with a modified choice probability ordered separation algorithm, yields formulations that provide a significant computational advantage over existing methods. Through comprehensive numerical studies, we emphasize the significance of aligning offline and online assortment decisions and underscore the perils associated with inaccurately specifying customer behavior models.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Angular Momentum Memory Effect
Authors:
Xinliang An,
Taoran He,
Dawei Shen
Abstract:
Utilizing recent mathematical advances in proving stability of Minkowski spacetime with minimal decay rates and nonlinear stability of Kerr black holes with small angular momentum, we investigate the detailed asymptotic behaviors of gravitational waves generated in these spacetimes. Here we report and propose a new angular momentum memory effect along future null infinity. This accompanies Christo…
▽ More
Utilizing recent mathematical advances in proving stability of Minkowski spacetime with minimal decay rates and nonlinear stability of Kerr black holes with small angular momentum, we investigate the detailed asymptotic behaviors of gravitational waves generated in these spacetimes. Here we report and propose a new angular momentum memory effect along future null infinity. This accompanies Christodoulou's nonlinear displacement memory effect and the spin memory effect. The connections and differences to these effects are also addressed.
△ Less
Submitted 10 April, 2024; v1 submitted 17 March, 2024;
originally announced March 2024.
-
Energy-efficient Decentralized Learning via Graph Sparsification
Authors:
Xusheng Zhang,
Cho-Chun Chiu,
Ting He
Abstract:
This work aims at improving the energy efficiency of decentralized learning by optimizing the mixing matrix, which controls the communication demands during the learning process. Through rigorous analysis based on a state-of-the-art decentralized learning algorithm, the problem is formulated as a bi-level optimization, with the lower level solved by graph sparsification. A solution with guaranteed…
▽ More
This work aims at improving the energy efficiency of decentralized learning by optimizing the mixing matrix, which controls the communication demands during the learning process. Through rigorous analysis based on a state-of-the-art decentralized learning algorithm, the problem is formulated as a bi-level optimization, with the lower level solved by graph sparsification. A solution with guaranteed performance is proposed for the special case of fully-connected base topology and a greedy heuristic is proposed for the general case. Simulations based on real topology and dataset show that the proposed solution can lower the energy consumption at the busiest node by 54%-76% while maintaining the quality of the trained model.
△ Less
Submitted 22 May, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
The $m$th-order Eulerian Numbers
Authors:
Tian-Xiao He
Abstract:
We define the $m$th-order Eulerian numbers with a combinatorial interpretation. The recurrence relation of the $m$th-order Eulerian numbers, the row generating function and the row sums of the $m$th-order Eulerian triangle are presented. We also define the $m$th-order Eulerian fraction and its alternative form. Some properties of the $m$th-order Eulerian fractions are represented by using differen…
▽ More
We define the $m$th-order Eulerian numbers with a combinatorial interpretation. The recurrence relation of the $m$th-order Eulerian numbers, the row generating function and the row sums of the $m$th-order Eulerian triangle are presented. We also define the $m$th-order Eulerian fraction and its alternative form. Some properties of the $m$th-order Eulerian fractions are represented by using differentiation and integration. An inversion relationship between second-order Eulerian numbers and Stirling numbers of the second kind is given. Finally, we give the exact expression of the values of the $m$th-order Eulerian numbers.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Dynamics of Apparent Horizon and a Null Comparison Principle
Authors:
Xinliang An,
Taoran He
Abstract:
This paper investigates the global dynamics of the apparent horizon. We present an approach to establish its existence and its long-term behaviors. Our apparent horizon is constructed by solving the marginally outer trapped surface (MOTS) along each incoming null hypersurface. Based on the nonlinear hyperbolic estimates established in [24] by Klainerman-Szeftel under polarized axial symmetry, we p…
▽ More
This paper investigates the global dynamics of the apparent horizon. We present an approach to establish its existence and its long-term behaviors. Our apparent horizon is constructed by solving the marginally outer trapped surface (MOTS) along each incoming null hypersurface. Based on the nonlinear hyperbolic estimates established in [24] by Klainerman-Szeftel under polarized axial symmetry, we prove that the corresponding apparent horizon is smooth, asymptotically null and converging to the event horizon eventually. To further address the local achronality of the apparent horizon, a new concept, called the null comparison principle, is introduced in this paper. For three typical scenarios of gravitational collapse, our null comparison principle is tested and verified, which guarantees that the apparent horizon must be piecewise spacelike or piecewise null. In addition, we also validate and provide new proofs for several physical laws along the apparent horizon.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Beyond the Holographic Entropy Cone via Cycle Flows
Authors:
Temple He,
Sergio Hernández-Cuenca,
Cynthia Keeler
Abstract:
Motivated by bit threads, we introduce a new prescription for computing entropy vectors outside the holographic entropy cone. By utilizing cycle flows on directed graphs, we show that the maximum cycle flow associated to any subset of vertices, which corresponds to a subsystem, manifestly obeys purification symmetry. Furthermore, we prove that the maximum cycle flow obeys both subadditivity and st…
▽ More
Motivated by bit threads, we introduce a new prescription for computing entropy vectors outside the holographic entropy cone. By utilizing cycle flows on directed graphs, we show that the maximum cycle flow associated to any subset of vertices, which corresponds to a subsystem, manifestly obeys purification symmetry. Furthermore, we prove that the maximum cycle flow obeys both subadditivity and strong subadditivity, thereby establishing it as a viable candidate for the entropy associated to the subsystem. Finally, we demonstrate how our model generalizes the entropy vectors obtainable via conventional flows in undirected graphs and hypergraphs.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
An overpartition analogue of Bressoud conjecture for even moduli
Authors:
Y. H. Chen,
T. T. Gu,
Thomas Y. He,
F. Tang,
J. J. Wei
Abstract:
In 1980, Bressoud conjectured a combinatorial identity $A_j=B_j$ for $j=0$ or $1$. In this paper, we introduce a new partition function $\overline{B}_0$ which can be viewed as an overpartition analogue of the partition function $B_0$. An overpartition is a partition such that the last occurrence of a part can be overlined. We build a bijection to get a relationship between $\overline{B}_0$ and…
▽ More
In 1980, Bressoud conjectured a combinatorial identity $A_j=B_j$ for $j=0$ or $1$. In this paper, we introduce a new partition function $\overline{B}_0$ which can be viewed as an overpartition analogue of the partition function $B_0$. An overpartition is a partition such that the last occurrence of a part can be overlined. We build a bijection to get a relationship between $\overline{B}_0$ and $B_1$, based on which an overpartition analogue of Bressoud's conjecture for $j=0$ is obtained.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Convexification Techniques for Fractional Programs
Authors:
Taotao He,
Siyue Liu,
Mohit Tawarmalani
Abstract:
This paper develops a correspondence relating convex hulls of fractional functions with those of polynomial functions over the same domain. Using this result, we develop a number of new reformulations and relaxations for fractional programming problems. First, we relate 0-1 problems involving a ratio of affine functions with the boolean quadric polytope, and use inequalities for the latter to deve…
▽ More
This paper develops a correspondence relating convex hulls of fractional functions with those of polynomial functions over the same domain. Using this result, we develop a number of new reformulations and relaxations for fractional programming problems. First, we relate 0-1 problems involving a ratio of affine functions with the boolean quadric polytope, and use inequalities for the latter to develop tighter formulations for the former. Second, we derive a new formulation to optimize a ratio of quadratic functions over a polytope using copositive programming. Third, we show that univariate fractional functions can be convexified using moment hulls. Fourth, we develop a new hierarchy of relaxations that converges finitely to the simultaneous convex hull of a collection of ratios of affine functions of 0-1 variables. Finally, we demonstrate theoretically and computationally that our techniques close a significant gap relative to state-of-the-art relaxations, require much less computational effort, and can solve larger problem instances.
△ Less
Submitted 15 June, 2024; v1 submitted 12 October, 2023;
originally announced October 2023.
-
MIP Relaxations in Factorable Programming
Authors:
Taotao He,
Mohit Tawarmalani
Abstract:
In this paper, we develop new discrete relaxations for nonlinear expressions in factorable programming. We utilize specialized convexification results as well as composite relaxations to develop mixed-integer programming (MIP) relaxations. Our relaxations rely on ideal formulations of convex hulls of outer-functions over a combinatorial structure that captures local inner-function structure. The r…
▽ More
In this paper, we develop new discrete relaxations for nonlinear expressions in factorable programming. We utilize specialized convexification results as well as composite relaxations to develop mixed-integer programming (MIP) relaxations. Our relaxations rely on ideal formulations of convex hulls of outer-functions over a combinatorial structure that captures local inner-function structure. The resulting relaxations often require fewer variables and are tighter than currently prevalent ones. Finally, we provide computational evidence to demonstrate that our relaxations close approximately 60-70% of the gap relative to McCormick relaxations and significantly improves the relaxations used in a state-of-the-art solver on various instances involving polynomial functions.
△ Less
Submitted 15 June, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Nonlinear asymptotic stability and transition threshold for 2D Taylor-Couette flows in Sobolev spaces
Authors:
Xinliang An,
Taoran He,
Te Li
Abstract:
In this paper, we investigate the stability of the 2-dimensional (2D) Taylor-Couette (TC) flow for the incompressible Navier-Stokes equations. The explicit form of velocity for 2D TC flow is given by $u=(Ar+\frac{B}{r})(-\sin θ, \cos θ)^T$ with $(r, θ)\in [1, R]\times \mathbb{S}^1$ being an annulus and $A, B$ being constants. Here, $A, B$ encode the rotational effect and $R$ is the ratio of the ou…
▽ More
In this paper, we investigate the stability of the 2-dimensional (2D) Taylor-Couette (TC) flow for the incompressible Navier-Stokes equations. The explicit form of velocity for 2D TC flow is given by $u=(Ar+\frac{B}{r})(-\sin θ, \cos θ)^T$ with $(r, θ)\in [1, R]\times \mathbb{S}^1$ being an annulus and $A, B$ being constants. Here, $A, B$ encode the rotational effect and $R$ is the ratio of the outer and inner radii of the annular region. Our focus is the long-term behavior of solutions around the steady 2D TC flow. While the laminar solution is known to be a global attractor for 2D channel flows and plane flows, it is unclear whether this is still true for rotating flows with curved geometries. In this article, we prove that the 2D Taylor-Couette flow is asymptotically stable, even at high Reynolds number ($Re\sim ν^{-1}$), with a sharp exponential decay rate of $\exp(-ν^{\frac13}|B|^{\frac23}R^{-2}t)$ as long as the initial perturbation is less than or equal to $ν^\frac12 |B|^{\frac12}R^{-2}$ in Sobolev space. The powers of $ν$ and $B$ in this decay estimate are optimal. It is derived using the method of resolvent estimates and is commonly recognized as the enhanced dissipative effect. Compared to the Couette flow, the enhanced dissipation of the rotating Taylor-Couette flow not only depends on the Reynolds number but also reflects the rotational aspect via the rotational coefficient $B$. The larger the $|B|$, the faster the long-time dissipation takes effect. We also conduct space-time estimates describing inviscid-dam** mechanism in our proof. To obtain these inviscid-dam** estimates, we find and construct a new set of explicit orthonormal basis of the weighted eigenfunctions for the Laplace operators corresponding to the circular flows. These provide new insights into the mathematical understanding of the 2D Taylor-Couette flows.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Some Separable integer partition classes
Authors:
Y. H. Chen,
Thomas Y. He,
F. Tang,
J. J. Wei
Abstract:
Recently, Andrews introduced separable integer partition classes and analyzed some well-known theorems. In this paper, we investigate partitions with parts separated by parity introduced by Andrews with the aid of separable integer partition classes with modulus $2$. We also extend separable integer partition classes with modulus $1$ to overpartitions, called separable overpartition classes. We st…
▽ More
Recently, Andrews introduced separable integer partition classes and analyzed some well-known theorems. In this paper, we investigate partitions with parts separated by parity introduced by Andrews with the aid of separable integer partition classes with modulus $2$. We also extend separable integer partition classes with modulus $1$ to overpartitions, called separable overpartition classes. We study overpartitions and the overpartition analogue of Rogers-Ramanujan identities, which are separable overpartition classes.
△ Less
Submitted 27 October, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Divisibility of the Sums of the Power of Consecutive Integers
Authors:
Tian-Xiao He,
Peter J. -S. Shiue
Abstract:
We study the divisibility of the sums of the odd power of consecutive integers, $S(m,k)=1^{mk}+2^{mk}+\cdots+k^{mk}$ and $1^k+2^k+\cdots+n^k$ for odd integers $m$ and $k$, by using the Girard-Waring identity. Faulhaber's approach for the divisibilities is discussed. Some expressions of power sums in terms of Stirling numbers of the second kind are represented.
We study the divisibility of the sums of the odd power of consecutive integers, $S(m,k)=1^{mk}+2^{mk}+\cdots+k^{mk}$ and $1^k+2^k+\cdots+n^k$ for odd integers $m$ and $k$, by using the Girard-Waring identity. Faulhaber's approach for the divisibilities is discussed. Some expressions of power sums in terms of Stirling numbers of the second kind are represented.
△ Less
Submitted 15 April, 2023;
originally announced April 2023.
-
Almost Coherence of Higher Direct Images
Authors:
Tongmu He
Abstract:
For a flat proper morphism of finite presentation between schemes with almost coherent structural sheaves (in the sense of Faltings), we prove that the higher direct images of quasi-coherent and almost coherent modules are quasi-coherent and almost coherent. Our proof uses Noetherian approximation, inspired by Kiehl's proof of the pseudo-coherence of higher direct images. Our result allows us to e…
▽ More
For a flat proper morphism of finite presentation between schemes with almost coherent structural sheaves (in the sense of Faltings), we prove that the higher direct images of quasi-coherent and almost coherent modules are quasi-coherent and almost coherent. Our proof uses Noetherian approximation, inspired by Kiehl's proof of the pseudo-coherence of higher direct images. Our result allows us to extend Abbes-Gros' proof of Faltings' main $p$-adic comparison theorem in the relative case for projective log-smooth morphisms of schemes to proper ones, and thus also their construction of the relative Hodge-Tate spectral sequence.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Sen Operators and Lie Algebras arising from Galois Representations over $p$-adic Varieties
Authors:
Tongmu He
Abstract:
Any finite-dimensional $p$-adic representation of the absolute Galois group of a $p$-adic local field with imperfect residue field is characterized by its arithmetic and geometric Sen operators defined by Sen and Brinon. We generalize their construction to the fundamental group of a $p$-adic affine variety with a semi-stable chart, and prove that the module of Sen operators is canonically defined,…
▽ More
Any finite-dimensional $p$-adic representation of the absolute Galois group of a $p$-adic local field with imperfect residue field is characterized by its arithmetic and geometric Sen operators defined by Sen and Brinon. We generalize their construction to the fundamental group of a $p$-adic affine variety with a semi-stable chart, and prove that the module of Sen operators is canonically defined, independently of the choice of the chart. Our construction relies on a descent theorem in the $p$-adic Simpson correspondence developed by Tsuji. When the representation comes from a $\mathbb{Q}_p$-representation of a $p$-adic analytic group quotient of the fundamental group, we describe its Lie algebra action in terms of the Sen operators, which is a generalization of a result of Sen and Ohkubo. These Sen operators can be extended continuously to certain infinite-dimensional representations. As an application, we prove that the geometric Sen operators annihilate locally analytic vectors, generalizing a result of Pan.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Quantitative blow-up estimates for spacelike singularities in gravitational-collapse cosmological spacetimes
Authors:
Xinliang An,
Haoyang Chen,
Taoran He
Abstract:
Under spherical symmetry, with double-null coordinates $(u,v)$, we study the gravitational collapse of the Einstein--scalar field system with a positive cosmological constant. The spacetime singularities arise when area radius $r$ vanishes and they are spacelike. We derive new quantitative estimates, obtain polynomial blow-up rates $O(1/r^N)$ for various quantities, and extend the results in [5] b…
▽ More
Under spherical symmetry, with double-null coordinates $(u,v)$, we study the gravitational collapse of the Einstein--scalar field system with a positive cosmological constant. The spacetime singularities arise when area radius $r$ vanishes and they are spacelike. We derive new quantitative estimates, obtain polynomial blow-up rates $O(1/r^N)$ for various quantities, and extend the results in [5] by the first author and Zhang and the arguments in [3] by the first author and Gajic to the cosmological settings. In particular, we sharpen the estimates of $r\partial_u r$ and $r\partial_v r$ in [5] and prove that the spacelike singularities where $r(u,v)=0$ are $C^{1,1/3}$ in $(u,v)$ coordinates. As an application, these estimates also give quantitative blow-up upper bounds of fluid velocity and density for the hard-phase model of the Einstein-Euler system under irrotational assumption. Near the timelike infinity, we also generalize the theorems in [3] by linking the precise blow-up rates of the Kretschmann scalar to the exponential Price's law along the event horizon. In cosmological settings, this further reveals the mass-inflation phenomena along the spacelike singularities for the first time.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
The Vertical Recursive Relation of Riordan Arrays and Their Matrix Representation
Authors:
Tian-Xiao He
Abstract:
A vertical recursive relation approach to Riordan arrays is induced, while the horizontal recursive relation is represented by $A$- and $Z$-sequences. This vertical recursive approach gives a way to represent the entries of a Riordan array $(g,f)$ in terms of a recursive linear combinations of the coefficients of $g$. A matrix representation of the vertical recursive relation is also given. The se…
▽ More
A vertical recursive relation approach to Riordan arrays is induced, while the horizontal recursive relation is represented by $A$- and $Z$-sequences. This vertical recursive approach gives a way to represent the entries of a Riordan array $(g,f)$ in terms of a recursive linear combinations of the coefficients of $g$. A matrix representation of the vertical recursive relation is also given. The set of all those matrices forms a group, called the quasi-Riordan group. The extensions of the horizontal recursive relation and the vertical recursive relation in terms of $c$- and $C$- Riordan arrays are defined with illustrations by using the rook triangle and the Laguerre triangle. Those extensions represent a way to study nonlinear recursive relations of the entries of some triangular matrices from linear recursive relations of the entries of Riordan arrays. In addition, the matrix representation of the vertical recursive relation of Riordan arrays provides transforms between lower order and high order finite Riordan arrays, where the $m$th order Riordan array is defined by $(g,f)_m=(d_{n,k})_{m\geq n,k\geq 0}$. Furthermore, the vertical relation approach to Riordan arrays provides a unified approach to construct identities.
△ Less
Submitted 4 December, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Extracting structure from functional expressions for continuous and discrete relaxations of MINLP
Authors:
Taotao He,
Mohit Tawarmalani
Abstract:
In this paper, we develop new continuous and discrete relaxations for nonlinear expressions in an MINLP. In contrast to factorable programming, our techniques utilize the inner-function structure by encapsulating it in a polyhedral set, using a technique first proposed in [12]. We tighten the relaxations derived in [33,13] and obtain new relaxations for functions that could not be treated using pr…
▽ More
In this paper, we develop new continuous and discrete relaxations for nonlinear expressions in an MINLP. In contrast to factorable programming, our techniques utilize the inner-function structure by encapsulating it in a polyhedral set, using a technique first proposed in [12]. We tighten the relaxations derived in [33,13] and obtain new relaxations for functions that could not be treated using prior techniques. We develop new discretization-based mixed-integer programming relaxations that yield tighter relaxations than similar relaxations in the literature. These relaxations utilize the simplotope that captures inner-function structure to generalize the incremental formulation of [8] to multivariate functions. In particular, when the outer-function is supermodular, our formulations require exponentially fewer continuous variables than any previously known formulation.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
An Approach for Fast Cascading Failure Simulation in Dynamic Models of Power Systems
Authors:
Sina Gharebaghi,
Nilanjan Ray Chaudhuri,
Ting He,
Thomas La Porta
Abstract:
The ground truth for cascading failure in power system can only be obtained through a detailed dynamic model involving nonlinear differential and algebraic equations whose solution process is computationally expensive. This has prohibited adoption of such models for cascading failure simulation. To solve this, we propose a fast cascading failure simulation approach based on implicit Backward Euler…
▽ More
The ground truth for cascading failure in power system can only be obtained through a detailed dynamic model involving nonlinear differential and algebraic equations whose solution process is computationally expensive. This has prohibited adoption of such models for cascading failure simulation. To solve this, we propose a fast cascading failure simulation approach based on implicit Backward Euler method (BEM) with stiff decay property. Unfortunately, BEM suffers from hyperstability issue in case of oscillatory instability and converges to the unstable equilibrium. We propose a predictor-corrector approach to fully address the hyperstability issue in BEM. The predictor identifies oscillatory instability based on eigendecomposition of the system matrix at the post-disturbance unstable equilibrium obtained as a byproduct of BEM. The corrector uses right eigenvectors to identify the group of machines participating in the unstable mode. This helps in applying appropriate protection schemes as in ground truth. We use Trapezoidal method (TM)-based simulation as the benchmark to validate the results of the proposed approach on the IEEE 118-bus network, 2,383-bus Polish grid, and IEEE 68-bus system. The proposed approach is able to track the cascade path and replicate the end results of TM-based simulation with very high accuracy while reducing the average simulation time by approximately 10-35 fold. The proposed approach was also compared with the partitioned method, which led to similar conclusions.
△ Less
Submitted 29 April, 2022;
originally announced May 2022.
-
New companions to the generations of the Göllnitz-Gordon identities
Authors:
Thomas Y. He,
Alice X. H. Zhao
Abstract:
The Göllnitz-Gordon identities were found by Göllnitz and Gordon independently. In 1967, Andrews obtained a combinatorial generalization of the Göllnitz-Gordon identities, called the Andrews-Göllnitz-Gordon theorem. In 1980, Bressoud extended the Andrews-Göllnitz-Gordon theorem to even moduli, called the Bressoud-Göllnitz-Gordon theorem. Furthermore, Bressoud gave the generating functions for the…
▽ More
The Göllnitz-Gordon identities were found by Göllnitz and Gordon independently. In 1967, Andrews obtained a combinatorial generalization of the Göllnitz-Gordon identities, called the Andrews-Göllnitz-Gordon theorem. In 1980, Bressoud extended the Andrews-Göllnitz-Gordon theorem to even moduli, called the Bressoud-Göllnitz-Gordon theorem. Furthermore, Bressoud gave the generating functions for the generalizations of the Göllnitz-Gordon identities. In this article, we will give new companions to the generalizations of the Göllnitz-Gordon identities.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
Enhanced dissipation and nonlinear asymptotic stability of the Taylor-Couette flow for the 2D Navier-Stokes equations
Authors:
Xinliang An,
Taoran He,
Te Li
Abstract:
In this paper, we study the nonlinear stability of a steady circular flow created between two rotating concentric cylinders. The dynamics of the viscous fluid are described by 2D Navier-Stokes equations. We adopt scaling variables. For the rescaled equations, we prove that the steady flow (Taylor-Couette flow) is asymptotically stable up to a large perturbation of initial data. Back to the origina…
▽ More
In this paper, we study the nonlinear stability of a steady circular flow created between two rotating concentric cylinders. The dynamics of the viscous fluid are described by 2D Navier-Stokes equations. We adopt scaling variables. For the rescaled equations, we prove that the steady flow (Taylor-Couette flow) is asymptotically stable up to a large perturbation of initial data. Back to the original 2D Navier-Stokes equations, this implies an improved transition threshold for the Taylor-Couette flow. The improvement is due to enhanced dissipation and new observations and constructions of weighted $L^2$ norms, which capture a hidden structure between the viscosity constant $ν$ and (different) rotating speeds and locations of two coaxial cylinders. In particular, we allow the location of the outer cylinder to tend to infinity, which renders the initial fluid kinetic energy not uniformly bounded. Due to enhanced-dissipation effect, we also establish a sharp resolvent estimate, desired space-time bounds and optimal decaying estimates, which lead to the proof of nonlinear asymptotic stability of 2D Taylor-Couette flow.
△ Less
Submitted 31 December, 2021;
originally announced December 2021.
-
On the Solutions of Three Variable Frobenius Related Problems Using Order Reduction Approach
Authors:
Tian-Xiao He,
Peter J. -S. Shiue,
Rama Venkat
Abstract:
This paper presents a new approach to determine the number of solutions of three variable Frobenius related problems and to find their solutions by using order reducing methods. Here, the order of a Frobenius related problem means the number of variables appearing in the problem. We present two types of order reduction methods that can be applied to the problem of finding all nonnegative solutions…
▽ More
This paper presents a new approach to determine the number of solutions of three variable Frobenius related problems and to find their solutions by using order reducing methods. Here, the order of a Frobenius related problem means the number of variables appearing in the problem. We present two types of order reduction methods that can be applied to the problem of finding all nonnegative solutions of three variable Frobenius related problems. The first method is used to reduce the equation of order three from a three variable Frobenius related problem to be a system of equations with two fixed variables. The second method reduces the equation of order three into three equations of order two, for which an algorithm is designed with an interesting open problem on solutions left as a conjecture.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Centralizers of the Riordan Group
Authors:
Tian-Xiao He,
Yuanziyi Zhang
Abstract:
In this paper, we discuss centralizers in the Riordan group. We will see that Faà di Bruno's formula is an application of the Fundamental Theorem of Riordan arrays. Then the composition group of formal power series in ${\cal F}_1$ is studied to construct the centralizers of Bell type and Lagrange type Riordan arrays. Our tools are the $A$-sequences of Riordan arrays and Faà di Bruno's formula. Som…
▽ More
In this paper, we discuss centralizers in the Riordan group. We will see that Faà di Bruno's formula is an application of the Fundamental Theorem of Riordan arrays. Then the composition group of formal power series in ${\cal F}_1$ is studied to construct the centralizers of Bell type and Lagrange type Riordan arrays. Our tools are the $A$-sequences of Riordan arrays and Faà di Bruno's formula. Some combinatorial explanation and discussion about related algebraic topics are also given.
△ Less
Submitted 15 May, 2021;
originally announced May 2021.
-
Cohomological Descent for Faltings' $p$-adic Hodge Theory and Applications
Authors:
Tongmu He
Abstract:
Faltings' approach in $p$-adic Hodge theory can be schematically divided into two main steps: firstly, a local reduction of the computation of the $p$-adic étale cohomology of a smooth variety over a $p$-adic local field to a Galois cohomology computation and then, the establishment of a link between the latter and differential forms. These relations are organized through Faltings ringed topos who…
▽ More
Faltings' approach in $p$-adic Hodge theory can be schematically divided into two main steps: firstly, a local reduction of the computation of the $p$-adic étale cohomology of a smooth variety over a $p$-adic local field to a Galois cohomology computation and then, the establishment of a link between the latter and differential forms. These relations are organized through Faltings ringed topos whose definition relies on the choice of an integral model of the variety, and whose good properties depend on the (logarithmic) smoothness of this model. Scholze's generalization for rigid analytic varieties has the advantage of depending only on the variety (i.e. the generic fibre). Inspired by Deligne's approach to classical Hodge theory for singular varieties, we establish a cohomological descent result for the structural sheaf of Faltings topos, which makes it possible to extend Faltings' approach to any integral model, i.e. without any smoothness assumption. An essential ingredient of our proof is a descent result of perfectoid algebras in the arc-topology due to Bhatt and Scholze. As an application of our cohomological descent, using a variant of de Jong's alteration theorem for morphisms of schemes due to Gabber-Illusie-Temkin, we generalize Faltings' main $p$-adic comparison theorem to any proper and finitely presented morphism of coherent schemes over an absolute integral closure of $\mathbb{Z}_p$ (without any assumption of smoothness) for torsion étale sheaves (not necessarily finite locally constant). As a second application, we prove a local version of the relative Hodge-Tate filtration as a consequence of the global version constructed by Abbes-Gros.
△ Less
Submitted 19 January, 2022; v1 submitted 26 April, 2021;
originally announced April 2021.
-
One-pth Riordan Arrays in the Construction of Identities
Authors:
Tian-Xiao He
Abstract:
For an integer $p\geq 2$ we construct vertical and horizontal one-pth Riordan arrays from a Riordan array. When $p=2$, one-pth Riordan arrays reduced to well known half Riordan arrays. The generating functions of the $A$-sequences of vertical and horizontal one-pth Riordan arrays are found. The vertical and horizontal one-pth Riordan arrays provide an approach to construct many identities. They ca…
▽ More
For an integer $p\geq 2$ we construct vertical and horizontal one-pth Riordan arrays from a Riordan array. When $p=2$, one-pth Riordan arrays reduced to well known half Riordan arrays. The generating functions of the $A$-sequences of vertical and horizontal one-pth Riordan arrays are found. The vertical and horizontal one-pth Riordan arrays provide an approach to construct many identities. They can also be used to verify some well known identities readily.
△ Less
Submitted 17 January, 2021; v1 submitted 30 October, 2020;
originally announced November 2020.
-
Power Grid State Estimation under General Cyber-Physical Attacks
Authors:
Yudi Huang,
Ting He,
Nilanjan Ray Chaudhuri,
Thomas La Porta
Abstract:
Effective defense against cyber-physical attacks in power grid requires the capability of accurate damage assessment within the attacked area. While some solutions have been proposed to recover the phase angles and the link status (i.e., breaker status) within the attacked area, existing solutions made the limiting assumption that the grid stays connected after the attack. To fill this gap, we stu…
▽ More
Effective defense against cyber-physical attacks in power grid requires the capability of accurate damage assessment within the attacked area. While some solutions have been proposed to recover the phase angles and the link status (i.e., breaker status) within the attacked area, existing solutions made the limiting assumption that the grid stays connected after the attack. To fill this gap, we study the problem of recovering the phase angles and the link status under a general cyber-physical attack that may partition the grid into islands. To this end, we (i) show that the existing solutions and recovery conditions still hold if the post-attack power injections in the attacked area are known, and (ii) propose a linear programming-based algorithm that can perfectly recover the link status under certain conditions even if the post-attack power injections are unknown. Our numerical evaluations based on the Polish power grid demonstrate that the proposed algorithm is highly accurate in localizing failed links once the phase angles are known.
△ Less
Submitted 4 February, 2021; v1 submitted 4 September, 2020;
originally announced September 2020.
-
Faltings extension and Hodge-Tate filtration for abelian varieties over $p$-adic local fields with imperfect residue fields
Authors:
Tongmu He
Abstract:
Let $K$ be a complete discrete valuation field of characteristic $0$ with not necessarily perfect residue field of characteristic $p>0$. We define a Faltings extension of $\mathcal{O}_K$ over $\mathbb{Z}_p$, and we construct a Hodge-Tate filtration for abelian varieties over $K$ by generalizing Fontaine's construction in 1981, where he treated the perfect residue field case.
Let $K$ be a complete discrete valuation field of characteristic $0$ with not necessarily perfect residue field of characteristic $p>0$. We define a Faltings extension of $\mathcal{O}_K$ over $\mathbb{Z}_p$, and we construct a Hodge-Tate filtration for abelian varieties over $K$ by generalizing Fontaine's construction in 1981, where he treated the perfect residue field case.
△ Less
Submitted 24 June, 2020; v1 submitted 21 March, 2020;
originally announced March 2020.
-
Superbalance of Holographic Entropy Inequalities
Authors:
Temple He,
Veronika E. Hubeny,
Mukund Rangamani
Abstract:
The domain of allowed von Neumann entropies of a holographic field theory carves out a polyhedral cone -- the holographic entropy cone -- in entropy space. Such polyhedral cones are characterized by their extreme rays. For an arbitrary number of parties, it is known that the so-called perfect tensors are extreme rays. In this work, we constrain the form of the remaining extreme rays by showing tha…
▽ More
The domain of allowed von Neumann entropies of a holographic field theory carves out a polyhedral cone -- the holographic entropy cone -- in entropy space. Such polyhedral cones are characterized by their extreme rays. For an arbitrary number of parties, it is known that the so-called perfect tensors are extreme rays. In this work, we constrain the form of the remaining extreme rays by showing that they correspond to geometries with vanishing mutual information between any two parties, ensuring the absence of Bell pair type entanglement between them. This is tantamount to proving that besides subadditivity, all non-redundant holographic entropy inequalities are superbalanced, i.e. not only do UV divergences cancel in the inequality itself (assuming smooth entangling surfaces), but also in the purification thereof.
△ Less
Submitted 7 July, 2020; v1 submitted 11 February, 2020;
originally announced February 2020.
-
Overpartitions and Bressoud's conjecture, II
Authors:
Thomas Y. He,
Kathy Q. Ji,
Alice X. H. Zhao
Abstract:
The main objective of this paper is to present an answer to Bressoud's conjecture for the case $j=0$, resulting in a complete solution to the conjecture. The case for $j=1$ has been recently resolved by Kim. Using the connection established in our previous paper between the ordinary partition function $B_0$ and the overpartition function $\overline{B}_1$, we found that the proof of Bressoud's conj…
▽ More
The main objective of this paper is to present an answer to Bressoud's conjecture for the case $j=0$, resulting in a complete solution to the conjecture. The case for $j=1$ has been recently resolved by Kim. Using the connection established in our previous paper between the ordinary partition function $B_0$ and the overpartition function $\overline{B}_1$, we found that the proof of Bressoud's conjecture for the case $j=0$ is equivalent to establishing an overpartition analogue of the conjecture for $j=1$. By generalizing Kim's method, we obtain the desired overpartition analogue of Bressoud's conjecture for $j=1$, which eventually enables us to confirm Bressoud's conjecture for the case $j=0$.
△ Less
Submitted 21 February, 2024; v1 submitted 1 January, 2020;
originally announced January 2020.
-
Overpartitions and Bressoud's conjecture, I
Authors:
Thomas Y. He,
Kathy Q. Ji,
Alice X. H. Zhao
Abstract:
In 1980, Bressoud conjectured a combinatorial identity $A_j=B_j$ for $j=0$ or $1$, where the function $A_j$ counts the number of partitions with certain congruence conditions and the function $B_j$ counts the number of partitions with certain difference conditions. Bressoud's conjecture specializes to a wide variety of well-known theorems in the theory of partitions. Special cases of his conjectur…
▽ More
In 1980, Bressoud conjectured a combinatorial identity $A_j=B_j$ for $j=0$ or $1$, where the function $A_j$ counts the number of partitions with certain congruence conditions and the function $B_j$ counts the number of partitions with certain difference conditions. Bressoud's conjecture specializes to a wide variety of well-known theorems in the theory of partitions. Special cases of his conjecture have been subsequently proved by Bressoud, Andrews, Kim and Yee. Recently, Kim resolved Bressoud's conjecture for the case $j=1$. In this paper, we introduce a new partition function $\bar{B}_j$ which can be viewed as an overpartition analogue of the partition function $B_j$ introduced by Bressoud. By means of Gordon markings, we build bijections to obtain a relationship between $\bar{B}_1$ and $B_0$ and a relationship between $\bar{B}_0$ and $B_1$. Based on these former relationships, we further give overpartition analogues of many classical partition theorems including Euler's partition theorem, the Rogers-Ramanujan-Gordon identities, the Bressoud-Rogers-Ramanujan identities, the Andrews-Göllnitz-Gordon identities and the Bressoud-Göllnitz-Gordon identities.
△ Less
Submitted 8 May, 2022; v1 submitted 17 October, 2019;
originally announced October 2019.
-
$A$-sequences, $Z$-sequence, and $B$-sequences of Riordan Matrices
Authors:
Tian-Xiao He
Abstract:
We defined two type $B$-sequences of Riordan arrays and present the $A$-sequence characterization and $Z$-sequence characterization of the Riordan matrices with two type $B$-sequences. The subgroups characterized by $A$-sequences and $Z$-sequences are studied. The application of the sequence characterization to the RNA type matrices is discussed. Finally, we investigate the $A$-, $Z$-, and $B$-seq…
▽ More
We defined two type $B$-sequences of Riordan arrays and present the $A$-sequence characterization and $Z$-sequence characterization of the Riordan matrices with two type $B$-sequences. The subgroups characterized by $A$-sequences and $Z$-sequences are studied. The application of the sequence characterization to the RNA type matrices is discussed. Finally, we investigate the $A$-, $Z$-, and $B$-sequences of the Pascal like Riordan matrices.
△ Less
Submitted 5 September, 2019;
originally announced September 2019.
-
Service Placement with Provable Guarantees in Heterogeneous Edge Computing Systems
Authors:
Stephen Pasteris,
Shiqiang Wang,
Mark Herbster,
Ting He
Abstract:
Mobile edge computing (MEC) is a promising technique for providing low-latency access to services at the network edge. The services are hosted at various types of edge nodes with both computation and communication capabilities. Due to the heterogeneity of edge node characteristics and user locations, the performance of MEC varies depending on where the service is hosted. In this paper, we consider…
▽ More
Mobile edge computing (MEC) is a promising technique for providing low-latency access to services at the network edge. The services are hosted at various types of edge nodes with both computation and communication capabilities. Due to the heterogeneity of edge node characteristics and user locations, the performance of MEC varies depending on where the service is hosted. In this paper, we consider such a heterogeneous MEC system, and focus on the problem of placing multiple services in the system to maximize the total reward. We show that the problem is NP-hard via reduction from the set cover problem, and propose a deterministic approximation algorithm to solve the problem, which has an approximation ratio that is not worse than $\left(1-e^{-1}\right)/4$. The proposed algorithm is based on two sub-routines that are suitable for small and arbitrarily sized services, respectively. The algorithm is designed using a novel way of partitioning each edge node into multiple slots, where each slot contains one service. The approximation guarantee is obtained via a specialization of the method of conditional expectations, which uses a randomized procedure as an intermediate step. In addition to theoretical guarantees, simulation results also show that the proposed algorithm outperforms other state-of-the-art approaches.
△ Less
Submitted 17 June, 2019;
originally announced June 2019.
-
Why gradient clip** accelerates training: A theoretical justification for adaptivity
Authors:
**gzhao Zhang,
Tianxing He,
Suvrit Sra,
Ali Jadbabaie
Abstract:
We provide a theoretical explanation for the effectiveness of gradient clip** in training deep neural networks. The key ingredient is a new smoothness condition derived from practical neural network training examples. We observe that gradient smoothness, a concept central to the analysis of first-order optimization algorithms that is often assumed to be a constant, demonstrates significant varia…
▽ More
We provide a theoretical explanation for the effectiveness of gradient clip** in training deep neural networks. The key ingredient is a new smoothness condition derived from practical neural network training examples. We observe that gradient smoothness, a concept central to the analysis of first-order optimization algorithms that is often assumed to be a constant, demonstrates significant variability along the training trajectory of deep neural networks. Further, this smoothness positively correlates with the gradient norm, and contrary to standard assumptions in the literature, it can grow with the norm of the gradient. These empirical observations limit the applicability of existing theoretical analyses of algorithms that rely on a fixed bound on smoothness. These observations motivate us to introduce a novel relaxation of gradient smoothness that is weaker than the commonly used Lipschitz smoothness assumption. Under the new condition, we prove that two popular methods, namely, \emph{gradient clip**} and \emph{normalized gradient}, converge arbitrarily faster than gradient descent with fixed stepsize. We further explain why such adaptively scaled gradient methods can accelerate empirical convergence and verify our results empirically in popular neural network training settings.
△ Less
Submitted 10 February, 2020; v1 submitted 28 May, 2019;
originally announced May 2019.
-
Bit Threads and Holographic Monogamy
Authors:
Shawn X. Cui,
Patrick Hayden,
Temple He,
Matthew Headrick,
Bogdan Stoica,
Michael Walter
Abstract:
Bit threads provide an alternative description of holographic entanglement, replacing the Ryu-Takayanagi minimal surface with bulk curves connecting pairs of boundary points. We use bit threads to prove the monogamy of mutual information (MMI) property of holographic entanglement entropies. This is accomplished using the concept of a so-called multicommodity flow, adapted from the network setting,…
▽ More
Bit threads provide an alternative description of holographic entanglement, replacing the Ryu-Takayanagi minimal surface with bulk curves connecting pairs of boundary points. We use bit threads to prove the monogamy of mutual information (MMI) property of holographic entanglement entropies. This is accomplished using the concept of a so-called multicommodity flow, adapted from the network setting, and tools from the theory of convex optimization. Based on the bit thread picture, we conjecture a general ansatz for a holographic state, involving only bipartite and perfect-tensor type entanglement, for any decomposition of the boundary into four regions. We also give new proofs of analogous theorems on networks.
△ Less
Submitted 28 June, 2019; v1 submitted 15 August, 2018;
originally announced August 2018.
-
Adaptive Federated Learning in Resource Constrained Edge Computing Systems
Authors:
Shiqiang Wang,
Tiffany Tuor,
Theodoros Salonidis,
Kin K. Leung,
Christian Makaya,
Ting He,
Kevin Chan
Abstract:
Emerging technologies and applications including Internet of Things (IoT), social networking, and crowd-sourcing generate large amounts of data at the network edge. Machine learning models are often built from the collected data, to enable the detection, classification, and prediction of future events. Due to bandwidth, storage, and privacy concerns, it is often impractical to send all the data to…
▽ More
Emerging technologies and applications including Internet of Things (IoT), social networking, and crowd-sourcing generate large amounts of data at the network edge. Machine learning models are often built from the collected data, to enable the detection, classification, and prediction of future events. Due to bandwidth, storage, and privacy concerns, it is often impractical to send all the data to a centralized location. In this paper, we consider the problem of learning model parameters from data distributed across multiple edge nodes, without sending raw data to a centralized place. Our focus is on a generic class of machine learning models that are trained using gradient-descent based approaches. We analyze the convergence bound of distributed gradient descent from a theoretical point of view, based on which we propose a control algorithm that determines the best trade-off between local update and global parameter aggregation to minimize the loss function under a given resource budget. The performance of the proposed algorithm is evaluated via extensive experiments with real datasets, both on a networked prototype system and in a larger-scale simulated environment. The experimentation results show that our proposed approach performs near to the optimum with various machine learning models and different data distributions.
△ Less
Submitted 16 February, 2019; v1 submitted 14 April, 2018;
originally announced April 2018.
-
Topological Conjugacy of Non-hyperbolic Linear Flows
Authors:
Tongmu He
Abstract:
The topological equivalence classification for linear flows on $\mathbb{R}^n$ had been completely solved by Kuiper and independently Ladis in 1973. However, Ladis' proof was published in a Russian journal which isn't easily available, Kuiper's proof is more topological and a little bit subtle. Aiming at topological conjugacy classification, mainly based on the ideas of Kuiper, we introduce other t…
▽ More
The topological equivalence classification for linear flows on $\mathbb{R}^n$ had been completely solved by Kuiper and independently Ladis in 1973. However, Ladis' proof was published in a Russian journal which isn't easily available, Kuiper's proof is more topological and a little bit subtle. Aiming at topological conjugacy classification, mainly based on the ideas of Kuiper, we introduce other techniques and try to present an elementary and self-contained proof just using linear algebra and elementary topology.
△ Less
Submitted 13 March, 2017;
originally announced March 2017.
-
The Bressoud-Göllnitz-Gordon Theorem for Overpartitions of even moduli
Authors:
Thomas Y. He,
Allison Y. F. Wang,
Alice X. H. Zhao
Abstract:
We give an overpartition analogue of Bressoud's combinatorial generalization of the Göllnitz-Gordon theorem for even moduli in general case. Let $\widetilde{O}_{k,i}(n)$ be the number of overpartitions of $n$ whose parts satisfy certain difference condition and $\widetilde{P}_{k,i}(n)$ be the number of overpartitions of $n$ whose non-overlined parts satisfy certain congruence condition. We show th…
▽ More
We give an overpartition analogue of Bressoud's combinatorial generalization of the Göllnitz-Gordon theorem for even moduli in general case. Let $\widetilde{O}_{k,i}(n)$ be the number of overpartitions of $n$ whose parts satisfy certain difference condition and $\widetilde{P}_{k,i}(n)$ be the number of overpartitions of $n$ whose non-overlined parts satisfy certain congruence condition. We show that $\widetilde{O}_{k,i}(n)=\widetilde{P}_{k,i}(n)$ for $1\leq i<k$.
△ Less
Submitted 27 February, 2017;
originally announced February 2017.
-
An overpartition analogue of the Andrews-Göllnitz-Gordon theorem
Authors:
Thomas Y. He,
Kathy Q. Ji,
Allison Y. F. Wang,
Alice X. H. Zhao
Abstract:
In 1967, Andrews found a combinatorial generalization of the Göllnitz-Gordon theorem, which can be called the Andrews-Göllnitz-Gordon theorem. In 1980, Bressoud derived a multisum Rogers-Ramanujan-type identity, which can be considered as the generating function counterpart of the Andrews-Göllnitz-Gordon theorem. Lovejoy gave an overpartition analogue of the Andrews-Göllnitz-Gordon theorem for…
▽ More
In 1967, Andrews found a combinatorial generalization of the Göllnitz-Gordon theorem, which can be called the Andrews-Göllnitz-Gordon theorem. In 1980, Bressoud derived a multisum Rogers-Ramanujan-type identity, which can be considered as the generating function counterpart of the Andrews-Göllnitz-Gordon theorem. Lovejoy gave an overpartition analogue of the Andrews-Göllnitz-Gordon theorem for $i=k$. In this paper, we give an overpartition analogue of this theorem in the general case. By using Bailey's lemma and a change of base formula due to Bressoud, Ismail and Stanton, we obtain an overpartition analogue of Bressoud's identity. We then give a combinatorial interpretation of this identity by introducing the Göllnitz-Gordon marking of an overpartition, which yields an overpartition analogue of the Andrews-Göllnitz-Gordon theorem.
△ Less
Submitted 16 March, 2018; v1 submitted 15 December, 2016;
originally announced December 2016.
-
A Fast Proximal Gradient Algorithm for Decentralized Composite Optimization over Directed Networks
Authors:
**shan Zeng,
Tao He,
Mingwen Wang
Abstract:
This paper proposes a fast decentralized algorithm for solving a consensus optimization problem defined in a directed networked multi-agent system, where the local objective functions have the smooth+nonsmooth composite form, and are possibly nonconvex. Examples of such problems include decentralized compressed sensing and constrained quadratic programming problems, as well as many decentralized r…
▽ More
This paper proposes a fast decentralized algorithm for solving a consensus optimization problem defined in a directed networked multi-agent system, where the local objective functions have the smooth+nonsmooth composite form, and are possibly nonconvex. Examples of such problems include decentralized compressed sensing and constrained quadratic programming problems, as well as many decentralized regularization problems. We extend the existing algorithms PG-EXTRA and ExtraPush to a new algorithm PG-ExtraPush for composite consensus optimization over a directed network. This algorithm takes advantage of the proximity operator like in PG-EXTRA to deal with the nonsmooth term, and employs the push-sum protocol like in ExtraPush to tackle the bias introduced by the directed network. With a proper step size, we show that PG-ExtraPush converges to an optimal solution at a linear rate under some regular assumptions. We conduct a series of numerical experiments to show the effectiveness of the proposed algorithm. Specifically, with a proper step size, PG-ExtraPush performs linear rates in most of cases, even in some nonconvex cases, and is significantly faster than Subgradient-Push, even if the latter uses a hand-optimized step size. The established theoretical results are also verified by the numerical results.
△ Less
Submitted 26 March, 2017; v1 submitted 13 September, 2016;
originally announced September 2016.
-
A Note on the Daubechies Approach in the Construction of Spline Type Orthogonal Scaling Functions
Authors:
Tian-Xiao He,
Tung Nguyen
Abstract:
We use Lorentz polynomials to present the solutions explicitly of equations (6.1.7) of [I. Daubechies, Ten lectures on wavelets, CBMS-NSF Regional Conference Series in Applied Mathematics, 61. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1992] and (4.9) of [I. Daubechies, Orthonormal bases of compactly supported wavelets. Comm. Pure Appl. Math. 41 (1988), no. 7, 909--99…
▽ More
We use Lorentz polynomials to present the solutions explicitly of equations (6.1.7) of [I. Daubechies, Ten lectures on wavelets, CBMS-NSF Regional Conference Series in Applied Mathematics, 61. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1992] and (4.9) of [I. Daubechies, Orthonormal bases of compactly supported wavelets. Comm. Pure Appl. Math. 41 (1988), no. 7, 909--996] sot that we give an efficient way to prove Daubechies' results on the existence of spline type orthogonal scaling functions and to evaluate Daubechies scaling functions.
△ Less
Submitted 10 July, 2015;
originally announced July 2015.
-
Duals of Bernoulli Numbers and Polynomials and Euler Number and Polynomials
Authors:
Tian-Xiao He,
**ze Zheng
Abstract:
A sequence inverse relationship can be defined by a pair of infinite inverse matrices. If the pair of matrices are the same, they define a dual relationship. Here presented is a unified approach to construct dual relationships via pseudo-involution of Riordan arrays. Then we give four dual relationships for Bernoulli numbers and Euler numbers, from which the corresponding dual sequences of Bernoul…
▽ More
A sequence inverse relationship can be defined by a pair of infinite inverse matrices. If the pair of matrices are the same, they define a dual relationship. Here presented is a unified approach to construct dual relationships via pseudo-involution of Riordan arrays. Then we give four dual relationships for Bernoulli numbers and Euler numbers, from which the corresponding dual sequences of Bernoulli polynomials and Euler polynomials are constructed. Some applications in the construction of identities of Bernoulli numbers and polynomials and Euler numbers and polynomials are discussed based on the dual relationships.
△ Less
Submitted 10 July, 2015;
originally announced July 2015.
-
Dynamic Service Migration in Mobile Edge Computing Based on Markov Decision Process
Authors:
Shiqiang Wang,
Rahul Urgaonkar,
Murtaza Zafer,
Ting He,
Kevin Chan,
Kin K. Leung
Abstract:
In mobile edge computing, local edge servers can host cloud-based services, which reduces network overhead and latency but requires service migrations as users move to new locations. It is challenging to make migration decisions optimally because of the uncertainty in such a dynamic cloud environment. In this paper, we formulate the service migration problem as a Markov Decision Process (MDP). Our…
▽ More
In mobile edge computing, local edge servers can host cloud-based services, which reduces network overhead and latency but requires service migrations as users move to new locations. It is challenging to make migration decisions optimally because of the uncertainty in such a dynamic cloud environment. In this paper, we formulate the service migration problem as a Markov Decision Process (MDP). Our formulation captures general cost models and provides a mathematical framework to design optimal service migration policies. In order to overcome the complexity associated with computing the optimal policy, we approximate the underlying state space by the distance between the user and service locations. We show that the resulting MDP is exact for uniform one-dimensional user mobility while it provides a close approximation for uniform two-dimensional mobility with a constant additive error. We also propose a new algorithm and a numerical technique for computing the optimal solution which is significantly faster than traditional methods based on standard value or policy iteration. We illustrate the application of our solution in practical scenarios where many theoretical assumptions are relaxed. Our evaluations based on real-world mobility traces of San Francisco taxis show superior performance of the proposed solution compared to baseline solutions.
△ Less
Submitted 8 May, 2019; v1 submitted 17 June, 2015;
originally announced June 2015.
-
Mobility-Induced Service Migration in Mobile Micro-Clouds
Authors:
Shiqiang Wang,
Rahul Urgaonkar,
Ting He,
Murtaza Zafer,
Kevin Chan,
Kin K. Leung
Abstract:
Mobile micro-cloud is an emerging technology in distributed computing, which is aimed at providing seamless computing/data access to the edge of the network when a centralized service may suffer from poor connectivity and long latency. Different from the traditional cloud, a mobile micro-cloud is smaller and deployed closer to users, typically attached to a cellular basestation or wireless network…
▽ More
Mobile micro-cloud is an emerging technology in distributed computing, which is aimed at providing seamless computing/data access to the edge of the network when a centralized service may suffer from poor connectivity and long latency. Different from the traditional cloud, a mobile micro-cloud is smaller and deployed closer to users, typically attached to a cellular basestation or wireless network access point. Due to the relatively small coverage area of each basestation or access point, when a user moves across areas covered by different basestations or access points which are attached to different micro-clouds, issues of service performance and service migration become important. In this paper, we consider such migration issues. We model the general problem as a Markov decision process (MDP), and show that, in the special case where the mobile user follows a one-dimensional asymmetric random walk mobility model, the optimal policy for service migration is a threshold policy. We obtain the analytical solution for the cost resulting from arbitrary thresholds, and then propose an algorithm for finding the optimal thresholds. The proposed algorithm is more efficient than standard mechanisms for solving MDPs.
△ Less
Submitted 17 March, 2015;
originally announced March 2015.