-
Semiclassical asymptotics for Bergman projections with Gevrey weights
Authors:
Haoren Xiong,
Hang Xu
Abstract:
We extend the direct approach to the semiclassical asymptotics for Bergman projections, developed by Deleporte--Hitrik--Sjöstrand for real analytic exponential weights and Hitrik--Stone for smooth exponential weights, to the case of Gevrey weights. We prove that the amplitude of the asymptotic Bergman projection forms a Gevrey symbol whose asymptotic coefficients obey certain Gevrey-type growth ra…
▽ More
We extend the direct approach to the semiclassical asymptotics for Bergman projections, developed by Deleporte--Hitrik--Sjöstrand for real analytic exponential weights and Hitrik--Stone for smooth exponential weights, to the case of Gevrey weights. We prove that the amplitude of the asymptotic Bergman projection forms a Gevrey symbol whose asymptotic coefficients obey certain Gevrey-type growth rate, and it is constructed by an asymptotic inversion of an explicit Fourier integral operator up to a Gevrey-type small remainder.
△ Less
Submitted 4 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
An Efficient Algorithm for Vertex Enumeration of Arrangement
Authors:
Zelin Dong,
Fenglei Fan,
Huan Xiong,
Tieyong Zeng
Abstract:
This paper presents a state-of-the-art algorithm for the vertex enumeration problem of arrangements, which is based on the proposed new pivot rule, called the Zero rule. The Zero rule possesses several desirable properties: i) It gets rid of the objective function; ii) Its terminal satisfies uniqueness; iii) We establish the if-and-only if condition between the Zero rule and its valid reverse, whi…
▽ More
This paper presents a state-of-the-art algorithm for the vertex enumeration problem of arrangements, which is based on the proposed new pivot rule, called the Zero rule. The Zero rule possesses several desirable properties: i) It gets rid of the objective function; ii) Its terminal satisfies uniqueness; iii) We establish the if-and-only if condition between the Zero rule and its valid reverse, which is not enjoyed by earlier rules; iv) Applying the Zero rule recursively definitely terminates in $d$ steps, where $d$ is the dimension of input variables. Because of so, given an arbitrary arrangement with $v$ vertices of $n$ hyperplanes in $\mathbb{R}^d$, the algorithm's complexity is at most $\mathcal{O}(n^2d^2v)$ and can be as low as $\mathcal{O}(nd^4v)$ if it is a simple arrangement, while Moss' algorithm takes $\mathcal{O}(nd^2v^2)$, and Avis and Fukuda's algorithm goes into a loop or skips vertices because the if-and-only-if condition between the rule they chose and its valid reverse is not fulfilled. Systematic and comprehensive experiments confirm that the Zero rule not only does not fail but also is the most efficient.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Multiplicity of Positive Solutions of Nonlinear Elliptic Equation with Gradient Term
Authors:
Fei Fang,
Zhong Tan,
Huiru Xiong
Abstract:
In this paper, we consider the following nonlinear elliptic equation with gradient term: \[ \left\{ \begin{gathered}
- Δu - \frac{1}{2}(x \cdot \nabla u) + (λa(x)+b(x))u = βu^q +u^{2^*-1}, \hfill
0<u \in {H_K^{1}(\mathbb{R}^N)}, \hfill \\ \end{gathered} \right . \] where $λ, β\in (0,\infty), q \in (1,2^*-1), 2^* = 2N/(N-2), N\geq3, a(x), b(x): \mathbb{R}^N \to \mathbb{R}$ are continuous functi…
▽ More
In this paper, we consider the following nonlinear elliptic equation with gradient term: \[ \left\{ \begin{gathered}
- Δu - \frac{1}{2}(x \cdot \nabla u) + (λa(x)+b(x))u = βu^q +u^{2^*-1}, \hfill
0<u \in {H_K^{1}(\mathbb{R}^N)}, \hfill \\ \end{gathered} \right . \] where $λ, β\in (0,\infty), q \in (1,2^*-1), 2^* = 2N/(N-2), N\geq3, a(x), b(x): \mathbb{R}^N \to \mathbb{R}$ are continuous functions, and $a(x)$ is nonnegative on $\mathbb{R}^N$. When $λ$ is large enough, we prove the existence and multiplicity of positive solutions to the equation.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Proof of a Conjecture of Nath and Sellers on Simultaneous Core Partitions
Authors:
Yetong Sha,
Huan Xiong
Abstract:
In 2016, Nath and Sellers proposed a conjecture regarding the precise largest size of ${(s,ms-1,ms+1)}$-core partitions. In this paper, we prove their conjecture. One of the key techniques in our proof is to introduce and study the properties of generalized-$β$-sets, which extend the concept of $β$-sets for core partitions. Our results can be interpreted as a generalization of the well-known resul…
▽ More
In 2016, Nath and Sellers proposed a conjecture regarding the precise largest size of ${(s,ms-1,ms+1)}$-core partitions. In this paper, we prove their conjecture. One of the key techniques in our proof is to introduce and study the properties of generalized-$β$-sets, which extend the concept of $β$-sets for core partitions. Our results can be interpreted as a generalization of the well-known result of Yang, Zhong, and Zhou concerning the largest size of $(s,s+1,s+2)$-core partitions.
△ Less
Submitted 5 March, 2024; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Boundedness of metaplectic Toeplitz operators and Weyl symbols
Authors:
Haoren Xiong
Abstract:
We study Toeplitz operators on the Bargmann space, whose Toeplitz symbols are exponentials of complex inhomogeneous quadratic polynomials. Extending a result by Coburn--Hitrik--Sjöstrand, we show that the boundedness of such Toeplitz operators implies the boundedness of the corresponding Weyl symbols, thus completing the proof of the Berger--Coburn conjecture in this case. We also show that a Toep…
▽ More
We study Toeplitz operators on the Bargmann space, whose Toeplitz symbols are exponentials of complex inhomogeneous quadratic polynomials. Extending a result by Coburn--Hitrik--Sjöstrand, we show that the boundedness of such Toeplitz operators implies the boundedness of the corresponding Weyl symbols, thus completing the proof of the Berger--Coburn conjecture in this case. We also show that a Toeplitz operator is compact precisely when its Weyl symbol vanishes at infinity in this case.
△ Less
Submitted 29 May, 2023; v1 submitted 6 May, 2023;
originally announced May 2023.
-
Weighted Sobolev Space and Hyperbolic Laplacian Equations I
Authors:
Fei Fang,
Zhong Tan,
Huiru Xiong
Abstract:
In this paper, the following problem in the hyperbolic space $\mathbb{B}^N$ will be considered \begin{equation*} -Δ_{\mathbb{B}^N} u=f(x,u), \mathrm{in} \ \mathbb{B}^N.\eqno{(1)} \end{equation*} where, $Δ_{\mathbb{B}^N}$ denotes the Laplace Beltrami operator on $\mathbb{B}^N$. And this problem can be converted into the following Euclidean problem \begin{equation*} \begin{cases} -\operatorname{div}…
▽ More
In this paper, the following problem in the hyperbolic space $\mathbb{B}^N$ will be considered \begin{equation*} -Δ_{\mathbb{B}^N} u=f(x,u), \mathrm{in} \ \mathbb{B}^N.\eqno{(1)} \end{equation*} where, $Δ_{\mathbb{B}^N}$ denotes the Laplace Beltrami operator on $\mathbb{B}^N$. And this problem can be converted into the following Euclidean problem \begin{equation*} \begin{cases} -\operatorname{div}(K(x) \nabla u)=4 K(x)^{\frac{N}{N-2}}f(x,u), &\mathrm{in} \ \mathbb{B}^N, \\ u(0)=0, &\mathrm{on}\ \partial\mathbb{B}^N, \end{cases}\eqno{(2)} \end{equation*} where, $K(x):=1/\left(1-|x|^2\right)^{N-2}.$ Then, the existence of solution of problem (1) can be obtained by studying the existence of solution of problem (2). We will equip problem (2) with a weighted Sobolev space and prove the compact embedding theorem and the concentration compactness principle for the weighted Sobolev space. And we will prove that the maximum principle holds for the operator $-\operatorname{div}(K(x) \nabla u)$.
When $f(x,u)=|u|^{2^*-2} u+λu^{q-2}u$, $λ>0$, $1<q<2^{\ast}$, using the variational method, the compact embedding theorem, the concentration compactness principle and the maximum principle, the existence of nonradial solutions of problem (2) will be obtained.
△ Less
Submitted 1 December, 2022; v1 submitted 26 November, 2022;
originally announced November 2022.
-
Zeroth-Order Negative Curvature Finding: Esca** Saddle Points without Gradients
Authors:
Hualin Zhang,
Huan Xiong,
Bin Gu
Abstract:
We consider esca** saddle points of nonconvex problems where only the function evaluations can be accessed. Although a variety of works have been proposed, the majority of them require either second or first-order information, and only a few of them have exploited zeroth-order methods, particularly the technique of negative curvature finding with zeroth-order methods which has been proven to be…
▽ More
We consider esca** saddle points of nonconvex problems where only the function evaluations can be accessed. Although a variety of works have been proposed, the majority of them require either second or first-order information, and only a few of them have exploited zeroth-order methods, particularly the technique of negative curvature finding with zeroth-order methods which has been proven to be the most efficient method for esca** saddle points. To fill this gap, in this paper, we propose two zeroth-order negative curvature finding frameworks that can replace Hessian-vector product computations without increasing the iteration complexity. We apply the proposed frameworks to ZO-GD, ZO-SGD, ZO-SCSG, ZO-SPIDER and prove that these ZO algorithms can converge to $(ε,δ)$-approximate second-order stationary points with less query complexity compared with prior zeroth-order works for finding local minima.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Combinatorics of Integer Partitions With Prescribed Perimeter
Authors:
Zhicong Lin,
Huan Xiong,
Sherry H. F. Yan
Abstract:
We prove that the number of even parts and the number of times that parts are repeated have the same distribution over integer partitions with a fixed perimeter. This refines Straub's analog of Euler's Odd-Distinct partition theorem. We generalize the two concerned statistics to these of the part-difference less than $d$ and the parts not congruent to $1$ modulo $d+1$ and prove a distribution ineq…
▽ More
We prove that the number of even parts and the number of times that parts are repeated have the same distribution over integer partitions with a fixed perimeter. This refines Straub's analog of Euler's Odd-Distinct partition theorem. We generalize the two concerned statistics to these of the part-difference less than $d$ and the parts not congruent to $1$ modulo $d+1$ and prove a distribution inequality, that has a similar flavor as Alder's ex-conjecture, over partitions with a prescribed perimeter. Both of our results are proved analytically and combinatorially.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Complex Higgs Oscillators
Authors:
Haoren Xiong
Abstract:
In this note we discuss the complex version of the Higgs oscillator on the hyperbolic space. The eigenvalues and resonances of the complex Higgs oscillator are computed in different examples in the hyperbolic setting. We also propose open problems like whether the complex absorbing potential (CAP) method works for asymptotically hyperbolic manifolds and finding hyperbolic analogues of the complex…
▽ More
In this note we discuss the complex version of the Higgs oscillator on the hyperbolic space. The eigenvalues and resonances of the complex Higgs oscillator are computed in different examples in the hyperbolic setting. We also propose open problems like whether the complex absorbing potential (CAP) method works for asymptotically hyperbolic manifolds and finding hyperbolic analogues of the complex harmonic oscillator.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Generic simplicity of resonances in obstacle scattering
Authors:
Haoren Xiong
Abstract:
We show that all resonances in Dirichlet obstacle scattering (in $\mathbb{C}$ in odd dimensions and in the logarithmic cover of $\mathbb{C}\setminus\{0\}$ in even dimensions) are generically simple in the class of obstacles with $C^k$ (and $C^\infty$) boundaries, $k \geq 2$.
We show that all resonances in Dirichlet obstacle scattering (in $\mathbb{C}$ in odd dimensions and in the logarithmic cover of $\mathbb{C}\setminus\{0\}$ in even dimensions) are generically simple in the class of obstacles with $C^k$ (and $C^\infty$) boundaries, $k \geq 2$.
△ Less
Submitted 9 September, 2022; v1 submitted 16 May, 2021;
originally announced May 2021.
-
Resonances as viscosity limits for black box perturbations
Authors:
Haoren Xiong
Abstract:
We show that the complex absorbing potential (CAP) method for computing scattering resonances applies to an abstractly defined class of black box perturbations of the Laplacian in $\mathbb{R}^n$ which can be analytically extended from $\mathbb{R}^n$ to a conic neighborhood in $\mathbb{C}^n$ near infinity. The black box setting allows a unifying treatment of diverse problems ranging from obstacle s…
▽ More
We show that the complex absorbing potential (CAP) method for computing scattering resonances applies to an abstractly defined class of black box perturbations of the Laplacian in $\mathbb{R}^n$ which can be analytically extended from $\mathbb{R}^n$ to a conic neighborhood in $\mathbb{C}^n$ near infinity. The black box setting allows a unifying treatment of diverse problems ranging from obstacle scattering to scattering on finite volume surfaces.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Finite-Time Analysis for Double Q-learning
Authors:
Huaqing Xiong,
Lin Zhao,
Yingbin Liang,
Wei Zhang
Abstract:
Although Q-learning is one of the most successful algorithms for finding the best action-value function (and thus the optimal policy) in reinforcement learning, its implementation often suffers from large overestimation of Q-function values incurred by random sampling. The double Q-learning algorithm proposed in~\citet{hasselt2010double} overcomes such an overestimation issue by randomly switching…
▽ More
Although Q-learning is one of the most successful algorithms for finding the best action-value function (and thus the optimal policy) in reinforcement learning, its implementation often suffers from large overestimation of Q-function values incurred by random sampling. The double Q-learning algorithm proposed in~\citet{hasselt2010double} overcomes such an overestimation issue by randomly switching the update between two Q-estimators, and has thus gained significant popularity in practice. However, the theoretical understanding of double Q-learning is rather limited. So far only the asymptotic convergence has been established, which does not characterize how fast the algorithm converges. In this paper, we provide the first non-asymptotic (i.e., finite-time) analysis for double Q-learning. We show that both synchronous and asynchronous double Q-learning are guaranteed to converge to an $ε$-accurate neighborhood of the global optimum by taking $\tildeΩ\left(\left( \frac{1}{(1-γ)^6ε^2}\right)^{\frac{1}ω} +\left(\frac{1}{1-γ}\right)^{\frac{1}{1-ω}}\right)$ iterations, where $ω\in(0,1)$ is the decay parameter of the learning rate, and $γ$ is the discount factor. Our analysis develops novel techniques to derive finite-time bounds on the difference between two inter-connected stochastic processes, which is new to the literature of stochastic approximation.
△ Less
Submitted 12 October, 2020; v1 submitted 29 September, 2020;
originally announced September 2020.
-
Momentum Q-learning with Finite-Sample Convergence Guarantee
Authors:
Bowen Weng,
Huaqing Xiong,
Lin Zhao,
Yingbin Liang,
Wei Zhang
Abstract:
Existing studies indicate that momentum ideas in conventional optimization can be used to improve the performance of Q-learning algorithms. However, the finite-sample analysis for momentum-based Q-learning algorithms is only available for the tabular case without function approximations. This paper analyzes a class of momentum-based Q-learning algorithms with finite-sample guarantee. Specifically,…
▽ More
Existing studies indicate that momentum ideas in conventional optimization can be used to improve the performance of Q-learning algorithms. However, the finite-sample analysis for momentum-based Q-learning algorithms is only available for the tabular case without function approximations. This paper analyzes a class of momentum-based Q-learning algorithms with finite-sample guarantee. Specifically, we propose the MomentumQ algorithm, which integrates the Nesterov's and Polyak's momentum schemes, and generalizes the existing momentum-based Q-learning algorithms. For the infinite state-action space case, we establish the convergence guarantee for MomentumQ with linear function approximations and Markovian sampling. In particular, we characterize the finite-sample convergence rate which is provably faster than the vanilla Q-learning. This is the first finite-sample analysis for momentum-based Q-learning algorithms with function approximations. For the tabular case under synchronous sampling, we also obtain a finite-sample convergence rate that is slightly better than the SpeedyQ \citep{azar2011speedy} when choosing a special family of step sizes. Finally, we demonstrate through various experiments that the proposed MomentumQ outperforms other momentum-based Q-learning algorithms.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Authors:
Bowen Weng,
Huaqing Xiong,
Yingbin Liang,
Wei Zhang
Abstract:
Existing convergence analyses of Q-learning mostly focus on the vanilla stochastic gradient descent (SGD) type of updates. Despite the Adaptive Moment Estimation (Adam) has been commonly used for practical Q-learning algorithms, there has not been any convergence guarantee provided for Q-learning with such type of updates. In this paper, we first characterize the convergence rate for Q-AMSGrad, wh…
▽ More
Existing convergence analyses of Q-learning mostly focus on the vanilla stochastic gradient descent (SGD) type of updates. Despite the Adaptive Moment Estimation (Adam) has been commonly used for practical Q-learning algorithms, there has not been any convergence guarantee provided for Q-learning with such type of updates. In this paper, we first characterize the convergence rate for Q-AMSGrad, which is the Q-learning algorithm with AMSGrad update (a commonly adopted alternative of Adam for theoretical analysis). To further improve the performance, we propose to incorporate the momentum restart scheme to Q-AMSGrad, resulting in the so-called Q-AMSGradR algorithm. The convergence rate of Q-AMSGradR is also established. Our experiments on a linear quadratic regulator problem show that the two proposed Q-learning algorithms outperform the vanilla Q-learning with SGD updates. The two algorithms also exhibit significantly better performance than the DQN learning method over a batch of Atari 2600 games.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
Resonances as Viscosity Limits for Exponentially Decaying Potentials
Authors:
Haoren Xiong
Abstract:
We show that the complex absorbing potential (CAP) method for computing scattering resonances applies to the case of exponentially decaying potentials. That means that the eigenvalues of $-Δ+ V - iεx^2$, $|V(x)|\leq C e^{-2γ|x|}$ converge, as $ ε\to 0+ $, to the poles of the meromorphic continuation of $ ( -Δ+ V -λ^2 )^{-1} $ uniformly on compact subsets of $\textrm{Re}\,λ>0$, $\textrm{Im}\,λ>-γ$,…
▽ More
We show that the complex absorbing potential (CAP) method for computing scattering resonances applies to the case of exponentially decaying potentials. That means that the eigenvalues of $-Δ+ V - iεx^2$, $|V(x)|\leq C e^{-2γ|x|}$ converge, as $ ε\to 0+ $, to the poles of the meromorphic continuation of $ ( -Δ+ V -λ^2 )^{-1} $ uniformly on compact subsets of $\textrm{Re}\,λ>0$, $\textrm{Im}\,λ>-γ$, $\argλ> -π/8$.
△ Less
Submitted 3 February, 2021; v1 submitted 3 May, 2020;
originally announced May 2020.
-
Resonances as Viscosity Limits for Exterior Dilation Analytic Potentials
Authors:
Haoren Xiong
Abstract:
For exterior dilation analytic potential, $V$, we use the method of complex scaling to show that the resonances of $ - Δ+ V $, in a conic neighbourhood of the real axis, are limits of eigenvalues of $ - Δ+ V - i εx^2 $ as $ ε\to 0+ $, if $V$ can be analytically extended from $\mathbb{R}^n$ to a truncated cone in $\mathbb{C}^n$.
For exterior dilation analytic potential, $V$, we use the method of complex scaling to show that the resonances of $ - Δ+ V $, in a conic neighbourhood of the real axis, are limits of eigenvalues of $ - Δ+ V - i εx^2 $ as $ ε\to 0+ $, if $V$ can be analytically extended from $\mathbb{R}^n$ to a truncated cone in $\mathbb{C}^n$.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling
Authors:
Huaqing Xiong,
Tengyu Xu,
Yingbin Liang,
Wei Zhang
Abstract:
Despite the wide applications of Adam in reinforcement learning (RL), the theoretical convergence of Adam-type RL algorithms has not been established. This paper provides the first such convergence analysis for two fundamental RL algorithms of policy gradient (PG) and temporal difference (TD) learning that incorporate AMSGrad updates (a standard alternative of Adam in theoretical analysis), referr…
▽ More
Despite the wide applications of Adam in reinforcement learning (RL), the theoretical convergence of Adam-type RL algorithms has not been established. This paper provides the first such convergence analysis for two fundamental RL algorithms of policy gradient (PG) and temporal difference (TD) learning that incorporate AMSGrad updates (a standard alternative of Adam in theoretical analysis), referred to as PG-AMSGrad and TD-AMSGrad, respectively. Moreover, our analysis focuses on Markovian sampling for both algorithms. We show that under general nonlinear function approximation, PG-AMSGrad with a constant stepsize converges to a neighborhood of a stationary point at the rate of $\mathcal{O}(1/T)$ (where $T$ denotes the number of iterations), and with a diminishing stepsize converges exactly to a stationary point at the rate of $\mathcal{O}(\log^2 T/\sqrt{T})$. Furthermore, under linear function approximation, TD-AMSGrad with a constant stepsize converges to a neighborhood of the global optimum at the rate of $\mathcal{O}(1/T)$, and with a diminishing stepsize converges exactly to the global optimum at the rate of $\mathcal{O}(\log T/\sqrt{T})$. Our study develops new techniques for analyzing the Adam-type RL algorithms under Markovian sampling.
△ Less
Submitted 17 August, 2020; v1 submitted 14 February, 2020;
originally announced February 2020.
-
Fast Large-Scale Discrete Optimization Based on Principal Coordinate Descent
Authors:
Huan Xiong,
Mengyang Yu,
Li Liu,
Fan Zhu,
Fumin Shen,
Ling Shao
Abstract:
Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are NP-hard and difficult to solve due to the binary constraints, especially when the number of variables is very large. Existing methods often suffer from high compu…
▽ More
Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are NP-hard and difficult to solve due to the binary constraints, especially when the number of variables is very large. Existing methods often suffer from high computational costs or large accumulated quantization errors, or are only designed for specific tasks. In this paper, we propose a fast algorithm to find effective approximate solutions for general binary optimization problems. The proposed algorithm iteratively solves minimization problems related to the linear surrogates of loss functions, which leads to the updating of some binary variables most impacting the value of loss functions in each step. Our method supports a wide class of empirical objective functions with/without restrictions on the numbers of $1$s and $-1$s in the binary variables. Furthermore, the theoretical convergence of our algorithm is proven, and the explicit convergence rates are derived, for objective functions with Lipschitz continuous gradients, which are commonly adopted in practice. Extensive experiments on several binary optimization tasks and large-scale datasets demonstrate the superiority of the proposed algorithm over several state-of-the-art methods in terms of both effectiveness and efficiency.
△ Less
Submitted 15 May, 2021; v1 submitted 16 September, 2019;
originally announced September 2019.
-
Accelerated Target Updates for Q-learning
Authors:
Bowen Weng,
Huaqing Xiong,
Wei Zhang
Abstract:
This paper studies accelerations in Q-learning algorithms. We propose an accelerated target update scheme by incorporating the historical iterates of Q functions. The idea is conceptually inspired by the momentum-based accelerated methods in the optimization theory. Conditions under which the proposed accelerated algorithms converge are established. The algorithms are validated using commonly adop…
▽ More
This paper studies accelerations in Q-learning algorithms. We propose an accelerated target update scheme by incorporating the historical iterates of Q functions. The idea is conceptually inspired by the momentum-based accelerated methods in the optimization theory. Conditions under which the proposed accelerated algorithms converge are established. The algorithms are validated using commonly adopted testing problems in reinforcement learning, including the FrozenLake grid world game, two discrete-time LQR problems from the Deepmind Control Suite, and the Atari 2600 games. Simulation results show that the proposed accelerated algorithms can improve the convergence performance compared with the vanilla Q-learning algorithm.
△ Less
Submitted 11 May, 2019; v1 submitted 7 May, 2019;
originally announced May 2019.
-
Analytical Convergence Regions of Accelerated Gradient Descent in Nonconvex Optimization under Regularity Condition
Authors:
Huaqing Xiong,
Yuejie Chi,
Bin Hu,
Wei Zhang
Abstract:
There is a growing interest in using robust control theory to analyze and design optimization and machine learning algorithms. This paper studies a class of nonconvex optimization problems whose cost functions satisfy the so-called Regularity Condition (RC). Empirical studies show that accelerated gradient descent (AGD) algorithms (e.g. Nesterov's acceleration and Heavy-ball) with proper initializ…
▽ More
There is a growing interest in using robust control theory to analyze and design optimization and machine learning algorithms. This paper studies a class of nonconvex optimization problems whose cost functions satisfy the so-called Regularity Condition (RC). Empirical studies show that accelerated gradient descent (AGD) algorithms (e.g. Nesterov's acceleration and Heavy-ball) with proper initializations often work well in practice. However, the convergence of such AGD algorithms is largely unknown in the literature. The main contribution of this paper is the analytical characterization of the convergence regions of AGD under RC via robust control tools. Since such optimization problems arise frequently in many applications such as phase retrieval, training of neural networks and matrix sensing, our result shows promise of robust control theory in these areas.
△ Less
Submitted 9 December, 2019; v1 submitted 7 October, 2018;
originally announced October 2018.
-
Monotonicity properties for ranks of overpartitions
Authors:
Huan Xiong,
Wenston J. T. Zang
Abstract:
The rank of partitions play an important role in the combinatorial interpretations of several Ramanujan's famous congruence formulas. In 2005 and 2008, the $D$-rank and $M_2$-rank of an overpartition were introduced by Lovejoy, respectively. Let $\overline{N}(m,n)$ and $\overline{N2}(m,n)$ denote the number of overpartitions of $n$ with $D$-rank $m$ and $M_2$-rank $m$, respectively. In 2014, Chan…
▽ More
The rank of partitions play an important role in the combinatorial interpretations of several Ramanujan's famous congruence formulas. In 2005 and 2008, the $D$-rank and $M_2$-rank of an overpartition were introduced by Lovejoy, respectively. Let $\overline{N}(m,n)$ and $\overline{N2}(m,n)$ denote the number of overpartitions of $n$ with $D$-rank $m$ and $M_2$-rank $m$, respectively. In 2014, Chan and Mao proposed a conjecture on monotonicity properties of $\overline{N}(m,n)$ and $\overline{N2}(m,n)$. In this paper, we prove the Chan-Mao monotonicity conjecture. To be specific, we show that for any integer $m$ and nonnegative integer $n$, $\overline{N2}(m,n)\leq \overline{N2}(m,n+1)$; and for $(m,n)\neq (0,4)$ with $n\neq\, |m| +2$, we have $\overline{N}(m,n)\leq \overline{N}(m,n+1)$. Furthermore, when $m$ increases, we prove that $\overline{N}(m,n)\geq \overline{N}(m+2,n)$ and $\overline{N2}(m,n)\geq \overline{N2}(m+2,n)$ for any $m,n\geq 0$, which is an analogue of Chan and Mao's result for partitions.
△ Less
Submitted 4 March, 2019; v1 submitted 13 August, 2018;
originally announced August 2018.
-
Risk-Averse Classification
Authors:
Constantine Vitt,
Darinka Dentcheva,
Hui Xiong
Abstract:
We develop a new approach to solving classification problems, which is bases on the theory of coherent measures of risk and risk sharing ideas. The proposed approach aims at designing a risk-averse classifier. The new approach allows for associating distinct risk functional to each classes. The risk may be measured by different (non-linear in probability) measures,
We analyze the structure of th…
▽ More
We develop a new approach to solving classification problems, which is bases on the theory of coherent measures of risk and risk sharing ideas. The proposed approach aims at designing a risk-averse classifier. The new approach allows for associating distinct risk functional to each classes. The risk may be measured by different (non-linear in probability) measures,
We analyze the structure of the new classifier design problem and establish its theoretical relation to known risk-neutral design problems. In particular, we show that the risk-sharing classification problem is equivalent to an implicitly defined optimization problem with unequal, implicitly defined but unknown, weights for each data point. We implement our methodology in a binary classification scenario on several different data sets and carry out numerical comparison with classifiers which are obtained using the Huber loss function and other loss functions known in the literature. We formulate specific risk-averse support vector machines in order to demonstrate the viability of our method.
△ Less
Submitted 20 July, 2018; v1 submitted 30 April, 2018;
originally announced May 2018.
-
On the polynomiality and asymptotics of moments of sizes for random $(n, dn\pm 1)$-core partitions with distinct parts
Authors:
Huan Xiong,
Wenston J. T. Zang
Abstract:
Amdeberhan's conjectures on the enumeration, the average size, and the largest size of $(n,n+1)$-core partitions with distinct parts have motivated many research on this topic. Recently, Straub and Nath-Sellers obtained formulas for the numbers of $(n, dn-1)$ and $(n, dn+1)$-core partitions with distinct parts, respectively. Let $X_{s,t}$ be the size of a uniform random $(s,t)$-core partition with…
▽ More
Amdeberhan's conjectures on the enumeration, the average size, and the largest size of $(n,n+1)$-core partitions with distinct parts have motivated many research on this topic. Recently, Straub and Nath-Sellers obtained formulas for the numbers of $(n, dn-1)$ and $(n, dn+1)$-core partitions with distinct parts, respectively. Let $X_{s,t}$ be the size of a uniform random $(s,t)$-core partition with distinct parts when $s$ and $t$ are coprime to each other. Some explicit formulas for the $k$-th moments $\mathbb{E} [X_{n,n+1}^k]$ and $\mathbb{E} [X_{2n+1,2n+3}^k]$ were given by Zaleski and Zeilberger when $k$ is small. Zaleski also studied the expectation and higher moments of $X_{n,dn-1}$ and conjectured some polynomiality properties concerning them in arXiv:1702.05634.
Motivated by the above works, we derive several polynomiality results and asymptotic formulas for the $k$-th moments of $X_{n,dn+1}$ and $X_{n,dn-1}$ in this paper, by studying the beta sets of core partitions. In particular, we show that these $k$-th moments are asymptotically some polynomials of n with degrees at most $2k$, when $d$ is given and $n$ tends to infinity. Moreover, when $d=1$, we derive that the $k$-th moment $\mathbb{E} [X_{n,n+1}^k]$ of $X_{n,n+1}$ is asymptotically equal to $\left(n^2/10\right)^k$ when $n$ tends to infinity. The explicit formulas for the expectations $\mathbb{E} [X_{n,dn+1}]$ and $\mathbb{E} [X_{n,dn-1}]$ are also given. The $(n,dn-1)$-core case in our results proves several conjectures of Zaleski on the polynomiality of the expectation and higher moments of $X_{n,dn-1}$.
△ Less
Submitted 2 March, 2019; v1 submitted 24 April, 2018;
originally announced April 2018.
-
Polynomiality of certain average weights for oscillating tableaux
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
We prove that a family of average weights for oscillating tableaux are polynomials in two variables, namely, the length of the oscillating tableau and the size of the ending partition, which generalizes a result of Hopkins and Zhang. Several explicit and asymptotic formulas for the average weights are also derived.
We prove that a family of average weights for oscillating tableaux are polynomials in two variables, namely, the length of the oscillating tableau and the size of the ending partition, which generalizes a result of Hopkins and Zhang. Several explicit and asymptotic formulas for the average weights are also derived.
△ Less
Submitted 2 September, 2017;
originally announced September 2017.
-
On the largest sizes of certain simultaneous core partitions with distinct parts
Authors:
Huan Xiong
Abstract:
Motivated by Amdeberhan's conjecture on $(t,t+1)$-core partitions with distinct parts, various results on the numbers, the largest sizes and the average sizes of simultaneous core partitions with distinct parts were obtained by many mathematicians recently. In this paper, we derive the largest sizes of $(t,mt\pm 1)$-core partitions with distinct parts, which verifies a generalization of Amdeberhan…
▽ More
Motivated by Amdeberhan's conjecture on $(t,t+1)$-core partitions with distinct parts, various results on the numbers, the largest sizes and the average sizes of simultaneous core partitions with distinct parts were obtained by many mathematicians recently. In this paper, we derive the largest sizes of $(t,mt\pm 1)$-core partitions with distinct parts, which verifies a generalization of Amdeberhan's conjecture. We also prove that the numbers of such partitions with the largest sizes are at most $2$.
△ Less
Submitted 2 September, 2017;
originally announced September 2017.
-
Skew doubled shifted plane partitions: calculus and asymptotics
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
Plane partitions have been widely studied in Mathematics since MacMahon. See, for example, the works by Andrews, Macdonald, Stanley, Sagan and Krattenthaler. The Schur process approach, introduced by Okounkov and Reshetikhin, and further developed by Borodin, Corwin, Corteel, Savelief and Vuletić, has been proved to be a powerful tool in the study of various kinds of plane partitions. The exact en…
▽ More
Plane partitions have been widely studied in Mathematics since MacMahon. See, for example, the works by Andrews, Macdonald, Stanley, Sagan and Krattenthaler. The Schur process approach, introduced by Okounkov and Reshetikhin, and further developed by Borodin, Corwin, Corteel, Savelief and Vuletić, has been proved to be a powerful tool in the study of various kinds of plane partitions. The exact enumerations of ordinary plane partitions, shifted plane partitions and cylindric partitions could be derived from two summation formulas for Schur processes, namely, the open summation formula and the cylindric summation formula.
In this paper, we establish a new summation formula for Schur processes, called the complete summation formula. As an application, we obtain the generating function and the asymptotic formula for the number of doubled shifted plane partitions, which can be viewed as plane partitions `shifted at the two sides'. We prove that the order of the asymptotic formula depends only on the diagonal width of the doubled shifted plane partition, not on the profile (the skew zone) itself. By using the same methods, the generating function and the asymptotic formula for the number of symmetric cylindric partitions are also derived.
△ Less
Submitted 6 June, 2019; v1 submitted 18 July, 2017;
originally announced July 2017.
-
Some useful theorems for asymptotic formulas and their applications to skew plane partitions and cylindric partitions
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
Inspired by the works of Dewar, Murty and Kotěšovec, we establish some useful theorems for asymptotic formulas. As an application, we obtain asymptotic formulas for the numbers of skew plane partitions and cylindric partitions. We prove that the order of the asymptotic formula for the skew plane partitions of fixed width depends only on the width of the region, not on the profile (the skew zone) i…
▽ More
Inspired by the works of Dewar, Murty and Kotěšovec, we establish some useful theorems for asymptotic formulas. As an application, we obtain asymptotic formulas for the numbers of skew plane partitions and cylindric partitions. We prove that the order of the asymptotic formula for the skew plane partitions of fixed width depends only on the width of the region, not on the profile (the skew zone) itself, while this is not true for cylindric partitions.
△ Less
Submitted 16 July, 2017;
originally announced July 2017.
-
Polynomiality of some hook-content summations for doubled distinct and self-conjugate partitions
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
In 2009, the first author proved the Nekrasov-Okounkov formula on hook lengths for integer partitions by using an identity of Macdonald in the framework of type $\widetilde A$ affine root systems, and conjectured that some summations over the set of all partitions of size $n$ are always polynomials in $n$. This conjecture was generalized and proved by Stanley. Recently, Pétréolle derived two Nekra…
▽ More
In 2009, the first author proved the Nekrasov-Okounkov formula on hook lengths for integer partitions by using an identity of Macdonald in the framework of type $\widetilde A$ affine root systems, and conjectured that some summations over the set of all partitions of size $n$ are always polynomials in $n$. This conjecture was generalized and proved by Stanley. Recently, Pétréolle derived two Nekrasov-Okounkov type formulas for $\widetilde C$ and $\widetilde C\,\check{}$ which involve doubled distinct and self-conjugate partitions. Inspired by all those previous works, we establish the polynomiality of some hook-content summations for doubled distinct and self-conjugate partitions.
△ Less
Submitted 17 January, 2016;
originally announced January 2016.
-
New hook-content formulas for strict partitions
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
We introduce the difference operator for functions defined on strict partitions and prove a polynomiality property for a summation involving the hook length and content statistics. As an application, several new hook-content formulas for strict partitions are derived.
We introduce the difference operator for functions defined on strict partitions and prove a polynomiality property for a summation involving the hook length and content statistics. As an application, several new hook-content formulas for strict partitions are derived.
△ Less
Submitted 13 December, 2016; v1 submitted 9 November, 2015;
originally announced November 2015.
-
Difference operators for partitions under the Littlewood decomposition
Authors:
Paul-Olivier Dehaye,
Guo-Niu Han,
Huan Xiong
Abstract:
The concept of $t$-difference operator for functions of partitions is introduced to prove a generalization of Stanley's theorem on polynomiality of Plancherel averages of symmetric functions related to contents and hook lengths. Our extension uses a generalization of the notion of Plancherel measure, based on walks in the Young lattice with steps given by the addition of $t$-hooks. It is well-know…
▽ More
The concept of $t$-difference operator for functions of partitions is introduced to prove a generalization of Stanley's theorem on polynomiality of Plancherel averages of symmetric functions related to contents and hook lengths. Our extension uses a generalization of the notion of Plancherel measure, based on walks in the Young lattice with steps given by the addition of $t$-hooks. It is well-known that the hook lengths of multiples of $t$ can be characterized by the Littlewood decomposition. Our study gives some further information on the contents and hook lengths of other congruence classes modulo $t$.
△ Less
Submitted 19 March, 2017; v1 submitted 9 November, 2015;
originally announced November 2015.
-
Core partitions with distinct parts
Authors:
Huan Xiong
Abstract:
Simultaneous core partitions have attracted much attention since Anderson's work on the number of $(t_1,t_2)$-core partitions. In this paper we focus on simultaneous core partitions with distinct parts. The generating function of $t$-core partitions with distinct parts is obtained. We also prove the results on the number, the largest size and the average size of $(t, t + 1)$-core partitions. This…
▽ More
Simultaneous core partitions have attracted much attention since Anderson's work on the number of $(t_1,t_2)$-core partitions. In this paper we focus on simultaneous core partitions with distinct parts. The generating function of $t$-core partitions with distinct parts is obtained. We also prove the results on the number, the largest size and the average size of $(t, t + 1)$-core partitions. This gives a complete answer to a conjecture of Amdeberhan, which is partly and independently proved by Straub, Nath and Sellers, and Zaleski recently.
△ Less
Submitted 19 March, 2017; v1 submitted 31 August, 2015;
originally announced August 2015.
-
Finite Groups Whose Character Graphs Associated with Codegrees Have No Triangles
Authors:
Huan Xiong
Abstract:
Motivated by the Problem $164$ proposed by Y. Berkovich and E. Zhmud' in their book "Characters of finite groups, Part $1$", we give a characterization of finite groups whose irreducible character codegrees are prime powers. This is based on a new kind of character graphs of finite groups associated with codegrees. Such graphs have close and obvious connections with character coedgree graphs. For…
▽ More
Motivated by the Problem $164$ proposed by Y. Berkovich and E. Zhmud' in their book "Characters of finite groups, Part $1$", we give a characterization of finite groups whose irreducible character codegrees are prime powers. This is based on a new kind of character graphs of finite groups associated with codegrees. Such graphs have close and obvious connections with character coedgree graphs. For example, they have the same number of connected components.
By analogy with the work of finite groups whose character graphs (associated with degrees) have no triangles, we conduct a result of classifying finite groups whose character graphs associated with codegrees have no triangles in the latter part of this paper.
△ Less
Submitted 19 March, 2017; v1 submitted 28 August, 2015;
originally announced August 2015.
-
Difference operators for partitions and some applications
Authors:
Guo-Niu Han,
Huan Xiong
Abstract:
Motivated by the Nekrasov-Okounkov formula on hook lengths, the first author conjectured that the Plancherel average of the $2k$-th power sum of hook lengths of partitions with size $n$ is always a polynomial of $n$ for any $k\in \mathbb{N}$. This conjecture was generalized and proved by Stanley (Ramanujan J., 23(1--3): 91--105, 2010). In this paper, inspired by the work of Stanley and Olshanski o…
▽ More
Motivated by the Nekrasov-Okounkov formula on hook lengths, the first author conjectured that the Plancherel average of the $2k$-th power sum of hook lengths of partitions with size $n$ is always a polynomial of $n$ for any $k\in \mathbb{N}$. This conjecture was generalized and proved by Stanley (Ramanujan J., 23(1--3): 91--105, 2010). In this paper, inspired by the work of Stanley and Olshanski on the differential poset of Young lattice, we study the properties of two kinds of difference operators $D$ and $D^-$ defined on functions of partitions. Even though the calculations for higher orders of $D$ are extremely complex, we prove that several well-known families of functions of partitions are annihilated by a power of the difference operator $D$. As an application, our results lead to several generalizations of classic results on partitions, including the marked hook formula, Stanley Theorem, Okada-Panova hook length formula, and Fujii-Kanno-Moriyama-Okada content formula. We insist that the Okada constants $K_r$ arise directly from the computation for a single partition $λ$, without the summation ranging over all partitions of size~$n$.
△ Less
Submitted 18 January, 2018; v1 submitted 4 August, 2015;
originally announced August 2015.
-
Partitions with the same hook multiset
Authors:
Huan Xiong
Abstract:
It is well-known that two conjugate partitions have the same hook multiset. But two different partitions with the same hook multiset may not be conjugate to each other. In $1977$, Herman and Chung proposed the following question: What are the necessary and sufficient conditions for partitions to be determined by their hook multisets up to conjugation? In this paper, we will answer this question by…
▽ More
It is well-known that two conjugate partitions have the same hook multiset. But two different partitions with the same hook multiset may not be conjugate to each other. In $1977$, Herman and Chung proposed the following question: What are the necessary and sufficient conditions for partitions to be determined by their hook multisets up to conjugation? In this paper, we will answer this question by giving a criterion to determine whether two different partitions with the same hook multiset are conjugate to each other.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
On the largest size of $(t,t+1,..., t+p)$-core partitions
Authors:
Huan Xiong
Abstract:
In this paper we prove that Amdeberhan's conjecture on the largest size of $(t, t+1, t+2)$-core partitions is true. We also show that the number of $(t, t + 1, t + 2)$-core partitions with the largest size is $1$ or $2$ based on the parity of $t$. More generally, the largest size of $(t,t+1,..., t+p)$-core partitions and the number of such partitions with the largest size are determined.
In this paper we prove that Amdeberhan's conjecture on the largest size of $(t, t+1, t+2)$-core partitions is true. We also show that the number of $(t, t + 1, t + 2)$-core partitions with the largest size is $1$ or $2$ based on the parity of $t$. More generally, the largest size of $(t,t+1,..., t+p)$-core partitions and the number of such partitions with the largest size are determined.
△ Less
Submitted 7 January, 2015; v1 submitted 8 October, 2014;
originally announced October 2014.
-
The number of simultaneous core partitions
Authors:
Huan Xiong
Abstract:
Amdeberhan conjectured that the number of $(t,t+1, t+2)$-core partitions is $\sum_{0\leq k\leq [\frac{t}{2}]}\frac{1}{k+1}\binom{t}{2k}\binom{2k}{k}$. In this paper, we obtain the generating function of the numbers $f_t$ of $(t, t + 1, ..., t + p)$-core partitions. In particular, this verifies that Amdeberhan's conjecture is true. We also prove that the number of $(t_1,t_2,..., t_m)$-core partitio…
▽ More
Amdeberhan conjectured that the number of $(t,t+1, t+2)$-core partitions is $\sum_{0\leq k\leq [\frac{t}{2}]}\frac{1}{k+1}\binom{t}{2k}\binom{2k}{k}$. In this paper, we obtain the generating function of the numbers $f_t$ of $(t, t + 1, ..., t + p)$-core partitions. In particular, this verifies that Amdeberhan's conjecture is true. We also prove that the number of $(t_1,t_2,..., t_m)$-core partitions is finite if and only if gcd$(t_1,t_2,..., t_m)=1,$ which extends Anderson's result on the finiteness of the number of $(t_1,t_2)$-core partitions for coprime positive integers $t_1$ and $t_2$ and thus rediscover a result of Keith and Nath with a different proof.
△ Less
Submitted 12 October, 2014; v1 submitted 24 September, 2014;
originally announced September 2014.
-
Delay-Aware Cross-Layer Design for Network Utility Maximization in Multi-hop Networks
Authors:
Haozhi Xiong,
Ruogu Li,
Atilla Eryilmaz,
Eylem Ekici
Abstract:
We investigate the problem of designing delay-aware joint flow control, routing, and scheduling algorithms in general multi-hop networks for maximizing network utilization. Since the end-to-end delay performance has a complex dependence on the high-order statistics of cross-layer algorithms, earlier optimization-based design methodologies that optimize the long term network utilization are not imm…
▽ More
We investigate the problem of designing delay-aware joint flow control, routing, and scheduling algorithms in general multi-hop networks for maximizing network utilization. Since the end-to-end delay performance has a complex dependence on the high-order statistics of cross-layer algorithms, earlier optimization-based design methodologies that optimize the long term network utilization are not immediately well-suited for delay-aware design. This motivates us in this work to develop a novel design framework and alternative methods that take advantage of several unexploited design choices in the routing and the scheduling strategy spaces. In particular, we reveal and exploit a crucial characteristic of back pressure-type controllers that enables us to develop a novel link rate allocation strategy that not only optimizes long-term network utilization, but also yields loop free multi-path routes} between each source-destination pair. Moreover, we propose a regulated scheduling strategy, based on a token-based service discipline, for sha** the per-hop delay distribution to obtain highly desirable end-to-end delay performance. We establish that our joint flow control, routing, and scheduling algorithm achieves loop-free routes and optimal network utilization. Our extensive numerical studies support our theoretical results, and further show that our joint design leads to substantial end-to-end delay performance improvements in multi-hop networks compared to earlier solutions.
△ Less
Submitted 7 December, 2010;
originally announced December 2010.