Search | arXiv e-print repository

Transversal Hamilton paths and cycles

Authors: Yangyang Cheng, Wanting Sun, Guanghui Wang, Lan Wei

Abstract: Given a collection $\mathcal{G} =\{G_1,G_2,\dots,G_m\}$ of graphs on the common vertex set $V$ of size $n$, an $m$-edge graph $H$ on the same vertex set $V$ is transversal in $\mathcal{G}$ if there exists a bijection $\varphi :E(H)\rightarrow [m]$ such that $e \in E(G_{\varphi(e)})$ for all $e\in E(H)$. Denote $δ(\mathcal{G}):=\operatorname*{min}\left\{δ(G_i): i\in [m]\right\}$. In this paper, we… ▽ More Given a collection $\mathcal{G} =\{G_1,G_2,\dots,G_m\}$ of graphs on the common vertex set $V$ of size $n$, an $m$-edge graph $H$ on the same vertex set $V$ is transversal in $\mathcal{G}$ if there exists a bijection $\varphi :E(H)\rightarrow [m]$ such that $e \in E(G_{\varphi(e)})$ for all $e\in E(H)$. Denote $δ(\mathcal{G}):=\operatorname*{min}\left\{δ(G_i): i\in [m]\right\}$. In this paper, we first establish a minimum degree condition for the existence of transversal Hamilton paths in $\mathcal{G}$: if $n=m+1$ and $δ(\mathcal{G})\geq \frac{n-1}{2}$, then $\mathcal{G}$ contains a transversal Hamilton path. This solves a problem proposed by [Li, Li and Li, J. Graph Theory, 2023]. As a continuation of the transversal version of Dirac's theorem [Joos and Kim, Bull. Lond. Math. Soc., 2020] and the stability result for transversal Hamilton cycles [Cheng and Staden, arXiv:2403.09913v1], our second result characterizes all graph collections with minimum degree at least $\frac{n}{2}-1$ and without transversal Hamilton cycles. We obtain an analogous result for transversal Hamilton paths. The proof is a combination of the stability result for transversal Hamilton paths or cycles, transversal blow-up lemma, along with some structural analysis. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 33 pages, 10 figures

MSC Class: 05C35

arXiv:2406.04060 [pdf, ps, other]

Solution to a conjecture on resistance distances of block tower graphs

Authors: Wensheng Sun, Yujun Yang, Wuxian Chen, Shou-Jun Xu

Abstract: Let $G$ be a connected graph. The resistance distance between two vertices $u$ and $v$ of $G$, denoted by $R_{G}[u,v]$, is defined as the net effective resistance between them in the electric network constructed from $G$ by replacing each edge with a unit resistor. The resistance diameter of $G$, denoted by $D_{r}(G)$, is defined as the maximum resistance distance among all pairs of vertices of… ▽ More Let $G$ be a connected graph. The resistance distance between two vertices $u$ and $v$ of $G$, denoted by $R_{G}[u,v]$, is defined as the net effective resistance between them in the electric network constructed from $G$ by replacing each edge with a unit resistor. The resistance diameter of $G$, denoted by $D_{r}(G)$, is defined as the maximum resistance distance among all pairs of vertices of $G$. Let $P_n=a_1a_2\ldots a_n$ be the $n$-vertex path graph and $C_{4}=b_{1}b_2b_3b_4b_{1}$ be the 4-cycle. Then the $n$-th block tower graph $G_n$ is defined as the the Cartesian product of $P_n$ and $C_4$, that is, $G_n=P_{n}\square C_4$. Clearly, the vertex set of $G_n$ is $\{(a_i,b_j)|i=1,\ldots,n;j=1,\ldots,4\}$. In [Discrete Appl. Math. 320 (2022) 387--407], Evans and Francis proposed the following conjecture on resistance distances of $G_n$ and $G_{n+1}$: \begin{equation*} \lim_{n \rightarrow \infty}\left(R_{G_{n+1}}[(a_{1},b_1),(a_{n+1},b_3)]-R_{G_{n}}[(a_{1},b_1),(a_{n},b_3)]\right)=\frac{1}{4}. \end{equation*} In this paper, combining algebraic methods and electrical network approaches, we confirm and further generalize this conjecture. In addition, we determine all the resistance diametrical pairs in $G_n$, which enables us to give an equivalent explanation of the conjecture. △ Less

Submitted 19 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

Comments: 19 pages,8 figures

arXiv:2406.02813 [pdf, ps, other]

$L^p$-norms for the homogeneous non-cutoff Boltzmann equation with soft potentials

Authors: Matt Spragge, Weiran Sun

Abstract: We establish a priori estimates showing the propagation and generation of $L^p$-norms for solutions to the non-cutoff spatially homogeneous Boltzmann equation with soft potentials. The singularity of the collision kernel is key to generate regularization and inhomogeneity in the energy estimates of the $L^p$-norms. Our result extends \cite{Alo19} from the hard potential cases to the soft ones. We establish a priori estimates showing the propagation and generation of $L^p$-norms for solutions to the non-cutoff spatially homogeneous Boltzmann equation with soft potentials. The singularity of the collision kernel is key to generate regularization and inhomogeneity in the energy estimates of the $L^p$-norms. Our result extends \cite{Alo19} from the hard potential cases to the soft ones. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2404.09448 [pdf, ps, other]

On maximum residual block Kaczmarz method for solving large consistent linear systems

Authors: Wen-Ning Sun, Mei Qin

Abstract: For solving large consistent linear systems by iteration methods, inspired by the maximum residual Kaczmarz method and the randomized block Kaczmarz method, we propose the maximum residual block Kaczmarz method, which is designed to preferentially eliminate the largest block in the residual vector $r_{k}$ at each iteration. At the same time, in order to further improve the convergence rate, we con… ▽ More For solving large consistent linear systems by iteration methods, inspired by the maximum residual Kaczmarz method and the randomized block Kaczmarz method, we propose the maximum residual block Kaczmarz method, which is designed to preferentially eliminate the largest block in the residual vector $r_{k}$ at each iteration. At the same time, in order to further improve the convergence rate, we construct the maximum residual average block Kaczmarz method to avoid the calculation of pseudo-inverse in block iteration, which completes the iteration by projecting the iteration vector $x_{k}$ to each row of the constrained subset of $A$ and applying different extrapolation step sizes to average them. We prove the convergence of these two methods and give the upper bounds on their convergence rates, respectively. Numerical experiments validate our theory and show that our proposed methods are superior to some other block Kaczmarz methods. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.09287 [pdf, ps, other]

On the Condensation and fluctuations in reversible coagulation-fragmentation models

Authors: Wen Sun

Abstract: We study the condensation phenomenon for the invariant measures of the mean-field model of reversible coagulation-fragmentation processes conditioned to a supercritical density of particles. It is shown that when the parameters of the associated balance equation satisfy a subexponential tail condition, there is one single giant particle that corresponds to the missing mass in the macroscopic limit… ▽ More We study the condensation phenomenon for the invariant measures of the mean-field model of reversible coagulation-fragmentation processes conditioned to a supercritical density of particles. It is shown that when the parameters of the associated balance equation satisfy a subexponential tail condition, there is one single giant particle that corresponds to the missing mass in the macroscopic limit. We also show that in this case, the rest of the particles are asymptotically \emph{i.i.d.} according to the normalized equilibrium state of the limit hydrodynamic differential equation. Conditions for the normal fluctuations and the $α$-stable fluctuations around the condensed mass are given. We obtain the large deviation principle for the empirical measure of the masses of the particles at equilibrium as well. △ Less

Submitted 14 April, 2024; originally announced April 2024.

arXiv:2404.07876 [pdf, other]

Joint transitivity for linear iterates

Authors: Sebastián Donoso, Andreas Koutsogiannis, Wenbo Sun

Abstract: We establish sufficient and necessary conditions for the joint transitivity of linear iterates in a minimal topological dynamical system with commuting transformations. This result provides the first topological analogue of the classical Berend and Bergelson joint ergodicity criterion in measure-preserving systems. We establish sufficient and necessary conditions for the joint transitivity of linear iterates in a minimal topological dynamical system with commuting transformations. This result provides the first topological analogue of the classical Berend and Bergelson joint ergodicity criterion in measure-preserving systems. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: Comments welcome!

MSC Class: Primary: 37B05; Secondary: 37B02; 37B20

arXiv:2404.06642 [pdf, ps, other]

Modulus representation of the Riemann $ξ$ function

Authors: Wei Sun

Abstract: We use the Jacobi theta function to give a representation of the modulus of the Riemann $ξ$ function. We use the Jacobi theta function to give a representation of the modulus of the Riemann $ξ$ function. △ Less

Submitted 9 April, 2024; originally announced April 2024.

MSC Class: 11M26

arXiv:2403.02290 [pdf, other]

Koopman-Assisted Reinforcement Learning

Authors: Preston Rozwood, Edward Mehrez, Ludger Paehler, Wen Sun, Steven L. Brunton

Abstract: The Bellman equation and its continuous form, the Hamilton-Jacobi-Bellman (HJB) equation, are ubiquitous in reinforcement learning (RL) and control theory. However, these equations quickly become intractable for systems with high-dimensional states and nonlinearity. This paper explores the connection between the data-driven Koopman operator and Markov Decision Processes (MDPs), resulting in the de… ▽ More The Bellman equation and its continuous form, the Hamilton-Jacobi-Bellman (HJB) equation, are ubiquitous in reinforcement learning (RL) and control theory. However, these equations quickly become intractable for systems with high-dimensional states and nonlinearity. This paper explores the connection between the data-driven Koopman operator and Markov Decision Processes (MDPs), resulting in the development of two new RL algorithms to address these limitations. We leverage Koopman operator techniques to lift a nonlinear system into new coordinates where the dynamics become approximately linear, and where HJB-based methods are more tractable. In particular, the Koopman operator is able to capture the expectation of the time evolution of the value function of a given system via linear dynamics in the lifted coordinates. By parameterizing the Koopman operator with the control actions, we construct a ``Koopman tensor'' that facilitates the estimation of the optimal value function. Then, a transformation of Bellman's framework in terms of the Koopman tensor enables us to reformulate two max-entropy RL algorithms: soft value iteration and soft actor-critic (SAC). This highly flexible framework can be used for deterministic or stochastic systems as well as for discrete or continuous-time dynamics. Finally, we show that these Koopman Assisted Reinforcement Learning (KARL) algorithms attain state-of-the-art (SOTA) performance with respect to traditional neural network-based SAC and linear quadratic regulator (LQR) baselines on four controlled dynamical systems: a linear state-space system, the Lorenz system, fluid flow past a cylinder, and a double-well potential with non-isotropic stochastic forcing. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: 35 pages, 12 figures

arXiv:2403.01820 [pdf, other]

Macroscopic auxiliary asymptotic preserving neural networks for the linear radiative transfer equations

Authors: Hongyan Li, Song Jiang, Wenjun Sun, Liwei Xu, Guanyu Zhou

Abstract: We develop a Macroscopic Auxiliary Asymptotic-Preserving Neural Network (MA-APNN) method to solve the time-dependent linear radiative transfer equations (LRTEs), which have a multi-scale nature and high dimensionality. To achieve this, we utilize the Physics-Informed Neural Networks (PINNs) framework and design a new adaptive exponentially weighted Asymptotic-Preserving (AP) loss function, which i… ▽ More We develop a Macroscopic Auxiliary Asymptotic-Preserving Neural Network (MA-APNN) method to solve the time-dependent linear radiative transfer equations (LRTEs), which have a multi-scale nature and high dimensionality. To achieve this, we utilize the Physics-Informed Neural Networks (PINNs) framework and design a new adaptive exponentially weighted Asymptotic-Preserving (AP) loss function, which incorporates the macroscopic auxiliary equation that is derived from the original transfer equation directly and explicitly contains the information of the diffusion limit equation. Thus, as the scale parameter tends to zero, the loss function gradually transitions from the transport state to the diffusion limit state. In addition, the initial data, boundary conditions, and conservation laws serve as the regularization terms for the loss. We present several numerical examples to demonstrate the effectiveness of MA-APNNs. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: 24 pages, 29 figures

arXiv:2401.04834 [pdf, ps, other]

Reconstruction of the Do** Profile in Vlasov-Poisson

Authors: Ru-Yu Lai, Qin Li, Weiran Sun

Abstract: We study the inverse problem of recovering the do** profile in the stationary Vlasov-Poisson equation, given the knowledge of the incoming and outgoing measurements at the boundary of the domain. This problem arises from identifying impurities in the semiconductor manufacturing. Our result states that, under suitable assumptions, the do** profile can be uniquely determined through an asymptoti… ▽ More We study the inverse problem of recovering the do** profile in the stationary Vlasov-Poisson equation, given the knowledge of the incoming and outgoing measurements at the boundary of the domain. This problem arises from identifying impurities in the semiconductor manufacturing. Our result states that, under suitable assumptions, the do** profile can be uniquely determined through an asymptotic formula of the electric field that it generates. △ Less

Submitted 9 January, 2024; originally announced January 2024.

MSC Class: 35Q83; 35Q81; 35Q49

arXiv:2401.02856 [pdf, ps, other]

Nonuniform Sobolev Spaces

Authors: Ting Chen, Loukas Grafakos, Wenchang Sun

Abstract: We study nonuniform Sobolev spaces, i.e., spaces of functions whose partial derivatives lie in possibly different Lebesgue spaces. Although standard proofs do not apply, we show that nonuniform Sobolev spaces share similar properties as the classical ones. These spaces arise naturally in the study of certain PDEs. For instance, we illustrate that nonuniform fractional Sobolev spaces are useful in… ▽ More We study nonuniform Sobolev spaces, i.e., spaces of functions whose partial derivatives lie in possibly different Lebesgue spaces. Although standard proofs do not apply, we show that nonuniform Sobolev spaces share similar properties as the classical ones. These spaces arise naturally in the study of certain PDEs. For instance, we illustrate that nonuniform fractional Sobolev spaces are useful in the study of local estimates for solutions of heat equations and the convergence of Schrödinger operators. In this work we extend recent advances on local energy estimates for solutions of heat equations and the convergence of Schrödinger operators to nonuniform fractional Sobolev spaces. △ Less

Submitted 23 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

Comments: 42 pages

arXiv:2312.12760 [pdf, ps, other]

An equivalent inequality for the Riemann hypothesis

Authors: Wei Sun

Abstract: We present a purely analytical inequality which is equivalent to the Riemann hypothesis (RH). The proof of the equivalence is based on a representation of the modulus of the Riemann $ξ$ function. As the first step to analyze the inequality, we consider polynomial approximations. We also show that the RH is equivalent to the statement that some wave functions constructed using the Brownian motion n… ▽ More We present a purely analytical inequality which is equivalent to the Riemann hypothesis (RH). The proof of the equivalence is based on a representation of the modulus of the Riemann $ξ$ function. As the first step to analyze the inequality, we consider polynomial approximations. We also show that the RH is equivalent to the statement that some wave functions constructed using the Brownian motion never evolve into perfectly distinguishable states. △ Less

Submitted 31 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

MSC Class: 11M26; 26D05; 26D15; 81Q10

arXiv:2312.06651 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields I: equidistribution for nilsequences

Authors: Wenbo Sun

Abstract: This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average i… ▽ More This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average is taken along spheres instead of cubes. To be more precise, let $Ω\subseteq\mathbb{F}_{p}^{d}$ be a sphere. We showed that if a polynomial sequence $(g(n)Γ)_{n\inΩ}$ which is $p$-periodic along $Ω$ is not equidistributed on a nilmanifold $G/Γ$, then there exists a nontrivial horizontal character $η$ of $G/Γ$ such that $η\circ g \mod \mathbb{Z}$ vanishes on $Ω$. This result will serve as a fundamental tool in later parts of the series to proof the spherical Gowers inverse theorem and the geometric Ramsey conjecture. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 127 pages, comments are welcome

MSC Class: 11T99; 37A99

arXiv:2312.06650 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields II: additive combinatorics for shifted ideals

Authors: Wenbo Sun

Abstract: This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form… ▽ More This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form $E\pm E$, where $E$ is a collection of shifted ideals of the polynomial ring $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$ and we identify two ideals if their difference contains the zero polynomial. We show that under appropriate definitions, the set $E\pm E$ enjoys properties similar to the conventional setting where $E$ is a subset of an abelian group. In particular, among other results, we prove the Balog-Gowers-Szemerédi theorem, the Rusza's quasi triangle inequality and a weak form of the Plünnecke-Rusza theorem in the setting of shifted ideals. We also show that for a special class of maps $ξ$ from $\mathbb{F}_{p}^{d}$ to the collection of all shifted ideals of $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$, if the set $ξ(\mathbb{F}_{p}^{d})+ξ(\mathbb{F}_{p}^{d})$ has large additive energy, then $ξ$ is an almost linear Freiman homomorphism. This result is the crucial additive combinatorial input we need to prove the spherical Gowers inverse theorem in later parts of the series. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 80 pages, comments are welcome

MSC Class: 05C99; 05D99

arXiv:2312.06649 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields IV: an application to the Geometric Ramsey Conjecture

Authors: Wenbo Sun

Abstract: This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting. In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the fini… ▽ More This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting. In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the finite field setting. To be more precise, we show that for any spherical configuration $X$ of $\mathbb{F}_{p}^{d}$ of complexity at most $C$ with $d$ being sufficiently large with respect to $C$ and $\vert X\vert$, and for some prime $p$ being sufficiently large with respect to $C$, $\vert X\vert$ and $ε>0$, any set $E\subseteq \mathbb{F}_{p}^{d}$ with $\vert E\vert>εp^{d}$ contains at least $\gg_{C,ε,\vert X\vert}p^{(k+1)d-(k+1)k/2}$ congruent copies of $X$, where $k$ is the dimension of $\text{span}_{\mathbb{F}_{p}}(X-X)$. The novelty of our approach is that we avoid the use of harmonic analysis, and replace it by the theory of spherical higher order Fourier analysis developed in previous parts of the series. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 61 pages, comments are welcome. arXiv admin note: text overlap with arXiv:2312.06636

MSC Class: 05D10; 37A99

arXiv:2312.06636 [pdf, ps, other]

Spherical higher order Fourier analysis over finite fields III: a spherical Gowers inverse theorem

Authors: Wenbo Sun

Abstract: This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on… ▽ More This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting. In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on a sphere. We show that if the $(s+1)$-th spherical Gowers norm of a 1-bounded function $f\colon\mathbb{F}_{p}^{d}\to \mathbb{C}$ is at least $ε$ and if $d$ is sufficiently large depending only on $s$, then $f$ correlates on the sphere with a $p$-periodic $s$-step nilsequence, where the bounds for the complexity and correlation depend only on $d$ and $ε$. This result will be used in later parts of the series to prove the geometric Ramsey conjecture in the finite field setting. △ Less

Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 104 pages, comments are welcome

MSC Class: 11T99; 37A99

arXiv:2312.06120 [pdf, ps, other]

The boundary case for the supercritical deformed Hermitian-Yang-Mills equation

Authors: Wei Sun

Abstract: In this paper, we shall study the weak solution to the supercritical deformed Hermitian-Yang-Mills equation in the boundary case. In this paper, we shall study the weak solution to the supercritical deformed Hermitian-Yang-Mills equation in the boundary case. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2310.06348 [pdf, ps, other]

A conditional compound Poisson process approach to the sparse Erdős-Rényi random graphs: moderate deviations

Authors: Wen Sun

Abstract: We construct a compound Poisson process conditioned on its random summation that represents the sizes of the connected components in the sparse Erdős-Rényi random graph $G(n,c/n)$. This new representation depicts a connection between the phase transition in the sparse random graph and the condensation transition in the zero-range model. Under this framework, we can derive moderate deviation princi… ▽ More We construct a compound Poisson process conditioned on its random summation that represents the sizes of the connected components in the sparse Erdős-Rényi random graph $G(n,c/n)$. This new representation depicts a connection between the phase transition in the sparse random graph and the condensation transition in the zero-range model. Under this framework, we can derive moderate deviation principles for the maximun component, total number of connected components and empirical measure of the sizes in the non-critical regimes. Large deviation results are discussed. △ Less

Submitted 20 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: 34 pages

arXiv:2310.04527 [pdf, ps, other]

A note on prime index of a certain subgroup in $\mathbb{F}_p^*$

Authors: Wei-Liang Sun

Abstract: Under the generalized Riemann hypothesis, we illustrate that the ratio of the set of primes $p$ such that $\langle -1, 2 \rangle$ has an odd prime index in $\mathbb{F}_p^*$ to the set of primes $p$ such that the subgroup has index greater than $2$ nears $46 \%$. Under the generalized Riemann hypothesis, we illustrate that the ratio of the set of primes $p$ such that $\langle -1, 2 \rangle$ has an odd prime index in $\mathbb{F}_p^*$ to the set of primes $p$ such that the subgroup has index greater than $2$ nears $46 \%$. △ Less

Submitted 11 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: Corrected "thousand" to "hundred" on the line 7 in the first paragraph

arXiv:2310.04220 [pdf, other]

Spatial second-order positive and asymptotic preserving filtered $P_N$ schemes for nonlinear radiative transfer equations

Authors: Xiao**g Xu, Song Jiang, Wenjun Sun

Abstract: A spatial second-order scheme for the nonlinear radiative transfer equations is introduced in this paper. The discretization scheme is based on the filtered spherical harmonics ($FP_N$) method for the angular variable and the unified gas kinetic scheme (UGKS) framework for the spatial and temporal variables respectively. In order to keep the scheme positive and second-order accuracy, firstly, we u… ▽ More A spatial second-order scheme for the nonlinear radiative transfer equations is introduced in this paper. The discretization scheme is based on the filtered spherical harmonics ($FP_N$) method for the angular variable and the unified gas kinetic scheme (UGKS) framework for the spatial and temporal variables respectively. In order to keep the scheme positive and second-order accuracy, firstly, we use the implicit Monte Carlo linearization method [6] in the construction of the UGKS numerical boundary fluxes. Then, by carefully analyzing the constructed second-order fluxes involved in the macro-micro decomposition, which is induced by the $FP_N$ angular discretization, we establish the sufficient conditions that guarantee the positivity of the radiative energy density and material temperature. Finally, we employ linear scaling limiters for the angular variable in the $P_N$ reconstruction and for the spatial variable in the piecewise linear slopes reconstruction respectively, which are shown to be realizable and reasonable to enforce the sufficient conditions holding. Thus, the desired scheme, called the $PPFP_N$-based UGKS, is obtained. Furthermore, in the regime $ε\ll 1$ and the regime $ε=O(1)$, a simplified spatial second-order scheme, called the $PPFP_N$-based SUGKS, is presented, which possesses all the properties of the non-simplified one. Inheriting the merit of UGKS, the proposed schemes are asymptotic preserving. By employing the $FP_N$ method for the angular variable, the proposed schemes are almost free of ray effects. To our best knowledge, this is the first time that spatial second-order, positive, asymptotic preserving and almost free of ray effects schemes are constructed for the nonlinear radiative transfer equations without operator splitting. Various numerical experiments are included to validate the properties of the proposed schemes. △ Less

Submitted 6 October, 2023; originally announced October 2023.

arXiv:2310.03471 [pdf, other]

On the measure concentration of infinitely divisible distributions

Authors: **g Zhang, Ze-Chun Hu, Wei Sun

Abstract: Let ${\cal I}$ be the set of all infinitely divisible random variables\ with finite second moments, ${\cal I}_0=\{X\in{\cal I}:{\rm Var}(X)>0\}$, $P_{\cal I}=\inf_{X\in{\cal I}}P\{|X-E[X]|\le \sqrt{{\rm Var}(X)}\}$ and $P_{{\cal I}_0}=\inf_{X\in{\cal I}_0} P\{|X-E[X]|< \sqrt{{\rm Var}(X)}\}$. Firstly, we prove that $P_{\cal I}\ge P_{{\cal I}_0}>0$. Secondly, we find the exact values of… ▽ More Let ${\cal I}$ be the set of all infinitely divisible random variables\ with finite second moments, ${\cal I}_0=\{X\in{\cal I}:{\rm Var}(X)>0\}$, $P_{\cal I}=\inf_{X\in{\cal I}}P\{|X-E[X]|\le \sqrt{{\rm Var}(X)}\}$ and $P_{{\cal I}_0}=\inf_{X\in{\cal I}_0} P\{|X-E[X]|< \sqrt{{\rm Var}(X)}\}$. Firstly, we prove that $P_{\cal I}\ge P_{{\cal I}_0}>0$. Secondly, we find the exact values of $\inf_{X\in{\cal J}}P\{|X-E[X]|\le \sqrt{{\rm Var}(X)}\}$ and $\inf_{X\in\cal J} P\{|X-E[X]|< \sqrt{{\rm Var}(X)}\}$ for the cases that $\cal J$ is the set of all geometric random variables, symmetric geometric random variables, Poisson random variables and symmetric Poisson random variables, respectively. As a consequence, we obtain that $P_{\cal I}\le e^{-1}\sum_{k=0}^{\infty}\frac{1}{2^{2k}(k!)^2}\approx 0.46576$ and $P_{{\cal I}_0}\le e^{-1}\approx 0.36788$. △ Less

Submitted 17 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

MSC Class: 60E07; 60E15; 62G32

arXiv:2309.12597 [pdf, other]

On Axial Symmetry in Convex Bodies

Authors: Ritesh Goenka, Kenneth Moore, Wen Rui Sun, Ethan Patrick White

Abstract: For a two-dimensional convex body, the Kovner-Besicovitch measure of symmetry is defined as the volume ratio of the largest centrally symmetric body contained inside the body to the original body. A classical result states that the Kovner-Besicovitch measure is at least $2/3$ for every convex body and equals $2/3$ for triangles. Lassak showed that an alternative measure of symmetry, i.e., symmetry… ▽ More For a two-dimensional convex body, the Kovner-Besicovitch measure of symmetry is defined as the volume ratio of the largest centrally symmetric body contained inside the body to the original body. A classical result states that the Kovner-Besicovitch measure is at least $2/3$ for every convex body and equals $2/3$ for triangles. Lassak showed that an alternative measure of symmetry, i.e., symmetry about a line (axiality) has a value of at least $2/3$ for every convex body. However, the smallest known value of the axiality of a convex body is around $0.81584$, achieved by a convex quadrilateral. We show that every plane convex body has axiality at least $\frac{2}{41}(10 + 3 \sqrt{2}) \approx 0.69476$, thereby establishing a separation with the central symmetry measure. Moreover, we find a family of convex quadrilaterals with axiality approaching $\frac{1}{3}(\sqrt{2}+1) \approx 0.80474$. We also establish improved bounds for a ``folding" measure of axial symmetry for plane convex bodies. Finally, we establish improved bounds for a generalization of axiality to high-dimensional convex bodies. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 26 pages, 14 figures

MSC Class: 52A10; 52A38 (Primary) 52A20; 52A41 (Secondary)

arXiv:2309.10986 [pdf, other]

Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model

Authors: Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao

Abstract: Based on principal-agent theory and optimal contract theory, companies use the method of increasing executives' shareholding to stimulate collaborative innovation. However, from the aspect of agency costs between management and shareholders (i.e. the first type) and between major shareholders and minority shareholders (i.e. the second type), the interests of management, shareholders and creditors… ▽ More Based on principal-agent theory and optimal contract theory, companies use the method of increasing executives' shareholding to stimulate collaborative innovation. However, from the aspect of agency costs between management and shareholders (i.e. the first type) and between major shareholders and minority shareholders (i.e. the second type), the interests of management, shareholders and creditors will be unbalanced with the change of the marginal utility of executive equity incentives.In order to establish the correlation between the proportion of shares held by executives and investments in corporate innovation, we have chosen a range of publicly listed companies within China's A-share market as the focus of our study. Employing a multi-variable linear regression model, we aim to analyze this relationship thoroughly.The following models were developed: (1) the impact model of executive shareholding on corporate innovation investment; (2) the impact model of executive shareholding on two types of agency costs; (3)The model is employed to examine the mediating influence of the two categories of agency costs. Following both correlation and regression analyses, the findings confirm a meaningful and positive correlation between executives' shareholding and the augmentation of corporate innovation investments. Additionally, the results indicate that executive shareholding contributes to the reduction of the first type of agency cost, thereby fostering corporate innovation investment. However, simultaneously, it leads to an escalation in the second type of agency cost, thus impeding corporate innovation investment. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: Accepted by the 7th APWeb-WAIM International Joint Conference on Web and Big Data. (APWeb 2023)

arXiv:2309.07249 [pdf, other]

Averages of completely multiplicative functions over the Gaussian integers -- a dynamical approach

Authors: Sebastián Donoso, Anh N. Le, Joel Moreira, Wenbo Sun

Abstract: We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including: (i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists:… ▽ More We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including: (i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m + {\rm i} n).$$ (ii) An answer to a special case of a question of Frantzikinakis and Host: for any completely multiplicative real-valued function $f: \mathbb{N} \to \mathbb{R}$, the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m^2 + n^2).$$ (iii) A variant of a theorem of Bergelson and Richter on ergodic averages along the $Ω$ function: if $(X,T)$ is a uniquely ergodic system with unique invariant measure $μ$, then for any $x\in X$ and $f\in C(X)$, $$\lim_{N\to\infty}\frac{1}{N^2}\sum_{1 \leq m, n \leq N} f(T^{Ω(m^2 + n^2)}x)=\int_Xf \ dμ.$$ △ Less

Submitted 6 March, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

Comments: 32 pages. Suggestions and comments of the referee have been incorporated

MSC Class: Primary: 37A44 and 11N99

arXiv:2307.04998 [pdf, other]

Selective Sampling and Imitation Learning via Online Regression

Authors: Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

Abstract: We consider the problem of Imitation Learning (IL) by actively querying noisy expert for feedback. While imitation learning has been empirically successful, much of prior work assumes access to noiseless expert feedback which is not practical in many applications. In fact, when one only has access to noisy expert feedback, algorithms that rely on purely offline data (non-interactive IL) can be sho… ▽ More We consider the problem of Imitation Learning (IL) by actively querying noisy expert for feedback. While imitation learning has been empirically successful, much of prior work assumes access to noiseless expert feedback which is not practical in many applications. In fact, when one only has access to noisy expert feedback, algorithms that rely on purely offline data (non-interactive IL) can be shown to need a prohibitively large number of samples to be successful. In contrast, in this work, we provide an interactive algorithm for IL that uses selective sampling to actively query the noisy expert for feedback. Our contributions are twofold: First, we provide a new selective sampling algorithm that works with general function classes and multiple actions, and obtains the best-known bounds for the regret and the number of queries. Next, we extend this analysis to the problem of IL with noisy expert feedback and provide a new IL algorithm that makes limited queries. Our algorithm for selective sampling leverages function approximation, and relies on an online regression oracle w.r.t.~the given model class to predict actions, and to decide whether to query the expert for its label. On the theoretical side, the regret bound of our algorithm is upper bounded by the regret of the online regression oracle, while the query complexity additionally depends on the eluder dimension of the model class. We complement this with a lower bound that demonstrates that our results are tight. We extend our selective sampling algorithm for IL with general function approximation and provide bounds on both the regret and the number of queries made to the noisy expert. A key novelty here is that our regret and query complexity bounds only depend on the number of times the optimal policy (and not the noisy expert, or the learner) go to states that have a small margin. △ Less

Submitted 10 July, 2023; originally announced July 2023.

arXiv:2306.17495 [pdf, ps, other]

Stability of a one-dimensional full viscous quantum hydrodynamic system

Authors: Xiaoying Han, Yuming Qin, Wenlong Sun

Abstract: A full viscous quantum hydrodynamic system for particle density, current density, energy density and electrostatic potential coupled with a Poisson equation in one dimensional bounded intervals is studied. First, the existence and uniqueness of a steady-state solution to the quantum hydrodynamic system is established. Then, utilizing the fact that the third order perturbation term has an appropria… ▽ More A full viscous quantum hydrodynamic system for particle density, current density, energy density and electrostatic potential coupled with a Poisson equation in one dimensional bounded intervals is studied. First, the existence and uniqueness of a steady-state solution to the quantum hydrodynamic system is established. Then, utilizing the fact that the third order perturbation term has an appropriate sign, the local-in-time existence of the solution is investigated by introducing a fourth order viscous regularization and using the entropy dissipation method. In the end, the exponential stability of the steady-state solution is shown by constructing a uniform a-priori estimate. △ Less

Submitted 30 June, 2023; originally announced June 2023.

MSC Class: 76E09

arXiv:2306.10550 [pdf, ps, other]

The boundary case of the $J$-flow

Authors: Wei Sun

Abstract: In this paper, we shall study the boundary case for the $J$-flow under certain geometric assumptions. In this paper, we shall study the boundary case for the $J$-flow under certain geometric assumptions. △ Less

Submitted 18 June, 2023; originally announced June 2023.

arXiv:2306.10549 [pdf, ps, other]

Interior gradient estimates for fully nonlinear elliptic equations on Riemannian manifolds

Authors: Wei Sun

Abstract: We study a class of fully nonlinear elliptic equations on Riemannian manifolds. We derive interior gradient estimates. We study a class of fully nonlinear elliptic equations on Riemannian manifolds. We derive interior gradient estimates. △ Less

Submitted 18 June, 2023; originally announced June 2023.

arXiv:2306.09577 [pdf, ps, other]

The extended Bogomolny equations on $R^2 \times R^+$ with real symmetry breaking

Authors: Weifeng Sun

Abstract: In this paper, we construct solutions to the extended Bogomolny equations on $X = R^2 \times R^+$ with certain boundary conditions and asymptotic conditions. Let $y$ be the coordinate of $R^+$. Roughly, both the boundary condition and the asymptotic condition say that a solution approaches to a certain model solution when $y \rightarrow 0$ and $y \rightarrow \infty$ resepctively. The boundary cond… ▽ More In this paper, we construct solutions to the extended Bogomolny equations on $X = R^2 \times R^+$ with certain boundary conditions and asymptotic conditions. Let $y$ be the coordinate of $R^+$. Roughly, both the boundary condition and the asymptotic condition say that a solution approaches to a certain model solution when $y \rightarrow 0$ and $y \rightarrow \infty$ resepctively. The boundary condition ($y \rightarrow 0$) is called generalized Nahm pole boundary condition and the asymptotic condition ($y \rightarrow \infty$) is called real symmetry breaking condition. The solutions should be thought as an analog of the instanton solutions that Taubes and Dimakis have created (using different methods), while their solutions satisfy a different asymptotic condition. △ Less

Submitted 15 June, 2023; originally announced June 2023.

arXiv:2306.05248 [pdf, other]

Optimal $L^2$ error analysis of a loosely coupled finite element scheme for thin-structure interactions

Authors: Buyang Li, Weiwei Sun, Yupei Xie, Wenshan Yu

Abstract: Finite element methods and kinematically coupled schemes that decouple the fluid velocity and structure displacement have been extensively studied for incompressible fluid-structure interaction (FSI) over the past decade. While these methods are known to be stable and easy to implement, optimal error analysis has remained challenging. Previous work has primarily relied on the classical elliptic pr… ▽ More Finite element methods and kinematically coupled schemes that decouple the fluid velocity and structure displacement have been extensively studied for incompressible fluid-structure interaction (FSI) over the past decade. While these methods are known to be stable and easy to implement, optimal error analysis has remained challenging. Previous work has primarily relied on the classical elliptic projection technique, which is only suitable for parabolic problems and does not lead to optimal convergence of numerical solutions to the FSI problems in the standard $L^2$ norm. In this article, we propose a new stable fully discrete kinematically coupled scheme for incompressible FSI thin-structure model and establish a new approach for the numerical analysis of FSI problems in terms of a newly introduced coupled non-stationary Ritz projection, which allows us to prove the optimal-order convergence of the proposed method in the $L^2$ norm. The methodology presented in this article is also applicable to numerous other FSI models and serves as a fundamental tool for advancing research in this field. △ Less

Submitted 12 December, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

arXiv:2305.18505 [pdf, ps, other]

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Authors: Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Abstract: Preference-based Reinforcement Learning (PbRL) is a paradigm in which an RL agent learns to optimize a task using pair-wise preference-based feedback over trajectories, rather than explicit reward signals. While PbRL has demonstrated practical success in fine-tuning language models, existing theoretical work focuses on regret minimization and fails to capture most of the practical frameworks. In t… ▽ More Preference-based Reinforcement Learning (PbRL) is a paradigm in which an RL agent learns to optimize a task using pair-wise preference-based feedback over trajectories, rather than explicit reward signals. While PbRL has demonstrated practical success in fine-tuning language models, existing theoretical work focuses on regret minimization and fails to capture most of the practical frameworks. In this study, we fill in such a gap between theoretical PbRL and practical algorithms by proposing a theoretical reward-agnostic PbRL framework where exploratory trajectories that enable accurate learning of hidden reward functions are acquired before collecting any human feedback. Theoretical analysis demonstrates that our algorithm requires less human feedback for learning the optimal policy under preference-based models with linear parameterization and unknown transitions, compared to the existing theoretical literature. Specifically, our framework can incorporate linear and low-rank MDPs with efficient sample complexity. Additionally, we investigate reward-agnostic RL with action-based comparison feedback and introduce an efficient querying algorithm tailored to this scenario. △ Less

Submitted 17 April, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

Comments: ICLR 2024 Spotlight

arXiv:2305.17964 [pdf, ps, other]

Viscosity solutions to uniformly elliptic complex equations

Authors: Wei Sun

Abstract: In this paper, we shall extend the definition of $\mathcal{C}$-subsolution condition and adapt the argument of Guo-Phong-Tong[18] to replace Alexandroff-Bakelman-Pucci estimate in complex cases. As an application, we shall define and study the viscosity solutions to uniformly elliptic complex equations and prove the Hölder regularity, following the argument for real equations. Our results show tha… ▽ More In this paper, we shall extend the definition of $\mathcal{C}$-subsolution condition and adapt the argument of Guo-Phong-Tong[18] to replace Alexandroff-Bakelman-Pucci estimate in complex cases. As an application, we shall define and study the viscosity solutions to uniformly elliptic complex equations and prove the Hölder regularity, following the argument for real equations. Our results show that the new method can improve the dependence in regularity and a priori estimates for complex elliptic equations. △ Less

Submitted 29 May, 2023; originally announced May 2023.

arXiv:2305.15703 [pdf, ps, other]

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Authors: Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun

Abstract: While distributional reinforcement learning (DistRL) has been empirically effective, the question of when and why it is better than vanilla, non-distributional RL has remained unanswered. This paper explains the benefits of DistRL through the lens of small-loss bounds, which are instance-dependent bounds that scale with optimal achievable cost. Particularly, our bounds converge much faster than th… ▽ More While distributional reinforcement learning (DistRL) has been empirically effective, the question of when and why it is better than vanilla, non-distributional RL has remained unanswered. This paper explains the benefits of DistRL through the lens of small-loss bounds, which are instance-dependent bounds that scale with optimal achievable cost. Particularly, our bounds converge much faster than those from non-distributional approaches if the optimal cost is small. As warmup, we propose a distributional contextual bandit (DistCB) algorithm, which we show enjoys small-loss regret bounds and empirically outperforms the state-of-the-art on three real-world tasks. In online RL, we propose a DistRL algorithm that constructs confidence sets using maximum likelihood estimation. We prove that our algorithm enjoys novel small-loss PAC bounds in low-rank MDPs. As part of our analysis, we introduce the $\ell_1$ distributional eluder dimension which may be of independent interest. Then, in offline RL, we show that pessimistic DistRL enjoys small-loss PAC bounds that are novel to the offline setting and are more robust to bad single-policy coverage. △ Less

Submitted 22 September, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2305.14816 [pdf, ps, other]

Provable Offline Preference-Based Reinforcement Learning

Authors: Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Abstract: In this paper, we investigate the problem of offline Preference-based Reinforcement Learning (PbRL) with human feedback where feedback is available in the form of preference between trajectory pairs rather than explicit rewards. Our proposed algorithm consists of two main steps: (1) estimate the implicit reward using Maximum Likelihood Estimation (MLE) with general function approximation from offl… ▽ More In this paper, we investigate the problem of offline Preference-based Reinforcement Learning (PbRL) with human feedback where feedback is available in the form of preference between trajectory pairs rather than explicit rewards. Our proposed algorithm consists of two main steps: (1) estimate the implicit reward using Maximum Likelihood Estimation (MLE) with general function approximation from offline data and (2) solve a distributionally robust planning problem over a confidence set around the MLE. We consider the general reward setting where the reward can be defined over the whole trajectory and provide a novel guarantee that allows us to learn any target policy with a polynomial number of samples, as long as the target policy is covered by the offline data. This guarantee is the first of its kind with general function approximation. To measure the coverage of the target policy, we introduce a new single-policy concentrability coefficient, which can be upper bounded by the per-trajectory concentrability coefficient. We also establish lower bounds that highlight the necessity of such concentrability and the difference from standard RL, where state-action-wise rewards are directly observed. We further extend and analyze our algorithm when the feedback is given over action pairs. △ Less

Submitted 29 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: The first two authors contribute equally

arXiv:2305.13615 [pdf, ps, other]

Variation comparison between the $F$-distribution and the normal distribution

Authors: ** Sun, Ze-Chun Hu, Wei Sun

Abstract: Let $X_{d_1,d_2}$ be an $F$-random variable with numerator and denominator degrees of freedom $d_1$ and $d_2$, respectively. We investigate the inequality: $P\{|X_{d_1,d_2}-E[X_{d_1,d_2}]|\le \sqrt{{\rm Var}(X_{d_1,d_2})}\}\ge P\{|W-E[W]|\le \sqrt{{\rm Var}(W)}\}$, where $W$ is a standard normal random variable or a $χ^2(d_1)$ random variable. We prove that this inequality holds for… ▽ More Let $X_{d_1,d_2}$ be an $F$-random variable with numerator and denominator degrees of freedom $d_1$ and $d_2$, respectively. We investigate the inequality: $P\{|X_{d_1,d_2}-E[X_{d_1,d_2}]|\le \sqrt{{\rm Var}(X_{d_1,d_2})}\}\ge P\{|W-E[W]|\le \sqrt{{\rm Var}(W)}\}$, where $W$ is a standard normal random variable or a $χ^2(d_1)$ random variable. We prove that this inequality holds for $d_1\in\{1,2,3,4\}$ and $5\le d_2\in\mathbb{N}$. △ Less

Submitted 22 May, 2023; originally announced May 2023.

MSC Class: 60E15; 62G32; 90C15

arXiv:2305.06132 [pdf, ps, other]

The weak solutions to complex Hessian equations

Authors: Wei Sun

Abstract: In this paper, we shall study existence of weak solutions to complex Hessian equations. With appropriate assumptions, it is possible to obtain weak solutions in pluripotential sense. In this paper, we shall study existence of weak solutions to complex Hessian equations. With appropriate assumptions, it is possible to obtain weak solutions in pluripotential sense. △ Less

Submitted 10 May, 2023; originally announced May 2023.

arXiv:2305.02576 [pdf, ps, other]

The boundary case for complex Monge-Ampère type equations

Authors: Wei Sun

Abstract: In this paper, we shall study the boundary case for complex Monge-Ampère type equations under certain geometric assumptions. In this paper, we shall study the boundary case for complex Monge-Ampère type equations under certain geometric assumptions. △ Less

Submitted 4 May, 2023; originally announced May 2023.

arXiv:2305.00494 [pdf, ps, other]

Interior gradient estimates for prescribed curvature equations in hyperbolic space

Authors: Zhenan Sui, Wei Sun

Abstract: In this paper, we study the interior gradient estimates for admissible solutions to prescribed curvature equations in hyperbolic space. In this paper, we study the interior gradient estimates for admissible solutions to prescribed curvature equations in hyperbolic space. △ Less

Submitted 30 April, 2023; originally announced May 2023.

arXiv:2304.11459 [pdf, ps, other]

Variation comparison between infinitely divisible distributions and the normal distribution

Authors: ** Sun, Ze-Chun Hu, Wei Sun

Abstract: Let $X$ be a random variable with finite second moment. We investigate the inequality: $P\{|X-E[X]|\le \sqrt{{\rm Var}(X)}\}\ge P\{|Z|\le 1\}$, where $Z$ is a standard normal random variable. We prove that this inequality holds for many familiar infinitely divisible continuous distributions including the Laplace, Gumbel, Logistic, Pareto, infinitely divisible Weibull, log-normal, student's $t$ and… ▽ More Let $X$ be a random variable with finite second moment. We investigate the inequality: $P\{|X-E[X]|\le \sqrt{{\rm Var}(X)}\}\ge P\{|Z|\le 1\}$, where $Z$ is a standard normal random variable. We prove that this inequality holds for many familiar infinitely divisible continuous distributions including the Laplace, Gumbel, Logistic, Pareto, infinitely divisible Weibull, log-normal, student's $t$ and inverse Gaussian distributions. Numerical results are given to show that the inequality with continuity correction also holds for some infinitely divisible discrete distributions. △ Less

Submitted 10 May, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

MSC Class: 60E15; 62G32; 90C15

arXiv:2303.17487 [pdf, ps, other]

The extreme values of two probability functions for the Gamma distribution

Authors: ** Sun, Ze-Chun Hu, Wei Sun

Abstract: Motivated by Chvátal's conjecture and Tomaszewaki's conjecture, we investigate the extreme value problem of two probability functions for the Gamma distribution. Let $α,β$ be arbitrary positive real numbers and $X_{α,β}$ be a Gamma random variable with shape parameter $α$ and scale parameter $β$. We study the extreme values of functions $P\{X_{α,β}\le E[X_{α,β}]\}$ and… ▽ More Motivated by Chvátal's conjecture and Tomaszewaki's conjecture, we investigate the extreme value problem of two probability functions for the Gamma distribution. Let $α,β$ be arbitrary positive real numbers and $X_{α,β}$ be a Gamma random variable with shape parameter $α$ and scale parameter $β$. We study the extreme values of functions $P\{X_{α,β}\le E[X_{α,β}]\}$ and $P\{|X_{α,β}-E[X_{α,β}]|\le \sqrt{{\rm Var}(X_{α,β})}\}$. Among other things, we show that $ \inf_{α,β}P\{X_{α,β}\le E[X_{α,β}]\}=\frac{1}{2}$ and $\inf_{α,β}P\{|X_{α,β}-E[X_{α,β}]|\le \sqrt{{\rm Var}(X_{α,β})}\}=P\{|Z|\le 1\}\approx 0.6826$, where $Z$ is a standard normal random variable. △ Less

Submitted 30 March, 2023; originally announced March 2023.

MSC Class: 60E15; 62G32; 90C15

arXiv:2303.13345 [pdf, ps, other]

A new subspace minimization conjugate gradient method for unconstrained minimization

Authors: Zexian Liu, Yan Ni, Hongwei Liu, Wumei Sun

Abstract: Subspace minimization conjugate gradient (SMCG) methods have become a class of quite efficient iterative methods for unconstrained optimization and have attracted extensive attention recently. Usually, the search directions of SMCG methods are generated by minimizing approximate models with the approximation matrix $ B_k $ of the objective function at the current iterate over the subspace spanned… ▽ More Subspace minimization conjugate gradient (SMCG) methods have become a class of quite efficient iterative methods for unconstrained optimization and have attracted extensive attention recently. Usually, the search directions of SMCG methods are generated by minimizing approximate models with the approximation matrix $ B_k $ of the objective function at the current iterate over the subspace spanned by the current gradient $ g_k $ and the latest search direction. The $ g_k^TB_kg_k $ must be estimated properly in the calculation of the search directions, which is crucial to the theoretical properties and the numerical performance of SMCG methods. It is a great challenge to estimate it properly. The projection technique has been used successfully to generate conjugate gradient directions such as Dai-Kou conjugate gradient direction. Motivated by the above two observations, in the paper we present a new subspace minimization conjugate gradient methods by using a projection technique based on the memoryless quasi-Newton method. More specially, we project the search direction of the memoryless quasi-Newton method into the subspace spanned by the current gradient and the latest search direction and drive a new search direction, which is proved to be descent. Remarkably, the proposed method without any line search enjoys the finite termination property for two dimensional convex quadratic functions, which is helpful for designing algorithm. An adaptive scaling factor in the search direction is given based on the above finite termination property. The proposed method does not need to determine the parameter $ ρ_k $ and can be regarded as an extension of Dai-Kou conjugate gradient method. The global convergence of the proposed method is analyzed. Numerical comparisons indicate the proposed method is very promising. △ Less

Submitted 23 March, 2023; originally announced March 2023.

arXiv:2303.07983 [pdf, other]

Parameter estimation of stochastic SIR model driven by small Lévy noise with time-dependent periodic transmission

Authors: Terry Easlick, Wei Sun

Abstract: We investigate the parameter estimation and prediction of two forms of the stochastic SIR model driven by small Lévy noise with time-dependent periodic transmission. We present consistency and rate of convergence results for the least-squares estimators. We include simulation studies using the method of projected gradient descent. We investigate the parameter estimation and prediction of two forms of the stochastic SIR model driven by small Lévy noise with time-dependent periodic transmission. We present consistency and rate of convergence results for the least-squares estimators. We include simulation studies using the method of projected gradient descent. △ Less

Submitted 23 April, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

MSC Class: 62M20; 92D30; 62F12

arXiv:2302.12278 [pdf, ps, other]

Total joint ergodicity for totally ergodic systems

Authors: Andreas Koutsogiannis, Wenbo Sun

Abstract: Examining multiple ergodic averages whose iterates are integer parts of real valued polynomials for totally ergodic systems, we provide various characterizations of total joint ergodicity, meaning that an average converges to the "expected" limit along every arithmetic progression. In particular, we obtain a complete characterization when the number of iterates is at most two, and disprove a conje… ▽ More Examining multiple ergodic averages whose iterates are integer parts of real valued polynomials for totally ergodic systems, we provide various characterizations of total joint ergodicity, meaning that an average converges to the "expected" limit along every arithmetic progression. In particular, we obtain a complete characterization when the number of iterates is at most two, and disprove a conjecture of the first author. We also improve a result of Frantzikinakis on joint ergodicity of Hardy field functions of at most polynomial growth for totally ergodic systems, which extends a conjecture of Bergelson-Moreira-Richter. Our method is to first use the methodology of Frantzikinakis, which allows one to reduce the systems to rotations on abelian groups without using deep tools from ergodic theory, then develop formulas for integrals of exponential functions over subtori, and finally, compute exponential sums for integer parts of real polynomials. △ Less

Submitted 23 February, 2023; originally announced February 2023.

Comments: 30 pages

MSC Class: Primary: 37A30; Secondary: 28D05; 11L03; 11L15

arXiv:2302.03854 [pdf, other]

Cospectral graphs obtained by edge deletion

Authors: Chris Godsil, Wanting Sun, Xiaohong Zhang

Abstract: Let $M\circ N$ denote the Schur product of two matrices $M$ and $N$. A graph $X$ with adjacency matrix $A$ is walk regular if $A^k\circ I$ is a constant times $I$ for each $k\ge0$, and $X$ is 1-walk-regular if it is walk regular and $A^k\circ A$ is a constant times $A$ for each $k\ge0$. Assume $X$ is 1-walk regular. Here we show that by deleting an edge in $X$, or deleting edges of a graph inside… ▽ More Let $M\circ N$ denote the Schur product of two matrices $M$ and $N$. A graph $X$ with adjacency matrix $A$ is walk regular if $A^k\circ I$ is a constant times $I$ for each $k\ge0$, and $X$ is 1-walk-regular if it is walk regular and $A^k\circ A$ is a constant times $A$ for each $k\ge0$. Assume $X$ is 1-walk regular. Here we show that by deleting an edge in $X$, or deleting edges of a graph inside a clique of $X$, we obtain families of graphs that are not necessarily isomorphic, but are cospectral with respect to four types of matrices: the adjacency matrix, Laplacian matrix, unsigned Laplacian matrix, and normalized Laplacian matrix. Furthermore, we show that removing edges of Laplacian cospectral graphs in cliques of a 1-walk regular graph results in Laplacian cospectral graphs; removing edges of unsigned Laplacian cospectral graphs whose complements are also cospectral with respect to the unsigned Laplacian in cliques of a 1-walk regular graph results in unsigned Laplacian cospectral graphs. △ Less

Submitted 7 February, 2023; originally announced February 2023.

MSC Class: 05C50

arXiv:2302.03201 [pdf, ps, other]

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Authors: Kaiwen Wang, Nathan Kallus, Wen Sun

Abstract: In this paper, we study risk-sensitive Reinforcement Learning (RL), focusing on the objective of Conditional Value at Risk (CVaR) with risk tolerance $τ$. Starting with multi-arm bandits (MABs), we show the minimax CVaR regret rate is $Ω(\sqrt{τ^{-1}AK})$, where $A$ is the number of actions and $K$ is the number of episodes, and that it is achieved by an Upper Confidence Bound algorithm with a nov… ▽ More In this paper, we study risk-sensitive Reinforcement Learning (RL), focusing on the objective of Conditional Value at Risk (CVaR) with risk tolerance $τ$. Starting with multi-arm bandits (MABs), we show the minimax CVaR regret rate is $Ω(\sqrt{τ^{-1}AK})$, where $A$ is the number of actions and $K$ is the number of episodes, and that it is achieved by an Upper Confidence Bound algorithm with a novel Bernstein bonus. For online RL in tabular Markov Decision Processes (MDPs), we show a minimax regret lower bound of $Ω(\sqrt{τ^{-1}SAK})$ (with normalized cumulative rewards), where $S$ is the number of states, and we propose a novel bonus-driven Value Iteration procedure. We show that our algorithm achieves the optimal regret of $\widetilde O(\sqrt{τ^{-1}SAK})$ under a continuity assumption and in general attains a near-optimal regret of $\widetilde O(τ^{-1}\sqrt{SAK})$, which is minimax-optimal for constant $τ$. This improves on the best available bounds. By discretizing rewards appropriately, our algorithms are computationally efficient. △ Less

Submitted 24 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

Comments: Accepted at ICML 2023

arXiv:2302.01487 [pdf, ps, other]

Conflict-Avoiding Codes of Prime Lengths and Cyclotomic Numbers

Authors: Liang-Chung Hsia, Hua-Chieh Li, Wei-Liang Sun

Abstract: The problem to construct optimal conflict-avoiding codes of even lengths and the Hamming weight $3$ is completely settled. On the contrary, it is still open for odd lengths. It turns out that the prime lengths are the fundamental cases needed to be constructed. In the article, we study conflict-avoiding codes of prime lengths and give a connection with the so-called cyclotomic numbers. By having s… ▽ More The problem to construct optimal conflict-avoiding codes of even lengths and the Hamming weight $3$ is completely settled. On the contrary, it is still open for odd lengths. It turns out that the prime lengths are the fundamental cases needed to be constructed. In the article, we study conflict-avoiding codes of prime lengths and give a connection with the so-called cyclotomic numbers. By having some nonzero cyclotomic numbers, a well-known algorithm for constructing optimal conflict-avoiding codes will work for certain prime lengths. As a consequence, we are able to answer the size of optimal conflict-avoiding code for a new class of prime lengths. △ Less

Submitted 2 February, 2023; originally announced February 2023.

arXiv:2302.00920 [pdf, ps, other]

doi 10.1016/j.ffa.2023.102298

Certain Diagonal Equations and Conflict-Avoiding Codes of Prime Lengths

Authors: Liang-Chung Hsia, Hua-Chieh Li, Wei-Liang Sun

Abstract: We study the construction of optimal conflict-avoiding codes (CAC) from a number theoretical point of view. The determination of the size of optimal CAC of prime length $p$ and weight 3 is formulated in terms of the solvability of certain twisted Fermat equations of the form $g^2 X^{\ell} + g Y^{\ell} + 1 = 0$ over the finite field $\mathbb{F}_{p}$ for some primitive root $g$ modulo $p.$ We treat… ▽ More We study the construction of optimal conflict-avoiding codes (CAC) from a number theoretical point of view. The determination of the size of optimal CAC of prime length $p$ and weight 3 is formulated in terms of the solvability of certain twisted Fermat equations of the form $g^2 X^{\ell} + g Y^{\ell} + 1 = 0$ over the finite field $\mathbb{F}_{p}$ for some primitive root $g$ modulo $p.$ We treat the problem of solving the twisted Fermat equations in a more general situation by allowing the base field to be any finite extension field $\mathbb{F}_q$ of $\mathbb{F}_{p}.$ We show that for $q$ greater than a lower bound of the order of magnitude $O(\ell^2)$ there exists a generator $g$ of $\mathbb{F}_{q}^{\times}$ such that the equation in question is solvable over $\mathbb{F}_{q}.$ Using our results we are able to contribute new results to the construction of optimal CAC of prime lengths and weight $3.$ △ Less

Submitted 2 February, 2023; originally announced February 2023.

Journal ref: Finite Fields and Their Applications, 29 (2023) 102298

arXiv:2301.06911 [pdf, ps, other]

Joint ergodicity for functions of polynomial growth

Authors: Sebastián Donoso, Andreas Koutsogiannis, Wenbo Sun

Abstract: We provide necessary and sufficient conditions for joint ergodicity results for systems of commuting measure preserving transformations for an iterated Hardy field function of polynomial growth. Our method builds on and improves recent techniques due to Frantzikinakis and Tsinas, who dealt with multiple ergodic averages along Hardy field functions; it also enhances an approach introduced by the au… ▽ More We provide necessary and sufficient conditions for joint ergodicity results for systems of commuting measure preserving transformations for an iterated Hardy field function of polynomial growth. Our method builds on and improves recent techniques due to Frantzikinakis and Tsinas, who dealt with multiple ergodic averages along Hardy field functions; it also enhances an approach introduced by the authors and Ferré Moragues to study polynomial iterates. The more general expression, in which the iterate is a linear combination of a Hardy field function of polynomial growth and a tempered function, is studied as well. △ Less

Submitted 2 March, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

Comments: 36 pages; Theorem 6.1 now covers the case where the iterate is a linear combination of a Hardy field and a tempered function of different polynomial growth rates, extending in particular Theorem 1.1

MSC Class: Primary: 37A05; Secondary: 37A30; 28A99; 60F99

arXiv:2301.02863 [pdf, ps, other]

A Regularized Limited Memory Subspace Minimization Conjugate Gradient Method for Unconstrained Optimization

Authors: Wumei Sun, Hongwei Liu, Zexian Liu

Abstract: In this paper, based on the limited memory techniques and subspace minimization conjugate gradient (SMCG) methods, a regularized limited memory subspace minimization conjugate gradient method is proposed, which contains two types of iterations. In SMCG iteration, we obtain the search direction by minimizing the approximate quadratic model or approximate regularization model. In RQN iteration, comb… ▽ More In this paper, based on the limited memory techniques and subspace minimization conjugate gradient (SMCG) methods, a regularized limited memory subspace minimization conjugate gradient method is proposed, which contains two types of iterations. In SMCG iteration, we obtain the search direction by minimizing the approximate quadratic model or approximate regularization model. In RQN iteration, combined with regularization technique and BFGS method, a modified regularized quasi-Newton method is used in the subspace to improve the orthogonality. Moreover, some simple acceleration criteria and an improved tactic for selecting the initial stepsize to enhance the efficiency of the algorithm are designed. Additionally, an generalized nonmonotone line search is utilized and the global convergence of our proposed algorithm is established under mild conditions. Finally, numerical results show that, the proposed algorithm has a significant improvement over ASMCG_PR and is superior to the particularly well-known limited memory conjugate gradient software packages CG_DESCENT (6.8) and CGOPT(2.0) for the CUTEr library. △ Less

Submitted 7 January, 2023; originally announced January 2023.

arXiv:2301.01102 [pdf]

Fourier series (based) multiscale method for computational analysis in science and engineering: VII. Fourier series multiscale solution for elastic bending of beams on Pasternak foundations

Authors: Weiming Sun, Zimao Zhang

Abstract: Fourier series multiscale method, a concise and efficient analytical approach for multiscale computation, will be developed out of this series of papers. In the seventh paper, the usual structural analysis of beams on an elastic foundation is extended to a thorough multiscale analysis for a fourth order linear differential equation for transverse deflection of the beam, where general boundary cond… ▽ More Fourier series multiscale method, a concise and efficient analytical approach for multiscale computation, will be developed out of this series of papers. In the seventh paper, the usual structural analysis of beams on an elastic foundation is extended to a thorough multiscale analysis for a fourth order linear differential equation for transverse deflection of the beam, where general boundary conditions and a wide spectrum of model parameters are prescribed. For this purpose, the solution function is expressed as a linear combination of the boundary function and the internal function, to ensure the series expression obtained uniformly convergent and termwise differentiable up to fourth order. Meanwhile, the internal function corresponds to the particular solution, and the boundary function corresponds to the general solution which satisfies the homogeneous form of the governing differential equation. Since the general solution has appropriately interpreted the meaning of the differential equation, the spatial characteristics of the solution of the equation are expected to be better captured. With the boundary function and the internal function selected specifically as combination of the linearly independent homogeneous solutions of the differential equation, and one-dimensional half-range Fourier sine series over the solution interval, the Fourier series multiscale solution of the bending problem of a beam on the Pasternak foundation is derived. And then the convergence characteristics of the Fourier series multiscale solution are investigated with numerical examples, and the multiscale characteristics of the bending problem of a beam on the Pasternak foundation are demonstrated for a wide spectrum of model parameters. △ Less

Submitted 3 January, 2023; originally announced January 2023.

Comments: 32 pages, 13 figures, 4 tables

MSC Class: 35C10; 35G15

Showing 1–50 of 256 results for author: Sun, W