-
A Bayesian Neural ODE for a Lettuce Greenhouse
Authors:
Sjoerd Boersma,
Xiaodong Cheng
Abstract:
Greenhouse production systems play a crucial role in modern agriculture, enabling year-round cultivation of crops by providing a controlled environment. However, effectively quantifying uncertainty in modeling greenhouse systems remains a challenging task. In this paper, we apply a novel approach based on sparse Bayesian deep learning for the system identification of lettuce greenhouse models. The…
▽ More
Greenhouse production systems play a crucial role in modern agriculture, enabling year-round cultivation of crops by providing a controlled environment. However, effectively quantifying uncertainty in modeling greenhouse systems remains a challenging task. In this paper, we apply a novel approach based on sparse Bayesian deep learning for the system identification of lettuce greenhouse models. The method leverages the power of deep neural networks while incorporating Bayesian inference to quantify the uncertainty in the weights of a Neural ODE. The simulation results show that the generated model can capture the intrinsic nonlinear behavior of the greenhouse system with probabilistic estimates of environmental variables and lettuce growth within the greenhouse.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Preferential Multi-Objective Bayesian Optimization
Authors:
Raul Astudillo,
Kejun Li,
Maegan Tucker,
Chu Xin Cheng,
Aaron D. Ames,
Yisong Yue
Abstract:
Preferential Bayesian optimization (PBO) is a framework for optimizing a decision-maker's latent preferences over available design choices. While preferences often involve multiple conflicting objectives, existing work in PBO assumes that preferences can be encoded by a single objective function. For example, in robotic assistive devices, technicians often attempt to maximize user comfort while si…
▽ More
Preferential Bayesian optimization (PBO) is a framework for optimizing a decision-maker's latent preferences over available design choices. While preferences often involve multiple conflicting objectives, existing work in PBO assumes that preferences can be encoded by a single objective function. For example, in robotic assistive devices, technicians often attempt to maximize user comfort while simultaneously minimizing mechanical energy consumption for longer battery life. Similarly, in autonomous driving policy design, decision-makers wish to understand the trade-offs between multiple safety and performance attributes before committing to a policy. To address this gap, we propose the first framework for PBO with multiple objectives. Within this framework, we present dueling scalarized Thompson sampling (DSTS), a multi-objective generalization of the popular dueling Thompson algorithm, which may be of interest beyond the PBO setting. We evaluate DSTS across four synthetic test functions and two simulated exoskeleton personalization and driving policy design tasks, showing that it outperforms several benchmarks. Finally, we prove that DSTS is asymptotically consistent. As a direct consequence, this result provides, to our knowledge, the first convergence guarantee for dueling Thompson sampling in the PBO setting.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
On semi-implicit schemes for the incompressible Euler equations via the vanishing viscosity limit
Authors:
Xinyu Cheng,
Zhaonan Luo,
Sheng Wang
Abstract:
A new type of systematic approach to study the incompressible Euler equations numerically via the vanishing viscosity limit is proposed in this work. We show the new strategy is unconditionally stable that the $L^2$-energy dissipates and $H^s$-norm is uniformly bounded in time without any restriction on the time step. Moreover, first-order convergence of the proposed method is established includin…
▽ More
A new type of systematic approach to study the incompressible Euler equations numerically via the vanishing viscosity limit is proposed in this work. We show the new strategy is unconditionally stable that the $L^2$-energy dissipates and $H^s$-norm is uniformly bounded in time without any restriction on the time step. Moreover, first-order convergence of the proposed method is established including both low regularity and high regularity error estimates. The proposed method is extended to full discretization with a newly developed iterative Fourier spectral scheme. Another main contributions of this work is to propose a new integration by parts technique to lower the regularity requirement from $H^4$ to $H^3$ in order to perform the $L^2$-error estimate. To our best knowledge, this is one of the very first work to study incompressible Euler equations by designing stable numerical schemes via the inviscid limit with rigorous analysis. Furthermore, we will present both low and high regularity errors from numerical experiments and demonstrate the dynamics in several benchmark examples.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics
Authors:
Xi Cheng,
Gaofeng Su,
Siyuan Feng,
Ke Liu,
Chen Zhu,
Hui Lin,
Jilin Song,
Jianan Chen
Abstract:
Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducte…
▽ More
Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducted simulations in the Berkeley city area during the "Big Game" to validate our system and demonstrate the benefits of our innovative parking assignment approach.
△ Less
Submitted 14 May, 2024;
originally announced June 2024.
-
Bollobás-Erdős-Tuza conjecture for graphs with no induced $K_{s,t}$
Authors:
Xinbu Cheng,
Zixiang Xu
Abstract:
A widely open conjecture proposed by Bollobás, Erdős, and Tuza in the early 1990s states that for any $n$-vertex graph $G$, if the independence number $α(G) = Ω(n)$, then there is a subset $T \subseteq V(G)$ with $|T| = o(n)$ such that $T$ intersects all maximum independent sets of $G$. In this paper, we prove that this conjecture holds for graphs that do not contain an induced $K_{s,t}$ for fixed…
▽ More
A widely open conjecture proposed by Bollobás, Erdős, and Tuza in the early 1990s states that for any $n$-vertex graph $G$, if the independence number $α(G) = Ω(n)$, then there is a subset $T \subseteq V(G)$ with $|T| = o(n)$ such that $T$ intersects all maximum independent sets of $G$. In this paper, we prove that this conjecture holds for graphs that do not contain an induced $K_{s,t}$ for fixed $t \ge s$. Our proof leverages the probabilistic method at an appropriate juncture.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Riemannian Bilevel Optimization
Authors:
Sanchayan Dutta,
Xiang Cheng,
Suvrit Sra
Abstract:
We develop new algorithms for Riemannian bilevel optimization. We focus in particular on batch and stochastic gradient-based methods, with the explicit goal of avoiding second-order information such as Riemannian hyper-gradients. We propose and analyze $\mathrm{RF^2SA}$, a method that leverages first-order gradient information to navigate the complex geometry of Riemannian manifolds efficiently. N…
▽ More
We develop new algorithms for Riemannian bilevel optimization. We focus in particular on batch and stochastic gradient-based methods, with the explicit goal of avoiding second-order information such as Riemannian hyper-gradients. We propose and analyze $\mathrm{RF^2SA}$, a method that leverages first-order gradient information to navigate the complex geometry of Riemannian manifolds efficiently. Notably, $\mathrm{RF^2SA}$ is a single-loop algorithm, and thus easier to implement and use. Under various setups, including stochastic optimization, we provide explicit convergence rates for reaching $ε$-stationary points. We also address the challenge of optimizing over Riemannian manifolds with constraints by adjusting the multiplier in the Lagrangian, ensuring convergence to the desired solution without requiring access to second-order derivatives.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Remark on the scattering theory of the nonlinear Schrödinger equation on the cylinders
Authors:
Xing Cheng,
Jiqiang Zheng
Abstract:
In this article, we consider the nonlinear Schrödinger equation on the cylinder $\mathbb{R}^d\times \mathbb{T}$. In the long range case, we show there is no linear scattering state of the nonlinear Schrödinger equation on $\mathbb{R}^d \times \mathbb{T}$. In the short range case, we show the decay and scattering of solutions of the nonlinear Schrödinger equation on…
▽ More
In this article, we consider the nonlinear Schrödinger equation on the cylinder $\mathbb{R}^d\times \mathbb{T}$. In the long range case, we show there is no linear scattering state of the nonlinear Schrödinger equation on $\mathbb{R}^d \times \mathbb{T}$. In the short range case, we show the decay and scattering of solutions of the nonlinear Schrödinger equation on $\mathbb{R}^d \times \mathbb{T}$ for small data.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Flight Path Optimization with Optimal Control Method
Authors:
Gaofeng Su,
Xi Cheng,
Siyuan Feng,
Ke Liu,
Jilin Song,
Jianan Chen,
Chen Zhu,
Hui Lin
Abstract:
This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d…
▽ More
This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to define the dynamic model of the aircraft in accordance with the controllable inputs and wind disturbances. Then we will identify a precise objective in terms of optimization and implement an optimization program to solve it under the circumstances of simulated real flight situation. Finally, the optimization result is validated and discussed by different scenarios.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Stability of Navier-Stokes equations with a free surface
Authors:
Xing Cheng,
Yunrui Zheng
Abstract:
We consider the viscous incompressible fluids in a three-dimensional horizontally periodic domain bounded below by a fixed smooth boundary and above by a free moving surface. The fluid dynamics are governed by the Navier-Stokes equations with the effect of gravity and surface tension on the free surface. We develop a global well-posedness theory by a nonlinear energy method in low regular Sobolev…
▽ More
We consider the viscous incompressible fluids in a three-dimensional horizontally periodic domain bounded below by a fixed smooth boundary and above by a free moving surface. The fluid dynamics are governed by the Navier-Stokes equations with the effect of gravity and surface tension on the free surface. We develop a global well-posedness theory by a nonlinear energy method in low regular Sobolev spaces with several techniques, including: the horizontal energy-dissipation estimates, a new tripled bootstrap argument inspired by Guo and Tice [Arch. Ration. Mech. Anal.(2018)]. Moreover, the solution decays asymptotically to the equilibrium in an exponential rate.
△ Less
Submitted 28 April, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Sublinear hitting sets for some geometric graphs
Authors:
Xinbu Cheng,
Zixiang Xu
Abstract:
For an $n$-vertex graph $G$, let $h(G)$ denote the smallest size of a subset of $V(G)$ such that it intersects every maximum independent set of $G$. A conjecture posed by Bollobás, Erdős and Tuza in early 90s remains widely open, asserting that for any $n$-vertex graph $G$, if the independence number $α(G) =Ω(n) $, then $h(G) = o(n)$. In this paper, we establish the validity of this conjecture for…
▽ More
For an $n$-vertex graph $G$, let $h(G)$ denote the smallest size of a subset of $V(G)$ such that it intersects every maximum independent set of $G$. A conjecture posed by Bollobás, Erdős and Tuza in early 90s remains widely open, asserting that for any $n$-vertex graph $G$, if the independence number $α(G) =Ω(n) $, then $h(G) = o(n)$. In this paper, we establish the validity of this conjecture for various classes of graphs, including disk graphs, even-hole-free graphs, circle graphs, and those hereditary graphs having sublinear balanced separators. We also determine the exact values of smallest possible hitting sets in comparability graphs, incomparability graphs and the graphs with VC-dimension one.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Threshold solutions of the energy-critical complex Ginzburg-Landau equation
Authors:
Xing Cheng,
Yunrui Zheng
Abstract:
In this article, we consider energy-critical complex Ginzburg-Landau equation in three and four dimensions. We give the dynamics when the energy of the initial data is equal to the energy of the stationary solution.
In this article, we consider energy-critical complex Ginzburg-Landau equation in three and four dimensions. We give the dynamics when the energy of the initial data is equal to the energy of the stationary solution.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Efficient Sampling on Riemannian Manifolds via Langevin MCMC
Authors:
Xiang Cheng,
**gzhao Zhang,
Suvrit Sra
Abstract:
We study the task of efficiently sampling from a Gibbs distribution $d π^* = e^{-h} d {vol}_g$ over a Riemannian manifold $M$ via (geometric) Langevin MCMC; this algorithm involves computing exponential maps in random Gaussian directions and is efficiently implementable in practice. The key to our analysis of Langevin MCMC is a bound on the discretization error of the geometric Euler-Murayama sche…
▽ More
We study the task of efficiently sampling from a Gibbs distribution $d π^* = e^{-h} d {vol}_g$ over a Riemannian manifold $M$ via (geometric) Langevin MCMC; this algorithm involves computing exponential maps in random Gaussian directions and is efficiently implementable in practice. The key to our analysis of Langevin MCMC is a bound on the discretization error of the geometric Euler-Murayama scheme, assuming $\nabla h$ is Lipschitz and $M$ has bounded sectional curvature. Our error bound matches the error of Euclidean Euler-Murayama in terms of its stepsize dependence. Combined with a contraction guarantee for the geometric Langevin Diffusion under Kendall-Cranston coupling, we prove that the Langevin MCMC iterates lie within $ε$-Wasserstein distance of $π^*$ after $\tilde{O}(ε^{-2})$ steps, which matches the iteration complexity for Euclidean Langevin MCMC. Our results apply in general settings where $h$ can be nonconvex and $M$ can have negative Ricci curvature. Under additional assumptions that the Riemannian curvature tensor has bounded derivatives, and that $π^*$ satisfies a $CD(\cdot,\infty)$ condition, we analyze the stochastic gradient version of Langevin MCMC, and bound its iteration complexity by $\tilde{O}(ε^{-2})$ as well.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Integral formulas for slice Cauchy-Riemann operator and applications
Authors:
Chao Ding,
Xiaoqian Cheng
Abstract:
The theory of slice regular functions has been developed rapidly in the past few years, and most properties are given in slices at the early stage. In 2013, Colombo et al. introduced a non-constant coefficients differential operator to describe slice regular functions globally, and this brought the study of slice regular functions in a global sense. In this article, we introduce a slice Cauchy-Rie…
▽ More
The theory of slice regular functions has been developed rapidly in the past few years, and most properties are given in slices at the early stage. In 2013, Colombo et al. introduced a non-constant coefficients differential operator to describe slice regular functions globally, and this brought the study of slice regular functions in a global sense. In this article, we introduce a slice Cauchy-Riemann operator, which is motivated by the non-constant coefficients differential operator mentioned above. Then, A Borel-Pompeiu formula for this slice Cauchy-Riemann operator is discovered, which leads to a Cauchy integral formula for slice regular functions. A Plemelj integral formula for the slice Cauchy-Riemann operator is introduced, which gives rise to results on slice regular extension at the end.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
The multiplicative ergodic theorem for McKean-Vlasov SDEs
Authors:
Xian** Cheng,
Zhenxin Liu,
Lixin Zhang
Abstract:
In this paper, we establish the multiplicative ergodic theorem for McKean-Vlasov stochastic differential equations, in which the Lyapunov exponent is defined using the upper limit. The reasonability of this definition is illustrated through an example; i.e., even when the coefficients are regular enough and their first-order derivatives are bounded, the upper limit cannot be replaced by a limit, a…
▽ More
In this paper, we establish the multiplicative ergodic theorem for McKean-Vlasov stochastic differential equations, in which the Lyapunov exponent is defined using the upper limit. The reasonability of this definition is illustrated through an example; i.e., even when the coefficients are regular enough and their first-order derivatives are bounded, the upper limit cannot be replaced by a limit, as the limit may not exist. Furthermore, the example reveals how the dependence on distribution significantly influences the dynamics of the system and evidently distinguishes McKean-Vlasov stochastic differential equations from classical stochastic differential equations.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Unconditionally stable exponential integrator schemes for the 2D Cahn-Hilliard equation
Authors:
Xinyu Cheng
Abstract:
Phase field models are gradient flows with their energy naturally dissipating in time. In order to preserve this property, many numerical schemes have been well-studied. In this paper we consider a well-known method, namely the exponential integrator method (EI). In the literature a few works studied several EI schemes for various phase field models and proved the energy dissipation by either requ…
▽ More
Phase field models are gradient flows with their energy naturally dissipating in time. In order to preserve this property, many numerical schemes have been well-studied. In this paper we consider a well-known method, namely the exponential integrator method (EI). In the literature a few works studied several EI schemes for various phase field models and proved the energy dissipation by either requiring a strong Lipschitz condition on the nonlinear source term or certain $L^\infty$ bounds on the numerical solutions (maximum principle). However for phase field models such as the (non-local) Cahn-Hilliard equation, the maximum principle no longer exists. As a result, solving such models via EI schemes remains open for a long time. In this paper we aim to give a systematic approach on applying EI-type schemes to such models by solving the Cahn-Hilliard equation with a first order EI scheme and showing the energy dissipation. In fact second order EI schemes can be handled similarly and we leave the discussion in a subsequent paper. To our best knowledge, this is the first work to handle phase field models without assuming any strong Lipschitz condition or $L^\infty$ boundedness. Furthermore, we will analyze the $L^2$ error and present some numerical simulations to demonstrate the dynamics.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Harnack inequality and the relevant theorems on Finsler metric measure manifolds
Authors:
Xinyue Cheng,
Yalu Feng
Abstract:
In this paper, we carry out in-depth research centering around the Harnack inequality for positive solutions to nonlinear heat equation on Finsler metric measure manifolds with weighted Ricci curvature ${\rm Ric}_{\infty}$ bounded below. Aim on this topic, we first give a volume comparison theorem of Bishop-Gromov type. Then we prove a weighted Poincaré inequality by using Whitney-type coverings t…
▽ More
In this paper, we carry out in-depth research centering around the Harnack inequality for positive solutions to nonlinear heat equation on Finsler metric measure manifolds with weighted Ricci curvature ${\rm Ric}_{\infty}$ bounded below. Aim on this topic, we first give a volume comparison theorem of Bishop-Gromov type. Then we prove a weighted Poincaré inequality by using Whitney-type coverings technique and give a local uniform Sobolev inequality. Further, we obtain two mean value inequalities for positive subsolutions and supersolutions of a class of parabolic differential equations. From the mean value inequality, we also derive a new local gradient estimate for positive solutions to heat equation. Finally, as the application of the mean value inequalities and weighted Poincaré inequality, we get the desired Harnack inequality for positive solutions to heat equation.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Global well-posedness of a two dimensional wave-Klein-Gordon system with small non-compactly supported data
Authors:
Xinyu Cheng
Abstract:
In this paper we are interested in the coupled wave and Klein-Gordon equations in $\mathbb{R}^+\times\mathbb{R}^2$. We want to establish the global well-posedness of such system by showing the uniform boundedness of the energy for the global solution without any compactness assumptions on the initial data. In addition, we also demonstrate the pointwise asymptotic behavior of the solution pair. In…
▽ More
In this paper we are interested in the coupled wave and Klein-Gordon equations in $\mathbb{R}^+\times\mathbb{R}^2$. We want to establish the global well-posedness of such system by showing the uniform boundedness of the energy for the global solution without any compactness assumptions on the initial data. In addition, we also demonstrate the pointwise asymptotic behavior of the solution pair. In order to achieve that we apply a modified Alinhac's ghost weight method together with a newly developed normal-form framework to remedy the lack of the space-time scaling vector field. Finally we show the global solutions scatter linearly strongly for the Klein-Gordon field $φ$ and weakly for the wave field $n$ as $t\to+\infty$. To our best knowledge such scattering phenomenon is novel in the literature.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Comparative Analysis of Linear Regression, Gaussian Elimination, and LU Decomposition for CT Real Estate Purchase Decisions
Authors:
Xilin Cheng
Abstract:
This paper presents a comprehensive evaluation of three distinct computational algorithms applied to the decision-making process of real estate purchases. Specifically, we analyze the efficacy of Linear Regression from Scikit-learn library, Gaussian Elimination with partial pivoting, and LU Decomposition in predicting the advisability of buying a house in the State of Connecticut based on a set of…
▽ More
This paper presents a comprehensive evaluation of three distinct computational algorithms applied to the decision-making process of real estate purchases. Specifically, we analyze the efficacy of Linear Regression from Scikit-learn library, Gaussian Elimination with partial pivoting, and LU Decomposition in predicting the advisability of buying a house in the State of Connecticut based on a set of financial and market-related parameters. The algorithms' performances were compared using a dataset encompassing town-specific details, yearly data, interest rates, and median sale ratios. Our results demonstrate significant differences in predictive accuracy, with Linear Regression and LU Decomposition providing the most reliable recommendations and Gaussian Elimination showing limitations in stability and performance. The study's findings emphasize the importance of algorithm selection in predictive analytic and offer insights into the practical applications of computational methods in real estate investment strategies. By evaluating model efficacy through metrics such as R-squared scores and Mean Squared Error, we provide a nuanced understanding of each method's strengths and weaknesses, contributing valuable knowledge to the fields of real estate analysis and predictive modeling.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
The limit theory of the energy-critical complex Ginzburg-Landau equation
Authors:
Xing. Cheng,
Chang-yu Guo,
Yunrui. Zheng
Abstract:
We study the limit behavior of the solutions to energy-critical complex Ginzburg-Landau equation. We give a rigorous theory of the zero-dispersion limit from energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear heat equation for dimensions 3 and 4 in both the defocusing and focusing cases by energy method. Furthermore, we also show the invisicid limit of energy-critical co…
▽ More
We study the limit behavior of the solutions to energy-critical complex Ginzburg-Landau equation. We give a rigorous theory of the zero-dispersion limit from energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear heat equation for dimensions 3 and 4 in both the defocusing and focusing cases by energy method. Furthermore, we also show the invisicid limit of energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear Schrödinger equation for dimension 4 in the focusing case.
△ Less
Submitted 22 April, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Hodge Classes in the Cohomology of Local Systems
Authors:
Xiaojiang Cheng
Abstract:
We study the Hodge conjecture for certain families of varieties over arithmetic quotients of balls and Siegel domain of degree two. As a byproduct, we derive formulas for Hodge numbers in terms of automorphic forms.
We study the Hodge conjecture for certain families of varieties over arithmetic quotients of balls and Siegel domain of degree two. As a byproduct, we derive formulas for Hodge numbers in terms of automorphic forms.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Convergence of flow-based generative models via proximal gradient descent in Wasserstein space
Authors:
Xiuyuan Cheng,
Jianfeng Lu,
Yixin Tan,
Yao Xie
Abstract:
Flow-based generative models enjoy certain advantages in computing the data generation and the likelihood, and have recently shown competitive empirical performance. Compared to the accumulating theoretical studies on related score-based diffusion models, analysis of flow-based models, which are deterministic in both forward (data-to-noise) and reverse (noise-to-data) directions, remain sparse. In…
▽ More
Flow-based generative models enjoy certain advantages in computing the data generation and the likelihood, and have recently shown competitive empirical performance. Compared to the accumulating theoretical studies on related score-based diffusion models, analysis of flow-based models, which are deterministic in both forward (data-to-noise) and reverse (noise-to-data) directions, remain sparse. In this paper, we provide a theoretical guarantee of generating data distribution by a progressive flow model, the so-called JKO flow model, which implements the Jordan-Kinderleherer-Otto (JKO) scheme in a normalizing flow network. Leveraging the exponential convergence of the proximal gradient descent (GD) in Wasserstein space, we prove the Kullback-Leibler (KL) guarantee of data generation by a JKO flow model to be $O(\varepsilon^2)$ when using $N \lesssim \log (1/\varepsilon)$ many JKO steps ($N$ Residual Blocks in the flow) where $\varepsilon $ is the error in the per-step first-order condition. The assumption on data density is merely a finite second moment, and the theory extends to data distributions without density and when there are inversion errors in the reverse process where we obtain KL-$W_2$ mixed error guarantees. The non-asymptotic convergence rate of the JKO-type $W_2$-proximal GD is proved for a general class of convex objective functionals that includes the KL divergence as a special case, which can be of independent interest. The analysis framework can extend to other first-order Wasserstein optimization schemes applied to flow-based generative models.
△ Less
Submitted 16 May, 2024; v1 submitted 26 October, 2023;
originally announced October 2023.
-
Time integration schemes based on neural networks for solving partial differential equations on coarse grids
Authors:
Xinxin Yan,
Zhideng Zhou,
Xiaohan Cheng,
Xiaolei Yang
Abstract:
The accuracy of solving partial differential equations (PDEs) on coarse grids is greatly affected by the choice of discretization schemes. In this work, we propose to learn time integration schemes based on neural networks which satisfy three distinct sets of mathematical constraints, i.e., unconstrained, semi-constrained with the root condition, and fully-constrained with both root and consistenc…
▽ More
The accuracy of solving partial differential equations (PDEs) on coarse grids is greatly affected by the choice of discretization schemes. In this work, we propose to learn time integration schemes based on neural networks which satisfy three distinct sets of mathematical constraints, i.e., unconstrained, semi-constrained with the root condition, and fully-constrained with both root and consistency conditions. We focus on the learning of 3-step linear multistep methods, which we subsequently applied to solve three model PDEs, i.e., the one-dimensional heat equation, the one-dimensional wave equation, and the one-dimensional Burgers' equation. The results show that the prediction error of the learned fully-constrained scheme is close to that of the Runge-Kutta method and Adams-Bashforth method. Compared to the traditional methods, the learned unconstrained and semi-constrained schemes significantly reduce the prediction error on coarse grids. On a grid that is 4 times coarser than the reference grid, the mean square error shows a reduction of up to an order of magnitude for some of the heat equation cases, and a substantial improvement in phase prediction for the wave equation. On a 32 times coarser grid, the mean square error for the Burgers' equation can be reduced by up to 35% to 40%.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Linear attention is (maybe) all you need (to understand transformer optimization)
Authors:
Kwangjun Ahn,
Xiang Cheng,
Minhak Song,
Chulhee Yun,
Ali Jadbabaie,
Suvrit Sra
Abstract:
Transformer training is notoriously difficult, requiring a careful design of optimizers and use of various heuristics. We make progress towards understanding the subtleties of training Transformers by carefully studying a simple yet canonical linearized shallow Transformer model. Specifically, we train linear Transformers to solve regression tasks, inspired by J.~von Oswald et al.~(ICML 2023), and…
▽ More
Transformer training is notoriously difficult, requiring a careful design of optimizers and use of various heuristics. We make progress towards understanding the subtleties of training Transformers by carefully studying a simple yet canonical linearized shallow Transformer model. Specifically, we train linear Transformers to solve regression tasks, inspired by J.~von Oswald et al.~(ICML 2023), and K.~Ahn et al.~(NeurIPS 2023). Most importantly, we observe that our proposed linearized models can reproduce several prominent aspects of Transformer training dynamics. Consequently, the results obtained in this paper suggest that a simple linearized Transformer model could actually be a valuable, realistic abstraction for understanding Transformer optimization.
△ Less
Submitted 13 March, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Global weak solution of 3-D focusing energy-critical nonlinear Schrödinger equation
Authors:
Xing Cheng,
Chang-Yu Guo,
Yunrui Zheng
Abstract:
In this article, we prove the existence of global weak solutions to the three-dimensional focusing energy-critical nonlinear Schrödinger (NLS) equation in the non-radial case. Furthermore, we prove the weak-strong uniqueness for some class of initial data. The main ingredient of our new approach is to use solutions of an energy-critical Ginzburg-Landau equation as approximations for the correspond…
▽ More
In this article, we prove the existence of global weak solutions to the three-dimensional focusing energy-critical nonlinear Schrödinger (NLS) equation in the non-radial case. Furthermore, we prove the weak-strong uniqueness for some class of initial data. The main ingredient of our new approach is to use solutions of an energy-critical Ginzburg-Landau equation as approximations for the corresponding nonlinear Schördinger equation.
In our proofs, we first show the dichotomy of global well-posedness versus finite time blow-up of energy-critical Ginzburg-Landau equation in $\dot{H}^1( \mathbb{R}^d)$ for $d = 3,4 $ when the energy is less than the energy of the stationary solution $W$. We follow the strategy of C. E. Kenig and F. Merle [25,26], using a concentration-compactness/rigidity argument to reduce the global well-posedness to the exclusion of a critical element. The critical element is ruled out by dissipation of the Ginzburg-Landau equation, including local smoothness, backwards uniqueness and unique continuation. The existence of global weak solution of the three dimensional focusing energy-critical nonlinear Schrödinger equation in the non-radial case then follows from the global well-posedness of the energy-critical Ginzburg-Landau equation via a limitation argument. We also adapt the arguments of M. Struwe [37,38] to prove the weak-strong uniqueness when the $\dot{H}^1$-norm of the initial data is bounded by a constant depending on the stationary solution $W$.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Some inequalities and gradient estimates for harmonic functions on Finsler measure spaces
Authors:
Xinyue Cheng,
Yalu Feng
Abstract:
In this paper, we study functional and geometric inequalities on complete Finsler measure spaces under the condition that the weighted Ricci curvature ${\rm Ric}_\infty$ has a lower bound. We first obtain some local uniform Poincaré inequalities and Sobolev inequalities. Further, we give a mean value inequality for nonnegative subsolutions of elliptic equations. Finally, we obtain local and global…
▽ More
In this paper, we study functional and geometric inequalities on complete Finsler measure spaces under the condition that the weighted Ricci curvature ${\rm Ric}_\infty$ has a lower bound. We first obtain some local uniform Poincaré inequalities and Sobolev inequalities. Further, we give a mean value inequality for nonnegative subsolutions of elliptic equations. Finally, we obtain local and global Harnack inequalities, and then, establish a global gradient estimate for positive harmonic functions on forward complete non-compact Finsler measure spaces. Besides, as a by-product of the mean value inequality, we prove a Liouville type theorem.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Fast Conditional Mixing of MCMC Algorithms for Non-log-concave Distributions
Authors:
Xiang Cheng,
Bohan Wang,
**gzhao Zhang,
Yusong Zhu
Abstract:
MCMC algorithms offer empirically efficient tools for sampling from a target distribution $π(x) \propto \exp(-V(x))$. However, on the theory side, MCMC algorithms suffer from slow mixing rate when $π(x)$ is non-log-concave. Our work examines this gap and shows that when Poincaré-style inequality holds on a subset $\mathcal{X}$ of the state space, the conditional distribution of MCMC iterates over…
▽ More
MCMC algorithms offer empirically efficient tools for sampling from a target distribution $π(x) \propto \exp(-V(x))$. However, on the theory side, MCMC algorithms suffer from slow mixing rate when $π(x)$ is non-log-concave. Our work examines this gap and shows that when Poincaré-style inequality holds on a subset $\mathcal{X}$ of the state space, the conditional distribution of MCMC iterates over $\mathcal{X}$ mixes fast to the true conditional distribution. This fast mixing guarantee can hold in cases when global mixing is provably slow. We formalize the statement and quantify the conditional mixing rate. We further show that conditional mixing can have interesting implications for sampling from mixtures of Gaussians, parameter estimation for Gaussian mixture models and Gibbs-sampling with well-connected local minima.
△ Less
Submitted 14 January, 2024; v1 submitted 18 June, 2023;
originally announced June 2023.
-
On the Stability of Symmetric Periodic Orbits of a Comb-Drive Finger Actuator Model
Authors:
Xuhua Cheng,
Baoting Liu
Abstract:
In this paper, we study the stability of symmetric periodic solutions of the comb-drive finger actuator model. First, on the basis of the relationship between the potential and the period as a function of the energy, we derive the properties of the period of the solution of the corresponding autonomous system (the parameter $δ$ of input voltage $V_δ(t)$ is equal to zero) in the prescribed energy r…
▽ More
In this paper, we study the stability of symmetric periodic solutions of the comb-drive finger actuator model. First, on the basis of the relationship between the potential and the period as a function of the energy, we derive the properties of the period of the solution of the corresponding autonomous system (the parameter $δ$ of input voltage $V_δ(t)$ is equal to zero) in the prescribed energy range. Then, using these properties and the stability criteria of symmetric periodic solutions of the time-periodic Newtonian equation, we analytically prove the linear stability/instability of the symmetric $(m,p)$-periodic solutions which emanated from nonconstant periodic solutions of the corresponding autonomous system when the parameter $δ$ is small.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
The locally homeomorphic property of McKean-Vlasov SDEs under the global Lipschitz condition
Authors:
Xian** Cheng,
Zhenxin Liu
Abstract:
In this paper, we establish the locally diffeomorphic property of the solution to McKean-Vlasov stochastic differential equations defined on the Euclidean space. Our approach is built upon the insightful ideas put forth by Kunita. We observe that although the coefficients satisfy the global Lipschitz condition and some suitable regularity condition, the solution in general does not satisfy the glo…
▽ More
In this paper, we establish the locally diffeomorphic property of the solution to McKean-Vlasov stochastic differential equations defined on the Euclidean space. Our approach is built upon the insightful ideas put forth by Kunita. We observe that although the coefficients satisfy the global Lipschitz condition and some suitable regularity condition, the solution in general does not satisfy the globally homeomorphic property at any time except the initial time, which sets McKean-Vlasov stochastic differential equations apart significantly from classical stochastic differential equations. Finally, we provide an example to complement our results.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Neural Differential Recurrent Neural Network with Adaptive Time Steps
Authors:
Yixuan Tan,
Liyan Xie,
Xiuyuan Cheng
Abstract:
The neural Ordinary Differential Equation (ODE) model has shown success in learning complex continuous-time processes from observations on discrete time stamps. In this work, we consider the modeling and forecasting of time series data that are non-stationary and may have sharp changes like spikes. We propose an RNN-based model, called RNN-ODE-Adap, that uses a neural ODE to represent the time dev…
▽ More
The neural Ordinary Differential Equation (ODE) model has shown success in learning complex continuous-time processes from observations on discrete time stamps. In this work, we consider the modeling and forecasting of time series data that are non-stationary and may have sharp changes like spikes. We propose an RNN-based model, called RNN-ODE-Adap, that uses a neural ODE to represent the time development of the hidden states, and we adaptively select time steps based on the steepness of changes of the data over time so as to train the model more efficiently for the "spike-like" time series. Theoretically, RNN-ODE-Adap yields provably a consistent estimation of the intensity function for the Hawkes-type time series data. We also provide an approximation analysis of the RNN-ODE model showing the benefit of adaptive steps. The proposed model is demonstrated to achieve higher prediction accuracy with reduced computational cost on simulated dynamic system data and point process data and on a real electrocardiography dataset.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Euclidean Gallai-Ramsey for various configurations
Authors:
Xinbu Cheng,
Zixiang Xu
Abstract:
The Euclidean Gallai-Ramsey problem, which investigates the existence of monochromatic or rainbow configurations in a colored $n$-dimensional Euclidean space $\mathbb{E}^{n}$, was introduced and studied recently. We further explore this problem for various configurations including triangles, squares, lines, and the structures with specific properties, such as rectangular and spherical configuratio…
▽ More
The Euclidean Gallai-Ramsey problem, which investigates the existence of monochromatic or rainbow configurations in a colored $n$-dimensional Euclidean space $\mathbb{E}^{n}$, was introduced and studied recently. We further explore this problem for various configurations including triangles, squares, lines, and the structures with specific properties, such as rectangular and spherical configurations. Several of our new results provide refinements to the results presented in a recent work by Mao, Ozeki and Wang. One intriguing phenomenon evident on the Gallai-Ramsey results proven in this paper is that the dimensions of spaces are often independent of the number of colors. Our proofs primarily adopt a geometric perspective.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
Expanding solutions near unstable Lane-Emden stars
Authors:
Ming Cheng,
Xing Cheng,
Zhiwu Lin
Abstract:
In this paper, we consider the compressible Euler-Poisson equations for polytropes $P(ρ)=Kρ^γ$ and the white dwarf star. Firstly, we develop two variational problem for $γ=\frac{4}{3}$ and $γ\in \left(\frac{6}{5},\frac{4}{3} \right)$ respectively. The first variational problem for $γ=\frac{4}{3}$ is related to the best constant of a Hardy-Littlewood type inequality. The best constant obtained is s…
▽ More
In this paper, we consider the compressible Euler-Poisson equations for polytropes $P(ρ)=Kρ^γ$ and the white dwarf star. Firstly, we develop two variational problem for $γ=\frac{4}{3}$ and $γ\in \left(\frac{6}{5},\frac{4}{3} \right)$ respectively. The first variational problem for $γ=\frac{4}{3}$ is related to the best constant of a Hardy-Littlewood type inequality. The best constant obtained is sharp and it yields a threshold of the mass to the gaseous star which is the Chandrasekhar limit mass. For $γ\in \left(\frac{6}{5},\frac{4}{3} \right)$, we construct a type of cross constrained variational problem attained by the Lane-Emden function. Then, we show that the spherically symmetric finite energy weak solution globally exists if the mass is less than the Chandrasekhar limit mass for $γ=\frac{4}{3}$ or the initial data belongs to an invariant set constructed by the cross-constrained variational argument for $γ\in \left(\frac{6}{5},\frac{4}{3} \right)$. Furthermore, we conditionally obtain that the support of the gaseous star expands as time tends to infinity with a virial argument. We also consider the white dwarf star and prove that if the mass is less than the Chandrasekhar limit mass, the white dwarf star cannot collapse to a point.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Time-Domain Moment Matching for Second-Order Systems
Authors:
Xiaodong Cheng,
Tudor C. Ionescu,
Monica Pătraşcu
Abstract:
This paper studies a structure-preserving model reduction problem for large-scale second-order dynamical systems via the framework of time-domain moment matching. The moments of a second-order system are interpreted as the solutions of second-order Sylvester equations, which leads to families of parameterized second-order reduced models that match the moments of an original second-order system at…
▽ More
This paper studies a structure-preserving model reduction problem for large-scale second-order dynamical systems via the framework of time-domain moment matching. The moments of a second-order system are interpreted as the solutions of second-order Sylvester equations, which leads to families of parameterized second-order reduced models that match the moments of an original second-order system at selected interpolation points. Based on this, a two-sided moment matching problem is addressed, providing a unique second-order reduced system that match two distinct sets interpolation points. Furthermore, we also construct the reduced second-order systems that matches the moments of both zero and first order derivative of the original second-order system. Finally, the Loewner framework is extended to the second-order systems, where two parameterized families of models are presented that retain the second-order structure and interpolate sets of tangential data.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Symmetry groups, fundamental solutions and conservation laws for conformable time fractional partial differential system with variable coefficients
Authors:
Xiaoyu Cheng,
Lizhen Wang
Abstract:
In this paper, the relationships between Lie symmetry groups and fundamental solutions for a class of conformable time fractional partial differential equations (PDEs) with variable coefficients are investigated. Specifically, the group-invariant solutions to the considered equations are constructed applying symmetry group method and the corresponding fundamental solutions for these systems are es…
▽ More
In this paper, the relationships between Lie symmetry groups and fundamental solutions for a class of conformable time fractional partial differential equations (PDEs) with variable coefficients are investigated. Specifically, the group-invariant solutions to the considered equations are constructed applying symmetry group method and the corresponding fundamental solutions for these systems are established with the help of the above obtained group-invariant solutions and inverting Laplace transformation. In addition, the connections between fundamental solutions for two conformable time fractional systems are given by equivalence transformation. Furthermore, the conservation laws of these fractional systems are provided using new Noether theorem and obtained Lie algebras.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Port-Hamiltonian formulations of the incompressible Euler equations with a free surface
Authors:
Xiaoyu Cheng,
J. J. W. Van der Vegt,
Yan Xu,
H. J. Zwart
Abstract:
In this paper, we present port-Hamiltonian formulations of the incompressible Euler equations with a free surface governed by surface tension and gravity forces, modelling e.g. capillary and gravity waves and the evolution of droplets in air. Three sets of variables are considered, namely $(v,Σ)$, $(η,φ_{\partial},Σ)$ and $(ω,φ_{\partial},Σ)$, with $v$ the velocity, $η$ the solenoidal velocity,…
▽ More
In this paper, we present port-Hamiltonian formulations of the incompressible Euler equations with a free surface governed by surface tension and gravity forces, modelling e.g. capillary and gravity waves and the evolution of droplets in air. Three sets of variables are considered, namely $(v,Σ)$, $(η,φ_{\partial},Σ)$ and $(ω,φ_{\partial},Σ)$, with $v$ the velocity, $η$ the solenoidal velocity, $φ_{\partial}$ a potential, $ω$ the vorticity, and $Σ$ the free surface, resulting in the incompressible Euler equations in primitive variables and the vorticity equation. First, the Hamiltonian formulation for the incompressible Euler equations in a domain with a free surface combined with a fixed boundary surface with a homogeneous boundary condition will be derived in the proper Sobolev spaces of differential forms. Next, these results will be extended to port-Hamiltonian formulations allowing inhomogeneous boundary conditions and a non-zero energy flow through the boundaries. Our main results are the construction and proof of Dirac structures in suitable Sobolev spaces of differential forms for each variable set, which provides the core of any port-Hamiltonian formulation. Finally, it is proven that the state dependent Dirac structures are related to Poisson brackets that are linear, skew-symmetric and satisfy the Jacobi identity.
△ Less
Submitted 29 April, 2023;
originally announced May 2023.
-
A necessary and sufficient condition for lower bounds on crossing numbers of generalized periodic graphs in an arbitrary surface
Authors:
Xiwu Yang,
Xiaodong Cheng,
Yuansheng Yang
Abstract:
Let $H$, $T$ and $C_n$ be a graph, a tree and a cycle of order $n$, respectively. Let $H^{(i)}$ be the complete join of $H$ and an empty graph on $i$ vertices. Then the Cartesian product $H\Box T$ of $H$ and $T$ can be obtained by applying zip product on $H^{(i)}$ and the graph produced by zip product repeatedly. Let $\textrm{cr}_Σ(H)$ denote the crossing number of $H$ in an arbitrary surface $Σ$.…
▽ More
Let $H$, $T$ and $C_n$ be a graph, a tree and a cycle of order $n$, respectively. Let $H^{(i)}$ be the complete join of $H$ and an empty graph on $i$ vertices. Then the Cartesian product $H\Box T$ of $H$ and $T$ can be obtained by applying zip product on $H^{(i)}$ and the graph produced by zip product repeatedly. Let $\textrm{cr}_Σ(H)$ denote the crossing number of $H$ in an arbitrary surface $Σ$. If $H$ satisfies certain connectivity condition, then $\textrm{cr}_Σ(H\Box T)$ is not less than the sum of the crossing numbers of its ``subgraphs". In this paper, we introduced a new concept of generalized periodic graphs, which contains $H\Box C_n$. For a generalized periodic graph $G$ and a function $f(t)$, where $t$ is the number of subgraphs in a decomposition of $G$, we gave a necessary and sufficient condition for $\textrm{cr}_Σ(G)\geq f(t)$. As an application, we confirmed a conjecture of Lin et al. on the crossing number of the generalized Petersen graph $P(4h+2,2h)$ in the plane. Based on the condition, algorithms are constructed to compute lower bounds on the crossing number of generalized periodic graphs in $Σ$. In special cases, it is possible to determine lower bounds on an infinite family of generalized periodic graphs, by determining a lower bound on the crossing number of a finite generalized periodic graph.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
An Optimal Projection Framework for Structure-Preserving Model Reduction of Linear Systems
Authors:
Xiaodong Cheng
Abstract:
This paper presents a structure-preserving model reduction framework for linear systems, in which the $\mathcal{H}_2$ optimization is incorporated with the Petrov-Galerkin projection to preserve structural features of interest, including dissipativity, passivity, and bounded realness. The model reduction problem is formulated in a nonconvex optimization setting on a noncompact Stiefel manifold, ai…
▽ More
This paper presents a structure-preserving model reduction framework for linear systems, in which the $\mathcal{H}_2$ optimization is incorporated with the Petrov-Galerkin projection to preserve structural features of interest, including dissipativity, passivity, and bounded realness. The model reduction problem is formulated in a nonconvex optimization setting on a noncompact Stiefel manifold, aiming to minimize the $\mathcal{H}_2$ norm of the approximation error between the full-order and reduced-order models. The explicit expression for the gradient of the objective function is derived, and two gradient descent procedures are applied to seek for a (local) minimum, followed by a theoretical analysis on the convergence properties of the algorithms. Finally, the performance of the proposed method is demonstrated by two numerical examples which consider stability-preserving and passivity-preserving model reduction problems, respectively.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Exact values and improved bounds on $k$-neighborly families of boxes
Authors:
Xinbu Cheng,
Meiqin Wang,
Zixiang Xu,
Chi Hoi Yip
Abstract:
A finite family $\mathcal{F}$ of $d$-dimensional convex polytopes is called $k$-neighborly if $d-k\le\textup{dim}(C\cap C')\le d-1$ for any two distinct members $C,C'\in\mathcal{F}$. In 1997, Alon initiated the study of the general function $n(k,d)$, which is defined to be the maximum size of $k$-neighborly families of standard boxes in $\mathbb{R}^{d}$. Based on a weighted count of vectors in…
▽ More
A finite family $\mathcal{F}$ of $d$-dimensional convex polytopes is called $k$-neighborly if $d-k\le\textup{dim}(C\cap C')\le d-1$ for any two distinct members $C,C'\in\mathcal{F}$. In 1997, Alon initiated the study of the general function $n(k,d)$, which is defined to be the maximum size of $k$-neighborly families of standard boxes in $\mathbb{R}^{d}$. Based on a weighted count of vectors in $\{0,1\}^{d}$, we improve a recent upper bound on $n(k,d)$ by Alon, Grytczuk, Kisielewicz, and Przesławski for any positive integers $d$ and $k$ with $d\ge k+2$. In particular, when $d$ is sufficiently large and $k\ge 0.123d$, our upper bound on $n(k,d)$ improves the bound $\sum_{i=1}^{k}2^{i-1}\binom{d}{i}+1$ shown by Huang and Sudakov exponentially.
Furthermore, we determine that $n(2,4)=9$, $n(3,5)=18$, $n(3,6)=27$, $n(4,6)=37$, $n(5,7)=74$, and $n(6,8)=150$. The stability result of Kleitman's isodiametric inequality plays an important role in the proofs.
△ Less
Submitted 5 January, 2024; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Linking number of monotonic cycles in random book embeddings of complete graphs
Authors:
Yasmin Aguillon,
Eric Burkholder,
Xingyu Cheng,
Spencer Eddins,
Emma Harrell,
Kenji Kozai,
Elijah Leake,
Pedro Morales
Abstract:
A book embedding of a complete graph is a spatial embedding whose planar projection has the vertices located along a circle, consecutive vertices are connected by arcs of the circle, and the projections of the remaining "interior" edges in the graph are straight line segments between the points on the circle representing the appropriate vertices. A random embedding of a complete graph can be gener…
▽ More
A book embedding of a complete graph is a spatial embedding whose planar projection has the vertices located along a circle, consecutive vertices are connected by arcs of the circle, and the projections of the remaining "interior" edges in the graph are straight line segments between the points on the circle representing the appropriate vertices. A random embedding of a complete graph can be generated by randomly assigning relative heights to these interior edges. We study a family of two-component links that arise as the realizations of pairs of disjoint cycles in these random embeddings of graphs. In particular, we show that the distribution of linking numbers can be described in terms of Eulerian numbers. Consequently, the mean of the squared linking number over all random embeddings is $\frac{i}{6}$, where $i$ is the number of interior edges in the cycles. We also show that the mean of the squared linking number over all pairs of $n$-cycles in $K_{2n}$ grows linearly in $n$.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
Compatible Powers of Hamilton Cycles in Dense Graphs
Authors:
Xiaohan Cheng,
Jie Hu,
Donglei Yang
Abstract:
Motivated by the concept of transition system investigated by Kotzig in 1968, Krivelevich, Lee and Sudakov proposed a more general notion of incompatibility system to formulate the robustness of Hamiltonicity of Dirac graphs. Given a graph $G=(V,E)$, an {\em incompatibility system} $\mathcal{F}$ over $G$ is a family $\mathcal{F}=\{F_v\}_{v\in V}$ such that for every $v\in V$, $F_v$ is a family of…
▽ More
Motivated by the concept of transition system investigated by Kotzig in 1968, Krivelevich, Lee and Sudakov proposed a more general notion of incompatibility system to formulate the robustness of Hamiltonicity of Dirac graphs. Given a graph $G=(V,E)$, an {\em incompatibility system} $\mathcal{F}$ over $G$ is a family $\mathcal{F}=\{F_v\}_{v\in V}$ such that for every $v\in V$, $F_v$ is a family of edge pairs in $\{\{e,e'\}: e\ne e'\in E, e\cap e'=\{v\}\}$. An incompatibility system $\mathcal{F}$ is \emph{$Δ$-bounded} if for every vertex $v$ and every edge $e$ incident with $v$, there are at most $Δ$ pairs in $F_v$ containing $e$. A subgraph $H$ of $G$ is \emph{compatible} (with respect to $\mathcal{F}$) if every pair of adjacent edges $e,e'$ of $H$ satisfies $\{e,e'\} \notin F_v$, where $v=e\cap e'$. Krivelevich, Lee and Sudakov proved that there is an universal constant $μ>0$ such that for every $μn$-bounded incompatibility system $\mathcal{F}$ over a Dirac graph, there exists a compatible Hamilton cycle, which resolves a conjecture of Häggkvist from 1988. We study high powers of Hamilton cycles in this context and show that for every $γ>0$ and $k\in\mathbb{N}$, there exists a constant $μ>0$ such that for sufficiently large $n\in\mathbb{N}$ and every $μn$-bounded incompatibility system over an $n$-vertex graph $G$ with $δ(G)\ge(\frac{k}{k+1}+γ)n$, there exists a compatible $k$-th power of a Hamilton cycle in $G$. Moreover, we give a construction which has minimum degree $\frac{k}{k+1}n+Ω(n)$ and contains no compatible $k$-th power of a Hamilton cycle.
△ Less
Submitted 18 February, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
On the global well-posedness and scattering of the 3D Klein-Gordon-Zakharov system
Authors:
Xinyu Cheng,
Jiao Xu
Abstract:
In this paper we are interested in the global well-posedness of the 3D Klein-Gordon-Zakharov equations with small initial data. We show the uniform boundedness of the energy for the global solution without any compactness assumptions on the initial data. The main novelty of our proof is to apply a modified Alinhac's ghost weight method together with a newly developed normal-form type estimate to r…
▽ More
In this paper we are interested in the global well-posedness of the 3D Klein-Gordon-Zakharov equations with small initial data. We show the uniform boundedness of the energy for the global solution without any compactness assumptions on the initial data. The main novelty of our proof is to apply a modified Alinhac's ghost weight method together with a newly developed normal-form type estimate to remedy the lack of the space-time scaling vector field; moreover, we give a clear description of the smallness conditions on the initial data.
△ Less
Submitted 9 April, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Localization for general Helmholtz
Authors:
Xinyu Cheng,
Dong Li,
Wen Yang
Abstract:
In \cite{gmw2022}, Guan, Murugan and Wei established the equivalence of the classical Helmholtz equation with a ``fractional Helmholtz" equation in which the Laplacian operator is replaced by the nonlocal fractional Laplacian operator. More general equivalence results are obtained for symbols which are complete Bernstein and satisfy additional regularity conditions. In this work we introduce a nov…
▽ More
In \cite{gmw2022}, Guan, Murugan and Wei established the equivalence of the classical Helmholtz equation with a ``fractional Helmholtz" equation in which the Laplacian operator is replaced by the nonlocal fractional Laplacian operator. More general equivalence results are obtained for symbols which are complete Bernstein and satisfy additional regularity conditions. In this work we introduce a novel and general set-up for this Helmholtz equivalence problem. We show that under very mild and easy-to-check conditions on the Fourier multiplier, the general Helmholtz equation can be effectively reduced to a localization statement on the support of the symbol.
△ Less
Submitted 19 June, 2024; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Robust Inference of Manifold Density and Geometry by Doubly Stochastic Scaling
Authors:
Boris Landa,
Xiuyuan Cheng
Abstract:
The Gaussian kernel and its traditional normalizations (e.g., row-stochastic) are popular approaches for assessing similarities between data points. Yet, they can be inaccurate under high-dimensional noise, especially if the noise magnitude varies considerably across the data, e.g., under heteroskedasticity or outliers. In this work, we investigate a more robust alternative -- the doubly stochasti…
▽ More
The Gaussian kernel and its traditional normalizations (e.g., row-stochastic) are popular approaches for assessing similarities between data points. Yet, they can be inaccurate under high-dimensional noise, especially if the noise magnitude varies considerably across the data, e.g., under heteroskedasticity or outliers. In this work, we investigate a more robust alternative -- the doubly stochastic normalization of the Gaussian kernel. We consider a setting where points are sampled from an unknown density on a low-dimensional manifold embedded in high-dimensional space and corrupted by possibly strong, non-identically distributed, sub-Gaussian noise. We establish that the doubly stochastic affinity matrix and its scaling factors concentrate around certain population forms, and provide corresponding finite-sample probabilistic error bounds. We then utilize these results to develop several tools for robust inference under general high-dimensional noise. First, we derive a robust density estimator that reliably infers the underlying sampling density and can substantially outperform the standard kernel density estimator under heteroskedasticity and outliers. Second, we obtain estimators for the pointwise noise magnitudes, the pointwise signal magnitudes, and the pairwise Euclidean distances between clean data points. Lastly, we derive robust graph Laplacian normalizations that accurately approximate various manifold Laplacians, including the Laplace Beltrami operator, improving over traditional normalizations in noisy settings. We exemplify our results in simulations and on real single-cell RNA-sequencing data. For the latter, we show that in contrast to traditional methods, our approach is robust to variability in technical noise levels across cell types.
△ Less
Submitted 10 July, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Girth of the algebraic bipartite graph $D(k,q)$
Authors:
Ming Xu,
Xiaoyan Cheng,
Yuansheng Tang
Abstract:
For integer $k\geq2$ and prime power $q$, the algebraic bipartite graph $D(k,q)$ proposed by Lazebnik and Ustimenko (1995) is meaningful not only in extremal graph theory but also in coding theory and cryptography. This graph is $q$-regular, edge-transitive and of girth at least $k+4$.
Its exact girth $g=g(D(k,q))$ was conjectured in 1995 to be $k+5$ for odd $k$ and $q\geq4$.
This conjecture w…
▽ More
For integer $k\geq2$ and prime power $q$, the algebraic bipartite graph $D(k,q)$ proposed by Lazebnik and Ustimenko (1995) is meaningful not only in extremal graph theory but also in coding theory and cryptography. This graph is $q$-regular, edge-transitive and of girth at least $k+4$.
Its exact girth $g=g(D(k,q))$ was conjectured in 1995 to be $k+5$ for odd $k$ and $q\geq4$.
This conjecture was shown to be valid in 2016 when $\frac{k+5}{2}|_p(q-1)$, where $p$ is the characteristic of $\mathbb{F}_q$ and $m|_pn$ means that $m$ divides $p^r n$ for some nonnegative integer $r$. In this paper, for $t\geq 1$ we prove that (a) $g(D(4t+2,q))=g(D(4t+1,q))$; (b) $g(D(4t+3,q))=4t+8$ if $g(D(2t,q))=2t+4$; (c) $g(D(8t,q))=8t+4$ if $g(D(4t-2,q))=4t+2$; (d) $g(D(2^{s+2}(2t-1)-5,q))=2^{s+2}(2t-1)$ if $p\geq 3$, $(2t-1)|_p(q-1)$ and $2^s\|(q-1)$.
A simple upper bound for the girth of $D(k,q)$ is proposed in the end of this paper.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Some rigidity properties for $λ$-self-expanders
Authors:
Saul Ancari,
Xu Cheng
Abstract:
$λ$-self-expanders $Σ$ in $\mathbb{R}^{n+1}$ are the solutions of the isoperimetric problem with respect to the same weighted area form as in the study of the self-expanders. In this paper, we mainly extend the results on self-expanders which we obtained in \cite{ancari2020volum} to $λ$-self-expanders. We prove some results that characterize the hyperplanes, spheres and cylinders as $λ$-self-expan…
▽ More
$λ$-self-expanders $Σ$ in $\mathbb{R}^{n+1}$ are the solutions of the isoperimetric problem with respect to the same weighted area form as in the study of the self-expanders. In this paper, we mainly extend the results on self-expanders which we obtained in \cite{ancari2020volum} to $λ$-self-expanders. We prove some results that characterize the hyperplanes, spheres and cylinders as $λ$-self-expanders. We also discuss the area growths and the finiteness of the weighted areas under the control of the growth of the mean curvature.
△ Less
Submitted 20 August, 2022;
originally announced August 2022.
-
Small perturbations may change the sign of Lyapunov exponents for linear SDEs
Authors:
Xian** Cheng,
Zhenxin Liu,
Lixin Zhang
Abstract:
In this paper, we study the existence of $n$-dimensional linear stochastic differential equations (SDEs) such that the sign of Lyapunov exponents is changed under an exponentially decaying perturbation. First, we show that the equation with all positive Lyapunov exponents will have $n-1$ linearly independent solutions with negative Lyapunov exponents under the perturbation. Meanwhile, we prove tha…
▽ More
In this paper, we study the existence of $n$-dimensional linear stochastic differential equations (SDEs) such that the sign of Lyapunov exponents is changed under an exponentially decaying perturbation. First, we show that the equation with all positive Lyapunov exponents will have $n-1$ linearly independent solutions with negative Lyapunov exponents under the perturbation. Meanwhile, we prove that the equation with all negative Lyapunov exponents will also have solutions with positive Lyapunov exponents under another similar perturbation. Finally, we also show that other three kinds of perturbations which appear at different positions of the equation will change the sign of Lyapunov exponents.
△ Less
Submitted 3 January, 2023; v1 submitted 12 August, 2022;
originally announced August 2022.
-
On the girth cycles of the bipartite graph $D(k,q)$
Authors:
Ming Xu,
Xiaoyan Cheng,
Yuansheng Tang
Abstract:
For integer $k\geq2$ and prime power $q$, the algebraic bipartite graph $D(k,q)$ proposed by Lazebnik and Ustimenko (1995) is meaningful not only in extremal graph theory but also in coding theory and cryptography. This graph is $q$-regular, edge-transitive and of girth at least $k+4$. For its exact girth $g=g(D(k,q))$,
Füredi et al. (1995) conjectured $g=k+5$ for odd $k$ and $q\geq4$.
This co…
▽ More
For integer $k\geq2$ and prime power $q$, the algebraic bipartite graph $D(k,q)$ proposed by Lazebnik and Ustimenko (1995) is meaningful not only in extremal graph theory but also in coding theory and cryptography. This graph is $q$-regular, edge-transitive and of girth at least $k+4$. For its exact girth $g=g(D(k,q))$,
Füredi et al. (1995) conjectured $g=k+5$ for odd $k$ and $q\geq4$.
This conjecture was shown to be valid in 2016 when $(k+5)/2$ is the product of an arbitrary factor of $q-1$ and an arbitrary power of the characteristic of $\mathbb{F}_q$.
In this paper, we determine all the girth cycles of $D(k,q)$ for $3\leq k\leq 5$, $q>3$, and those for $3\leq k\leq8$, $q=3$.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Local rainbow colorings for various graphs
Authors:
Xinbu Cheng,
Zixiang Xu
Abstract:
Motivated by a problem in theoretical computer science suggested by Wigderson, Alon and Ben-Eliezer studied the following extremal problem systematically one decade ago. Given a graph $H$, let $C(n,H)$ be the minimum number $k$ such that the following holds. There are $n$ colorings of $E(K_{n})$ with $k$ colors, each associated with one of the vertices of $K_{n}$, such that for every copy $T$ of…
▽ More
Motivated by a problem in theoretical computer science suggested by Wigderson, Alon and Ben-Eliezer studied the following extremal problem systematically one decade ago. Given a graph $H$, let $C(n,H)$ be the minimum number $k$ such that the following holds. There are $n$ colorings of $E(K_{n})$ with $k$ colors, each associated with one of the vertices of $K_{n}$, such that for every copy $T$ of $H$ in $K_{n}$, at least one of the colorings that are associated with $V(T)$ assigns distinct colors to all the edges of $E(T)$. In this paper, we obtain several new results in this problem including: \begin{itemize}
\item For paths of short length, we show that $C(n,P_{4})=Ω(n^{1/5})$ and $C(n,P_{t})=Ω(n^{1/3})$ with $t\in\{5,6\}$, which significantly improve the previously known lower bounds $(\log{n})^{Ω(1)}$.
\item We make progress on the problem of Alon and Ben-Eliezer about complete graphs, more precisely, we show that $C(n,K_{r})=Ω(n^{2/3})$ when $r\geqslant 8$. This provides the first instance of graph for which the lower bound goes beyond the natural barrier $Ω(n^{1/2})$. Moreover, we prove that $C(n,K_{s,t})=Ω(n^{2/3})$ for $t\geqslant s\geqslant 7$.
\item When $H$ is a star with at least $4$ leaves, a matching of size at least $4$, or a path of length at least $7$, we give the new lower bound for $C(n,H)$. We also show that for any graph $H$ with at least $6$ edges, $C(n,H)$ is polynomial in $n$. All of these improve the corresponding results obtained by Alon and Ben-Eliezer.
△ Less
Submitted 3 February, 2023; v1 submitted 15 July, 2022;
originally announced July 2022.
-
Neural Stein critics with staged $L^2$-regularization
Authors:
Matthew Repasky,
Xiuyuan Cheng,
Yao Xie
Abstract:
Learning to differentiate model distributions from observed data is a fundamental problem in statistics and machine learning, and high-dimensional data remains a challenging setting for such problems. Metrics that quantify the disparity in probability distributions, such as the Stein discrepancy, play an important role in high-dimensional statistical testing. In this paper, we investigate the role…
▽ More
Learning to differentiate model distributions from observed data is a fundamental problem in statistics and machine learning, and high-dimensional data remains a challenging setting for such problems. Metrics that quantify the disparity in probability distributions, such as the Stein discrepancy, play an important role in high-dimensional statistical testing. In this paper, we investigate the role of $L^2$ regularization in training a neural network Stein critic so as to distinguish between data sampled from an unknown probability distribution and a nominal model distribution. Making a connection to the Neural Tangent Kernel (NTK) theory, we develop a novel staging procedure for the weight of regularization over training time, which leverages the advantages of highly-regularized training at early times. Theoretically, we prove the approximation of the training dynamic by the kernel optimization, namely the ``lazy training'', when the $L^2$ regularization weight is large, and training on $n$ samples converge at a rate of ${O}(n^{-1/2})$ up to a log factor. The result guarantees learning the optimal critic assuming sufficient alignment with the leading eigen-modes of the zero-time NTK. The benefit of the staged $L^2$ regularization is demonstrated on simulated high dimensional data and an application to evaluating generative models of image data.
△ Less
Submitted 1 May, 2023; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Bi-stochastically normalized graph Laplacian: convergence to manifold Laplacian and robustness to outlier noise
Authors:
Xiuyuan Cheng,
Boris Landa
Abstract:
Bi-stochastic normalization provides an alternative normalization of graph Laplacians in graph-based data analysis and can be computed efficiently by Sinkhorn-Knopp (SK) iterations. This paper proves the convergence of bi-stochastically normalized graph Laplacian to manifold (weighted-)Laplacian with rates, when $n$ data points are i.i.d. sampled from a general $d$-dimensional manifold embedded in…
▽ More
Bi-stochastic normalization provides an alternative normalization of graph Laplacians in graph-based data analysis and can be computed efficiently by Sinkhorn-Knopp (SK) iterations. This paper proves the convergence of bi-stochastically normalized graph Laplacian to manifold (weighted-)Laplacian with rates, when $n$ data points are i.i.d. sampled from a general $d$-dimensional manifold embedded in a possibly high-dimensional space. Under certain joint limit of $n \to \infty$ and kernel bandwidth $ε\to 0$, the point-wise convergence rate of the graph Laplacian operator (under 2-norm) is proved to be $ O( n^{-1/(d/2+3)})$ at finite large $n$ up to log factors, achieved at the scaling of $ε\sim n^{-1/(d/2+3)} $. When the manifold data are corrupted by outlier noise, we theoretically prove the graph Laplacian point-wise consistency which matches the rate for clean manifold data plus an additional term proportional to the boundedness of the inner-products of the noise vectors among themselves and with data vectors. Motivated by our analysis, which suggests that not exact bi-stochastic normalization but an approximate one will achieve the same consistency rate, we propose an approximate and constrained matrix scaling problem that can be solved by SK iterations with early termination. Numerical experiments support our theoretical results and show the robustness of bi-stochastically normalized graph Laplacian to high-dimensional outlier noise.
△ Less
Submitted 26 January, 2023; v1 submitted 22 June, 2022;
originally announced June 2022.
-
The characterizations on a class of weakly weighted Einstein-Finsler metrics
Authors:
Xinyue Cheng,
Hong Cheng,
Pengsheng Wu
Abstract:
In this paper, we study the weakly weighted Einstein-Finsler metrics. First, we show that weakly weighted Einstein-Kropina metrics must be of isotropic S-curvature with respect to the Busemann-Hausdorff volume form under a certain condition about the weight constants. Then we characterize weakly weighted Einstein-Kropina metrics completely via their navigation expressions or via $α$ and $β$ respec…
▽ More
In this paper, we study the weakly weighted Einstein-Finsler metrics. First, we show that weakly weighted Einstein-Kropina metrics must be of isotropic S-curvature with respect to the Busemann-Hausdorff volume form under a certain condition about the weight constants. Then we characterize weakly weighted Einstein-Kropina metrics completely via their navigation expressions or via $α$ and $β$ respectively.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.