-
Minimum Reduced-Order Models via Causal Inference
Authors:
Nan Chen,
Honghu Liu
Abstract:
Enhancing the sparsity of data-driven reduced-order models (ROMs) has gained increasing attention in recent years. In this work, we analyze an efficient approach to identifying skillful ROMs with a sparse structure using an information-theoretic indicator called causation entropy. The causation entropy quantifies in a statistical way the additional contribution of each term to the underlying dynam…
▽ More
Enhancing the sparsity of data-driven reduced-order models (ROMs) has gained increasing attention in recent years. In this work, we analyze an efficient approach to identifying skillful ROMs with a sparse structure using an information-theoretic indicator called causation entropy. The causation entropy quantifies in a statistical way the additional contribution of each term to the underlying dynamics beyond the information already captured by all the other terms in the ansatz. By doing so, the causation entropy assesses the importance of each term to the dynamics before a parameter estimation procedure is performed. Thus, the approach can be utilized to eliminate terms with little dynamic impact, leading to a parsimonious structure that retains the essential physics. To circumvent the difficulty of estimating high-dimensional probability density functions (PDFs) involved in the causation entropy computation, we leverage Gaussian approximations for such PDFs, which are demonstrated to be sufficient even in the presence of highly non-Gaussian dynamics. The effectiveness of the approach is illustrated by the Kuramoto-Sivashinsky equation by building sparse causation-based ROMs for various purposes, such as recovering long-term statistics and inferring unobserved dynamics via data assimilation with partial observations.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
ScoreFusion: fusing score-based generative models via Kullback-Leibler barycenters
Authors:
Hao Liu,
Junze,
Ye,
Jose Blanchet,
Nian Si
Abstract:
We study the problem of fusing pre-trained (auxiliary) generative models to enhance the training of a target generative model. We propose using KL-divergence weighted barycenters as an optimal fusion mechanism, in which the barycenter weights are optimally trained to minimize a suitable loss for the target population. While computing the optimal KL-barycenter weights can be challenging, we demonst…
▽ More
We study the problem of fusing pre-trained (auxiliary) generative models to enhance the training of a target generative model. We propose using KL-divergence weighted barycenters as an optimal fusion mechanism, in which the barycenter weights are optimally trained to minimize a suitable loss for the target population. While computing the optimal KL-barycenter weights can be challenging, we demonstrate that this process can be efficiently executed using diffusion score training when the auxiliary generative models are also trained based on diffusion score methods. Moreover, we show that our fusion method has a dimension-free sample complexity in total variation distance provided that the auxiliary models are well fitted for their own task and the auxiliary tasks combined capture the target well. The main takeaway of our method is that if the auxiliary models are well-trained and can borrow features from each other that are present in the target, our fusion method significantly improves the training of generative models. We provide a concise computational implementation of the fusion algorithm, and validate its efficiency in the low-data regime with numerical experiments involving mixtures models and image datasets.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Stochastic First-Order Methods with Non-smooth and Non-Euclidean Proximal Terms for Nonconvex High-Dimensional Stochastic Optimization
Authors:
Yue Xie,
Jiawen Bi,
Hongcheng Liu
Abstract:
When the nonconvex problem is complicated by stochasticity, the sample complexity of stochastic first-order methods may depend linearly on the problem dimension, which is undesirable for large-scale problems. In this work, we propose dimension-insensitive stochastic first-order methods (DISFOMs) to address nonconvex optimization with expected-valued objective function. Our algorithms allow for non…
▽ More
When the nonconvex problem is complicated by stochasticity, the sample complexity of stochastic first-order methods may depend linearly on the problem dimension, which is undesirable for large-scale problems. In this work, we propose dimension-insensitive stochastic first-order methods (DISFOMs) to address nonconvex optimization with expected-valued objective function. Our algorithms allow for non-Euclidean and non-smooth distance functions as the proximal terms. Under mild assumptions, we show that DISFOM using minibatches to estimate the gradient enjoys sample complexity of $ \mathcal{O} ( (\log d) / ε^4 ) $ to obtain an $ε$-stationary point. Furthermore, we prove that DISFOM employing variance reduction can sharpen this bound to $\mathcal{O} ( (\log d)^{2/3}/ε^{10/3} )$, which perhaps leads to the best-known sample complexity result in terms of $d$. We provide two choices of the non-smooth distance functions, both of which allow for closed-form solutions to the proximal step. Numerical experiments are conducted to illustrate the dimension insensitive property of the proposed frameworks.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Authors:
Tim Tsz-Kit Lau,
Weijian Li,
Chenwei Xu,
Han Liu,
Mladen Kolar
Abstract:
Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite under…
▽ More
Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite understanding their convergence in i.i.d. and heterogeneous settings and knowing the importance of batch sizes for efficiency and generalization, optimal local batch sizes are difficult to determine. We introduce adaptive batch size strategies for local gradient methods that increase batch sizes adaptively to reduce minibatch gradient variance. We provide convergence guarantees under homogeneous data conditions and support our claims with image classification experiments, demonstrating the effectiveness of our strategies in training and generalization.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
The Onsager principle and physics preserving numerical schemes
Authors:
Huangxin Chen,
Hailiang Liu,
Xianmin Xu
Abstract:
We present a natural framework for constructing energy-stable time discretization schemes. By leveraging the Onsager principle, we demonstrate its efficacy in formulating partial differential equation models for diverse gradient flow systems. Furthermore, this principle provides a robust basis for develo** numerical schemes that uphold crucial physical properties. Within this framework, several…
▽ More
We present a natural framework for constructing energy-stable time discretization schemes. By leveraging the Onsager principle, we demonstrate its efficacy in formulating partial differential equation models for diverse gradient flow systems. Furthermore, this principle provides a robust basis for develo** numerical schemes that uphold crucial physical properties. Within this framework, several widely used schemes emerge naturally, showing its versatility and applicability.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
On the splitting of surfaces in motivic stable homotopy category
Authors:
Haoyang Liu
Abstract:
Let $k$ be a field and $X$ be a smooth projective surface over $k$ with a rational point, we discuss the condition of splitting off the top cell for the motivic stable homotopy type of $X$. We also study some outlying examples, such as K3 surfaces.
Let $k$ be a field and $X$ be a smooth projective surface over $k$ with a rational point, we discuss the condition of splitting off the top cell for the motivic stable homotopy type of $X$. We also study some outlying examples, such as K3 surfaces.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Stabler Neo-Hookean Simulation: Absolute Eigenvalue Filtering for Projected Newton
Authors:
Honglin Chen,
Hsueh-Ti Derek Liu,
David I. W. Levin,
Changxi Zheng,
Alec Jacobson
Abstract:
Volume-preserving hyperelastic materials are widely used to model near-incompressible materials such as rubber and soft tissues. However, the numerical simulation of volume-preserving hyperelastic materials is notoriously challenging within this regime due to the non-convexity of the energy function. In this work, we identify the pitfalls of the popular eigenvalue clam** strategy for projecting…
▽ More
Volume-preserving hyperelastic materials are widely used to model near-incompressible materials such as rubber and soft tissues. However, the numerical simulation of volume-preserving hyperelastic materials is notoriously challenging within this regime due to the non-convexity of the energy function. In this work, we identify the pitfalls of the popular eigenvalue clam** strategy for projecting Hessian matrices to positive semi-definiteness during Newton's method. We introduce a novel eigenvalue filtering strategy for projected Newton's method to stabilize the optimization of Neo-Hookean energy and other volume-preserving variants under high Poisson's ratio (near 0.5) and large initial volume change. Our method only requires a single line of code change in the existing projected Newton framework, while achieving significant improvement in both stability and convergence speed. We demonstrate the effectiveness and efficiency of our eigenvalue projection scheme on a variety of challenging examples and over different deformations on a large dataset.
△ Less
Submitted 21 June, 2024; v1 submitted 9 June, 2024;
originally announced June 2024.
-
Splitting of abelian varieties in motivic stable homotopy category
Authors:
Haoyang Liu
Abstract:
In this paper, we discuss the motivic stable homotopy type of abelian varieties. For an abelian variety over a field $k$ with a rational point, it always splits off a top-dimensional cell in motivic stable homotopy category $\text{SH}(k)$. Let $k = \mathbb{R}$, there is a concrete splitting which is determined by the motive of X and the real points $X(\mathbb{R})$ in…
▽ More
In this paper, we discuss the motivic stable homotopy type of abelian varieties. For an abelian variety over a field $k$ with a rational point, it always splits off a top-dimensional cell in motivic stable homotopy category $\text{SH}(k)$. Let $k = \mathbb{R}$, there is a concrete splitting which is determined by the motive of X and the real points $X(\mathbb{R})$ in $\text{SH}(\mathbb{R})_\mathbb{Q}$. We will also discuss this splitting from a viewpoint of the Chow-Witt correspondences.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Rainbow cycles through specified vertices
Authors:
Henry Liu
Abstract:
An edge-coloured cycle is rainbow if the edges have distinct colours. Let $G$ be a graph such that any $k$ vertices lie in a cycle of $G$. The $k$-rainbow cycle index of $G$, denoted by $crx_k(G)$, is the minimum number of colours required to colour the edges of $G$ such that, for every set $S$ of $k$ vertices in $G$, there exists a rainbow cycle in $G$ containing $S$. In this paper, we will first…
▽ More
An edge-coloured cycle is rainbow if the edges have distinct colours. Let $G$ be a graph such that any $k$ vertices lie in a cycle of $G$. The $k$-rainbow cycle index of $G$, denoted by $crx_k(G)$, is the minimum number of colours required to colour the edges of $G$ such that, for every set $S$ of $k$ vertices in $G$, there exists a rainbow cycle in $G$ containing $S$. In this paper, we will first prove some results about the parameter $crx_k(G)$ for general graphs $G$. One of the results is a classification of all graphs $G$ such that $crx_k(G)=e(G)$, for $k=1,2$. We will also determine $crx_k(G)$ for some specific graphs $G$, including wheels, complete graphs, complete bipartite and multipartite graphs, and discrete cubes.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Determining state space anomalies in mean field games
Authors:
Hongyu Liu,
Catharine W. K. Lo
Abstract:
In this paper, we are concerned with the inverse problem of determining anomalies in the state space associated with the stationary mean field game (MFG) system. We establish novel unique identifiability results for the intrinsic structure of these anomalies in mean field games systems, including their topological structure and parameter configurations, in several general scenarios of practical in…
▽ More
In this paper, we are concerned with the inverse problem of determining anomalies in the state space associated with the stationary mean field game (MFG) system. We establish novel unique identifiability results for the intrinsic structure of these anomalies in mean field games systems, including their topological structure and parameter configurations, in several general scenarios of practical interest, including traffic flow, market economics and epidemics. To the best of our knowledge, this is the first work that considers anomalies in the state space for the nonlinear coupled MFG system.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Decoding a mean field game by the Cauchy data around its unknown stationary states
Authors:
Hongyu Liu,
Catharine W. K. Lo,
Shen Zhang
Abstract:
In recent years, mean field games (MFGs) have garnered considerable attention and emerged as a dynamic and actively researched field across various domains, including economics, social sciences, finance, and transportation. The inverse design and decoding of MFGs offer valuable means to extract information from observed data and gain insights into the intricate underlying dynamics and strategies o…
▽ More
In recent years, mean field games (MFGs) have garnered considerable attention and emerged as a dynamic and actively researched field across various domains, including economics, social sciences, finance, and transportation. The inverse design and decoding of MFGs offer valuable means to extract information from observed data and gain insights into the intricate underlying dynamics and strategies of these complex physical systems. This paper presents a novel approach to the study of inverse problems in MFGs by analyzing the Cauchy data around their unknown stationary states. This study distinguishes itself from existing inverse problem investigations in three key significant aspects: Firstly, we consider MFG problems in a highly general form. Secondly, we address the technical challenge of the probability measure constraint by utilizing Cauchy data in our inverse problem study. Thirdly, we enhance existing high order linearization methods by introducing a novel approach that involves conducting linearization around non-trivial stationary states of the MFG system, which are not a-priori known. These contributions provide new insights and offer promising avenues for studying inverse problems for MFGs. By unraveling the hidden structure of MFGs, researchers and practitioners can make informed decisions, optimize system performance, and address real-world challenges more effectively.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Inference under covariate-adaptive randomization with many strata
Authors:
Jiahui Xin,
Hanzhong Liu,
Wei Ma
Abstract:
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design…
▽ More
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design and analysis, which is a common scenario in practice, such as in multicenter randomized clinical trials. In this paper, we propose a general framework for inference under covariate-adaptive randomization, which extends the seminal works of Bugni et al. (2018, 2019) by allowing for a diverging number of strata. Furthermore, we introduce a novel weighted regression adjustment that ensures efficiency improvement. On top of establishing the asymptotic theory, practical algorithms for handling situations involving an extremely large number of strata are also developed. Moreover, by linking design balance and inference robustness, we highlight the advantages of stratified block randomization, which enforces better covariate balance within strata compared to simple randomization. This paper offers a comprehensive landscape of inference under covariate-adaptive randomization, spanning from fixed to diverging to extremely large numbers of strata.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Restarted Primal-Dual Hybrid Conjugate Gradient Method for Large-Scale Quadratic Programming
Authors:
Yicheng Huang,
Wanyu Zhang,
Hongpei Li,
Weihan Xue,
Dongdong Ge,
Huikang Liu,
Yinyu Ye
Abstract:
Convex quadratic programming (QP) is an essential class of optimization problems with broad applications across various fields. Traditional QP solvers, typically based on simplex or barrier methods, face significant scalability challenges. In response to these limitations, recent research has shifted towards matrix-free first-order methods to enhance scalability in QP. Among these, the restarted a…
▽ More
Convex quadratic programming (QP) is an essential class of optimization problems with broad applications across various fields. Traditional QP solvers, typically based on simplex or barrier methods, face significant scalability challenges. In response to these limitations, recent research has shifted towards matrix-free first-order methods to enhance scalability in QP. Among these, the restarted accelerated primal-dual hybrid gradient (rAPDHG) method, proposed by H.Lu(2023), has gained notable attention due to its linear convergence rate to an optimal solution and its straightforward implementation on Graphics Processing Units (GPUs). Building on this framework, this paper introduces a restarted primal-dual hybrid conjugate gradient (PDHCG) method, which incorporates conjugate gradient (CG) techniques to address the primal subproblems inexactly. We demonstrate that PDHCG maintains a linear convergence rate with an improved convergence constant and is also straightforward to implement on GPUs. Extensive numerical experiments affirm that, compared to rAPDHG, our method could significantly reduce the number of iterations required to achieve the desired accuracy and offer a substantial performance improvement in large-scale problems. These findings highlight the significant potential of our proposed PDHCG method to boost both the efficiency and scalability of solving complex QP challenges.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Iterative Thresholding Methods for Longest Minimal Length Partitions
Authors:
Shilong Hu,
Hao Liu,
Dong Wang
Abstract:
In this paper, we introduce two iterative methods for longest minimal length partition problem, which asks whether the disc (ball) is the set maximizing the total perimeter of the shortest partition that divides the total region into sub-regions with given volume proportions, under a volume constraint. The objective functional is approximated by a short-time heat flow using indicator functions of…
▽ More
In this paper, we introduce two iterative methods for longest minimal length partition problem, which asks whether the disc (ball) is the set maximizing the total perimeter of the shortest partition that divides the total region into sub-regions with given volume proportions, under a volume constraint. The objective functional is approximated by a short-time heat flow using indicator functions of regions and Gaussian convolution. The problem is then represented as a constrained max-min optimization problem. Auction dynamics is used to find the shortest partition in a fixed region, and threshold dynamics is used to update the region. Numerical experiments in two-dimensional and three-dimensional cases are shown with different numbers of partitions, unequal volume proportions, and different initial shapes. The results of both methods are consistent with the conjecture that the disc in two dimensions and the ball in three dimensions are the solution of the longest minimal length partition problem.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
A conservative relaxation Crank-Nicolson finite element method for the Schrödinger-Poisson equation
Authors:
Huini Liu,
Nianyu Yi,
Peimeng Yin
Abstract:
In this paper, we propose a novel mass and energy conservative relaxation Crank-Nicolson finite element method for the Schrödinger-Poisson equation. Utilizing only a single auxiliary variable, we simultaneously reformulate the distinct nonlinear terms present in both the Schrödinger equation and the Poisson equation into their equivalent expressions, constructing an equivalent system to the origin…
▽ More
In this paper, we propose a novel mass and energy conservative relaxation Crank-Nicolson finite element method for the Schrödinger-Poisson equation. Utilizing only a single auxiliary variable, we simultaneously reformulate the distinct nonlinear terms present in both the Schrödinger equation and the Poisson equation into their equivalent expressions, constructing an equivalent system to the original Schrödinger-Poisson equation. Our proposed scheme, derived from this new system, operates linearly and bypasses the need to solve the nonlinear coupled equation, thus eliminating the requirement for iterative techniques. We in turn rigorously derive error estimates for the proposed scheme, demonstrating second-order accuracy in time and $(k+1)$th order accuracy in space when employing polynomials of degree up to $k$. Numerical experiments validate the accuracy and effectiveness of our method and emphasize its conservation properties over long-time simulations.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Invariance of elliptic genus under wall-crossing
Authors:
Henry Liu
Abstract:
Wall-crossing formulas for various flavors of elliptic genus can be obtained using master spaces. We give a topological criterion which implies that such wall-crossing formulas are trivial. Applications are given for: GIT quotients, following Thaddeus; moduli of sheaves, following Mochizuki; Donaldson-Thomas and Vafa-Witten theory, following Joyce and Tanaka-Thomas respectively.
Wall-crossing formulas for various flavors of elliptic genus can be obtained using master spaces. We give a topological criterion which implies that such wall-crossing formulas are trivial. Applications are given for: GIT quotients, following Thaddeus; moduli of sheaves, following Mochizuki; Donaldson-Thomas and Vafa-Witten theory, following Joyce and Tanaka-Thomas respectively.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
A new class of Carleson measures and integral operators on Bergman spaces
Authors:
Hicham Arroussi,
Huijie Liu,
Cezhong Tong,
Zicong Yang
Abstract:
Let $n$ be a positive integer and $\mathbf{g}=(g_0,g_1,\cdots,g_{n-1})$, with $g_k\in H(\mathbb{D})$ for $k=0,1,\cdots,n-1$. Let $I_{\mathbf{g}}^{(n)}$ be the generalized Volterra-type operators on $H(\mathbb{C})$, which is represented as $$ I_{\mathbf{g}}^{(n)}f=I^n\left(fg_0+f'g_1+\cdots+f^{(n-1)}g_{n-1}\right), $$ where $I$ denotes the integration operator $$(If)(z)=\int_0^zf(w)dw,$$ and $I^n$…
▽ More
Let $n$ be a positive integer and $\mathbf{g}=(g_0,g_1,\cdots,g_{n-1})$, with $g_k\in H(\mathbb{D})$ for $k=0,1,\cdots,n-1$. Let $I_{\mathbf{g}}^{(n)}$ be the generalized Volterra-type operators on $H(\mathbb{C})$, which is represented as $$ I_{\mathbf{g}}^{(n)}f=I^n\left(fg_0+f'g_1+\cdots+f^{(n-1)}g_{n-1}\right), $$ where $I$ denotes the integration operator $$(If)(z)=\int_0^zf(w)dw,$$ and $I^n$ is the $n$th iteration of $I$. This operator is a generalization of the operator that was introduced by Chalmoukis in \cite{Cn}. In this paper, we study the boundedness and compactness of the operator $I_{\mathbf{g}}^{(n)}$ acting on Bergman spaces to another. As a consequence of these characterizations, we obtain conditions for certain linear differential equations to have solutions in Bergman spaces. Moreover, we study the boundedness, compactness and Hilbert-Schmidtness of the following sums of generalized weighted composition operators: Let $\mathbf{u}=(u_0,u_1,\cdots,u_n)$ with $u_k\in H(\mathbb{D})$ for $0\leq k\leq n$ and $\varphi$ be an analytic self-map of $\mathbb{D}.$ The sums of generalized weighted composition operators is defined by $$L_{\mathbf{u},\varphi}^{(n)}=\sum_{k=0}^nW_{u_k,\varphi}^{(k)},$$ where $$W_{u_k,\varphi}^{(k)}f=u_k\cdot f^{(k)}\circ\varphi.$$ Our approach involves the study of new class of Sobolev-Carleson measures for classical Bergman spaces on unit disk which appears in the first main Theorems \ref{Theorem1.1} and \ref{Theorem1.2}.
△ Less
Submitted 11 June, 2024; v1 submitted 19 May, 2024;
originally announced May 2024.
-
Subgraphs of random graphs in hereditary families
Authors:
Alexander Clifton,
Hong Liu,
Letícia Mattos,
Michael Zheng
Abstract:
For a graph $G$ and a hereditary property $\mathcal{P}$, let $\text{ex}(G,\mathcal{P})$ denote the maximum number of edges of a subgraph of $G$ that belongs to $\mathcal{P}$. We prove that for every non-trivial hereditary property $\mathcal{P}$ such that $L \notin \mathcal{P}$ for some bipartite graph $L$ and for every fixed $p \in (0,1)$ we have \[\text{ex}(G(n,p),\mathcal{P}) \le n^{2-\varepsilo…
▽ More
For a graph $G$ and a hereditary property $\mathcal{P}$, let $\text{ex}(G,\mathcal{P})$ denote the maximum number of edges of a subgraph of $G$ that belongs to $\mathcal{P}$. We prove that for every non-trivial hereditary property $\mathcal{P}$ such that $L \notin \mathcal{P}$ for some bipartite graph $L$ and for every fixed $p \in (0,1)$ we have \[\text{ex}(G(n,p),\mathcal{P}) \le n^{2-\varepsilon}\] with high probability, for some constant $\varepsilon = \varepsilon(\mathcal{P})>0$. This answers a question of Alon, Krivelevich and Samotij.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
New lower bound on ball packing density in high-dimensional hyperbolic spaces
Authors:
Irene Gil Fernández,
Jaehoon Kim,
Hong Liu,
Oleg Pikhurko
Abstract:
We present a new lower bound on the Bowen-Radin maximal density of radius-R ball packings in the m-dimensional hyperbolic space, improving on the basic covering bound by factor Ω(m(R+\ln m)) as m tends to infinity. This is done by applying the recent theorem of Campos, Jenssen, Michelen and Sahasrabudhe on independent sets in graphs with sparse neighbourhoods.
We present a new lower bound on the Bowen-Radin maximal density of radius-R ball packings in the m-dimensional hyperbolic space, improving on the basic covering bound by factor Ω(m(R+\ln m)) as m tends to infinity. This is done by applying the recent theorem of Campos, Jenssen, Michelen and Sahasrabudhe on independent sets in graphs with sparse neighbourhoods.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Some Computational Results on Koszul-Vinberg Cochain Complexes
Authors:
Hanwen Liu,
Jun Zhang
Abstract:
An affine connection is said to be flat if its curvature tensor vanishes identically. Koszul-Vinberg (KV for abbreviation) cohomology has been invoked to study the deformation theory of flat and torsion-free affine connections on tangent bundle. In this Note, we compute explicitly the differentials of various specific KV cochains, and study their relation to classical objects in information geomet…
▽ More
An affine connection is said to be flat if its curvature tensor vanishes identically. Koszul-Vinberg (KV for abbreviation) cohomology has been invoked to study the deformation theory of flat and torsion-free affine connections on tangent bundle. In this Note, we compute explicitly the differentials of various specific KV cochains, and study their relation to classical objects in information geometry, including deformations associated with projective and dual-projective transformations of a flat and torsion-free affine connection. As an application, we also give a simple yet non-trivial example of a KV algebra of which second cohomology group does not vanish.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
A characterization of compactness via bilinear $T1$ theorem
Authors:
Mingming Cao,
Honghai Liu,
Zengyan Si,
Kôzô Yabuta
Abstract:
We establish a bilinear $T1$ theorem to characterize the weighted compactness of bilinear Calderón--Zygmund operators. Let $T$ be a bilinear operator associated with a standard bilinear Calderón--Zygmund kernel. We demonstrate that $T$ can be extended to a compact bilinear operator from $L^{p_1}(w_1^{p_1}) \times L^{p_2}(w_2^{p_2})$ to $L^p(w^p)$ for all exponents…
▽ More
We establish a bilinear $T1$ theorem to characterize the weighted compactness of bilinear Calderón--Zygmund operators. Let $T$ be a bilinear operator associated with a standard bilinear Calderón--Zygmund kernel. We demonstrate that $T$ can be extended to a compact bilinear operator from $L^{p_1}(w_1^{p_1}) \times L^{p_2}(w_2^{p_2})$ to $L^p(w^p)$ for all exponents $\frac{1}{p} = \frac{1}{p_1} + \frac{1}{p_2}$ with $1<p_1, p_2< \infty$ and for all weights $(w_1, w_2) \in A_{(p_1, p_2)}$ if and only if the following conditions hold: (i) $T$ is associated with a compact bilinear Calderón--Zygmund kernel, (ii) $T$ satisfies the weak compactness property, and (iii) $T(1,1), T^{*1}(1,1), T^{*2}(1,1) \in \mathrm{CMO}(\mathbb{R}^n)$.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
FCNCP: A Coupled Nonnegative CANDECOMP/PARAFAC Decomposition Based on Federated Learning
Authors:
Yukai Cai,
Hang Liu,
Xiulin Wang,
Hong** Li,
Ziyi Wang,
Chuanshuai Yang,
Fengyu Cong
Abstract:
In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes…
▽ More
In the field of brain science, data sharing across servers is becoming increasingly challenging due to issues such as industry competition, privacy security, and administrative procedure policies and regulations. Therefore, there is an urgent need to develop new methods for data analysis and processing that enable scientific collaboration without data sharing. In view of this, this study proposes to study and develop a series of efficient non-negative coupled tensor decomposition algorithm frameworks based on federated learning called FCNCP for the EEG data arranged on different servers. It combining the good discriminative performance of tensor decomposition in high-dimensional data representation and decomposition, the advantages of coupled tensor decomposition in cross-sample tensor data analysis, and the features of federated learning for joint modelling in distributed servers. The algorithm utilises federation learning to establish coupling constraints for data distributed across different servers. In the experiments, firstly, simulation experiments are carried out using simulated data, and stable and consistent decomposition results are obtained, which verify the effectiveness of the proposed algorithms in this study. Then the FCNCP algorithm was utilised to decompose the fifth-order event-related potential (ERP) tensor data collected by applying proprioceptive stimuli on the left and right hands. It was found that contralateral stimulation induced more symmetrical components in the activation areas of the left and right hemispheres. The conclusions drawn are consistent with the interpretations of related studies in cognitive neuroscience, demonstrating that the method can efficiently process higher-order EEG data and that some key hidden information can be preserved.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
On inverse problems in multi-population aggregation models
Authors:
Yuhan Li,
Hongyu Liu,
Catharine W. K. Lo
Abstract:
This paper focuses on inverse problems arising in studying multi-population aggregations. The goal is to reconstruct the diffusion coefficient, advection coefficient, and interaction kernels of the aggregation system, which characterize the dynamics of different populations. In the theoretical analysis of the physical setup, it is crucial to ensure non-negativity of solutions. To address this, we…
▽ More
This paper focuses on inverse problems arising in studying multi-population aggregations. The goal is to reconstruct the diffusion coefficient, advection coefficient, and interaction kernels of the aggregation system, which characterize the dynamics of different populations. In the theoretical analysis of the physical setup, it is crucial to ensure non-negativity of solutions. To address this, we employ the high-order variation method and introduce modifications to the systems. Additionally, we propose a novel approach called transformative asymptotic technique that enables the recovery of the diffusion coefficient preceding the Laplace operator, presenting a pioneering method for this type of problems. Through these techniques, we offer comprehensive insights into the unique identifiability aspect of inverse problems associated with multi-population aggregation models.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Simultaneously Cloaking Electric and Hydrodynamic Fields via Electro-osmosis
Authors:
Hongyu Liu,
Zhi-Qiang Miao,
Guang-Hui Zheng
Abstract:
In this paper, we develop a general mathematical framework for the electro-osmosis problem to design simultaneous microscale electric and hydrodynamic cloaking in a Hele-Shaw configuration. A novel approach to achieving simultaneously cloaking both the electric and flow fields through a combination of scattering-cancellation technology and an electro-osmosis effect is proposed. In the design, the…
▽ More
In this paper, we develop a general mathematical framework for the electro-osmosis problem to design simultaneous microscale electric and hydrodynamic cloaking in a Hele-Shaw configuration. A novel approach to achieving simultaneously cloaking both the electric and flow fields through a combination of scattering-cancellation technology and an electro-osmosis effect is proposed. In the design, the electric field is manipulated with scattering-cancellation technology while the pressure with electro-osmosis effect. As proof of this concept, the perfect electric and hydrodynamic cloaking conditions are derived for the cloaks with the cross-sectional shape being annulus or confocal ellipses using the layer potential techniques. Furthermore, we also propose an optimization scheme for the design of approximate cloaks within general geometries and prove the well-posedness of the optimization problem. In particular, the conditions that can ensure the simultaneous occurrence of approximate cloaks for general geometries are also established. Our theoretical findings are validated by a variety of numerical results and guide efficiently designing electric-related multiphysics cloaking.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
An MILP-Based Solution Scheme for Factored and Robust Factored Markov Decision Processes
Authors:
Huikang Liu,
Wolfram Wiesemann,
Man-Chung Yue
Abstract:
Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and…
▽ More
Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components. Through the use of dynamic Bayesian networks and context-specific independence, factored MDPs can achieve an exponential reduction in the state space of an MDP and thus scale to problem sizes that are beyond the reach of classical MDP algorithms. However, factored MDPs are typically solved using custom-designed algorithms that can require meticulous implementations and considerable fine-tuning. In this paper, we propose a mathematical programming approach to solving factored MDPs. In contrast to existing solution schemes, our approach leverages off-the-shelf solvers, which allows for a streamlined implementation and maintenance; it effectively capitalizes on the factored structure present in both state and action spaces; and it readily extends to the largely unexplored class of robust factored MDPs, whose transition kernels are only known to reside in a pre-specified ambiguity set. Our numerical experiments demonstrate the potential of our approach.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Optimal bounds on the polynomial Schur's theorem
Authors:
Jaehoon Kim,
Hong Liu,
Péter Pál Pach
Abstract:
Liu, Pach and Sándor recently characterized all polynomials $p(z)$ such that the equation $x+y=p(z)$ is $2$-Ramsey, that is, any $2$-coloring of $\mathbb{N}$ contains infinitely many monochromatic solutions for $x+y=p(z)$. In this paper, we find asymptotically tight bounds for the following two quantitative questions.
$\bullet$ For $n\in \mathbb{N}$, what is the longest interval $[n,f(n)]$ of na…
▽ More
Liu, Pach and Sándor recently characterized all polynomials $p(z)$ such that the equation $x+y=p(z)$ is $2$-Ramsey, that is, any $2$-coloring of $\mathbb{N}$ contains infinitely many monochromatic solutions for $x+y=p(z)$. In this paper, we find asymptotically tight bounds for the following two quantitative questions.
$\bullet$ For $n\in \mathbb{N}$, what is the longest interval $[n,f(n)]$ of natural numbers which admits a $2$-coloring with no monochromatic solutions of $x+y=p(z)$?
$\bullet$ For $n\in \mathbb{N}$ and a $2$-coloring of the first $n$ integers $[n]$, what is the smallest possible number $g(n)$ of monochromatic solutions of $x+y=p(z)$?
Our theorems determine $f(n)$ up to a multiplicative constant $2+o(1)$, and determine the asymptotics for $g(n)$.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Piercing independent sets in graphs without large induced matching
Authors:
Jiangdong Ai,
Hong Liu,
Zixiang Xu,
Qiang Zhou
Abstract:
Given a graph $G$, denote by $h(G)$ the smallest size of a subset of $V(G)$ which intersects every maximum independent set of $G$. We prove that any graph $G$ without induced matching of size $t$ satisfies $h(G)\le ω(G)^{3t-3+o(1)}$. This resolves a conjecture of Hajebi, Li and Spirkl (Hitting all maximum stable sets in $P_{5}$-free graphs, JCTB 2024).
Given a graph $G$, denote by $h(G)$ the smallest size of a subset of $V(G)$ which intersects every maximum independent set of $G$. We prove that any graph $G$ without induced matching of size $t$ satisfies $h(G)\le ω(G)^{3t-3+o(1)}$. This resolves a conjecture of Hajebi, Li and Spirkl (Hitting all maximum stable sets in $P_{5}$-free graphs, JCTB 2024).
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Beyond chromatic threshold via the $(p,q)$-theorem, and a sharp blow-up phenomenon
Authors:
Hong Liu,
Chong Shangguan,
Jozef Skokan,
Zixiang Xu
Abstract:
We establish a novel connection between the well-known chromatic threshold problem in extremal combinatorics and the celebrated $(p,q)$-theorem in discrete geometry. In particular, for a graph $G$ with bounded clique number and a natural density condition, we prove a $(p,q)$-theorem for an abstract convexity space associated with $G$. Our result strengthens those of Thomassen and Nikiforov on the…
▽ More
We establish a novel connection between the well-known chromatic threshold problem in extremal combinatorics and the celebrated $(p,q)$-theorem in discrete geometry. In particular, for a graph $G$ with bounded clique number and a natural density condition, we prove a $(p,q)$-theorem for an abstract convexity space associated with $G$. Our result strengthens those of Thomassen and Nikiforov on the chromatic threshold of cliques. Our $(p,q)$-theorem can also be viewed as a $χ$-boundedness result for (what we call) ultra maximal $K_r$-free graphs.
We further show that the graphs under study are blow-ups of constant size graphs, improving a result of Oberkampf and Schacht on homomorphism threshold of cliques. Our result unravels the cause underpinning such a blow-up phenomenon, differentiating the chromatic and homomorphism threshold problems for cliques. It implies that for the homomorphism threshold problem, rather than the minimum degree condition usually considered in the literature, the decisive factor is a clique density condition on co-neighborhoods of vertices. More precisely, we show that if an $n$-vertex $K_{r}$-free graph $G$ satisfies that the common neighborhood of every pair of non-adjacent vertices induces a subgraph with $K_{r-2}$-density at least $\varepsilon>0$, then $G$ must be a blow-up of some $K_r$-free graph $F$ on at most $2^{O(\frac{r}{\varepsilon}\log\frac{1}{\varepsilon})}$ vertices. Furthermore, this single exponential bound is optimal. We construct examples with no $K_r$-free homomorphic image of size smaller than $2^{Ω_r(\frac{1}{\varepsilon})}$.
△ Less
Submitted 27 May, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Tensor Neural Network Based Machine Learning Method for Elliptic Multiscale Problems
Authors:
Zhongshuo Lin,
Haochen Liu,
Hehu Xie
Abstract:
In this paper, we introduce a type of tensor neural network based machine learning method to solve elliptic multiscale problems. Based on the special structure, we can do the direct and highly accurate high dimensional integrations for the tensor neural network functions without Monte Carlo process. Here, with the help of homogenization techniques, the multiscale problem is first transformed to th…
▽ More
In this paper, we introduce a type of tensor neural network based machine learning method to solve elliptic multiscale problems. Based on the special structure, we can do the direct and highly accurate high dimensional integrations for the tensor neural network functions without Monte Carlo process. Here, with the help of homogenization techniques, the multiscale problem is first transformed to the high dimensional limit problem with reasonable accuracy. Then, based on the tensor neural network, we design a type of machine learning method to solve the derived high dimensional limit problem. The proposed method in this paper brings a new way to design numerical methods for computing more general multiscale problems with high accuracy. Several numerical examples are also provided to validate the accuracy of the proposed numerical methods.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Positivity-preserving and energy-dissipating discontinuous Galerkin methods for nonlinear nonlocal Fokker-Planck equations
Authors:
José A. Carrillo,
Hailiang Liu,
Hui Yu
Abstract:
This paper is concerned with structure-preserving numerical approximations for a class of nonlinear nonlocal Fokker-Planck equations, which admit a gradient flow structure and find application in diverse contexts. The solutions, representing density distributions, must be non-negative and satisfy a specific energy dissipation law. We design an arbitrary high-order discontinuous Galerkin (DG) metho…
▽ More
This paper is concerned with structure-preserving numerical approximations for a class of nonlinear nonlocal Fokker-Planck equations, which admit a gradient flow structure and find application in diverse contexts. The solutions, representing density distributions, must be non-negative and satisfy a specific energy dissipation law. We design an arbitrary high-order discontinuous Galerkin (DG) method tailored for these model problems. Both semi-discrete and fully discrete schemes are shown to admit the energy dissipation law for non-negative numerical solutions. To ensure the preservation of positivity in cell averages at all time steps, we introduce a local flux correction applied to the DDG diffusive flux. Subsequently, a hybrid algorithm is presented, utilizing a positivity-preserving limiter, to generate positive and energy-dissipating solutions. Numerical examples are provided to showcase the high resolution of the numerical solutions and the verified properties of the DG schemes.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Generalized Ramsey--Turán density for cliques
Authors:
Jun Gao,
Suyun Jiang,
Hong Liu,
Maya Sankar
Abstract:
We study the generalized Ramsey--Turán function $\mathrm{RT}(n,K_s,K_t,o(n))$, which is the maximum possible number of copies of $K_s$ in an $n$-vertex $K_t$-free graph with independence number $o(n)$. The case when $s=2$ was settled by Erd{ő}s, S{ó}s, Bollob{á}s, Hajnal, and Szemerédi in the 1980s. We combinatorially resolve the general case for all $s\ge 3$, showing that the (asymptotic) extrema…
▽ More
We study the generalized Ramsey--Turán function $\mathrm{RT}(n,K_s,K_t,o(n))$, which is the maximum possible number of copies of $K_s$ in an $n$-vertex $K_t$-free graph with independence number $o(n)$. The case when $s=2$ was settled by Erd{ő}s, S{ó}s, Bollob{á}s, Hajnal, and Szemerédi in the 1980s. We combinatorially resolve the general case for all $s\ge 3$, showing that the (asymptotic) extremal graphs for this problem have simple (bounded) structures. In particular, it implies that the extremal structures follow a periodic pattern when $t$ is much larger than $s$. Our results disprove a conjecture of Balogh, Liu, and Sharifzadeh and show that a relaxed version does hold.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
On the positivity of Fourier transform of the stretched Gaußian function
Authors:
Hanwen Liu
Abstract:
The stretched Gaußian function $f(\mathbf{x})=\exp \left(-\|\mathbf{x}\|^s\right)$, as a real function defined on $\mathbb{R}^d$, has found numerous applications in mathematics and physics. For instance, to describe results from spectroscopy or inelastic scattering, the Fourier transform of the stretched Gaußian function is needed. For $s \in(0,2]$, we prove that the Fourier transform of…
▽ More
The stretched Gaußian function $f(\mathbf{x})=\exp \left(-\|\mathbf{x}\|^s\right)$, as a real function defined on $\mathbb{R}^d$, has found numerous applications in mathematics and physics. For instance, to describe results from spectroscopy or inelastic scattering, the Fourier transform of the stretched Gaußian function is needed. For $s \in(0,2]$, we prove that the Fourier transform of $f(\mathbf{x})=\exp \left(-\|\mathbf{x}\|^s\right)$ is everywhere positive on $\mathbb{R}^d$.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Optimal estimate of electromagnetic field concentration between two nearly-touching inclusions in the quasi-static regime
Authors:
Youjun Deng,
Hongyu Liu,
Liyan Zhu
Abstract:
We investigate the electromagnetic field concentration between two nearly-touching inclusions that possess high-contrast electric permittivities in the quasi-static regime. By using layer potential techniques and asymptotic analysis in the low-frequency regime, we derive low-frequency expansions that provide integral representations for the solutions of the Maxwell equations. For the leading-order…
▽ More
We investigate the electromagnetic field concentration between two nearly-touching inclusions that possess high-contrast electric permittivities in the quasi-static regime. By using layer potential techniques and asymptotic analysis in the low-frequency regime, we derive low-frequency expansions that provide integral representations for the solutions of the Maxwell equations. For the leading-order term $\bE_0$ of the asymptotic expansion of the electric field, we prove that it has the blow up order of $ε^{-1} |\ln ε|^{-1}$ within the radial geometry, where $ε$ signifies the asymptotic distance between the inclusions. By delicate analysis of the integral operators involved, we further prove the boundedness of the first-order term $\bE_1$. We also conduct extensive numerical experiments which not only corroborate the theoretical findings but also provide more discoveries on the field concentration in the general geometric setup. Our study provides the first treatment in the literature on field concentration between nearly-touching material inclusions for the full Maxwell system.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
A Low-Rank ADMM Splitting Approach for Semidefinite Programming
Authors:
Qiushi Han,
Chenxi Li,
Zhenwei Lin,
Caihua Chen,
Qi Deng,
Dongdong Ge,
Huikang Liu,
Yinyu Ye
Abstract:
We introduce a new first-order method for solving general semidefinite programming problems, based on the alternating direction method of multipliers (ADMM) and a matrix-splitting technique. Our algorithm has an advantage over the Burer-Monteiro approach as it only involves much easier quadratically regularized subproblems in each iteration. For a linear objective, the subproblems are well-conditi…
▽ More
We introduce a new first-order method for solving general semidefinite programming problems, based on the alternating direction method of multipliers (ADMM) and a matrix-splitting technique. Our algorithm has an advantage over the Burer-Monteiro approach as it only involves much easier quadratically regularized subproblems in each iteration. For a linear objective, the subproblems are well-conditioned quadratic programs that can be efficiently solved by the standard conjugate gradient method. We show that the ADMM algorithm achieves sublinear or linear convergence rates to the KKT solutions under different conditions. Building on this theoretical development, we present LoRADS, a new solver for linear SDP based on the Low-Rank ADMM Splitting approach. LoRADS incorporates several strategies that significantly increase its efficiency. Firstly, it initiates with a warm-start phase that uses the Burer-Monteiro approach. Moreover, motivated by the SDP low-rank theory [So et al. 2008], LoRADS chooses an initial rank of logarithmic order and then employs a dynamic approach to increase the rank. Numerical experiments indicate that LoRADS exhibits promising performance on various SDP problems. A noteworthy achievement of LoRADS is its successful solving of a matrix completion problem with $15,694,167$ constraints and a matrix variable of size $40,000 \times 40,000$ in $351$ seconds.
△ Less
Submitted 25 March, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Aα-spectral radius and path-factor covered graphs
Authors:
Sizhong Zhou,
Hongxia Liu,
Qiuxiang Bian
Abstract:
Let $α\in[0,1)$, and let $G$ be a connected graph of order $n$ with $n\geq f(α)$, where $f(α)=14$ for $α\in[0,\frac{1}{2}]$, $f(α)=17$ for $α\in(\frac{1}{2},\frac{2}{3}]$, $f(α)=20$ for $α\in(\frac{2}{3},\frac{3}{4}]$ and $f(α)=\frac{5}{1-α}+1$ for $α\in(\frac{3}{4},1)$. A path factor is a spanning subgraph $F$ of $G$ such that every component of $F$ is a path with at least two vertices. Let…
▽ More
Let $α\in[0,1)$, and let $G$ be a connected graph of order $n$ with $n\geq f(α)$, where $f(α)=14$ for $α\in[0,\frac{1}{2}]$, $f(α)=17$ for $α\in(\frac{1}{2},\frac{2}{3}]$, $f(α)=20$ for $α\in(\frac{2}{3},\frac{3}{4}]$ and $f(α)=\frac{5}{1-α}+1$ for $α\in(\frac{3}{4},1)$. A path factor is a spanning subgraph $F$ of $G$ such that every component of $F$ is a path with at least two vertices. Let $k\geq2$ be an integer. A $P_{\geq k}$-factor means a path-factor with each component being a path of order at least $k$. A graph $G$ is called a $P_{\geq k}$-factor covered graph if $G$ has a $P_{\geq k}$-factor containing $e$ for any $e\in E(G)$. Let $A_α(G)=αD(G)+(1-α)A(G)$, where $D(G)$ denotes the diagonal matrix of vertex degrees of $G$ and $A(G)$ denotes the adjacency matrix of $G$. The largest eigenvalue of $A_α(G)$ is called the $A_α$-spectral radius of $G$, which is denoted by $ρ_α(G)$. In this paper, it is proved that $G$ is a $P_{\geq2}$-factor covered graph if $ρ_α(G)>η(n)$, where $η(n)$ is the largest root of $x^{3}-((α+1)n+α-4)x^{2}+(αn^{2}+(α^{2}-2α-1)n-2α+1)x-α^{2}n^{2}+(5α^{2}-3α+2)n-10α^{2}+15α-8=0$. Furthermore, we provide a graph to show that the bound on $A_α$-spectral radius is optimal.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Hilbert Space-Valued LQ Mean Field Games: An Infinite-Dimensional Analysis
Authors:
Hanchao Liu,
Dena Firoozi
Abstract:
This paper presents a comprehensive study of Hilbert space-valued linear-quadratic (LQ) mean field games (MFGs), generalizing the classic LQ mean field game theory to scenarios involving $N$ agent whose dynamics are governed by infinite-dimensional stochastic equations. In this framework, both the state and control processes of each agent take values in separable Hilbert spaces. Moreoever, all age…
▽ More
This paper presents a comprehensive study of Hilbert space-valued linear-quadratic (LQ) mean field games (MFGs), generalizing the classic LQ mean field game theory to scenarios involving $N$ agent whose dynamics are governed by infinite-dimensional stochastic equations. In this framework, both the state and control processes of each agent take values in separable Hilbert spaces. Moreoever, all agents are coupled through the average state of the population which appears in their linear dynamics and quadratic cost functional. Specifically, the dynamics of each agent incorporates an infinite-dimensional noise, namely a $Q$-Wiener process, and an unbounded operator. The diffusion coefficient of each agent also involves the state, control, and the average state processes. We first study the well-posedness of a system of $N$ general coupled infinite-dimensional stochastic evolution equations, which forms the foundation of MFGs in Hilbert spaces. Subsequently, we address the limiting Hilbert space-valued MFG as the number of agents approaches infinity and develop an infinite-dimensional variant of the Nash Certainty Equivalence principle. We characterize a unique Nash equilibrium for the limiting model and demonstrate that the associated best-response strategies constitute an $ε$-Nash equilibrium for the original $N$-player game in Hilbert spaces.
△ Less
Submitted 20 June, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Toughness and Aα-spectral radius in graphs
Authors:
Sizhong Zhou,
Yuli Zhang,
Tao Zhang,
Hongxia Liu
Abstract:
Let $α\in[0,1)$, and let $G$ be a connected graph of order $n$ with $n\geq f(α)$, where $f(α)=6$ for $α\in[0,\frac{2}{3}]$ and $f(α)=\frac{4}{1-α}$ for $α\in(\frac{2}{3},1)$. A graph $G$ is said to be $t$-tough if $|S|\geq tc(G-S)$ for each subset $S$ of $V(G)$ with $c(G-S)\geq2$, where $c(G-S)$ is the number of connected components in $G-S$. The $A_α$-spectral radius of $G$ is denoted by…
▽ More
Let $α\in[0,1)$, and let $G$ be a connected graph of order $n$ with $n\geq f(α)$, where $f(α)=6$ for $α\in[0,\frac{2}{3}]$ and $f(α)=\frac{4}{1-α}$ for $α\in(\frac{2}{3},1)$. A graph $G$ is said to be $t$-tough if $|S|\geq tc(G-S)$ for each subset $S$ of $V(G)$ with $c(G-S)\geq2$, where $c(G-S)$ is the number of connected components in $G-S$. The $A_α$-spectral radius of $G$ is denoted by $ρ_α(G)$. In this paper, it is verified that $G$ is a 1-tough graph unless $G=K_1\vee(K_{n-2}\cup K_1)$ if $ρ_α(G)\geqρ_α(K_1\vee(K_{n-2}\cup K_1))$, where $ρ_α(K_1\vee(K_{n-2}\cup K_1))$ equals the largest root of $x^{3}-((α+1)n+α-3)x^{2}+(αn^{2}+(α^{2}-α-1)n-2α+1)x-α^{2}n^{2}+(3α^{2}-α+1)n-4α^{2}+5α-3=0$. Further, we present an $A_α$-spectral radius condition for a graph to be a $t$-tough graph.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Graph with any rational density and no rich subsets of linear size
Authors:
Seonghyuk Im,
Suyun Jiang,
Hong Liu,
Tuan Tran
Abstract:
A well-known application of the dependent random choice asserts that any $n$-vertex graph $G$ with positive edge density contains a `rich' vertex subset $U$ of size $n^{1-o(1)}$ such that every pair of vertices in $U$ has at least $n^{1-o(1)}$ common neighbors. In 2003, using a beautiful construction on hypercube, Kostochka and Sudakov showed that this is tight: one cannot remove the $o(1)$ terms…
▽ More
A well-known application of the dependent random choice asserts that any $n$-vertex graph $G$ with positive edge density contains a `rich' vertex subset $U$ of size $n^{1-o(1)}$ such that every pair of vertices in $U$ has at least $n^{1-o(1)}$ common neighbors. In 2003, using a beautiful construction on hypercube, Kostochka and Sudakov showed that this is tight: one cannot remove the $o(1)$ terms even if the edge density of $G$ is $1/2$. In this paper, we generalize their result from pairs to tuples. To be precise, we show that given every pair of positive integers $p<q$, there is an $n$-vertex graph $G$ for all sufficiently large $n$ with edge density $p/q$ such that any vertex subset $U$ of size $Ω(n)$ contains $q$ vertices, any $p+1$ of which have $o(n)$ common neighbors. The edge density $p/q$ is best possible. Our construction uses isoperimetry and concentration of measure on high dimensional complex spheres.
△ Less
Submitted 22 February, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Inverse boundary problem for a mean field game system with probability density constraint
Authors:
Hongyu Liu,
Shen Zhang
Abstract:
By following the study in [24], we consider an inverse boundary problem for the mean field game system where a probability density constraint is enforced on the game agents. That is, we consider the case that reflective boundary conditions are enforced and hence the population distribution of the game agents should be treated as a probability measure which preserves both positivity and the total p…
▽ More
By following the study in [24], we consider an inverse boundary problem for the mean field game system where a probability density constraint is enforced on the game agents. That is, we consider the case that reflective boundary conditions are enforced and hence the population distribution of the game agents should be treated as a probability measure which preserves both positivity and the total population. This poses significant challenges for the corresponding inverse problems in constructing suitable ``probing modes" which should fulfill such a probability density constraint. We develop an effective scheme in tackling such a case which is new to the literature.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
PDEformer: Towards a Foundation Model for One-Dimensional Partial Differential Equations
Authors:
Zhanhong Ye,
Xiang Huang,
Leheng Chen,
Hongsheng Liu,
Zidong Wang,
Bin Dong
Abstract:
This paper introduces PDEformer, a neural solver for partial differential equations (PDEs) capable of simultaneously addressing various types of PDEs. We propose to represent the PDE in the form of a computational graph, facilitating the seamless integration of both symbolic and numerical information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed to…
▽ More
This paper introduces PDEformer, a neural solver for partial differential equations (PDEs) capable of simultaneously addressing various types of PDEs. We propose to represent the PDE in the form of a computational graph, facilitating the seamless integration of both symbolic and numerical information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed to generate mesh-free predicted solutions. Following pretraining on data exhibiting a certain level of diversity, our model achieves zero-shot accuracies on benchmark datasets that is comparable to those of specifically trained expert models. Additionally, PDEformer demonstrates promising results in the inverse problem of PDE coefficient recovery.
△ Less
Submitted 30 April, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Criteria for nilpotency of fusion systems
Authors:
Jie Jian,
Jun Liao,
Heguo Liu
Abstract:
Let $p$ be an odd prime and let $\mathcal{F}$ be a fusion system over a finite $p$-group $P$. A fusion system $\mathcal{F}$ is said to be nilpotent if $\mathcal{F}=\mathcal{F}_{P}(P)$. In this paper we provide new criteria for saturated fusion systems $\mathcal{F}$ to be nilpotent, which can be viewed as extension of the $p$-nilpotency theorem of Glauberman and Thompson for fusion systems attribut…
▽ More
Let $p$ be an odd prime and let $\mathcal{F}$ be a fusion system over a finite $p$-group $P$. A fusion system $\mathcal{F}$ is said to be nilpotent if $\mathcal{F}=\mathcal{F}_{P}(P)$. In this paper we provide new criteria for saturated fusion systems $\mathcal{F}$ to be nilpotent, which can be viewed as extension of the $p$-nilpotency theorem of Glauberman and Thompson for fusion systems attributed to Kessar and Linckelmann.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods
Authors:
Tim Tsz-Kit Lau,
Han Liu,
Mladen Kolar
Abstract:
The choice of batch sizes in minibatch stochastic gradient optimizers is critical in large-scale model training for both optimization and generalization performance. Although large-batch training is arguably the dominant training paradigm for large-scale deep learning due to hardware advances, the generalization performance of the model deteriorates compared to small-batch training, leading to the…
▽ More
The choice of batch sizes in minibatch stochastic gradient optimizers is critical in large-scale model training for both optimization and generalization performance. Although large-batch training is arguably the dominant training paradigm for large-scale deep learning due to hardware advances, the generalization performance of the model deteriorates compared to small-batch training, leading to the so-called "generalization gap" phenomenon. To mitigate this, we investigate adaptive batch size strategies derived from adaptive sampling methods, originally developed only for stochastic gradient descent. Given the significant interplay between learning rates and batch sizes, and considering the prevalence of adaptive gradient methods in deep learning, we emphasize the need for adaptive batch size strategies in these contexts. We introduce AdAdaGrad and its scalar variant AdAdaGradNorm, which progressively increase batch sizes during training, while model updates are performed using AdaGrad and AdaGradNorm. We prove that AdAdaGradNorm converges with high probability at a rate of $\mathscr{O}(1/K)$ to find a first-order stationary point of smooth nonconvex functions within $K$ iterations. AdAdaGrad also demonstrates similar convergence properties when integrated with a novel coordinate-wise variant of our adaptive batch size strategies. We corroborate our theoretical claims by performing image classification experiments, highlighting the merits of the proposed schemes in terms of both training efficiency and model generalization. Our work unveils the potential of adaptive batch size strategies for adaptive gradient optimizers in large-scale model training.
△ Less
Submitted 28 May, 2024; v1 submitted 17 February, 2024;
originally announced February 2024.
-
Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior
Authors:
Hao Liu,
Suresh P. Sethi,
Tak Kwong Wong,
Sheung Chi Phillip Yam
Abstract:
We extend the work on optimal investment and consumption of a population considered in [2] to a general stochastic setting over a finite time horizon. We incorporate the Cobb-Douglas production function in the capital dynamics while the consumption utility function and the drift rate in the population dynamics can be general, in contrast with [2, 30, 31]. The dynamic programming formulation yields…
▽ More
We extend the work on optimal investment and consumption of a population considered in [2] to a general stochastic setting over a finite time horizon. We incorporate the Cobb-Douglas production function in the capital dynamics while the consumption utility function and the drift rate in the population dynamics can be general, in contrast with [2, 30, 31]. The dynamic programming formulation yields an unconventional nonlinear Hamilton-Jacobi-Bellman (HJB) equation, in which the Cobb-Douglas production function as the coefficient of the gradient of the value function induces the mismatching of power rates between capital and population. Moreover, the equation has a very singular term, essentially a very negative power of the partial derivative of the value function with respect to the capital, coming from the optimization of control, and their resolution turns out to be a complex problem not amenable to classical analysis. To show that this singular term, which has not been studied in any physical systems yet, does not actually blow up, we establish new pointwise generalized power laws for the partial derivative of the value function. Our contribution lies in providing a theoretical treatment that combines both the probabilistic approach and theory of partial differential equations to derive the pointwise upper and lower bounds as well as energy estimates in weighted Sobolev spaces. By then, we accomplish showing the well-posedness of classical solutions to a non-canonical parabolic equation arising from a long-lasting problem in macroeconomics.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Monochromatic $k$-connection of graphs
Authors:
Qingqiong Cai,
Shinya Fujita,
Henry Liu,
Boram Park
Abstract:
An edge-coloured path is monochromatic if all of its edges have the same colour. For a $k$-connected graph $G$, the monochromatic $k$-connection number of $G$, denoted by $mc_k(G)$, is the maximum number of colours in an edge-colouring of $G$ such that, any two vertices are connected by $k$ internally vertex-disjoint monochromatic paths. In this paper, we shall study the parameter $mc_k(G)$. We ob…
▽ More
An edge-coloured path is monochromatic if all of its edges have the same colour. For a $k$-connected graph $G$, the monochromatic $k$-connection number of $G$, denoted by $mc_k(G)$, is the maximum number of colours in an edge-colouring of $G$ such that, any two vertices are connected by $k$ internally vertex-disjoint monochromatic paths. In this paper, we shall study the parameter $mc_k(G)$. We obtain bounds for $mc_k(G)$, for general graphs $G$. We also compute $mc_k(G)$ exactly when $k$ is small, and $G$ is a graph on $n$ vertices, with a spanning $k$-connected subgraph having the minimum possible number of edges, namely $\lceil\frac{kn}{2}\rceil$. We prove a similar result when $G$ is a bipartite graph.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Effective Reduced Models from Delay Differential Equations: Bifurcations, Tip** Solution Paths, and ENSO variability
Authors:
Mickaël D. Chekroun,
Honghu Liu
Abstract:
Conceptual delay models have played a key role in the understanding of El Niño-Southern Oscillation (ENSO) variability. Based on such delay models, we propose a novel scenario for the fabric of ENSO variability resulting from the subtle interplay between stochastic disturbances and nonlinear invariant sets emerging from bifurcations of the unperturbed dynamics.
To identify these invariant sets w…
▽ More
Conceptual delay models have played a key role in the understanding of El Niño-Southern Oscillation (ENSO) variability. Based on such delay models, we propose a novel scenario for the fabric of ENSO variability resulting from the subtle interplay between stochastic disturbances and nonlinear invariant sets emerging from bifurcations of the unperturbed dynamics.
To identify these invariant sets we adopt an approach combining Galerkin-Koornwinder (GK) approximations of delay differential equations and center-unstable manifold reduction techniques. In that respect, GK approximation formulas are reviewed and synthesized, as well as analytic approximation formulas of center-unstable manifolds. The reduced systems derived thereof enable us to conduct a thorough analysis of the bifurcations arising in a standard delay model of ENSO. We identify thereby a saddle-node bifurcation of periodic orbits co-existing with a subcritical Hopf bifurcation, and a homoclinic bifurcation for this model. We show furthermore that the computation of unstable periodic orbits (UPOs) unfolding through these bifurcations is considerably simplified from the reduced systems.
These dynamical insights enable us in turn to design a stochastic model whose solutions -- as the delay parameter drifts slowly through its critical values -- produce a wealth of temporal patterns resembling ENSO events and exhibiting also decadal variability. Our analysis dissects the origin of this variability and shows how it is tied to certain transition paths between invariant sets of the unperturbed dynamics (for ENSO's interannual variability) or simply due to the presence of UPOs close to the homoclinic orbit (for decadal variability). In short, this study points out the role of solution paths evolving through tip** "points" beyond equilibria, as possible mechanisms organizing the variability of certain climate phenomena.
△ Less
Submitted 2 January, 2024;
originally announced February 2024.
-
Unconditionally energy stable IEQ-FEMs for the Cahn-Hilliard equation and Allen-Cahn equation
Authors:
Yaoyao Chen,
Hailiang Liu,
Nianyu Yi,
Peimeng Yin
Abstract:
In this paper, we present several unconditionally energy-stable invariant energy quadratization (IEQ) finite element methods (FEMs) with linear, first- and second-order accuracy for solving both the Cahn-Hilliard equation and the Allen-Cahn equation. For time discretization, we compare three distinct IEQ-FEM schemes that position the intermediate function introduced by the IEQ approach in differen…
▽ More
In this paper, we present several unconditionally energy-stable invariant energy quadratization (IEQ) finite element methods (FEMs) with linear, first- and second-order accuracy for solving both the Cahn-Hilliard equation and the Allen-Cahn equation. For time discretization, we compare three distinct IEQ-FEM schemes that position the intermediate function introduced by the IEQ approach in different function spaces: finite element space, continuous function space, or a combination of these spaces. Rigorous proofs establishing the existence and uniqueness of the numerical solution, along with analyses of energy dissipation for both equations and mass conservation for the Cahn-Hilliard equation, are provided. The proposed schemes' accuracy, efficiency, and solution properties are demonstrated through numerical experiments.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Inf-Sup neural networks for high-dimensional elliptic PDE problems
Authors:
Xiaokai Huo,
Hailiang Liu
Abstract:
Solving high dimensional partial differential equations (PDEs) has historically posed a considerable challenge when utilizing conventional numerical methods, such as those involving domain meshes. Recent advancements in the field have seen the emergence of neural PDE solvers, leveraging deep networks to effectively tackle high dimensional PDE problems. This study introduces Inf-SupNet, a model-bas…
▽ More
Solving high dimensional partial differential equations (PDEs) has historically posed a considerable challenge when utilizing conventional numerical methods, such as those involving domain meshes. Recent advancements in the field have seen the emergence of neural PDE solvers, leveraging deep networks to effectively tackle high dimensional PDE problems. This study introduces Inf-SupNet, a model-based unsupervised learning approach designed to acquire solutions for a specific category of elliptic PDEs. The fundamental concept behind Inf-SupNet involves incorporating the inf-sup formulation of the underlying PDE into the loss function. The analysis reveals that the global solution error can be bounded by the sum of three distinct errors: the numerical integration error, the duality gap of the loss function (training error), and the neural network approximation error for functions within Sobolev spaces. To validate the efficacy of the proposed method, numerical experiments conducted in high dimensions demonstrate its stability and accuracy across various boundary conditions, as well as for both semi-linear and nonlinear PDEs.
△ Less
Submitted 2 February, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Sharp variance estimator and causal bootstrap in stratified randomized experiments
Authors:
Haoyang Yu,
Ke Zhu,
Hanzhong Liu
Abstract:
The design-based finite-population asymptotic theory provides a normal approximation for the sampling distribution of the average treatment effect estimator in stratified randomized experiments. The asymptotic variance could be estimated by a Neyman-type conservative variance estimator. However, the variance estimator can be overly conservative, and the asymptotic theory may fail in small samples.…
▽ More
The design-based finite-population asymptotic theory provides a normal approximation for the sampling distribution of the average treatment effect estimator in stratified randomized experiments. The asymptotic variance could be estimated by a Neyman-type conservative variance estimator. However, the variance estimator can be overly conservative, and the asymptotic theory may fail in small samples. To solve these issues, we propose a sharp variance estimator for the weighted difference-in-means in stratified randomized experiments. Furthermore, we propose two causal bootstrap procedures to more accurately approximate the sampling distribution of the weighted difference-in-means estimator. The first causal bootstrap procedure is based on rank-preserving imputation and we prove its second-order refinement over normal approximation. The second causal bootstrap procedure is based on constant-treatment-effect imputation and is applicable in paired experiments. We prove its validity even when the assumption of constant treatment effect is violated for the true potential outcomes. Our analysis is randomization-based or design-based by conditioning on the potential outcomes, with treatment assignment being the sole source of randomness. Numerical studies and two real data applications demonstrate advantages of our proposed methods in finite samples.
△ Less
Submitted 26 June, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Extremal density for subdivisions with length or sparsity constraints
Authors:
Jaehoon Kim,
Hong Liu,
Yantao Tang,
Guanghui Wang,
Donglei Yang,
Fan Yang
Abstract:
Given a graph $H$, a balanced subdivision of $H$ is obtained by replacing all edges of $H$ with internally disjoint paths of the same length. In this paper, we prove that for any graph $H$, a linear-in-$e(H)$ bound on average degree guarantees a balanced $H$-subdivision. This strengthens an old result of Bollobás and Thomason, and resolves a question of Gil-Fernández, Hyde, Liu, Pikhurko and Wu.…
▽ More
Given a graph $H$, a balanced subdivision of $H$ is obtained by replacing all edges of $H$ with internally disjoint paths of the same length. In this paper, we prove that for any graph $H$, a linear-in-$e(H)$ bound on average degree guarantees a balanced $H$-subdivision. This strengthens an old result of Bollobás and Thomason, and resolves a question of Gil-Fernández, Hyde, Liu, Pikhurko and Wu.
We observe that this linear bound on average degree is best possible whenever $H$ is logarithmically dense. We further show that this logarithmic density is the critical threshold: for many graphs $H$ below this density, its subdivisions are forcible by a sublinear bound in $e(H)$ on average degree. We provide such examples by proving that the subdivisions of any almost bipartite graph $H$ with sublogarithmic density are forcible by a sublinear-in-$e(H)$ bound on average degree, provided that $H$ satisfies some additional separability condition.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Three closed characteristics on non-degenerate star-shaped hypersurfaces in $\mathbf{R}^{6}$
Authors:
Huagui Duan,
Hui Liu,
Yiming Long,
Zihao Qi,
Wei Wang
Abstract:
In this paper, we prove that for every non-degenerate $C^3$ compact star-shaped hypersurface $Σ$ in $\mathbf{R}^{6}$ which carries no prime closed characteristic of Maslov-type index $0$ or no prime closed characteristic of Maslov-type index $-1$, there exist at least three prime closed characteristics on $Σ$.
In this paper, we prove that for every non-degenerate $C^3$ compact star-shaped hypersurface $Σ$ in $\mathbf{R}^{6}$ which carries no prime closed characteristic of Maslov-type index $0$ or no prime closed characteristic of Maslov-type index $-1$, there exist at least three prime closed characteristics on $Σ$.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.