-
Fully discrete energy-dissipative and conservative discrete gradient particle methods for a class of continuity equations
Authors:
**gwei Hu,
Samuel Q. Van Fleet,
Andy T. S. Wan
Abstract:
Structure-preserving particle methods have recently been proposed for a class of nonlinear continuity equations, including aggregation-diffusion equation in [J. Carrillo, K. Craig, F. Patacchini, Calc. Var., 58 (2019), pp. 53] and the Landau equation in [J. Carrillo, J. Hu., L. Wang, J. Wu, J. Comput. Phys. X, 7 (2020), pp. 100066]. One common feature to these equations is that they both admit som…
▽ More
Structure-preserving particle methods have recently been proposed for a class of nonlinear continuity equations, including aggregation-diffusion equation in [J. Carrillo, K. Craig, F. Patacchini, Calc. Var., 58 (2019), pp. 53] and the Landau equation in [J. Carrillo, J. Hu., L. Wang, J. Wu, J. Comput. Phys. X, 7 (2020), pp. 100066]. One common feature to these equations is that they both admit some variational formulation, which upon proper regularization, leads to particle approximations dissipating the energy and conserving some quantities simultaneously at the semi-discrete level. In this paper, we formulate continuity equations with a density dependent bilinear form associated with the variational derivative of the energy functional and prove that appropriate particle methods satisfy a compatibility condition with its regularized energy. This enables us to utilize discrete gradient time integrators and show that the energy can be dissipated and the mass conserved simultaneously at the fully discrete level. In the case of the Landau equation, we prove that our approach also conserves the momentum and kinetic energy at the fully discrete level. Several numerical examples are presented to demonstrate the dissipative and conservative properties of our proposed method.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Model structure arising from one hereditary cotorsion pair on extriangulated categories
Authors:
Jiangsheng Hu,
Dongdong Zhang,
Panyue Zhou
Abstract:
Let $\mathcal{C}$ be a weakly idempotent complete extriangulated category. In contrast with the Hovey correspondence of admissible model structures on weakly idempotent complete exact categories from two complete cotorsion pairs, we give a construction of model structures on $\mathcal{C}$ from only one complete cotorsion pair. Our main result not only generalizes the work by Beligiannis-Reiten and…
▽ More
Let $\mathcal{C}$ be a weakly idempotent complete extriangulated category. In contrast with the Hovey correspondence of admissible model structures on weakly idempotent complete exact categories from two complete cotorsion pairs, we give a construction of model structures on $\mathcal{C}$ from only one complete cotorsion pair. Our main result not only generalizes the work by Beligiannis-Reiten and Cui-Lu-Zhang, but also provides methods to construct model structures from silting objects of $\mathcal{C}$ and co-$t$-structures in triangulated categories.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits
Authors:
Li Yang,
Jiang Hu,
Jianghao Li,
Zhidong Bai
Abstract:
The random self-reinforcement mechanism, characterized by the principle of ``the rich get richer'', has demonstrated significant utility across various domains. One prominent model embodying this mechanism is the random reinforcement urn model. This paper investigates a multicolored, multiple-drawing variant of the random reinforced urn model. We establish the limiting behavior of the normalized u…
▽ More
The random self-reinforcement mechanism, characterized by the principle of ``the rich get richer'', has demonstrated significant utility across various domains. One prominent model embodying this mechanism is the random reinforcement urn model. This paper investigates a multicolored, multiple-drawing variant of the random reinforced urn model. We establish the limiting behavior of the normalized urn composition and demonstrate strong convergence upon scaling the counts of each color. Additionally, we derive strong convergence estimators for the reinforcement means, i.e., for the expectations of the replacement matrix's diagonal elements, and prove their joint asymptotic normality. It is noteworthy that the estimators of the largest reinforcement mean are asymptotically independent of the estimators of the other smaller reinforcement means. Additionally, if a reinforcement mean is not the largest, the estimators of these smaller reinforcement means will also demonstrate asymptotic independence among themselves. Furthermore, we explore the parallels between the reinforced mechanisms in random reinforced urn models and multi-armed bandits, addressing hypothesis testing for expected payoffs in the latter context.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
An Inexact Bregman Proximal Difference-of-Convex Algorithm with Two Types of Relative Stop** Criteria
Authors:
Lei Yang,
**g**g Hu,
Kim-Chuan Toh
Abstract:
In this paper, we consider a class of difference-of-convex (DC) optimization problems, where the global Lipschitz gradient continuity assumption on the smooth part of the objective function is not required. Such problems are prevalent in many contemporary applications such as compressed sensing, statistical regression, and machine learning, and can be solved by a general Bregman proximal DC algori…
▽ More
In this paper, we consider a class of difference-of-convex (DC) optimization problems, where the global Lipschitz gradient continuity assumption on the smooth part of the objective function is not required. Such problems are prevalent in many contemporary applications such as compressed sensing, statistical regression, and machine learning, and can be solved by a general Bregman proximal DC algorithm (BPDCA). However, the existing BPDCA is developed based on the stringent requirement that the involved subproblems must be solved exactly, which is often impractical and limits the applicability of the BPDCA. To facilitate the practical implementations and wider applications of the BPDCA, we develop an inexact Bregman proximal difference-of-convex algorithm (iBPDCA) by incorporating two types of relative-type stop** criteria for solving the subproblems. The proposed inexact framework has considerable flexibility to encompass many existing exact and inexact methods, and can accommodate different types of errors that may occur when solving the subproblem. This enables the potential application of our inexact framework across different DC decompositions to facilitate the design of a more efficient DCA scheme in practice. The global subsequential convergence and the global sequential convergence of our iBPDCA are established under suitable conditions including the Kurdyka-Łojasiewicz property. Some numerical experiments on the $\ell_{1-2}$ regularized least squares problem and the constrained $\ell_{1-2}$ sparse optimization problem are conducted to show the superior performance of our iBPDCA in comparison to existing algorithms. These results also empirically verify the necessity of develo** different types of stop** criteria to facilitate the efficient computation of the subproblem in each iteration of our iBPDCA.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Distributed Bilevel Optimization with Communication Compression
Authors:
Yutong He,
Jie Hu,
Xinmeng Huang,
Songtao Lu,
Bin Wang,
Kun Yuan
Abstract:
Stochastic bilevel optimization tackles challenges involving nested optimization structures. Its fast-growing scale nowadays necessitates efficient distributed algorithms. In conventional distributed bilevel methods, each worker must transmit full-dimensional stochastic gradients to the server every iteration, leading to significant communication overhead and thus hindering efficiency and scalabil…
▽ More
Stochastic bilevel optimization tackles challenges involving nested optimization structures. Its fast-growing scale nowadays necessitates efficient distributed algorithms. In conventional distributed bilevel methods, each worker must transmit full-dimensional stochastic gradients to the server every iteration, leading to significant communication overhead and thus hindering efficiency and scalability. To resolve this issue, we introduce the first family of distributed bilevel algorithms with communication compression. The primary challenge in algorithmic development is mitigating bias in hypergradient estimation caused by the nested structure. We first propose C-SOBA, a simple yet effective approach with unbiased compression and provable linear speedup convergence. However, it relies on strong assumptions on bounded gradients. To address this limitation, we explore the use of moving average, error feedback, and multi-step compression in bilevel optimization, resulting in a series of advanced algorithms with relaxed assumptions and improved convergence properties. Numerical experiments show that our compressed bilevel algorithms can achieve $10\times$ reduction in communication overhead without severe performance degradation.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
On Galkin's Lower Bound Conjecture
Authors:
Jianxun Hu,
Huazhong Ke,
Changzheng Li,
Zhitong Su
Abstract:
We estimate an upper bound of the spectral radius of a linear operator on the quantum cohomology of the toric Fano manifolds $\mathbb{P}_{\mathbb{P}^{n}}(\mathcal{O}\oplus\mathcal{O}(3))$. This provides a negative answer to Galkin's lower bound conjecture.
We estimate an upper bound of the spectral radius of a linear operator on the quantum cohomology of the toric Fano manifolds $\mathbb{P}_{\mathbb{P}^{n}}(\mathcal{O}\oplus\mathcal{O}(3))$. This provides a negative answer to Galkin's lower bound conjecture.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Counter-examples to Gamma conjecture I
Authors:
Sergey Galkin,
Jianxun Hu,
Hiroshi Iritani,
Huazhong Ke,
Changzheng Li,
Zhitong Su
Abstract:
We investigate Gamma conjecture I and its underlying Conjecture $\mathcal{O}$ for the $\mathbb{P}^1$-bundles $X_n=\mathbb{P}_{\mathbb{P}^{n}}(\mathcal{O}\oplus\mathcal{O}(n))$ with $n\ge 3$. We show that Conjecture $\mathcal{O}$ does not hold if $n$ is odd, and that Gamma conjecture I does not hold if $n$ is even. Led by this example, we propose modifications for Gamma conjecture I, discuss Gamma…
▽ More
We investigate Gamma conjecture I and its underlying Conjecture $\mathcal{O}$ for the $\mathbb{P}^1$-bundles $X_n=\mathbb{P}_{\mathbb{P}^{n}}(\mathcal{O}\oplus\mathcal{O}(n))$ with $n\ge 3$. We show that Conjecture $\mathcal{O}$ does not hold if $n$ is odd, and that Gamma conjecture I does not hold if $n$ is even. Led by this example, we propose modifications for Gamma conjecture I, discuss Gamma conjecture I over the Kahler moduli space, and identify the corresponding principal asymptotic class.
△ Less
Submitted 5 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Transport based particle methods for the Fokker-Planck-Landau equation
Authors:
Vasily Ilin,
**gwei Hu,
Zhenfu Wang
Abstract:
We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the a…
▽ More
We propose a particle method for numerically solving the Landau equation, inspired by the score-based transport modeling (SBTM) method for the Fokker-Planck equation. This method can preserve some important physical properties of the Landau equation, such as the conservation of mass, momentum, and energy, and decay of estimated entropy. We prove that matching the gradient of the logarithm of the approximate solution is enough to recover the true solution to the Landau equation with Maxwellian molecules. Several numerical experiments in low and moderately high dimensions are performed, with particular emphasis on comparing the proposed method with the traditional particle or blob method.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
A bridge connecting convex analysis and complex analysis and $L^2$-estimate of $d$ and $\bar\partial$
Authors:
Fusheng Deng,
**** Hu,
Weiwen Jiang,
Xiangsen Qin
Abstract:
We propose a way to connect complex analysis and convex analysis. As applications, we derive some results about $L^2$-estimate for $d$-equation and prove some curvature positivity related to convex analysis from well known $L^2$-estimate for $\bar\partial$-equation or the results we prove in complex analysis.
We propose a way to connect complex analysis and convex analysis. As applications, we derive some results about $L^2$-estimate for $d$-equation and prove some curvature positivity related to convex analysis from well known $L^2$-estimate for $\bar\partial$-equation or the results we prove in complex analysis.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
The Asymptotic Properties of the Extreme Eigenvectors of High-dimensional Generalized Spiked Covariance Model
Authors:
Zhangni Pu,
Xiaozhuo Zhang,
Jiang Hu,
Zhidong Bai
Abstract:
In this paper, we investigate the asymptotic behaviors of the extreme eigenvectors in a general spiked covariance matrix, where the dimension and sample size increase proportionally. We eliminate the restrictive assumption of the block diagonal structure in the population covariance matrix. Moreover, there is no requirement for the spiked eigenvalues and the 4th moment to be bounded. Specifically,…
▽ More
In this paper, we investigate the asymptotic behaviors of the extreme eigenvectors in a general spiked covariance matrix, where the dimension and sample size increase proportionally. We eliminate the restrictive assumption of the block diagonal structure in the population covariance matrix. Moreover, there is no requirement for the spiked eigenvalues and the 4th moment to be bounded. Specifically, we apply random matrix theory to derive the convergence and limiting distributions of certain projections of the extreme eigenvectors in a large sample covariance matrix within a generalized spiked population model. Furthermore, our techniques are robust and effective, even when spiked eigenvalues differ significantly in magnitude from nonspiked ones. Finally, we propose a powerful statistic for hypothesis testing for the eigenspaces of covariance matrices.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Provable Preconditioned Plug-and-Play Approach for Compressed Sensing MRI Reconstruction
Authors:
Tao Hong,
Xiaojian Xu,
Jason Hu,
Jeffrey A. Fessler
Abstract:
Model-based methods play a key role in the reconstruction of compressed sensing (CS) MRI. Finding an effective prior to describe the statistical distribution of the image family of interest is crucial for model-based methods. Plug-and-play (PnP) is a general framework that uses denoising algorithms as the prior or regularizer. Recent work showed that PnP methods with denoisers based on pretrained…
▽ More
Model-based methods play a key role in the reconstruction of compressed sensing (CS) MRI. Finding an effective prior to describe the statistical distribution of the image family of interest is crucial for model-based methods. Plug-and-play (PnP) is a general framework that uses denoising algorithms as the prior or regularizer. Recent work showed that PnP methods with denoisers based on pretrained convolutional neural networks outperform other classical regularizers in CS MRI reconstruction. However, the numerical solvers for PnP can be slow for CS MRI reconstruction. This paper proposes a preconditioned PnP (P^2nP) method to accelerate the convergence speed. Moreover, we provide proofs of the fixed-point convergence of the P^2nP iterates. Numerical experiments on CS MRI reconstruction with non-Cartesian sampling trajectories illustrate the effectiveness and efficiency of the P^2nP approach.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Mirror Construction for Nakajima Quiver Varieties
Authors:
Jiawei Hu,
Siu-Cheong Lau,
Ju Tan
Abstract:
In this paper, we construct the ADHM quiver representations and the corresponding sheaves as the mirror objects of formal deformations of the framed immersed Lagrangian sphere decorated with flat bundles. More generally, framed double quivers of Nakajima are constructed as localized mirrors of framed Lagrangian immersions in dimension two. This produces a localized mirror functor to the dg categor…
▽ More
In this paper, we construct the ADHM quiver representations and the corresponding sheaves as the mirror objects of formal deformations of the framed immersed Lagrangian sphere decorated with flat bundles. More generally, framed double quivers of Nakajima are constructed as localized mirrors of framed Lagrangian immersions in dimension two. This produces a localized mirror functor to the dg category of modules over the framed preprojective algebra.
For affine ADE quivers in specific multiplicities, the corresponding (unframed) Lagrangian immersions are homological tori, whose moduli of stable deformations are asymptotically locally Euclidean (ALE) spaces. We show that framed stable Lagrangian branes are transformed into monadic complexes of framed torsion-free sheaves over the ALE spaces.
A main ingredient is the notion of framed Lagrangian immersions. Moreover, it is important to note that the deformation space of a Lagrangian immersion with more than one component is stacky. Using the formalism of quiver algebroid stacks, we find isomorphisms between the moduli of stable Lagrangian immersions and that of special Lagrangian fibers of an SYZ fibration in the affine $A_n$ cases.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Cost-effective company response policy for product co-creation in company-sponsored online community
Authors:
Jiamin Hu,
Lu-Xing Yang,
Xiaofan Yang,
Kaifan Huang,
Gang Li,
Yong Xiang
Abstract:
Product co-creation based on company-sponsored online community has come to be a paradigm of develo** new products collaboratively with customers. In such a product co-creation campaign, the sponsoring company needs to interact intensively with active community members about the design scheme of the product. We call the collection of the rates of the company's response to active community member…
▽ More
Product co-creation based on company-sponsored online community has come to be a paradigm of develo** new products collaboratively with customers. In such a product co-creation campaign, the sponsoring company needs to interact intensively with active community members about the design scheme of the product. We call the collection of the rates of the company's response to active community members at all time in the co-creation campaign as a company response policy (CRP). This paper addresses the problem of finding a cost-effective CRP (the CRP problem). First, we introduce a novel community state evolutionary model and, thereby, establish an optimal control model for the CRP problem (the CRP model). Second, based on the optimality system for the CRP model, we present an iterative algorithm for solving the CRP model (the CRP algorithm). Thirdly, through extensive numerical experiments, we conclude that the CRP algorithm converges and the resulting CRP exhibits excellent cost benefit. Consequently, we recommend the resulting CRP to companies that embrace product co-creation. Next, we discuss how to implement the resulting CRP. Finally, we investigate the effect of some factors on the cost benefit of the resulting CRP. To our knowledge, this work is the first attempt to study value co-creation through optimal control theoretic approach.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Correspondence Research of the Most Probable Transition Paths between a Stochastic Interacting Particle System and its Mean Field Limit System
Authors:
Jianyu Chen,
Jianyu Hu,
Zibo Wang,
Ting Gao **qiao Duan
Abstract:
This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stoc…
▽ More
This paper derived the indirect approximation theorem of the most probable transition pathway of a stochastic interacting particle system in the mean field sense. This paper studied the problem of indirect approximation of the most probable transition pathway of an interacting particle system (i.e., a high-dimensional stochastic dynamic system) and its mean field limit equation (McKean-Vlasov stochastic differential equation). This study is based on the Onsager-Machlup action functional, reformulated the problem as an optimal control problem. With the stochastic Pontryagin's Maximum Principle, this paper completed the derivation. This paper proved the existence and uniqueness theorem of the solution to the mean field optimal control problem of McKean-Vlasov stochastic differential equations, and also established a system of equations satisfying the control parameters $θ^{*}$ and $θ^{N}$ respectively. There are few studies on the most probable transition pathways of stochastic interacting particle systems, it is still a great challenge to solve the most probable transition pathways directly or to approximate it with the mean field limit system. Therefore, this paper first gave the proof of correspondence between the core equation of Pontryagin's Maximum Principle, that is, Hamiltonian extreme condition equation. That is to say, this correspondence indirectly explain the correspondence between the most probable transition pathways of stochastic interacting particle systems and the mean field systems.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
ABP estimate and comparison principle for cone degenerate quasilinear elliptic equations
Authors:
Hua Chen,
Jiangtao Hu,
Xiaochun Liu,
Yawei Wei,
Mengnan Zhang
Abstract:
In this paper, we study the cone degenerate quasilinear elliptic equations. We provide the existence of the viscosity solutions by proving Alexandrov-Bakelman-Pucci and Hölder estimates. Further more, we give the comparison principle by an equivalent transformation.
In this paper, we study the cone degenerate quasilinear elliptic equations. We provide the existence of the viscosity solutions by proving Alexandrov-Bakelman-Pucci and Hölder estimates. Further more, we give the comparison principle by an equivalent transformation.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Oracle complexities of augmented Lagrangian methods for nonsmooth manifold optimization
Authors:
Kangkang Deng,
Jiang Hu,
Jiayuan Wu,
Zaiwen Wen
Abstract:
In this paper, we present two novel manifold inexact augmented Lagrangian methods, \textbf{ManIAL} for deterministic settings and \textbf{StoManIAL} for stochastic settings, solving nonsmooth manifold optimization problems. By using the Riemannian gradient method as a subroutine, we establish an $\mathcal{O}(ε^{-3})$ oracle complexity result of \textbf{ManIAL}, matching the best-known complexity r…
▽ More
In this paper, we present two novel manifold inexact augmented Lagrangian methods, \textbf{ManIAL} for deterministic settings and \textbf{StoManIAL} for stochastic settings, solving nonsmooth manifold optimization problems. By using the Riemannian gradient method as a subroutine, we establish an $\mathcal{O}(ε^{-3})$ oracle complexity result of \textbf{ManIAL}, matching the best-known complexity result. Our algorithm relies on the careful selection of penalty parameters and the precise control of termination criteria for subproblems. Moreover, for cases where the smooth term follows an expectation form, our proposed \textbf{StoManIAL} utilizes a Riemannian recursive momentum method as a subroutine, and achieves an oracle complexity of $\tilde{\mathcal{O}}(ε^{-3.5})$, which surpasses the best-known $\mathcal{O}(ε^{-4})$ result. Numerical experiments conducted on sparse principal component analysis and sparse canonical correlation analysis demonstrate that our proposed methods outperform an existing method with the previously best-known complexity result. To the best of our knowledge, these are the first complexity results of the augmented Lagrangian methods for solving nonsmooth manifold optimization problems.
△ Less
Submitted 29 April, 2024; v1 submitted 7 April, 2024;
originally announced April 2024.
-
Modular representations of the Yangian $Y_2$
Authors:
Hao Chang,
**xin Hu,
Lewis Topley
Abstract:
Let $Y_2$ be the Yangian associated to the general linear Lie algebra $\mathfrak{gl}_2$, defined over an algebraically closed field $\mathbbm{k}$ of characteristic $p > 0$. In this paper, we study the representation theory of the restricted Yangian $Y^{[p]}_2$. This leads to a description of the representations of $\mathfrak{gl}_{2n}$, whose $p$-character is nilpotent with Jordan type given by a t…
▽ More
Let $Y_2$ be the Yangian associated to the general linear Lie algebra $\mathfrak{gl}_2$, defined over an algebraically closed field $\mathbbm{k}$ of characteristic $p > 0$. In this paper, we study the representation theory of the restricted Yangian $Y^{[p]}_2$. This leads to a description of the representations of $\mathfrak{gl}_{2n}$, whose $p$-character is nilpotent with Jordan type given by a two-row partition $(n, n)$.
△ Less
Submitted 8 May, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
Mixed Variational Formulation of Coupled Plates
Authors:
Jun Hu,
Zhen Liu,
Rui Ma,
Ruishu Wang
Abstract:
This paper proposes a mixed variational formulation for the problem of two coupled plates with a rigid {junction}. The proposed mixed {formulation} introduces {the union of} stresses and moments as {an auxiliary variable}, which {are} commonly of great interest in practical applications. The primary challenge lies in determining a suitable {space involving} both boundary and junction conditions of…
▽ More
This paper proposes a mixed variational formulation for the problem of two coupled plates with a rigid {junction}. The proposed mixed {formulation} introduces {the union of} stresses and moments as {an auxiliary variable}, which {are} commonly of great interest in practical applications. The primary challenge lies in determining a suitable {space involving} both boundary and junction conditions of the auxiliary variable. The {theory} of densely defined operators in Hilbert spaces is employed to define {a nonstandard Sobolev space} without the use of trace operators. The well-posedness is established for the mixed formulation. Based on these conditions, this paper provides a framework {of} conforming {mixed} finite element methods. Numerical experiments are given to validate the theoretical results.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
A Structure-Preserving Kernel Method for Learning Hamiltonian Systems
Authors:
Jianyu Hu,
Juan-Pablo Ortega,
Daiying Yin
Abstract:
A structure-preserving kernel ridge regression method is presented that allows the recovery of potentially high-dimensional and nonlinear Hamiltonian functions out of datasets made of noisy observations of Hamiltonian vector fields. The method proposes a closed-form solution that yields excellent numerical performances that surpass other techniques proposed in the literature in this setup. From th…
▽ More
A structure-preserving kernel ridge regression method is presented that allows the recovery of potentially high-dimensional and nonlinear Hamiltonian functions out of datasets made of noisy observations of Hamiltonian vector fields. The method proposes a closed-form solution that yields excellent numerical performances that surpass other techniques proposed in the literature in this setup. From the methodological point of view, the paper extends kernel regression methods to problems in which loss functions involving linear functions of gradients are required and, in particular, a differential reproducing property and a Representer Theorem are proved in this context. The relation between the structure-preserving kernel estimator and the Gaussian posterior mean estimator is analyzed. A full error analysis is conducted that provides convergence rates using fixed and adaptive regularization parameters. The good performance of the proposed estimator is illustrated with various numerical experiments.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Quantum Circuits for partial differential equations via Schrödingerisation
Authors:
Junpeng Hu,
Shi **,
Nana Liu,
Lei Zhang
Abstract:
Quantum computing has emerged as a promising avenue for achieving significant speedup, particularly in large-scale PDE simulations, compared to classical computing. One of the main quantum approaches involves utilizing Hamiltonian simulation, which is directly applicable only to Schrödinger-type equations. To address this limitation, Schrödingerisation techniques have been developed, employing the…
▽ More
Quantum computing has emerged as a promising avenue for achieving significant speedup, particularly in large-scale PDE simulations, compared to classical computing. One of the main quantum approaches involves utilizing Hamiltonian simulation, which is directly applicable only to Schrödinger-type equations. To address this limitation, Schrödingerisation techniques have been developed, employing the warped transformation to convert general linear PDEs into Schrödinger-type equations. However, despite the development of Schrödingerisation techniques, the explicit implementation of the corresponding quantum circuit for solving general PDEs remains to be designed. In this paper, we present detailed implementation of a quantum algorithm for general PDEs using Schrödingerisation techniques. We provide examples of the heat equation, and the advection equation approximated by the upwind scheme, to demonstrate the effectiveness of our approach. Complexity analysis is also carried out to demonstrate the quantum advantages of these algorithms in high dimensions over their classical counterparts.
△ Less
Submitted 12 May, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Managing Distributional Ambiguity in Stochastic Optimization through a Statistical Upper Bound Framework
Authors:
Shixin Liu,
Jian Hu
Abstract:
Stochastic optimization is often hampered by distributional ambiguity, where critical probability distributions are poorly characterized or unknown. Addressing this challenge, we introduce a new framework that targets the minimization of a statistical upper bound for the expected value of uncertain objectives, facilitating more statistically robust decision-making. Central to our approach is the A…
▽ More
Stochastic optimization is often hampered by distributional ambiguity, where critical probability distributions are poorly characterized or unknown. Addressing this challenge, we introduce a new framework that targets the minimization of a statistical upper bound for the expected value of uncertain objectives, facilitating more statistically robust decision-making. Central to our approach is the Average Percentile Upper Bound (APUB), a novel construct that simultaneously delivers a statistically rigorous upper bound for the population mean and a meaningful risk metric for the sample mean. The integration of APUB into stochastic optimization not only fortifies the process against distributional ambiguity but also reinforces key data-driven decision-making attributes, such as reliability, consistency, and comprehensibility. Notably, APUB-enriched optimization problems feature tractability, with particular advantages in two-stage stochastic optimization with random recourse. Empirical demonstrations on two-stage product mix and multi-product newsvendor benchmark problems reveal the benefit of the APUB optimization framework, in comparison with conventional techniques such as sample average approximation and distributionally robust optimization.
△ Less
Submitted 18 April, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Hypothesis testing for homogenous of nodes in $β$-models
Authors:
Kang Fu,
Jianwei Hu,
Meng Sun
Abstract:
The $β$-model has been extensively utilized to model degree heterogeneity in networks, wherein each node is assigned a unique parameter. In this article, we consider the hypothesis testing problem that two nodes $i$ and $j$ of a $β$-model have the same node parameter. We prove that the null distribution of the proposed statistic converges in distribution to the standard normal distribution. Furthe…
▽ More
The $β$-model has been extensively utilized to model degree heterogeneity in networks, wherein each node is assigned a unique parameter. In this article, we consider the hypothesis testing problem that two nodes $i$ and $j$ of a $β$-model have the same node parameter. We prove that the null distribution of the proposed statistic converges in distribution to the standard normal distribution. Further, we investigate the homogeneous test for $β$-model by combining individual $p$-values to aggregate small effects of multiple tests. Both simulation studies and real-world data examples indicate that the proposed method works well.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Revised BDS Test
Authors:
Wenya Luo,
Zhidong Bai,
Jiang Hu,
Chen Wang
Abstract:
In this paper, we focus on the BDS test, which is a nonparametric test of independence. Specifically, the null hypothesis $H_{0}$ of it is that $\{u_{t}\}$ is i.i.d. (independent and identically distributed), where $\{u_{t}\}$ is a random sequence. The BDS test is widely used in economics and finance, but it has a weakness that cannot be ignored: over-rejecting $H_{0}$ even if the length $T$ of…
▽ More
In this paper, we focus on the BDS test, which is a nonparametric test of independence. Specifically, the null hypothesis $H_{0}$ of it is that $\{u_{t}\}$ is i.i.d. (independent and identically distributed), where $\{u_{t}\}$ is a random sequence. The BDS test is widely used in economics and finance, but it has a weakness that cannot be ignored: over-rejecting $H_{0}$ even if the length $T$ of $\{u_{t}\}$ is as large as $(100,2000)$. To improve the over-rejection problem of BDS test, considering that the correlation integral is the foundation of BDS test, we not only accurately describe the expectation of the correlation integral under $H_{0}$, but also calculate all terms of the asymptotic variance of the correlation integral whose order is $O(T^{-1})$ and $O(T^{-2})$, which is essential to improve the finite sample performance of BDS test. Based on this, we propose a revised BDS (RBDS) test and prove its asymptotic normality under $H_{0}$. The RBDS test not only inherits all the advantages of the BDS test, but also effectively corrects the over-rejection problem of the BDS test, which can be fully confirmed by the simulation results we presented. Moreover, based on the simulation results, we find that similar to BDS test, RBDS test would also be affected by the parameter estimations of the ARCH-type model, resulting in size distortion, but this phenomenon can be alleviated by the logarithmic transformation preprocessing of the estimate residuals of the model. Besides, through some actual datasets that have been demonstrated to fit well with ARCH-type models, we also compared the performance of BDS test and RBDS test in evaluating the goodness-of-fit of the model in empirical problem, and the results reflect that, under the same condition, the performance of the RBDS test is more encouraging.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
A High-order Nyström-based Scheme Explicitly Enforcing Surface Density Continuity for the Electric Field Integral Equation
Authors:
** Hu,
Constantine Sideris
Abstract:
This paper introduces an efficient approach for solving the Electric Field Integral Equation (EFIE) with high-order accuracy by explicitly enforcing the continuity of the impressed current densities across boundaries of the surface patch discretization. The integral operator involved is discretized via a Nyström-collocation approach based on Chebyshev polynomial expansion within each patch and a c…
▽ More
This paper introduces an efficient approach for solving the Electric Field Integral Equation (EFIE) with high-order accuracy by explicitly enforcing the continuity of the impressed current densities across boundaries of the surface patch discretization. The integral operator involved is discretized via a Nyström-collocation approach based on Chebyshev polynomial expansion within each patch and a closed quadrature rule is utilized such that the discretization points inside one patch coincide with those inside another patch on the shared boundary of those two patches. The continuity enforcement is achieved by constructing a map** from those coninciding points to a vector containing unique discretization points used in the GMRES iterative solver. The proposed approach is applied to the scattering of several different geometries including a sphere, a cube, a NURBS model imported from CAD software, and a dipole structure and results are compared with the Magnetic Field Integral Equation (MFIE) and the EFIE without enforcing continuity to illustrate the effectiveness of the approach.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Offline Learning of Decision Functions in Multiplayer Games with Expectation Constraints
Authors:
Yuanhanqing Huang,
Jianghai Hu
Abstract:
We explore a class of stochastic multiplayer games where each player in the game aims to optimize its objective under uncertainty and adheres to some expectation constraints. The study employs an offline learning paradigm, leveraging a pre-existing dataset containing auxiliary features. While prior research in deterministic and stochastic multiplayer games primarily explored vector-valued decision…
▽ More
We explore a class of stochastic multiplayer games where each player in the game aims to optimize its objective under uncertainty and adheres to some expectation constraints. The study employs an offline learning paradigm, leveraging a pre-existing dataset containing auxiliary features. While prior research in deterministic and stochastic multiplayer games primarily explored vector-valued decisions, this work departs by considering function-valued decisions that incorporate auxiliary features as input. We leverage the law of large deviations and degree theory to establish the almost sure convergence of the offline learning solution to the true solution as the number of data samples increases. Finally, we demonstrate the validity of our method via a multi-account portfolio optimization problem.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Non-adaptive Bellman-Ford: Yen's improvement is optimal
Authors:
Jialu Hu,
László Kozma
Abstract:
The Bellman-Ford algorithm for single-source shortest paths repeatedly updates tentative distances in an operation called relaxing an edge. In several important applications a non-adaptive (oblivious) implementation is preferred, which means fixing the entire sequence of relaxations upfront, independent of the edge-weights. In a dense graph on $n$ vertices, the algorithm in its standard form perfo…
▽ More
The Bellman-Ford algorithm for single-source shortest paths repeatedly updates tentative distances in an operation called relaxing an edge. In several important applications a non-adaptive (oblivious) implementation is preferred, which means fixing the entire sequence of relaxations upfront, independent of the edge-weights. In a dense graph on $n$ vertices, the algorithm in its standard form performs $(1 + o(1))n^3$ relaxations. An improvement by Yen from 1970 reduces the number of relaxations by a factor of two. We show that no further constant-factor improvements are possible, and every non-adaptive deterministic algorithm based on relaxations must perform $(\frac{1}{2} - o(1))n^3$ steps. This improves an earlier lower bound of Eppstein of $(\frac{1}{6} - o(1))n^3$. Given that a non-adaptive randomized variant of Bellman-Ford with at most $(\frac{1}{3} + o(1))n^3$ relaxations (with high probability) is known, our result implies a strict separation between deterministic and randomized strategies, answering an open question of Eppstein.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
A hybrid iterative method based on MIONet for PDEs: Theory and numerical examples
Authors:
Jun Hu,
Pengzhan **
Abstract:
We propose a hybrid iterative method based on MIONet for PDEs, which combines the traditional numerical iterative solver and the recent powerful machine learning method of neural operator, and further systematically analyze its theoretical properties, including the convergence condition, the spectral behavior, as well as the convergence rate, in terms of the errors of the discretization and the mo…
▽ More
We propose a hybrid iterative method based on MIONet for PDEs, which combines the traditional numerical iterative solver and the recent powerful machine learning method of neural operator, and further systematically analyze its theoretical properties, including the convergence condition, the spectral behavior, as well as the convergence rate, in terms of the errors of the discretization and the model inference. We show the theoretical results for the frequently-used smoothers, i.e. Richardson (damped Jacobi) and Gauss-Seidel. We give an upper bound of the convergence rate of the hybrid method w.r.t. the model correction period, which indicates a minimum point to make the hybrid iteration converge fastest. Several numerical examples including the hybrid Richardson (Gauss-Seidel) iteration for the 1-d (2-d) Poisson equation are presented to verify our theoretical results, and also reflect an excellent acceleration effect. As a meshless acceleration method, it is provided with enormous potentials for practice applications.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
LEMDA: A Lagrangian-Eulerian Multiscale Data Assimilation Framework
Authors:
Quanling Deng,
Nan Chen,
Samuel N. Stechmann,
Jiuhua Hu
Abstract:
Lagrangian trajectories are widely used as observations for recovering the underlying flow field via Lagrangian data assimilation (DA). However, the strong nonlinearity in the observational process and the high dimensionality of the problems often cause challenges in applying standard Lagrangian DA. In this paper, a Lagrangian-Eulerian multiscale DA (LEMDA) framework is developed. It starts with e…
▽ More
Lagrangian trajectories are widely used as observations for recovering the underlying flow field via Lagrangian data assimilation (DA). However, the strong nonlinearity in the observational process and the high dimensionality of the problems often cause challenges in applying standard Lagrangian DA. In this paper, a Lagrangian-Eulerian multiscale DA (LEMDA) framework is developed. It starts with exploiting the Boltzmann kinetic description of the particle dynamics to derive a set of continuum equations, which characterize the statistical quantities of particle motions at fixed grids and serve as Eulerian observations. Despite the nonlinearity in the continuum equations and the processes of Lagrangian observations, the time evolutions of the posterior distribution from LEMDA can be written down using closed analytic formulae. This offers an exact and efficient way of carrying out DA, which avoids using ensemble approximations and the associated tunings. The analytically solvable properties also facilitate the derivation of an effective reduced-order Lagrangian DA scheme that further enhances computational efficiency. The Lagrangian DA within the framework has advantages when a moderate number of particles is used, while the Eulerian DA can effectively save computational costs when the number of particle observations becomes large. The Eulerian DA is also valuable when particles collide, such as using sea ice floe trajectories as observations. LEMDA naturally applies to multiscale turbulent flow fields, where the Eulerian DA recovers the large-scale structures, and the Lagrangian DA efficiently resolves the small-scale features in each grid cell via parallel computing. Numerical experiments demonstrate the skilful results of LEMDA and its two components.
△ Less
Submitted 4 February, 2024; v1 submitted 31 January, 2024;
originally announced January 2024.
-
A direct finite element method for elliptic interface problems
Authors:
Jun Hu,
Limin Ma
Abstract:
In this paper, a direct finite element method is proposed for solving interface problems on simple unfitted meshes. The fact that the two interface conditions form a $H^{\frac12}(Γ)\times H^{-\frac12}(Γ)$ pair leads to a simple and direct weak formulation with an integral term for the mutual interaction over the interface, and the well-posedness of this weak formulation is proved. Based on this fo…
▽ More
In this paper, a direct finite element method is proposed for solving interface problems on simple unfitted meshes. The fact that the two interface conditions form a $H^{\frac12}(Γ)\times H^{-\frac12}(Γ)$ pair leads to a simple and direct weak formulation with an integral term for the mutual interaction over the interface, and the well-posedness of this weak formulation is proved. Based on this formulation, a direct finite element method is proposed to solve the problem on two adjacent subdomains separated by the interface by conforming finite element and conforming mixed finite element, respectively. The well-posedness and an optimal a priori analysis are proved for this direct finite element method under some reasonable assumptions. A simple lowest order direct finite element method by using the linear element method and the lowest order Raviart-Thomas element method is proposed and analyzed to admit the optimal a priori error estimate by verifying the aforementioned assumptions. Numerical tests are also conducted to verify the theoretical results and the effectiveness of the direct finite element method.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Multi-Robot Relative Pose Estimation in SE(2) with Observability Analysis: A Comparison of Extended Kalman Filtering and Robust Pose Graph Optimization
Authors:
Kihoon Shin,
Hyunjae Sim,
Seungwon Nam,
Yonghee Kim,
Jae Hu,
Kwang-Ki K. Kim
Abstract:
In this study, we address multi-robot localization issues, with a specific focus on cooperative localization and observability analysis of relative pose estimation. Cooperative localization involves enhancing each robot's information through a communication network and message passing. If odometry data from a target robot can be transmitted to the ego robot, observability of their relative pose es…
▽ More
In this study, we address multi-robot localization issues, with a specific focus on cooperative localization and observability analysis of relative pose estimation. Cooperative localization involves enhancing each robot's information through a communication network and message passing. If odometry data from a target robot can be transmitted to the ego robot, observability of their relative pose estimation can be achieved through range-only or bearing-only measurements, provided both robots have non-zero linear velocities. In cases where odometry data from a target robot are not directly transmitted but estimated by the ego robot, both range and bearing measurements are necessary to ensure observability of relative pose estimation. For ROS/Gazebo simulations, we explore four sensing and communication structures. We compare extended Kalman filtering (EKF) and pose graph optimization (PGO) estimation using different robust loss functions (filtering and smoothing with varying batch sizes of sliding windows) in terms of estimation accuracy. In hardware experiments, two Turtlebot3 equipped with UWB modules are used for real-world inter-robot relative pose estimation, applying both EKF and PGO and comparing their performance.
△ Less
Submitted 4 February, 2024; v1 submitted 27 January, 2024;
originally announced January 2024.
-
Z-estimation system: a modular approach to asymptotic analysis
Authors:
Jie Kate Hu
Abstract:
Asymptotic analysis for related inference problems often involves similar steps and proofs. These intermediate results could be shared across problems if each of them is made self-contained and easily identified. However, asymptotic analysis using Taylor expansions is limited for result borrowing because it is a step-to-step procedural approach. This article introduces EEsy, a modular system for e…
▽ More
Asymptotic analysis for related inference problems often involves similar steps and proofs. These intermediate results could be shared across problems if each of them is made self-contained and easily identified. However, asymptotic analysis using Taylor expansions is limited for result borrowing because it is a step-to-step procedural approach. This article introduces EEsy, a modular system for estimating finite and infinitely dimensional parameters in related inference problems. It is based on the infinite-dimensional Z-estimation theorem, Donsker and Glivenko-Cantelli preservation theorems, and weight calibration techniques. This article identifies the systematic nature of these tools and consolidates them into one system containing several modules, which can be built, shared, and extended in a modular manner. This change to the structure of method development allows related methods to be developed in parallel and complex problems to be solved collaboratively, expediting the development of new analytical methods. This article considers four related inference problems -- estimating parameters with random sampling, two-phase sampling, auxiliary information incorporation, and model misspecification. We illustrate this modular approach by systematically develo** 9 parameter estimators and 18 variance estimators for the four related inference problems regarding semi-parametric additive hazards models. Simulation studies show the obtained asymptotic results for these 27 estimators are valid. In the end, I describe how this system can simplify the use of empirical process theory, a powerful but challenging tool to be adopted by the broad community of methods developers. I discuss challenges and the extension of this system to other inference problems.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Accelerating Distributed Stochastic Optimization via Self-Repellent Random Walks
Authors:
Jie Hu,
Vishwaraj Doshi,
Do Young Eun
Abstract:
We study a family of distributed stochastic optimization algorithms where gradients are sampled by a token traversing a network of agents in random-walk fashion. Typically, these random-walks are chosen to be Markov chains that asymptotically sample from a desired target distribution, and play a critical role in the convergence of the optimization iterates. In this paper, we take a novel approach…
▽ More
We study a family of distributed stochastic optimization algorithms where gradients are sampled by a token traversing a network of agents in random-walk fashion. Typically, these random-walks are chosen to be Markov chains that asymptotically sample from a desired target distribution, and play a critical role in the convergence of the optimization iterates. In this paper, we take a novel approach by replacing the standard linear Markovian token by one which follows a nonlinear Markov chain - namely the Self-Repellent Radom Walk (SRRW). Defined for any given 'base' Markov chain, the SRRW, parameterized by a positive scalar α, is less likely to transition to states that were highly visited in the past, thus the name. In the context of MCMC sampling on a graph, a recent breakthrough in Doshi et al. (2023) shows that the SRRW achieves O(1/α) decrease in the asymptotic variance for sampling. We propose the use of a 'generalized' version of the SRRW to drive token algorithms for distributed stochastic optimization in the form of stochastic approximation, termed SA-SRRW. We prove that the optimization iterate errors of the resulting SA-SRRW converge to zero almost surely and prove a central limit theorem, deriving the explicit form of the resulting asymptotic covariance matrix corresponding to iterate errors. This asymptotic covariance is always smaller than that of an algorithm driven by the base Markov chain and decreases at rate O(1/α^2) - the performance benefit of using SRRW thereby amplified in the stochastic optimization context. Empirical results support our theoretical findings.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Central Limit Theorem for Two-Timescale Stochastic Approximation with Markovian Noise: Theory and Applications
Authors:
Jie Hu,
Vishwaraj Doshi,
Do Young Eun
Abstract:
Two-timescale stochastic approximation (TTSA) is among the most general frameworks for iterative stochastic algorithms. This includes well-known stochastic optimization methods such as SGD variants and those designed for bilevel or minimax problems, as well as reinforcement learning like the family of gradient-based temporal difference (GTD) algorithms. In this paper, we conduct an in-depth asympt…
▽ More
Two-timescale stochastic approximation (TTSA) is among the most general frameworks for iterative stochastic algorithms. This includes well-known stochastic optimization methods such as SGD variants and those designed for bilevel or minimax problems, as well as reinforcement learning like the family of gradient-based temporal difference (GTD) algorithms. In this paper, we conduct an in-depth asymptotic analysis of TTSA under controlled Markovian noise via central limit theorem (CLT), uncovering the coupled dynamics of TTSA influenced by the underlying Markov chain, which has not been addressed by previous CLT results of TTSA only with Martingale difference noise. Building upon our CLT, we expand its application horizon of efficient sampling strategies from vanilla SGD to a wider TTSA context in distributed learning, thus broadening the scope of Hu et al. (2022). In addition, we leverage our CLT result to deduce the statistical properties of GTD algorithms with nonlinear function approximation using Markovian samples and show their identical asymptotic performance, a perspective not evident from current finite-time bounds.
△ Less
Submitted 13 February, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
The Gaussian log-Minkowski problem
Authors:
**rong Hu
Abstract:
The Gaussian log-Minkowski problem for symmetric convex bodies is solved by a variational argument, without the method of Lagrange multipliers, under the condition that the given measure is not concentrated on any great hypersphere.
The Gaussian log-Minkowski problem for symmetric convex bodies is solved by a variational argument, without the method of Lagrange multipliers, under the condition that the given measure is not concentrated on any great hypersphere.
△ Less
Submitted 16 March, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
On the maximal and minimal degree components of the cocenter of the cyclotomic KLR algebras
Authors:
Jun Hu,
Lei Shi
Abstract:
Let $\mathscr{R}_α^Λ$ be the cyclotomic KLR algebra associated to a symmetrizable Kac-Moody Lie algebra $\mathfrak{g}$ and polynomials $\{Q_{ij}(u,v)\}_{i,j\in I}$. Shan, Varagnolo and Vasserot show that, when the ground field $K$ has characteristic $0$, the degree $d$ component of the cocenter $Tr(\mathscr{R}_α^Λ)$ is nonzero only if $0\leq d\leq d_{Λ,α}$. In this paper we show that this holds tr…
▽ More
Let $\mathscr{R}_α^Λ$ be the cyclotomic KLR algebra associated to a symmetrizable Kac-Moody Lie algebra $\mathfrak{g}$ and polynomials $\{Q_{ij}(u,v)\}_{i,j\in I}$. Shan, Varagnolo and Vasserot show that, when the ground field $K$ has characteristic $0$, the degree $d$ component of the cocenter $Tr(\mathscr{R}_α^Λ)$ is nonzero only if $0\leq d\leq d_{Λ,α}$. In this paper we show that this holds true for arbitrary ground field $K$, arbitrary $\mathfrak{g}$ and arbitrary polynomials $\{Q_{ij}(u,v)\}_{i,j\in I}$. We generalize our earlier results on the $K$-linear generators of $Tr(\mathscr{R}_α^Λ), Tr(\mathscr{R}_α^Λ)_0, Tr(\mathscr{R}_α^Λ)_{d_{Λ,α}}$ to arbitrary ground field $K$. Moreover, we show that the dimension of the degree $0$ component $Tr(\mathscr{R}_α^Λ)_0$ is always equal to $\dim V(Λ)_{Λ-α}$, where $V(Λ)$ is the integrable highest weight $U(\mathfrak{g})$-module with highest weight $Λ$, and we obtain a basis for $Tr(\mathscr{R}_α^Λ)_0$.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
The Collisional Particle-In-Cell Method for the Vlasov-Maxwell-Landau Equations
Authors:
Rafael Bailo,
José A. Carrillo,
**gwei Hu
Abstract:
We introduce an extension of the particle-in-cell (PIC) method that captures the Landau collisional effects in the Vlasov-Maxwell-Landau equations. The method arises from a regularisation of the variational formulation of the Landau equation, leading to a discretisation of the collision operator that conserves mass, charge, momentum, and energy, while increasing the (regularised) entropy. The coll…
▽ More
We introduce an extension of the particle-in-cell (PIC) method that captures the Landau collisional effects in the Vlasov-Maxwell-Landau equations. The method arises from a regularisation of the variational formulation of the Landau equation, leading to a discretisation of the collision operator that conserves mass, charge, momentum, and energy, while increasing the (regularised) entropy. The collisional effects appear as a fully deterministic effective force, thus the method does not require any transport-collision splitting. The scheme can be used in arbitrary dimension, and for a general interaction, including the Coulomb case. We validate the scheme on scenarios such as the Landau dam**, the two-stream instability, and the Weibel instability, demonstrating its effectiveness in the numerical simulation of plasma.
△ Less
Submitted 31 March, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
The randomized Milstein scheme for stochastic Volterra integral equations with weakly singular kernels
Authors:
Zhaohang Wang,
Zhuoqi Liu,
Shuaibin Gao,
Junhao Hu
Abstract:
This paper focuses on the randomized Milstein scheme for approximating solutions to stochastic Volterra integral equations with weakly singular kernels, where the drift coefficients are non-differentiable. An essential component of the error analysis involves the utilization of randomized quadrature rules for stochastic integrals to avoid the Taylor expansion in drift coefficient functions. Finall…
▽ More
This paper focuses on the randomized Milstein scheme for approximating solutions to stochastic Volterra integral equations with weakly singular kernels, where the drift coefficients are non-differentiable. An essential component of the error analysis involves the utilization of randomized quadrature rules for stochastic integrals to avoid the Taylor expansion in drift coefficient functions. Finally, we implement the simulation of multiple singular stochastic integral in the numerical experiment by applying the Riemann-Stieltjes integral.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
An Augmented Lagrangian Primal-Dual Semismooth Newton Method for Multi-Block Composite Optimization
Authors:
Zhanwang Deng,
Kangkang Deng,
Jiang Hu,
Zaiwen Wen
Abstract:
In this paper, we develop a novel primal-dual semismooth Newton method for solving linearly constrained multi-block convex composite optimization problems. First, a differentiable augmented Lagrangian (AL) function is constructed by utilizing the Moreau envelopes of the nonsmooth functions. It enables us to derive an equivalent saddle point problem and establish the strong AL duality under the Sla…
▽ More
In this paper, we develop a novel primal-dual semismooth Newton method for solving linearly constrained multi-block convex composite optimization problems. First, a differentiable augmented Lagrangian (AL) function is constructed by utilizing the Moreau envelopes of the nonsmooth functions. It enables us to derive an equivalent saddle point problem and establish the strong AL duality under the Slater's condition. Consequently, a semismooth system of nonlinear equations is formulated to characterize the optimality of the original problem instead of the inclusion-form KKT conditions. We then develop a semismooth Newton method, called ALPDSN, which uses purely second-order steps and a nonmonotone line search based globalization strategy. Through a connection to the inexact first-order steps when the regularization parameter is sufficiently large, the global convergence of ALPDSN is established. Under the regularity conditions, partial smoothness, the local error bound, and the strict complementarity, we show that both the primal and the dual iteration sequences possess a superlinear convergence rate and provide concrete examples where these regularity conditions are met. Numerical results on the image restoration with two regularization terms and the corrected tensor nuclear norm problem are presented to demonstrate the high efficiency and robustness of our ALPDSN.
△ Less
Submitted 15 May, 2024; v1 submitted 2 December, 2023;
originally announced December 2023.
-
Pinching estimates of hypersurfaces by a generalized Gauss curvature flow
Authors:
**rong Hu,
** Zhang
Abstract:
A variant of the Gauss curvature flow for closed and convex hypersurfaces is considered. We reveal that if the initial hypersurface is pinched enough, then this property is preserved. Furthermore, based on some structure assumptions on the speed function of the shrinking flow, we show that the flow converges to a sphere. This may generalize the result of B. Chow\cite{CW85} to the possible non-homo…
▽ More
A variant of the Gauss curvature flow for closed and convex hypersurfaces is considered. We reveal that if the initial hypersurface is pinched enough, then this property is preserved. Furthermore, based on some structure assumptions on the speed function of the shrinking flow, we show that the flow converges to a sphere. This may generalize the result of B. Chow\cite{CW85} to the possible non-homogeneous curvature flows.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Decentralized Douglas-Rachford splitting methods for smooth optimization over compact submanifolds
Authors:
Kangkang Deng,
Jiang Hu,
Hongxia Wang
Abstract:
We study decentralized smooth optimization problems over compact submanifolds. Recasting it as a composite optimization problem, we propose a decentralized Douglas-Rachford splitting algorithm, DDRS. When the proximal operator of the local loss function does not have a closed-form solution, an inexact version of DDRS, iDDRS, is also presented. Both algorithms rely on an ingenious integration of th…
▽ More
We study decentralized smooth optimization problems over compact submanifolds. Recasting it as a composite optimization problem, we propose a decentralized Douglas-Rachford splitting algorithm, DDRS. When the proximal operator of the local loss function does not have a closed-form solution, an inexact version of DDRS, iDDRS, is also presented. Both algorithms rely on an ingenious integration of the nonconvex Douglas-Rachford splitting algorithm with gradient tracking and manifold optimization. We show that our DDRS and iDDRS achieve the best-known convergence rate of $\mathcal{O}(1/K)$. The main challenge in the proof is how to handle the nonconvexity of the manifold constraint. To address this issue, we utilize the concept of proximal smoothness for compact submanifolds. This ensures that the projection onto the submanifold exhibits convexity-like properties, which allows us to control the consensus error across agents. Numerical experiments on the principal component analysis are conducted to demonstrate the effectiveness of our decentralized DRS compared with the state-of-the-art ones.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Machine Learning-Enhanced Aircraft Landing Scheduling under Uncertainties
Authors:
Yutian Pang,
Peng Zhao,
Jueming Hu,
Yongming Liu
Abstract:
This paper addresses aircraft delays, emphasizing their impact on safety and financial losses. To mitigate these issues, an innovative machine learning (ML)-enhanced landing scheduling methodology is proposed, aiming to improve automation and safety. Analyzing flight arrival delay scenarios reveals strong multimodal distributions and clusters in arrival flight time durations. A multi-stage conditi…
▽ More
This paper addresses aircraft delays, emphasizing their impact on safety and financial losses. To mitigate these issues, an innovative machine learning (ML)-enhanced landing scheduling methodology is proposed, aiming to improve automation and safety. Analyzing flight arrival delay scenarios reveals strong multimodal distributions and clusters in arrival flight time durations. A multi-stage conditional ML predictor enhances separation time prediction based on flight events. ML predictions are then integrated as safety constraints in a time-constrained traveling salesman problem formulation, solved using mixed-integer linear programming (MILP). Historical flight recordings and model predictions address uncertainties between successive flights, ensuring reliability. The proposed method is validated using real-world data from the Atlanta Air Route Traffic Control Center (ARTCC ZTL). Case studies demonstrate an average 17.2% reduction in total landing time compared to the First-Come-First-Served (FCFS) rule. Unlike FCFS, the proposed methodology considers uncertainties, instilling confidence in scheduling. The study concludes with remarks and outlines future research directions.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Robust and conservative dynamical low-rank methods for the Vlasov equation via a novel macro-micro decomposition
Authors:
Jack Coughlin,
**gwei Hu,
Uri Shumlak
Abstract:
Dynamical low-rank (DLR) approximation has gained interest in recent years as a viable solution to the curse of dimensionality in the numerical solution of kinetic equations including the Boltzmann and Vlasov equations. These methods include the projector-splitting and Basis-update & Galerkin (BUG) DLR integrators, and have shown promise at greatly improving the computational efficiency of kinetic…
▽ More
Dynamical low-rank (DLR) approximation has gained interest in recent years as a viable solution to the curse of dimensionality in the numerical solution of kinetic equations including the Boltzmann and Vlasov equations. These methods include the projector-splitting and Basis-update & Galerkin (BUG) DLR integrators, and have shown promise at greatly improving the computational efficiency of kinetic solutions. However, this often comes at the cost of conservation of charge, current and energy. In this work we show how a novel macro-micro decomposition may be used to separate the distribution function into two components, one of which carries the conserved quantities, and the other of which is orthogonal to them. We apply DLR approximation to the latter, and thereby achieve a clean and extensible approach to a conservative DLR scheme which retains the computational advantages of the base scheme. Moreover, our approach requires no change to the mechanics of the DLR approximation, so it is compatible with both the BUG family of integrators and the projector-splitting integrator which we use here. We describe a first-order integrator which can exactly conserve charge and either current or energy, as well as an integrator which exactly conserves charge and energy and exhibits second-order accuracy on our test problems. To highlight the flexibility of the proposed macro-micro decomposition, we implement a pair of velocity space discretizations, and verify the claimed accuracy and conservation properties on a suite of plasma benchmark problems.
△ Less
Submitted 10 April, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Nash-Moser iteration approach to the logarithmic gradient estimates and Liouville Properties of quasilinear elliptic equations on manifolds
Authors:
Jie He,
**gchen Hu,
Youde Wang
Abstract:
In this paper, we provide a new routine to employ the Nash-Moser iteration technique to analyze the local and global properties of positive solutions to the equation $$Δ_pv + a|\nabla v|^qv^r =0$$ on a complete Riemannian manifold with Ricci curvature bounded from below, where $p>1$, $q$, $r$ and $a$ are some real constants. Assuming certain conditions on $a,\, p,\, q$ and $r$, we can derive unive…
▽ More
In this paper, we provide a new routine to employ the Nash-Moser iteration technique to analyze the local and global properties of positive solutions to the equation $$Δ_pv + a|\nabla v|^qv^r =0$$ on a complete Riemannian manifold with Ricci curvature bounded from below, where $p>1$, $q$, $r$ and $a$ are some real constants. Assuming certain conditions on $a,\, p,\, q$ and $r$, we can derive universal and succinct Cheng-Yau type logarithmic gradient estimates for such solutions. In particular, we give the obvious expressions of constants in the logarithmic gradient estimate for entire solutions to the above equation (see \thmref{t10}). The gradient estimates enable us to obtain some Liouville-type theorems, Harnack inequalities and some local estimates near singularities for positive solutions. Some of our results are new even in the case the domain is an Euclidean space and $p=2$.
△ Less
Submitted 26 March, 2024; v1 submitted 5 November, 2023;
originally announced November 2023.
-
A robust version of the multipartite Hajnal--Szemerédi theorem
Authors:
Jie Han,
Jie Hu,
Donglei Yang
Abstract:
In this note we show the following strengthening of a multipartite version of the Hajnal--Szemerédi theorem. For an integer $r \ge 3$ and $γ> 0$, there exists a constant $C$ such that if $p\ge Cn^{-2/r}(\log n)^{1/{r \choose 2}}$ and $G$ is a balanced $r$-partite graph with each vertex class of size $n$ and $δ^\ast(G)\ge (1-1/r+γ)n$, then with high probability the random subgraph $G(p)$ of $G$ con…
▽ More
In this note we show the following strengthening of a multipartite version of the Hajnal--Szemerédi theorem. For an integer $r \ge 3$ and $γ> 0$, there exists a constant $C$ such that if $p\ge Cn^{-2/r}(\log n)^{1/{r \choose 2}}$ and $G$ is a balanced $r$-partite graph with each vertex class of size $n$ and $δ^\ast(G)\ge (1-1/r+γ)n$, then with high probability the random subgraph $G(p)$ of $G$ contains a $K_r$-factor. We also use it to derive corresponding transversal versions.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
A particle method for the multispecies Landau equation
Authors:
José A. Carrillo,
**gwei Hu,
Samuel Q. Van Fleet
Abstract:
The multispecies Landau collision operator describes the two-particle, small scattering angle or grazing collisions in a plasma made up of different species of particles such as electrons and ions. Recently, a structure preserving deterministic particle method arXiv:1910.03080 has been developed for the single species spatially homogeneous Landau equation. This method relies on a regularization of…
▽ More
The multispecies Landau collision operator describes the two-particle, small scattering angle or grazing collisions in a plasma made up of different species of particles such as electrons and ions. Recently, a structure preserving deterministic particle method arXiv:1910.03080 has been developed for the single species spatially homogeneous Landau equation. This method relies on a regularization of the Landau collision operator so that an approximate solution, which is a linear combination of Dirac delta distributions, is well-defined. Based on a weak form of the regularized Landau equation, the time dependent locations of the Dirac delta functions satisfy a system of ordinary differential equations. In this work, we extend this particle method to the multispecies case, and examine its conservation of mass, momentum, and energy, and decay of entropy properties. We show that the equilibrium distribution of the regularized multispecies Landau equation is a Maxwellian distribution, and state a critical condition on the regularization parameters that guarantees a species independent equilibrium temperature. A convergence study comparing an exact multispecies BKW solution to the particle solution shows approximately 2nd order accuracy. Important physical properties such as conservation, decay of entropy, and equilibrium distribution of the particle method are demonstrated with several numerical examples.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Distributed estimation of spiked eigenvalues in spiked population models
Authors:
Lu Yan,
Jiang Hu
Abstract:
The proliferation of science and technology has led to the prevalence of voluminous data sets that are distributed across multiple machines. It is an established fact that conventional statistical methodologies may be unfeasible in the analysis of such massive data sets due to prohibitively long computing durations, memory constraints, communication overheads, and confidentiality considerations. I…
▽ More
The proliferation of science and technology has led to the prevalence of voluminous data sets that are distributed across multiple machines. It is an established fact that conventional statistical methodologies may be unfeasible in the analysis of such massive data sets due to prohibitively long computing durations, memory constraints, communication overheads, and confidentiality considerations. In this paper, we propose distributed estimators of the spiked eigenvalues in spiked population models. The consistency and asymptotic normality of the distributed estimators are derived, and the statistical error analysis of the distributed estimators is provided as well. Compared to the estimation from the full sample, the proposed distributed estimation shares the same order of convergence. Simulation study and real data analysis indicate that the proposed distributed estimation and testing procedures have excellent properties in terms of estimation accuracy and stability as well as transmission efficiency.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
The torsion log-Minkowski problem
Authors:
**rong Hu
Abstract:
In this paper, we deal with the torsion log-Minkowski problem without symmetry assumptions via an approximation argument.
In this paper, we deal with the torsion log-Minkowski problem without symmetry assumptions via an approximation argument.
△ Less
Submitted 10 October, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Invariants of real vector bundles
Authors:
Jiahao Hu
Abstract:
For a compact smooth manifold with corners (or finite CW-complex) $X$, we can prescribe a finite set of spin or spin$^h$ manifolds (possibly with boundary) map** into it so that every real vector bundle over $X$ is determined, up to stable equivalence, by the Dirac indices of the real vector bundle when pulled-back onto those prescribed spin or spin$^h$ manifolds.
For a compact smooth manifold with corners (or finite CW-complex) $X$, we can prescribe a finite set of spin or spin$^h$ manifolds (possibly with boundary) map** into it so that every real vector bundle over $X$ is determined, up to stable equivalence, by the Dirac indices of the real vector bundle when pulled-back onto those prescribed spin or spin$^h$ manifolds.
△ Less
Submitted 8 December, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Stochastic equations with low regularity drifts
Authors:
**long Wei,
Junhao Hu,
Chenggui Yuan
Abstract:
By using the Itô-Tanaka trick, we prove the unique strong solvability as well as the gradient estimates for stochastic differential equations with irregular drifts in low regularity Lebesgue-Hölder space $L^q(0,T;{\mathcal C}_b^α({\mathbb R}^d))$ with $α\in(0,1)$ and $q\in (2/(1+α),2$). As applications, we show the unique weak and strong solvability for stochastic transport equations driven by the…
▽ More
By using the Itô-Tanaka trick, we prove the unique strong solvability as well as the gradient estimates for stochastic differential equations with irregular drifts in low regularity Lebesgue-Hölder space $L^q(0,T;{\mathcal C}_b^α({\mathbb R}^d))$ with $α\in(0,1)$ and $q\in (2/(1+α),2$). As applications, we show the unique weak and strong solvability for stochastic transport equations driven by the low regularity drift with $q\in (4/(2+α),2$) as well as the local Lipschitz estimate for stochastic strong solutions.
△ Less
Submitted 28 October, 2023; v1 submitted 30 September, 2023;
originally announced October 2023.
-
Numerical characterization of the hard Lefschetz classes of dimension two
Authors:
Jiajun Hu,
Jian Xiao
Abstract:
We study the numerical characterization of two dimensional hard Lefschetz classes given by the complete intersections of nef classes. In Shenfeld and van Handel's breakthrough work on the characterization of the extremals of the Alexandrov-Fenchel inequality for convex polytopes, they proposed an open question on the algebraic analogue of the characterization. By taking further inspiration from ou…
▽ More
We study the numerical characterization of two dimensional hard Lefschetz classes given by the complete intersections of nef classes. In Shenfeld and van Handel's breakthrough work on the characterization of the extremals of the Alexandrov-Fenchel inequality for convex polytopes, they proposed an open question on the algebraic analogue of the characterization. By taking further inspiration from our previous work with Shang on hard Lefschetz theorems for free line bundles, we formulate and refine the conjectural picture more precisely and settle the open question when the collection of nef classes is given by a rearrangement of supercriticality, which in particular includes the big nef collection as a special case. The main results enable us to refine some previous results and study the extremals of Hodge index inequality, and more importantly provide the first series of examples of hard Lefschetz classes of dimension two both in algebraic geometry and analytic geometry, in which one can allow nontrivial augmented base locus and thus drop the semi-ampleness or semi-positivity assumption. As a key ingredient of the numerical characterization, we establish a local Hodge index inequality for Lorentzian polynomials, which is the algebraic analogue of the local Alexandrov-Fenchel inequality obtained by Shenfeld-van Handel for convex polytopes. This result holds in broad contexts, e.g., it holds on a smooth projective variety, on a compact Kähler manifold and on a Lorentzian fan, which contains the Bergman fan of a matroid or polymatroid as a typical example.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.