-
Green Multigrid Network
Authors:
Ye Lin,
Young Ju Lee,
Jiwei Jia
Abstract:
GreenLearning networks (GL) directly learn Green's function in physical space, making them an interpretable model for capturing unknown solution operators of partial differential equations (PDEs). For many PDEs, the corresponding Green's function exhibits asymptotic smoothness. In this paper, we propose a framework named Green Multigrid networks (GreenMGNet), an operator learning algorithm designe…
▽ More
GreenLearning networks (GL) directly learn Green's function in physical space, making them an interpretable model for capturing unknown solution operators of partial differential equations (PDEs). For many PDEs, the corresponding Green's function exhibits asymptotic smoothness. In this paper, we propose a framework named Green Multigrid networks (GreenMGNet), an operator learning algorithm designed for a class of asymptotically smooth Green's functions.
Compared with the pioneering GL, the new framework presents itself with better accuracy and efficiency, thereby achieving a significant improvement. GreenMGNet is composed of two technical novelties. First, Green's function is modeled as a piecewise function to take into account its singular behavior in some parts of the hyperplane. Such piecewise function is then approximated by a neural network with augmented output(AugNN) so that it can capture singularity accurately. Second, the asymptotic smoothness property of Green's function is used to leverage the Multi-Level Multi-Integration (MLMI) algorithm for both the training and inference stages. Several test cases of operator learning are presented to demonstrate the accuracy and effectiveness of the proposed method. On average, GreenMGNet achieves $3.8\%$ to $39.15\%$ accuracy improvement. To match the accuracy level of GL, GreenMGNet requires only about $10\%$ of the full grid data, resulting in a $55.9\%$ and $92.5\%$ reduction in training time and GPU memory cost for one-dimensional test problems, and a $37.7\%$ and $62.5\%$ reduction for two-dimensional test problems.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Solution of some tiling open problems of Propp, Lai, and some related results
Authors:
Seok Hyun Byun,
Mihai Ciucu,
Yi-Lin Lee
Abstract:
In this paper, we present a new version of the second author's factorization theorem for perfect matchings of symmetric graphs. We then use our result to solve four open problems of Propp on the enumeration of trimer tilings on the hexagonal lattice.
As another application, we obtain a semi-factorization result for the number of lozenge tilings of a large class of hexagonal regions with holes (o…
▽ More
In this paper, we present a new version of the second author's factorization theorem for perfect matchings of symmetric graphs. We then use our result to solve four open problems of Propp on the enumeration of trimer tilings on the hexagonal lattice.
As another application, we obtain a semi-factorization result for the number of lozenge tilings of a large class of hexagonal regions with holes (obtained by starting with an arbitrary symmetric hexagon with holes, and translating all the holes one unit lattice segment in the same direction). This in turn leads to the solution of two open problems posed by Lai, to an extension of a result due to Fulmek and Krattenthaler, and to exact enumeration formulas for some new families of hexagonal regions with holes.
Our result also allows us to find new, simpler proofs (and in one case, a new, simpler form) of some formulas due to Krattenthaler for the number of perfect matchings of Aztec rectangles with unit holes along a lattice diagonal.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
A Nonoverlap** Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems
Authors:
Chang-Ock Lee,
Youngkyu Lee,
Byungeun Ryoo
Abstract:
Extreme learning machine (ELM) is a methodology for solving partial differential equations (PDEs) using a single hidden layer feed-forward neural network. It presets the weight/bias coefficients in the hidden layer with random values, which remain fixed throughout the computation, and uses a linear least squares method for training the parameters of the output layer of the neural network. It is kn…
▽ More
Extreme learning machine (ELM) is a methodology for solving partial differential equations (PDEs) using a single hidden layer feed-forward neural network. It presets the weight/bias coefficients in the hidden layer with random values, which remain fixed throughout the computation, and uses a linear least squares method for training the parameters of the output layer of the neural network. It is known to be much faster than Physics informed neural networks. However, classical ELM is still computationally expensive when a high level of representation is desired in the solution as this requires solving a large least squares system. In this paper, we propose a nonoverlap** domain decomposition method (DDM) for ELMs that not only reduces the training time of ELMs, but is also suitable for parallel computation. In numerical analysis, DDMs have been widely studied to reduce the time to obtain finite element solutions for elliptic PDEs through parallel computation. Among these approaches, nonoverlap** DDMs are attracting the most attention. Motivated by these methods, we introduce local neural networks, which are valid only at corresponding subdomains, and an auxiliary variable at the interface. We construct a system on the variable and the parameters of local neural networks. A Schur complement system on the interface can be derived by eliminating the parameters of the output layer. The auxiliary variable is then directly obtained by solving the reduced system after which the parameters for each local neural network are solved in parallel. A method for initializing the hidden layer parameters suitable for high approximation quality in large systems is also proposed. Numerical results that verify the acceleration performance of the proposed method with respect to the number of subdomains are presented.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Hida family of theta lift from U(1) to definite U(2)
Authors:
Yu-Sheng Lee
Abstract:
Let K/F be a CM extension satisfying the ordinary assumption for an odd prime p. In this article, we construct Hida families that interpolate theta lifts of algebraic Hecke characters to a definite unitary group U(2) defined from skew-Hermitian spaces over K, and show that the Hida family is primitive when the central L-value of the branch character of the family satisfies certain non-vanishing mo…
▽ More
Let K/F be a CM extension satisfying the ordinary assumption for an odd prime p. In this article, we construct Hida families that interpolate theta lifts of algebraic Hecke characters to a definite unitary group U(2) defined from skew-Hermitian spaces over K, and show that the Hida family is primitive when the central L-value of the branch character of the family satisfies certain non-vanishing modulo p conditions.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Two-level overlap** additive Schwarz preconditioner for training scientific machine learning applications
Authors:
Youngkyu Lee,
Alena Kopaničáková,
George Em Karniadakis
Abstract:
We introduce a novel two-level overlap** additive Schwarz preconditioner for accelerating the training of scientific machine learning applications. The design of the proposed preconditioner is motivated by the nonlinear two-level overlap** additive Schwarz preconditioner. The neural network parameters are decomposed into groups (subdomains) with overlap** regions. In addition, the network's…
▽ More
We introduce a novel two-level overlap** additive Schwarz preconditioner for accelerating the training of scientific machine learning applications. The design of the proposed preconditioner is motivated by the nonlinear two-level overlap** additive Schwarz preconditioner. The neural network parameters are decomposed into groups (subdomains) with overlap** regions. In addition, the network's feed-forward structure is indirectly imposed through a novel subdomain-wise synchronization strategy and a coarse-level training step. Through a series of numerical experiments, which consider physics-informed neural networks and operator learning approaches, we demonstrate that the proposed two-level preconditioner significantly speeds up the convergence of the standard (LBFGS) optimizer while also yielding more accurate machine learning models. Moreover, the devised preconditioner is designed to take advantage of model-parallel computations, which can further reduce the training time.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Hamilton-Jacobi Based Policy-Iteration via Deep Operator Learning
Authors:
Jae Yong Lee,
Yeoneung Kim
Abstract:
The framework of deep operator network (DeepONet) has been widely exploited thanks to its capability of solving high dimensional partial differential equations. In this paper, we incorporate DeepONet with a recently developed policy iteration scheme to numerically solve optimal control problems and the corresponding Hamilton--Jacobi--Bellman (HJB) equations. A notable feature of our approach is th…
▽ More
The framework of deep operator network (DeepONet) has been widely exploited thanks to its capability of solving high dimensional partial differential equations. In this paper, we incorporate DeepONet with a recently developed policy iteration scheme to numerically solve optimal control problems and the corresponding Hamilton--Jacobi--Bellman (HJB) equations. A notable feature of our approach is that once the neural network is trained, the solution to the optimal control problem and HJB equations with different terminal functions can be inferred quickly thanks to the unique feature of operator learning. Furthermore, a quantitative analysis of the accuracy of the algorithm is carried out via comparison principles of viscosity solutions. The effectiveness of the method is verified with various examples, including 10-dimensional linear quadratic regulator problems (LQRs).
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
An Unconstrained Formulation of Some Constrained Partial Differential Equations and its Application to Finite Neuron Methods
Authors:
Jiwei Jia,
Young Ju Lee,
Ruitong Shan
Abstract:
In this paper, we present a new framework how a PDE with constraints can be formulated into a sequence of PDEs with no constraints, whose solutions are convergent to the solution of the PDE with constraints. This framework is then used to build a novel finite neuron method to solve the 2nd order elliptic equations with the Dirichlet boundary condition. Our algorithm is the first algorithm, proven…
▽ More
In this paper, we present a new framework how a PDE with constraints can be formulated into a sequence of PDEs with no constraints, whose solutions are convergent to the solution of the PDE with constraints. This framework is then used to build a novel finite neuron method to solve the 2nd order elliptic equations with the Dirichlet boundary condition. Our algorithm is the first algorithm, proven to lead to shallow neural network solutions with an optimal H1 norm error. We show that a widely used penalized PDE, which imposes the Dirichlet boundary condition weakly can be interpreted as the first element of the sequence of PDEs within our framework. Furthermore, numerically, we show that it may not lead to the solution with the optimal H1 norm error bound in general. On the other hand, we theoretically demonstrate that the second and later elements of a sequence of PDEs can lead to an adequate solution with the optimal H1 norm error bound. A number of sample tests are performed to confirm the effectiveness of the proposed algorithm and the relevant theory.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Positivity and Maximum Principle Preserving Discontinuous Galerkin Finite Element Schemes for a Coupled Flow and Transport
Authors:
Shihua Gong,
Young-Ju Lee,
Yukun Li,
Yue Yu
Abstract:
We introduce a new concept of the locally conservative flux and investigate its relationship with the compatible discretization pioneered by Dawson, Sun and Wheeler [11]. We then demonstrate how the new concept of the locally conservative flux can play a crucial role in obtaining the L2 norm stability of the discontinuous Galerkin finite element scheme for the transport in the coupled system with…
▽ More
We introduce a new concept of the locally conservative flux and investigate its relationship with the compatible discretization pioneered by Dawson, Sun and Wheeler [11]. We then demonstrate how the new concept of the locally conservative flux can play a crucial role in obtaining the L2 norm stability of the discontinuous Galerkin finite element scheme for the transport in the coupled system with flow. In particular, the lowest order discontinuous Galerkin finite element for the transport is shown to inherit the positivity and maximum principle when the locally conservative flux is used, which has been elusive for many years in literature. The theoretical results established in this paper are based on the equivalence between Lesaint-Raviart discontinuous Galerkin scheme and Brezzi-Marini-Suli discontinuous Galerkin scheme for the linear hyperbolic system as well as the relationship between the Lesaint-Raviart discontinuous Galerkin scheme and the characteristic method along the streamline. Sample numerical experiments have also been performed to justify our theoretical findings
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Carleson measures for weighted Bergman--Zygmund spaces
Authors:
Hong Rae Cho,
Hyungwoon Koo,
Young Joo Lee,
Atte Pennanen,
Jouni Rättyä,
Fanglei Wu
Abstract:
For $0<p<\infty$, $Ψ:[0,\infty)\to(0,\infty)$ and a finite positive Borel measure $μ$ on the unit disc $\mathbb{D}$, the Lebesgue--Zygmund space $L^p_{μ,Ψ}$ consists of all measurable functions $f$ such that $\lVert f \rVert_{L_{μ, Ψ}^{p}}^p =\int_{\mathbb{D}}|f|^pΨ(|f|)\,dμ< \infty$. For an integrable radial function $ω$ on $\mathbb{D}$, the corresponding weighted Bergman-Zygmund space…
▽ More
For $0<p<\infty$, $Ψ:[0,\infty)\to(0,\infty)$ and a finite positive Borel measure $μ$ on the unit disc $\mathbb{D}$, the Lebesgue--Zygmund space $L^p_{μ,Ψ}$ consists of all measurable functions $f$ such that $\lVert f \rVert_{L_{μ, Ψ}^{p}}^p =\int_{\mathbb{D}}|f|^pΨ(|f|)\,dμ< \infty$. For an integrable radial function $ω$ on $\mathbb{D}$, the corresponding weighted Bergman-Zygmund space $A_{ω, Ψ}^{p}$ is the set of all analytic functions in $L_{μ, Ψ}^{p}$ with $dμ=ω\,dA$.
The purpose of the paper is to characterize bounded (and compact) embeddings $A_{ω,Ψ}^{p}\subset L_{μ, Φ}^{q}$, when $0<p\le q<\infty$, the functions $Ψ$ and $Φ$ are essential monotonic, and $Ψ,Φ,ω$ satisfy certain doubling properties. The tools developed on the way to the main results are applied to characterize bounded and compact integral operators acting from $A^p_{ω,Ψ}$ to $A^q_{ν,Φ}$, provided $ν$ admits the same doubling property as $ω$.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Hawkes Models And Their Applications
Authors:
Patrick J. Laub,
Young Lee,
Philip K. Pollett,
Thomas Taimre
Abstract:
The Hawkes process is a model for counting the number of arrivals to a system which exhibits the self-exciting property - that one arrival creates a heightened chance of further arrivals in the near future. The model, and its generalizations, have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model…
▽ More
The Hawkes process is a model for counting the number of arrivals to a system which exhibits the self-exciting property - that one arrival creates a heightened chance of further arrivals in the near future. The model, and its generalizations, have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model is elegantly simple, generalizations have been proposed which: track marks for each arrival, are multivariate, have a spatial component, are driven by renewal processes, treat time as discrete, and so on. This paper creates a cohesive review of the traditional Hawkes model and the modern generalizations, providing details on their construction, simulation algorithms, and giving key references to the appropriate literature for a detailed treatment.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More
Authors:
Fanchen Bu,
Hyeonsoo Jo,
Soo Yong Lee,
Sungsoo Ahn,
Kijung Shin
Abstract:
Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each co…
▽ More
Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each component confronts unique challenges. First, deriving objectives under various conditions (e.g., cardinality constraints and minimum) is nontrivial. Second, the derandomization process is underexplored, and the existing derandomization methods are either random sampling or naive rounding. In this work, we aim to tackle prevalent (i.e., commonly involved) conditions in unsupervised CO. First, we concretize the targets for objective construction and derandomization with theoretical justification. Then, for various conditions commonly involved in different CO problems, we derive nontrivial objectives and derandomization to meet the targets. Finally, we apply the derivations to various CO problems. Via extensive experiments on synthetic and real-world graphs, we validate the correctness of our derivations and show our empirical superiority w.r.t. both optimization quality and speed.
△ Less
Submitted 23 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
The zeta-determinant of the Dirichlet-to-Neumann operator of the Steklov Problem on forms
Authors:
Klaus Kirsten,
Yoonweon Lee
Abstract:
On a compact Riemannian manifold $M$ with boundary $Y$, we express the log of the zeta-determinant of the Dirichlet-to-Neumann operator acting on $q$-forms on $Y$ as the difference of the log of the zeta-determinant of the Laplacian on $q$-forms on $M$ with absolute boundary conditions and that of the Laplacian with Dirichlet boundary conditions with some additional terms which are expressed by cu…
▽ More
On a compact Riemannian manifold $M$ with boundary $Y$, we express the log of the zeta-determinant of the Dirichlet-to-Neumann operator acting on $q$-forms on $Y$ as the difference of the log of the zeta-determinant of the Laplacian on $q$-forms on $M$ with absolute boundary conditions and that of the Laplacian with Dirichlet boundary conditions with some additional terms which are expressed by curvature tensors. When the dimension of $M$ is $2$ or $3$, we compute these terms explicitly. We also discuss the value of the zeta function at zero associated to the Dirichlet-to-Neumann operator by using a conformal rescaling method. As an application, we recover the result of the conformal invariance obtained in C. Guillarmou and L. Guillopé, The determinant of the Dirichlet-to-Neumann map for surfaces with boundary, Int. Math. Res. Not. IMRN 2007, no. 22, Art. ID rnm099, when the dimension of $M$ is $2$.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Off-diagonally symmetric domino tilings of the Aztec diamond of odd order
Authors:
Yi-Lin Lee
Abstract:
We study the enumeration of off-diagonally symmetric domino tilings of odd-order Aztec diamonds in two directions: (1) with one boundary defect, and (2) with maximally-many zeroes on the diagonal. In the first direction, we prove a symmetry property which states that the numbers of off-diagonally symmetric domino tilings of the Aztec diamond of order $2n-1$ are equal when the boundary defect is at…
▽ More
We study the enumeration of off-diagonally symmetric domino tilings of odd-order Aztec diamonds in two directions: (1) with one boundary defect, and (2) with maximally-many zeroes on the diagonal. In the first direction, we prove a symmetry property which states that the numbers of off-diagonally symmetric domino tilings of the Aztec diamond of order $2n-1$ are equal when the boundary defect is at the $k$th position and the $(2n-k)$th position on the boundary, respectively. This symmetry property proves a special case of a recent conjecture by Behrend, Fischer, and Koutschan.
In the second direction, a Pfaffian formula is obtained for the number of "nearly" off-diagonally symmetric domino tilings of odd-order Aztec diamonds, where the entries of the Pfaffian satisfy a simple recurrence relation. The numbers of domino tilings mentioned in the above two directions do not seem to have a simple product formula, but we show that these numbers satisfy simple matrix equations in which the entries of the matrix are given by Delannoy numbers. The proof of these results involves the method of non-intersecting lattice paths and a modification of Stembridge's Pfaffian formula for families of non-intersecting lattice paths. Finally, we propose conjectures concerning the log-concavity and asymptotic behavior of the number of off-diagonally symmetric domino tilings of odd-order Aztec diamonds.
△ Less
Submitted 21 April, 2024; v1 submitted 13 April, 2024;
originally announced April 2024.
-
An explicit formula for the orbital integrals on the spherical Hecke algebra of $\mathrm{GL}_3$
Authors:
Sungmun Cho,
Yuchan Lee
Abstract:
We provide the explicit formula for orbital integrals associated with elliptic regular semisimple elements in $\mathrm{GL}_n(F) \cap \mathrm{M}_n(\mathfrak{o})$ and associated with arbitrary elements of the spherical Hecke algebra of $\mathrm{GL}_n(F)$ when $n=2, 3$, using results of [CKL]. Here $F$ is a non-Archimedean local field of any characteristic with $\mathfrak{o}$ its ring of integers.
We provide the explicit formula for orbital integrals associated with elliptic regular semisimple elements in $\mathrm{GL}_n(F) \cap \mathrm{M}_n(\mathfrak{o})$ and associated with arbitrary elements of the spherical Hecke algebra of $\mathrm{GL}_n(F)$ when $n=2, 3$, using results of [CKL]. Here $F$ is a non-Archimedean local field of any characteristic with $\mathfrak{o}$ its ring of integers.
△ Less
Submitted 8 April, 2024; v1 submitted 6 April, 2024;
originally announced April 2024.
-
Sampling error mitigation through spectrum smoothing in ensemble data assimilation
Authors:
Bosu Choi,
Yoonsang Lee
Abstract:
In data assimilation, an ensemble provides a nonintrusive way to evolve a probability density described by a nonlinear prediction model. Although a large ensemble size is required for statistical accuracy, the ensemble size is typically limited to a small number due to the computational cost of running the prediction model, which leads to a sampling error. Several methods, such as localization, ex…
▽ More
In data assimilation, an ensemble provides a nonintrusive way to evolve a probability density described by a nonlinear prediction model. Although a large ensemble size is required for statistical accuracy, the ensemble size is typically limited to a small number due to the computational cost of running the prediction model, which leads to a sampling error. Several methods, such as localization, exist to mitigate the sampling error, often requiring problem-dependent fine-tuning and design. This work introduces another sampling error mitigation method using a smoothness constraint in the Fourier space. In particular, this work smoothes out the spectrum of the system to increase the stability and accuracy even under a small ensemble size. The efficacy of the new idea is validated through a suite of stringent test problems, including Lorenz 96 and Kuramoto-Sivashinsky turbulence models.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Improving the Bit Complexity of Communication for Distributed Convex Optimization
Authors:
Mehrdad Ghadiri,
Yin Tat Lee,
Swati Padmanabhan,
William Swartworth,
David Woodruff,
Guanghao Ye
Abstract:
We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app…
▽ More
We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank approximation; for a number of these fundamental problems our bounds are nearly optimal, as proven by our lower bounds.
Among our techniques, we use the notion of block leverage scores, which have been relatively unexplored in this context, as well as drop** all but the ``middle" bits in Richardson-style algorithms. We also introduce a new communication problem for accurately approximating inner products and establish a lower bound using the spherical Radon transform. Our lower bound can be used to show the first separation of linear programming and linear systems in the distributed model when the number of constraints is polynomial, addressing an open question in prior work.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Stochastic approach for elliptic problems in perforated domains
Authors:
Jihun Han,
Yoonsang Lee
Abstract:
A wide range of applications in science and engineering involve a PDE model in a domain with perforations, such as perforated metals or air filters. Solving such perforated domain problems suffers from computational challenges related to resolving the scale imposed by the geometries of perforations. We propose a neural network-based mesh-free approach for perforated domain problems. The method is…
▽ More
A wide range of applications in science and engineering involve a PDE model in a domain with perforations, such as perforated metals or air filters. Solving such perforated domain problems suffers from computational challenges related to resolving the scale imposed by the geometries of perforations. We propose a neural network-based mesh-free approach for perforated domain problems. The method is robust and efficient in capturing various configuration scales, including the averaged macroscopic behavior of the solution that involves a multiscale nature induced by small perforations. The new approach incorporates the derivative-free loss method that uses a stochastic representation or the Feynman-Kac formulation. In particular, we implement the Neumann boundary condition for the derivative-free loss method to handle the interface between the domain and perforations. A suite of stringent numerical tests is provided to support the proposed method's efficacy in handling various perforation scales.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Effective Results in The Metric Theory of Quantitative Diophantine Approximation
Authors:
Ying Wai Lee,
Andrew Scoones
Abstract:
Many results related to quantitative problems in the metric theory of Diophantine approximation are asymptotic, such as the number of rational solutions to certain inequalities grows with the same rate almost everywhere modulo an asymptotic error term. The error term incorporates an implicit constant that varies from one point to another. This means that applications of these results does not give…
▽ More
Many results related to quantitative problems in the metric theory of Diophantine approximation are asymptotic, such as the number of rational solutions to certain inequalities grows with the same rate almost everywhere modulo an asymptotic error term. The error term incorporates an implicit constant that varies from one point to another. This means that applications of these results does not give concrete bounds when applied to, say a finite sum, or when applied to counting the number of solutions up to a finite point for a given inequality. This paper addresses this problem and makes the tools and their results effective, by making the implicit constant explicit outside of an exceptional subset of Lebesgue measure at most $δ>0$, an arbitrarily small constant chosen in advance. We deduce from this the fully effective results for Schmidt's Theorem, quantitative Koukoulopoulos-Maynard Theorem and quantitative results on $M_{0}$-sets; we also provide effective results regarding statistics of normal numbers and strong law of large numbers.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Structure-Preserving Operator Learning: Modeling the Collision Operator of Kinetic Equations
Authors:
Jae Yong Lee,
Steffen Schotthöfer,
Tianbai Xiao,
Sebastian Krumscheid,
Martin Frank
Abstract:
This work explores the application of deep operator learning principles to a problem in statistical physics. Specifically, we consider the linear kinetic equation, consisting of a differential advection operator and an integral collision operator, which is a powerful yet expensive mathematical model for interacting particle systems with ample applications, e.g., in radiation transport. We investig…
▽ More
This work explores the application of deep operator learning principles to a problem in statistical physics. Specifically, we consider the linear kinetic equation, consisting of a differential advection operator and an integral collision operator, which is a powerful yet expensive mathematical model for interacting particle systems with ample applications, e.g., in radiation transport. We investigate the capabilities of the Deep Operator network (DeepONet) approach to modelling the high dimensional collision operator of the linear kinetic equation. This integral operator has crucial analytical structures that a surrogate model, e.g., a DeepONet, needs to preserve to enable meaningful physical simulation. We propose several DeepONet modifications to encapsulate essential structural properties of this integral operator in a DeepONet model. To be precise, we adapt the architecture of the trunk-net so the DeepONet has the same collision invariants as the theoretical kinetic collision operator, thus preserving conserved quantities, e.g., mass, of the modeled many-particle system. Further, we propose an entropy-inspired data-sampling method tailored to train the modified DeepONet surrogates without requiring an excessive expensive simulation-based data generation.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
The number of automorphic representations of $\mathrm{GL}_2$ with exceptional eigenvalues
Authors:
Dohoon Choi,
Min Lee,
Youngmin Lee,
Subong Lim
Abstract:
We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result.
Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the se…
▽ More
We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result.
Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the set of irreducible cuspidal automorphic representations $π$ of $\mathrm{GL}_2(\mathbb{A}_{F})$ with the trivial central character such that for each archimedean place $v$ of $F$, the local representation of $π$ at $v$ is an unramified principal series and is not tempered. For an ideal $J$ of $\mathcal{O}_{F}$, let $\mathrm{K}_{0}(J)$ be the subgroup of $\mathrm{GL}_2(\mathbb{A}_{F})$ corresponding to $Γ_0(J) \subset \mathrm{SL}_2(\mathcal{O}_F)$. Let $r_1$ be the number of real embeddings of $F$ and $r_2$ be the number of conjugate pairs of complex embeddings of $F$. Using the Arthur-Selberg trace formula, we have \begin{equation*}
\sum_{π\in \mathfrak{X}_{F,\mathrm{ex}}} \dim π^{\mathrm{K}_0(J)}
\ll_{F} \frac{[\mathrm{SL}_2(\mathcal{O}_{F}) : Γ_0(J)]}{(\log (N_{F/\mathbb{Q}}(J)))^{2r_1+3r_2}} \quad \text{ as } \quad |N_{F/\mathbb{Q}}(J)|\to \infty.
\end{equation*} From this result, we obtain the result on an upper bound for the number of Hecke-Maass cusp forms of weight $0$ on $Γ_0(N)$ which do not satisfy the Selberg eigenvalue conjecture.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
The finitude of tamely ramified pro-$p$ extensions of number fields with cyclic $p$-class groups
Authors:
Yoon** Lee,
Donghyeok Lim
Abstract:
Let $p$ be an odd prime and $F$ be a number field whose $p$-class group is cyclic. Let $F_{\{\mathfrak{q}\}}$ be the maximal pro-$p$ extension of $F$ which is unramified outside a single non-$p$-adic prime ideal $\mathfrak{q}$ of $F$. In this work, we study the finitude of the Galois group $G_{\{\mathfrak{q}\}}(F)$ of $F_{\{\mathfrak{q}\}}$ over $F$. We prove that $G_{\{\mathfrak{q}\}}(F)$ is fini…
▽ More
Let $p$ be an odd prime and $F$ be a number field whose $p$-class group is cyclic. Let $F_{\{\mathfrak{q}\}}$ be the maximal pro-$p$ extension of $F$ which is unramified outside a single non-$p$-adic prime ideal $\mathfrak{q}$ of $F$. In this work, we study the finitude of the Galois group $G_{\{\mathfrak{q}\}}(F)$ of $F_{\{\mathfrak{q}\}}$ over $F$. We prove that $G_{\{\mathfrak{q}\}}(F)$ is finite for the majority of $\mathfrak{q}$'s such that the generator rank of $G_{\{\mathfrak{q}\}}(F)$ is two, provided that for $p = 3$, $F$ is not a complex quartic field containing the primitive third roots of unity.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids
Authors:
Sung Woong Cho,
Jae Yong Lee,
Hyung Ju Hwang
Abstract:
Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions…
▽ More
Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions as inputs and outputs, enabling real-time predictions as surrogate models for solution operators. There has also been significant progress in the research on surrogate models based on graph neural networks (GNNs), specifically targeting the dynamics in time-dependent PDEs. In this paper, we propose GraphDeepONet, an autoregressive model based on GNNs, to effectively adapt DeepONet, which is well-known for successful operator learning. GraphDeepONet exhibits robust accuracy in predicting solutions compared to existing GNN-based PDE solver models. It maintains consistent performance even on irregular grids, leveraging the advantages inherited from DeepONet and enabling predictions on arbitrary grids. Additionally, unlike traditional DeepONet and its variants, GraphDeepONet enables time extrapolation for time-dependent PDE solutions. We also provide theoretical analysis of the universal approximation capability of GraphDeepONet in approximating continuous operators across arbitrary time intervals.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Minimal grid diagrams of the prime knots with crossing number 13 and arc index 13
Authors:
Hwa Jeong Lee,
Yoonsang Lee,
Chanmin Lee,
Yeseo Park,
Hun Kim,
Gyo Taek **
Abstract:
We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided b…
▽ More
We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided by the program Knotscape to generate spanning trees of the corresponding knot diagrams to obtain minimal arc presentations in the form of grid diagrams.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
The global Cauchy problem for the Euler-Riesz equations
Authors:
Young-Pil Choi,
**wook Jung,
Yoonjung Lee
Abstract:
We completely resolve the global Cauchy problem for the multi-dimensional Euler-Riesz equations, where the interaction forcing is given by $\nabla (-Δ)^{-σ/2}ρ$ for some $σ\in (0,2)$. We construct the global-in-time unique solution to the Euler-Riesz system in a $H^s$ Sobolev space under a smallness assumption on the initial density and a dispersive spectral condition on the initial velocity. More…
▽ More
We completely resolve the global Cauchy problem for the multi-dimensional Euler-Riesz equations, where the interaction forcing is given by $\nabla (-Δ)^{-σ/2}ρ$ for some $σ\in (0,2)$. We construct the global-in-time unique solution to the Euler-Riesz system in a $H^s$ Sobolev space under a smallness assumption on the initial density and a dispersive spectral condition on the initial velocity. Moreover, we investigate the algebraic time decay of convergences for the constructed solutions. Our results cover the both attractive and repulsive cases as well as the whole regime $σ\in (0,2)$.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Sub-Optimal Fast Fourier Series Approximation for Initial Trajectory Design
Authors:
Caleb Gunsaulus,
Carl De Vries,
William Brown,
Youngro Lee,
Madhusudan Vijayakumar,
Ossama Abdelkhalik
Abstract:
The Finite Fourier Series (FFS) Shape-Based (SB) trajectory approximation method has been used to rapidly generate initial trajectories that satisfy the dynamics, trajectory boundary conditions, and limitation on maximum thrust acceleration. The FFS SB approach solves a nonlinear programming problem (NLP) in searching for feasible trajectories. This paper extends the development of the FFS SB appr…
▽ More
The Finite Fourier Series (FFS) Shape-Based (SB) trajectory approximation method has been used to rapidly generate initial trajectories that satisfy the dynamics, trajectory boundary conditions, and limitation on maximum thrust acceleration. The FFS SB approach solves a nonlinear programming problem (NLP) in searching for feasible trajectories. This paper extends the development of the FFS SB approach to generate sub optimal solutions. Specifically, the objective function of the NLP problem is modified to include also a measure for the time of flight. Numerical results presented in this paper show several solutions that differ from those of the original FFS SB ones. The sub-optimal trajectories generated using a time of flight minimization are shown to be physically feasible trajectories and potential candidates for direct solvers.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Scattering for the dispersion managed nonlinear Schrödinger equation
Authors:
Mi-Ran Choi,
Kiyeon Lee,
Young-Ran Lee
Abstract:
We consider the dispersion managed nonlinear Schrdinger equations with quintic and cubic nonlinearities in one and two dimensions, respectively. We prove the global well-posedness and scattering in $L_x^2$ for small initial data employing the $U^p$ and $V^p$ spaces.
We consider the dispersion managed nonlinear Schrdinger equations with quintic and cubic nonlinearities in one and two dimensions, respectively. We prove the global well-posedness and scattering in $L_x^2$ for small initial data employing the $U^p$ and $V^p$ spaces.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations
Authors:
Johann Rudi,
Youngjun Lee,
Aidan H. Chadha,
Mohamed Wahib,
Klaus Weide,
Jared P. O'Neal,
Anshu Dubey
Abstract:
CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to uniq…
▽ More
CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to unique challenges including handling an existing large code base in Fortran and/or C/C++, subdivision of code into a great variety of units supporting a wide range of physics and numerical methods, different parallelization techniques for distributed- and shared-memory systems and accelerator devices, and heterogeneity of computing platforms requiring coexisting variants of parallel algorithms. The challenges demand that developers determine custom abstractions and granularity for code generation. CG-Kit tackles this with standalone tools that can be combined into highly specific and, we argue, highly effective portability and maintainability tool chains. Here we present the design of our new tools: parametrized source trees, control flow graphs, and recipes. The tools are implemented in Python. Although the tools are agnostic to the programming language of the source code, we focus on C/C++ and Fortran. Code generation experiments demonstrate the generation of variants of parallel algorithms: first, multithreaded variants of the basic AXPY operation (scalar-vector addition and vector-vector multiplication) to introduce the application of CG-Kit tool chains; and second, variants of parallel algorithms within a hydrodynamics solver, called Spark, from Flash-X that operates on block-structured adaptive meshes. In summary, code generated by CG-Kit achieves a reduction by over 60% of the original C/C++/Fortran source code.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork
Authors:
Jae Yong Lee,
Sung Woong Cho,
Hyung Ju Hwang
Abstract:
Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear map**s between function spaces. However, the DeepONet requires many parameters a…
▽ More
Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear map**s between function spaces. However, the DeepONet requires many parameters and has a high computational cost when learning operators, particularly those with complex (discontinuous or non-smooth) target functions. This study proposes HyperDeepONet, which uses the expressive power of the hypernetwork to enable the learning of a complex operator with a smaller set of parameters. The DeepONet and its variant models can be thought of as a method of injecting the input function information into the target function. From this perspective, these models can be viewed as a particular case of HyperDeepONet. We analyze the complexity of DeepONet and conclude that HyperDeepONet needs relatively lower complexity to obtain the desired accuracy for operator learning. HyperDeepONet successfully learned various operators with fewer computational resources compared to other benchmarks.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
On a Kostant section for the unitary group
Authors:
Yuchan Lee
Abstract:
For the unitary group defined over the ring of integers in a non Archimedean local field, we give a correction for a Kostant section provided in G.Laumon and B.C. Ngô's paper; Le lemme fondamental pour les groupes unitaires.
For the unitary group defined over the ring of integers in a non Archimedean local field, we give a correction for a Kostant section provided in G.Laumon and B.C. Ngô's paper; Le lemme fondamental pour les groupes unitaires.
△ Less
Submitted 17 December, 2023; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Global existence versus finite time blowup dichotomy for the dispersion managed NLS
Authors:
Mi-Ran Choi,
Younghun Hong,
Young-Ran Lee
Abstract:
We consider the Gabitov-Turitsyn equation or the dispersion managed nonlinear Schrödinger equation of a power-type nonlinearity
\[
i\partial_t u+ d_\text{av} \partial_x^2u+\int_0^1 e^{-ir\partial_x^2}\big(|e^{ir\partial_x^2}u|^{p-1}e^{ir\partial_x^2}u\big)dr=0
\] and prove the global existence versus finite time blowup dichotomy for the mass-supercritical cases, that is, $p>9$.
We consider the Gabitov-Turitsyn equation or the dispersion managed nonlinear Schrödinger equation of a power-type nonlinearity
\[
i\partial_t u+ d_\text{av} \partial_x^2u+\int_0^1 e^{-ir\partial_x^2}\big(|e^{ir\partial_x^2}u|^{p-1}e^{ir\partial_x^2}u\big)dr=0
\] and prove the global existence versus finite time blowup dichotomy for the mass-supercritical cases, that is, $p>9$.
△ Less
Submitted 25 June, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Autoregressive Renaissance in Neural PDE Solvers
Authors:
Yolanne Yi Ran Lee
Abstract:
Recent developments in the field of neural partial differential equation (PDE) solvers have placed a strong emphasis on neural operators. However, the paper "Message Passing Neural PDE Solver" by Brandstetter et al. published in ICLR 2022 revisits autoregressive models and designs a message passing graph neural network that is comparable with or outperforms both the state-of-the-art Fourier Neural…
▽ More
Recent developments in the field of neural partial differential equation (PDE) solvers have placed a strong emphasis on neural operators. However, the paper "Message Passing Neural PDE Solver" by Brandstetter et al. published in ICLR 2022 revisits autoregressive models and designs a message passing graph neural network that is comparable with or outperforms both the state-of-the-art Fourier Neural Operator and traditional classical PDE solvers in its generalization capabilities and performance. This blog post delves into the key contributions of this work, exploring the strategies used to address the common problem of instability in autoregressive models and the design choices of the message passing graph neural network architecture.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates
Authors:
Youngkyu Lee,
Jongho Park,
Chang-Ock Lee
Abstract:
The performance of neural networks has been significantly improved by increasing the number of channels in convolutional layers. However, this increase in performance comes with a higher computational cost, resulting in numerous studies focused on reducing it. One promising approach to address this issue is group convolution, which effectively reduces the computational cost by grou** channels. H…
▽ More
The performance of neural networks has been significantly improved by increasing the number of channels in convolutional layers. However, this increase in performance comes with a higher computational cost, resulting in numerous studies focused on reducing it. One promising approach to address this issue is group convolution, which effectively reduces the computational cost by grou** channels. However, to the best of our knowledge, there has been no theoretical analysis on how well the group convolution approximates the standard convolution. In this paper, we mathematically analyze the approximation of the group convolution to the standard convolution with respect to the number of groups. Furthermore, we propose a novel variant of the group convolution called balanced group convolution, which shows a higher approximation with a small additional computational cost. We provide experimental results that validate our theoretical findings and demonstrate the superior performance of the balanced group convolution over other variants of group convolution.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
On the Statistical Foundations of H-likelihood for Unobserved Random Variables
Authors:
Hangbin Lee,
Youngjo Lee
Abstract:
The maximum likelihood estimation is widely used for statistical inferences. This paper aims to reformulate Lee and Nelder's (1996) h-likelihood, so that the maximum h-likelihood estimator resembles the maximum likelihood estimator of the classical likelihood. We establish the statistical foundations of the new h-likelihood. This extends classical likelihood theories to embrace broader class of st…
▽ More
The maximum likelihood estimation is widely used for statistical inferences. This paper aims to reformulate Lee and Nelder's (1996) h-likelihood, so that the maximum h-likelihood estimator resembles the maximum likelihood estimator of the classical likelihood. We establish the statistical foundations of the new h-likelihood. This extends classical likelihood theories to embrace broader class of statistical models with random parameters. Maximization of the h-likelihood yields asymptotically optimal estimators for both fixed and random parameters achieving the generalized Cramér-Rao lower bound, while providing computationally efficient fitting algorithms. Furthermore, we explore asymptotic theory when the consistency of either fixed parameter estimation or random parameter prediction is violated. We also study how to obtain maximum h-likelihood estimators when the h-likelihood is not explicitly available.
△ Less
Submitted 5 December, 2023; v1 submitted 15 October, 2023;
originally announced October 2023.
-
Finite size corrections for real eigenvalues of the elliptic Ginibre matrices
Authors:
Sung-Soo Byun,
Yong-Woo Lee
Abstract:
We consider the elliptic Ginibre matrices in the orthogonal symmetry class that interpolates between the real Ginibre ensemble and the Gaussian orthogonal ensemble. We obtain the finite size corrections of the real eigenvalue densities in both the global and edge scaling regimes, as well as in both the strong and weak non-Hermiticity regimes. Our results extend and provide the rate of convergence…
▽ More
We consider the elliptic Ginibre matrices in the orthogonal symmetry class that interpolates between the real Ginibre ensemble and the Gaussian orthogonal ensemble. We obtain the finite size corrections of the real eigenvalue densities in both the global and edge scaling regimes, as well as in both the strong and weak non-Hermiticity regimes. Our results extend and provide the rate of convergence to the previous recent findings in the aforementioned limits. In particular, in the Hermitian limit, our results recover the finite size corrections of the Gaussian orthogonal ensemble established by Forrester, Frankel and Garoni.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
An analysis of the derivative-free loss method for solving PDEs
Authors:
Jihun Han,
Yoonsang Lee
Abstract:
This study analyzes the derivative-free loss method to solve a certain class of elliptic PDEs using neural networks. The derivative-free loss method uses the Feynman-Kac formulation, incorporating stochastic walkers and their corresponding average values. We investigate the effect of the time interval related to the Feynman-Kac formulation and the walker size in the context of computational effici…
▽ More
This study analyzes the derivative-free loss method to solve a certain class of elliptic PDEs using neural networks. The derivative-free loss method uses the Feynman-Kac formulation, incorporating stochastic walkers and their corresponding average values. We investigate the effect of the time interval related to the Feynman-Kac formulation and the walker size in the context of computational efficiency, trainability, and sampling errors. Our analysis shows that the training loss bias is proportional to the time interval and the spatial gradient of the neural network while inversely proportional to the walker size. We also show that the time interval must be sufficiently long to train the network. These analytic results tell that we can choose the walker size as small as possible based on the optimal lower bound of the time interval. We also provide numerical tests supporting our analysis.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Torsion Vanishing for Some Shimura Varieties
Authors:
Linus Hamann,
Si Ying Lee
Abstract:
We generalize the torsion vanishing results of Caraiani-Scholze and Koshikawa. Our results apply to the cohomology of general Shimura varieties $(\mathbf{G},X)$ of PEL type $A$ or $C$, localized at a suitable maximal ideal $\mathfrak{m}$ in the spherical Hecke algebra at primes $p$ such that $\mathbf{G}_{\mathbb{Q}_{p}}$ is a group for which we know the Fargues-Scholze local Langlands corresponden…
▽ More
We generalize the torsion vanishing results of Caraiani-Scholze and Koshikawa. Our results apply to the cohomology of general Shimura varieties $(\mathbf{G},X)$ of PEL type $A$ or $C$, localized at a suitable maximal ideal $\mathfrak{m}$ in the spherical Hecke algebra at primes $p$ such that $\mathbf{G}_{\mathbb{Q}_{p}}$ is a group for which we know the Fargues-Scholze local Langlands correspondence is the semi-simplification of a suitably nice local Langlands correspondence. This is accomplished by combining Koshikawa's technique, the theory of geometric Eisenstein series over the Fargues-Fontaine curve, the work of Santos describing the structure of the fibers of the minimally and toroidally compactified Hodge-Tate period morphism for general PEL type Shimura varieties of type $A$ or $C$, and ideas developed by Zhang on comparing Hecke correspondences on the moduli stack of $G$-bundles with the cohomology of Shimura varieties. In the process, we also establish a description of the generic part of the cohomology that bears resemblance to the work of Xiao-Zhu. Moreover, we also construct a filtration on the compactly supported cohomology that differs from Manotovan's filtration in the case that the Shimura variety is non-compact, allowing us to circumvent some of the circumlocutions taken by Cariani-Scholze. Our method showcases a very general strategy for proving such torsion vanishing results, and should bear even more fruit once the inputs are generalized. Motivated by this, we formulate an even more general torsion vanishing conjecture.
△ Less
Submitted 2 December, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
A Structurally Informed Data Assimilation Approach for Nonlinear Partial Differential Equations
Authors:
Tongtong Li,
Anne Gelb,
Yoonsang Lee
Abstract:
Ensemble transform Kalman filtering (ETKF) data assimilation is often used to combine available observations with numerical simulations to obtain statistically accurate and reliable state representations in dynamical systems. However, it is well known that the commonly used Gaussian distribution assumption introduces biases for state variables that admit discontinuous profiles, which are prevalent…
▽ More
Ensemble transform Kalman filtering (ETKF) data assimilation is often used to combine available observations with numerical simulations to obtain statistically accurate and reliable state representations in dynamical systems. However, it is well known that the commonly used Gaussian distribution assumption introduces biases for state variables that admit discontinuous profiles, which are prevalent in nonlinear partial differential equations. This investigation designs a new structurally informed non-Gaussian prior that exploits statistical information from the simulated state variables. In particular, we construct a new weighting matrix based on the second moment of the gradient information of the state variable to replace the prior covariance matrix used for model/data compromise in the ETKF data assimilation framework. We further adapt our weighting matrix to include information in discontinuity regions via a clustering technique. Our numerical experiments demonstrate that this new approach yields more accurate estimates than those obtained using ETKF on shallow water equations, even when ETKF is enhanced with inflation and localization techniques.
△ Less
Submitted 5 March, 2024; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Damped Euler system with attractive Riesz interaction forces
Authors:
Young-Pil Choi,
**wook Jung,
Yoonjung Lee
Abstract:
We consider the barotropic Euler equations with pairwise attractive Riesz interactions and linear velocity dam** in the periodic domain. We establish the global-in-time well-posedness theory for the system near an equilibrium state. We also analyze the large-time behavior of solutions showing the exponential rate of convergence toward the equilibrium state as time goes to infinity.
We consider the barotropic Euler equations with pairwise attractive Riesz interactions and linear velocity dam** in the periodic domain. We establish the global-in-time well-posedness theory for the system near an equilibrium state. We also analyze the large-time behavior of solutions showing the exponential rate of convergence toward the equilibrium state as time goes to infinity.
△ Less
Submitted 31 August, 2023;
originally announced September 2023.
-
Local times of anisotropic Gaussian random fields and stochastic heat equation
Authors:
Cheuk Yin Lee,
Yimin Xiao
Abstract:
We study the local times of a large class of Gaussian random fields satisfying strong local nondeterminism with respect to an anisotropic metric. We establish moment estimates and Hölder conditions for the local times of the Gaussian random fields. Our key estimates rely on geometric properties of Voronoi partitions with respect to an anisotropic metric and the use of Besicovitch's covering theore…
▽ More
We study the local times of a large class of Gaussian random fields satisfying strong local nondeterminism with respect to an anisotropic metric. We establish moment estimates and Hölder conditions for the local times of the Gaussian random fields. Our key estimates rely on geometric properties of Voronoi partitions with respect to an anisotropic metric and the use of Besicovitch's covering theorem. As a consequence, we deduce sample path properties of the Gaussian random fields that are related to Chung's law of the iterated logarithm and modulus of non-differentiability. Moreover, we apply our results to systems of stochastic heat equations with additive Gaussian noise and determine the exact Hausdorff measure function with respect to the parabolic metric for the level sets of the solutions.
△ Less
Submitted 30 October, 2023; v1 submitted 25 August, 2023;
originally announced August 2023.
-
Derived $p$-adic heights and the leading coefficient of the Bertolini--Darmon--Prasanna $p$-adic $L$-function
Authors:
Francesc Castella,
Chi-Yun Hsu,
Debanjana Kundu,
Yu-Shen Lee,
Zheng Liu
Abstract:
Let $E/\mathbf{Q}$ be an elliptic curve and let $p$ be an odd prime of good reduction for $E$. Let $K$ be an imaginary quadratic field satisfying the classical Heegner hypothesis and in which $p$ splits. In a previous work, Agboola--Castella formulated an analogue of the Birch--Swinnerton-Dyer conjecture for the $p$-adic $L$-function $L_{\mathfrak{p}}^{\rm BDP}$ of Bertolini--Darmon--Prasanna atta…
▽ More
Let $E/\mathbf{Q}$ be an elliptic curve and let $p$ be an odd prime of good reduction for $E$. Let $K$ be an imaginary quadratic field satisfying the classical Heegner hypothesis and in which $p$ splits. In a previous work, Agboola--Castella formulated an analogue of the Birch--Swinnerton-Dyer conjecture for the $p$-adic $L$-function $L_{\mathfrak{p}}^{\rm BDP}$ of Bertolini--Darmon--Prasanna attached to $E/K$, assuming the prime $p$ to be ordinary for $E$. The goal of this paper is two-fold:
(1) We formulate a $p$-adic BSD conjecture for $L_{\mathfrak{p}}^{\rm BDP}$ for all odd primes $p$ of good reduction.
(2) For an algebraic analogue $F_{\overline{\mathfrak{p}}}^{\rm BDP}$ of $L_{\mathfrak{p}}^{\rm BDP}$, we show that the ``leading coefficient'' part of our conjecture holds, and that the ``order of vanishing'' part follows from the expected ``maximal non-degeneracy'' of an anticyclotomic $p$-adic height.
In particular, when the Iwasawa--Greenberg Main Conjecture $(F_{\overline{\mathfrak{p}}}^{\rm BDP})=(L_{\mathfrak{p}}^{\rm BDP})$ is known, our results determine the leading coefficient of $L_{\mathfrak{p}}^{\rm BDP}$ at $T=0$ up to a $p$-adic unit. Moreover, by adapting the approach of Burungale--Castella--Kim in the $p$-ordinary case, we prove the main conjecture for supersingular primes $p$ under mild hypotheses.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Finite Element Operator Network for Solving Parametric PDEs
Authors:
Jae Yong Lee,
Seungchan Ko,
Youngjoon Hong
Abstract:
Partial differential equations (PDEs) underlie our understanding and prediction of natural phenomena across numerous fields, including physics, engineering, and finance. However, solving parametric PDEs is a complex task that necessitates efficient numerical methods. In this paper, we propose a novel approach for solving parametric PDEs using a Finite Element Operator Network (FEONet). Our propose…
▽ More
Partial differential equations (PDEs) underlie our understanding and prediction of natural phenomena across numerous fields, including physics, engineering, and finance. However, solving parametric PDEs is a complex task that necessitates efficient numerical methods. In this paper, we propose a novel approach for solving parametric PDEs using a Finite Element Operator Network (FEONet). Our proposed method leverages the power of deep learning in conjunction with traditional numerical methods, specifically the finite element method, to solve parametric PDEs in the absence of any paired input-output training data. We performed various experiments on several benchmark problems and confirmed that our approach has demonstrated excellent performance across various settings and environments, proving its versatility in terms of accuracy, generalization, and computational flexibility. Our FEONet framework shows potential for application in various fields where PDEs play a crucial role in modeling complex domains with diverse boundary conditions and singular behavior. Furthermore, we provide theoretical convergence analysis to support our approach, utilizing finite element approximation in numerical analysis.
△ Less
Submitted 19 December, 2023; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Weighted Hessian estimates in Orlicz spaces for nondivergence elliptic operators with certain potentials
Authors:
Mikyoung Lee,
Yoonjung Lee
Abstract:
We prove interior weighted Hessian estimates in Orlicz spaces for nondivergence type elliptic equations with a lower order term which involves a nonnegative potential satisfying a reverse Hölder type condition.
We prove interior weighted Hessian estimates in Orlicz spaces for nondivergence type elliptic equations with a lower order term which involves a nonnegative potential satisfying a reverse Hölder type condition.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Weighted inhomogeneous regularization for inverse problems with indirect and incomplete measurement data
Authors:
Bosu Choi,
Jihun Han,
Yoonsang Lee
Abstract:
Regularization promotes well-posedness in solving an inverse problem with incomplete measurement data. The regularization term is typically designed based on a priori characterization of the unknown signal, such as sparsity or smoothness. The standard inhomogeneous regularization incorporates a spatially changing exponent $p$ of the standard $\ell_p$ norm-based regularization to recover a signal w…
▽ More
Regularization promotes well-posedness in solving an inverse problem with incomplete measurement data. The regularization term is typically designed based on a priori characterization of the unknown signal, such as sparsity or smoothness. The standard inhomogeneous regularization incorporates a spatially changing exponent $p$ of the standard $\ell_p$ norm-based regularization to recover a signal whose characteristic varies spatially. This study proposes a weighted inhomogeneous regularization that extends the standard inhomogeneous regularization through new exponent design and weighting using spatially varying weights. The new exponent design avoids misclassification when different characteristics stay close to each other. The weights handle another issue when the region of one characteristic is too small to be recovered effectively by the $\ell_p$ norm-based regularization even after identified correctly. A suite of numerical tests shows the efficacy of the proposed weighted inhomogeneous regularization, including synthetic image experiments and real sea ice recovery from its incomplete wave measurements.
△ Less
Submitted 10 January, 2024; v1 submitted 19 July, 2023;
originally announced July 2023.
-
The BFK type gluing formula of zeta-determinants for the Robin Boundary condition
Authors:
Klaus Kirsten,
Yoonweon Lee
Abstract:
In this paper we discuss the BFK type gluing formula for zeta-determinants of Laplacians with respect to the Robin boundary condition on a compact Riemannian manifold. As a special case, we discuss the gluing formula with respect to the Neumann boundary condition. We also compute the difference of two zeta-determinants with respect to the Robin and Dirichlet boundary conditions. We use this result…
▽ More
In this paper we discuss the BFK type gluing formula for zeta-determinants of Laplacians with respect to the Robin boundary condition on a compact Riemannian manifold. As a special case, we discuss the gluing formula with respect to the Neumann boundary condition. We also compute the difference of two zeta-determinants with respect to the Robin and Dirichlet boundary conditions. We use this result to compute the zeta-determinant of a Laplacian on a cylinder when the Robin boundary condition is imposed, which extends a result in [25]. We also discuss the gluing formula more precisely when the product structure is given near a cutting hypersurface.
△ Less
Submitted 3 July, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Distribution-free inference with hierarchical data
Authors:
Yonghoon Lee,
Rina Foygel Barber,
Rebecca Willett
Abstract:
This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold. To address this challenge, a hierarchical form of exchangeability is derived, facilitating extensions of distribution-free methods, including conformal predict…
▽ More
This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold. To address this challenge, a hierarchical form of exchangeability is derived, facilitating extensions of distribution-free methods, including conformal prediction and jackknife+. While the standard theoretical guarantee obtained by the conformal prediction framework is a marginal predictive coverage guarantee, in the special case of independent repeated measurements, it is possible to achieve a stronger form of coverage -- the "second-moment coverage" property -- to provide better control of conditional miscoverage rates, and distribution-free prediction sets that achieve this property are constructed. Simulations illustrate that this guarantee indeed leads to uniformly small conditional miscoverage rates. Empirically, this stronger guarantee comes at the cost of a larger width of the prediction set in scenarios where the fitted model is poorly calibrated, but this cost is very mild in cases where the fitted model is accurate.
△ Less
Submitted 2 March, 2024; v1 submitted 10 June, 2023;
originally announced June 2023.
-
Damped nonlinear Schrödinger equation with Stark effect
Authors:
Yi Hu,
Yongki Lee,
Shijun Zheng
Abstract:
We study the $L^2$-critical damped NLS with a Stark potential. We prove that the threshold for global existence and finite time blowup of this equation is given by $\|Q\|_2$, where $Q$ is the unique positive radial solution of $ΔQ + |Q|^{4/d} Q = Q$ in $H^1(\mathbb{R}^d)$. Moreover, in any small neighborhood of $Q$, there exists an initial data $u_0$ above the ground state such that the solution f…
▽ More
We study the $L^2$-critical damped NLS with a Stark potential. We prove that the threshold for global existence and finite time blowup of this equation is given by $\|Q\|_2$, where $Q$ is the unique positive radial solution of $ΔQ + |Q|^{4/d} Q = Q$ in $H^1(\mathbb{R}^d)$. Moreover, in any small neighborhood of $Q$, there exists an initial data $u_0$ above the ground state such that the solution flow admits the log-log blowup speed. This verifies the structural stability for the ``$\log$-$\log$ law'' associated to the NLS mechanism under the perturbation by a dam** term and a Stark potential. The proof of our main theorem is based on the Avron-Herbst formula and the analogous result for the unperturbed damped NLS.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
Polarity of points for systems of nonlinear stochastic heat equations in the critical dimension
Authors:
Cheuk Yin Lee,
Yimin Xiao
Abstract:
Let $u(t, x) = (u_1(t, x), \dots, u_d(t, x))$ be the solution to the systems of nonlinear stochastic heat equations \[ \begin{split} \frac{\partial}{\partial t} u(t, x) &= \frac{\partial^2}{\partial x^2} u(t, x) + σ(u(t, x)) \dot{W}(t, x),\\ u(0, x) &= u_0(x), \end{split} \] where $t \ge 0$, $x \in \mathbb{R}$, $\dot{W}(t, x) = (\dot{W}_1(t, x), \dots, \dot{W}_d(t, x))$ is a vector of $d$ independ…
▽ More
Let $u(t, x) = (u_1(t, x), \dots, u_d(t, x))$ be the solution to the systems of nonlinear stochastic heat equations \[ \begin{split} \frac{\partial}{\partial t} u(t, x) &= \frac{\partial^2}{\partial x^2} u(t, x) + σ(u(t, x)) \dot{W}(t, x),\\ u(0, x) &= u_0(x), \end{split} \] where $t \ge 0$, $x \in \mathbb{R}$, $\dot{W}(t, x) = (\dot{W}_1(t, x), \dots, \dot{W}_d(t, x))$ is a vector of $d$ independent space-time white noises, and $σ: \mathbb{R}^d \to \mathbb{R}^{d\times d}$ is a matrix-valued function. We say that a subset $S$ of $\mathbb{R}^d$ is polar for $\{u(t, x), t \ge 0, x \in \mathbb{R}\}$ if \[ \mathbb{P}\{u(t,x) \in S \text{ for some } t>0 \text{ and } x\in\mathbb{R} \}=0. \] The main result of this paper shows that, in the critical dimension $d=6$, all points in $\mathbb{R}^d$ are polar for $\{u(t, x), t \ge 0, x \in \mathbb{R}\}$. This solves an open problem of Dalang, Khoshnevisan and Nualart (2009, 2013) and Dalang, Mueller and Xiao (2021). We also provide a sufficient condition for a subset $S$ of $\mathbb{R}^d$ to be polar.
△ Less
Submitted 20 August, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Higher Genus Quantum $K$--theory
Authors:
You-Cheng Chou,
Leo Herr,
Y. -P. Lee
Abstract:
We prove genus $g$ invariants in quantum $K$-theory are determined by genus zero invariants of a smooth stack in the spirit of K.~Costello's result in Gromov--Witten theory.
We prove genus $g$ invariants in quantum $K$-theory are determined by genus zero invariants of a smooth stack in the spirit of K.~Costello's result in Gromov--Witten theory.
△ Less
Submitted 19 May, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Quantum K-invariants and Gopakumar-Vafa invariants II. Calabi-Yau threefolds at genus zero
Authors:
You-Cheng Chou,
Y. -P. Lee
Abstract:
This is the second part of our ongoing project on the relations between Gopakumar-Vafa BPS invariants (GV) and quantum K-theory (QK) on the Calabi--Yau threefolds (CY3). We show that on CY3 a genus zero quantum K-invariant can be written as a linear combination of a finite number of Gopakumar--Vafa invariants with coefficients from an explicit ``multiple cover formula''. Conversely, GV can be dete…
▽ More
This is the second part of our ongoing project on the relations between Gopakumar-Vafa BPS invariants (GV) and quantum K-theory (QK) on the Calabi--Yau threefolds (CY3). We show that on CY3 a genus zero quantum K-invariant can be written as a linear combination of a finite number of Gopakumar--Vafa invariants with coefficients from an explicit ``multiple cover formula''. Conversely, GV can be determined by QK in a similar manner. The technical heart is a proof of a remarkable conjecture by Hans Jockers and Peter Mayr.
This result is consistent with the ``virtual Clemens conjecture'' for the Calabi--Yau threefolds. A heuristic derivation of the relation between QK and GV via the virtual Clemens conjecture and the multiple cover formula is also given.
△ Less
Submitted 24 July, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Finite flocking time of the nonlinear Cucker--Smale model with Rayleigh friction type using the discrete $p$-Laplacian
Authors:
Jong-Ho Kim,
Young Ju Lee,
Jea-Hyun Park
Abstract:
The study of collective behavior in multi-agent systems has attracted the attention of many researchers due to its wide range of applications. Among them, the Cucker-Smale model was developed to study the phenomenon of flocking, and various types of extended models have been actively proposed and studied in recent decades.
In this study, we address open questions of the Cucker--Smale model with…
▽ More
The study of collective behavior in multi-agent systems has attracted the attention of many researchers due to its wide range of applications. Among them, the Cucker-Smale model was developed to study the phenomenon of flocking, and various types of extended models have been actively proposed and studied in recent decades.
In this study, we address open questions of the Cucker--Smale model with norm-type Rayleigh friction: {\bf (i)} The positivity of the communication weight, {\bf (ii)} The convergence of the norm of the velocities of agents, {\bf (iii)} The direction of the velocities of agents. For problems (i) and (ii), we present the nonlinear Cucker--Smale model with norm-type Rayleigh friction, where the nonlinear Cucker--Smale model is generalized to a nonlinear model by applying a discrete $p$-Laplacian operator. For this model, we present conditions that guarantee that the norm for velocities of agents converges to 0 or a positive value, and we also show that the regular communication weight satisfies the conditions given in this study. In particular, we present a condition for the initial configuration to obtain that the norm of agent velocities converges to only some positive value.
By contrast, problem (iii) is not solved by the norm-type nonlinear model. Thus, we propose a nonlinear Cucker--Smale model with a vector-type Rayleigh friction for problem (iii). In parallel to the first model, we show that the direction of the agents' velocities can be controlled by parameters in the nonlinear Cucker--Smale model with the vector-type Rayleigh friction.
△ Less
Submitted 29 August, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.