Search | arXiv e-print repository

Green Multigrid Network

Authors: Ye Lin, Young Ju Lee, Jiwei Jia

Abstract: GreenLearning networks (GL) directly learn Green's function in physical space, making them an interpretable model for capturing unknown solution operators of partial differential equations (PDEs). For many PDEs, the corresponding Green's function exhibits asymptotic smoothness. In this paper, we propose a framework named Green Multigrid networks (GreenMGNet), an operator learning algorithm designe… ▽ More GreenLearning networks (GL) directly learn Green's function in physical space, making them an interpretable model for capturing unknown solution operators of partial differential equations (PDEs). For many PDEs, the corresponding Green's function exhibits asymptotic smoothness. In this paper, we propose a framework named Green Multigrid networks (GreenMGNet), an operator learning algorithm designed for a class of asymptotically smooth Green's functions. Compared with the pioneering GL, the new framework presents itself with better accuracy and efficiency, thereby achieving a significant improvement. GreenMGNet is composed of two technical novelties. First, Green's function is modeled as a piecewise function to take into account its singular behavior in some parts of the hyperplane. Such piecewise function is then approximated by a neural network with augmented output(AugNN) so that it can capture singularity accurately. Second, the asymptotic smoothness property of Green's function is used to leverage the Multi-Level Multi-Integration (MLMI) algorithm for both the training and inference stages. Several test cases of operator learning are presented to demonstrate the accuracy and effectiveness of the proposed method. On average, GreenMGNet achieves $3.8\%$ to $39.15\%$ accuracy improvement. To match the accuracy level of GL, GreenMGNet requires only about $10\%$ of the full grid data, resulting in a $55.9\%$ and $92.5\%$ reduction in training time and GPU memory cost for one-dimensional test problems, and a $37.7\%$ and $62.5\%$ reduction for two-dimensional test problems. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2406.18419 [pdf, ps, other]

Solution of some tiling open problems of Propp, Lai, and some related results

Authors: Seok Hyun Byun, Mihai Ciucu, Yi-Lin Lee

Abstract: In this paper, we present a new version of the second author's factorization theorem for perfect matchings of symmetric graphs. We then use our result to solve four open problems of Propp on the enumeration of trimer tilings on the hexagonal lattice. As another application, we obtain a semi-factorization result for the number of lozenge tilings of a large class of hexagonal regions with holes (o… ▽ More In this paper, we present a new version of the second author's factorization theorem for perfect matchings of symmetric graphs. We then use our result to solve four open problems of Propp on the enumeration of trimer tilings on the hexagonal lattice. As another application, we obtain a semi-factorization result for the number of lozenge tilings of a large class of hexagonal regions with holes (obtained by starting with an arbitrary symmetric hexagon with holes, and translating all the holes one unit lattice segment in the same direction). This in turn leads to the solution of two open problems posed by Lai, to an extension of a result due to Fulmek and Krattenthaler, and to exact enumeration formulas for some new families of hexagonal regions with holes. Our result also allows us to find new, simpler proofs (and in one case, a new, simpler form) of some formulas due to Krattenthaler for the number of perfect matchings of Aztec rectangles with unit holes along a lattice diagonal. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: 44 pages, 43 figures

MSC Class: 05A15; 05A19

arXiv:2406.15959 [pdf, other]

A Nonoverlap** Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Authors: Chang-Ock Lee, Youngkyu Lee, Byungeun Ryoo

Abstract: Extreme learning machine (ELM) is a methodology for solving partial differential equations (PDEs) using a single hidden layer feed-forward neural network. It presets the weight/bias coefficients in the hidden layer with random values, which remain fixed throughout the computation, and uses a linear least squares method for training the parameters of the output layer of the neural network. It is kn… ▽ More Extreme learning machine (ELM) is a methodology for solving partial differential equations (PDEs) using a single hidden layer feed-forward neural network. It presets the weight/bias coefficients in the hidden layer with random values, which remain fixed throughout the computation, and uses a linear least squares method for training the parameters of the output layer of the neural network. It is known to be much faster than Physics informed neural networks. However, classical ELM is still computationally expensive when a high level of representation is desired in the solution as this requires solving a large least squares system. In this paper, we propose a nonoverlap** domain decomposition method (DDM) for ELMs that not only reduces the training time of ELMs, but is also suitable for parallel computation. In numerical analysis, DDMs have been widely studied to reduce the time to obtain finite element solutions for elliptic PDEs through parallel computation. Among these approaches, nonoverlap** DDMs are attracting the most attention. Motivated by these methods, we introduce local neural networks, which are valid only at corresponding subdomains, and an auxiliary variable at the interface. We construct a system on the variable and the parameters of local neural networks. A Schur complement system on the interface can be derived by eliminating the parameters of the output layer. The auxiliary variable is then directly obtained by solving the reduced system after which the parameters for each local neural network are solved in parallel. A method for initializing the hidden layer parameters suitable for high approximation quality in large systems is also proposed. Numerical results that verify the acceleration performance of the proposed method with respect to the number of subdomains are presented. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: 18 pages, 4 figures, 7 tables

MSC Class: 65N55 (Primary); 35J25; 68T07 (Secondary)

arXiv:2406.12351 [pdf, ps, other]

Hida family of theta lift from U(1) to definite U(2)

Authors: Yu-Sheng Lee

Abstract: Let K/F be a CM extension satisfying the ordinary assumption for an odd prime p. In this article, we construct Hida families that interpolate theta lifts of algebraic Hecke characters to a definite unitary group U(2) defined from skew-Hermitian spaces over K, and show that the Hida family is primitive when the central L-value of the branch character of the family satisfies certain non-vanishing mo… ▽ More Let K/F be a CM extension satisfying the ordinary assumption for an odd prime p. In this article, we construct Hida families that interpolate theta lifts of algebraic Hecke characters to a definite unitary group U(2) defined from skew-Hermitian spaces over K, and show that the Hida family is primitive when the central L-value of the branch character of the family satisfies certain non-vanishing modulo p conditions. △ Less

Submitted 18 June, 2024; originally announced June 2024.

MSC Class: 11F27(Primary) 11F67(Secondary)

arXiv:2406.10997 [pdf, other]

Two-level overlap** additive Schwarz preconditioner for training scientific machine learning applications

Authors: Youngkyu Lee, Alena Kopaničáková, George Em Karniadakis

Abstract: We introduce a novel two-level overlap** additive Schwarz preconditioner for accelerating the training of scientific machine learning applications. The design of the proposed preconditioner is motivated by the nonlinear two-level overlap** additive Schwarz preconditioner. The neural network parameters are decomposed into groups (subdomains) with overlap** regions. In addition, the network's… ▽ More We introduce a novel two-level overlap** additive Schwarz preconditioner for accelerating the training of scientific machine learning applications. The design of the proposed preconditioner is motivated by the nonlinear two-level overlap** additive Schwarz preconditioner. The neural network parameters are decomposed into groups (subdomains) with overlap** regions. In addition, the network's feed-forward structure is indirectly imposed through a novel subdomain-wise synchronization strategy and a coarse-level training step. Through a series of numerical experiments, which consider physics-informed neural networks and operator learning approaches, we demonstrate that the proposed two-level preconditioner significantly speeds up the convergence of the standard (LBFGS) optimizer while also yielding more accurate machine learning models. Moreover, the devised preconditioner is designed to take advantage of model-parallel computations, which can further reduce the training time. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 24 pages, 9 figures

MSC Class: 90C30; 90C26; 90C06; 65M55; 68T07

arXiv:2406.10920 [pdf, other]

Hamilton-Jacobi Based Policy-Iteration via Deep Operator Learning

Authors: Jae Yong Lee, Yeoneung Kim

Abstract: The framework of deep operator network (DeepONet) has been widely exploited thanks to its capability of solving high dimensional partial differential equations. In this paper, we incorporate DeepONet with a recently developed policy iteration scheme to numerically solve optimal control problems and the corresponding Hamilton--Jacobi--Bellman (HJB) equations. A notable feature of our approach is th… ▽ More The framework of deep operator network (DeepONet) has been widely exploited thanks to its capability of solving high dimensional partial differential equations. In this paper, we incorporate DeepONet with a recently developed policy iteration scheme to numerically solve optimal control problems and the corresponding Hamilton--Jacobi--Bellman (HJB) equations. A notable feature of our approach is that once the neural network is trained, the solution to the optimal control problem and HJB equations with different terminal functions can be inferred quickly thanks to the unique feature of operator learning. Furthermore, a quantitative analysis of the accuracy of the algorithm is carried out via comparison principles of viscosity solutions. The effectiveness of the method is verified with various examples, including 10-dimensional linear quadratic regulator problems (LQRs). △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 24 pages, 5 figures

MSC Class: 68T20; 68U07; 35F21; 49L12; 49L25

arXiv:2405.16894 [pdf, ps, other]

An Unconstrained Formulation of Some Constrained Partial Differential Equations and its Application to Finite Neuron Methods

Authors: Jiwei Jia, Young Ju Lee, Ruitong Shan

Abstract: In this paper, we present a new framework how a PDE with constraints can be formulated into a sequence of PDEs with no constraints, whose solutions are convergent to the solution of the PDE with constraints. This framework is then used to build a novel finite neuron method to solve the 2nd order elliptic equations with the Dirichlet boundary condition. Our algorithm is the first algorithm, proven… ▽ More In this paper, we present a new framework how a PDE with constraints can be formulated into a sequence of PDEs with no constraints, whose solutions are convergent to the solution of the PDE with constraints. This framework is then used to build a novel finite neuron method to solve the 2nd order elliptic equations with the Dirichlet boundary condition. Our algorithm is the first algorithm, proven to lead to shallow neural network solutions with an optimal H1 norm error. We show that a widely used penalized PDE, which imposes the Dirichlet boundary condition weakly can be interpreted as the first element of the sequence of PDEs within our framework. Furthermore, numerically, we show that it may not lead to the solution with the optimal H1 norm error bound in general. On the other hand, we theoretically demonstrate that the second and later elements of a sequence of PDEs can lead to an adequate solution with the optimal H1 norm error bound. A number of sample tests are performed to confirm the effectiveness of the proposed algorithm and the relevant theory. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.16117 [pdf, other]

Positivity and Maximum Principle Preserving Discontinuous Galerkin Finite Element Schemes for a Coupled Flow and Transport

Authors: Shihua Gong, Young-Ju Lee, Yukun Li, Yue Yu

Abstract: We introduce a new concept of the locally conservative flux and investigate its relationship with the compatible discretization pioneered by Dawson, Sun and Wheeler [11]. We then demonstrate how the new concept of the locally conservative flux can play a crucial role in obtaining the L2 norm stability of the discontinuous Galerkin finite element scheme for the transport in the coupled system with… ▽ More We introduce a new concept of the locally conservative flux and investigate its relationship with the compatible discretization pioneered by Dawson, Sun and Wheeler [11]. We then demonstrate how the new concept of the locally conservative flux can play a crucial role in obtaining the L2 norm stability of the discontinuous Galerkin finite element scheme for the transport in the coupled system with flow. In particular, the lowest order discontinuous Galerkin finite element for the transport is shown to inherit the positivity and maximum principle when the locally conservative flux is used, which has been elusive for many years in literature. The theoretical results established in this paper are based on the equivalence between Lesaint-Raviart discontinuous Galerkin scheme and Brezzi-Marini-Suli discontinuous Galerkin scheme for the linear hyperbolic system as well as the relationship between the Lesaint-Raviart discontinuous Galerkin scheme and the characteristic method along the streamline. Sample numerical experiments have also been performed to justify our theoretical findings △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.13455 [pdf, ps, other]

Carleson measures for weighted Bergman--Zygmund spaces

Authors: Hong Rae Cho, Hyungwoon Koo, Young Joo Lee, Atte Pennanen, Jouni Rättyä, Fanglei Wu

Abstract: For $0<p<\infty$, $Ψ:[0,\infty)\to(0,\infty)$ and a finite positive Borel measure $μ$ on the unit disc $\mathbb{D}$, the Lebesgue--Zygmund space $L^p_{μ,Ψ}$ consists of all measurable functions $f$ such that $\lVert f \rVert_{L_{μ, Ψ}^{p}}^p =\int_{\mathbb{D}}|f|^pΨ(|f|)\,dμ< \infty$. For an integrable radial function $ω$ on $\mathbb{D}$, the corresponding weighted Bergman-Zygmund space… ▽ More For $0<p<\infty$, $Ψ:[0,\infty)\to(0,\infty)$ and a finite positive Borel measure $μ$ on the unit disc $\mathbb{D}$, the Lebesgue--Zygmund space $L^p_{μ,Ψ}$ consists of all measurable functions $f$ such that $\lVert f \rVert_{L_{μ, Ψ}^{p}}^p =\int_{\mathbb{D}}|f|^pΨ(|f|)\,dμ< \infty$. For an integrable radial function $ω$ on $\mathbb{D}$, the corresponding weighted Bergman-Zygmund space $A_{ω, Ψ}^{p}$ is the set of all analytic functions in $L_{μ, Ψ}^{p}$ with $dμ=ω\,dA$. The purpose of the paper is to characterize bounded (and compact) embeddings $A_{ω,Ψ}^{p}\subset L_{μ, Φ}^{q}$, when $0<p\le q<\infty$, the functions $Ψ$ and $Φ$ are essential monotonic, and $Ψ,Φ,ω$ satisfy certain doubling properties. The tools developed on the way to the main results are applied to characterize bounded and compact integral operators acting from $A^p_{ω,Ψ}$ to $A^q_{ν,Φ}$, provided $ν$ admits the same doubling property as $ω$. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.10527 [pdf, other]

Hawkes Models And Their Applications

Authors: Patrick J. Laub, Young Lee, Philip K. Pollett, Thomas Taimre

Abstract: The Hawkes process is a model for counting the number of arrivals to a system which exhibits the self-exciting property - that one arrival creates a heightened chance of further arrivals in the near future. The model, and its generalizations, have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model… ▽ More The Hawkes process is a model for counting the number of arrivals to a system which exhibits the self-exciting property - that one arrival creates a heightened chance of further arrivals in the near future. The model, and its generalizations, have been applied in a plethora of disparate domains, though two particularly developed applications are in seismology and in finance. As the original model is elegantly simple, generalizations have been proposed which: track marks for each arrival, are multivariate, have a spatial component, are driven by renewal processes, treat time as discrete, and so on. This paper creates a cohesive review of the traditional Hawkes model and the modern generalizations, providing details on their construction, simulation algorithms, and giving key references to the appropriate literature for a detailed treatment. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.08424 [pdf, other]

Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More

Authors: Fanchen Bu, Hyeonsoo Jo, Soo Yong Lee, Sungsoo Ahn, Kijung Shin

Abstract: Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each co… ▽ More Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each component confronts unique challenges. First, deriving objectives under various conditions (e.g., cardinality constraints and minimum) is nontrivial. Second, the derandomization process is underexplored, and the existing derandomization methods are either random sampling or naive rounding. In this work, we aim to tackle prevalent (i.e., commonly involved) conditions in unsupervised CO. First, we concretize the targets for objective construction and derandomization with theoretical justification. Then, for various conditions commonly involved in different CO problems, we derive nontrivial objectives and derandomization to meet the targets. Finally, we apply the derivations to various CO problems. Via extensive experiments on synthetic and real-world graphs, we validate the correctness of our derivations and show our empirical superiority w.r.t. both optimization quality and speed. △ Less

Submitted 23 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

Comments: ICML 2024

arXiv:2404.14562 [pdf, ps, other]

The zeta-determinant of the Dirichlet-to-Neumann operator of the Steklov Problem on forms

Authors: Klaus Kirsten, Yoonweon Lee

Abstract: On a compact Riemannian manifold $M$ with boundary $Y$, we express the log of the zeta-determinant of the Dirichlet-to-Neumann operator acting on $q$-forms on $Y$ as the difference of the log of the zeta-determinant of the Laplacian on $q$-forms on $M$ with absolute boundary conditions and that of the Laplacian with Dirichlet boundary conditions with some additional terms which are expressed by cu… ▽ More On a compact Riemannian manifold $M$ with boundary $Y$, we express the log of the zeta-determinant of the Dirichlet-to-Neumann operator acting on $q$-forms on $Y$ as the difference of the log of the zeta-determinant of the Laplacian on $q$-forms on $M$ with absolute boundary conditions and that of the Laplacian with Dirichlet boundary conditions with some additional terms which are expressed by curvature tensors. When the dimension of $M$ is $2$ or $3$, we compute these terms explicitly. We also discuss the value of the zeta function at zero associated to the Dirichlet-to-Neumann operator by using a conformal rescaling method. As an application, we recover the result of the conformal invariance obtained in C. Guillarmou and L. Guillopé, The determinant of the Dirichlet-to-Neumann map for surfaces with boundary, Int. Math. Res. Not. IMRN 2007, no. 22, Art. ID rnm099, when the dimension of $M$ is $2$. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: 28 pages

MSC Class: 58J20; 14F40

arXiv:2404.09057 [pdf, ps, other]

Off-diagonally symmetric domino tilings of the Aztec diamond of odd order

Authors: Yi-Lin Lee

Abstract: We study the enumeration of off-diagonally symmetric domino tilings of odd-order Aztec diamonds in two directions: (1) with one boundary defect, and (2) with maximally-many zeroes on the diagonal. In the first direction, we prove a symmetry property which states that the numbers of off-diagonally symmetric domino tilings of the Aztec diamond of order $2n-1$ are equal when the boundary defect is at… ▽ More We study the enumeration of off-diagonally symmetric domino tilings of odd-order Aztec diamonds in two directions: (1) with one boundary defect, and (2) with maximally-many zeroes on the diagonal. In the first direction, we prove a symmetry property which states that the numbers of off-diagonally symmetric domino tilings of the Aztec diamond of order $2n-1$ are equal when the boundary defect is at the $k$th position and the $(2n-k)$th position on the boundary, respectively. This symmetry property proves a special case of a recent conjecture by Behrend, Fischer, and Koutschan. In the second direction, a Pfaffian formula is obtained for the number of "nearly" off-diagonally symmetric domino tilings of odd-order Aztec diamonds, where the entries of the Pfaffian satisfy a simple recurrence relation. The numbers of domino tilings mentioned in the above two directions do not seem to have a simple product formula, but we show that these numbers satisfy simple matrix equations in which the entries of the matrix are given by Delannoy numbers. The proof of these results involves the method of non-intersecting lattice paths and a modification of Stembridge's Pfaffian formula for families of non-intersecting lattice paths. Finally, we propose conjectures concerning the log-concavity and asymptotic behavior of the number of off-diagonally symmetric domino tilings of odd-order Aztec diamonds. △ Less

Submitted 21 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

Comments: 24 pages, 6 figures. Comments are very welcome

MSC Class: 05A15; 05B20; 05B45

arXiv:2404.04666 [pdf, ps, other]

An explicit formula for the orbital integrals on the spherical Hecke algebra of $\mathrm{GL}_3$

Authors: Sungmun Cho, Yuchan Lee

Abstract: We provide the explicit formula for orbital integrals associated with elliptic regular semisimple elements in $\mathrm{GL}_n(F) \cap \mathrm{M}_n(\mathfrak{o})$ and associated with arbitrary elements of the spherical Hecke algebra of $\mathrm{GL}_n(F)$ when $n=2, 3$, using results of [CKL]. Here $F$ is a non-Archimedean local field of any characteristic with $\mathfrak{o}$ its ring of integers. We provide the explicit formula for orbital integrals associated with elliptic regular semisimple elements in $\mathrm{GL}_n(F) \cap \mathrm{M}_n(\mathfrak{o})$ and associated with arbitrary elements of the spherical Hecke algebra of $\mathrm{GL}_n(F)$ when $n=2, 3$, using results of [CKL]. Here $F$ is a non-Archimedean local field of any characteristic with $\mathfrak{o}$ its ring of integers. △ Less

Submitted 8 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

Comments: 17 pages

MSC Class: 11F72; 11S80

arXiv:2404.00154 [pdf, other]

Sampling error mitigation through spectrum smoothing in ensemble data assimilation

Authors: Bosu Choi, Yoonsang Lee

Abstract: In data assimilation, an ensemble provides a nonintrusive way to evolve a probability density described by a nonlinear prediction model. Although a large ensemble size is required for statistical accuracy, the ensemble size is typically limited to a small number due to the computational cost of running the prediction model, which leads to a sampling error. Several methods, such as localization, ex… ▽ More In data assimilation, an ensemble provides a nonintrusive way to evolve a probability density described by a nonlinear prediction model. Although a large ensemble size is required for statistical accuracy, the ensemble size is typically limited to a small number due to the computational cost of running the prediction model, which leads to a sampling error. Several methods, such as localization, exist to mitigate the sampling error, often requiring problem-dependent fine-tuning and design. This work introduces another sampling error mitigation method using a smoothness constraint in the Fourier space. In particular, this work smoothes out the spectrum of the system to increase the stability and accuracy even under a small ensemble size. The efficacy of the new idea is validated through a suite of stringent test problems, including Lorenz 96 and Kuramoto-Sivashinsky turbulence models. △ Less

Submitted 29 March, 2024; originally announced April 2024.

arXiv:2403.19146 [pdf, ps, other]

Improving the Bit Complexity of Communication for Distributed Convex Optimization

Authors: Mehrdad Ghadiri, Yin Tat Lee, Swati Padmanabhan, William Swartworth, David Woodruff, Guanghao Ye

Abstract: We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app… ▽ More We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank approximation; for a number of these fundamental problems our bounds are nearly optimal, as proven by our lower bounds. Among our techniques, we use the notion of block leverage scores, which have been relatively unexplored in this context, as well as drop** all but the ``middle" bits in Richardson-style algorithms. We also introduce a new communication problem for accurately approximating inner products and establish a lower bound using the spherical Radon transform. Our lower bound can be used to show the first separation of linear programming and linear systems in the distributed model when the number of constraints is polynomial, addressing an open question in prior work. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: To appear in STOC '24. Abstract shortened to meet the arXiv limits. Comments welcome!

arXiv:2403.11385 [pdf, other]

Stochastic approach for elliptic problems in perforated domains

Authors: Jihun Han, Yoonsang Lee

Abstract: A wide range of applications in science and engineering involve a PDE model in a domain with perforations, such as perforated metals or air filters. Solving such perforated domain problems suffers from computational challenges related to resolving the scale imposed by the geometries of perforations. We propose a neural network-based mesh-free approach for perforated domain problems. The method is… ▽ More A wide range of applications in science and engineering involve a PDE model in a domain with perforations, such as perforated metals or air filters. Solving such perforated domain problems suffers from computational challenges related to resolving the scale imposed by the geometries of perforations. We propose a neural network-based mesh-free approach for perforated domain problems. The method is robust and efficient in capturing various configuration scales, including the averaged macroscopic behavior of the solution that involves a multiscale nature induced by small perforations. The new approach incorporates the derivative-free loss method that uses a stochastic representation or the Feynman-Kac formulation. In particular, we implement the Neumann boundary condition for the derivative-free loss method to handle the interface between the domain and perforations. A suite of stringent numerical tests is provided to support the proposed method's efficacy in handling various perforation scales. △ Less

Submitted 17 March, 2024; originally announced March 2024.

Comments: 18 pages, 6 figures

MSC Class: 65N99; 65C05; 68T07

arXiv:2402.19032 [pdf, ps, other]

Effective Results in The Metric Theory of Quantitative Diophantine Approximation

Authors: Ying Wai Lee, Andrew Scoones

Abstract: Many results related to quantitative problems in the metric theory of Diophantine approximation are asymptotic, such as the number of rational solutions to certain inequalities grows with the same rate almost everywhere modulo an asymptotic error term. The error term incorporates an implicit constant that varies from one point to another. This means that applications of these results does not give… ▽ More Many results related to quantitative problems in the metric theory of Diophantine approximation are asymptotic, such as the number of rational solutions to certain inequalities grows with the same rate almost everywhere modulo an asymptotic error term. The error term incorporates an implicit constant that varies from one point to another. This means that applications of these results does not give concrete bounds when applied to, say a finite sum, or when applied to counting the number of solutions up to a finite point for a given inequality. This paper addresses this problem and makes the tools and their results effective, by making the implicit constant explicit outside of an exceptional subset of Lebesgue measure at most $δ>0$, an arbitrarily small constant chosen in advance. We deduce from this the fully effective results for Schmidt's Theorem, quantitative Koukoulopoulos-Maynard Theorem and quantitative results on $M_{0}$-sets; we also provide effective results regarding statistics of normal numbers and strong law of large numbers. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 48 pages, 0 figures

arXiv:2402.16613 [pdf, other]

Structure-Preserving Operator Learning: Modeling the Collision Operator of Kinetic Equations

Authors: Jae Yong Lee, Steffen Schotthöfer, Tianbai Xiao, Sebastian Krumscheid, Martin Frank

Abstract: This work explores the application of deep operator learning principles to a problem in statistical physics. Specifically, we consider the linear kinetic equation, consisting of a differential advection operator and an integral collision operator, which is a powerful yet expensive mathematical model for interacting particle systems with ample applications, e.g., in radiation transport. We investig… ▽ More This work explores the application of deep operator learning principles to a problem in statistical physics. Specifically, we consider the linear kinetic equation, consisting of a differential advection operator and an integral collision operator, which is a powerful yet expensive mathematical model for interacting particle systems with ample applications, e.g., in radiation transport. We investigate the capabilities of the Deep Operator network (DeepONet) approach to modelling the high dimensional collision operator of the linear kinetic equation. This integral operator has crucial analytical structures that a surrogate model, e.g., a DeepONet, needs to preserve to enable meaningful physical simulation. We propose several DeepONet modifications to encapsulate essential structural properties of this integral operator in a DeepONet model. To be precise, we adapt the architecture of the trunk-net so the DeepONet has the same collision invariants as the theoretical kinetic collision operator, thus preserving conserved quantities, e.g., mass, of the modeled many-particle system. Further, we propose an entropy-inspired data-sampling method tailored to train the modified DeepONet surrogates without requiring an excessive expensive simulation-based data generation. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 12 pages, 8 figures

arXiv:2402.11761 [pdf, ps, other]

The number of automorphic representations of $\mathrm{GL}_2$ with exceptional eigenvalues

Authors: Dohoon Choi, Min Lee, Youngmin Lee, Subong Lim

Abstract: We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result. Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the se… ▽ More We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result. Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the set of irreducible cuspidal automorphic representations $π$ of $\mathrm{GL}_2(\mathbb{A}_{F})$ with the trivial central character such that for each archimedean place $v$ of $F$, the local representation of $π$ at $v$ is an unramified principal series and is not tempered. For an ideal $J$ of $\mathcal{O}_{F}$, let $\mathrm{K}_{0}(J)$ be the subgroup of $\mathrm{GL}_2(\mathbb{A}_{F})$ corresponding to $Γ_0(J) \subset \mathrm{SL}_2(\mathcal{O}_F)$. Let $r_1$ be the number of real embeddings of $F$ and $r_2$ be the number of conjugate pairs of complex embeddings of $F$. Using the Arthur-Selberg trace formula, we have \begin{equation*} \sum_{π\in \mathfrak{X}_{F,\mathrm{ex}}} \dim π^{\mathrm{K}_0(J)} \ll_{F} \frac{[\mathrm{SL}_2(\mathcal{O}_{F}) : Γ_0(J)]}{(\log (N_{F/\mathbb{Q}}(J)))^{2r_1+3r_2}} \quad \text{ as } \quad |N_{F/\mathbb{Q}}(J)|\to \infty. \end{equation*} From this result, we obtain the result on an upper bound for the number of Hecke-Maass cusp forms of weight $0$ on $Γ_0(N)$ which do not satisfy the Selberg eigenvalue conjecture. △ Less

Submitted 18 February, 2024; originally announced February 2024.

MSC Class: 11F72 (Primary); 11F12 (Secondary)

arXiv:2402.08512 [pdf, ps, other]

The finitude of tamely ramified pro-$p$ extensions of number fields with cyclic $p$-class groups

Authors: Yoon** Lee, Donghyeok Lim

Abstract: Let $p$ be an odd prime and $F$ be a number field whose $p$-class group is cyclic. Let $F_{\{\mathfrak{q}\}}$ be the maximal pro-$p$ extension of $F$ which is unramified outside a single non-$p$-adic prime ideal $\mathfrak{q}$ of $F$. In this work, we study the finitude of the Galois group $G_{\{\mathfrak{q}\}}(F)$ of $F_{\{\mathfrak{q}\}}$ over $F$. We prove that $G_{\{\mathfrak{q}\}}(F)$ is fini… ▽ More Let $p$ be an odd prime and $F$ be a number field whose $p$-class group is cyclic. Let $F_{\{\mathfrak{q}\}}$ be the maximal pro-$p$ extension of $F$ which is unramified outside a single non-$p$-adic prime ideal $\mathfrak{q}$ of $F$. In this work, we study the finitude of the Galois group $G_{\{\mathfrak{q}\}}(F)$ of $F_{\{\mathfrak{q}\}}$ over $F$. We prove that $G_{\{\mathfrak{q}\}}(F)$ is finite for the majority of $\mathfrak{q}$'s such that the generator rank of $G_{\{\mathfrak{q}\}}(F)$ is two, provided that for $p = 3$, $F$ is not a complex quartic field containing the primitive third roots of unity. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: to appear in Journal of Number Theory

MSC Class: 11R32; 11R37

arXiv:2402.08187 [pdf, other]

Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids

Authors: Sung Woong Cho, Jae Yong Lee, Hyung Ju Hwang

Abstract: Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions… ▽ More Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions as inputs and outputs, enabling real-time predictions as surrogate models for solution operators. There has also been significant progress in the research on surrogate models based on graph neural networks (GNNs), specifically targeting the dynamics in time-dependent PDEs. In this paper, we propose GraphDeepONet, an autoregressive model based on GNNs, to effectively adapt DeepONet, which is well-known for successful operator learning. GraphDeepONet exhibits robust accuracy in predicting solutions compared to existing GNN-based PDE solver models. It maintains consistent performance even on irregular grids, leveraging the advantages inherited from DeepONet and enabling predictions on arbitrary grids. Additionally, unlike traditional DeepONet and its variants, GraphDeepONet enables time extrapolation for time-dependent PDE solutions. We also provide theoretical analysis of the universal approximation capability of GraphDeepONet in approximating continuous operators across arbitrary time intervals. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 25 pages, 11 figures

MSC Class: 65D17; 68U07

arXiv:2402.02717 [pdf, other]

Minimal grid diagrams of the prime knots with crossing number 13 and arc index 13

Authors: Hwa Jeong Lee, Yoonsang Lee, Chanmin Lee, Yeseo Park, Hun Kim, Gyo Taek **

Abstract: We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided b… ▽ More We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided by the program Knotscape to generate spanning trees of the corresponding knot diagrams to obtain minimal arc presentations in the form of grid diagrams. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: 57 pages, 5 figures, 1 table and 3250 grid diagrams

MSC Class: 57K10

arXiv:2402.00674 [pdf, ps, other]

The global Cauchy problem for the Euler-Riesz equations

Authors: Young-Pil Choi, **wook Jung, Yoonjung Lee

Abstract: We completely resolve the global Cauchy problem for the multi-dimensional Euler-Riesz equations, where the interaction forcing is given by $\nabla (-Δ)^{-σ/2}ρ$ for some $σ\in (0,2)$. We construct the global-in-time unique solution to the Euler-Riesz system in a $H^s$ Sobolev space under a smallness assumption on the initial density and a dispersive spectral condition on the initial velocity. More… ▽ More We completely resolve the global Cauchy problem for the multi-dimensional Euler-Riesz equations, where the interaction forcing is given by $\nabla (-Δ)^{-σ/2}ρ$ for some $σ\in (0,2)$. We construct the global-in-time unique solution to the Euler-Riesz system in a $H^s$ Sobolev space under a smallness assumption on the initial density and a dispersive spectral condition on the initial velocity. Moreover, we investigate the algebraic time decay of convergences for the constructed solutions. Our results cover the both attractive and repulsive cases as well as the whole regime $σ\in (0,2)$. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 40 pages

arXiv:2401.16963 [pdf, ps, other]

Sub-Optimal Fast Fourier Series Approximation for Initial Trajectory Design

Authors: Caleb Gunsaulus, Carl De Vries, William Brown, Youngro Lee, Madhusudan Vijayakumar, Ossama Abdelkhalik

Abstract: The Finite Fourier Series (FFS) Shape-Based (SB) trajectory approximation method has been used to rapidly generate initial trajectories that satisfy the dynamics, trajectory boundary conditions, and limitation on maximum thrust acceleration. The FFS SB approach solves a nonlinear programming problem (NLP) in searching for feasible trajectories. This paper extends the development of the FFS SB appr… ▽ More The Finite Fourier Series (FFS) Shape-Based (SB) trajectory approximation method has been used to rapidly generate initial trajectories that satisfy the dynamics, trajectory boundary conditions, and limitation on maximum thrust acceleration. The FFS SB approach solves a nonlinear programming problem (NLP) in searching for feasible trajectories. This paper extends the development of the FFS SB approach to generate sub optimal solutions. Specifically, the objective function of the NLP problem is modified to include also a measure for the time of flight. Numerical results presented in this paper show several solutions that differ from those of the original FFS SB ones. The sub-optimal trajectories generated using a time of flight minimization are shown to be physically feasible trajectories and potential candidates for direct solvers. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 2021 AAS/AIAA Astrodynamics Specialist Conference, Big Sky, Virtual, August 9-11, 2021

arXiv:2401.16717 [pdf, ps, other]

Scattering for the dispersion managed nonlinear Schrödinger equation

Authors: Mi-Ran Choi, Kiyeon Lee, Young-Ran Lee

Abstract: We consider the dispersion managed nonlinear Schrdinger equations with quintic and cubic nonlinearities in one and two dimensions, respectively. We prove the global well-posedness and scattering in $L_x^2$ for small initial data employing the $U^p$ and $V^p$ spaces. We consider the dispersion managed nonlinear Schrdinger equations with quintic and cubic nonlinearities in one and two dimensions, respectively. We prove the global well-posedness and scattering in $L_x^2$ for small initial data employing the $U^p$ and $V^p$ spaces. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 14 pages

MSC Class: 35Q55; 37K60; 35Q60

arXiv:2401.03378 [pdf, other]

CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations

Authors: Johann Rudi, Youngjun Lee, Aidan H. Chadha, Mohamed Wahib, Klaus Weide, Jared P. O'Neal, Anshu Dubey

Abstract: CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to uniq… ▽ More CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to unique challenges including handling an existing large code base in Fortran and/or C/C++, subdivision of code into a great variety of units supporting a wide range of physics and numerical methods, different parallelization techniques for distributed- and shared-memory systems and accelerator devices, and heterogeneity of computing platforms requiring coexisting variants of parallel algorithms. The challenges demand that developers determine custom abstractions and granularity for code generation. CG-Kit tackles this with standalone tools that can be combined into highly specific and, we argue, highly effective portability and maintainability tool chains. Here we present the design of our new tools: parametrized source trees, control flow graphs, and recipes. The tools are implemented in Python. Although the tools are agnostic to the programming language of the source code, we focus on C/C++ and Fortran. Code generation experiments demonstrate the generation of variants of parallel algorithms: first, multithreaded variants of the basic AXPY operation (scalar-vector addition and vector-vector multiplication) to introduce the application of CG-Kit tool chains; and second, variants of parallel algorithms within a hydrodynamics solver, called Spark, from Flash-X that operates on block-structured adaptive meshes. In summary, code generated by CG-Kit achieves a reduction by over 60% of the original C/C++/Fortran source code. △ Less

Submitted 6 January, 2024; originally announced January 2024.

Comments: submitted

arXiv:2312.15949 [pdf, other]

HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork

Authors: Jae Yong Lee, Sung Woong Cho, Hyung Ju Hwang

Abstract: Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear map**s between function spaces. However, the DeepONet requires many parameters a… ▽ More Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear map**s between function spaces. However, the DeepONet requires many parameters and has a high computational cost when learning operators, particularly those with complex (discontinuous or non-smooth) target functions. This study proposes HyperDeepONet, which uses the expressive power of the hypernetwork to enable the learning of a complex operator with a smaller set of parameters. The DeepONet and its variant models can be thought of as a method of injecting the input function information into the target function. From this perspective, these models can be viewed as a particular case of HyperDeepONet. We analyze the complexity of DeepONet and conclude that HyperDeepONet needs relatively lower complexity to obtain the desired accuracy for operator learning. HyperDeepONet successfully learned various operators with fewer computational resources compared to other benchmarks. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 26 pages, 13 figures. Published as a conference paper at Eleventh International Conference on Learning Representations (ICLR 2023)

MSC Class: 65D17; 68U07

arXiv:2312.08612 [pdf, ps, other]

On a Kostant section for the unitary group

Authors: Yuchan Lee

Abstract: For the unitary group defined over the ring of integers in a non Archimedean local field, we give a correction for a Kostant section provided in G.Laumon and B.C. Ngô's paper; Le lemme fondamental pour les groupes unitaires. For the unitary group defined over the ring of integers in a non Archimedean local field, we give a correction for a Kostant section provided in G.Laumon and B.C. Ngô's paper; Le lemme fondamental pour les groupes unitaires. △ Less

Submitted 17 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

arXiv:2311.02905 [pdf, other]

Global existence versus finite time blowup dichotomy for the dispersion managed NLS

Authors: Mi-Ran Choi, Younghun Hong, Young-Ran Lee

Abstract: We consider the Gabitov-Turitsyn equation or the dispersion managed nonlinear Schrödinger equation of a power-type nonlinearity \[ i\partial_t u+ d_\text{av} \partial_x^2u+\int_0^1 e^{-ir\partial_x^2}\big(|e^{ir\partial_x^2}u|^{p-1}e^{ir\partial_x^2}u\big)dr=0 \] and prove the global existence versus finite time blowup dichotomy for the mass-supercritical cases, that is, $p>9$. We consider the Gabitov-Turitsyn equation or the dispersion managed nonlinear Schrödinger equation of a power-type nonlinearity \[ i\partial_t u+ d_\text{av} \partial_x^2u+\int_0^1 e^{-ir\partial_x^2}\big(|e^{ir\partial_x^2}u|^{p-1}e^{ir\partial_x^2}u\big)dr=0 \] and prove the global existence versus finite time blowup dichotomy for the mass-supercritical cases, that is, $p>9$. △ Less

Submitted 25 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

Comments: 23pages. 1 figure

arXiv:2310.19763 [pdf, other]

Autoregressive Renaissance in Neural PDE Solvers

Authors: Yolanne Yi Ran Lee

Abstract: Recent developments in the field of neural partial differential equation (PDE) solvers have placed a strong emphasis on neural operators. However, the paper "Message Passing Neural PDE Solver" by Brandstetter et al. published in ICLR 2022 revisits autoregressive models and designs a message passing graph neural network that is comparable with or outperforms both the state-of-the-art Fourier Neural… ▽ More Recent developments in the field of neural partial differential equation (PDE) solvers have placed a strong emphasis on neural operators. However, the paper "Message Passing Neural PDE Solver" by Brandstetter et al. published in ICLR 2022 revisits autoregressive models and designs a message passing graph neural network that is comparable with or outperforms both the state-of-the-art Fourier Neural Operator and traditional classical PDE solvers in its generalization capabilities and performance. This blog post delves into the key contributions of this work, exploring the strategies used to address the common problem of instability in autoregressive models and the design choices of the message passing graph neural network architecture. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: Presented as a workshop poster at ICLR 2023

arXiv:2310.12461 [pdf, other]

Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Authors: Youngkyu Lee, Jongho Park, Chang-Ock Lee

Abstract: The performance of neural networks has been significantly improved by increasing the number of channels in convolutional layers. However, this increase in performance comes with a higher computational cost, resulting in numerous studies focused on reducing it. One promising approach to address this issue is group convolution, which effectively reduces the computational cost by grou** channels. H… ▽ More The performance of neural networks has been significantly improved by increasing the number of channels in convolutional layers. However, this increase in performance comes with a higher computational cost, resulting in numerous studies focused on reducing it. One promising approach to address this issue is group convolution, which effectively reduces the computational cost by grou** channels. However, to the best of our knowledge, there has been no theoretical analysis on how well the group convolution approximates the standard convolution. In this paper, we mathematically analyze the approximation of the group convolution to the standard convolution with respect to the number of groups. Furthermore, we propose a novel variant of the group convolution called balanced group convolution, which shows a higher approximation with a small additional computational cost. We provide experimental results that validate our theoretical findings and demonstrate the superior performance of the balanced group convolution over other variants of group convolution. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 26pages, 2 figures

MSC Class: 68W01; 68W40

arXiv:2310.09955 [pdf, other]

On the Statistical Foundations of H-likelihood for Unobserved Random Variables

Authors: Hangbin Lee, Youngjo Lee

Abstract: The maximum likelihood estimation is widely used for statistical inferences. This paper aims to reformulate Lee and Nelder's (1996) h-likelihood, so that the maximum h-likelihood estimator resembles the maximum likelihood estimator of the classical likelihood. We establish the statistical foundations of the new h-likelihood. This extends classical likelihood theories to embrace broader class of st… ▽ More The maximum likelihood estimation is widely used for statistical inferences. This paper aims to reformulate Lee and Nelder's (1996) h-likelihood, so that the maximum h-likelihood estimator resembles the maximum likelihood estimator of the classical likelihood. We establish the statistical foundations of the new h-likelihood. This extends classical likelihood theories to embrace broader class of statistical models with random parameters. Maximization of the h-likelihood yields asymptotically optimal estimators for both fixed and random parameters achieving the generalized Cramér-Rao lower bound, while providing computationally efficient fitting algorithms. Furthermore, we explore asymptotic theory when the consistency of either fixed parameter estimation or random parameter prediction is violated. We also study how to obtain maximum h-likelihood estimators when the h-likelihood is not explicitly available. △ Less

Submitted 5 December, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

arXiv:2310.09823 [pdf, other]

Finite size corrections for real eigenvalues of the elliptic Ginibre matrices

Authors: Sung-Soo Byun, Yong-Woo Lee

Abstract: We consider the elliptic Ginibre matrices in the orthogonal symmetry class that interpolates between the real Ginibre ensemble and the Gaussian orthogonal ensemble. We obtain the finite size corrections of the real eigenvalue densities in both the global and edge scaling regimes, as well as in both the strong and weak non-Hermiticity regimes. Our results extend and provide the rate of convergence… ▽ More We consider the elliptic Ginibre matrices in the orthogonal symmetry class that interpolates between the real Ginibre ensemble and the Gaussian orthogonal ensemble. We obtain the finite size corrections of the real eigenvalue densities in both the global and edge scaling regimes, as well as in both the strong and weak non-Hermiticity regimes. Our results extend and provide the rate of convergence to the previous recent findings in the aforementioned limits. In particular, in the Hermitian limit, our results recover the finite size corrections of the Gaussian orthogonal ensemble established by Forrester, Frankel and Garoni. △ Less

Submitted 15 October, 2023; originally announced October 2023.

Comments: 27 pages, 3 figures

arXiv:2309.16829 [pdf, other]

An analysis of the derivative-free loss method for solving PDEs

Authors: Jihun Han, Yoonsang Lee

Abstract: This study analyzes the derivative-free loss method to solve a certain class of elliptic PDEs using neural networks. The derivative-free loss method uses the Feynman-Kac formulation, incorporating stochastic walkers and their corresponding average values. We investigate the effect of the time interval related to the Feynman-Kac formulation and the walker size in the context of computational effici… ▽ More This study analyzes the derivative-free loss method to solve a certain class of elliptic PDEs using neural networks. The derivative-free loss method uses the Feynman-Kac formulation, incorporating stochastic walkers and their corresponding average values. We investigate the effect of the time interval related to the Feynman-Kac formulation and the walker size in the context of computational efficiency, trainability, and sampling errors. Our analysis shows that the training loss bias is proportional to the time interval and the spatial gradient of the neural network while inversely proportional to the walker size. We also show that the time interval must be sufficiently long to train the network. These analytic results tell that we can choose the walker size as small as possible based on the optimal lower bound of the time interval. We also provide numerical tests supporting our analysis. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 18 pages, 6 figures

MSC Class: 65N15; 65N75; 65C05; 60G46

arXiv:2309.08705 [pdf, ps, other]

Torsion Vanishing for Some Shimura Varieties

Authors: Linus Hamann, Si Ying Lee

Abstract: We generalize the torsion vanishing results of Caraiani-Scholze and Koshikawa. Our results apply to the cohomology of general Shimura varieties $(\mathbf{G},X)$ of PEL type $A$ or $C$, localized at a suitable maximal ideal $\mathfrak{m}$ in the spherical Hecke algebra at primes $p$ such that $\mathbf{G}_{\mathbb{Q}_{p}}$ is a group for which we know the Fargues-Scholze local Langlands corresponden… ▽ More We generalize the torsion vanishing results of Caraiani-Scholze and Koshikawa. Our results apply to the cohomology of general Shimura varieties $(\mathbf{G},X)$ of PEL type $A$ or $C$, localized at a suitable maximal ideal $\mathfrak{m}$ in the spherical Hecke algebra at primes $p$ such that $\mathbf{G}_{\mathbb{Q}_{p}}$ is a group for which we know the Fargues-Scholze local Langlands correspondence is the semi-simplification of a suitably nice local Langlands correspondence. This is accomplished by combining Koshikawa's technique, the theory of geometric Eisenstein series over the Fargues-Fontaine curve, the work of Santos describing the structure of the fibers of the minimally and toroidally compactified Hodge-Tate period morphism for general PEL type Shimura varieties of type $A$ or $C$, and ideas developed by Zhang on comparing Hecke correspondences on the moduli stack of $G$-bundles with the cohomology of Shimura varieties. In the process, we also establish a description of the generic part of the cohomology that bears resemblance to the work of Xiao-Zhu. Moreover, we also construct a filtration on the compactly supported cohomology that differs from Manotovan's filtration in the case that the Shimura variety is non-compact, allowing us to circumvent some of the circumlocutions taken by Cariani-Scholze. Our method showcases a very general strategy for proving such torsion vanishing results, and should bear even more fruit once the inputs are generalized. Motivated by this, we formulate an even more general torsion vanishing conjecture. △ Less

Submitted 2 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: v2: fixed minor typos and sign error

arXiv:2309.02585 [pdf, other]

A Structurally Informed Data Assimilation Approach for Nonlinear Partial Differential Equations

Authors: Tongtong Li, Anne Gelb, Yoonsang Lee

Abstract: Ensemble transform Kalman filtering (ETKF) data assimilation is often used to combine available observations with numerical simulations to obtain statistically accurate and reliable state representations in dynamical systems. However, it is well known that the commonly used Gaussian distribution assumption introduces biases for state variables that admit discontinuous profiles, which are prevalent… ▽ More Ensemble transform Kalman filtering (ETKF) data assimilation is often used to combine available observations with numerical simulations to obtain statistically accurate and reliable state representations in dynamical systems. However, it is well known that the commonly used Gaussian distribution assumption introduces biases for state variables that admit discontinuous profiles, which are prevalent in nonlinear partial differential equations. This investigation designs a new structurally informed non-Gaussian prior that exploits statistical information from the simulated state variables. In particular, we construct a new weighting matrix based on the second moment of the gradient information of the state variable to replace the prior covariance matrix used for model/data compromise in the ETKF data assimilation framework. We further adapt our weighting matrix to include information in discontinuity regions via a clustering technique. Our numerical experiments demonstrate that this new approach yields more accurate estimates than those obtained using ETKF on shallow water equations, even when ETKF is enhanced with inflation and localization techniques. △ Less

Submitted 5 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

arXiv:2309.00210 [pdf, ps, other]

Damped Euler system with attractive Riesz interaction forces

Authors: Young-Pil Choi, **wook Jung, Yoonjung Lee

Abstract: We consider the barotropic Euler equations with pairwise attractive Riesz interactions and linear velocity dam** in the periodic domain. We establish the global-in-time well-posedness theory for the system near an equilibrium state. We also analyze the large-time behavior of solutions showing the exponential rate of convergence toward the equilibrium state as time goes to infinity. We consider the barotropic Euler equations with pairwise attractive Riesz interactions and linear velocity dam** in the periodic domain. We establish the global-in-time well-posedness theory for the system near an equilibrium state. We also analyze the large-time behavior of solutions showing the exponential rate of convergence toward the equilibrium state as time goes to infinity. △ Less

Submitted 31 August, 2023; originally announced September 2023.

Comments: 24 pages

arXiv:2308.13732 [pdf, ps, other]

Local times of anisotropic Gaussian random fields and stochastic heat equation

Authors: Cheuk Yin Lee, Yimin Xiao

Abstract: We study the local times of a large class of Gaussian random fields satisfying strong local nondeterminism with respect to an anisotropic metric. We establish moment estimates and Hölder conditions for the local times of the Gaussian random fields. Our key estimates rely on geometric properties of Voronoi partitions with respect to an anisotropic metric and the use of Besicovitch's covering theore… ▽ More We study the local times of a large class of Gaussian random fields satisfying strong local nondeterminism with respect to an anisotropic metric. We establish moment estimates and Hölder conditions for the local times of the Gaussian random fields. Our key estimates rely on geometric properties of Voronoi partitions with respect to an anisotropic metric and the use of Besicovitch's covering theorem. As a consequence, we deduce sample path properties of the Gaussian random fields that are related to Chung's law of the iterated logarithm and modulus of non-differentiability. Moreover, we apply our results to systems of stochastic heat equations with additive Gaussian noise and determine the exact Hausdorff measure function with respect to the parabolic metric for the level sets of the solutions. △ Less

Submitted 30 October, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.10474 [pdf, ps, other]

Derived $p$-adic heights and the leading coefficient of the Bertolini--Darmon--Prasanna $p$-adic $L$-function

Authors: Francesc Castella, Chi-Yun Hsu, Debanjana Kundu, Yu-Shen Lee, Zheng Liu

Abstract: Let $E/\mathbf{Q}$ be an elliptic curve and let $p$ be an odd prime of good reduction for $E$. Let $K$ be an imaginary quadratic field satisfying the classical Heegner hypothesis and in which $p$ splits. In a previous work, Agboola--Castella formulated an analogue of the Birch--Swinnerton-Dyer conjecture for the $p$-adic $L$-function $L_{\mathfrak{p}}^{\rm BDP}$ of Bertolini--Darmon--Prasanna atta… ▽ More Let $E/\mathbf{Q}$ be an elliptic curve and let $p$ be an odd prime of good reduction for $E$. Let $K$ be an imaginary quadratic field satisfying the classical Heegner hypothesis and in which $p$ splits. In a previous work, Agboola--Castella formulated an analogue of the Birch--Swinnerton-Dyer conjecture for the $p$-adic $L$-function $L_{\mathfrak{p}}^{\rm BDP}$ of Bertolini--Darmon--Prasanna attached to $E/K$, assuming the prime $p$ to be ordinary for $E$. The goal of this paper is two-fold: (1) We formulate a $p$-adic BSD conjecture for $L_{\mathfrak{p}}^{\rm BDP}$ for all odd primes $p$ of good reduction. (2) For an algebraic analogue $F_{\overline{\mathfrak{p}}}^{\rm BDP}$ of $L_{\mathfrak{p}}^{\rm BDP}$, we show that the ``leading coefficient'' part of our conjecture holds, and that the ``order of vanishing'' part follows from the expected ``maximal non-degeneracy'' of an anticyclotomic $p$-adic height. In particular, when the Iwasawa--Greenberg Main Conjecture $(F_{\overline{\mathfrak{p}}}^{\rm BDP})=(L_{\mathfrak{p}}^{\rm BDP})$ is known, our results determine the leading coefficient of $L_{\mathfrak{p}}^{\rm BDP}$ at $T=0$ up to a $p$-adic unit. Moreover, by adapting the approach of Burungale--Castella--Kim in the $p$-ordinary case, we prove the main conjecture for supersingular primes $p$ under mild hypotheses. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 34 pages

arXiv:2308.04690 [pdf, other]

Finite Element Operator Network for Solving Parametric PDEs

Authors: Jae Yong Lee, Seungchan Ko, Youngjoon Hong

Abstract: Partial differential equations (PDEs) underlie our understanding and prediction of natural phenomena across numerous fields, including physics, engineering, and finance. However, solving parametric PDEs is a complex task that necessitates efficient numerical methods. In this paper, we propose a novel approach for solving parametric PDEs using a Finite Element Operator Network (FEONet). Our propose… ▽ More Partial differential equations (PDEs) underlie our understanding and prediction of natural phenomena across numerous fields, including physics, engineering, and finance. However, solving parametric PDEs is a complex task that necessitates efficient numerical methods. In this paper, we propose a novel approach for solving parametric PDEs using a Finite Element Operator Network (FEONet). Our proposed method leverages the power of deep learning in conjunction with traditional numerical methods, specifically the finite element method, to solve parametric PDEs in the absence of any paired input-output training data. We performed various experiments on several benchmark problems and confirmed that our approach has demonstrated excellent performance across various settings and environments, proving its versatility in terms of accuracy, generalization, and computational flexibility. Our FEONet framework shows potential for application in various fields where PDEs play a crucial role in modeling complex domains with diverse boundary conditions and singular behavior. Furthermore, we provide theoretical convergence analysis to support our approach, utilizing finite element approximation in numerical analysis. △ Less

Submitted 19 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: 23 pages, 11 figures

MSC Class: 65M60; 65N30; 68T20; 68U07 ACM Class: G.1.8

arXiv:2308.03285 [pdf, ps, other]

Weighted Hessian estimates in Orlicz spaces for nondivergence elliptic operators with certain potentials

Authors: Mikyoung Lee, Yoonjung Lee

Abstract: We prove interior weighted Hessian estimates in Orlicz spaces for nondivergence type elliptic equations with a lower order term which involves a nonnegative potential satisfying a reverse Hölder type condition. We prove interior weighted Hessian estimates in Orlicz spaces for nondivergence type elliptic equations with a lower order term which involves a nonnegative potential satisfying a reverse Hölder type condition. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 18 pages

MSC Class: 35J10 (Primary) 35B65; 46E30 (Secondary)

arXiv:2307.10448 [pdf, other]

Weighted inhomogeneous regularization for inverse problems with indirect and incomplete measurement data

Authors: Bosu Choi, Jihun Han, Yoonsang Lee

Abstract: Regularization promotes well-posedness in solving an inverse problem with incomplete measurement data. The regularization term is typically designed based on a priori characterization of the unknown signal, such as sparsity or smoothness. The standard inhomogeneous regularization incorporates a spatially changing exponent $p$ of the standard $\ell_p$ norm-based regularization to recover a signal w… ▽ More Regularization promotes well-posedness in solving an inverse problem with incomplete measurement data. The regularization term is typically designed based on a priori characterization of the unknown signal, such as sparsity or smoothness. The standard inhomogeneous regularization incorporates a spatially changing exponent $p$ of the standard $\ell_p$ norm-based regularization to recover a signal whose characteristic varies spatially. This study proposes a weighted inhomogeneous regularization that extends the standard inhomogeneous regularization through new exponent design and weighting using spatially varying weights. The new exponent design avoids misclassification when different characteristics stay close to each other. The weights handle another issue when the region of one characteristic is too small to be recovered effectively by the $\ell_p$ norm-based regularization even after identified correctly. A suite of numerical tests shows the efficacy of the proposed weighted inhomogeneous regularization, including synthetic image experiments and real sea ice recovery from its incomplete wave measurements. △ Less

Submitted 10 January, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

MSC Class: 65D18; 65F22; 65K10

arXiv:2306.17572 [pdf, ps, other]

The BFK type gluing formula of zeta-determinants for the Robin Boundary condition

Authors: Klaus Kirsten, Yoonweon Lee

Abstract: In this paper we discuss the BFK type gluing formula for zeta-determinants of Laplacians with respect to the Robin boundary condition on a compact Riemannian manifold. As a special case, we discuss the gluing formula with respect to the Neumann boundary condition. We also compute the difference of two zeta-determinants with respect to the Robin and Dirichlet boundary conditions. We use this result… ▽ More In this paper we discuss the BFK type gluing formula for zeta-determinants of Laplacians with respect to the Robin boundary condition on a compact Riemannian manifold. As a special case, we discuss the gluing formula with respect to the Neumann boundary condition. We also compute the difference of two zeta-determinants with respect to the Robin and Dirichlet boundary conditions. We use this result to compute the zeta-determinant of a Laplacian on a cylinder when the Robin boundary condition is imposed, which extends a result in [25]. We also discuss the gluing formula more precisely when the product structure is given near a cutting hypersurface. △ Less

Submitted 3 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: 34 pages

MSC Class: 58J20

arXiv:2306.06342 [pdf, other]

Distribution-free inference with hierarchical data

Authors: Yonghoon Lee, Rina Foygel Barber, Rebecca Willett

Abstract: This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold. To address this challenge, a hierarchical form of exchangeability is derived, facilitating extensions of distribution-free methods, including conformal predict… ▽ More This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold. To address this challenge, a hierarchical form of exchangeability is derived, facilitating extensions of distribution-free methods, including conformal prediction and jackknife+. While the standard theoretical guarantee obtained by the conformal prediction framework is a marginal predictive coverage guarantee, in the special case of independent repeated measurements, it is possible to achieve a stronger form of coverage -- the "second-moment coverage" property -- to provide better control of conditional miscoverage rates, and distribution-free prediction sets that achieve this property are constructed. Simulations illustrate that this guarantee indeed leads to uniformly small conditional miscoverage rates. Empirically, this stronger guarantee comes at the cost of a larger width of the prediction set in scenarios where the fitted model is poorly calibrated, but this cost is very mild in cases where the fitted model is accurate. △ Less

Submitted 2 March, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

arXiv:2306.05931 [pdf, ps, other]

Damped nonlinear Schrödinger equation with Stark effect

Authors: Yi Hu, Yongki Lee, Shijun Zheng

Abstract: We study the $L^2$-critical damped NLS with a Stark potential. We prove that the threshold for global existence and finite time blowup of this equation is given by $\|Q\|_2$, where $Q$ is the unique positive radial solution of $ΔQ + |Q|^{4/d} Q = Q$ in $H^1(\mathbb{R}^d)$. Moreover, in any small neighborhood of $Q$, there exists an initial data $u_0$ above the ground state such that the solution f… ▽ More We study the $L^2$-critical damped NLS with a Stark potential. We prove that the threshold for global existence and finite time blowup of this equation is given by $\|Q\|_2$, where $Q$ is the unique positive radial solution of $ΔQ + |Q|^{4/d} Q = Q$ in $H^1(\mathbb{R}^d)$. Moreover, in any small neighborhood of $Q$, there exists an initial data $u_0$ above the ground state such that the solution flow admits the log-log blowup speed. This verifies the structural stability for the ``$\log$-$\log$ law'' associated to the NLS mechanism under the perturbation by a dam** term and a Stark potential. The proof of our main theorem is based on the Avron-Herbst formula and the analogous result for the unperturbed damped NLS. △ Less

Submitted 9 June, 2023; originally announced June 2023.

Comments: 13 pages

MSC Class: 35B40; 35Q55

arXiv:2305.18853

Polarity of points for systems of nonlinear stochastic heat equations in the critical dimension

Authors: Cheuk Yin Lee, Yimin Xiao

Abstract: Let $u(t, x) = (u_1(t, x), \dots, u_d(t, x))$ be the solution to the systems of nonlinear stochastic heat equations \[ \begin{split} \frac{\partial}{\partial t} u(t, x) &= \frac{\partial^2}{\partial x^2} u(t, x) + σ(u(t, x)) \dot{W}(t, x),\\ u(0, x) &= u_0(x), \end{split} \] where $t \ge 0$, $x \in \mathbb{R}$, $\dot{W}(t, x) = (\dot{W}_1(t, x), \dots, \dot{W}_d(t, x))$ is a vector of $d$ independ… ▽ More Let $u(t, x) = (u_1(t, x), \dots, u_d(t, x))$ be the solution to the systems of nonlinear stochastic heat equations \[ \begin{split} \frac{\partial}{\partial t} u(t, x) &= \frac{\partial^2}{\partial x^2} u(t, x) + σ(u(t, x)) \dot{W}(t, x),\\ u(0, x) &= u_0(x), \end{split} \] where $t \ge 0$, $x \in \mathbb{R}$, $\dot{W}(t, x) = (\dot{W}_1(t, x), \dots, \dot{W}_d(t, x))$ is a vector of $d$ independent space-time white noises, and $σ: \mathbb{R}^d \to \mathbb{R}^{d\times d}$ is a matrix-valued function. We say that a subset $S$ of $\mathbb{R}^d$ is polar for $\{u(t, x), t \ge 0, x \in \mathbb{R}\}$ if \[ \mathbb{P}\{u(t,x) \in S \text{ for some } t>0 \text{ and } x\in\mathbb{R} \}=0. \] The main result of this paper shows that, in the critical dimension $d=6$, all points in $\mathbb{R}^d$ are polar for $\{u(t, x), t \ge 0, x \in \mathbb{R}\}$. This solves an open problem of Dalang, Khoshnevisan and Nualart (2009, 2013) and Dalang, Mueller and Xiao (2021). We also provide a sufficient condition for a subset $S$ of $\mathbb{R}^d$ to be polar. △ Less

Submitted 20 August, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

Comments: There is a crucial error in the paper because the formula (4.2) is not true in general. As a result, the decomposition (4.3) is not valid, which leads to a gap in the proof

arXiv:2305.10137 [pdf, ps, other]

Higher Genus Quantum $K$--theory

Authors: You-Cheng Chou, Leo Herr, Y. -P. Lee

Abstract: We prove genus $g$ invariants in quantum $K$-theory are determined by genus zero invariants of a smooth stack in the spirit of K.~Costello's result in Gromov--Witten theory. We prove genus $g$ invariants in quantum $K$-theory are determined by genus zero invariants of a smooth stack in the spirit of K.~Costello's result in Gromov--Witten theory. △ Less

Submitted 19 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: 36 pages, comments welcome!

arXiv:2305.08480 [pdf, ps, other]

Quantum K-invariants and Gopakumar-Vafa invariants II. Calabi-Yau threefolds at genus zero

Authors: You-Cheng Chou, Y. -P. Lee

Abstract: This is the second part of our ongoing project on the relations between Gopakumar-Vafa BPS invariants (GV) and quantum K-theory (QK) on the Calabi--Yau threefolds (CY3). We show that on CY3 a genus zero quantum K-invariant can be written as a linear combination of a finite number of Gopakumar--Vafa invariants with coefficients from an explicit ``multiple cover formula''. Conversely, GV can be dete… ▽ More This is the second part of our ongoing project on the relations between Gopakumar-Vafa BPS invariants (GV) and quantum K-theory (QK) on the Calabi--Yau threefolds (CY3). We show that on CY3 a genus zero quantum K-invariant can be written as a linear combination of a finite number of Gopakumar--Vafa invariants with coefficients from an explicit ``multiple cover formula''. Conversely, GV can be determined by QK in a similar manner. The technical heart is a proof of a remarkable conjecture by Hans Jockers and Peter Mayr. This result is consistent with the ``virtual Clemens conjecture'' for the Calabi--Yau threefolds. A heuristic derivation of the relation between QK and GV via the virtual Clemens conjecture and the multiple cover formula is also given. △ Less

Submitted 24 July, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

Comments: Comments are welcome

arXiv:2305.07765 [pdf, other]

Finite flocking time of the nonlinear Cucker--Smale model with Rayleigh friction type using the discrete $p$-Laplacian

Authors: Jong-Ho Kim, Young Ju Lee, Jea-Hyun Park

Abstract: The study of collective behavior in multi-agent systems has attracted the attention of many researchers due to its wide range of applications. Among them, the Cucker-Smale model was developed to study the phenomenon of flocking, and various types of extended models have been actively proposed and studied in recent decades. In this study, we address open questions of the Cucker--Smale model with… ▽ More The study of collective behavior in multi-agent systems has attracted the attention of many researchers due to its wide range of applications. Among them, the Cucker-Smale model was developed to study the phenomenon of flocking, and various types of extended models have been actively proposed and studied in recent decades. In this study, we address open questions of the Cucker--Smale model with norm-type Rayleigh friction: {\bf (i)} The positivity of the communication weight, {\bf (ii)} The convergence of the norm of the velocities of agents, {\bf (iii)} The direction of the velocities of agents. For problems (i) and (ii), we present the nonlinear Cucker--Smale model with norm-type Rayleigh friction, where the nonlinear Cucker--Smale model is generalized to a nonlinear model by applying a discrete $p$-Laplacian operator. For this model, we present conditions that guarantee that the norm for velocities of agents converges to 0 or a positive value, and we also show that the regular communication weight satisfies the conditions given in this study. In particular, we present a condition for the initial configuration to obtain that the norm of agent velocities converges to only some positive value. By contrast, problem (iii) is not solved by the norm-type nonlinear model. Thus, we propose a nonlinear Cucker--Smale model with a vector-type Rayleigh friction for problem (iii). In parallel to the first model, we show that the direction of the agents' velocities can be controlled by parameters in the nonlinear Cucker--Smale model with the vector-type Rayleigh friction. △ Less

Submitted 29 August, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: 26 pages

Showing 1–50 of 442 results for author: Lee, Y