-
Robust dividend policy: Equivalence of Epstein-Zin and Maenhout preferences
Authors:
Kexin Chen,
Kyunghyun Park,
Hoi Ying Wong
Abstract:
In a continuous-time economy, this study formulates the Epstein-Zin (EZ) preference for the discounted dividend (or cash payouts) of stockholders as an EZ singular control utility. We show that such a problem is well-defined and equivalent to the robust dividend policy set by the firm's executive in the sense of Maenhout's ambiguity-averse preference. While the firm's executive announces the expec…
▽ More
In a continuous-time economy, this study formulates the Epstein-Zin (EZ) preference for the discounted dividend (or cash payouts) of stockholders as an EZ singular control utility. We show that such a problem is well-defined and equivalent to the robust dividend policy set by the firm's executive in the sense of Maenhout's ambiguity-averse preference. While the firm's executive announces the expected future earnings in financial reports, they also signal the firm's confidence in the expected earnings through dividend or cash payouts. The robust dividend policy can then be characterized by a Hamilton-Jacobi-Bellman (HJB) variational inequality (VI). By constructing a novel shooting method for the HJB-VI, we theoretically prove that the robust dividend policy is a threshold strategy on the firm's surplus process. Therefore, dividend-caring investors can choose firms that match their preferences by examining stock's dividend policies and financial statements, whereas executives can make use of dividend to signal their confidence, in the form of ambiguity aversion, on realizing the earnings implied by their financial statements.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Zeros of the Brownian Sheet
Authors:
Keming Chen,
Guillaume Woessner
Abstract:
In this work we firstly answer to a question raised by Khoshnevisan in \cite[Open Problem 4]{khoshnevisan2007slices} by proving that almost surely there is no projection of big enough rank changing the Hausdorff dimension of the zeros of the Brownian sheet. Secondly, we prove that almost surely for every projection whose rank isn't matching the aforementioned condition, the projection of the zero…
▽ More
In this work we firstly answer to a question raised by Khoshnevisan in \cite[Open Problem 4]{khoshnevisan2007slices} by proving that almost surely there is no projection of big enough rank changing the Hausdorff dimension of the zeros of the Brownian sheet. Secondly, we prove that almost surely for every projection whose rank isn't matching the aforementioned condition, the projection of the zero set is the entirety of the projective space. Key words: Brownian sheet, zeros set, Hausdorff dimension, orthogonal projection.
△ Less
Submitted 31 May, 2024; v1 submitted 30 May, 2024;
originally announced May 2024.
-
Multi-qubit Lattice Surgery Scheduling
Authors:
Allyson Silva,
Xiangyi Zhang,
Zak Webb,
Mia Kramer,
Chan Woo Yang,
Xiao Liu,
Jessica Lemieux,
Ka-Wai Chen,
Artur Scherer,
Pooya Ronagh
Abstract:
Fault-tolerant quantum computation using two-dimensional topological quantum error correcting codes can benefit from multi-qubit long-range operations. By using simple commutation rules, a quantum circuit can be transpiled into a sequence of solely non-Clifford multi-qubit gates. Prior work on fault-tolerant compilation avoids optimal scheduling of such gates since they reduce the parallelizabilit…
▽ More
Fault-tolerant quantum computation using two-dimensional topological quantum error correcting codes can benefit from multi-qubit long-range operations. By using simple commutation rules, a quantum circuit can be transpiled into a sequence of solely non-Clifford multi-qubit gates. Prior work on fault-tolerant compilation avoids optimal scheduling of such gates since they reduce the parallelizability of the circuit. We observe that the reduced parallelization potential is outweighed by the significant reduction in the number of gates. We therefore devise a method for scheduling multi-qubit lattice surgery using an earliest-available-first policy, solving the associated forest packing problem using a representation of the multi-qubit gates as Steiner trees. Our extensive testing on random and application-inspired circuits demonstrates the method's scalability and performance. We show that the transpilation significantly reduces the circuit length on the set of circuits tested, and that the resulting circuit of multi-qubit gates has a further reduction in the expected circuit execution time compared to serial execution.
△ Less
Submitted 10 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Discrete Lehmann representation of three-point functions
Authors:
Dominik Kiese,
Hugo U. R. Strand,
Kun Chen,
Nils Wentzell,
Olivier Parcollet,
Jason Kaye
Abstract:
We present a generalization of the discrete Lehmann representation (DLR) to three-point correlation and vertex functions in imaginary time and Matsubara frequency. The representation takes the form of a linear combination of judiciously chosen exponentials in imaginary time, and products of simple poles in Matsubara frequency, which are universal for a given temperature and energy cutoff. We prese…
▽ More
We present a generalization of the discrete Lehmann representation (DLR) to three-point correlation and vertex functions in imaginary time and Matsubara frequency. The representation takes the form of a linear combination of judiciously chosen exponentials in imaginary time, and products of simple poles in Matsubara frequency, which are universal for a given temperature and energy cutoff. We present a systematic algorithm to generate compact sampling grids, from which the coefficients of such an expansion can be obtained by solving a linear system. We show that the explicit form of the representation can be used to evaluate diagrammatic expressions involving infinite Matsubara sums, such as polarization functions or self-energies, with controllable, high-order accuracy. This collection of techniques establishes a framework through which methods involving three-point objects can be implemented robustly, with a substantially reduced computational cost and memory footprint.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
On the Hodge Structures of Global Smoothings of Normal Crossing Varieties
Authors:
Kuan-Wen Chen
Abstract:
Let $f:X \rightarrow Δ$ be a one-parameter semistable degeneration of $m$-dimensional compact complex manifolds. Assume that each component of the central fiber $X_0$ is Kähler. Then, we provide a criterion for a general fiber to satisfy the $\partial\overline{\partial}$-lemma and a formula to compute the Hodge index on the middle cohomology of the general fiber in terms of the topological conditi…
▽ More
Let $f:X \rightarrow Δ$ be a one-parameter semistable degeneration of $m$-dimensional compact complex manifolds. Assume that each component of the central fiber $X_0$ is Kähler. Then, we provide a criterion for a general fiber to satisfy the $\partial\overline{\partial}$-lemma and a formula to compute the Hodge index on the middle cohomology of the general fiber in terms of the topological conditions/invariants on the central fiber.
We apply our theorem to several examples, including the global smoothing of $m$-fold ODPs, Hashimoto-Sano's non-Kähler Calabi-Yau threefolds, and Sano's non-Kähler Calabi-Yau $m$-folds.
To deal with the last example, we also prove a Lefschetz-type theorem for the cohomology of the fiber product of two Lefschetz fibrations over $\mathbb{P}^1$ with disjoint critical locus.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Optimal Transport for Mixtures of Radial Functions
Authors:
Keyu Chen,
Yunxin Zhang
Abstract:
Recently, a relaxed formulation of optimal transport for Gaussian mixtures has been proposed, which is based on the explicit formulation between Gaussians. The Gaussian distributions can be viewed as special elliptical contoured distributions generated from exponential function. In literature, there are few research about optimal transport between elliptical contoured distributions generated from…
▽ More
Recently, a relaxed formulation of optimal transport for Gaussian mixtures has been proposed, which is based on the explicit formulation between Gaussians. The Gaussian distributions can be viewed as special elliptical contoured distributions generated from exponential function. In literature, there are few research about optimal transport between elliptical contoured distributions generated from different functions. In this paper, we first study optimal transport between radial contoured distributions generated from different functions and show theirWasserstein barycenter is still radial. Then we introduce a relaxed Wasserstein-type distance for mixtures with radial contoured components. We also consider the corresponding barycenter problem and connect it with a multimarginal problem.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Least Squares Inference for Data with Network Dependency
Authors:
**g Lei,
Kehui Chen,
Haeun Moon
Abstract:
We address the inference problem concerning regression coefficients in a classical linear regression model using least squares estimates. The analysis is conducted under circumstances where network dependency exists across units in the sample. Neglecting the dependency among observations may lead to biased estimation of the asymptotic variance and often inflates the Type I error in coefficient inf…
▽ More
We address the inference problem concerning regression coefficients in a classical linear regression model using least squares estimates. The analysis is conducted under circumstances where network dependency exists across units in the sample. Neglecting the dependency among observations may lead to biased estimation of the asymptotic variance and often inflates the Type I error in coefficient inference. In this paper, we first establish a central limit theorem for the ordinary least squares estimate, with a verifiable dependence condition alongside corresponding neighborhood growth conditions. Subsequently, we propose a consistent estimator for the asymptotic variance of the estimated coefficients, which employs a data-driven method to balance the bias-variance trade-off. We find that the optimal tuning depends on the linear hypothesis under consideration and must be chosen adaptively. The presented theory and methods are illustrated and supported by numerical experiments and a data example.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Mallows Product Measure
Authors:
Alexey Bufetov,
Kailun Chen
Abstract:
Q-exchangeable ergodic distributions on the infinite symmetric group were classified by Gnedin-Olshanski (2012). In this paper, we study a specific linear combination of the ergodic measures and call it the Mallows product measure. From a particle system perspective, the Mallows product measure is a reversible stationary blocking measure of the infinite-species ASEP and it is a natural multi-speci…
▽ More
Q-exchangeable ergodic distributions on the infinite symmetric group were classified by Gnedin-Olshanski (2012). In this paper, we study a specific linear combination of the ergodic measures and call it the Mallows product measure. From a particle system perspective, the Mallows product measure is a reversible stationary blocking measure of the infinite-species ASEP and it is a natural multi-species extension of the Bernoulli product blocking measures of the one-species ASEP. Moreover, the Mallows product measure can be viewed as the universal product blocking measure of interacting particle systems coming from random walks on Hecke algebras.
For the random infinite permutation distributed according to the Mallows product measure we have computed the joint distribution of its neighboring displacements, as well as several other observables. The key feature of the obtained formulas is their remarkably simple product structure. We project these formulas to ASEP with finitely many species, which in particular recovers a recent result of Adams-Balazs-Jay, and also to ASEP(q,M).
Our main tools are results of Gnedin-Olshanski about ergodic Mallows measures and shift-invariance symmetries of the stochastic colored six vertex model discovered by Borodin-Gorin-Wheeler and Galashin.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Learning to Optimize: Accelerating Optimal Power Flow via Data-driven Constraint Screening
Authors:
Shourya Bose,
Kejun Chen,
Yu Zhang
Abstract:
This paper introduces a novel data-driven constraint screening approach aimed at accelerating the solution of convexified optimal power flow (OPF) by eliminating constraints that are non-binding at the optimum. Our constraint screening process leverages an input-convex neural network, trained to predict optimal dual variables based on problem parameters. The results demonstrate that, subject to ce…
▽ More
This paper introduces a novel data-driven constraint screening approach aimed at accelerating the solution of convexified optimal power flow (OPF) by eliminating constraints that are non-binding at the optimum. Our constraint screening process leverages an input-convex neural network, trained to predict optimal dual variables based on problem parameters. The results demonstrate that, subject to certain mild conditions on the OPF model, our proposed method guarantees an identical solution to the full OPF but significantly reduces computational time. Extensive simulations conducted on the OPF based on the convexified DistFlow model demonstrate that our method outperforms other constraint screening techniques.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Online Learning Quantum States with the Logarithmic Loss via VB-FTRL
Authors:
Wei-Fu Tseng,
Kai-Chun Chen,
Zi-Hong Xiao,
Yen-Huan Li
Abstract:
Online learning quantum states with the logarithmic loss (LL-OLQS) is a quantum generalization of online portfolio selection, a classic open problem in the field of online learning for over three decades. The problem also emerges in designing randomized optimization algorithms for maximum-likelihood quantum state tomography. Recently, Jezequel et al. (arXiv:2209.13932) proposed the VB-FTRL algorit…
▽ More
Online learning quantum states with the logarithmic loss (LL-OLQS) is a quantum generalization of online portfolio selection, a classic open problem in the field of online learning for over three decades. The problem also emerges in designing randomized optimization algorithms for maximum-likelihood quantum state tomography. Recently, Jezequel et al. (arXiv:2209.13932) proposed the VB-FTRL algorithm, the first nearly regret-optimal algorithm for OPS with moderate computational complexity. In this note, we generalize VB-FTRL for LL-OLQS. Let $d$ denote the dimension and $T$ the number of rounds. The generalized algorithm achieves a regret rate of $O ( d^2 \log ( d + T ) )$ for LL-OLQS. Each iteration of the algorithm consists of solving a semidefinite program that can be implemented in polynomial time by, e.g., cutting-plane methods. For comparison, the best-known regret rate for LL-OLQS is currently $O ( d^2 \log T )$, achieved by the exponential weight method. However, there is no explicit implementation available for the exponential weight method for LL-OLQS. To facilitate the generalization, we introduce the notion of VB-convexity. VB-convexity is a sufficient condition for the logarithmic barrier associated with any function to be convex and is of independent interest.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Pseudo-differential integral autoencoder network for inverse PDE operators
Authors:
Ke Chen,
Jasen Lai,
Chunmei Wang
Abstract:
Partial differential equations (PDEs) play a foundational role in modeling physical phenomena. This study addresses the challenging task of determining variable coefficients within PDEs from measurement data. We introduce a novel neural network, "pseudo-differential IAEnet" (pd-IAEnet), which draws inspiration from pseudo-differential operators. pd-IAEnet achieves significantly enhanced computatio…
▽ More
Partial differential equations (PDEs) play a foundational role in modeling physical phenomena. This study addresses the challenging task of determining variable coefficients within PDEs from measurement data. We introduce a novel neural network, "pseudo-differential IAEnet" (pd-IAEnet), which draws inspiration from pseudo-differential operators. pd-IAEnet achieves significantly enhanced computational speed and accuracy with fewer parameters compared to conventional models. Extensive benchmark evaluations are conducted across a range of inverse problems, including Electrical Impedance Tomography (EIT), optical tomography, and seismic imaging, consistently demonstrating pd-IAEnet's superior accuracy. Notably, pd-IAEnet exhibits robustness in the presence of measurement noise, a critical characteristic for real-world applications. An exceptional feature is its discretization invariance, enabling effective training on data from diverse discretization schemes while maintaining accuracy on different meshes. In summary, pd-IAEnet offers a potent and efficient solution for addressing inverse PDE problems, contributing to improved computational efficiency, robustness, and adaptability to a wide array of data sources.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Let data talk: data-regularized operator learning theory for inverse problems
Authors:
Ke Chen,
Chunmei Wang,
Haizhao Yang
Abstract:
Regularization plays a pivotal role in integrating prior information into inverse problems. While many deep learning methods have been proposed to solve inverse problems, determining where to apply regularization remains a crucial consideration. Typical methods regularize neural networks via architecture, wherein neural network functions parametrize the parameter of interest or the regularization…
▽ More
Regularization plays a pivotal role in integrating prior information into inverse problems. While many deep learning methods have been proposed to solve inverse problems, determining where to apply regularization remains a crucial consideration. Typical methods regularize neural networks via architecture, wherein neural network functions parametrize the parameter of interest or the regularization term. We introduce a novel approach, denoted as the "data-regularized operator learning" (DaROL) method, designed to address PDE inverse problems. The DaROL method trains a neural network on data, regularized through common techniques such as Tikhonov variational methods and Bayesian inference. The DaROL method offers flexibility across different frameworks, faster inverse problem-solving, and a simpler structure that separates regularization and neural network training. We demonstrate that training a neural network on the regularized data is equivalent to supervised learning for a regularized inverse map. Furthermore, we provide sufficient conditions for the smoothness of such a regularized inverse map and estimate the learning error in terms of neural network size and the number of training samples.
△ Less
Submitted 20 March, 2024; v1 submitted 15 October, 2023;
originally announced October 2023.
-
A short report on preconditioned Anderson acceleration method
Authors:
Kewang Chen,
Ye Ji,
Matthias Möller,
Cornelis Vuik
Abstract:
In this report, we present a versatile and efficient preconditioned Anderson acceleration (PAA) method for fixed-point iterations. The proposed framework offers flexibility in balancing convergence rates (linear, super-linear, or quadratic) and computational costs related to the Jacobian matrix. Our approach recovers various fixed-point iteration techniques, including Picard, Newton, and quasi-New…
▽ More
In this report, we present a versatile and efficient preconditioned Anderson acceleration (PAA) method for fixed-point iterations. The proposed framework offers flexibility in balancing convergence rates (linear, super-linear, or quadratic) and computational costs related to the Jacobian matrix. Our approach recovers various fixed-point iteration techniques, including Picard, Newton, and quasi-Newton iterations. The PAA method can be interpreted as employing Anderson acceleration (AA) as its own preconditioner or as an accelerator for quasi-Newton methods when their convergence is insufficient. Adaptable to a wide range of problems with differing degrees of nonlinearity and complexity, the method achieves improved convergence rates and robustness by incorporating suitable preconditioners. We test multiple preconditioning strategies on various problems and investigate a delayed update strategy for preconditioners to further reduce the computational costs.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
Application Potential of a Hybrid Ground Source Heat Pump Array for the UC Berkeley Campus Business and Law Node Energy System: A Preliminary Study
Authors:
Kecheng Chen,
Kenichi Soga,
Patrick Dobson,
Peter Nico
Abstract:
The current plan divides the UC Berkeley (UCB) campus energy system into five nodes, where the Business and Law node was studied because of an open field site for borehole installation. The Pacific Northwest National Laboratory's Commercial Prototype Building Models were used to estimate heating and cooling load requirements for UCB campus building types by considering model characteristics (for e…
▽ More
The current plan divides the UC Berkeley (UCB) campus energy system into five nodes, where the Business and Law node was studied because of an open field site for borehole installation. The Pacific Northwest National Laboratory's Commercial Prototype Building Models were used to estimate heating and cooling load requirements for UCB campus building types by considering model characteristics (for example, high base load from hospitals, high DHW in hotels) corresponding to the ASHRAE Standard 90.1-2013. Unscaled load profiles were created from the EnergyPlus building energy simulation and scaled with monitored peak load and annual energy use to generate the target node's hourly heating and cooling load profiles. An optimization problem was solved to design a hybrid GSHP system, where the objective function is the lifetime total cost of the system, and the optimization variables are the portion of heating and cooling loads covered by the GSHP system. Modelica models for air source and ground source heat pump systems were built for detailed case studies based on optimization results. In the Modelica model, the demand side is connected to the radiators in the building to transfer heat, and the source side is connected to GSHP, ASHP, or other heating and cooling facilities. The results demonstrate that an appropriate hybrid GSHP system can help reduce both borehole numbers and electricity consumption for the UCB campus site.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Super-Resolution Surface Reconstruction from Few Low-Resolution Slices
Authors:
Yiyao Zhang,
Ke Chen,
Shang-Hua Yang
Abstract:
In many imaging applications where segmented features (e.g. blood vessels) are further used for other numerical simulations (e.g. finite element analysis), the obtained surfaces do not have fine resolutions suitable for the task. Increasing the resolution of such surfaces becomes crucial. This paper proposes a new variational model for solving this problem, based on an Euler-Elastica-based regular…
▽ More
In many imaging applications where segmented features (e.g. blood vessels) are further used for other numerical simulations (e.g. finite element analysis), the obtained surfaces do not have fine resolutions suitable for the task. Increasing the resolution of such surfaces becomes crucial. This paper proposes a new variational model for solving this problem, based on an Euler-Elastica-based regulariser. Further, we propose and implement two numerical algorithms for solving the model, a projected gradient descent method and the alternating direction method of multipliers. Numerical experiments using real-life examples (including two from outputs of another variational model) have been illustrated for effectiveness. The advantages of the new model are shown through quantitative comparisons by the standard deviation of Gaussian curvatures and mean curvatures from the viewpoint of discrete geometry.
△ Less
Submitted 12 September, 2023; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Thermal analysis of dual-phase-lag model in a two-dimensional plate subjected to a heat source moving along elliptical trajectories
Authors:
Kaiyuan Chen,
Zhicheng Hu
Abstract:
In this paper, we focus on the study of heat transfer behavior for the dual-phase-lag heat conduction model, which describes the evolution of temperature in a two-dimensional rectangular plate caused by the activity of a point heat source moving along elliptical trajectories. At first, Green's function approach is applied to derive the analytical solution of temperature for the given model. Based…
▽ More
In this paper, we focus on the study of heat transfer behavior for the dual-phase-lag heat conduction model, which describes the evolution of temperature in a two-dimensional rectangular plate caused by the activity of a point heat source moving along elliptical trajectories. At first, Green's function approach is applied to derive the analytical solution of temperature for the given model. Based on the series representation of this analytical solution, the thermal responses for the underlying heat transfer problem, including the relations between the moving heat source and the concomitant temperature peak, the influences of the pair of phase lags and the angular velocity of heat source on temperature, are then investigated, analyzed and discussed in detail for three different movement trajectories. Compared with the results revealed for the common situation that the heat source moves in a straight line with a constant speed, the present results show quite distinctive thermal behaviors for all cases, which subsequently can help us to better understand the internal mechanism of the dual-phase-lag heat transfer subjected to a moving heat source with curved trajectory.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Nonlinear conjugate gradient method for vector optimization on Riemannian manifolds with retraction and vector transport
Authors:
Kangming Chen,
Ellen H. Fukuda,
Hiroyuki Sato
Abstract:
In this paper, we propose nonlinear conjugate gradient methods for vector optimization on Riemannian manifolds. The concepts of Wolfe and Zoutendjik conditions are extended for Riemannian manifolds. Specifically, we establish the existence of intervals of step sizes that satisfy the Wolfe conditions. The convergence analysis covers the vector extensions of the Fletcher--Reeves, conjugate descent,…
▽ More
In this paper, we propose nonlinear conjugate gradient methods for vector optimization on Riemannian manifolds. The concepts of Wolfe and Zoutendjik conditions are extended for Riemannian manifolds. Specifically, we establish the existence of intervals of step sizes that satisfy the Wolfe conditions. The convergence analysis covers the vector extensions of the Fletcher--Reeves, conjugate descent, and Dai--Yuan parameters. Under some assumptions, we prove that the sequence obtained by the algorithm can converge to a Pareto stationary point. Moreover, we also discuss several other choices of the parameter. Numerical experiments illustrating the practical behavior of the methods are presented.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
Fast and high-order approximation of parabolic equations using hierarchical direct solvers and implicit Runge-Kutta methods
Authors:
Ke Chen,
Daniel Appelö,
Tracy Babb,
Per-Gunnar Martinsson
Abstract:
An additive Runge-Kutta method is used for the time step**, which integrates the linear stiff terms by an explicit singly diagonally implicit Runge-Kutta (ESDIRK) method and the nonlinear terms by an explicit Runge-Kutta (ERK) method. In each time step, the implicit solve is performed by the recently developed Hierarchical Poincaré-Steklov (HPS) method. This is a fast direct solver for elliptic…
▽ More
An additive Runge-Kutta method is used for the time step**, which integrates the linear stiff terms by an explicit singly diagonally implicit Runge-Kutta (ESDIRK) method and the nonlinear terms by an explicit Runge-Kutta (ERK) method. In each time step, the implicit solve is performed by the recently developed Hierarchical Poincaré-Steklov (HPS) method. This is a fast direct solver for elliptic equations that decomposes the space domain into a hierarchical tree of subdomains and builds spectral collocation solvers locally on the subdomains. These ideas are naturally combined in the presented method since the singly diagonal coefficient in ESDIRK and a fixed time-step ensures that the coefficient matrix in the implicit solve of HPS remains the same for all time stages. This means that the precomputed inverse can be efficiently reused, leading to a scheme with complexity (in two dimensions) $\mathcal{O}(N^{1.5})$ for the precomputation where the solution operator to the elliptic problems is built, and then $\mathcal{O}(N \log N)$ for the solve in each time step. The stability of the method is proved for first order in time and any order in space, and numerical evidence substantiates a claim of stability for a much broader class of time discretization methods. Numerical experiments supporting the accuracy of efficiency of the method in one and two dimensions are presented.
△ Less
Submitted 7 May, 2024; v1 submitted 4 June, 2023;
originally announced June 2023.
-
Generalized Implicit Follow-The-Regularized-Leader
Authors:
Keyi Chen,
Francesco Orabona
Abstract:
We propose a new class of online learning algorithms, generalized implicit Follow-The-Regularized-Leader (FTRL), that expands the scope of FTRL framework. Generalized implicit FTRL can recover known algorithms, as FTRL with linearized losses and implicit FTRL, and it allows the design of new update rules, as extensions of aProx and Mirror-Prox to FTRL. Our theory is constructive in the sense that…
▽ More
We propose a new class of online learning algorithms, generalized implicit Follow-The-Regularized-Leader (FTRL), that expands the scope of FTRL framework. Generalized implicit FTRL can recover known algorithms, as FTRL with linearized losses and implicit FTRL, and it allows the design of new update rules, as extensions of aProx and Mirror-Prox to FTRL. Our theory is constructive in the sense that it provides a simple unifying framework to design updates that directly improve the worst-case upper bound on the regret. The key idea is substituting the linearization of the losses with a Fenchel-Young inequality. We show the flexibility of the framework by proving that some known algorithms, like the Mirror-Prox updates, are instantiations of the generalized implicit FTRL. Finally, the new framework allows us to recover the temporal variation bound of implicit OMD, with the same computational complexity.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
On LinDistFlow Model Congestion Pricing: Bounding the Changes in Power Tariffs
Authors:
Shourya Bose,
Kejun Chen,
Yu Zhang
Abstract:
The optimal power flow (OPF) problem is an important mathematical program that aims at obtaining the best operating point of an electric power grid. The optimization problem typically minimizes the total generation cost subject to certain physical constraints of the system. The so-called linearized distribution flow (LinDistFlow) model leverages a set of linear equations to approximate the nonline…
▽ More
The optimal power flow (OPF) problem is an important mathematical program that aims at obtaining the best operating point of an electric power grid. The optimization problem typically minimizes the total generation cost subject to certain physical constraints of the system. The so-called linearized distribution flow (LinDistFlow) model leverages a set of linear equations to approximate the nonlinear AC power flows. In this paper, we consider an OPF problem based on the LinDistFlow model for a single-phase radial power network. We derive closed-form solutions to the marginal values of both real and reactive power demands. We also derive upper bounds on the congestion price (a.k.a. `shadow price'), which denotes the change in marginal demand prices when the apparent power flow limits of certain lines are binding at optimum. Various cases of our result are discussed while simulations are carried out on a $141$-bus radial power network.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Some Symmetry and Duality Theorems on Multiple Zeta(-star) Values
Authors:
Kwang-Wu Chen,
Minking Eie,
Yao Lin Ong
Abstract:
In this paper, we provide a symmetric formula and a duality formula relating multiple zeta values and zeta-star values. Leveraging Zagier's formula for computing $ζ^\star(\{2\}^p,3,\{2\}^q)$, we employ our theorems to establish a formula for computing $ζ^\star(\{2\}^p,1,\{2\}^q)$ for any positive integers $p$ and $q$, along with other formulas of interest.
In this paper, we provide a symmetric formula and a duality formula relating multiple zeta values and zeta-star values. Leveraging Zagier's formula for computing $ζ^\star(\{2\}^p,3,\{2\}^q)$, we employ our theorems to establish a formula for computing $ζ^\star(\{2\}^p,1,\{2\}^q)$ for any positive integers $p$ and $q$, along with other formulas of interest.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Stability and chaos of the duopoly model of Kopel: A study based on symbolic computations
Authors:
Xiaoliang Li,
Kongyan Chen,
Wei Niu,
Bo Huang
Abstract:
Since Kopel's duopoly model was proposed about three decades ago, there are almost no analytical results on the equilibria and their stability in the asymmetric case. The first objective of our study is to fill this gap. This paper analyzes the asymmetric duopoly model of Kopel analytically by using several tools based on symbolic computations. We discuss the possibility of the existence of multip…
▽ More
Since Kopel's duopoly model was proposed about three decades ago, there are almost no analytical results on the equilibria and their stability in the asymmetric case. The first objective of our study is to fill this gap. This paper analyzes the asymmetric duopoly model of Kopel analytically by using several tools based on symbolic computations. We discuss the possibility of the existence of multiple positive equilibria and establish necessary and sufficient conditions for a given number of positive equilibria to exist. The possible positions of the equilibria in Kopel's model are also explored. Furthermore, in the asymmetric model of Kopel, if the duopolists adopt the best response reactions or homogeneous adaptive expectations, we establish rigorous conditions for the local stability of equilibria for the first time. The occurrence of chaos in Kopel's model seems to be supported by observations through numerical simulations, which, however, is challenging to prove rigorously. The second objective is to prove the existence of snapback repellers in Kopel's map, which implies the existence of chaos in the sense of Li-Yorke according to Marotto's theorem.
△ Less
Submitted 28 May, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Computing one-bit compressive sensing via zero-norm regularized DC loss model and its surrogate
Authors:
Kai Chen,
Ling Liang,
Shaohua Pan
Abstract:
One-bit compressed sensing is very popular in signal processing and communications due to its low storage costs and low hardware complexity, but it is a challenging task to recover the signal by using the one-bit information. In this paper, we propose a zero-norm regularized smooth difference of convexity (DC) loss model and derive a family of equivalent nonconvex surrogates covering the MCP and S…
▽ More
One-bit compressed sensing is very popular in signal processing and communications due to its low storage costs and low hardware complexity, but it is a challenging task to recover the signal by using the one-bit information. In this paper, we propose a zero-norm regularized smooth difference of convexity (DC) loss model and derive a family of equivalent nonconvex surrogates covering the MCP and SCAD surrogates as special cases. Compared to the existing models, the new model and its SCAD surrogate have better robustness. To compute their $τ$-stationary points, we develop a proximal gradient algorithm with extrapolation and establish the convergence of the whole iterate sequence. Also, the convergence is proved to have a linear rate under a mild condition by studying the KL property of exponent 0 of the models. Numerical comparisons with several state-of-art methods show that in terms of the quality of solution, the proposed model and its SCAD surrogate are remarkably superior to the $\ell_p$-norm regularized models, and are comparable even superior to those sparsity constrained models with the true sparsity and the sign flip ratio as inputs.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Toward finiteness of central configurations for the planar six-body problem by symbolic computations
Authors:
Ke-Ming Chang,
Kuo-Chang Chen
Abstract:
In this paper we develop symbolic computation algorithms to investigate finiteness of central configurations for the planar $n$-body problem. Our approach is based on Albouy-Kaloshin's work on finiteness of central configurations for the 5-body problems. In their paper, bicolored graphs called $zw$-diagrams were introduced for possible scenarios when the finiteness conjecture fails, and proving fi…
▽ More
In this paper we develop symbolic computation algorithms to investigate finiteness of central configurations for the planar $n$-body problem. Our approach is based on Albouy-Kaloshin's work on finiteness of central configurations for the 5-body problems. In their paper, bicolored graphs called $zw$-diagrams were introduced for possible scenarios when the finiteness conjecture fails, and proving finiteness amounts to exclusions of central configurations associated to these diagrams. Following their method, the amount of computations becomes enormous when there are more than five bodies. Here we introduce matrix algebra for determination of both diagrams and asymptotic orders, devise several criteria to reduce computational complexity, and verify finiteness mostly through automated deductions. For the planar six-body problem, our first algorithm effectively narrows the proof for finiteness down to 117 $zw$-diagrams, the second algorithm eliminates 31 of them, the last algorithm eliminates 62 other diagrams except for masses in some co-dimension 2 variety in the mass space, and leaving 24 cases unsolved.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Algorithmic Randomness and Probabilistic Laws
Authors:
Jeffrey A. Barrett,
Eddy Keming Chen
Abstract:
We consider two ways one might use algorithmic randomness to characterize a probabilistic law. The first is a generative chance* law. Such laws involve a nonstandard notion of chance. The second is a probabilistic* constraining law. Such laws impose relative frequency and randomness constraints that every physically possible world must satisfy. While each notion has virtues, we argue that the latt…
▽ More
We consider two ways one might use algorithmic randomness to characterize a probabilistic law. The first is a generative chance* law. Such laws involve a nonstandard notion of chance. The second is a probabilistic* constraining law. Such laws impose relative frequency and randomness constraints that every physically possible world must satisfy. While each notion has virtues, we argue that the latter has advantages over the former. It supports a unified governing account of non-Humean laws and provides independently motivated solutions to issues in the Humean best-system account. On both notions, we have a much tighter connection between probabilistic laws and their corresponding sets of possible worlds. Certain histories permitted by traditional probabilistic laws are ruled out as physically impossible. As a result, such laws avoid one variety of empirical underdetermination, but the approach reveals other varieties of underdetermination that are typically overlooked.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Bohr-type inequalities for unimodular bounded analytic functions
Authors:
Kaixin Chen,
Ming-Sheng Liu,
Saminathan Ponnusamy
Abstract:
In this paper, we establish several new versions of Bohr-type inequalities for bounded analytic functions in the unit disk by allowing $\varphi=\{\varphi_n(r)\}^{\infty}_{n=0}$ in place of the $\{r^n\}^{\infty}_{n=0}$ in the power series representations of the functions involved with the Bohr sum and thereby introducing a single parameter, which generalize several related results of earlier author…
▽ More
In this paper, we establish several new versions of Bohr-type inequalities for bounded analytic functions in the unit disk by allowing $\varphi=\{\varphi_n(r)\}^{\infty}_{n=0}$ in place of the $\{r^n\}^{\infty}_{n=0}$ in the power series representations of the functions involved with the Bohr sum and thereby introducing a single parameter, which generalize several related results of earlier authors.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Behavioural predictors of math anxiety
Authors:
M. Y. K. Chen,
A. Jamaludin,
A. L. Tan
Abstract:
Math anxiety is a highly prevalent problem in education that has consistently shown to lead to poor math performance. This study sought to investigate whether certain behaviours are predictive of math anxiety among students. This study involved elementary school students who were low-progressing in math, and is part of an educational intervention program. Ten classifications types of behavioural i…
▽ More
Math anxiety is a highly prevalent problem in education that has consistently shown to lead to poor math performance. This study sought to investigate whether certain behaviours are predictive of math anxiety among students. This study involved elementary school students who were low-progressing in math, and is part of an educational intervention program. Ten classifications types of behavioural indicators were identified, such as counting out loud. A multiple linear regression was conducted, identifying three behavioural observations that were positively and significantly associated with their math anxiety. Implications and limitations are discussed.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Equilibria and their stability in an asymmetric duopoly model of Kopel
Authors:
Xiaoliang Li,
Kongyan Chen
Abstract:
In this paper, we investigate the equilibria and their stability in an asymmetric duopoly model of Kopel by using several tools based on symbolic computations. We explore the possible positions of the equilibria in Kopel's model. We discuss the possibility of the existence of multiple positive equilibria and establish a necessary and sufficient condition for a given number of equilibria to exist.…
▽ More
In this paper, we investigate the equilibria and their stability in an asymmetric duopoly model of Kopel by using several tools based on symbolic computations. We explore the possible positions of the equilibria in Kopel's model. We discuss the possibility of the existence of multiple positive equilibria and establish a necessary and sufficient condition for a given number of equilibria to exist. Furthermore, if the two duopolists adopt the best response reactions or homogeneous adaptive expectations, we establish rigorous conditions for the existence of distinct numbers of positive equilibria for the first time.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
A Generalization of Bell Polynomials and Multinomial Expansions via Permutations on Partitions, by Perturbation expansions of Functional Determinants
Authors:
Kui-Yo Chen,
Zhong-Tang Wu
Abstract:
We give an exact coefficients formula of any infinite product of power series with constant term equal to $1$, by using structures from partitions of integers and permutation groups. This is an universal theorem for various of Binomial-type theorems in many sense. In particular, we give the new formulas as the double counting of Bell polynomial, Binomial Theorem and Multinomial Theorem.
We give an exact coefficients formula of any infinite product of power series with constant term equal to $1$, by using structures from partitions of integers and permutation groups. This is an universal theorem for various of Binomial-type theorems in many sense. In particular, we give the new formulas as the double counting of Bell polynomial, Binomial Theorem and Multinomial Theorem.
△ Less
Submitted 9 July, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Transfer Learning with Large-Scale Quantile Regression
Authors:
Jun **,
Jun Yan,
Robert H. Aseltine,
Kun Chen
Abstract:
Quantile regression is increasingly encountered in modern big data applications due to its robustness and flexibility. We consider the scenario of learning the conditional quantiles of a specific target population when the available data may go beyond the target and be supplemented from other sources that possibly share similarities with the target. A crucial question is how to properly distinguis…
▽ More
Quantile regression is increasingly encountered in modern big data applications due to its robustness and flexibility. We consider the scenario of learning the conditional quantiles of a specific target population when the available data may go beyond the target and be supplemented from other sources that possibly share similarities with the target. A crucial question is how to properly distinguish and utilize useful information from other sources to improve the quantile estimation and inference at the target. We develop transfer learning methods for high-dimensional quantile regression by detecting informative sources whose models are similar to the target and utilizing them to improve the target model. We show that under reasonable conditions, the detection of the informative sources based on sample splitting is consistent. Compared to the naive estimator with only the target data, the transfer learning estimator achieves a much lower error rate as a function of the sample sizes, the signal-to-noise ratios, and the similarity measures among the target and the source models. Extensive simulation studies demonstrate the superiority of our proposed approach. We apply our methods to tackle the problem of detecting hard-landing risk for flight safety and show the benefits and insights gained from transfer learning of three different types of airplanes: Boeing 737, Airbus A320, and Airbus A380.
△ Less
Submitted 25 February, 2024; v1 submitted 13 December, 2022;
originally announced December 2022.
-
Duality in optimal consumption--investment problems with alternative data
Authors:
Kexin Chen,
Hoi Ying Wong
Abstract:
This study investigates an optimal consumption--investment problem in which the unobserved stock trend is modulated by a hidden Markov chain that represents different economic regimes. In the classical approach, the hidden state is estimated from historical asset prices, but recent advancements in technology enable investors to consider alternative data in their decision-making. These include soci…
▽ More
This study investigates an optimal consumption--investment problem in which the unobserved stock trend is modulated by a hidden Markov chain that represents different economic regimes. In the classical approach, the hidden state is estimated from historical asset prices, but recent advancements in technology enable investors to consider alternative data in their decision-making. These include social media commentary, expert opinions, COVID-19 pandemic data, and GPS data, which originate outside of the standard sources of market data but are considered useful for predicting stock trends. We develop a novel duality theory for this problem and consider a jump-diffusion process for the alternative data series. This theory helps investors in identifying ``useful'' alternative data for dynamic decision-making by offering conditions to the filter equation that permit the use of a control approach based on the dynamic programming principle. We demonstrate an application for proving a unique smooth solution for a constant relative risk-averse agent once the distributions of the signals generated from alternative data satisfy a bounded likelihood ratio condition. In doing so, we obtain an explicit consumption--investment strategy that takes advantage of different types of alternative data that have not been addressed in the literature.
△ Less
Submitted 18 July, 2023; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Asymptotic Statistical Analysis of $f$-divergence GAN
Authors:
Xinwei Shen,
Kani Chen,
Tong Zhang
Abstract:
Generative Adversarial Networks (GANs) have achieved great success in data generation. However, its statistical properties are not fully understood. In this paper, we consider the statistical behavior of the general $f$-divergence formulation of GAN, which includes the Kullback--Leibler divergence that is closely related to the maximum likelihood principle. We show that for parametric generative m…
▽ More
Generative Adversarial Networks (GANs) have achieved great success in data generation. However, its statistical properties are not fully understood. In this paper, we consider the statistical behavior of the general $f$-divergence formulation of GAN, which includes the Kullback--Leibler divergence that is closely related to the maximum likelihood principle. We show that for parametric generative models that are correctly specified, all $f$-divergence GANs with the same discriminator classes are asymptotically equivalent under suitable regularity conditions. Moreover, with an appropriately chosen local discriminator, they become equivalent to the maximum likelihood estimate asymptotically. For generative models that are misspecified, GANs with different $f$-divergences {converge to different estimators}, and thus cannot be directly compared. However, it is shown that for some commonly used $f$-divergences, the original $f$-GAN is not optimal in that one can achieve a smaller asymptotic variance when the discriminator training in the original $f$-GAN formulation is replaced by logistic regression. The resulting estimation method is referred to as Adversarial Gradient Estimation (AGE). Empirical studies are provided to support the theory and to demonstrate the advantage of AGE over the original $f$-GANs under model misspecification.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Global well-posedness of the 1d compressible Navier-Stokes system with rough data
Authors:
Ke Chen,
Ly Kim Ha,
Ruilin Hu,
Quoc-Hung Nguyen
Abstract:
In this paper, we study the global well-posedness problem for the 1d compressible Navier-Stokers system (cNSE) in gas dynamics with rough initial data. Frist, Liu- Yu (2022) established the global well-posedness theory for the 1d isentropic cNSE with initial velocity data in BV space. Then, it was extended to the 1d cNSE for the polytropic ideal gas with initial velocity and temperature data in BV…
▽ More
In this paper, we study the global well-posedness problem for the 1d compressible Navier-Stokers system (cNSE) in gas dynamics with rough initial data. Frist, Liu- Yu (2022) established the global well-posedness theory for the 1d isentropic cNSE with initial velocity data in BV space. Then, it was extended to the 1d cNSE for the polytropic ideal gas with initial velocity and temperature data in BV space by Wang-Yu-Zhang (2022). We improve the global well-posedness result of Liu-Yu with initial velocity data in $W^{2γ,1}$ space; and of Wang-Yu-Zhang with initial velocity data in $ L^2\cap W^{2γ,1}$ space and initial data of temperature in $\dot W^{-\frac{2}{3},\frac{6}{5}}\cap \dot W^{2γ-1,1}$ for any $γ>0$ \textit{arbitrary small}. Our essential ideas are based on establishing various "end-point" smoothing estimates for the 1d parabolic equation.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
A stochastic agent-based model to evaluate COVID-19 transmission influenced by human mobility
Authors:
Kejie Chen,
Yanqing Li,
Rongxin Zhou,
Xiaomo Jiang
Abstract:
The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy…
▽ More
The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy and hierarchical structures of spatial containers corresponding to the notion of places in geography, this study proposes a novel model, Mob-Cov, to study the impact of human traveling behaviour and individual health conditions on the disease outbreak and the probability of zero COVID in the population. Specifically, individuals perform power-law type of local movements within a container and global transport between different-level containers. Frequent short movements inside a small-level container (e.g. a road or a county) and a large population size influence the local crowdedness of people, which accelerates the infection and regional transmission. Travels between large-level containers (e.g. cities and nations) facilitate global spread and outbreak. Moreover, dynamic infection and recovery in the population are able to drive the bifurcation of the system to a "zero-COVID" state or a "live with COVID" state, depending on the mobility patterns, population number and health conditions. Reducing total population and local people accumulation as well as restricting global travels help achieve zero-COVID. In summary, the Mob-Cov model considers more realistic human mobility in a wide range of spatial scales, and has been designed with equal emphasis on performance, low simulation cost, accuracy, ease of use and flexibility. It is a useful tool for researchers and politicians to investigate the pandemic dynamics and plan actions against the disease.
△ Less
Submitted 17 November, 2022; v1 submitted 6 September, 2022;
originally announced September 2022.
-
Local well-posedness of the $1d$ compressible Navier-Stokes system with rough data
Authors:
Ke Chen,
Ruilin Hu,
Quoc-Hung Nguyen
Abstract:
This paper presents a new approach to the local well-posedness of the $1d$ compressible Navier-Stokes systems with rough initial data. Our approach is based on establishing some smoothing and Lipschitz-type estimates for the $1d$ parabolic equation with piecewise continuous coefficients.
This paper presents a new approach to the local well-posedness of the $1d$ compressible Navier-Stokes systems with rough initial data. Our approach is based on establishing some smoothing and Lipschitz-type estimates for the $1d$ parabolic equation with piecewise continuous coefficients.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
A Novel Multi-Agent Scheduling Mechanism for Adaptation of Production Plans in Case of Supply Chain Disruptions
Authors:
**g Tan,
Lars Braubach,
Kai Jander,
Rongjun Xu,
Kai Chen
Abstract:
Manufacturing companies typically use sophisticated production planning systems optimizing production steps, often delivering near-optimal solutions. As a downside for delivering a near-optimal schedule, planning systems have high computational demands resulting in hours of computation. Under normal circumstances this is not issue if there is enough buffer time before implementation of the schedul…
▽ More
Manufacturing companies typically use sophisticated production planning systems optimizing production steps, often delivering near-optimal solutions. As a downside for delivering a near-optimal schedule, planning systems have high computational demands resulting in hours of computation. Under normal circumstances this is not issue if there is enough buffer time before implementation of the schedule (e.g. at night for the next day). However, in case of unexpected disruptions such as delayed part deliveries or defectively manufactured goods, the planned schedule may become invalid and swift replanning becomes necessary. Such immediate replanning is unsuited for existing optimal planners due to the computational requirements. This paper proposes a novel solution that can effectively and efficiently perform replanning in case of different types of disruptions using an existing plan. The approach is based on the idea to adhere to the existing schedule as much as possible, adapting it based on limited local changes. For that purpose an agent-based scheduling mechanism has been devised, in which agents represent materials and production sites and use local optimization techniques and negotiations to generate an adapted (sufficient, but non-optimal) schedule. The approach has been evaluated using real production data from Huawei, showing that efficient schedules are produced in short time. The system has been implemented as proof of concept and is currently reimplemented and transferred to a production system based on the Jadex agent platform.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Fast Multi-grid Methods for Minimizing Curvature Energy
Authors:
Zhenwei Zhang,
Ke Chen,
Ke Tang,
Yu** Duan
Abstract:
The geometric high-order regularization methods such as mean curvature and Gaussian curvature, have been intensively studied during the last decades due to their abilities in preserving geometric properties including image edges, corners, and contrast. However, the dilemma between restoration quality and computational efficiency is an essential roadblock for high-order methods. In this paper, we p…
▽ More
The geometric high-order regularization methods such as mean curvature and Gaussian curvature, have been intensively studied during the last decades due to their abilities in preserving geometric properties including image edges, corners, and contrast. However, the dilemma between restoration quality and computational efficiency is an essential roadblock for high-order methods. In this paper, we propose fast multi-grid algorithms for minimizing both mean curvature and Gaussian curvature energy functionals without sacrificing accuracy for efficiency. Unlike the existing approaches based on operator splitting and the Augmented Lagrangian method (ALM), no artificial parameters are introduced in our formulation, which guarantees the robustness of the proposed algorithm. Meanwhile, we adopt the domain decomposition method to promote parallel computing and use the fine-to-coarse structure to accelerate convergence. Numerical experiments are presented on image denoising, CT, and MRI reconstruction problems to demonstrate the superiority of our method in preserving geometric structures and fine details. The proposed method is also shown effective in dealing with large-scale image processing problems by recovering an image of size $1024\times 1024$ within $40$s, while the ALM method requires around $200$s.
△ Less
Submitted 11 March, 2023; v1 submitted 17 April, 2022;
originally announced April 2022.
-
Composite Anderson acceleration method with dynamic window-sizes and optimized dam**
Authors:
Kewang Chen,
Cornelis Vuik
Abstract:
In this paper, we propose and analyze a set of fully non-stationary Anderson acceleration algorithms with dynamic window sizes and optimized dam**. Although Anderson acceleration (AA) has been used for decades to speed up nonlinear solvers in many applications, most authors are simply using and analyzing the stationary version of Anderson acceleration (sAA) with fixed window size and a constant…
▽ More
In this paper, we propose and analyze a set of fully non-stationary Anderson acceleration algorithms with dynamic window sizes and optimized dam**. Although Anderson acceleration (AA) has been used for decades to speed up nonlinear solvers in many applications, most authors are simply using and analyzing the stationary version of Anderson acceleration (sAA) with fixed window size and a constant dam** factor. The behavior and potential of the non-stationary version of Anderson acceleration methods remain an open question. Since most efficient linear solvers use composable algorithmic components. Similar ideas can be used for AA to solve nonlinear systems. Thus in the present work, to develop non-stationary Anderson acceleration algorithms, we first propose two systematic ways to dynamically alternate the window size $m$ by composition. One simple way to package sAA(m) with sAA(n) in each iteration is applying sAA(m) and sAA(n) separately and then average their results. It is an additive composite combination. The other more important way is the multiplicative composite combination, which means we apply sAA(m) in the outer loop and apply sAA(n) in the inner loop. By doing this, significant gains can be achieved. Secondly, to make AA to be a fully non-stationary algorithm, we need to combine these strategies with our recent work on the non-stationary Anderson acceleration algorithm with optimized dam** (AAoptD), which is another important direction of producing non-stationary AA and nice performance gains have been observed. Moreover, we also investigate the rate of convergence of these non-stationary AA methods under suitable assumptions. Finally, our numerical results show that some of these proposed non-stationary Anderson acceleration algorithms converge faster than the stationary sAA method and they may significantly reduce the storage and time to find the solution in many cases.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Weighted Sum Formulas from Shuffle Products of Multiple Zeta-star Values
Authors:
Kwang-Wu Chen,
Minking Eie
Abstract:
In this paper, we are going to perform the shuffle products of $Z_-(n) = \sum_{a+b=m} (-1)^{b} ζ(\{1\}^{a},b+2)$ and $Z_+^\star(n) = \sum_{c+d=n} ζ^{\star}(\{1\}^{c},d+2)$ with $m+n = p$. The resulted shuffle relation is a weighted sum formula given by \begin{equation*}
\frac{(p+1)(p+2)}{2} ζ(p+4)
=\sum_{m+n=p} \sum_{|\boldsymbolα|=p+3}
ζ(α_{0}, α_{1}, \ldots, α_{m}, α_{m+1}+1)
\sum_{a+b+c…
▽ More
In this paper, we are going to perform the shuffle products of $Z_-(n) = \sum_{a+b=m} (-1)^{b} ζ(\{1\}^{a},b+2)$ and $Z_+^\star(n) = \sum_{c+d=n} ζ^{\star}(\{1\}^{c},d+2)$ with $m+n = p$. The resulted shuffle relation is a weighted sum formula given by \begin{equation*}
\frac{(p+1)(p+2)}{2} ζ(p+4)
=\sum_{m+n=p} \sum_{|\boldsymbolα|=p+3}
ζ(α_{0}, α_{1}, \ldots, α_{m}, α_{m+1}+1)
\sum_{a+b+c=m}
\Bigl( W_{\boldsymbolα}(a,b,c) + W_{\boldsymbolα}(a,b,c=0)
+ W_{\boldsymbolα}(a=0,b,c) + W_{\boldsymbolα}(a=0,b=m,c=0) \Bigr), \end{equation*} where $W_{\boldsymbolα}(a,b,c) = 2^{σ(a+b+1)-σ(a)-(b+1)} (1-2^{1-α_{a+b+1}}\ \ )$, with $σ(r) = \sum_{j=0}^{r} α_{j}$.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Non-stationary Anderson acceleration with optimized dam**
Authors:
Kewang Chen,
Cornelis Vuik
Abstract:
Anderson acceleration (AA) has a long history of use and a strong recent interest due to its potential ability to dramatically improve the linear convergence of the fixed-point iteration. Most authors are simply using and analyzing the stationary version of Anderson acceleration (sAA) with a constant dam** factor or without dam**. Little attention has been paid to nonstationary algorithms. How…
▽ More
Anderson acceleration (AA) has a long history of use and a strong recent interest due to its potential ability to dramatically improve the linear convergence of the fixed-point iteration. Most authors are simply using and analyzing the stationary version of Anderson acceleration (sAA) with a constant dam** factor or without dam**. Little attention has been paid to nonstationary algorithms. However, dam** can be useful and is sometimes crucial for simulations in which the underlying fixed-point operator is not globally contractive. The role of this dam** factor has not been fully understood. In the present work, we consider the non-stationary Anderson acceleration algorithm with optimized dam** (AAoptD) in each iteration to further speed up linear and nonlinear iterations by applying one extra inexpensive optimization. We analyze this procedure and develop an efficient and inexpensive implementation scheme. We also show that, compared with the stationary Anderson acceleration with fixed window size sAA(m), optimizing the dam** factors is related to dynamically packaging sAA(m) and sAA(1) in each iteration (alternating window size $m$ is another direction of producing non-stationary AA). Moreover, we show by extensive numerical experiments that the proposed non-stationary Anderson acceleration with optimized dam** procedure often converges much faster than stationary AA with constant dam** or without dam**.
△ Less
Submitted 10 February, 2022;
originally announced February 2022.
-
On three general forms of multiple zeta(-star) values
Authors:
Kwang-Wu Chen,
Minking Eie
Abstract:
In this paper, we investigate three general forms of multiple zeta(-star) values. We use these values to give three new sum formulas for multiple zeta(-star) values with height $\leq 2$ and the evaluation of $ζ^\star(\{1\}^m,\{2\}^{n+1})$. We also give a new proof of sum formula of multiple zeta values.
In this paper, we investigate three general forms of multiple zeta(-star) values. We use these values to give three new sum formulas for multiple zeta(-star) values with height $\leq 2$ and the evaluation of $ζ^\star(\{1\}^m,\{2\}^{n+1})$. We also give a new proof of sum formula of multiple zeta values.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
Low-rank approximation for multiscale PDEs
Authors:
Ke Chen,
Shi Chen,
Qin Li,
Jianfeng Lu,
Stephen J. Wright
Abstract:
Historically, analysis for multiscale PDEs is largely unified while numerical schemes tend to be equation-specific. In this paper, we propose a unified framework for computing multiscale problems through random sampling. This is achieved by incorporating randomized SVD solvers and manifold learning techniques to numerically reconstruct the low-rank features of multiscale PDEs. We use multiscale ra…
▽ More
Historically, analysis for multiscale PDEs is largely unified while numerical schemes tend to be equation-specific. In this paper, we propose a unified framework for computing multiscale problems through random sampling. This is achieved by incorporating randomized SVD solvers and manifold learning techniques to numerically reconstruct the low-rank features of multiscale PDEs. We use multiscale radiative transfer equation and elliptic equation with rough media to showcase the application of this framework.
△ Less
Submitted 7 March, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
A Generalized Proportionate-Type Normalized Subband Adaptive Filter
Authors:
Kuan-Lin Chen,
Ching-Hua Lee,
Bhaskar D. Rao,
Harinath Garudadri
Abstract:
We show that a new design criterion, i.e., the least squares on subband errors regularized by a weighted norm, can be used to generalize the proportionate-type normalized subband adaptive filtering (PtNSAF) framework. The new criterion directly penalizes subband errors and includes a sparsity penalty term which is minimized using the damped regularized Newton's method. The impact of the proposed g…
▽ More
We show that a new design criterion, i.e., the least squares on subband errors regularized by a weighted norm, can be used to generalize the proportionate-type normalized subband adaptive filtering (PtNSAF) framework. The new criterion directly penalizes subband errors and includes a sparsity penalty term which is minimized using the damped regularized Newton's method. The impact of the proposed generalized PtNSAF (GPtNSAF) is studied for the system identification problem via computer simulations. Specifically, we study the effects of using different numbers of subbands and various sparsity penalty terms for quasi-sparse, sparse, and dispersive systems. The results show that the benefit of increasing the number of subbands is larger than promoting sparsity of the estimated filter coefficients when the target system is quasi-sparse or dispersive. On the other hand, for sparse target systems, promoting sparsity becomes more important. More importantly, the two aspects provide complementary and additive benefits to the GPtNSAF for speeding up convergence.
△ Less
Submitted 17 November, 2021;
originally announced November 2021.
-
ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees
Authors:
Kuan-Lin Chen,
Ching-Hua Lee,
Harinath Garudadri,
Bhaskar D. Rao
Abstract:
Models recently used in the literature proving residual networks (ResNets) are better than linear predictors are actually different from standard ResNets that have been widely used in computer vision. In addition to the assumptions such as scalar-valued output or single residual block, these models have no nonlinearities at the final residual representation that feeds into the final affine layer.…
▽ More
Models recently used in the literature proving residual networks (ResNets) are better than linear predictors are actually different from standard ResNets that have been widely used in computer vision. In addition to the assumptions such as scalar-valued output or single residual block, these models have no nonlinearities at the final residual representation that feeds into the final affine layer. To codify such a difference in nonlinearities and reveal a linear estimation property, we define ResNEsts, i.e., Residual Nonlinear Estimators, by simply drop** nonlinearities at the last residual representation from standard ResNets. We show that wide ResNEsts with bottleneck blocks can always guarantee a very desirable training property that standard ResNets aim to achieve, i.e., adding more blocks does not decrease performance given the same set of basis elements. To prove that, we first recognize ResNEsts are basis function models that are limited by a coupling problem in basis learning and linear prediction. Then, to decouple prediction weights from basis learning, we construct a special architecture termed augmented ResNEst (A-ResNEst) that always guarantees no worse performance with the addition of a block. As a result, such an A-ResNEst establishes empirical risk lower bounds for a ResNEst using corresponding bases. Our results demonstrate ResNEsts indeed have a problem of diminishing feature reuse; however, it can be avoided by sufficiently expanding or widening the input space, leading to the above-mentioned desirable property. Inspired by the DenseNets that have been shown to outperform ResNets, we also propose a corresponding new model called Densely connected Nonlinear Estimator (DenseNEst). We show that any DenseNEst can be represented as a wide ResNEst with bottleneck blocks. Unlike ResNEsts, DenseNEsts exhibit the desirable property without any special architectural re-design.
△ Less
Submitted 15 January, 2022; v1 submitted 9 November, 2021;
originally announced November 2021.
-
libdlr: Efficient imaginary time calculations using the discrete Lehmann representation
Authors:
Jason Kaye,
Kun Chen,
Hugo U. R. Strand
Abstract:
We introduce libdlr, a library implementing the recently introduced discrete Lehmann representation (DLR) of imaginary time Green's functions. The DLR basis consists of a collection of exponentials chosen by the interpolative decomposition to ensure stable and efficient recovery of Green's functions from imaginary time or Matsbuara frequency samples. The library provides subroutines to build the D…
▽ More
We introduce libdlr, a library implementing the recently introduced discrete Lehmann representation (DLR) of imaginary time Green's functions. The DLR basis consists of a collection of exponentials chosen by the interpolative decomposition to ensure stable and efficient recovery of Green's functions from imaginary time or Matsbuara frequency samples. The library provides subroutines to build the DLR basis and grids, and to carry out various standard operations. The simplicity of the DLR makes it straightforward to incorporate into existing codes as a replacement for less efficient representations of imaginary time Green's functions, and libdlr is intended to facilitate this process. libdlr is written in Fortran, provides a C header interface, and contains a Python module pydlr. We also introduce a stand-alone Julia implementation, Lehmann.jl.
△ Less
Submitted 21 June, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
On the convolutions of sums of multiple zeta(-star) values of height one
Authors:
Kwang-Wu Chen,
Minking Eie
Abstract:
In this paper, we investigate the sums of mutliple zeta(-star) values of height one: $Z_{\pm}(n)=\sum_{a+b=n} (\pm 1)^bζ(\{1\}^a,b+2)$, $Z_{\pm}^{\star}(n)=\sum_{a+b=n} (\pm 1)^bζ^{\star}(\{1\}^a,b+2)$. In particular, we prove that the weighted sum $\sum_{\substack{0\leq m\leq p\\ m: {\rm even}}} \sum_{\mid\boldsymbolα\mid=p+3} 2^{α_{m+1}\ +1}ζ(α_0,α_1,\ldots,α_m,α_{m+1}+1) $ can be evaluated thro…
▽ More
In this paper, we investigate the sums of mutliple zeta(-star) values of height one: $Z_{\pm}(n)=\sum_{a+b=n} (\pm 1)^bζ(\{1\}^a,b+2)$, $Z_{\pm}^{\star}(n)=\sum_{a+b=n} (\pm 1)^bζ^{\star}(\{1\}^a,b+2)$. In particular, we prove that the weighted sum $\sum_{\substack{0\leq m\leq p\\ m: {\rm even}}} \sum_{\mid\boldsymbolα\mid=p+3} 2^{α_{m+1}\ +1}ζ(α_0,α_1,\ldots,α_m,α_{m+1}+1) $ can be evaluated through the convolution of $Z_{-}(m)$ and $Z_{+}(n)$ with $m+n=p$.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
Global Calderón--Zygmund theory for parabolic $p$-Laplacian system: the case $1<p\leq \frac{2n}{n+2}$
Authors:
Ke Chen,
Quoc-Hung Nguyen,
Na Zhao
Abstract:
The aim of this paper is to establish global Calderón--Zygmund theory to parabolic $p$-Laplacian system:
$$ u_t -\operatorname{div}(|\nabla u|^{p-2}\nabla u) = \operatorname{div} (|F|^{p-2}F)~\text{in}~Ω\times (0,T)\subset \mathbb{R}^{n+1},
$$ proving that $$F\in L^q\Rightarrow \nabla u\in L^q,$$ for any $q>\max\{p,\frac{n(2-p)}{2}\}$ and $p>1$. Acerbi and Mingione \cite{Acerbi07} proved this…
▽ More
The aim of this paper is to establish global Calderón--Zygmund theory to parabolic $p$-Laplacian system:
$$ u_t -\operatorname{div}(|\nabla u|^{p-2}\nabla u) = \operatorname{div} (|F|^{p-2}F)~\text{in}~Ω\times (0,T)\subset \mathbb{R}^{n+1},
$$ proving that $$F\in L^q\Rightarrow \nabla u\in L^q,$$ for any $q>\max\{p,\frac{n(2-p)}{2}\}$ and $p>1$. Acerbi and Mingione \cite{Acerbi07} proved this estimate in the case $p>\frac{2n}{n+2}$. In this article we settle the case $1<p\leq \frac{2n}{n+2}$. We also treat systems with discontinuous coefficients having small BMO (bounded mean oscillation) norm.
△ Less
Submitted 13 September, 2021; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Kashaev--Reshetikhin Invariants of Links
Authors:
Kai-Chieh Chen,
Calvin McPhail-Snyder,
Scott Morrison,
Noah Snyder
Abstract:
Kashaev and Reshetikhin previously described a way to define holonomy invariants of knots using quantum $\mathfrak{sl}_2$ at a root of unity. These are generalized quantum invariants depend both on a knot $K$ and a representation of the fundamental group of its complement into $\mathrm{SL}_2(\mathbb{C})$; equivalently, we can think of $\mathrm{KR}(K)$ as associating to each knot a function on (a s…
▽ More
Kashaev and Reshetikhin previously described a way to define holonomy invariants of knots using quantum $\mathfrak{sl}_2$ at a root of unity. These are generalized quantum invariants depend both on a knot $K$ and a representation of the fundamental group of its complement into $\mathrm{SL}_2(\mathbb{C})$; equivalently, we can think of $\mathrm{KR}(K)$ as associating to each knot a function on (a slight generalization of) its character variety. In this paper we clarify some details of their construction. In particular, we show that for $K$ a hyperbolic knot $\mathrm{KaRe}(K)$ can be viewed as a function on the geometric component of the $A$-polynomial curve of $K$. We compute some examples at a third root of unity.
△ Less
Submitted 14 August, 2021;
originally announced August 2021.
-
The Peskin problem with $\dot B^1_{\infty,\infty}$ initial data
Authors:
Ke Chen,
Quoc-Hung Nguyen
Abstract:
In this paper we study the Peskin problem in 2D, which describes the dynamics of a 1D closed elastic structure immersed in a steady Stokes flow. We prove the local well-posedness for arbitrary initial configuration in $(C^2)^{\dot B^1_{\infty,\infty}}$ satisfying the well-stretched condition, and the global well-posedness when the initial configuration is sufficiently close to an equilibrium in…
▽ More
In this paper we study the Peskin problem in 2D, which describes the dynamics of a 1D closed elastic structure immersed in a steady Stokes flow. We prove the local well-posedness for arbitrary initial configuration in $(C^2)^{\dot B^1_{\infty,\infty}}$ satisfying the well-stretched condition, and the global well-posedness when the initial configuration is sufficiently close to an equilibrium in $\dot B^1_{\infty,\infty}$. Here $(C^2)^{\dot B^1_{\infty,\infty}}$ is the closure of $C^2$ in the Besov space $\dot B^1_{\infty,\infty}$. The global-in-time solution will converge to an equilibrium exponentially as $t\rightarrow+\infty$. This is the first well-posedness result for the Peskin problem with non-Lipschitz initial data.
△ Less
Submitted 22 December, 2021; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Discrete Lehmann representation of imaginary time Green's functions
Authors:
Jason Kaye,
Kun Chen,
Olivier Parcollet
Abstract:
We present an efficient basis for imaginary time Green's functions based on a low rank decomposition of the spectral Lehmann representation. The basis functions are simply a set of well-chosen exponentials, so the corresponding expansion may be thought of as a discrete form of the Lehmann representation using an effective spectral density which is a sum of $δ$ functions. The basis is determined on…
▽ More
We present an efficient basis for imaginary time Green's functions based on a low rank decomposition of the spectral Lehmann representation. The basis functions are simply a set of well-chosen exponentials, so the corresponding expansion may be thought of as a discrete form of the Lehmann representation using an effective spectral density which is a sum of $δ$ functions. The basis is determined only by an upper bound on the product $βω_{\max}$, with $β$ the inverse temperature and $ω_{\max}$ an energy cutoff, and a user-defined error tolerance $ε$. The number $r$ of basis functions scales as $\mathcal{O}\left(\log(βω_{\max}) \log (1/ε)\right)$. The discrete Lehmann representation of a particular imaginary time Green's function can be recovered by interpolation at a set of $r$ imaginary time nodes. Both the basis functions and the interpolation nodes can be obtained rapidly using standard numerical linear algebra routines. Due to the simple form of the basis, the discrete Lehmann representation of a Green's function can be explicitly transformed to the Matsubara frequency domain, or obtained directly by interpolation on a Matsubara frequency grid. We benchmark the efficiency of the representation on simple cases, and with a high precision solution of the Sachdev-Ye-Kitaev equation at low temperature. We compare our approach with the related intermediate representation method, and introduce an improved algorithm to build the intermediate representation basis and a corresponding sampling grid.
△ Less
Submitted 17 February, 2022; v1 submitted 27 July, 2021;
originally announced July 2021.