-
Asymptotic quadratic convergence of the Gauss-Newton method for complex phase retrieval
Authors:
Meng Huang
Abstract:
In this paper, we introduce a Gauss-Newton method for solving the complex phase retrieval problem. In contrast to the real-valued setting, the Gauss-Newton matrix for complex-valued signals is rank-deficient and, thus, non-invertible. To address this, we utilize a Gauss-Newton step that moves orthogonally to certain trivial directions. We establish that this modified Gauss-Newton step has a closed…
▽ More
In this paper, we introduce a Gauss-Newton method for solving the complex phase retrieval problem. In contrast to the real-valued setting, the Gauss-Newton matrix for complex-valued signals is rank-deficient and, thus, non-invertible. To address this, we utilize a Gauss-Newton step that moves orthogonally to certain trivial directions. We establish that this modified Gauss-Newton step has a closed-form solution, which corresponds precisely to the minimal-norm solution of the associated least squares problem. Additionally, using the leave-one-out technique, we demonstrate that $m\ge O( n\log^3 n)$ independent complex Gaussian random measurements ensures that the entire trajectory of the Gauss-Newton iterations remains confined within a specific region of incoherence and contraction with high probability. This finding allows us to establish the asymptotic quadratic convergence rate of the Gauss-Newton method without the need of sample splitting.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Positivity for quantum cluster algebras from orbifolds
Authors:
Min Huang
Abstract:
Let $(S,M,U)$ be a marked orbifold with or without punctures and let $\mathcal A_v$ be a quantum cluster algebra from $(S,M,U)$ with arbitrary coefficients and quantization. We provide combinatorial formulas for quantum Laurent expansion of quantum cluster variables of $\mathcal A_v$ concerning an arbitrary quantum seed. Consequently, the positivity for the quantum cluster algebra $\mathcal A_v$ i…
▽ More
Let $(S,M,U)$ be a marked orbifold with or without punctures and let $\mathcal A_v$ be a quantum cluster algebra from $(S,M,U)$ with arbitrary coefficients and quantization. We provide combinatorial formulas for quantum Laurent expansion of quantum cluster variables of $\mathcal A_v$ concerning an arbitrary quantum seed. Consequently, the positivity for the quantum cluster algebra $\mathcal A_v$ is proved.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
A GPU-accelerated Cartesian grid method for PDEs on irregular domain
Authors:
Liwei Tan,
Minsheng Huang,
Wenjun Ying
Abstract:
The kernel-free boundary integral (KFBI) method has successfully solved partial differential equations (PDEs) on irregular domains. Diverging from traditional boundary integral methods, the computation of boundary integrals in KFBI is executed through the resolution of equivalent simple interface problems on Cartesian grids, utilizing fast algorithms. While existing implementations of KFBI methods…
▽ More
The kernel-free boundary integral (KFBI) method has successfully solved partial differential equations (PDEs) on irregular domains. Diverging from traditional boundary integral methods, the computation of boundary integrals in KFBI is executed through the resolution of equivalent simple interface problems on Cartesian grids, utilizing fast algorithms. While existing implementations of KFBI methods predominantly utilize CPU platforms, GPU architecture's superior computational capabilities and extensive memory bandwidth offer an efficient resolution to computational bottlenecks. This paper delineates the algorithms adapted for both single-GPU and multiple-GPU applications. On a single GPU, assigning individual threads can control correction, interpolation, and jump calculations. The algorithm is expanded to multiple GPUs to enhance the processing of larger-scale problems. The arrowhead decomposition method is employed in multiple-GPU settings, ensuring optimal computational efficiency and load balancing. Numerical examples show that the proposed algorithm is second-order accurate and efficient. Single-GPU solver speeds 50-200 times than traditional CPU while the eight GPUs distributed solver yields up to 60% parallel efficiency.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
A GPU-accelerated Cartesian grid method is proposed for solving the heat, wave, and Schrodinger equations on irregular domains
Authors:
Liwei Tan,
Minsheng Huang,
Wenjun Ying
Abstract:
This paper introduces a second-order method for solving general elliptic partial differential equations (PDEs) on irregular domains using GPU acceleration, based on Ying's kernel-free boundary integral (KFBI) method. The method addresses limitations imposed by CFL conditions in explicit schemes and accuracy issues in fully implicit schemes for the Laplacian operator. To overcome these challenges,…
▽ More
This paper introduces a second-order method for solving general elliptic partial differential equations (PDEs) on irregular domains using GPU acceleration, based on Ying's kernel-free boundary integral (KFBI) method. The method addresses limitations imposed by CFL conditions in explicit schemes and accuracy issues in fully implicit schemes for the Laplacian operator. To overcome these challenges, the paper employs a series of second-order time discrete schemes and splits the Laplacian operator into explicit and implicit components. Specifically, the Crank-Nicolson method discretizes the heat equation in the temporal dimension, while the implicit scheme is used for the wave equation. The Schrodinger equation is treated using the Strang splitting method. By discretizing the temporal dimension implicitly, the heat, wave, and Schrodinger equations are transformed into a sequence of elliptic equations. The Laplacian operator on the right-hand side of the elliptic equation is obtained from the numerical scheme rather than being discretized and corrected by the five-point difference method. A Cartesian grid-based KFBI method is employed to solve the resulting elliptic equations. GPU acceleration, achieved through a parallel Cartesian grid solver, enhances the computational efficiency by exploiting high degrees of parallelism. Numerical results demonstrate that the proposed method achieves second-order accuracy for the heat, wave, and Schrodinger equations. Furthermore, the GPU-accelerated solvers for the three types of time-dependent equations exhibit a speedup of 30 times compared to CPU-based solvers.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Covering convection with thermal blankets: formation of supercontinents
Authors:
**zi Mac Huang
Abstract:
The continental plates of Earth are known to drift over a geophysical timescale, and their interactions have lead to some of the most spectacular geoformations of our planet while also causing natural disasters such as earthquakes and volcanic activity. Understanding the dynamics of interacting continental plates is thus significant. In this work, we present a fluid mechanical investigation of the…
▽ More
The continental plates of Earth are known to drift over a geophysical timescale, and their interactions have lead to some of the most spectacular geoformations of our planet while also causing natural disasters such as earthquakes and volcanic activity. Understanding the dynamics of interacting continental plates is thus significant. In this work, we present a fluid mechanical investigation of the plate motion, interaction, and dynamics. Through numerical experiments, we examine the coupling between a convective fluid and plates floating on top of it. With physical modeling, we show the coupling is both mechanical and thermal, leading to the thermal blanket effect: the floating plate is not only transported by the fluid flow beneath, it also prevents the heat from leaving the fluid, leading to a convective flow that further affects the plate motion. By adding several plates to such a coupled fluid-structure interaction, we also investigate how floating plates interact with each other and show that, under proper conditions, small plates can converge into a supercontinent.
△ Less
Submitted 2 April, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
The 2-character theory for finite 2-groups
Authors:
Mo Huang,
Hao Xu,
Zhi-Hao Zhang
Abstract:
In this work, we generalize the notion of character for 2-representations of finite 2-groups. The properties of 2-characters bear strong similarities to those classical characters of finite groups, including conjugation invariance, additivity, multiplicativity and orthogonality. With a careful analysis using homotopy fixed points and quotients for categories with 2-group actions, we prove that the…
▽ More
In this work, we generalize the notion of character for 2-representations of finite 2-groups. The properties of 2-characters bear strong similarities to those classical characters of finite groups, including conjugation invariance, additivity, multiplicativity and orthogonality. With a careful analysis using homotopy fixed points and quotients for categories with 2-group actions, we prove that the category of class functors on a 2-group $\mathcal G$ is equivalent to the Drinfeld center of the 2-group algebra $\mathrm{Vec}_{\mathcal G}$, which categorifies the Fourier transform on finite abelian groups. After transferring the canonical nondegenerate braided monoidal structure from $\mathfrak Z_1(\mathrm{Vec}_{\mathcal G})$, we discover that irreducible 2-characters of $\mathcal G$ coincide with full centers of the corresponding 2-representations, which are in a one-to-one correspondence with Lagrangian algebras in the category of class functors on $\mathcal G$. In particular, the fusion rule of $2\mathrm{Rep}(\mathcal G)$ can be calculated from the pointwise product of Lagrangian algebras as class functors. From a topological quantum field theory (TQFT) point of view, the commutative Frobenius algebra structure on a 2-character is induced from a 2D topological sigma-model with target space $\lvert \mathrm{B} \mathcal G \rvert$.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Learning Best-in-Class Policies for the Predict-then-Optimize Framework
Authors:
Michael Huang,
Vishal Gupta
Abstract:
We propose a novel family of decision-aware surrogate losses, called Perturbation Gradient (PG) losses, for the predict-then-optimize framework. These losses directly approximate the downstream decision loss and can be optimized using off-the-shelf gradient-based methods. Importantly, unlike existing surrogate losses, the approximation error of our PG losses vanishes as the number of samples grows…
▽ More
We propose a novel family of decision-aware surrogate losses, called Perturbation Gradient (PG) losses, for the predict-then-optimize framework. These losses directly approximate the downstream decision loss and can be optimized using off-the-shelf gradient-based methods. Importantly, unlike existing surrogate losses, the approximation error of our PG losses vanishes as the number of samples grows. This implies that optimizing our surrogate loss yields a best-in-class policy asymptotically, even in misspecified settings. This is the first such result in misspecified settings and we provide numerical evidence confirming our PG losses substantively outperform existing proposals when the underlying model is misspecified and the noise is not centrally symmetric. Insofar as misspecification is commonplace in practice -- especially when we might prefer a simpler, more interpretable model -- PG losses offer a novel, theoretically justified, method for computationally tractable decision-aware learning.
△ Less
Submitted 8 February, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Last fall degree of semi-local polynomial systems
Authors:
Ming-Deh A. Huang
Abstract:
We study the last fall degrees of {\em semi-local} polynomial systems, and the computational complexity of solving such systems for closed-point and rational-point solutions, where the systems are defined over a finite field. A semi-local polynomial system specifies an algebraic set which is the image of a global linear transformation of a direct product of local affine algebraic sets. As a specia…
▽ More
We study the last fall degrees of {\em semi-local} polynomial systems, and the computational complexity of solving such systems for closed-point and rational-point solutions, where the systems are defined over a finite field. A semi-local polynomial system specifies an algebraic set which is the image of a global linear transformation of a direct product of local affine algebraic sets. As a special but interesting case, polynomial systems that arise from Weil restriction of algebraic sets in an affine space of low dimension are semi-local. Such systems have received considerable attention due to their application in cryptography. Our main results bound the last fall degree of a semi-local polynomial system in terms of the number of closed point solutions, and yield an efficient algorithm for finding all rational-point solutions when the prime characteristic of the finite field and the number of rational solutions are small. Our results on solving semi-local systems imply an improvement on a previously known polynomial-time attack on the HFE (Hidden Field Equations) cryptosystems. The attacks implied in our results extend to public key encryption functions which are based on semi-local systems where either the number of closed point solutions is small, or the characteristic of the field is small. It remains plausible to construct public key cryptosystems based on semi-local systems over a finite field of large prime characteristic with exponential number of closed point solutions. Such a method is presented in the paper, followed by further cryptanalysis involving the isomorphism of polynomials (IP) problem, as well as a concrete public key encryption scheme which is secure against all the attacks discussed in this paper.
△ Less
Submitted 5 November, 2023;
originally announced November 2023.
-
Stochastic Smoothed Gradient Descent Ascent for Federated Minimax Optimization
Authors:
Wei Shen,
Minhui Huang,
Jiawei Zhang,
Cong Shen
Abstract:
In recent years, federated minimax optimization has attracted growing interest due to its extensive applications in various machine learning tasks. While Smoothed Alternative Gradient Descent Ascent (Smoothed-AGDA) has proved its success in centralized nonconvex minimax optimization, how and whether smoothing technique could be helpful in federated setting remains unexplored. In this paper, we pro…
▽ More
In recent years, federated minimax optimization has attracted growing interest due to its extensive applications in various machine learning tasks. While Smoothed Alternative Gradient Descent Ascent (Smoothed-AGDA) has proved its success in centralized nonconvex minimax optimization, how and whether smoothing technique could be helpful in federated setting remains unexplored. In this paper, we propose a new algorithm termed Federated Stochastic Smoothed Gradient Descent Ascent (FESS-GDA), which utilizes the smoothing technique for federated minimax optimization. We prove that FESS-GDA can be uniformly used to solve several classes of federated minimax problems and prove new or better analytical convergence results for these settings. We showcase the practical efficiency of FESS-GDA in practical federated learning tasks of training generative adversarial networks (GANs) and fair classification.
△ Less
Submitted 18 April, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Riesz type theorems for $κ$-pluriharmonic map**s, invariant harmonic quasiregular map**s and harmonic quasiregular map**s
Authors:
Shaolin Chen,
Manzi Huang
Abstract:
The main purpose of this paper is to develop some methods to improve and generalize the main results in a recent paper by Liu and Zhu (Adv. Math., 2023, i.e., \cite{L-Z}). The paper consists of two parts. In the first part, we discuss the Riesz type theorem in the setting of $n$-dimensional complex spaces for all $n\geq 1$. In this part, we first introduce the family of $κ$-pluriharmonic map**s…
▽ More
The main purpose of this paper is to develop some methods to improve and generalize the main results in a recent paper by Liu and Zhu (Adv. Math., 2023, i.e., \cite{L-Z}). The paper consists of two parts. In the first part, we discuss the Riesz type theorem in the setting of $n$-dimensional complex spaces for all $n\geq 1$. In this part, we first introduce the family of $κ$-pluriharmonic map**s of the $n$-dimensional complex unit ball. Then we establish two Riesz type theorems for these map**s, which are the $n$-dimensional versions of Theorems 1.1 and 1.2 in \cite{L-Z}, respectively. Furthermore, even when $n=1$, our first result shows that the assumption of the real parts of the map**s not being negative (or being negative) in \cite[Theorem 1.1]{L-Z} is redundant; and our second result illustrates that the assumption of "quasiconformality" on the map**s in \cite[Theorem 1.2]{L-Z} can be replaced by the weaker one of "quasiregularity". In the second part, we investigate the Riesz type theorem in the setting of $n$-dimensional real spaces for all $n\geq 2$. In this part, first, we prove a Riesz type theorem for invariant harmonic quasiregular map**s of the unit $n$-dimensional real ball. Our result indicates that $(i)$ the range of the parameter $p$ discussed in \cite[Theorem 1.3]{L-Z} can be changed from $(1,2)$ to $(1,\infty)$; $(ii)$ the assumption of the first coordinate functions of the map**s being non-zero in \cite[Theorem 1.3]{L-Z} is redundant. In this way, we complete the discussions carried out in \cite[Theorems 1.3 and 1.4]{L-Z}. Second, we obtain a Riesz type theorem for harmonic $K$-quasiregular map**s of the unit $n$-dimensional real ball. Our result demonstrates that the range of the parameter $p$ discussed in \cite[Theorem 2.1]{K-2023} can be changed from $(1,2)$ to $(1,\infty)$.
△ Less
Submitted 29 October, 2023; v1 submitted 23 October, 2023;
originally announced October 2023.
-
ADI schemes for heat equations with irregular boundaries and interfaces in 3D with applications
Authors:
Han Zhou,
Minsheng Huang,
Wenjun Ying
Abstract:
In this paper, efficient alternating direction implicit (ADI) schemes are proposed to solve three-dimensional heat equations with irregular boundaries and interfaces. Starting from the well-known Douglas-Gunn ADI scheme, a modified ADI scheme is constructed to mitigate the issue of accuracy loss in solving problems with time-dependent boundary conditions. The unconditional stability of the new ADI…
▽ More
In this paper, efficient alternating direction implicit (ADI) schemes are proposed to solve three-dimensional heat equations with irregular boundaries and interfaces. Starting from the well-known Douglas-Gunn ADI scheme, a modified ADI scheme is constructed to mitigate the issue of accuracy loss in solving problems with time-dependent boundary conditions. The unconditional stability of the new ADI scheme is also rigorously proven with the Fourier analysis. Then, by combining the ADI schemes with a 1D kernel-free boundary integral (KFBI) method, KFBI-ADI schemes are developed to solve the heat equation with irregular boundaries. In 1D sub-problems of the KFBI-ADI schemes, the KFBI discretization takes advantage of the Cartesian grid and preserves the structure of the coefficient matrix so that the fast Thomas algorithm can be applied to solve the linear system efficiently. Second-order accuracy and unconditional stability of the KFBI-ADI schemes are verified through several numerical tests for both the heat equation and a reaction-diffusion equation. For the Stefan problem, which is a free boundary problem of the heat equation, a level set method is incorporated into the ADI method to capture the time-dependent interface. Numerical examples for simulating 3D dendritic solidification phenomenons are also presented.
△ Less
Submitted 2 September, 2023;
originally announced September 2023.
-
Friezes of cluster algebras of geometric type
Authors:
Antoine de Saint Germain,
Min Huang,
Jiang-Hua Lu
Abstract:
For a cluster algebra $\mathcal{A}$ over $\mathbb{Q}$ of geometric type, a $\textit{frieze}$ of $\mathcal{A}$ is defined to be a $\mathbb{Q}$-algebra homomorphism from $\mathcal{A}$ to $\mathbb{Q}$ that takes positive integer values on all cluster variables and all frozen variables. We present some basic facts on friezes, including frieze testing criteria, the notion of $\textit{frieze points}$ wh…
▽ More
For a cluster algebra $\mathcal{A}$ over $\mathbb{Q}$ of geometric type, a $\textit{frieze}$ of $\mathcal{A}$ is defined to be a $\mathbb{Q}$-algebra homomorphism from $\mathcal{A}$ to $\mathbb{Q}$ that takes positive integer values on all cluster variables and all frozen variables. We present some basic facts on friezes, including frieze testing criteria, the notion of $\textit{frieze points}$ when $\mathcal{A}$ is finitely generated, and pullbacks of friezes under certain $\mathbb{Q}$-algebra homomorphisms. When the cluster algebra $\mathcal{A}$ is acyclic, we define $\textit{frieze patterns associated to acyclic seeds of }\mathcal{A}$, generalizing the $\textit{ frieze patterns with coefficients of type } A$ studied by J. Propp and by M. Cuntz, T. Holm, and P. Jorgensen, and we give a sufficient condition for such frieze patterns to be equivalent to friezes. For the special cases when $\mathcal{A}$ has an acyclic seed with either trivial coefficients, principal coefficients, or what we call the $\textit{BFZ coefficients}$ (named after A. Berenstein, S. Fomin, and A. Zelevinsky), we identify frieze points of $\mathcal{A}$ both geometrically as certain positive integral points in explicitly described affine varieties and Lie theoretically (in the finite case) in terms of reduced double Bruhat cells and generalized minors on the associated semi-simple Lie groups. Furthermore, extending the gliding symmetry of the classical Coxeter frieze patterns of type $A$, we determine the symmetry of frieze patterns of any finite type with arbitrary coefficients.
△ Less
Submitted 3 October, 2023; v1 submitted 2 September, 2023;
originally announced September 2023.
-
Positive 2-bridge knots and chirally cosmetic surgeries
Authors:
Michael Huang,
Zelong Li,
Rahi Tanaz,
Chengyi Zhang
Abstract:
In this paper we verify that with the exception of the $(2, 2n+1)$ torus knots, positive 2-bridge knots up to 31 crossings do not admit chirally cosmetic surgeries. A knot $K$ admits chirally cosmetic surgeries if there exist surgeries $S^3_r$ and $S^3_{r'}$ with distinct slopes $r$ and $r'$ such that $S^3_r(K) \cong -S^3_{r'}(K)$, where the negative represents an orientation reversal. To verify t…
▽ More
In this paper we verify that with the exception of the $(2, 2n+1)$ torus knots, positive 2-bridge knots up to 31 crossings do not admit chirally cosmetic surgeries. A knot $K$ admits chirally cosmetic surgeries if there exist surgeries $S^3_r$ and $S^3_{r'}$ with distinct slopes $r$ and $r'$ such that $S^3_r(K) \cong -S^3_{r'}(K)$, where the negative represents an orientation reversal. To verify this, we use the obstruction formula from arXiv:2112.03144 which relates classical knot invariants to the existence of chirally cosmetic surgeries. To check the formula, we develop a Python program that computes the classical knot invariants $a_2$, $a_4$, $v_3$, $\det$, and $g$ of a positive 2-bridge knot.
△ Less
Submitted 22 August, 2023; v1 submitted 19 August, 2023;
originally announced August 2023.
-
Fluid pendulum explains reversals of the large-scale circulation in thermal convection
Authors:
Nicholas J. Moore,
**zi Mac Huang
Abstract:
We introduce a low-dimensional dynamical system to describe thermal convection in an annulus. The model derives systematically from a Fourier-Laurent truncation of the governing Navier-Stokes Boussinesq equations with no adjustable parameters and with the ability to generalize to any order. Comparison with fully resolved numerical solutions shows that the leading-order model captures parameter bif…
▽ More
We introduce a low-dimensional dynamical system to describe thermal convection in an annulus. The model derives systematically from a Fourier-Laurent truncation of the governing Navier-Stokes Boussinesq equations with no adjustable parameters and with the ability to generalize to any order. Comparison with fully resolved numerical solutions shows that the leading-order model captures parameter bifurcations and reversals of the large-scale circulation (LSC) with quantitative accuracy, including states of (i) steady circulating flow, (ii) chaotic LSC reversals, and (iii) periodic LSC reversals. Casting the system in terms of the fluid's angular momentum and center of mass (CoM) reveals equivalence to a damped pendulum with forcing that raises the CoM above the fulcrum. This formulation offers a transparent mechanism for LSC reversals, namely the inertial overshoot of a driven pendulum, and it yields accurate predictions for the frequency of regular LSC reversals in the high Rayleigh-number limit.
△ Less
Submitted 25 July, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
A convective fluid pendulum revealing states of order and chaos
Authors:
**zi Mac Huang,
Nicholas J. Moore
Abstract:
We examine thermal convection in a two-dimensional annulus using fully resolved direct numerical simulation (DNS) in conjunction with a low-dimensional model deriving from Galerkin truncation of the governing Navier-Stokes Boussinesq (NSB) equations. The DNS is based on fast and accurate pseudo-spectral discretization of the full NSB system with implicit-explicit time step**. Inspired by the num…
▽ More
We examine thermal convection in a two-dimensional annulus using fully resolved direct numerical simulation (DNS) in conjunction with a low-dimensional model deriving from Galerkin truncation of the governing Navier-Stokes Boussinesq (NSB) equations. The DNS is based on fast and accurate pseudo-spectral discretization of the full NSB system with implicit-explicit time step**. Inspired by the numerical results, we propose a reduced model that is based on a Fourier-Laurent truncation of the NSB system and can generalize to any degree of accuracy. We demonstrate that the lowest-order model capable of satisfying all boundary conditions on the annulus successfully captures reversals of the large-scale circulation (LSC) in certain regimes. Based on both the DNS and stability analysis of the reduced model, we identify a sequence of transitions between (i) a motionless conductive state, (ii) a state of steady circulation, (iii) non-periodic dynamics and chaotic reversals of the LSC, (iv) a high Rayleigh-number state in which LSC reversals are periodic despite turbulent fluctuations at the small scale. The reduced model reveals a link to a damped pendulum system with a particular form of external forcing. The oscillatory pendulum motion provides an accurate prediction for the LSC reversal frequency in the high Rayleigh-number regime.
△ Less
Submitted 25 July, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Tannaka-Krein duality for finite 2-groups
Authors:
Mo Huang,
Zhi-Hao Zhang
Abstract:
Let $\mathcal{G}$ be a finite 2-group. We show that the 2-category $2\mathrm{Rep}(\mathcal{G})$ of finite semisimple 2-representations is a symmetric fusion 2-category. We also relate the auto-equivalence 2-group of the symmetric monoidal forgetful 2-functor $ω: 2\mathrm{Rep}(\mathcal{G}) \to 2\mathrm{Vec}$ to the auto-equivalence 2-group of the regular algebra and show that they are equivalent to…
▽ More
Let $\mathcal{G}$ be a finite 2-group. We show that the 2-category $2\mathrm{Rep}(\mathcal{G})$ of finite semisimple 2-representations is a symmetric fusion 2-category. We also relate the auto-equivalence 2-group of the symmetric monoidal forgetful 2-functor $ω: 2\mathrm{Rep}(\mathcal{G}) \to 2\mathrm{Vec}$ to the auto-equivalence 2-group of the regular algebra and show that they are equivalent to $\mathcal{G}$. This result categorifies the usual Tannaka-Krein duality for finite groups.
△ Less
Submitted 27 September, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
On successive minimal bases of division points of Drinfeld modules
Authors:
Maozhou Huang
Abstract:
We define successive minimal bases (SMBs) for the space of $u^{n}$-division points of a Drinfeld $\mathbb{F}_{q}[t]$-module over a local field, where $u$ is a finite prime of $\mathbb{F}_{q}[t]$ and $n$ is a positive integer. These SMBs share similar properties to those of SMBs of the lattices associated to Drinfeld modules. We study the relations between these SMBs and those of the lattices. Fina…
▽ More
We define successive minimal bases (SMBs) for the space of $u^{n}$-division points of a Drinfeld $\mathbb{F}_{q}[t]$-module over a local field, where $u$ is a finite prime of $\mathbb{F}_{q}[t]$ and $n$ is a positive integer. These SMBs share similar properties to those of SMBs of the lattices associated to Drinfeld modules. We study the relations between these SMBs and those of the lattices. Finally, we apply the relations to study the explicit wild ramification subgroup action on an SMB of the space of $u^{n}$-division points and show the function field analogue of Szpiro's conjecture for rank $2$ Drinfeld modules under a certain limited situation.
△ Less
Submitted 16 December, 2023; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Quasi-homomorphisms of quantum cluster algebras
Authors:
Wen Chang,
Min Huang,
Jian-Rong Li
Abstract:
In this paper, we study quasi-homomorphisms of quantum cluster algebras, which are quantum analogy of quasi-homomorphisms of cluster algebras introduced by Fraser.
For a quantum Grassmannian cluster algebra $\mathbb{C}_q[{\rm Gr}(k,n)]$, we show that there is an associated braid group and each generator $σ_i$ of the braid group preserves the quasi-commutative relations of quantum Plücker coordin…
▽ More
In this paper, we study quasi-homomorphisms of quantum cluster algebras, which are quantum analogy of quasi-homomorphisms of cluster algebras introduced by Fraser.
For a quantum Grassmannian cluster algebra $\mathbb{C}_q[{\rm Gr}(k,n)]$, we show that there is an associated braid group and each generator $σ_i$ of the braid group preserves the quasi-commutative relations of quantum Plücker coordinates and exchange relations of the quantum Grassmannian cluster algebra. We conjecture that $σ_i$ also preserves $r$-term ($r \ge 4$) quantum Plücker relations of $\mathbb{C}_q[{\rm Gr}(k,n)]$ and other relations which cannot be derived from quantum quantum Plücker relations (if any). Up to this conjecture, we show that $σ_i$ is a quasi-automorphism of $\mathbb{C}_q[{\rm Gr}(k,n)]$ and the braid group acts on $\mathbb{C}_q[{\rm Gr}(k,n)]$.
△ Less
Submitted 1 September, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Locally biHölder continuous map**s and their induced embeddings between Besov spaces
Authors:
Manzi Huang,
Xiantao Wang,
Zhuang Wang,
Zhihao Xu
Abstract:
In this paper, we introduce a class of homeomorphisms between metric spaces, which are locally biHölder continuous map**s. Then an embedding result between Besov spaces induced by locally biHölder continuous map**s between Ahlfors regular spaces is established, which extends the corresponding result of Björn-Björn-Gill-Shanmugalingam (J. Reine Angew. Math. 725: 63-114, 2017). Furthermore, an e…
▽ More
In this paper, we introduce a class of homeomorphisms between metric spaces, which are locally biHölder continuous map**s. Then an embedding result between Besov spaces induced by locally biHölder continuous map**s between Ahlfors regular spaces is established, which extends the corresponding result of Björn-Björn-Gill-Shanmugalingam (J. Reine Angew. Math. 725: 63-114, 2017). Furthermore, an example is constructed to show that our embedding result is more general. We also introduce a geometric condition, named as uniform boundedness, to characterize when a quasisymmetric map** between uniformly perfect spaces is locally biHölder continuous.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Isoperimetric type inequalities for map**s induced by weighted Laplace differential operators
Authors:
Jiaolong Chen,
Shaolin Chen,
Manzi Huang,
Huaqing Zheng
Abstract:
The main purpose of this paper is to establish some isoperimetric type inequalities for map**s induced by the weighted Laplace differential operators. The obtained results of this paper provide improvements and extensions of the corresponding known results.
The main purpose of this paper is to establish some isoperimetric type inequalities for map**s induced by the weighted Laplace differential operators. The obtained results of this paper provide improvements and extensions of the corresponding known results.
△ Less
Submitted 16 February, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Achieving Linear Speedup in Non-IID Federated Bilevel Learning
Authors:
Minhui Huang,
Dewei Zhang,
Kaiyi Ji
Abstract:
Federated bilevel optimization has received increasing attention in various emerging machine learning and communication applications. Recently, several Hessian-vector-based algorithms have been proposed to solve the federated bilevel optimization problem. However, several important properties in federated learning such as the partial client participation and the linear speedup for convergence (i.e…
▽ More
Federated bilevel optimization has received increasing attention in various emerging machine learning and communication applications. Recently, several Hessian-vector-based algorithms have been proposed to solve the federated bilevel optimization problem. However, several important properties in federated learning such as the partial client participation and the linear speedup for convergence (i.e., the convergence rate and complexity are improved linearly with respect to the number of sampled clients) in the presence of non-i.i.d.~datasets, still remain open. In this paper, we fill these gaps by proposing a new federated bilevel algorithm named FedMBO with a novel client sampling scheme in the federated hypergradient estimation. We show that FedMBO achieves a convergence rate of $\mathcal{O}\big(\frac{1}{\sqrt{nK}}+\frac{1}{K}+\frac{\sqrt{n}}{K^{3/2}}\big)$ on non-i.i.d.~datasets, where $n$ is the number of participating clients in each round, and $K$ is the total number of iteration. This is the first theoretical linear speedup result for non-i.i.d.~federated bilevel optimization. Extensive experiments validate our theoretical results and demonstrate the effectiveness of our proposed method.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Toeplitz operators on $\mathcal L^p$-spaces of a tree
Authors:
Mingmei Huang,
Xiaoyan Zhang,
Xianfeng Zhao
Abstract:
Let $T$ be a rooted, countable infinite tree without terminal vertices. In the present paper, we characterize the spectra, self-adjointness and positivity of Toeplitz operators on the spaces of $p$-summable functions on $T$. Moreover, we obtain a necessary and sufficient condition for Toeplitz operators to have finite rank on such function spaces.
Let $T$ be a rooted, countable infinite tree without terminal vertices. In the present paper, we characterize the spectra, self-adjointness and positivity of Toeplitz operators on the spaces of $p$-summable functions on $T$. Moreover, we obtain a necessary and sufficient condition for Toeplitz operators to have finite rank on such function spaces.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Quasi-symmetries between metric spaces and rough quasi-isometries between their infinite hyperbolic cones
Authors:
Manzi Huang,
Zhihao Xu
Abstract:
In this paper, we first prove that any power quasi-symmetry of two metric spaces induces a rough quasi-isometry between their infinite hyperbolic cones. Second, we prove that for a complete metric space $Z$, there exists a point $ω$ in the Gromov boundary of its infinite hyperbolic cone such that $Z$ can be seen as the Gromov boundary relative to $ω$ of its infinite hyperbolic cone. Third, we prov…
▽ More
In this paper, we first prove that any power quasi-symmetry of two metric spaces induces a rough quasi-isometry between their infinite hyperbolic cones. Second, we prove that for a complete metric space $Z$, there exists a point $ω$ in the Gromov boundary of its infinite hyperbolic cone such that $Z$ can be seen as the Gromov boundary relative to $ω$ of its infinite hyperbolic cone. Third, we prove that for a visual Gromov hyperbolic metric space $X$ and a Gromov boundary point $ω$, $X$ is roughly similar to the infinite hyperbolic cone of its Gromov boundary relative to $ω$. These are the generalizations of Theorem 7.4, Theorem 8.1 and Theorem 8.2 in [3] since the underlying spaces are not assumed to be bounded and the hyperbolic cones are infinite.
△ Less
Submitted 6 April, 2024; v1 submitted 27 November, 2022;
originally announced November 2022.
-
A sequential linear programming (SLP) approach for uncertainty analysis-based data-driven computational mechanics
Authors:
Mengcheng Huang,
Chang Liu,
Zongliang Du,
Shan Tang,
Xu Guo
Abstract:
In this article, an efficient sequential linear programming algorithm (SLP) for uncertainty analysis-based data-driven computational mechanics (UA-DDCM) is presented. By assuming that the uncertain constitutive relationship embedded behind the prescribed data set can be characterized through a convex combination of the local data points, the upper and lower bounds of structural responses pertainin…
▽ More
In this article, an efficient sequential linear programming algorithm (SLP) for uncertainty analysis-based data-driven computational mechanics (UA-DDCM) is presented. By assuming that the uncertain constitutive relationship embedded behind the prescribed data set can be characterized through a convex combination of the local data points, the upper and lower bounds of structural responses pertaining to the given data set, which are more valuable for making decisions in engineering design, can be found by solving a sequential of linear programming problems very efficiently. Numerical examples demonstrate the effectiveness of the proposed approach on sparse data set and its robustness with respect to the existence of noise and outliers in the data set.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity
Authors:
Xuxing Chen,
Minhui Huang,
Shiqian Ma,
Krishnakumar Balasubramanian
Abstract:
Bilevel optimization recently has received tremendous attention due to its great success in solving important machine learning problems like meta learning, reinforcement learning, and hyperparameter optimization. Extending single-agent training on bilevel problems to the decentralized setting is a natural generalization, and there has been a flurry of work studying decentralized bilevel optimizati…
▽ More
Bilevel optimization recently has received tremendous attention due to its great success in solving important machine learning problems like meta learning, reinforcement learning, and hyperparameter optimization. Extending single-agent training on bilevel problems to the decentralized setting is a natural generalization, and there has been a flurry of work studying decentralized bilevel optimization algorithms. However, it remains unknown how to design the distributed algorithm with sample complexity and convergence rate comparable to SGD for stochastic optimization, and at the same time without directly computing the exact Hessian or Jacobian matrices. In this paper we propose such an algorithm. More specifically, we propose a novel decentralized stochastic bilevel optimization (DSBO) algorithm that only requires first order stochastic oracle, Hessian-vector product and Jacobian-vector product oracle. The sample complexity of our algorithm matches the currently best known results for DSBO, and the advantage of our algorithm is that it does not require estimating the full Hessian and Jacobian matrices, thereby having improved per-iteration complexity.
△ Less
Submitted 31 May, 2023; v1 submitted 23 October, 2022;
originally announced October 2022.
-
Measure solutions to piston problem for compressible fluid flow of generalized chaplygin gas
Authors:
Meixiang Huang,
Yuan** Wang,
Zhiqiang Shao
Abstract:
We study the piston problem of the compressible fluid flow with the generalized Chaplygin gas. Depending on the inferential critical value of Mach number, we prove that, there exists an integral weak solution for the proceeding piston problem, consisting of a shock separating constant states ahead of the piston if Mach numbers less than this critical value, while a singular measure solution, with…
▽ More
We study the piston problem of the compressible fluid flow with the generalized Chaplygin gas. Depending on the inferential critical value of Mach number, we prove that, there exists an integral weak solution for the proceeding piston problem, consisting of a shock separating constant states ahead of the piston if Mach numbers less than this critical value, while a singular measure solution, with density containing a Dirac measure supported on the piston, shall be proposed to solve the proceeding piston problem if Mach numbers greater than or equal to the critical value. For the receding piston problem, rarefaction wave solution always exists when the piston recedes from the gas with any constant speed. Moreover, the occurrence of vacuum state and the convergence of solutions, as well as degeneration of equations are analyzed in the receding case as Mach number tends to infinity.
△ Less
Submitted 18 September, 2022;
originally announced September 2022.
-
No existence of linear algorithm for Fourier phase retrieval
Authors:
Meng Huang,
Zhiqiang Xu
Abstract:
Fourier phase retrieval, which seeks to reconstruct a signal from its Fourier magnitude, is of fundamental importance in fields of engineering and science. In this paper, we give a theoretical understanding of algorithms for Fourier phase retrieval. Particularly, we show if there exists an algorithm which could reconstruct an arbitrary signal ${\mathbf x}\in {\mathbb C}^N$ in…
▽ More
Fourier phase retrieval, which seeks to reconstruct a signal from its Fourier magnitude, is of fundamental importance in fields of engineering and science. In this paper, we give a theoretical understanding of algorithms for Fourier phase retrieval. Particularly, we show if there exists an algorithm which could reconstruct an arbitrary signal ${\mathbf x}\in {\mathbb C}^N$ in $ \mbox{Poly}(N) \log(1/ε)$ time to reach $ε$-precision from its magnitude of discrete Fourier transform and its initial value $x(0)$, then $\mathcal{ P}=\mathcal{NP}$. This demystifies the phenomenon that, although almost all signals are determined uniquely by their Fourier magnitude with a prior conditions, there is no algorithm with theoretical guarantees being proposed over the past few decades. Our proofs employ the result in computational complexity theory that Product Partition problem is NP-complete in the strong sense.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Piston problem to the isentropic Euler equations for modified Chaplygin gas
Authors:
Meixiang Huang,
Yuan** Wang,
Zhiqiang Shao
Abstract:
In this paper, we solve constructively the piston problem for one-dimensional isentropic Euler equations of modified Chaplygin gas. In solutions, we prove rigorously the global existence and uniqueness of a shock wave separating constant states ahead of the piston when the piston pushed forward into the gas. It is quite different from the results of Chaplygin gas or generalized Chaplygin gas in wh…
▽ More
In this paper, we solve constructively the piston problem for one-dimensional isentropic Euler equations of modified Chaplygin gas. In solutions, we prove rigorously the global existence and uniqueness of a shock wave separating constant states ahead of the piston when the piston pushed forward into the gas. It is quite different from the results of Chaplygin gas or generalized Chaplygin gas in which a Radon measure solution is constructed to deal with concentration of mass on the piston. When the piston pulled back from the gas, we strictly confirm only the first family rarefaction wave exists in front of the piston and the concentration will never occur. In addition, by studying the limiting behavior, we show that the piston solutions of modified Chaplygin gas equations tend to the piston solutions of generalized or pure Chaplygin gas equations as a single parameter of pressure state function vanishes.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
Decentralized Bilevel Optimization
Authors:
Xuxing Chen,
Minhui Huang,
Shiqian Ma
Abstract:
Bilevel optimization has been successfully applied to many important machine learning problems. Algorithms for solving bilevel optimization have been studied under various settings. In this paper, we study the nonconvex-strongly-convex bilevel optimization under a decentralized setting. We design decentralized algorithms for both deterministic and stochastic bilevel optimization problems. Moreover…
▽ More
Bilevel optimization has been successfully applied to many important machine learning problems. Algorithms for solving bilevel optimization have been studied under various settings. In this paper, we study the nonconvex-strongly-convex bilevel optimization under a decentralized setting. We design decentralized algorithms for both deterministic and stochastic bilevel optimization problems. Moreover, we analyze the convergence rates of the proposed algorithms in difference scenarios including the case where data heterogeneity is observed across agents. Numerical experiments on both synthetic and real data demonstrate that the proposed methods are efficient.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Ramification of Tate modules for rank $2$ Drinfeld modules
Authors:
Takuya Asayama,
Maozhou Huang
Abstract:
In this paper, we study the ramification of extensions of a function field generated by division points of rank 2 Drinfeld modules. Also conductors of certain rank 2 Drinfeld modules are defined as analogues of those for elliptic curves. A calculation of these conductors allows us to show an analogue of Szpiro's conjecture under a certain limited situation.
In this paper, we study the ramification of extensions of a function field generated by division points of rank 2 Drinfeld modules. Also conductors of certain rank 2 Drinfeld modules are defined as analogues of those for elliptic curves. A calculation of these conductors allows us to show an analogue of Szpiro's conjecture under a certain limited situation.
△ Less
Submitted 28 February, 2023; v1 submitted 28 April, 2022;
originally announced April 2022.
-
On the monotonicity of the generalized Markov numbers
Authors:
Min Huang
Abstract:
Using the Markov distance and Ptolemy inequality introduced by Lee-Li-Rabideau-Schiffler \cite{LLRS}, we completely determine the monotonicity of the generalized Markov numbers along the lines of a given slope.
Using the Markov distance and Ptolemy inequality introduced by Lee-Li-Rabideau-Schiffler \cite{LLRS}, we completely determine the monotonicity of the generalized Markov numbers along the lines of a given slope.
△ Less
Submitted 26 April, 2022; v1 submitted 25 April, 2022;
originally announced April 2022.
-
Characterizations for the existence of traces of first-order Sobolev spaces on hyperbolic fillings
Authors:
Manzi Huang,
Zhihao Xu
Abstract:
In this paper, we study the existence of traces for Sobolev spaces on the hyperbolic filling $X$ of a compact metric space $Z$ equipped with a doubling measure. Given a suitable metric on $X$, we can regard $Z$ as the boundary of $X$. After equip** $X$ with a weighted measure $μ$ via the measure on $Z$ and the Euclidean arc length, we give characterizations for the existence of traces for first-…
▽ More
In this paper, we study the existence of traces for Sobolev spaces on the hyperbolic filling $X$ of a compact metric space $Z$ equipped with a doubling measure. Given a suitable metric on $X$, we can regard $Z$ as the boundary of $X$. After equip** $X$ with a weighted measure $μ$ via the measure on $Z$ and the Euclidean arc length, we give characterizations for the existence of traces for first-order Sobolev spaces.
△ Less
Submitted 2 May, 2022; v1 submitted 27 March, 2022;
originally announced March 2022.
-
On product decomposition
Authors:
Ming-Deh A. Huang
Abstract:
Given a finite set $W$ in $\bar{k}^n$ where $\bar{k}$ is the algebraic closure of a field $k$ one would like to determine if $W$ can be decomposed as $\prod_{i=1}^n V_i$ where $V_i \subset \bar{k}$ under a linear transformation, that is, $W\stackrelλ{\to} \prod_{i=1}^n V_i$ where $λ\in Gl_n (\bar{k})$. We assume that $W$ is presented as $W=Z(\mathcal{F})$, the zero set of a polynomial system…
▽ More
Given a finite set $W$ in $\bar{k}^n$ where $\bar{k}$ is the algebraic closure of a field $k$ one would like to determine if $W$ can be decomposed as $\prod_{i=1}^n V_i$ where $V_i \subset \bar{k}$ under a linear transformation, that is, $W\stackrelλ{\to} \prod_{i=1}^n V_i$ where $λ\in Gl_n (\bar{k})$. We assume that $W$ is presented as $W=Z(\mathcal{F})$, the zero set of a polynomial system $\mathcal{F}$ in $n$ variables over $k$. We study algebraic characterization of such product decomposition. For decomposition into component sets of the same cardinality we obtain a stronger characterization and show that the decomposition in this case is essentially unique (up to permutation and scalar multiplication of coordinates). We investigate computational problems that arise from the decomposition problem.
△ Less
Submitted 30 December, 2021;
originally announced January 2022.
-
The global landscape of phase retrieval II: quotient intensity models
Authors:
Jian-Feng Cai,
Meng Huang,
Dong Li,
Yang Wang
Abstract:
A fundamental problem in phase retrieval is to reconstruct an unknown signal from a set of magnitude-only measurements. In this work we introduce three novel quotient intensity-based models (QIMs) based a deep modification of the traditional intensity-based models. A remarkable feature of the new loss functions is that the corresponding geometric landscape is benign under the optimal sampling comp…
▽ More
A fundamental problem in phase retrieval is to reconstruct an unknown signal from a set of magnitude-only measurements. In this work we introduce three novel quotient intensity-based models (QIMs) based a deep modification of the traditional intensity-based models. A remarkable feature of the new loss functions is that the corresponding geometric landscape is benign under the optimal sampling complexity. When the measurements $ a_i\in \Rn$ are Gaussian random vectors and the number of measurements $m\ge Cn$, the QIMs admit no spurious local minimizers with high probability, i.e., the target solution $ x$ is the unique global minimizer (up to a global phase) and the loss function has a negative directional curvature around each saddle point. Such benign geometric landscape allows the gradient descent methods to find the global solution $x$ (up to a global phase) without spectral initialization.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
The global landscape of phase retrieval I: perturbed amplitude models
Authors:
Jian-Feng Cai,
Meng Huang,
Dong Li,
Yang Wang
Abstract:
A fundamental task in phase retrieval is to recover an unknown signal $\vx\in \Rn$ from a set of magnitude-only measurements $y_i=\abs{\nj{\va_i,\vx}}, \; i=1,\ldots,m$. In this paper, we propose two novel perturbed amplitude models (PAMs) which have non-convex and quadratic-type loss function. When the measurements $ \va_i \in \Rn$ are Gaussian random vectors and the number of measurements…
▽ More
A fundamental task in phase retrieval is to recover an unknown signal $\vx\in \Rn$ from a set of magnitude-only measurements $y_i=\abs{\nj{\va_i,\vx}}, \; i=1,\ldots,m$. In this paper, we propose two novel perturbed amplitude models (PAMs) which have non-convex and quadratic-type loss function. When the measurements $ \va_i \in \Rn$ are Gaussian random vectors and the number of measurements $m\ge Cn$, we rigorously prove that the PAMs admit no spurious local minimizers with high probability, i.e., the target solution $ \vx$ is the unique global minimizer (up to a global phase) and the loss function has a negative directional curvature around each saddle point. Thanks to the well-tamed benign geometric landscape, one can employ the vanilla gradient descent method to locate the global minimizer $\vx$ (up to a global phase) without spectral initialization. We carry out extensive numerical experiments to show that the gradient descent algorithm with random initialization outperforms state-of-the-art algorithms with spectral initialization in empirical success rate and convergence speed.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Besicovitch almost periodic solutions of abstract semi-linear differential equations with delay
Authors:
Yongkun Li,
Mei Huang,
Bing Li
Abstract:
In this paper, we first use the Bohr property to give a definition of Besicovitch almost periodic functions, and study some basic properties of Besicovitch almost periodic functions, including the equivalence of the Bohr property and the Bochner property. Then, as an application, we use the contraction principal to obtain the existence and uniqueness of Besicovitch almost periodic solutions for a…
▽ More
In this paper, we first use the Bohr property to give a definition of Besicovitch almost periodic functions, and study some basic properties of Besicovitch almost periodic functions, including the equivalence of the Bohr property and the Bochner property. Then, as an application, we use the contraction principal to obtain the existence and uniqueness of Besicovitch almost periodic solutions for a class of abstract semi-linear differential equations with delay.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Simplex Initialization: A Survey of Techniques and Trends
Authors:
Mengyu Huang,
Yuxing Zhong,
Huiwen Yang,
Jiazheng Wang,
Fan Zhang,
Bo Bai,
Ling Shi
Abstract:
The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim…
▽ More
The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim to provide an overview about the initialization methods in the primal and dual simplex, respectively. We also propose several potential future directions about how to improve the existing initialization methods with the help of advanced learning technologies.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
On the Convergence of Projected Alternating Maximization for Equitable and Optimal Transport
Authors:
Minhui Huang,
Shiqian Ma,
Lifeng Lai
Abstract:
This paper studies the equitable and optimal transport (EOT) problem, which has many applications such as fair division problems and optimal transport with multiple agents etc. In the discrete distributions case, the EOT problem can be formulated as a linear program (LP). Since this LP is prohibitively large for general LP solvers, Scetbon \etal \cite{scetbon2021equitable} suggests to perturb the…
▽ More
This paper studies the equitable and optimal transport (EOT) problem, which has many applications such as fair division problems and optimal transport with multiple agents etc. In the discrete distributions case, the EOT problem can be formulated as a linear program (LP). Since this LP is prohibitively large for general LP solvers, Scetbon \etal \cite{scetbon2021equitable} suggests to perturb the problem by adding an entropy regularization. They proposed a projected alternating maximization algorithm (PAM) to solve the dual of the entropy regularized EOT. In this paper, we provide the first convergence analysis of PAM. A novel rounding procedure is proposed to help construct the primal solution for the original EOT problem. We also propose a variant of PAM by incorporating the extrapolation technique that can numerically improve the performance of PAM. Results in this paper may shed lights on block coordinate (gradient) descent methods for general optimization problems.
△ Less
Submitted 30 September, 2021; v1 submitted 29 September, 2021;
originally announced September 2021.
-
Borderline case of traces and extensions for weighted Sobolev spaces
Authors:
Manzi Huang,
Xiantao Wang,
Zhuang Wang,
Zhihao Xu
Abstract:
In this paper, we study the traces and the extensions for weighted Sobolev spaces on upper half spaces when the weights reach to the borderline cases. We first give a full characterization of the existence of trace spaces for these weighted Sobolev spaces, and then study the trace parts and the extension parts between the weighted Sobolev spaces and a new kind of Besov-type spaces (on hyperplanes)…
▽ More
In this paper, we study the traces and the extensions for weighted Sobolev spaces on upper half spaces when the weights reach to the borderline cases. We first give a full characterization of the existence of trace spaces for these weighted Sobolev spaces, and then study the trace parts and the extension parts between the weighted Sobolev spaces and a new kind of Besov-type spaces (on hyperplanes) which are defined by using integral averages over selected layers of dyadic cubes.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
Linear convergence of randomized Kaczmarz method for solving complex-valued phaseless equations
Authors:
Meng Huang,
Yang Wang
Abstract:
A randomized Kaczmarz method was recently proposed for phase retrieval, which has been shown numerically to exhibit empirical performance over other state-of-the-art phase retrieval algorithms both in terms of the sampling complexity and in terms of computation time. While the rate of convergence has been studied well in the real case where the signals and measurement vectors are all real-valued,…
▽ More
A randomized Kaczmarz method was recently proposed for phase retrieval, which has been shown numerically to exhibit empirical performance over other state-of-the-art phase retrieval algorithms both in terms of the sampling complexity and in terms of computation time. While the rate of convergence has been studied well in the real case where the signals and measurement vectors are all real-valued, there is no guarantee for the convergence in the complex case. In fact, the linear convergence of the randomized Kaczmarz method for phase retrieval in the complex setting is left as a conjecture by Tan and Vershynin. In this paper, we provide the first theoretical guarantees for it. We show that for random measurements $\mathbf{a}_j \in \mathbb{C}^n, j=1,\ldots,m $ which are drawn independently and uniformly from the complex unit sphere, or equivalent are independent complex Gaussian random vectors, when $m \ge Cn$ for some universal positive constant $C$, the randomized Kaczmarz scheme with a good initialization converges linearly to the target solution (up to a global phase) in expectation with high probability. This gives a positive answer to that conjecture.
△ Less
Submitted 24 September, 2021;
originally announced September 2021.
-
$p$-harmonic map**s between metric spaces
Authors:
Chang-Yu Guo,
Manzi Huang,
Zhuang Wang,
Haiqing Xu
Abstract:
In this paper, we solve the Dirichlet problem for Sobolev maps between singular metric spaces that extends the corresponding result of Guo and Wenger [Comm. Anal. Geom. 2020]. The main new ingredient in our proofs is a suitable extension of the theory of trace for metric valued Sobolev maps developed by Korevaar and Schoen [Comm. Anal. Geom. 1993]. We also develop a theory of trace in the borderli…
▽ More
In this paper, we solve the Dirichlet problem for Sobolev maps between singular metric spaces that extends the corresponding result of Guo and Wenger [Comm. Anal. Geom. 2020]. The main new ingredient in our proofs is a suitable extension of the theory of trace for metric valued Sobolev maps developed by Korevaar and Schoen [Comm. Anal. Geom. 1993]. We also develop a theory of trace in the borderline case, which investigates a sharp condition to characterize the existence of traces.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Debiasing In-Sample Policy Performance for Small-Data, Large-Scale Optimization
Authors:
Vishal Gupta,
Michael Huang,
Paat Rusmevichientong
Abstract:
Motivated by the poor performance of cross-validation in settings where data are scarce, we propose a novel estimator of the out-of-sample performance of a policy in data-driven optimization.Our approach exploits the optimization problem's sensitivity analysis to estimate the gradient of the optimal objective value with respect to the amount of noise in the data and uses the estimated gradient to…
▽ More
Motivated by the poor performance of cross-validation in settings where data are scarce, we propose a novel estimator of the out-of-sample performance of a policy in data-driven optimization.Our approach exploits the optimization problem's sensitivity analysis to estimate the gradient of the optimal objective value with respect to the amount of noise in the data and uses the estimated gradient to debias the policy's in-sample performance. Unlike cross-validation techniques, our approach avoids sacrificing data for a test set, utilizes all data when training and, hence, is well-suited to settings where data are scarce. We prove bounds on the bias and variance of our estimator for optimization problems with uncertain linear objectives but known, potentially non-convex, feasible regions. For more specialized optimization problems where the feasible region is "weakly-coupled" in a certain sense, we prove stronger results. Specifically, we provide explicit high-probability bounds on the error of our estimator that hold uniformly over a policy class and depends on the problem's dimension and policy class's complexity. Our bounds show that under mild conditions, the error of our estimator vanishes as the dimension of the optimization problem grows, even if the amount of available data remains small and constant. Said differently, we prove our estimator performs well in the small-data, large-scale regime. Finally, we numerically compare our proposed method to state-of-the-art approaches through a case-study on dispatching emergency medical response services using real data. Our method provides more accurate estimates of out-of-sample performance and learns better-performing policies.
△ Less
Submitted 2 August, 2022; v1 submitted 26 July, 2021;
originally announced July 2021.
-
Linear quadratic mean field games: Decentralized $O(1/N)$-Nash equilibria
Authors:
Minyi Huang,
Xuwei Yang
Abstract:
This paper studies an asymptotic solvability problem for linear quadratic (LQ) mean field games with controlled diffusions and indefinite weights for the state and control in the costs. We employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation (ODE) system, which characterizes a necessary and sufficient condition for asymptotic solvability. The rescaling tec…
▽ More
This paper studies an asymptotic solvability problem for linear quadratic (LQ) mean field games with controlled diffusions and indefinite weights for the state and control in the costs. We employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation (ODE) system, which characterizes a necessary and sufficient condition for asymptotic solvability. The rescaling technique is further used for performance estimates, establishing an $O(1/N)$-Nash equilibrium for the obtained decentralized strategies.
△ Less
Submitted 17 September, 2021; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Uniqueness and stability for the solution of a nonlinear least squares problem
Authors:
Meng Huang,
Zhiqiang Xu
Abstract:
In this paper, we focus on the nonlinear least squares: $\mbox{min}_{\mathbf{x} \in \mathbb{H}^d}\| |A\mathbf{x}|-\mathbf{b}\|$ where $A\in \mathbb{H}^{m\times d}$, $\mathbf{b} \in \mathbb{R}^m$ with $\mathbb{H} \in \{\mathbb{R},\mathbb{C} \}$ and consider the uniqueness and stability of solutions. Such problem arises, for instance, in phase retrieval and absolute value rectification neural networ…
▽ More
In this paper, we focus on the nonlinear least squares: $\mbox{min}_{\mathbf{x} \in \mathbb{H}^d}\| |A\mathbf{x}|-\mathbf{b}\|$ where $A\in \mathbb{H}^{m\times d}$, $\mathbf{b} \in \mathbb{R}^m$ with $\mathbb{H} \in \{\mathbb{R},\mathbb{C} \}$ and consider the uniqueness and stability of solutions. Such problem arises, for instance, in phase retrieval and absolute value rectification neural networks. For the case where $\mathbf{b}=|A\mathbf{x}_0|$ for some $\mathbf{x}_0\in \mathbb{H}^d$, many results have been developed to characterize the uniqueness and stability of solutions. However, for the case where $\mathbf{b} \neq |A\mathbf{x}_0| $ for any $\mathbf{x}_0\in \mathbb{H}^d$, there is no existing result for it to the best of our knowledge. In this paper, we first focus on the uniqueness of solutions and show for any matrix $A\in \mathbb{H}^{m \times d}$ there always exists a vector $\mathbf{b} \in \mathbb{R}^m$ such that the solution is not unique. But, in real case, such ``bad'' vectors $\mathbf{b}$ are negligible, namely, if $\mathbf{b} \in \mathbb{R}_{+}^m$ does not lie in some measure zero set, then the solution is unique. We also present some conditions under which the solution is unique. For the stability of solutions, we prove that the solution is never uniformly stable. But if we restrict the vectors $\mathbf{b}$ to any convex set then it is stable.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
On the last fall degree of Weil descent polynomial systems
Authors:
Ming-Deh Huang
Abstract:
Given a polynomial system $\mathcal{F}$ over a finite field $k$ which is not necessarily of dimension zero, we consider the Weil descent $\mathcal{F}'$ of $\mathcal{F}$ over a subfield $k'$. We prove a theorem which relates the last fall degrees of $\mathcal{F}_1$ and $\mathcal{F}'_1$, where the zero set of $\mathcal{F}_1$ corresponds bijectively to the set of $k$-rational points of $\mathcal{F}$,…
▽ More
Given a polynomial system $\mathcal{F}$ over a finite field $k$ which is not necessarily of dimension zero, we consider the Weil descent $\mathcal{F}'$ of $\mathcal{F}$ over a subfield $k'$. We prove a theorem which relates the last fall degrees of $\mathcal{F}_1$ and $\mathcal{F}'_1$, where the zero set of $\mathcal{F}_1$ corresponds bijectively to the set of $k$-rational points of $\mathcal{F}$, and the zero set of $\mathcal{F}'_1$ is the set of $k'$-rational points of the Weil descent $\mathcal{F}'$. As an application we derive upper bounds on the last fall degree of $\mathcal{F}'_1$ in the case where $\mathcal{F}$ is a set of linearized polynomials.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
Esca** Saddle Points for Nonsmooth Weakly Convex Functions via Perturbed Proximal Algorithms
Authors:
Minhui Huang
Abstract:
We propose perturbed proximal algorithms that can provably escape strict saddles for nonsmooth weakly convex functions. The main results are based on a novel characterization of $ε$-approximate local minimum for nonsmooth functions, and recent developments on perturbed gradient methods for esca** saddle points for smooth problems. Specifically, we show that under standard assumptions, the pertur…
▽ More
We propose perturbed proximal algorithms that can provably escape strict saddles for nonsmooth weakly convex functions. The main results are based on a novel characterization of $ε$-approximate local minimum for nonsmooth functions, and recent developments on perturbed gradient methods for esca** saddle points for smooth problems. Specifically, we show that under standard assumptions, the perturbed proximal point, perturbed proximal gradient and perturbed proximal linear algorithms find $ε$-approximate local minimum for nonsmooth weakly convex functions in $O(ε^{-2}\log(d)^4)$ iterations, where $d$ is the dimension of the problem.
△ Less
Submitted 8 April, 2021; v1 submitted 4 February, 2021;
originally announced February 2021.
-
Solving phase retrieval with random initial guess is nearly as good as by spectral initialization
Authors:
Jianfeng Cai,
Meng Huang,
Dong Li,
Yang Wang
Abstract:
The problem of recovering a signal $\mathbf{x}\in \mathbb{R}^n$ from a set of magnitude measurements $y_i=|\langle \mathbf{a}_i, \mathbf{x} \rangle |, \; i=1,\ldots,m$ is referred as phase retrieval, which has many applications in fields of physical sciences and engineering. In this paper we show that the smoothed amplitude flow model for phase retrieval has benign geometric structure under the op…
▽ More
The problem of recovering a signal $\mathbf{x}\in \mathbb{R}^n$ from a set of magnitude measurements $y_i=|\langle \mathbf{a}_i, \mathbf{x} \rangle |, \; i=1,\ldots,m$ is referred as phase retrieval, which has many applications in fields of physical sciences and engineering. In this paper we show that the smoothed amplitude flow model for phase retrieval has benign geometric structure under the optimal sampling complexity. In particular, we show that when the measurements $\mathbf{a}_i\in \mathbb{R}^n$ are Gaussian random vectors and the number of measurements $m\ge Cn$, our smoothed amplitude flow model has no spurious local minimizers with high probability, ie., the target solution $\mathbf{x}$ is the unique global minimizer (up to a global phase) and the loss function has a negative directional curvature around each saddle point. Due to this benign geometric landscape, the phase retrieval problem can be solved by the gradient descent algorithms without spectral initialization. Numerical experiments show that the gradient descent algorithm with random initialization performs well even comparing with state-of-the-art algorithms with spectral initialization in empirical success rate and convergence speed.
△ Less
Submitted 10 January, 2021;
originally announced January 2021.
-
Binary Mean Field Stochastic Games: Stationary Equilibria and Comparative Statics
Authors:
Minyi Huang,
Yan Ma
Abstract:
This paper considers mean field games in a multi-agent Markov decision process (MDP) framework. Each player has a continuum state and binary action, and benefits from the improvement of the condition of the overall population. Based on an infinite horizon discounted individual cost, we show existence of a stationary equilibrium, and prove its uniqueness under a positive externality condition. We f…
▽ More
This paper considers mean field games in a multi-agent Markov decision process (MDP) framework. Each player has a continuum state and binary action, and benefits from the improvement of the condition of the overall population. Based on an infinite horizon discounted individual cost, we show existence of a stationary equilibrium, and prove its uniqueness under a positive externality condition. We further analyze comparative statics of the stationary equilibrium by quantitatively determining the impact of the effort cost.
△ Less
Submitted 1 January, 2021;
originally announced January 2021.
-
Linear quadratic mean field social optimization: Asymptotic solvability and decentralized control
Authors:
Minyi Huang,
Xuwei Yang
Abstract:
This paper studies asymptotic solvability of a linear quadratic (LQ) mean field social optimization problem with controlled diffusions and indefinite state and control weights. Starting with an $N$-agent model, we employ a rescaling approach to derive a low-dimensional Riccati ordinary differential equation (ODE) system, which characterizes a necessary and sufficient condition for asymptotic solva…
▽ More
This paper studies asymptotic solvability of a linear quadratic (LQ) mean field social optimization problem with controlled diffusions and indefinite state and control weights. Starting with an $N$-agent model, we employ a rescaling approach to derive a low-dimensional Riccati ordinary differential equation (ODE) system, which characterizes a necessary and sufficient condition for asymptotic solvability. The decentralized control obtained from the mean field limit ensures a bounded optimality loss in minimizing the social cost having magnitude $O(N)$, which implies an optimality loss of $O(1/N)$ per agent. We further quantify the efficiency gain of the social optimum with respect to the solution of the mean field game.
△ Less
Submitted 13 September, 2021; v1 submitted 31 December, 2020;
originally announced December 2020.
-
A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance
Authors:
Minhui Huang,
Shiqian Ma,
Lifeng Lai
Abstract:
The Wasserstein distance has become increasingly important in machine learning and deep learning. Despite its popularity, the Wasserstein distance is hard to approximate because of the curse of dimensionality. A recently proposed approach to alleviate the curse of dimensionality is to project the sampled data from the high dimensional probability distribution onto a lower-dimensional subspace, and…
▽ More
The Wasserstein distance has become increasingly important in machine learning and deep learning. Despite its popularity, the Wasserstein distance is hard to approximate because of the curse of dimensionality. A recently proposed approach to alleviate the curse of dimensionality is to project the sampled data from the high dimensional probability distribution onto a lower-dimensional subspace, and then compute the Wasserstein distance between the projected data. However, this approach requires to solve a max-min problem over the Stiefel manifold, which is very challenging in practice. The only existing work that solves this problem directly is the RGAS (Riemannian Gradient Ascent with Sinkhorn Iteration) algorithm, which requires to solve an entropy-regularized optimal transport problem in each iteration, and thus can be costly for large-scale problems. In this paper, we propose a Riemannian block coordinate descent (RBCD) method to solve this problem, which is based on a novel reformulation of the regularized max-min problem over the Stiefel manifold. We show that the complexity of arithmetic operations for RBCD to obtain an $ε$-stationary point is $O(ε^{-3})$. This significantly improves the corresponding complexity of RGAS, which is $O(ε^{-12})$. Moreover, our RBCD has very low per-iteration complexity, and hence is suitable for large-scale problems. Numerical results on both synthetic and real datasets demonstrate that our method is more efficient than existing methods, especially when the number of sampled data is very large.
△ Less
Submitted 27 September, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.