-
Structured and Balanced Multi-component and Multi-layer Neural Networks
Authors:
Shijun Zhang,
Hongkai Zhao,
Yimin Zhong,
Haomin Zhou
Abstract:
In this work, we propose a balanced multi-component and multi-layer neural network (MMNN) structure to approximate functions with complex features with both accuracy and efficiency in terms of degrees of freedom and computation cost. The main idea is motivated by a multi-component, each of which can be approximated effectively by a single-layer network, and multi-layer decomposition in a "divide-a…
▽ More
In this work, we propose a balanced multi-component and multi-layer neural network (MMNN) structure to approximate functions with complex features with both accuracy and efficiency in terms of degrees of freedom and computation cost. The main idea is motivated by a multi-component, each of which can be approximated effectively by a single-layer network, and multi-layer decomposition in a "divide-and-conquer" type of strategy to deal with a complex function. While an easy modification to fully connected neural networks (FCNNs) or multi-layer perceptrons (MLPs) through the introduction of balanced multi-component structures in the network, MMNNs achieve a significant reduction of training parameters, a much more efficient training process, and a much improved accuracy compared to FCNNs or MLPs. Extensive numerical experiments are presented to illustrate the effectiveness of MMNNs in approximating high oscillatory functions and its automatic adaptivity in capturing localized features.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Koebe uniformization for infinitely connected attracting Fatou domains
Authors:
Xiaoguang Wang,
Yi Zhong
Abstract:
This paper works on the structure of infinitely connected Fatou damains of rational maps in terms of Koebe uniformization. Due to the complicated boundary behavior, the existing uniformization results are failed to apply in general. We proved that if the rational map is geometrically finite, then its infinitely connected attracting Fatou damain is conformally homeomorphic to a circle domain.
This paper works on the structure of infinitely connected Fatou damains of rational maps in terms of Koebe uniformization. Due to the complicated boundary behavior, the existing uniformization results are failed to apply in general. We proved that if the rational map is geometrically finite, then its infinitely connected attracting Fatou damain is conformally homeomorphic to a circle domain.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Two-Source and Affine Non-Malleable Extractors for Small Entropy
Authors:
Xin Li,
Yan Zhong
Abstract:
Non-malleable extractors are generalizations and strengthening of standard randomness extractors, that are resilient to adversarial tampering. Such extractors have wide applications in cryptography and explicit construction of extractors. In the well-studied models of two-source and affine non-malleable extractors, the previous best constructions only work for entropy rate $>2/3$ and $1-γ$ respect…
▽ More
Non-malleable extractors are generalizations and strengthening of standard randomness extractors, that are resilient to adversarial tampering. Such extractors have wide applications in cryptography and explicit construction of extractors. In the well-studied models of two-source and affine non-malleable extractors, the previous best constructions only work for entropy rate $>2/3$ and $1-γ$ respectively by Li (FOCS' 23).
We present explicit constructions of two-source and affine non-malleable extractors that match the state-of-the-art constructions of standard ones for small entropy. Our main results include two-source and affine non-malleable extractors (over $\mathsf{F}_2$) for sources on $n$ bits with min-entropy $k \ge \log^C n$ and polynomially small error, matching the parameters of standard extractors by Chattopadhyay and Zuckerman (STOC' 16, Annals of Mathematics' 19) and Li (FOCS' 16), as well as those with min-entropy $k = O(\log n)$ and constant error, matching the parameters of standard extractors by Li (FOCS' 23).
Our constructions significantly improve previous results, and the parameters (entropy requirement and error) are the best possible without first improving the constructions of standard extractors. In addition, our improved affine non-malleable extractors give strong lower bounds for a certain kind of read-once linear branching programs, recently introduced by Gryaznov, Pudlák, and Talebanfard (CCC' 22) as a generalization of several well-studied computational models. These bounds match the previously best-known average-case hardness results given by Chattopadhyay and Liao (CCC' 23) and Li (FOCS' 23), where the branching program size lower bounds are close to optimal, but the explicit functions we use here are different.\ Our results also suggest a possible deeper connection between non-malleable extractors and standard ones.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Authors:
Bo Lin,
Yangzheng Zhong,
Weiqing Ren
Abstract:
Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challengi…
▽ More
Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challenging task for complex and high-dimensional systems. In this work, we formulate the path-finding task as a cost minimization problem over a particular path space. The cost function is adapted from the Freidlin-Wentzell action functional so that it is able to deal with rough potential landscapes. The path-finding problem is then solved using a actor-critic method based on the deep deterministic policy gradient algorithm (DDPG). The method incorporates the potential force of the system in the policy for generating episodes and combines physical properties of the system with the learning process for molecular systems. The exploitation and exploration nature of reinforcement learning enables the method to efficiently sample the transition events and compute the globally optimal transition pathway. We illustrate the effectiveness of the proposed method using three benchmark systems including an extended Mueller system and the Lennard-Jones system of seven particles.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Well-posedness and no-uniform dependence for the Euler-Poincaré equations in Triebel-Lizorkin spaces
Authors:
Yuanhua Zhong,
Jianzhong Lu,
Min Li,
**lu Li
Abstract:
In this paper, we study the Cauchy problem of the Euler-Poincaré equations in $\R^d$ with initial data belonging to the Triebel-Lizorkin spaces. We prove the local-in-time unique existence of solutions to the Euler-Poincaré equations in $F^s_{p,r}(\R^d)$. Furthermore, we obtain that the data-to-solution of this equation is continuous but not uniformly continuous in these spaces.
In this paper, we study the Cauchy problem of the Euler-Poincaré equations in $\R^d$ with initial data belonging to the Triebel-Lizorkin spaces. We prove the local-in-time unique existence of solutions to the Euler-Poincaré equations in $F^s_{p,r}(\R^d)$. Furthermore, we obtain that the data-to-solution of this equation is continuous but not uniformly continuous in these spaces.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Robustness of the data-driven approach in limited angle tomography
Authors:
Yiran Wang,
Yimin Zhong
Abstract:
The limited angle Radon transform is notoriously difficult to invert due to the ill-posedness. In this work, we give a mathematical explanation that the data-driven approach based on deep neural networks can reconstruct more information in a stable way compared to traditional methods.
The limited angle Radon transform is notoriously difficult to invert due to the ill-posedness. In this work, we give a mathematical explanation that the data-driven approach based on deep neural networks can reconstruct more information in a stable way compared to traditional methods.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Probabilistic Guarantees of Stochastic Recursive Gradient in Non-Convex Finite Sum Problems
Authors:
Yanjie Zhong,
Jiaqi Li,
Soumendra Lahiri
Abstract:
This paper develops a new dimension-free Azuma-Hoeffding type bound on summation norm of a martingale difference sequence with random individual bounds. With this novel result, we provide high-probability bounds for the gradient norm estimator in the proposed algorithm Prob-SARAH, which is a modified version of the StochAstic Recursive grAdient algoritHm (SARAH), a state-of-art variance reduced al…
▽ More
This paper develops a new dimension-free Azuma-Hoeffding type bound on summation norm of a martingale difference sequence with random individual bounds. With this novel result, we provide high-probability bounds for the gradient norm estimator in the proposed algorithm Prob-SARAH, which is a modified version of the StochAstic Recursive grAdient algoritHm (SARAH), a state-of-art variance reduced algorithm that achieves optimal computational complexity in expectation for the finite sum problem. The in-probability complexity by Prob-SARAH matches the best in-expectation result up to logarithmic factors. Empirical experiments demonstrate the superior probabilistic performance of Prob-SARAH on real datasets compared to other popular algorithms.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Error Analysis for the Implicit Boundary Integral Method
Authors:
Yimin Zhong,
Kui Ren,
Olof Runborg,
Richard Tsai
Abstract:
The implicit boundary integral method (IBIM) provides a framework to construct quadrature rules on regular lattices for integrals over irregular domain boundaries. This work provides a systematic error analysis for IBIMs on uniform Cartesian grids for boundaries with different degree of regularities. We first show that the quadrature error gains an addition order of $\frac{d-1}{2}$ from the curvat…
▽ More
The implicit boundary integral method (IBIM) provides a framework to construct quadrature rules on regular lattices for integrals over irregular domain boundaries. This work provides a systematic error analysis for IBIMs on uniform Cartesian grids for boundaries with different degree of regularities. We first show that the quadrature error gains an addition order of $\frac{d-1}{2}$ from the curvature for a strongly convex smooth boundary due to the ``randomness'' in the signed distances. This gain is discounted for degenerated convex surfaces. We then extend the error estimate to general boundaries under some special circumstances, including how quadrature error depends on the boundary's local geometry relative to the underlying grid. Bounds on the variance of the quadrature error under random shifts and rotations of the lattices are also derived.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Inequalities for the $k$-Regular Overpartitions
Authors:
Yi Peng,
Helen W. J. Zhang,
Ying Zhong
Abstract:
Bessenrodt and Ono, Chen, Wang and Jia, DeSalvo and Pak were the first to discover the log-subadditivity, log-concavity, and the third-order Turán inequality of partition function, respectively. Many other important partition statistics are proved to enjoy similar properties. This paper focuses on the partition function $\overline{p}_k(n)$, which counts the number of overpartitions of $n$ with no…
▽ More
Bessenrodt and Ono, Chen, Wang and Jia, DeSalvo and Pak were the first to discover the log-subadditivity, log-concavity, and the third-order Turán inequality of partition function, respectively. Many other important partition statistics are proved to enjoy similar properties. This paper focuses on the partition function $\overline{p}_k(n)$, which counts the number of overpartitions of $n$ with no parts divisible by $k$. We provide a combinatorial proof to establish that for any $k\geq2$, the partition function $\overline{p}_k(n)$ exhibits strict log-subadditivity. Specifically, we show that $\overline{p}_k(a)\overline{p}_k(b)>\overline{p}_k(a+b)$ for integers $a\geq b\geq1$ and $a+b\geq k$. Furthermore, we investigate the log-concavity and the satisfaction of the third-order Turán inequality for $\overline{p}_k(n)$, where $2\leq k\leq9$.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Why Shallow Networks Struggle with Approximating and Learning High Frequency: A Numerical Study
Authors:
Shijun Zhang,
Hongkai Zhao,
Yimin Zhong,
Haomin Zhou
Abstract:
In this work, a comprehensive numerical study involving analysis and experiments shows why a two-layer neural network has difficulties handling high frequencies in approximation and learning when machine precision and computation cost are important factors in real practice. In particular, the following basic computational issues are investigated: (1) the minimal numerical error one can achieve giv…
▽ More
In this work, a comprehensive numerical study involving analysis and experiments shows why a two-layer neural network has difficulties handling high frequencies in approximation and learning when machine precision and computation cost are important factors in real practice. In particular, the following basic computational issues are investigated: (1) the minimal numerical error one can achieve given a finite machine precision, (2) the computation cost to achieve a given accuracy, and (3) stability with respect to perturbations. The key to the study is the conditioning of the representation and its learning dynamics. Explicit answers to the above questions with numerical verifications are presented.
△ Less
Submitted 21 November, 2023; v1 submitted 29 June, 2023;
originally announced June 2023.
-
Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Authors:
Yu Gui,
Cong Ma,
Yiqiao Zhong
Abstract:
We investigate the role of projection heads, also known as projectors, within the encoder-projector framework (e.g., SimCLR) used in contrastive learning. We aim to demystify the observed phenomenon where representations learned before projectors outperform those learned after -- measured using the downstream linear classification accuracy, even when the projectors themselves are linear.
In this…
▽ More
We investigate the role of projection heads, also known as projectors, within the encoder-projector framework (e.g., SimCLR) used in contrastive learning. We aim to demystify the observed phenomenon where representations learned before projectors outperform those learned after -- measured using the downstream linear classification accuracy, even when the projectors themselves are linear.
In this paper, we make two significant contributions towards this aim. Firstly, through empirical and theoretical analysis, we identify two crucial effects -- expansion and shrinkage -- induced by the contrastive loss on the projectors. In essence, contrastive loss either expands or shrinks the signal direction in the representations learned by an encoder, depending on factors such as the augmentation strength, the temperature used in contrastive loss, etc. Secondly, drawing inspiration from the expansion and shrinkage phenomenon, we propose a family of linear transformations to accurately model the projector's behavior. This enables us to precisely characterize the downstream linear classification accuracy in the high-dimensional asymptotic limit. Our findings reveal that linear projectors operating in the shrinkage (or expansion) regime hinder (or improve) the downstream classification accuracy. This provides the first theoretical explanation as to why (linear) projectors impact the downstream performance of learned representations. Our theoretical findings are further corroborated by extensive experiments on both synthetic data and real image data.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Online Bootstrap Inference with Nonconvex Stochastic Gradient Descent Estimator
Authors:
Yanjie Zhong,
Todd Kuffner,
Soumendra Lahiri
Abstract:
In this paper, we investigate the theoretical properties of stochastic gradient descent (SGD) for statistical inference in the context of nonconvex optimization problems, which have been relatively unexplored compared to convex settings. Our study is the first to establish provable inferential procedures using the SGD estimator for general nonconvex objective functions, which may contain multiple…
▽ More
In this paper, we investigate the theoretical properties of stochastic gradient descent (SGD) for statistical inference in the context of nonconvex optimization problems, which have been relatively unexplored compared to convex settings. Our study is the first to establish provable inferential procedures using the SGD estimator for general nonconvex objective functions, which may contain multiple local minima.
We propose two novel online inferential procedures that combine SGD and the multiplier bootstrap technique. The first procedure employs a consistent covariance matrix estimator, and we establish its error convergence rate. The second procedure approximates the limit distribution using bootstrap SGD estimators, yielding asymptotically valid bootstrap confidence intervals. We validate the effectiveness of both approaches through numerical experiments.
Furthermore, our analysis yields an intermediate result: the in-expectation error convergence rate for the original SGD estimator in nonconvex settings, which is comparable to existing results for convex problems. We believe this novel finding holds independent interest and enriches the literature on optimization and statistical inference.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Improving Gradient Computation for Differentiable Physics Simulation with Contacts
Authors:
Yaofeng Desmond Zhong,
Jiequn Han,
Biswadip Dey,
Georgia Olympia Brikis
Abstract:
Differentiable simulation enables gradients to be back-propagated through physics simulations. In this way, one can learn the dynamics and properties of a physics system by gradient-based optimization or embed the whole differentiable simulation as a layer in a deep learning model for downstream tasks, such as planning and control. However, differentiable simulation at its current stage is not per…
▽ More
Differentiable simulation enables gradients to be back-propagated through physics simulations. In this way, one can learn the dynamics and properties of a physics system by gradient-based optimization or embed the whole differentiable simulation as a layer in a deep learning model for downstream tasks, such as planning and control. However, differentiable simulation at its current stage is not perfect and might provide wrong gradients that deteriorate its performance in learning tasks. In this paper, we study differentiable rigid-body simulation with contacts. We find that existing differentiable simulation methods provide inaccurate gradients when the contact normal direction is not fixed - a general situation when the contacts are between two moving objects. We propose to improve gradient computation by continuous collision detection and leverage the time-of-impact (TOI) to calculate the post-collision velocities. We demonstrate our proposed method, referred to as TOI-Velocity, on two optimal control problems. We show that with TOI-Velocity, we are able to learn an optimal control sequence that matches the analytical solution, while without TOI-Velocity, existing differentiable simulation methods fail to do so.
△ Less
Submitted 28 April, 2023;
originally announced May 2023.
-
Some Asymptotic Properties of the Erlang-C Formula in Many-Server Limiting Regimes
Authors:
Ragavendran Gopalakrishnan,
Yueyang Zhong
Abstract:
This paper presents asymptotic properties of the Erlang-C formula in a spectrum of many-server limiting regimes. Specifically, we address an important gap in the literature regarding its limiting value in critically loaded regimes by studying extensions of the well-known square-root safety staffing rule used in the Quality-and-Efficiency-Driven (QED) regime.
This paper presents asymptotic properties of the Erlang-C formula in a spectrum of many-server limiting regimes. Specifically, we address an important gap in the literature regarding its limiting value in critically loaded regimes by studying extensions of the well-known square-root safety staffing rule used in the Quality-and-Efficiency-Driven (QED) regime.
△ Less
Submitted 10 May, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Explicit Directional Affine Extractors and Improved Hardness for Linear Branching Programs
Authors:
Xin Li,
Yan Zhong
Abstract:
In a recent work, Gryaznov, Pudlák, and Talebanfard (CCC' 22) introduced a stronger version of affine extractors known as directional affine extractors, together with a generalization of $\mathsf{ROBP}$s where each node can make linear queries, and showed that the former implies strong lower bound for a certain type of the latter known as strongly read-once linear branching programs (…
▽ More
In a recent work, Gryaznov, Pudlák, and Talebanfard (CCC' 22) introduced a stronger version of affine extractors known as directional affine extractors, together with a generalization of $\mathsf{ROBP}$s where each node can make linear queries, and showed that the former implies strong lower bound for a certain type of the latter known as strongly read-once linear branching programs ($\mathsf{SROLBP}$s). Their main result gives explicit constructions of directional affine extractors for entropy $k > 2n/3$, which implies average-case complexity $2^{n/3-o(n)}$ against $\mathsf{SROLBP}$s with exponentially small correlation. A follow-up work by Chattopadhyay and Liao (ECCC' 22) improves the hardness to $2^{n-o(n)}$ at the price of increasing the correlation to polynomially large.
In this paper we show:
An explicit construction of directional affine extractors with $k=o(n)$ and exponentially small error, which gives average-case complexity $2^{n-o(n)}$ against $\mathsf{SROLBP}$s with exponentially small correlation, thus answering the two open questions raised in previous works.
An explicit function in $\mathsf{AC}^0$ that gives average-case complexity $2^{(1-δ)n}$ against $\mathsf{ROBP}$s with negligible correlation, for any constant $δ>0$. Previously, no such average-case hardness is known, and the best size lower bound for any function in $\mathsf{AC}^0$ against $\mathsf{ROBP}$s is $2^{Ω(n)}$.
One of the key ingredients in our constructions is a new linear somewhere condenser for affine sources, which is based on dimension expanders. The condenser also leads to an unconditional improvement of the entropy requirement of explicit affine extractors with negligible error. We further show that the condenser also works for general weak random sources, under the Polynomial Freiman-Ruzsa Theorem in $\mathsf{F}_2^n$.
△ Less
Submitted 3 July, 2024; v1 submitted 22 April, 2023;
originally announced April 2023.
-
Asymptotics for $k$-crank of $k$-colored partitions
Authors:
Helen W. J. Zhang,
Ying Zhong
Abstract:
In this paper, we obtain asymptotic formulas for $k$-crank of $k$-colored partitions. Let $M_k(a, c; n)$ denote the number of $k$-colored partitions of $n$ with a $k$-crank congruent to $a$ mod $c$. For the cases $k=2,3,4$, Fu and Tang derived several inequality relations for $M_k(a, c; n)$ using generating functions. We employ the Hardy-Ramanujan Circle Method to extend the results of Fu and Tang…
▽ More
In this paper, we obtain asymptotic formulas for $k$-crank of $k$-colored partitions. Let $M_k(a, c; n)$ denote the number of $k$-colored partitions of $n$ with a $k$-crank congruent to $a$ mod $c$. For the cases $k=2,3,4$, Fu and Tang derived several inequality relations for $M_k(a, c; n)$ using generating functions. We employ the Hardy-Ramanujan Circle Method to extend the results of Fu and Tang. Furthermore, additional inequality relations for $M_k(a, c; n)$ have been established, such as logarithmic concavity and logarithmic subadditivity.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
On the generalized Hausdorff dimension of Besicovitch sets
Authors:
Xianghong Chen,
Lixin Yan,
Yue Zhong
Abstract:
Keich (1999) showed that the sharp gauge function for the generalized Hausdorff dimension of Besicovitch sets in $\mathbb R^2$ is between $r^2\log 1/r$ and $r^2(\log 1/r) (\log\log 1/r)^{2+\varepsilon}$ by refining an argument of Bourgain (1991). It is not known whether the iterated logarithms in Keich's bound are necessary. In this paper we construct a family of Besicovitch line sets whose sharp…
▽ More
Keich (1999) showed that the sharp gauge function for the generalized Hausdorff dimension of Besicovitch sets in $\mathbb R^2$ is between $r^2\log 1/r$ and $r^2(\log 1/r) (\log\log 1/r)^{2+\varepsilon}$ by refining an argument of Bourgain (1991). It is not known whether the iterated logarithms in Keich's bound are necessary. In this paper we construct a family of Besicovitch line sets whose sharp gauge function is smaller than $r^2(\log 1/r) (\log\log 1/r)^{\varepsilon}$. Moreover, these Besicovitch sets are minimal in the sense that there is essentially only one line in the set pointing in each direction.
△ Less
Submitted 7 April, 2024; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Transport models for wave propagation in scattering media with nonlinear absorption
Authors:
Joseph Kraisler,
Wei Li,
Kui Ren,
John C. Schotland,
Yimin Zhong
Abstract:
This work considers the propagation of high-frequency waves in highly-scattering media where physical absorption of a nonlinear nature occurs. Using the classical tools of the Wigner transform and multiscale analysis, we derive semilinear radiative transport models for the phase-space intensity and the diffusive limits of such transport models. As an application, we consider an inverse problem for…
▽ More
This work considers the propagation of high-frequency waves in highly-scattering media where physical absorption of a nonlinear nature occurs. Using the classical tools of the Wigner transform and multiscale analysis, we derive semilinear radiative transport models for the phase-space intensity and the diffusive limits of such transport models. As an application, we consider an inverse problem for the semilinear transport equation, where we reconstruct the absorption coefficients of the equation from a functional of its solution. We obtain a uniqueness result on the inverse problem.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Corrected Trapezoidal Rule-IBIM for linearized Poisson-Boltzmann equation
Authors:
Federico Izzo,
Yimin Zhong,
Olof Runborg,
Richard Tsai
Abstract:
In this paper, we solve the linearized Poisson-Boltzmann equation, used to model the electric potential of macromolecules in a solvent. We derive a corrected trapezoidal rule with improved accuracy for a boundary integral formulation of the linearized Poisson-Boltzmann equation. More specifically, in contrast to the typical boundary integral formulations, the corrected trapezoidal rule is applied…
▽ More
In this paper, we solve the linearized Poisson-Boltzmann equation, used to model the electric potential of macromolecules in a solvent. We derive a corrected trapezoidal rule with improved accuracy for a boundary integral formulation of the linearized Poisson-Boltzmann equation. More specifically, in contrast to the typical boundary integral formulations, the corrected trapezoidal rule is applied to integrate a system of compacted supported singular integrals using uniform Cartesian grids in $\mathbb{R}^3$, without explicit surface parameterization. A Krylov method, accelerated by a fast multipole method, is used to invert the resulting linear system. We study the efficacy of the proposed method, and compare it to an existing, lower order method. We then apply the method to the computation of electrostatic potential of macromolecules immersed in solvent. The solvent excluded surfaces, defined by a common approach, are merely piecewise smooth, and we study the effectiveness of the method for such surfaces.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
The constant term algebra of type $A$: the Structure
Authors:
Guoce Xin,
Chen Zhang,
Yue Zhou,
Yueming Zhong
Abstract:
In this paper, we discover a new noncommutative algebra. We refer this algebra as the constant term algebra of type $A$, which is generated by certain constant term operators. We characterize a structural result of this algebra by establishing an explicit basis in terms of certain forests. This algebra arises when we apply the method of the iterated Laurent series to investigate Beck and Pixton's…
▽ More
In this paper, we discover a new noncommutative algebra. We refer this algebra as the constant term algebra of type $A$, which is generated by certain constant term operators. We characterize a structural result of this algebra by establishing an explicit basis in terms of certain forests. This algebra arises when we apply the method of the iterated Laurent series to investigate Beck and Pixton's residue computation for the Ehrhart series of the Birkhoff polytope. This algebra seems to be the first structural result in the area of the constant term world since the discovery of the Dyson constant term identity in 1962.
△ Less
Submitted 3 May, 2023; v1 submitted 30 September, 2022;
originally announced September 2022.
-
Strict Log-Subadditivity for Overpartition Rank
Authors:
Helen W. J. Zhang,
Ying Zhong
Abstract:
Bessenrodt and Ono initially found the strict log-subadditivity of partition function $p(n)$, that is, $p(a+b)< p(a)p(b)$ for $a,b>1$ and $a+b>9$. Many other important statistics of partitions are proved to enjoy similar properties. Lovejoy introduced the overpartition rank as an analog of Dyson's rank for partitions from the $q$-series perspective. Let $\overline{N}(a,c,n)$ denote the number of o…
▽ More
Bessenrodt and Ono initially found the strict log-subadditivity of partition function $p(n)$, that is, $p(a+b)< p(a)p(b)$ for $a,b>1$ and $a+b>9$. Many other important statistics of partitions are proved to enjoy similar properties. Lovejoy introduced the overpartition rank as an analog of Dyson's rank for partitions from the $q$-series perspective. Let $\overline{N}(a,c,n)$ denote the number of overpartitions with rank congruent to $a$ modulo $c$. Ciolan computed the asymptotic formula of $\overline{N}(a,c,n)$ and showed that $\overline{N}(a, c, n) > \overline{N}(b, c, n)$ for $c\geq7$ and $n$ large enough. In this paper, we derive an upper bound and a lower bound of $\overline{N}(a,c,n)$ for each $c\geq3$ by using the asymptotics of Ciolan. Consequently, we establish the strict log-subadditivity of $\overline{N}(a,c,n)$ analogous to the partition function $p(n)$.
△ Less
Submitted 26 June, 2022;
originally announced June 2022.
-
Normalized ground state solutions for the fractional Sobolev critical NLSE with an extra mass supercritical nonlinearity
Authors:
Jiabin Zuo,
Yuyou Zhong,
Dušan D. Repovš
Abstract:
This paper is concerned with existence of normalized ground state solutions for the mass supercritical fractional nonlinear Schrödinger equation involving a critical growth in the fractional Sobolev sense. The compactness of Palais-Smale sequences is obtained by a special technique, which borrows from the ideas of Soave (J. Funct. Anal. 279 (6) (2020) 1086102020). This paper represents an extensio…
▽ More
This paper is concerned with existence of normalized ground state solutions for the mass supercritical fractional nonlinear Schrödinger equation involving a critical growth in the fractional Sobolev sense. The compactness of Palais-Smale sequences is obtained by a special technique, which borrows from the ideas of Soave (J. Funct. Anal. 279 (6) (2020) 1086102020). This paper represents an extension of previously known results - in the local and the nonlocal cases.
△ Less
Submitted 28 July, 2023; v1 submitted 25 June, 2022;
originally announced June 2022.
-
How much can one learn from a single solution of a PDE?
Authors:
Hongkai Zhao,
Yimin Zhong
Abstract:
Linear evolution PDE $\partial_t u(x,t) = -\mathcal{L} u$, where $\mathcal{L}$ is a strongly elliptic operator independent of time, is studied as an example to show if one can superpose snapshots of a single (or a finite number of) solution(s) to construct an arbitrary solution. Our study shows that it depends on the growth rate of the eigenvalues, $μ_n$, of $\mathcal{L}$ in terms of $n$. When the…
▽ More
Linear evolution PDE $\partial_t u(x,t) = -\mathcal{L} u$, where $\mathcal{L}$ is a strongly elliptic operator independent of time, is studied as an example to show if one can superpose snapshots of a single (or a finite number of) solution(s) to construct an arbitrary solution. Our study shows that it depends on the growth rate of the eigenvalues, $μ_n$, of $\mathcal{L}$ in terms of $n$. When the statement is true, a simple data-driven approach for model reduction and approximation of an arbitrary solution of a PDE without knowing the underlying PDE is designed. Numerical experiments are presented to corroborate our analysis.
△ Less
Submitted 10 June, 2022;
originally announced June 2022.
-
Asymptotic formula for the $M_2$-ranks of overpartitions
Authors:
Helen W. J. Zhang,
Ying Zhong
Abstract:
Let $\overline{N}_2(a,c,n)$ be the number of overpartitions of $n$ whose the $M_2$-rank is congruent to $a$ modulo $c$. In this paper, we obtain the asymptotic formula of $\overline{N}_2(a,c,n)$ utilizing the Ingham Tauberian Theorem. As applications, we derive inequalities concerning with $\overline{N}_2(a,c,n)$ including its strict concavity and log-concavity.
Let $\overline{N}_2(a,c,n)$ be the number of overpartitions of $n$ whose the $M_2$-rank is congruent to $a$ modulo $c$. In this paper, we obtain the asymptotic formula of $\overline{N}_2(a,c,n)$ utilizing the Ingham Tauberian Theorem. As applications, we derive inequalities concerning with $\overline{N}_2(a,c,n)$ including its strict concavity and log-concavity.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
Multi-block Min-max Bilevel Optimization with Applications in Multi-task Deep AUC Maximization
Authors:
Quanqi Hu,
Yongjian Zhong,
Tianbao Yang
Abstract:
In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively hi…
▽ More
In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively high, especially with a large number of blocks. To tackle this challenge, we present a single-loop randomized stochastic algorithm, which requires updates for only a constant number of blocks at each iteration. Under some mild assumptions on the problem, we establish its sample complexity of $O(1/ε^4)$ for finding an $ε$-stationary point. This matches the optimal complexity for solving stochastic nonconvex optimization under a general unbiased stochastic oracle model. Moreover, we provide two applications of the proposed method in multi-task deep AUC (area under ROC curve) maximization and multi-task deep partial AUC maximization. Experimental results validate our theory and demonstrate the effectiveness of our method on problems with hundreds of tasks.
△ Less
Submitted 17 November, 2022; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Higher order log-concavity of the overpartition function and its consequences
Authors:
Gargi Mukherjee,
Helen W. J. Zhang,
Ying Zhong
Abstract:
Let $\bar{p}(n)$ denote the overpartition function. In this paper, we study the asymptotic higher order $\log$-concavity property of the overpatition function in a similar framework done by Hou and Zhang for the partition function. This will enable us to move on further in order to prove $\log$-concavity of overpartitions, explicitly by studying the asymptotic expansion of the quotient…
▽ More
Let $\bar{p}(n)$ denote the overpartition function. In this paper, we study the asymptotic higher order $\log$-concavity property of the overpatition function in a similar framework done by Hou and Zhang for the partition function. This will enable us to move on further in order to prove $\log$-concavity of overpartitions, explicitly by studying the asymptotic expansion of the quotient $\bar{p}(n-1)\bar{p}(n+1)/\bar{p}(n)^2$ upto a certain order so that one can finally ends up with the phenomena of $2$-$\log$-concavity and higher order Turán property of $\bar{p}(n)$ by following a sort of unified approach.
△ Less
Submitted 17 April, 2022;
originally announced April 2022.
-
How much can one learn a partial differential equation from its solution?
Authors:
Yuchen He,
Hongkai Zhao,
Yimin Zhong
Abstract:
In this work we study the problem about learning a partial differential equation (PDE) from its solution data. PDEs of various types are used as examples to illustrate how much the solution data can reveal the PDE operator depending on the underlying operator and initial data. A data driven and data adaptive approach based on local regression and global consistency is proposed for stable PDE ident…
▽ More
In this work we study the problem about learning a partial differential equation (PDE) from its solution data. PDEs of various types are used as examples to illustrate how much the solution data can reveal the PDE operator depending on the underlying operator and initial data. A data driven and data adaptive approach based on local regression and global consistency is proposed for stable PDE identification. Numerical experiments are provided to verify our analysis and demonstrate the performance of the proposed algorithms.
△ Less
Submitted 9 November, 2022; v1 submitted 10 April, 2022;
originally announced April 2022.
-
On Sombor index of graphs with a given number of cut-vertices
Authors:
Sakander Hayat,
Ansar Rehman,
Yubin Zhong
Abstract:
Introduced by Gutman in 2021, the Sombor index is a novel graph-theoretic topological descriptor possessing potential applications in the modeling of thermodynamic properties of compounds. Let G^k_n be the set of all n-vertex connected graphs with k cut-vertices. In this paper, we present minimum Sombor indices of graphs in G^k_n. The corresponding extremal graphs have been characterized as well.
Introduced by Gutman in 2021, the Sombor index is a novel graph-theoretic topological descriptor possessing potential applications in the modeling of thermodynamic properties of compounds. Let G^k_n be the set of all n-vertex connected graphs with k cut-vertices. In this paper, we present minimum Sombor indices of graphs in G^k_n. The corresponding extremal graphs have been characterized as well.
△ Less
Submitted 30 June, 2022; v1 submitted 16 March, 2022;
originally announced March 2022.
-
Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence
Authors:
Zi-Hao Qiu,
Quanqi Hu,
Yongjian Zhong,
Lijun Zhang,
Tianbao Yang
Abstract:
NDCG, namely Normalized Discounted Cumulative Gain, is a widely used ranking metric in information retrieval and machine learning. However, efficient and provable stochastic methods for maximizing NDCG are still lacking, especially for deep models. In this paper, we propose a principled approach to optimize NDCG and its top-$K$ variant. First, we formulate a novel compositional optimization proble…
▽ More
NDCG, namely Normalized Discounted Cumulative Gain, is a widely used ranking metric in information retrieval and machine learning. However, efficient and provable stochastic methods for maximizing NDCG are still lacking, especially for deep models. In this paper, we propose a principled approach to optimize NDCG and its top-$K$ variant. First, we formulate a novel compositional optimization problem for optimizing the NDCG surrogate, and a novel bilevel compositional optimization problem for optimizing the top-$K$ NDCG surrogate. Then, we develop efficient stochastic algorithms with provable convergence guarantees for the non-convex objectives. Different from existing NDCG optimization methods, the per-iteration complexity of our algorithms scales with the mini-batch size instead of the number of total items. To improve the effectiveness for deep learning, we further propose practical strategies by using initial warm-up and stop gradient operator. Experimental results on multiple datasets demonstrate that our methods outperform prior ranking approaches in terms of NDCG. To the best of our knowledge, this is the first time that stochastic algorithms are proposed to optimize NDCG with a provable convergence guarantee. Our proposed methods are implemented in the LibAUC library at https://libauc.org/.
△ Less
Submitted 2 February, 2023; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Inverse Source Problem for Acoustically-Modulated Electromagnetic Waves
Authors:
Wei Li,
John C. Schotland,
Yang Yang,
Yimin Zhong
Abstract:
We propose a method to reconstruct the electrical current density from acoustically-modulated boundary measurements of time-harmonic electromagnetic fields. We show that the current can be uniquely reconstructed with Lipschitz stability. We also report numerical simulations to illustrate the analytical results.
We propose a method to reconstruct the electrical current density from acoustically-modulated boundary measurements of time-harmonic electromagnetic fields. We show that the current can be uniquely reconstructed with Lipschitz stability. We also report numerical simulations to illustrate the analytical results.
△ Less
Submitted 23 February, 2022;
originally announced February 2022.
-
Proving some conjectures on Kekulé numbers for certain benzenoids by using Chebyshev polynomials
Authors:
Guoce Xin,
Yueming Zhong
Abstract:
In chemistry, Cyvin-Gutman enumerates Kekulé numbers for certain benzenoids and record it as $A050446$ on OEIS. This number is exactly the two variable array $T(n,m)$ defined by the recursion $T(n, m) = T(n, m-1) + \sum^{\lfloor\frac{n-1}{2}\rfloor}_{k=0} T(2k, m-1)T(n-1-2k, m)$, where $T(n,0)=T(0,m)=1$ for all nonnegative integers $m,n$. Interestingly, this number also appeared in the context of…
▽ More
In chemistry, Cyvin-Gutman enumerates Kekulé numbers for certain benzenoids and record it as $A050446$ on OEIS. This number is exactly the two variable array $T(n,m)$ defined by the recursion $T(n, m) = T(n, m-1) + \sum^{\lfloor\frac{n-1}{2}\rfloor}_{k=0} T(2k, m-1)T(n-1-2k, m)$, where $T(n,0)=T(0,m)=1$ for all nonnegative integers $m,n$. Interestingly, this number also appeared in the context of weighted graphs, graph polytopes, magic labellings, and unit primitive matrices, studied by different authors. Several interesting conjectures were made on the OEIS. These conjectures are related to both the row and column generating function of $T(n,m)$. In this paper, give explicit formula of the column generating function, which is also the generating function $F(n,x)$ studied by Bóna, Ju, and Yoshida. We also get trig function representations by using Chebyshev polynomials of the second kind. This allows us to prove all these conjectures.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
Simplex Initialization: A Survey of Techniques and Trends
Authors:
Mengyu Huang,
Yuxing Zhong,
Huiwen Yang,
Jiazheng Wang,
Fan Zhang,
Bo Bai,
Ling Shi
Abstract:
The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim…
▽ More
The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim to provide an overview about the initialization methods in the primal and dual simplex, respectively. We also propose several potential future directions about how to improve the existing initialization methods with the help of advanced learning technologies.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Tractability from overparametrization: The example of the negative perceptron
Authors:
Andrea Montanari,
Yiqiao Zhong,
Kangjie Zhou
Abstract:
In the negative perceptron problem we are given $n$ data points $({\boldsymbol x}_i,y_i)$, where ${\boldsymbol x}_i$ is a $d$-dimensional vector and $y_i\in\{+1,-1\}$ is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible \emph{negative} margin. In other words, we want to find a unit norm vector ${\boldsymbol θ}$…
▽ More
In the negative perceptron problem we are given $n$ data points $({\boldsymbol x}_i,y_i)$, where ${\boldsymbol x}_i$ is a $d$-dimensional vector and $y_i\in\{+1,-1\}$ is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible \emph{negative} margin. In other words, we want to find a unit norm vector ${\boldsymbol θ}$ that maximizes $\min_{i\le n}y_i\langle {\boldsymbol θ},{\boldsymbol x}_i\rangle$. This is a non-convex optimization problem (it is equivalent to finding a maximum norm vector in a polytope), and we study its typical properties under two random models for the data.
We consider the proportional asymptotics in which $n,d\to \infty$ with $n/d\toδ$, and prove upper and lower bounds on the maximum margin $κ_{\text{s}}(δ)$ or -- equivalently -- on its inverse function $δ_{\text{s}}(κ)$. In other words, $δ_{\text{s}}(κ)$ is the overparametrization threshold: for $n/d\le δ_{\text{s}}(κ)-\varepsilon$ a classifier achieving vanishing training error exists with high probability, while for $n/d\ge δ_{\text{s}}(κ)+\varepsilon$ it does not. Our bounds on $δ_{\text{s}}(κ)$ match to the leading order as $κ\to -\infty$. We then analyze a linear programming algorithm to find a solution, and characterize the corresponding threshold $δ_{\text{lin}}(κ)$. We observe a gap between the interpolation threshold $δ_{\text{s}}(κ)$ and the linear programming threshold $δ_{\text{lin}}(κ)$, raising the question of the behavior of other algorithms.
△ Less
Submitted 3 July, 2023; v1 submitted 27 October, 2021;
originally announced October 2021.
-
The Complex Ball-quotient Structure of the Moduli Space of Certain Sextic Curves
Authors:
Zhiwei Zheng,
Yiming Zhong
Abstract:
We study moduli spaces of certain sextic curves with a singularity of multiplicity 3 from both perspectives of Deligne-Mostow theory and periods of K3 surfaces. In both ways we can describe the moduli spaces via arithmetic quotients of complex hyperbolic balls. We show in Theorem 7.4 that the two ball-quotient constructions can be unified in a geometric way.
We study moduli spaces of certain sextic curves with a singularity of multiplicity 3 from both perspectives of Deligne-Mostow theory and periods of K3 surfaces. In both ways we can describe the moduli spaces via arithmetic quotients of complex hyperbolic balls. We show in Theorem 7.4 that the two ball-quotient constructions can be unified in a geometric way.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
A symmetric chain decomposition of $N(m,n)$ of composition
Authors:
Yueming Zhong
Abstract:
A poset is called a symmetric chain decomposition if the poset can be expressed as a disjoint union of symmetric chains. For positive integers $m$ and $n$, let $N(m,n)$ denote the set of all compositions $α=(α_1,\cdots,α_m)$, with $0\le α_i \le n$ for each $i=1,\cdots,m$. Define order $<$ as follow, $\forall α,β\in N(m,n)$, $β< α$ if and only if $β_i \le α_i(i=1,\cdots,m)$ and…
▽ More
A poset is called a symmetric chain decomposition if the poset can be expressed as a disjoint union of symmetric chains. For positive integers $m$ and $n$, let $N(m,n)$ denote the set of all compositions $α=(α_1,\cdots,α_m)$, with $0\le α_i \le n$ for each $i=1,\cdots,m$. Define order $<$ as follow, $\forall α,β\in N(m,n)$, $β< α$ if and only if $β_i \le α_i(i=1,\cdots,m)$ and $\sum\limits_{i=1}^{m}β_i <\sum\limits_{i=1}^{m}α_i$. In this paper, we show that the poset $(N(m,n),<)$ can be expressed as a disjoint of symmetric chains by constructive method.
△ Less
Submitted 24 July, 2021;
originally announced July 2021.
-
Multiplayer Homicidal Chauffeur Reach-Avoid Games via Guaranteed Winning Strategies
Authors:
Rui Yan,
Ruiliang Deng,
Haowen Lai,
Weixian Zhang,
Zongying Shi,
Yisheng Zhong
Abstract:
This paper studies a planar multiplayer Homicidal Chauffeur reach-avoid differential game, where each pursuer is a Dubins car and each evader has simple motion. The pursuers aim to protect a goal region cooperatively from the evaders. Due to the high-dimensional strategy space among pursuers, we decompose the whole game into multiple one-pursuer-one-evader subgames, each of which is solved in an a…
▽ More
This paper studies a planar multiplayer Homicidal Chauffeur reach-avoid differential game, where each pursuer is a Dubins car and each evader has simple motion. The pursuers aim to protect a goal region cooperatively from the evaders. Due to the high-dimensional strategy space among pursuers, we decompose the whole game into multiple one-pursuer-one-evader subgames, each of which is solved in an analytical approach instead of solving Hamilton-Jacobi-Isaacs equations. For each subgame, an evasion region (ER) is introduced, based on which a pursuit strategy guaranteeing the winning of a simple-motion pursuer under specific conditions is proposed. Motivated by the simple-motion pursuer, a strategy for a Dubins-car pursuer is proposed when the pursuer-evader configuration satisfies separation condition (SC) and interception orientation (IO). The necessary and sufficient condition on capture radius, minimum turning radius and speed ratio to guarantee the pursuit winning is derived. When the IO is not satisfied (Non-IO), a heading adjustment pursuit strategy is proposed, and the condition to achieve IO within a finite time, is given. Then, a two-step pursuit strategy is proposed for the SC and Non-IO case. A non-convex optimization problem is introduced to give a condition guaranteeing the winning of the pursuer. A polynomial equation gives a lower bound of the non-convex problem, providing a sufficient and efficient pursuit winning condition. Finally, these pairwise outcomes are collected for the pursuer-evader matching. Simulations are provided to illustrate the theoretical results.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
On Magic Distinct Labellings of Simple Graphs
Authors:
Guoce Xin,
Xinyu Xu,
Chen Zhang,
Yueming Zhong
Abstract:
A magic labelling of a graph $G$ with magic sum $s$ is a labelling of the edges of $G$ by nonnegative integers such that for each vertex $v\in V$, the sum of labels of all edges incident to $v$ is equal to the same number $s$. Stanley gave remarkable results on magic labellings, but the distinct labelling case is much more complicated. We consider the complete construction of all magic labellings…
▽ More
A magic labelling of a graph $G$ with magic sum $s$ is a labelling of the edges of $G$ by nonnegative integers such that for each vertex $v\in V$, the sum of labels of all edges incident to $v$ is equal to the same number $s$. Stanley gave remarkable results on magic labellings, but the distinct labelling case is much more complicated. We consider the complete construction of all magic labellings of a given graph $G$. The idea is illustrated in detail by dealing with three regular graphs. We give combinatorial proofs. The structure result was used to enumerate the corresponding magic distinct labellings.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.
-
Asymptotically Optimal Idling in the GI/GI/N+GI Queue
Authors:
Yueyang Zhong,
Amy R. Ward,
Amber L. Puha
Abstract:
We formulate a control problem for a GI/GI/N+GI queue, whose objective is to trade off the long-run average operational costs (i.e., abandonment costs and holding costs) with server utilization costs. To solve the control problem, we consider an asymptotic regime in which the arrival rate and the number of servers grow large. The solution to an associated fluid control problem motivates that non-i…
▽ More
We formulate a control problem for a GI/GI/N+GI queue, whose objective is to trade off the long-run average operational costs (i.e., abandonment costs and holding costs) with server utilization costs. To solve the control problem, we consider an asymptotic regime in which the arrival rate and the number of servers grow large. The solution to an associated fluid control problem motivates that non-idling service disciplines are not in general optimal, unless some arrivals are turned away. We propose an admission control policy designed to ensure servers have sufficient idle time that we show is asymptotically optimal.
△ Less
Submitted 7 November, 2023; v1 submitted 21 June, 2021;
originally announced June 2021.
-
An explicit order matching for $L(3,n)$ from several approaches and its extension for $L(4,n)$
Authors:
Guoce Xin,
Yueming Zhong
Abstract:
Let $L(m,n)$ denote Young's lattice consisting of all partitions whose Young diagrams are contained in the $m\times n$ rectangle. It is a well-known result that the poset $L(m,n)$ is rank symmetric, rank unimodal, and Sperner. A direct proof of this result by finding an explicit order matching of $L(m,n)$ is an outstanding open problem. In this paper, we present an explicit order matching…
▽ More
Let $L(m,n)$ denote Young's lattice consisting of all partitions whose Young diagrams are contained in the $m\times n$ rectangle. It is a well-known result that the poset $L(m,n)$ is rank symmetric, rank unimodal, and Sperner. A direct proof of this result by finding an explicit order matching of $L(m,n)$ is an outstanding open problem. In this paper, we present an explicit order matching $\varphi$ for $L(3,n)$ by several different approaches, and give chain tableau version of $\varphi$ that is very helpful in finding patterns. It is surprise that the greedy algorithm and a recursive knead process also give the same order matching. Our methods extend for $L(4,n)$.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Inverse Boundary Problem for the Two Photon Absorption Transport Equation
Authors:
Plamen Stefanov,
Yimin Zhong
Abstract:
This work studies the inverse boundary problem for the two photon absorption radiative transport equation. We show that the absorption coefficients and scattering coefficients can be uniquely determined from the \emph{albedo} operator. If scattering is absent, we do not require smallness of the incoming source and the reconstructions of the absorption coefficients are explicit.
This work studies the inverse boundary problem for the two photon absorption radiative transport equation. We show that the absorption coefficients and scattering coefficients can be uniquely determined from the \emph{albedo} operator. If scattering is absent, we do not require smallness of the incoming source and the reconstructions of the absorption coefficients are explicit.
△ Less
Submitted 17 February, 2022; v1 submitted 13 April, 2021;
originally announced April 2021.
-
Acousto-electric Inverse Source Problem
Authors:
Wei Li,
John C. Schotland,
Yang Yang,
Yimin Zhong
Abstract:
We propose a method to reconstruct the electrical current density inside a conducting medium from acoustically-modulated boundary measurements of the electric potential. We show that the current can be uniquely reconstructed with Lipschitz stability. We also perform numerical simulations to illustrate the analytical results, and explore the partial data setting when measurements are taken only on…
▽ More
We propose a method to reconstruct the electrical current density inside a conducting medium from acoustically-modulated boundary measurements of the electric potential. We show that the current can be uniquely reconstructed with Lipschitz stability. We also perform numerical simulations to illustrate the analytical results, and explore the partial data setting when measurements are taken only on part of the boundary.
△ Less
Submitted 20 March, 2021;
originally announced March 2021.
-
Quantitative PAT with simplified $P_N$ approximation
Authors:
Hongkai Zhao,
Yimin Zhong
Abstract:
The photoacoustic tomography (PAT) is a hybrid modality that combines the optics and acoustics to obtain high resolution and high contrast imaging of heterogeneous media. In this work, our objective is to study the inverse problem in the quantitative step of PAT which aims to reconstruct the optical coefficients of the governing radiative transport equation from the ultrasound measurements. In our…
▽ More
The photoacoustic tomography (PAT) is a hybrid modality that combines the optics and acoustics to obtain high resolution and high contrast imaging of heterogeneous media. In this work, our objective is to study the inverse problem in the quantitative step of PAT which aims to reconstruct the optical coefficients of the governing radiative transport equation from the ultrasound measurements. In our analysis, we take the simplified $P_N$ approximation of the radiative transport equation as the physical model and then show the uniqueness and stability for this modified inverse problem. Numerical simulations based on synthetic data are presented to validate our analysis.
△ Less
Submitted 11 March, 2021; v1 submitted 22 December, 2020;
originally announced December 2020.
-
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data
Authors:
Yaofeng Desmond Zhong,
Biswadip Dey,
Amit Chakraborty
Abstract:
The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network mode…
▽ More
The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network models, including HNN, LNN, DeLaN, SymODEN, CHNN, CLNN and their variants. We provide a compact derivation of the theory behind these models and explain their similarities and differences. Their performance are compared in 4 physical systems. We point out the possibility of leveraging some of these energy-conserving models to design energy-based controllers.
△ Less
Submitted 28 April, 2023; v1 submitted 3 December, 2020;
originally announced December 2020.
-
A Note on the Gaussian Minimum Conjecture
Authors:
Yang-Fan Zhong,
Ting Ma,
Ze-Chun Hu
Abstract:
Let $n\geq 2$ and $(X_i,1\leq i\leq n)$ be a centered Gaussian random vector. The Gaussian minimum conjecture says that $E\left(\min_{1\leq i\leq n}|X_i|\right)\geq E\left(\min_{1\leq i\leq n}|Y_i|\right)$, where $Y_1,\ldots,Y_n$ are independent centered Gaussian random variables with $E(X_i^2)=E(Y_i^2)$ for any $i=1,\ldots,n$. In this note, we will show that this conjecture holds if and only if…
▽ More
Let $n\geq 2$ and $(X_i,1\leq i\leq n)$ be a centered Gaussian random vector. The Gaussian minimum conjecture says that $E\left(\min_{1\leq i\leq n}|X_i|\right)\geq E\left(\min_{1\leq i\leq n}|Y_i|\right)$, where $Y_1,\ldots,Y_n$ are independent centered Gaussian random variables with $E(X_i^2)=E(Y_i^2)$ for any $i=1,\ldots,n$. In this note, we will show that this conjecture holds if and only if $n=2$.
△ Less
Submitted 14 August, 2020;
originally announced August 2020.
-
Influence Spread in the Heterogeneous Multiplex Linear Threshold Model
Authors:
Yaofeng Desmond Zhong,
Vaibhav Srivastava,
Naomi Ehrich Leonard
Abstract:
The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish sign…
▽ More
The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish signals from different layers: an agent becomes active if a sufficient number of its neighbors in each of any $a$ of the $m$ layers is active. We focus on Protocol OR, when $a=1$, and Protocol AND, when $a=m$, which model agents that are most and least readily activated, respectively. We develop theory and algorithms to compute the size of the spread at steady state for any set of initially active agents and to analyze the role of distinguished sensing modalities, network structure, and heterogeneity. We show how heterogeneity manages the tension in spreading dynamics between sensitivity to inputs and robustness to disturbances.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Authors:
Andrea Montanari,
Yiqiao Zhong
Abstract:
Modern neural networks are often operated in a strongly overparametrized regime: they comprise so many parameters that they can interpolate the training set, even if actual labels are replaced by purely random ones. Despite this, they achieve good prediction error on unseen data: interpolating the training set does not lead to a large generalization error. Further, overparametrization appears to b…
▽ More
Modern neural networks are often operated in a strongly overparametrized regime: they comprise so many parameters that they can interpolate the training set, even if actual labels are replaced by purely random ones. Despite this, they achieve good prediction error on unseen data: interpolating the training set does not lead to a large generalization error. Further, overparametrization appears to be beneficial in that it simplifies the optimization landscape. Here we study these phenomena in the context of two-layers neural networks in the neural tangent (NT) regime. We consider a simple data model, with isotropic covariates vectors in $d$ dimensions, and $N$ hidden neurons. We assume that both the sample size $n$ and the dimension $d$ are large, and they are polynomially related. Our first main result is a characterization of the eigenstructure of the empirical NT kernel in the overparametrized regime $Nd\gg n$. This characterization implies as a corollary that the minimum eigenvalue of the empirical NT kernel is bounded away from zero as soon as $Nd\gg n$, and therefore the network can exactly interpolate arbitrary labels in the same regime.
Our second main result is a characterization of the generalization error of NT ridge regression including, as a special case, min-$\ell_2$ norm interpolation. We prove that, as soon as $Nd\gg n$, the test error is well approximated by the one of kernel ridge regression with respect to the infinite-width kernel. The latter is in turn well approximated by the error of polynomial ridge regression, whereby the regularization parameter is increased by a `self-induced' term related to the high-degree components of the activation function. The polynomial degree depends on the sample size and the dimension (in particular on $\log n/\log d$).
△ Less
Submitted 8 June, 2022; v1 submitted 24 July, 2020;
originally announced July 2020.
-
Unique determination of absorption coefficients in a semilinear transport equation
Authors:
Kui Ren,
Yimin Zhong
Abstract:
Motivated by applications in quantitative photoacoustic imaging, we study inverse problems to a semilinear radiative transport equation (RTE) where we intend to reconstruct absorption coefficients in the equation from single and multiple internal data sets. We derive uniqueness and stability results for the inverse transport problem in the absence of scattering (in which case we also derive some e…
▽ More
Motivated by applications in quantitative photoacoustic imaging, we study inverse problems to a semilinear radiative transport equation (RTE) where we intend to reconstruct absorption coefficients in the equation from single and multiple internal data sets. We derive uniqueness and stability results for the inverse transport problem in the absence of scattering (in which case we also derive some explicit reconstruction methods) and in the presence of known scattering.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
Derivations of the Positive Part of the Two-parameter Quantum Group of type $G_2$
Authors:
Yongyue Zhong,
Xiaomin Tang
Abstract:
In this paper, we compute the derivations of the positive part of the two-parameter quantum group of type $G_2$ by embedding it into a quantum torus. We also show that the Hochschild cohomology group of degree $1$ of this algebra is a two dimensional vector space over the complex field.
In this paper, we compute the derivations of the positive part of the two-parameter quantum group of type $G_2$ by embedding it into a quantum torus. We also show that the Hochschild cohomology group of degree $1$ of this algebra is a two dimensional vector space over the complex field.
△ Less
Submitted 12 March, 2020;
originally announced March 2020.
-
A fast algorithm for time-dependent radiative transport equation based on integral formulation
Authors:
Hongkai Zhao,
Yimin Zhong
Abstract:
In this work, we introduce a fast numerical algorithm to solve the time-dependent radiative transport equation (RTE). Our method uses the integral formulation of RTE and applies the treecode algorithm to reduce the computational complexity from O(M^{2+1/d} ) to O(M^{1+1/d} log M ), where M is the number of points in the physical domain. The error analysis is presented and numerical experiments are…
▽ More
In this work, we introduce a fast numerical algorithm to solve the time-dependent radiative transport equation (RTE). Our method uses the integral formulation of RTE and applies the treecode algorithm to reduce the computational complexity from O(M^{2+1/d} ) to O(M^{1+1/d} log M ), where M is the number of points in the physical domain. The error analysis is presented and numerical experiments are performed to validate our algorithm.
△ Less
Submitted 27 January, 2020;
originally announced January 2020.
-
On Parity Unimodality of $q$-Catalan Polynomials
Authors:
Guoce Xin,
Yueming Zhong
Abstract:
A polynomial $A(q)=\sum_{i=0}^n a_iq^i$ is said to be unimodal if $a_0\le a_1\le \cdots \le a_k\ge a_{k+1} \ge \cdots \ge a_n$. We investigate the unimodality of rational $q$-Catalan polynomials, which is defined to be $C_{m,n}(q)= \frac{1}{[n+m]} \left[ m+n \atop n\right]$ for a coprime pair of positive integers $(m,n)$. We conjecture that they are unimodal with respect to parity, or equivalently…
▽ More
A polynomial $A(q)=\sum_{i=0}^n a_iq^i$ is said to be unimodal if $a_0\le a_1\le \cdots \le a_k\ge a_{k+1} \ge \cdots \ge a_n$. We investigate the unimodality of rational $q$-Catalan polynomials, which is defined to be $C_{m,n}(q)= \frac{1}{[n+m]} \left[ m+n \atop n\right]$ for a coprime pair of positive integers $(m,n)$. We conjecture that they are unimodal with respect to parity, or equivalently, $(1+q)C_{m+n}(q)$ is unimodal. By using generating functions and the constant term method, we verify our conjecture for $m\le 5$ in a straightforward way.
△ Less
Submitted 4 December, 2019;
originally announced December 2019.