Search | arXiv e-print repository

Structured and Balanced Multi-component and Multi-layer Neural Networks

Authors: Shijun Zhang, Hongkai Zhao, Yimin Zhong, Haomin Zhou

Abstract: In this work, we propose a balanced multi-component and multi-layer neural network (MMNN) structure to approximate functions with complex features with both accuracy and efficiency in terms of degrees of freedom and computation cost. The main idea is motivated by a multi-component, each of which can be approximated effectively by a single-layer network, and multi-layer decomposition in a "divide-a… ▽ More In this work, we propose a balanced multi-component and multi-layer neural network (MMNN) structure to approximate functions with complex features with both accuracy and efficiency in terms of degrees of freedom and computation cost. The main idea is motivated by a multi-component, each of which can be approximated effectively by a single-layer network, and multi-layer decomposition in a "divide-and-conquer" type of strategy to deal with a complex function. While an easy modification to fully connected neural networks (FCNNs) or multi-layer perceptrons (MLPs) through the introduction of balanced multi-component structures in the network, MMNNs achieve a significant reduction of training parameters, a much more efficient training process, and a much improved accuracy compared to FCNNs or MLPs. Extensive numerical experiments are presented to illustrate the effectiveness of MMNNs in approximating high oscillatory functions and its automatic adaptivity in capturing localized features. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: Our codes and implementation details are available at https://github.com/ShijunZhangMath/MMNN

arXiv:2406.13524 [pdf, ps, other]

Koebe uniformization for infinitely connected attracting Fatou domains

Authors: Xiaoguang Wang, Yi Zhong

Abstract: This paper works on the structure of infinitely connected Fatou damains of rational maps in terms of Koebe uniformization. Due to the complicated boundary behavior, the existing uniformization results are failed to apply in general. We proved that if the rational map is geometrically finite, then its infinitely connected attracting Fatou damain is conformally homeomorphic to a circle domain. This paper works on the structure of infinitely connected Fatou damains of rational maps in terms of Koebe uniformization. Due to the complicated boundary behavior, the existing uniformization results are failed to apply in general. We proved that if the rational map is geometrically finite, then its infinitely connected attracting Fatou damain is conformally homeomorphic to a circle domain. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 13 pages

MSC Class: 30C20(Primary); 30C35(Secondary)

arXiv:2404.17013 [pdf, ps, other]

Two-Source and Affine Non-Malleable Extractors for Small Entropy

Authors: Xin Li, Yan Zhong

Abstract: Non-malleable extractors are generalizations and strengthening of standard randomness extractors, that are resilient to adversarial tampering. Such extractors have wide applications in cryptography and explicit construction of extractors. In the well-studied models of two-source and affine non-malleable extractors, the previous best constructions only work for entropy rate $>2/3$ and $1-γ$ respect… ▽ More Non-malleable extractors are generalizations and strengthening of standard randomness extractors, that are resilient to adversarial tampering. Such extractors have wide applications in cryptography and explicit construction of extractors. In the well-studied models of two-source and affine non-malleable extractors, the previous best constructions only work for entropy rate $>2/3$ and $1-γ$ respectively by Li (FOCS' 23). We present explicit constructions of two-source and affine non-malleable extractors that match the state-of-the-art constructions of standard ones for small entropy. Our main results include two-source and affine non-malleable extractors (over $\mathsf{F}_2$) for sources on $n$ bits with min-entropy $k \ge \log^C n$ and polynomially small error, matching the parameters of standard extractors by Chattopadhyay and Zuckerman (STOC' 16, Annals of Mathematics' 19) and Li (FOCS' 16), as well as those with min-entropy $k = O(\log n)$ and constant error, matching the parameters of standard extractors by Li (FOCS' 23). Our constructions significantly improve previous results, and the parameters (entropy requirement and error) are the best possible without first improving the constructions of standard extractors. In addition, our improved affine non-malleable extractors give strong lower bounds for a certain kind of read-once linear branching programs, recently introduced by Gryaznov, Pudlák, and Talebanfard (CCC' 22) as a generalization of several well-studied computational models. These bounds match the previously best-known average-case hardness results given by Chattopadhyay and Liao (CCC' 23) and Li (FOCS' 23), where the branching program size lower bounds are close to optimal, but the explicit functions we use here are different.\ Our results also suggest a possible deeper connection between non-malleable extractors and standard ones. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: To appear in ICALP 24. Abstract shortened due to arXiv requirement

arXiv:2404.05905 [pdf, other]

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

Authors: Bo Lin, Yangzheng Zhong, Weiqing Ren

Abstract: Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challengi… ▽ More Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challenging task for complex and high-dimensional systems. In this work, we formulate the path-finding task as a cost minimization problem over a particular path space. The cost function is adapted from the Freidlin-Wentzell action functional so that it is able to deal with rough potential landscapes. The path-finding problem is then solved using a actor-critic method based on the deep deterministic policy gradient algorithm (DDPG). The method incorporates the potential force of the system in the policy for generating episodes and combines physical properties of the system with the learning process for molecular systems. The exploitation and exploration nature of reinforcement learning enables the method to efficiently sample the transition events and compute the globally optimal transition pathway. We illustrate the effectiveness of the proposed method using three benchmark systems including an extended Mueller system and the Lennard-Jones system of seven particles. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2403.12824 [pdf, ps, other]

Well-posedness and no-uniform dependence for the Euler-Poincaré equations in Triebel-Lizorkin spaces

Authors: Yuanhua Zhong, Jianzhong Lu, Min Li, **lu Li

Abstract: In this paper, we study the Cauchy problem of the Euler-Poincaré equations in $\R^d$ with initial data belonging to the Triebel-Lizorkin spaces. We prove the local-in-time unique existence of solutions to the Euler-Poincaré equations in $F^s_{p,r}(\R^d)$. Furthermore, we obtain that the data-to-solution of this equation is continuous but not uniformly continuous in these spaces. In this paper, we study the Cauchy problem of the Euler-Poincaré equations in $\R^d$ with initial data belonging to the Triebel-Lizorkin spaces. We prove the local-in-time unique existence of solutions to the Euler-Poincaré equations in $F^s_{p,r}(\R^d)$. Furthermore, we obtain that the data-to-solution of this equation is continuous but not uniformly continuous in these spaces. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 17pages

MSC Class: 35Q35

arXiv:2403.11350 [pdf, other]

Robustness of the data-driven approach in limited angle tomography

Authors: Yiran Wang, Yimin Zhong

Abstract: The limited angle Radon transform is notoriously difficult to invert due to the ill-posedness. In this work, we give a mathematical explanation that the data-driven approach based on deep neural networks can reconstruct more information in a stable way compared to traditional methods. The limited angle Radon transform is notoriously difficult to invert due to the ill-posedness. In this work, we give a mathematical explanation that the data-driven approach based on deep neural networks can reconstruct more information in a stable way compared to traditional methods. △ Less

Submitted 17 March, 2024; originally announced March 2024.

MSC Class: 35R30

arXiv:2401.15890 [pdf, other]

Probabilistic Guarantees of Stochastic Recursive Gradient in Non-Convex Finite Sum Problems

Authors: Yanjie Zhong, Jiaqi Li, Soumendra Lahiri

Abstract: This paper develops a new dimension-free Azuma-Hoeffding type bound on summation norm of a martingale difference sequence with random individual bounds. With this novel result, we provide high-probability bounds for the gradient norm estimator in the proposed algorithm Prob-SARAH, which is a modified version of the StochAstic Recursive grAdient algoritHm (SARAH), a state-of-art variance reduced al… ▽ More This paper develops a new dimension-free Azuma-Hoeffding type bound on summation norm of a martingale difference sequence with random individual bounds. With this novel result, we provide high-probability bounds for the gradient norm estimator in the proposed algorithm Prob-SARAH, which is a modified version of the StochAstic Recursive grAdient algoritHm (SARAH), a state-of-art variance reduced algorithm that achieves optimal computational complexity in expectation for the finite sum problem. The in-probability complexity by Prob-SARAH matches the best in-expectation result up to logarithmic factors. Empirical experiments demonstrate the superior probabilistic performance of Prob-SARAH on real datasets compared to other popular algorithms. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 41 pages, 3 figures, accepted to PAKDD 2024

arXiv:2312.07722 [pdf, other]

Error Analysis for the Implicit Boundary Integral Method

Authors: Yimin Zhong, Kui Ren, Olof Runborg, Richard Tsai

Abstract: The implicit boundary integral method (IBIM) provides a framework to construct quadrature rules on regular lattices for integrals over irregular domain boundaries. This work provides a systematic error analysis for IBIMs on uniform Cartesian grids for boundaries with different degree of regularities. We first show that the quadrature error gains an addition order of $\frac{d-1}{2}$ from the curvat… ▽ More The implicit boundary integral method (IBIM) provides a framework to construct quadrature rules on regular lattices for integrals over irregular domain boundaries. This work provides a systematic error analysis for IBIMs on uniform Cartesian grids for boundaries with different degree of regularities. We first show that the quadrature error gains an addition order of $\frac{d-1}{2}$ from the curvature for a strongly convex smooth boundary due to the ``randomness'' in the signed distances. This gain is discounted for degenerated convex surfaces. We then extend the error estimate to general boundaries under some special circumstances, including how quadrature error depends on the boundary's local geometry relative to the underlying grid. Bounds on the variance of the quadrature error under random shifts and rotations of the lattices are also derived. △ Less

Submitted 12 December, 2023; originally announced December 2023.

MSC Class: 65D32; 41A55 ACM Class: G.1.4

arXiv:2308.04678 [pdf, ps, other]

Inequalities for the $k$-Regular Overpartitions

Authors: Yi Peng, Helen W. J. Zhang, Ying Zhong

Abstract: Bessenrodt and Ono, Chen, Wang and Jia, DeSalvo and Pak were the first to discover the log-subadditivity, log-concavity, and the third-order Turán inequality of partition function, respectively. Many other important partition statistics are proved to enjoy similar properties. This paper focuses on the partition function $\overline{p}_k(n)$, which counts the number of overpartitions of $n$ with no… ▽ More Bessenrodt and Ono, Chen, Wang and Jia, DeSalvo and Pak were the first to discover the log-subadditivity, log-concavity, and the third-order Turán inequality of partition function, respectively. Many other important partition statistics are proved to enjoy similar properties. This paper focuses on the partition function $\overline{p}_k(n)$, which counts the number of overpartitions of $n$ with no parts divisible by $k$. We provide a combinatorial proof to establish that for any $k\geq2$, the partition function $\overline{p}_k(n)$ exhibits strict log-subadditivity. Specifically, we show that $\overline{p}_k(a)\overline{p}_k(b)>\overline{p}_k(a+b)$ for integers $a\geq b\geq1$ and $a+b\geq k$. Furthermore, we investigate the log-concavity and the satisfaction of the third-order Turán inequality for $\overline{p}_k(n)$, where $2\leq k\leq9$. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 28 pages

arXiv:2306.17301 [pdf, other]

Why Shallow Networks Struggle with Approximating and Learning High Frequency: A Numerical Study

Authors: Shijun Zhang, Hongkai Zhao, Yimin Zhong, Haomin Zhou

Abstract: In this work, a comprehensive numerical study involving analysis and experiments shows why a two-layer neural network has difficulties handling high frequencies in approximation and learning when machine precision and computation cost are important factors in real practice. In particular, the following basic computational issues are investigated: (1) the minimal numerical error one can achieve giv… ▽ More In this work, a comprehensive numerical study involving analysis and experiments shows why a two-layer neural network has difficulties handling high frequencies in approximation and learning when machine precision and computation cost are important factors in real practice. In particular, the following basic computational issues are investigated: (1) the minimal numerical error one can achieve given a finite machine precision, (2) the computation cost to achieve a given accuracy, and (3) stability with respect to perturbations. The key to the study is the conditioning of the representation and its learning dynamics. Explicit answers to the above questions with numerical verifications are presented. △ Less

Submitted 21 November, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

arXiv:2306.03335 [pdf, other]

Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage

Authors: Yu Gui, Cong Ma, Yiqiao Zhong

Abstract: We investigate the role of projection heads, also known as projectors, within the encoder-projector framework (e.g., SimCLR) used in contrastive learning. We aim to demystify the observed phenomenon where representations learned before projectors outperform those learned after -- measured using the downstream linear classification accuracy, even when the projectors themselves are linear. In this… ▽ More We investigate the role of projection heads, also known as projectors, within the encoder-projector framework (e.g., SimCLR) used in contrastive learning. We aim to demystify the observed phenomenon where representations learned before projectors outperform those learned after -- measured using the downstream linear classification accuracy, even when the projectors themselves are linear. In this paper, we make two significant contributions towards this aim. Firstly, through empirical and theoretical analysis, we identify two crucial effects -- expansion and shrinkage -- induced by the contrastive loss on the projectors. In essence, contrastive loss either expands or shrinks the signal direction in the representations learned by an encoder, depending on factors such as the augmentation strength, the temperature used in contrastive loss, etc. Secondly, drawing inspiration from the expansion and shrinkage phenomenon, we propose a family of linear transformations to accurately model the projector's behavior. This enables us to precisely characterize the downstream linear classification accuracy in the high-dimensional asymptotic limit. Our findings reveal that linear projectors operating in the shrinkage (or expansion) regime hinder (or improve) the downstream classification accuracy. This provides the first theoretical explanation as to why (linear) projectors impact the downstream performance of learned representations. Our theoretical findings are further corroborated by extensive experiments on both synthetic data and real image data. △ Less

Submitted 5 June, 2023; originally announced June 2023.

arXiv:2306.02205 [pdf, other]

Online Bootstrap Inference with Nonconvex Stochastic Gradient Descent Estimator

Authors: Yanjie Zhong, Todd Kuffner, Soumendra Lahiri

Abstract: In this paper, we investigate the theoretical properties of stochastic gradient descent (SGD) for statistical inference in the context of nonconvex optimization problems, which have been relatively unexplored compared to convex settings. Our study is the first to establish provable inferential procedures using the SGD estimator for general nonconvex objective functions, which may contain multiple… ▽ More In this paper, we investigate the theoretical properties of stochastic gradient descent (SGD) for statistical inference in the context of nonconvex optimization problems, which have been relatively unexplored compared to convex settings. Our study is the first to establish provable inferential procedures using the SGD estimator for general nonconvex objective functions, which may contain multiple local minima. We propose two novel online inferential procedures that combine SGD and the multiplier bootstrap technique. The first procedure employs a consistent covariance matrix estimator, and we establish its error convergence rate. The second procedure approximates the limit distribution using bootstrap SGD estimators, yielding asymptotically valid bootstrap confidence intervals. We validate the effectiveness of both approaches through numerical experiments. Furthermore, our analysis yields an intermediate result: the in-expectation error convergence rate for the original SGD estimator in nonconvex settings, which is comparable to existing results for convex problems. We believe this novel finding holds independent interest and enriches the literature on optimization and statistical inference. △ Less

Submitted 3 June, 2023; originally announced June 2023.

arXiv:2305.00092 [pdf, other]

Improving Gradient Computation for Differentiable Physics Simulation with Contacts

Authors: Yaofeng Desmond Zhong, Jiequn Han, Biswadip Dey, Georgia Olympia Brikis

Abstract: Differentiable simulation enables gradients to be back-propagated through physics simulations. In this way, one can learn the dynamics and properties of a physics system by gradient-based optimization or embed the whole differentiable simulation as a layer in a deep learning model for downstream tasks, such as planning and control. However, differentiable simulation at its current stage is not per… ▽ More Differentiable simulation enables gradients to be back-propagated through physics simulations. In this way, one can learn the dynamics and properties of a physics system by gradient-based optimization or embed the whole differentiable simulation as a layer in a deep learning model for downstream tasks, such as planning and control. However, differentiable simulation at its current stage is not perfect and might provide wrong gradients that deteriorate its performance in learning tasks. In this paper, we study differentiable rigid-body simulation with contacts. We find that existing differentiable simulation methods provide inaccurate gradients when the contact normal direction is not fixed - a general situation when the contacts are between two moving objects. We propose to improve gradient computation by continuous collision detection and leverage the time-of-impact (TOI) to calculate the post-collision velocities. We demonstrate our proposed method, referred to as TOI-Velocity, on two optimal control problems. We show that with TOI-Velocity, we are able to learn an optimal control sequence that matches the analytical solution, while without TOI-Velocity, existing differentiable simulation methods fail to do so. △ Less

Submitted 28 April, 2023; originally announced May 2023.

Comments: 5th Annual Conference on Learning for Dynamics and Control

Journal ref: Proceedings of Machine Learning Research vol 211, 2023

arXiv:2304.13845 [pdf, other]

Some Asymptotic Properties of the Erlang-C Formula in Many-Server Limiting Regimes

Authors: Ragavendran Gopalakrishnan, Yueyang Zhong

Abstract: This paper presents asymptotic properties of the Erlang-C formula in a spectrum of many-server limiting regimes. Specifically, we address an important gap in the literature regarding its limiting value in critically loaded regimes by studying extensions of the well-known square-root safety staffing rule used in the Quality-and-Efficiency-Driven (QED) regime. This paper presents asymptotic properties of the Erlang-C formula in a spectrum of many-server limiting regimes. Specifically, we address an important gap in the literature regarding its limiting value in critically loaded regimes by studying extensions of the well-known square-root safety staffing rule used in the Quality-and-Efficiency-Driven (QED) regime. △ Less

Submitted 10 May, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

Comments: 14 pages

arXiv:2304.11495 [pdf, ps, other]

Explicit Directional Affine Extractors and Improved Hardness for Linear Branching Programs

Authors: Xin Li, Yan Zhong

Abstract: In a recent work, Gryaznov, Pudlák, and Talebanfard (CCC' 22) introduced a stronger version of affine extractors known as directional affine extractors, together with a generalization of $\mathsf{ROBP}$s where each node can make linear queries, and showed that the former implies strong lower bound for a certain type of the latter known as strongly read-once linear branching programs (… ▽ More In a recent work, Gryaznov, Pudlák, and Talebanfard (CCC' 22) introduced a stronger version of affine extractors known as directional affine extractors, together with a generalization of $\mathsf{ROBP}$s where each node can make linear queries, and showed that the former implies strong lower bound for a certain type of the latter known as strongly read-once linear branching programs ($\mathsf{SROLBP}$s). Their main result gives explicit constructions of directional affine extractors for entropy $k > 2n/3$, which implies average-case complexity $2^{n/3-o(n)}$ against $\mathsf{SROLBP}$s with exponentially small correlation. A follow-up work by Chattopadhyay and Liao (ECCC' 22) improves the hardness to $2^{n-o(n)}$ at the price of increasing the correlation to polynomially large. In this paper we show: An explicit construction of directional affine extractors with $k=o(n)$ and exponentially small error, which gives average-case complexity $2^{n-o(n)}$ against $\mathsf{SROLBP}$s with exponentially small correlation, thus answering the two open questions raised in previous works. An explicit function in $\mathsf{AC}^0$ that gives average-case complexity $2^{(1-δ)n}$ against $\mathsf{ROBP}$s with negligible correlation, for any constant $δ>0$. Previously, no such average-case hardness is known, and the best size lower bound for any function in $\mathsf{AC}^0$ against $\mathsf{ROBP}$s is $2^{Ω(n)}$. One of the key ingredients in our constructions is a new linear somewhere condenser for affine sources, which is based on dimension expanders. The condenser also leads to an unconditional improvement of the entropy requirement of explicit affine extractors with negligible error. We further show that the condenser also works for general weak random sources, under the Polynomial Freiman-Ruzsa Theorem in $\mathsf{F}_2^n$. △ Less

Submitted 3 July, 2024; v1 submitted 22 April, 2023; originally announced April 2023.

arXiv:2304.06316 [pdf, ps, other]

Asymptotics for $k$-crank of $k$-colored partitions

Authors: Helen W. J. Zhang, Ying Zhong

Abstract: In this paper, we obtain asymptotic formulas for $k$-crank of $k$-colored partitions. Let $M_k(a, c; n)$ denote the number of $k$-colored partitions of $n$ with a $k$-crank congruent to $a$ mod $c$. For the cases $k=2,3,4$, Fu and Tang derived several inequality relations for $M_k(a, c; n)$ using generating functions. We employ the Hardy-Ramanujan Circle Method to extend the results of Fu and Tang… ▽ More In this paper, we obtain asymptotic formulas for $k$-crank of $k$-colored partitions. Let $M_k(a, c; n)$ denote the number of $k$-colored partitions of $n$ with a $k$-crank congruent to $a$ mod $c$. For the cases $k=2,3,4$, Fu and Tang derived several inequality relations for $M_k(a, c; n)$ using generating functions. We employ the Hardy-Ramanujan Circle Method to extend the results of Fu and Tang. Furthermore, additional inequality relations for $M_k(a, c; n)$ have been established, such as logarithmic concavity and logarithmic subadditivity. △ Less

Submitted 13 April, 2023; originally announced April 2023.

Comments: 40 pages. arXiv admin note: text overlap with arXiv:1311.4344 by other authors

arXiv:2304.03633 [pdf, other]

On the generalized Hausdorff dimension of Besicovitch sets

Authors: Xianghong Chen, Lixin Yan, Yue Zhong

Abstract: Keich (1999) showed that the sharp gauge function for the generalized Hausdorff dimension of Besicovitch sets in $\mathbb R^2$ is between $r^2\log 1/r$ and $r^2(\log 1/r) (\log\log 1/r)^{2+\varepsilon}$ by refining an argument of Bourgain (1991). It is not known whether the iterated logarithms in Keich's bound are necessary. In this paper we construct a family of Besicovitch line sets whose sharp… ▽ More Keich (1999) showed that the sharp gauge function for the generalized Hausdorff dimension of Besicovitch sets in $\mathbb R^2$ is between $r^2\log 1/r$ and $r^2(\log 1/r) (\log\log 1/r)^{2+\varepsilon}$ by refining an argument of Bourgain (1991). It is not known whether the iterated logarithms in Keich's bound are necessary. In this paper we construct a family of Besicovitch line sets whose sharp gauge function is smaller than $r^2(\log 1/r) (\log\log 1/r)^{\varepsilon}$. Moreover, these Besicovitch sets are minimal in the sense that there is essentially only one line in the set pointing in each direction. △ Less

Submitted 7 April, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

Comments: 50 pages, 10 figures. Submitted for publication

arXiv:2210.17024 [pdf, other]

Transport models for wave propagation in scattering media with nonlinear absorption

Authors: Joseph Kraisler, Wei Li, Kui Ren, John C. Schotland, Yimin Zhong

Abstract: This work considers the propagation of high-frequency waves in highly-scattering media where physical absorption of a nonlinear nature occurs. Using the classical tools of the Wigner transform and multiscale analysis, we derive semilinear radiative transport models for the phase-space intensity and the diffusive limits of such transport models. As an application, we consider an inverse problem for… ▽ More This work considers the propagation of high-frequency waves in highly-scattering media where physical absorption of a nonlinear nature occurs. Using the classical tools of the Wigner transform and multiscale analysis, we derive semilinear radiative transport models for the phase-space intensity and the diffusive limits of such transport models. As an application, we consider an inverse problem for the semilinear transport equation, where we reconstruct the absorption coefficients of the equation from a functional of its solution. We obtain a uniqueness result on the inverse problem. △ Less

Submitted 30 October, 2022; originally announced October 2022.

arXiv:2210.03699 [pdf, other]

Corrected Trapezoidal Rule-IBIM for linearized Poisson-Boltzmann equation

Authors: Federico Izzo, Yimin Zhong, Olof Runborg, Richard Tsai

Abstract: In this paper, we solve the linearized Poisson-Boltzmann equation, used to model the electric potential of macromolecules in a solvent. We derive a corrected trapezoidal rule with improved accuracy for a boundary integral formulation of the linearized Poisson-Boltzmann equation. More specifically, in contrast to the typical boundary integral formulations, the corrected trapezoidal rule is applied… ▽ More In this paper, we solve the linearized Poisson-Boltzmann equation, used to model the electric potential of macromolecules in a solvent. We derive a corrected trapezoidal rule with improved accuracy for a boundary integral formulation of the linearized Poisson-Boltzmann equation. More specifically, in contrast to the typical boundary integral formulations, the corrected trapezoidal rule is applied to integrate a system of compacted supported singular integrals using uniform Cartesian grids in $\mathbb{R}^3$, without explicit surface parameterization. A Krylov method, accelerated by a fast multipole method, is used to invert the resulting linear system. We study the efficacy of the proposed method, and compare it to an existing, lower order method. We then apply the method to the computation of electrostatic potential of macromolecules immersed in solvent. The solvent excluded surfaces, defined by a common approach, are merely piecewise smooth, and we study the effectiveness of the method for such surfaces. △ Less

Submitted 7 October, 2022; originally announced October 2022.

Comments: 22 pages, 6 figures

MSC Class: 45A05; 65R20; 6N5D30; 65N80; 78M16; 92E10

arXiv:2209.15495 [pdf, other]

The constant term algebra of type $A$: the Structure

Authors: Guoce Xin, Chen Zhang, Yue Zhou, Yueming Zhong

Abstract: In this paper, we discover a new noncommutative algebra. We refer this algebra as the constant term algebra of type $A$, which is generated by certain constant term operators. We characterize a structural result of this algebra by establishing an explicit basis in terms of certain forests. This algebra arises when we apply the method of the iterated Laurent series to investigate Beck and Pixton's… ▽ More In this paper, we discover a new noncommutative algebra. We refer this algebra as the constant term algebra of type $A$, which is generated by certain constant term operators. We characterize a structural result of this algebra by establishing an explicit basis in terms of certain forests. This algebra arises when we apply the method of the iterated Laurent series to investigate Beck and Pixton's residue computation for the Ehrhart series of the Birkhoff polytope. This algebra seems to be the first structural result in the area of the constant term world since the discovery of the Dyson constant term identity in 1962. △ Less

Submitted 3 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

Comments: 28 pages, 4 figures

MSC Class: Primary 08A05; Secondary 47C05; 05A19

arXiv:2206.12833 [pdf, ps, other]

Strict Log-Subadditivity for Overpartition Rank

Authors: Helen W. J. Zhang, Ying Zhong

Abstract: Bessenrodt and Ono initially found the strict log-subadditivity of partition function $p(n)$, that is, $p(a+b)< p(a)p(b)$ for $a,b>1$ and $a+b>9$. Many other important statistics of partitions are proved to enjoy similar properties. Lovejoy introduced the overpartition rank as an analog of Dyson's rank for partitions from the $q$-series perspective. Let $\overline{N}(a,c,n)$ denote the number of o… ▽ More Bessenrodt and Ono initially found the strict log-subadditivity of partition function $p(n)$, that is, $p(a+b)< p(a)p(b)$ for $a,b>1$ and $a+b>9$. Many other important statistics of partitions are proved to enjoy similar properties. Lovejoy introduced the overpartition rank as an analog of Dyson's rank for partitions from the $q$-series perspective. Let $\overline{N}(a,c,n)$ denote the number of overpartitions with rank congruent to $a$ modulo $c$. Ciolan computed the asymptotic formula of $\overline{N}(a,c,n)$ and showed that $\overline{N}(a, c, n) > \overline{N}(b, c, n)$ for $c\geq7$ and $n$ large enough. In this paper, we derive an upper bound and a lower bound of $\overline{N}(a,c,n)$ for each $c\geq3$ by using the asymptotics of Ciolan. Consequently, we establish the strict log-subadditivity of $\overline{N}(a,c,n)$ analogous to the partition function $p(n)$. △ Less

Submitted 26 June, 2022; originally announced June 2022.

Comments: 19 pages

MSC Class: 05A20; 11P82; 11P83

arXiv:2206.12583 [pdf, ps, other]

Normalized ground state solutions for the fractional Sobolev critical NLSE with an extra mass supercritical nonlinearity

Authors: Jiabin Zuo, Yuyou Zhong, Dušan D. Repovš

Abstract: This paper is concerned with existence of normalized ground state solutions for the mass supercritical fractional nonlinear Schrödinger equation involving a critical growth in the fractional Sobolev sense. The compactness of Palais-Smale sequences is obtained by a special technique, which borrows from the ideas of Soave (J. Funct. Anal. 279 (6) (2020) 1086102020). This paper represents an extensio… ▽ More This paper is concerned with existence of normalized ground state solutions for the mass supercritical fractional nonlinear Schrödinger equation involving a critical growth in the fractional Sobolev sense. The compactness of Palais-Smale sequences is obtained by a special technique, which borrows from the ideas of Soave (J. Funct. Anal. 279 (6) (2020) 1086102020). This paper represents an extension of previously known results - in the local and the nonlocal cases. △ Less

Submitted 28 July, 2023; v1 submitted 25 June, 2022; originally announced June 2022.

MSC Class: 26A33; 35A15; 35B33; 35J20

Journal ref: St. Petersburg Math. J. (2023)

arXiv:2206.05336 [pdf, other]

How much can one learn from a single solution of a PDE?

Authors: Hongkai Zhao, Yimin Zhong

Abstract: Linear evolution PDE $\partial_t u(x,t) = -\mathcal{L} u$, where $\mathcal{L}$ is a strongly elliptic operator independent of time, is studied as an example to show if one can superpose snapshots of a single (or a finite number of) solution(s) to construct an arbitrary solution. Our study shows that it depends on the growth rate of the eigenvalues, $μ_n$, of $\mathcal{L}$ in terms of $n$. When the… ▽ More Linear evolution PDE $\partial_t u(x,t) = -\mathcal{L} u$, where $\mathcal{L}$ is a strongly elliptic operator independent of time, is studied as an example to show if one can superpose snapshots of a single (or a finite number of) solution(s) to construct an arbitrary solution. Our study shows that it depends on the growth rate of the eigenvalues, $μ_n$, of $\mathcal{L}$ in terms of $n$. When the statement is true, a simple data-driven approach for model reduction and approximation of an arbitrary solution of a PDE without knowing the underlying PDE is designed. Numerical experiments are presented to corroborate our analysis. △ Less

Submitted 10 June, 2022; originally announced June 2022.

MSC Class: 44A60; 65D05

arXiv:2206.02167 [pdf, ps, other]

Asymptotic formula for the $M_2$-ranks of overpartitions

Authors: Helen W. J. Zhang, Ying Zhong

Abstract: Let $\overline{N}_2(a,c,n)$ be the number of overpartitions of $n$ whose the $M_2$-rank is congruent to $a$ modulo $c$. In this paper, we obtain the asymptotic formula of $\overline{N}_2(a,c,n)$ utilizing the Ingham Tauberian Theorem. As applications, we derive inequalities concerning with $\overline{N}_2(a,c,n)$ including its strict concavity and log-concavity. Let $\overline{N}_2(a,c,n)$ be the number of overpartitions of $n$ whose the $M_2$-rank is congruent to $a$ modulo $c$. In this paper, we obtain the asymptotic formula of $\overline{N}_2(a,c,n)$ utilizing the Ingham Tauberian Theorem. As applications, we derive inequalities concerning with $\overline{N}_2(a,c,n)$ including its strict concavity and log-concavity. △ Less

Submitted 5 June, 2022; originally announced June 2022.

MSC Class: 05A17; 11P72; 11P82

arXiv:2206.00260 [pdf, other]

Multi-block Min-max Bilevel Optimization with Applications in Multi-task Deep AUC Maximization

Authors: Quanqi Hu, Yongjian Zhong, Tianbao Yang

Abstract: In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively hi… ▽ More In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively high, especially with a large number of blocks. To tackle this challenge, we present a single-loop randomized stochastic algorithm, which requires updates for only a constant number of blocks at each iteration. Under some mild assumptions on the problem, we establish its sample complexity of $O(1/ε^4)$ for finding an $ε$-stationary point. This matches the optimal complexity for solving stochastic nonconvex optimization under a general unbiased stochastic oracle model. Moreover, we provide two applications of the proposed method in multi-task deep AUC (area under ROC curve) maximization and multi-task deep partial AUC maximization. Experimental results validate our theory and demonstrate the effectiveness of our method on problems with hundreds of tasks. △ Less

Submitted 17 November, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

arXiv:2204.07961 [pdf, ps, other]

Higher order log-concavity of the overpartition function and its consequences

Authors: Gargi Mukherjee, Helen W. J. Zhang, Ying Zhong

Abstract: Let $\bar{p}(n)$ denote the overpartition function. In this paper, we study the asymptotic higher order $\log$-concavity property of the overpatition function in a similar framework done by Hou and Zhang for the partition function. This will enable us to move on further in order to prove $\log$-concavity of overpartitions, explicitly by studying the asymptotic expansion of the quotient… ▽ More Let $\bar{p}(n)$ denote the overpartition function. In this paper, we study the asymptotic higher order $\log$-concavity property of the overpatition function in a similar framework done by Hou and Zhang for the partition function. This will enable us to move on further in order to prove $\log$-concavity of overpartitions, explicitly by studying the asymptotic expansion of the quotient $\bar{p}(n-1)\bar{p}(n+1)/\bar{p}(n)^2$ upto a certain order so that one can finally ends up with the phenomena of $2$-$\log$-concavity and higher order Turán property of $\bar{p}(n)$ by following a sort of unified approach. △ Less

Submitted 17 April, 2022; originally announced April 2022.

MSC Class: 05A20; 11N37; 65G99

arXiv:2204.04602 [pdf, other]

How much can one learn a partial differential equation from its solution?

Authors: Yuchen He, Hongkai Zhao, Yimin Zhong

Abstract: In this work we study the problem about learning a partial differential equation (PDE) from its solution data. PDEs of various types are used as examples to illustrate how much the solution data can reveal the PDE operator depending on the underlying operator and initial data. A data driven and data adaptive approach based on local regression and global consistency is proposed for stable PDE ident… ▽ More In this work we study the problem about learning a partial differential equation (PDE) from its solution data. PDEs of various types are used as examples to illustrate how much the solution data can reveal the PDE operator depending on the underlying operator and initial data. A data driven and data adaptive approach based on local regression and global consistency is proposed for stable PDE identification. Numerical experiments are provided to verify our analysis and demonstrate the performance of the proposed algorithms. △ Less

Submitted 9 November, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

Comments: 44 pages

MSC Class: 35K15; 47A10; 47F10; 62J05; 65J10; 35R30

arXiv:2203.08438

On Sombor index of graphs with a given number of cut-vertices

Authors: Sakander Hayat, Ansar Rehman, Yubin Zhong

Abstract: Introduced by Gutman in 2021, the Sombor index is a novel graph-theoretic topological descriptor possessing potential applications in the modeling of thermodynamic properties of compounds. Let G^k_n be the set of all n-vertex connected graphs with k cut-vertices. In this paper, we present minimum Sombor indices of graphs in G^k_n. The corresponding extremal graphs have been characterized as well. Introduced by Gutman in 2021, the Sombor index is a novel graph-theoretic topological descriptor possessing potential applications in the modeling of thermodynamic properties of compounds. Let G^k_n be the set of all n-vertex connected graphs with k cut-vertices. In this paper, we present minimum Sombor indices of graphs in G^k_n. The corresponding extremal graphs have been characterized as well. △ Less

Submitted 30 June, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: The paper was reviewed by a journal and reviewers found some majors errors in the proofs of results regarding trees in Section 2. Moreover, there was a major error in Lemma 3 of Section 3. Since the main results are based on these lemmas, we need to withdraw the paper from arxive and fix all the errors as it will take a considerable aount of time

MSC Class: 05C92; 05C09; 05C35

arXiv:2202.12183 [pdf, other]

Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence

Authors: Zi-Hao Qiu, Quanqi Hu, Yongjian Zhong, Lijun Zhang, Tianbao Yang

Abstract: NDCG, namely Normalized Discounted Cumulative Gain, is a widely used ranking metric in information retrieval and machine learning. However, efficient and provable stochastic methods for maximizing NDCG are still lacking, especially for deep models. In this paper, we propose a principled approach to optimize NDCG and its top-$K$ variant. First, we formulate a novel compositional optimization proble… ▽ More NDCG, namely Normalized Discounted Cumulative Gain, is a widely used ranking metric in information retrieval and machine learning. However, efficient and provable stochastic methods for maximizing NDCG are still lacking, especially for deep models. In this paper, we propose a principled approach to optimize NDCG and its top-$K$ variant. First, we formulate a novel compositional optimization problem for optimizing the NDCG surrogate, and a novel bilevel compositional optimization problem for optimizing the top-$K$ NDCG surrogate. Then, we develop efficient stochastic algorithms with provable convergence guarantees for the non-convex objectives. Different from existing NDCG optimization methods, the per-iteration complexity of our algorithms scales with the mini-batch size instead of the number of total items. To improve the effectiveness for deep learning, we further propose practical strategies by using initial warm-up and stop gradient operator. Experimental results on multiple datasets demonstrate that our methods outperform prior ranking approaches in terms of NDCG. To the best of our knowledge, this is the first time that stochastic algorithms are proposed to optimize NDCG with a provable convergence guarantee. Our proposed methods are implemented in the LibAUC library at https://libauc.org/. △ Less

Submitted 2 February, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

Comments: 32 pages, 12 figures; Accepted by ICML2022

arXiv:2202.11888 [pdf, other]

Inverse Source Problem for Acoustically-Modulated Electromagnetic Waves

Authors: Wei Li, John C. Schotland, Yang Yang, Yimin Zhong

Abstract: We propose a method to reconstruct the electrical current density from acoustically-modulated boundary measurements of time-harmonic electromagnetic fields. We show that the current can be uniquely reconstructed with Lipschitz stability. We also report numerical simulations to illustrate the analytical results. We propose a method to reconstruct the electrical current density from acoustically-modulated boundary measurements of time-harmonic electromagnetic fields. We show that the current can be uniquely reconstructed with Lipschitz stability. We also report numerical simulations to illustrate the analytical results. △ Less

Submitted 23 February, 2022; originally announced February 2022.

Comments: 20 pages, 4 figures

arXiv:2201.02376 [pdf, other]

Proving some conjectures on Kekulé numbers for certain benzenoids by using Chebyshev polynomials

Authors: Guoce Xin, Yueming Zhong

Abstract: In chemistry, Cyvin-Gutman enumerates Kekulé numbers for certain benzenoids and record it as $A050446$ on OEIS. This number is exactly the two variable array $T(n,m)$ defined by the recursion $T(n, m) = T(n, m-1) + \sum^{\lfloor\frac{n-1}{2}\rfloor}_{k=0} T(2k, m-1)T(n-1-2k, m)$, where $T(n,0)=T(0,m)=1$ for all nonnegative integers $m,n$. Interestingly, this number also appeared in the context of… ▽ More In chemistry, Cyvin-Gutman enumerates Kekulé numbers for certain benzenoids and record it as $A050446$ on OEIS. This number is exactly the two variable array $T(n,m)$ defined by the recursion $T(n, m) = T(n, m-1) + \sum^{\lfloor\frac{n-1}{2}\rfloor}_{k=0} T(2k, m-1)T(n-1-2k, m)$, where $T(n,0)=T(0,m)=1$ for all nonnegative integers $m,n$. Interestingly, this number also appeared in the context of weighted graphs, graph polytopes, magic labellings, and unit primitive matrices, studied by different authors. Several interesting conjectures were made on the OEIS. These conjectures are related to both the row and column generating function of $T(n,m)$. In this paper, give explicit formula of the column generating function, which is also the generating function $F(n,x)$ studied by Bóna, Ju, and Yoshida. We also get trig function representations by using Chebyshev polynomials of the second kind. This allows us to prove all these conjectures. △ Less

Submitted 7 January, 2022; originally announced January 2022.

Comments: 28 pages,3 figures, 1 table

MSC Class: 05A15; 15A18; 05C78; 52B11

arXiv:2111.03376 [pdf, other]

Simplex Initialization: A Survey of Techniques and Trends

Authors: Mengyu Huang, Yuxing Zhong, Huiwen Yang, Jiazheng Wang, Fan Zhang, Bo Bai, Ling Shi

Abstract: The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim… ▽ More The simplex method is one of the most fundamental technologies for solving linear programming (LP) problems and has been widely applied to different practical applications. In the past literature, how to improve and accelerate the simplex method has attracted plenty of research. One important way to achieve this goal is to find a better initialization method for the simplex. In this survey, we aim to provide an overview about the initialization methods in the primal and dual simplex, respectively. We also propose several potential future directions about how to improve the existing initialization methods with the help of advanced learning technologies. △ Less

Submitted 5 November, 2021; originally announced November 2021.

arXiv:2110.15824 [pdf, other]

Tractability from overparametrization: The example of the negative perceptron

Authors: Andrea Montanari, Yiqiao Zhong, Kangjie Zhou

Abstract: In the negative perceptron problem we are given $n$ data points $({\boldsymbol x}_i,y_i)$, where ${\boldsymbol x}_i$ is a $d$-dimensional vector and $y_i\in\{+1,-1\}$ is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible \emph{negative} margin. In other words, we want to find a unit norm vector ${\boldsymbol θ}$… ▽ More In the negative perceptron problem we are given $n$ data points $({\boldsymbol x}_i,y_i)$, where ${\boldsymbol x}_i$ is a $d$-dimensional vector and $y_i\in\{+1,-1\}$ is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible \emph{negative} margin. In other words, we want to find a unit norm vector ${\boldsymbol θ}$ that maximizes $\min_{i\le n}y_i\langle {\boldsymbol θ},{\boldsymbol x}_i\rangle$. This is a non-convex optimization problem (it is equivalent to finding a maximum norm vector in a polytope), and we study its typical properties under two random models for the data. We consider the proportional asymptotics in which $n,d\to \infty$ with $n/d\toδ$, and prove upper and lower bounds on the maximum margin $κ_{\text{s}}(δ)$ or -- equivalently -- on its inverse function $δ_{\text{s}}(κ)$. In other words, $δ_{\text{s}}(κ)$ is the overparametrization threshold: for $n/d\le δ_{\text{s}}(κ)-\varepsilon$ a classifier achieving vanishing training error exists with high probability, while for $n/d\ge δ_{\text{s}}(κ)+\varepsilon$ it does not. Our bounds on $δ_{\text{s}}(κ)$ match to the leading order as $κ\to -\infty$. We then analyze a linear programming algorithm to find a solution, and characterize the corresponding threshold $δ_{\text{lin}}(κ)$. We observe a gap between the interpolation threshold $δ_{\text{s}}(κ)$ and the linear programming threshold $δ_{\text{lin}}(κ)$, raising the question of the behavior of other algorithms. △ Less

Submitted 3 July, 2023; v1 submitted 27 October, 2021; originally announced October 2021.

Comments: 107 pages; 7 pdf figures

arXiv:2110.10630 [pdf, ps, other]

The Complex Ball-quotient Structure of the Moduli Space of Certain Sextic Curves

Authors: Zhiwei Zheng, Yiming Zhong

Abstract: We study moduli spaces of certain sextic curves with a singularity of multiplicity 3 from both perspectives of Deligne-Mostow theory and periods of K3 surfaces. In both ways we can describe the moduli spaces via arithmetic quotients of complex hyperbolic balls. We show in Theorem 7.4 that the two ball-quotient constructions can be unified in a geometric way. We study moduli spaces of certain sextic curves with a singularity of multiplicity 3 from both perspectives of Deligne-Mostow theory and periods of K3 surfaces. In both ways we can describe the moduli spaces via arithmetic quotients of complex hyperbolic balls. We show in Theorem 7.4 that the two ball-quotient constructions can be unified in a geometric way. △ Less

Submitted 20 October, 2021; originally announced October 2021.

Comments: 26 pages, comments welcome!

arXiv:2107.11715 [pdf, other]

A symmetric chain decomposition of $N(m,n)$ of composition

Authors: Yueming Zhong

Abstract: A poset is called a symmetric chain decomposition if the poset can be expressed as a disjoint union of symmetric chains. For positive integers $m$ and $n$, let $N(m,n)$ denote the set of all compositions $α=(α_1,\cdots,α_m)$, with $0\le α_i \le n$ for each $i=1,\cdots,m$. Define order $<$ as follow, $\forall α,β\in N(m,n)$, $β< α$ if and only if $β_i \le α_i(i=1,\cdots,m)$ and… ▽ More A poset is called a symmetric chain decomposition if the poset can be expressed as a disjoint union of symmetric chains. For positive integers $m$ and $n$, let $N(m,n)$ denote the set of all compositions $α=(α_1,\cdots,α_m)$, with $0\le α_i \le n$ for each $i=1,\cdots,m$. Define order $<$ as follow, $\forall α,β\in N(m,n)$, $β< α$ if and only if $β_i \le α_i(i=1,\cdots,m)$ and $\sum\limits_{i=1}^{m}β_i <\sum\limits_{i=1}^{m}α_i$. In this paper, we show that the poset $(N(m,n),<)$ can be expressed as a disjoint of symmetric chains by constructive method. △ Less

Submitted 24 July, 2021; originally announced July 2021.

Comments: 10 pages, 7 figures

MSC Class: Primary 05A19; Secondary 05E99

arXiv:2107.04709 [pdf, other]

Multiplayer Homicidal Chauffeur Reach-Avoid Games via Guaranteed Winning Strategies

Authors: Rui Yan, Ruiliang Deng, Haowen Lai, Weixian Zhang, Zongying Shi, Yisheng Zhong

Abstract: This paper studies a planar multiplayer Homicidal Chauffeur reach-avoid differential game, where each pursuer is a Dubins car and each evader has simple motion. The pursuers aim to protect a goal region cooperatively from the evaders. Due to the high-dimensional strategy space among pursuers, we decompose the whole game into multiple one-pursuer-one-evader subgames, each of which is solved in an a… ▽ More This paper studies a planar multiplayer Homicidal Chauffeur reach-avoid differential game, where each pursuer is a Dubins car and each evader has simple motion. The pursuers aim to protect a goal region cooperatively from the evaders. Due to the high-dimensional strategy space among pursuers, we decompose the whole game into multiple one-pursuer-one-evader subgames, each of which is solved in an analytical approach instead of solving Hamilton-Jacobi-Isaacs equations. For each subgame, an evasion region (ER) is introduced, based on which a pursuit strategy guaranteeing the winning of a simple-motion pursuer under specific conditions is proposed. Motivated by the simple-motion pursuer, a strategy for a Dubins-car pursuer is proposed when the pursuer-evader configuration satisfies separation condition (SC) and interception orientation (IO). The necessary and sufficient condition on capture radius, minimum turning radius and speed ratio to guarantee the pursuit winning is derived. When the IO is not satisfied (Non-IO), a heading adjustment pursuit strategy is proposed, and the condition to achieve IO within a finite time, is given. Then, a two-step pursuit strategy is proposed for the SC and Non-IO case. A non-convex optimization problem is introduced to give a condition guaranteeing the winning of the pursuer. A polynomial equation gives a lower bound of the non-convex problem, providing a sufficient and efficient pursuit winning condition. Finally, these pairwise outcomes are collected for the pursuer-evader matching. Simulations are provided to illustrate the theoretical results. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: 15 pages, 5 figures

arXiv:2107.03161 [pdf, other]

On Magic Distinct Labellings of Simple Graphs

Authors: Guoce Xin, Xinyu Xu, Chen Zhang, Yueming Zhong

Abstract: A magic labelling of a graph $G$ with magic sum $s$ is a labelling of the edges of $G$ by nonnegative integers such that for each vertex $v\in V$, the sum of labels of all edges incident to $v$ is equal to the same number $s$. Stanley gave remarkable results on magic labellings, but the distinct labelling case is much more complicated. We consider the complete construction of all magic labellings… ▽ More A magic labelling of a graph $G$ with magic sum $s$ is a labelling of the edges of $G$ by nonnegative integers such that for each vertex $v\in V$, the sum of labels of all edges incident to $v$ is equal to the same number $s$. Stanley gave remarkable results on magic labellings, but the distinct labelling case is much more complicated. We consider the complete construction of all magic labellings of a given graph $G$. The idea is illustrated in detail by dealing with three regular graphs. We give combinatorial proofs. The structure result was used to enumerate the corresponding magic distinct labellings. △ Less

Submitted 7 July, 2021; originally announced July 2021.

Comments: 14 pages, 6 figures

MSC Class: Primary 05A19; Secondary 11D04; 05C78

arXiv:2106.11407 [pdf, ps, other]

Asymptotically Optimal Idling in the GI/GI/N+GI Queue

Authors: Yueyang Zhong, Amy R. Ward, Amber L. Puha

Abstract: We formulate a control problem for a GI/GI/N+GI queue, whose objective is to trade off the long-run average operational costs (i.e., abandonment costs and holding costs) with server utilization costs. To solve the control problem, we consider an asymptotic regime in which the arrival rate and the number of servers grow large. The solution to an associated fluid control problem motivates that non-i… ▽ More We formulate a control problem for a GI/GI/N+GI queue, whose objective is to trade off the long-run average operational costs (i.e., abandonment costs and holding costs) with server utilization costs. To solve the control problem, we consider an asymptotic regime in which the arrival rate and the number of servers grow large. The solution to an associated fluid control problem motivates that non-idling service disciplines are not in general optimal, unless some arrivals are turned away. We propose an admission control policy designed to ensure servers have sufficient idle time that we show is asymptotically optimal. △ Less

Submitted 7 November, 2023; v1 submitted 21 June, 2021; originally announced June 2021.

arXiv:2104.11003 [pdf, other]

An explicit order matching for $L(3,n)$ from several approaches and its extension for $L(4,n)$

Authors: Guoce Xin, Yueming Zhong

Abstract: Let $L(m,n)$ denote Young's lattice consisting of all partitions whose Young diagrams are contained in the $m\times n$ rectangle. It is a well-known result that the poset $L(m,n)$ is rank symmetric, rank unimodal, and Sperner. A direct proof of this result by finding an explicit order matching of $L(m,n)$ is an outstanding open problem. In this paper, we present an explicit order matching… ▽ More Let $L(m,n)$ denote Young's lattice consisting of all partitions whose Young diagrams are contained in the $m\times n$ rectangle. It is a well-known result that the poset $L(m,n)$ is rank symmetric, rank unimodal, and Sperner. A direct proof of this result by finding an explicit order matching of $L(m,n)$ is an outstanding open problem. In this paper, we present an explicit order matching $\varphi$ for $L(3,n)$ by several different approaches, and give chain tableau version of $\varphi$ that is very helpful in finding patterns. It is surprise that the greedy algorithm and a recursive knead process also give the same order matching. Our methods extend for $L(4,n)$. △ Less

Submitted 22 April, 2021; originally announced April 2021.

Comments: 26 pages, 22 figures

arXiv:2104.06566 [pdf, ps, other]

Inverse Boundary Problem for the Two Photon Absorption Transport Equation

Authors: Plamen Stefanov, Yimin Zhong

Abstract: This work studies the inverse boundary problem for the two photon absorption radiative transport equation. We show that the absorption coefficients and scattering coefficients can be uniquely determined from the \emph{albedo} operator. If scattering is absent, we do not require smallness of the incoming source and the reconstructions of the absorption coefficients are explicit. This work studies the inverse boundary problem for the two photon absorption radiative transport equation. We show that the absorption coefficients and scattering coefficients can be uniquely determined from the \emph{albedo} operator. If scattering is absent, we do not require smallness of the incoming source and the reconstructions of the absorption coefficients are explicit. △ Less

Submitted 17 February, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

MSC Class: 35R30; 78A46; 80A23; 92C55

arXiv:2103.11281 [pdf, other]

Acousto-electric Inverse Source Problem

Authors: Wei Li, John C. Schotland, Yang Yang, Yimin Zhong

Abstract: We propose a method to reconstruct the electrical current density inside a conducting medium from acoustically-modulated boundary measurements of the electric potential. We show that the current can be uniquely reconstructed with Lipschitz stability. We also perform numerical simulations to illustrate the analytical results, and explore the partial data setting when measurements are taken only on… ▽ More We propose a method to reconstruct the electrical current density inside a conducting medium from acoustically-modulated boundary measurements of the electric potential. We show that the current can be uniquely reconstructed with Lipschitz stability. We also perform numerical simulations to illustrate the analytical results, and explore the partial data setting when measurements are taken only on part of the boundary. △ Less

Submitted 20 March, 2021; originally announced March 2021.

MSC Class: 35R30; 35Q60; 78A46

arXiv:2012.12380 [pdf, other]

doi 10.1088/1361-6420/abf318

Quantitative PAT with simplified $P_N$ approximation

Authors: Hongkai Zhao, Yimin Zhong

Abstract: The photoacoustic tomography (PAT) is a hybrid modality that combines the optics and acoustics to obtain high resolution and high contrast imaging of heterogeneous media. In this work, our objective is to study the inverse problem in the quantitative step of PAT which aims to reconstruct the optical coefficients of the governing radiative transport equation from the ultrasound measurements. In our… ▽ More The photoacoustic tomography (PAT) is a hybrid modality that combines the optics and acoustics to obtain high resolution and high contrast imaging of heterogeneous media. In this work, our objective is to study the inverse problem in the quantitative step of PAT which aims to reconstruct the optical coefficients of the governing radiative transport equation from the ultrasound measurements. In our analysis, we take the simplified $P_N$ approximation of the radiative transport equation as the physical model and then show the uniqueness and stability for this modified inverse problem. Numerical simulations based on synthetic data are presented to validate our analysis. △ Less

Submitted 11 March, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

Comments: 32 pages

MSC Class: 35R30; 78A46; 80A23; 92C55

arXiv:2012.02334 [pdf, other]

Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data

Authors: Yaofeng Desmond Zhong, Biswadip Dey, Amit Chakraborty

Abstract: The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network mode… ▽ More The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network models, including HNN, LNN, DeLaN, SymODEN, CHNN, CLNN and their variants. We provide a compact derivation of the theory behind these models and explain their similarities and differences. Their performance are compared in 4 physical systems. We point out the possibility of leveraging some of these energy-conserving models to design energy-based controllers. △ Less

Submitted 28 April, 2023; v1 submitted 3 December, 2020; originally announced December 2020.

arXiv:2008.06211 [pdf, ps, other]

A Note on the Gaussian Minimum Conjecture

Authors: Yang-Fan Zhong, Ting Ma, Ze-Chun Hu

Abstract: Let $n\geq 2$ and $(X_i,1\leq i\leq n)$ be a centered Gaussian random vector. The Gaussian minimum conjecture says that $E\left(\min_{1\leq i\leq n}|X_i|\right)\geq E\left(\min_{1\leq i\leq n}|Y_i|\right)$, where $Y_1,\ldots,Y_n$ are independent centered Gaussian random variables with $E(X_i^2)=E(Y_i^2)$ for any $i=1,\ldots,n$. In this note, we will show that this conjecture holds if and only if… ▽ More Let $n\geq 2$ and $(X_i,1\leq i\leq n)$ be a centered Gaussian random vector. The Gaussian minimum conjecture says that $E\left(\min_{1\leq i\leq n}|X_i|\right)\geq E\left(\min_{1\leq i\leq n}|Y_i|\right)$, where $Y_1,\ldots,Y_n$ are independent centered Gaussian random variables with $E(X_i^2)=E(Y_i^2)$ for any $i=1,\ldots,n$. In this note, we will show that this conjecture holds if and only if $n=2$. △ Less

Submitted 14 August, 2020; originally announced August 2020.

Comments: 10 pages

arXiv:2008.04383 [pdf, other]

Influence Spread in the Heterogeneous Multiplex Linear Threshold Model

Authors: Yaofeng Desmond Zhong, Vaibhav Srivastava, Naomi Ehrich Leonard

Abstract: The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish sign… ▽ More The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish signals from different layers: an agent becomes active if a sufficient number of its neighbors in each of any $a$ of the $m$ layers is active. We focus on Protocol OR, when $a=1$, and Protocol AND, when $a=m$, which model agents that are most and least readily activated, respectively. We develop theory and algorithms to compute the size of the spread at steady state for any set of initially active agents and to analyze the role of distinguished sensing modalities, network structure, and heterogeneity. We show how heterogeneity manages the tension in spreading dynamics between sensitivity to inputs and robustness to disturbances. △ Less

Submitted 10 August, 2020; originally announced August 2020.

arXiv:2007.12826 [pdf, other]

The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training

Authors: Andrea Montanari, Yiqiao Zhong

Abstract: Modern neural networks are often operated in a strongly overparametrized regime: they comprise so many parameters that they can interpolate the training set, even if actual labels are replaced by purely random ones. Despite this, they achieve good prediction error on unseen data: interpolating the training set does not lead to a large generalization error. Further, overparametrization appears to b… ▽ More Modern neural networks are often operated in a strongly overparametrized regime: they comprise so many parameters that they can interpolate the training set, even if actual labels are replaced by purely random ones. Despite this, they achieve good prediction error on unseen data: interpolating the training set does not lead to a large generalization error. Further, overparametrization appears to be beneficial in that it simplifies the optimization landscape. Here we study these phenomena in the context of two-layers neural networks in the neural tangent (NT) regime. We consider a simple data model, with isotropic covariates vectors in $d$ dimensions, and $N$ hidden neurons. We assume that both the sample size $n$ and the dimension $d$ are large, and they are polynomially related. Our first main result is a characterization of the eigenstructure of the empirical NT kernel in the overparametrized regime $Nd\gg n$. This characterization implies as a corollary that the minimum eigenvalue of the empirical NT kernel is bounded away from zero as soon as $Nd\gg n$, and therefore the network can exactly interpolate arbitrary labels in the same regime. Our second main result is a characterization of the generalization error of NT ridge regression including, as a special case, min-$\ell_2$ norm interpolation. We prove that, as soon as $Nd\gg n$, the test error is well approximated by the one of kernel ridge regression with respect to the infinite-width kernel. The latter is in turn well approximated by the error of polynomial ridge regression, whereby the regularization parameter is increased by a `self-induced' term related to the high-degree components of the activation function. The polynomial degree depends on the sample size and the dimension (in particular on $\log n/\log d$). △ Less

Submitted 8 June, 2022; v1 submitted 24 July, 2020; originally announced July 2020.

Comments: 83 pages, 5 figures

MSC Class: 62J07; 62H12 ACM Class: I.2.6

arXiv:2007.09516 [pdf, other]

Unique determination of absorption coefficients in a semilinear transport equation

Authors: Kui Ren, Yimin Zhong

Abstract: Motivated by applications in quantitative photoacoustic imaging, we study inverse problems to a semilinear radiative transport equation (RTE) where we intend to reconstruct absorption coefficients in the equation from single and multiple internal data sets. We derive uniqueness and stability results for the inverse transport problem in the absence of scattering (in which case we also derive some e… ▽ More Motivated by applications in quantitative photoacoustic imaging, we study inverse problems to a semilinear radiative transport equation (RTE) where we intend to reconstruct absorption coefficients in the equation from single and multiple internal data sets. We derive uniqueness and stability results for the inverse transport problem in the absence of scattering (in which case we also derive some explicit reconstruction methods) and in the presence of known scattering. △ Less

Submitted 18 July, 2020; originally announced July 2020.

Comments: 30 pages

MSC Class: 35R30; 78A46; 80A23; 85A25; 92C55

arXiv:2003.06078 [pdf, ps, other]

Derivations of the Positive Part of the Two-parameter Quantum Group of type $G_2$

Authors: Yongyue Zhong, Xiaomin Tang

Abstract: In this paper, we compute the derivations of the positive part of the two-parameter quantum group of type $G_2$ by embedding it into a quantum torus. We also show that the Hochschild cohomology group of degree $1$ of this algebra is a two dimensional vector space over the complex field. In this paper, we compute the derivations of the positive part of the two-parameter quantum group of type $G_2$ by embedding it into a quantum torus. We also show that the Hochschild cohomology group of degree $1$ of this algebra is a two dimensional vector space over the complex field. △ Less

Submitted 12 March, 2020; originally announced March 2020.

Comments: 14 pages

MSC Class: 17B37; 17B62; 17B50

arXiv:2001.10050 [pdf, other]

A fast algorithm for time-dependent radiative transport equation based on integral formulation

Authors: Hongkai Zhao, Yimin Zhong

Abstract: In this work, we introduce a fast numerical algorithm to solve the time-dependent radiative transport equation (RTE). Our method uses the integral formulation of RTE and applies the treecode algorithm to reduce the computational complexity from O(M^{2+1/d} ) to O(M^{1+1/d} log M ), where M is the number of points in the physical domain. The error analysis is presented and numerical experiments are… ▽ More In this work, we introduce a fast numerical algorithm to solve the time-dependent radiative transport equation (RTE). Our method uses the integral formulation of RTE and applies the treecode algorithm to reduce the computational complexity from O(M^{2+1/d} ) to O(M^{1+1/d} log M ), where M is the number of points in the physical domain. The error analysis is presented and numerical experiments are performed to validate our algorithm. △ Less

Submitted 27 January, 2020; originally announced January 2020.

MSC Class: 45K05; 65N22; 65N99; 65R20; 65Y10

arXiv:1912.01829 [pdf, other]

On Parity Unimodality of $q$-Catalan Polynomials

Authors: Guoce Xin, Yueming Zhong

Abstract: A polynomial $A(q)=\sum_{i=0}^n a_iq^i$ is said to be unimodal if $a_0\le a_1\le \cdots \le a_k\ge a_{k+1} \ge \cdots \ge a_n$. We investigate the unimodality of rational $q$-Catalan polynomials, which is defined to be $C_{m,n}(q)= \frac{1}{[n+m]} \left[ m+n \atop n\right]$ for a coprime pair of positive integers $(m,n)$. We conjecture that they are unimodal with respect to parity, or equivalently… ▽ More A polynomial $A(q)=\sum_{i=0}^n a_iq^i$ is said to be unimodal if $a_0\le a_1\le \cdots \le a_k\ge a_{k+1} \ge \cdots \ge a_n$. We investigate the unimodality of rational $q$-Catalan polynomials, which is defined to be $C_{m,n}(q)= \frac{1}{[n+m]} \left[ m+n \atop n\right]$ for a coprime pair of positive integers $(m,n)$. We conjecture that they are unimodal with respect to parity, or equivalently, $(1+q)C_{m+n}(q)$ is unimodal. By using generating functions and the constant term method, we verify our conjecture for $m\le 5$ in a straightforward way. △ Less

Submitted 4 December, 2019; originally announced December 2019.

Comments: 16 pages, 3 figures

MSC Class: 05A15; 05A20; 05E05

Showing 1–50 of 86 results for author: Zhong, Y