Search | arXiv e-print repository

A Note on Improved bounds for the Oriented Radius of Mixed Multigraphs

Authors: Hengzhe Li, Zhiwei Ding, Jianbing Liu, Yanhong Gao, Shuli Zhao

Abstract: For a positive integer $r$, let $f(r)$ denote the smallest number such that any 2-edge connected mixed graph with radius $r$ has an oriented radius of at most $f(r)$. Recently, Babu, Benson, and Rajendraprasad significantly improved the upper bound of $f(r)$ by establishing that $f(r) \leq 1.5r^2 + r + 1$, see [Improved bounds for the oriented radius of mixed multigraphs, J. Graph Theory, 103 (202… ▽ More For a positive integer $r$, let $f(r)$ denote the smallest number such that any 2-edge connected mixed graph with radius $r$ has an oriented radius of at most $f(r)$. Recently, Babu, Benson, and Rajendraprasad significantly improved the upper bound of $f(r)$ by establishing that $f(r) \leq 1.5r^2 + r + 1$, see [Improved bounds for the oriented radius of mixed multigraphs, J. Graph Theory, 103 (2023), 674-689]. Additionally, they demonstrated that if each edge of a graph $G$ is contained within a cycle of length at most $η$, then the oriented radius of $G$ is at most $1.5rη$. The authors' results were derived through Observation 1, which served as the foundation for the development of Algorithm ORIENTOUT and Algorithm ORIENTIN. By integrating these algorithms, they obtained the improved bounds. However, an error has been identified in Observation 1, necessitating revisions to Algorithm ORIENTOUT and Algorithm ORIENTIN. In this note, we address the error and propose the necessary modifications to both algorithms, thereby ensuring the correctness of the conclusions. △ Less

Submitted 27 June, 2024; originally announced July 2024.

Comments: 7 pages, 1 figure

MSC Class: 05C12; 05C40

arXiv:2406.05624 [pdf, other]

An arbitrary order Reconstructed Discontinuous Approximation to Fourth-order Curl Problem

Authors: Ruo Li, Qicheng Liu, Shuhai Zhao

Abstract: We present an arbitrary order discontinuous Galerkin finite element method for solving the fourth-order curl problem using a reconstructed discontinuous approximation method. It is based on an arbitrarily high-order approximation space with one unknown per element in each dimension. The discrete problem is based on the symmetric IPDG method. We prove a priori error estimates under the energy norm… ▽ More We present an arbitrary order discontinuous Galerkin finite element method for solving the fourth-order curl problem using a reconstructed discontinuous approximation method. It is based on an arbitrarily high-order approximation space with one unknown per element in each dimension. The discrete problem is based on the symmetric IPDG method. We prove a priori error estimates under the energy norm and the L^2 norm and show numerical results to verify the theoretical analysis. △ Less

Submitted 8 June, 2024; originally announced June 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2305.03430

arXiv:2405.14134 [pdf, ps, other]

Sharp convergence rate on Schrödinger type operators

Authors: Meng Wang, Shuijiang Zhao

Abstract: For Schrödinger type operators in one dimension, we consider the relationship between the convergence rate and the regularity for initial data. By establishing the associated frequency-localized maximal estimates, we prove sharp results up to the endpoints. The optimal range for the wave operator in all dimensions is also obtained. For Schrödinger type operators in one dimension, we consider the relationship between the convergence rate and the regularity for initial data. By establishing the associated frequency-localized maximal estimates, we prove sharp results up to the endpoints. The optimal range for the wave operator in all dimensions is also obtained. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 13 pages, 2 figures

MSC Class: 42B25

arXiv:2405.13785 [pdf, other]

Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling

Authors: Shifan Zhao, Jiaying Lu, Ji Yang, Edmond Chow, Yuanzhe Xi

Abstract: Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical application… ▽ More Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical applications. However, a systematic approach to handle these misspecifications is lacking in the literature. In this work, we propose a general framework to address these issues. Firstly, we introduce a flexible two-stage GPR framework that separates mean prediction and uncertainty quantification (UQ) to prevent mean misspecification, which can introduce bias into the model. Secondly, kernel function misspecification is addressed through a novel automatic kernel search algorithm, supported by theoretical analysis, that selects the optimal kernel from a candidate set. Additionally, we propose a subsampling-based warm-start strategy for hyperparameter initialization to improve efficiency and avoid hyperparameter misspecification. With much lower computational cost, our subsampling-based strategy can yield competitive or better performance than training exclusively on the full dataset. Combining all these components, we recommend two GPR methods-exact and scalable-designed to match available computational resources and specific UQ requirements. Extensive evaluation on real-world datasets, including UCI benchmarks and a safety-critical medical case study, demonstrates the robustness and precision of our methods. △ Less

Submitted 22 May, 2024; originally announced May 2024.

ACM Class: G.3; J.3

arXiv:2404.17250 [pdf, ps, other]

Omega Theorems for Logarithmic Derivatives of Zeta and L-functions Near the 1-line

Authors: Zhonghua Li, Shengbo Zhao

Abstract: We establish an omega theorem for logarithmic derivative of the Riemann zeta function near the 1-line by resonance method. We show that the inequality $\left| ζ^{\prime}\left(σ_A+it\right)/ζ\left(σ_A+it\right) \right| \geqslant \left(\left(e^A-1\right)/A\right)\log_2 T + O\left(\log_2 T / \log_3 T\right)$ has a solution $t \in [T^β, T]$ for all sufficiently large $T,$ where… ▽ More We establish an omega theorem for logarithmic derivative of the Riemann zeta function near the 1-line by resonance method. We show that the inequality $\left| ζ^{\prime}\left(σ_A+it\right)/ζ\left(σ_A+it\right) \right| \geqslant \left(\left(e^A-1\right)/A\right)\log_2 T + O\left(\log_2 T / \log_3 T\right)$ has a solution $t \in [T^β, T]$ for all sufficiently large $T,$ where $σ_A = 1 - A / \log_2 {T}.$Furthermore, we give a conditional lower bound for the measure of the set of $t$ for which the logarithmic derivative of the Riemann zeta function is large. Moreover, similar results can be generalized to Dirichlet $L$-functions. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.04588 [pdf, ps, other]

On the biases and asymptotics of partitions with finite choices of parts

Authors: Jiyou Li, Sicheng Zhao

Abstract: Biases in integer partitions have been studied recently. For three disjoint subsets $R,S,I$ of positive integers, let $p_{RSI}(n)$ be the number of partitions of $n$ with parts from $R\cup S\cup I$ and $p_{R>S,I}(n)$ be the number of such partitions with more parts from $R$ than that from $S$. In this paper, in the case that $R,S,I$ are finite we obtain a concrete formula of the asymptotic ratio o… ▽ More Biases in integer partitions have been studied recently. For three disjoint subsets $R,S,I$ of positive integers, let $p_{RSI}(n)$ be the number of partitions of $n$ with parts from $R\cup S\cup I$ and $p_{R>S,I}(n)$ be the number of such partitions with more parts from $R$ than that from $S$. In this paper, in the case that $R,S,I$ are finite we obtain a concrete formula of the asymptotic ratio of $p_{R>S,I}(n)$ to $p_{RSI}(n)$. We also propose a conjecture in the case that $R,S$ are certain infinite arithmetic progressions. △ Less

Submitted 6 April, 2024; originally announced April 2024.

Comments: 15 pages

MSC Class: 05A17 11P81

arXiv:2403.19687 [pdf, ps, other]

Weighted low-lying zeros of L-functions attached to Siegel modular forms

Authors: Shifan Zhao

Abstract: In this paper, we study weighted low-lying zeros of spinor and standard $L$-functions attached to degree 2 Siegel modular forms. We show the symmetry type of weighted low-lying zeros of spinor $L$-functions is symplectic, for test functions whose Fourier transform have support in $(-1,1)$, extending the previous range $(-\frac{4}{15},\frac{4}{15})$ by E. Kowalski, A. Saha and J. Tsimerman . We the… ▽ More In this paper, we study weighted low-lying zeros of spinor and standard $L$-functions attached to degree 2 Siegel modular forms. We show the symmetry type of weighted low-lying zeros of spinor $L$-functions is symplectic, for test functions whose Fourier transform have support in $(-1,1)$, extending the previous range $(-\frac{4}{15},\frac{4}{15})$ by E. Kowalski, A. Saha and J. Tsimerman . We then show the symmetry type of weighted low-lying zeros of standard $L$-functions is also symplectic. We further extend the range of support by performing an average over weight. As an application, we discuss non-vanishing of central values of those $L$-functions. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 21 pages

MSC Class: Primary 11F46; 11F66; 11F72

arXiv:2403.04361 [pdf, other]

Subsampling for Big Data Linear Models with Measurement Errors

Authors: Jiangshan Ju, Mingqiu Wang, Shengli Zhao

Abstract: Subsampling algorithms for various parametric regression models with massive data have been extensively investigated in recent years. However, all existing studies on subsampling heavily rely on clean massive data. In practical applications, the observed covariates may suffer from inaccuracies due to measurement errors. To address the challenge of large datasets with measurement errors, this study… ▽ More Subsampling algorithms for various parametric regression models with massive data have been extensively investigated in recent years. However, all existing studies on subsampling heavily rely on clean massive data. In practical applications, the observed covariates may suffer from inaccuracies due to measurement errors. To address the challenge of large datasets with measurement errors, this study explores two subsampling algorithms based on the corrected likelihood approach: the optimal subsampling algorithm utilizing inverse probability weighting and the perturbation subsampling algorithm employing random weighting assuming a perfectly known distribution. Theoretical properties for both algorithms are provided. Numerical simulations and two real-world examples demonstrate the effectiveness of these proposed methods compared to other uncorrected algorithms. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.13479 [pdf, ps, other]

Some inequalities for adjointable operators on Hilbert $C^*$-modules

Authors: Mohammad Sababheh, Hamid Reza Moradi, Qingxiang Xu, Shuo Zhao

Abstract: The main purpose of this paper is, in the general setting of the adjointable operators on Hilbert $C^*$-modules, to develop two new tools that can be applied to deal with the positive solutions of certain operator equations, the operator norm as well as the numerical radius, respectively. Among other things, the positivity of a $2\times 2$ block operator matrix is clarified without any preconditio… ▽ More The main purpose of this paper is, in the general setting of the adjointable operators on Hilbert $C^*$-modules, to develop two new tools that can be applied to deal with the positive solutions of certain operator equations, the operator norm as well as the numerical radius, respectively. Among other things, the positivity of a $2\times 2$ block operator matrix is clarified without any preconditions on its entries, and a generalized version of the mixed Schwarz inequality with a parameter is derived. Numerical examples are provided to illustrate the non-triviality of this newly obtained inequality. △ Less

Submitted 12 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.07210 [pdf, other]

Fukushima Nuclear Wastewater Discharge: An Evolutionary Game Theory Approach to International and Domestic Interaction and Strategic Decision-Making

Authors: Mingyang Li, Han Pengsihua, Songqing Zhao, Zejun Wang, Limin Yang, Weian Liu

Abstract: On August 24, 2023, Japan controversially decided to discharge nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the ocean, sparking intense domestic and global debates. This study uses evolutionary game theory to analyze the strategic dynamics between Japan, other countries, and the Japan Fisheries Association. By incorporating economic, legal, international aid, and environm… ▽ More On August 24, 2023, Japan controversially decided to discharge nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the ocean, sparking intense domestic and global debates. This study uses evolutionary game theory to analyze the strategic dynamics between Japan, other countries, and the Japan Fisheries Association. By incorporating economic, legal, international aid, and environmental factors, the research identifies three evolutionarily stable strategies, analyzing them via numerical simulations. The focus is on Japan's shift from wastewater release to its cessation, exploring the myriad factors influencing this transition and their effects on stakeholders' decisions. Key insights highlight the need for international cooperation, rigorous scientific research, public education, and effective wastewater treatment methods. Offering both a fresh theoretical perspective and practical guidance, this study aims to foster global consensus on nuclear wastewater management, crucial for marine conservation and sustainable development. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2401.06872 [pdf, other]

Disease Transmission on Random Graphs Using Edge-Based Percolation

Authors: S. Zhao, F. M. G. Magpantay

Abstract: Edge-based percolation methods can be used to analyze disease transmission on complex social networks. This allows us to include complex social heterogeneity in our models while maintaining tractability. Here we review the seminal works on this field by Newman et al (2001); Newman (2002, 2003), and Miller et al (2012). We present a systematic discussion of the theoretical background behind these m… ▽ More Edge-based percolation methods can be used to analyze disease transmission on complex social networks. This allows us to include complex social heterogeneity in our models while maintaining tractability. Here we review the seminal works on this field by Newman et al (2001); Newman (2002, 2003), and Miller et al (2012). We present a systematic discussion of the theoretical background behind these models, including an extensive derivation of the major results. We also connect these results relate back to the classical literature in random graph theory Molloy and Reed (1995, 1998). Finally, we also present an accompanying R package that takes epidemic and network parameters as input and generates estimates of the epidemic trajectory and final size. This manuscript and the R package was developed to help researchers easily understand and use network models to investigate the interaction between different community structures and disease transmission. △ Less

Submitted 27 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

MSC Class: 00A71; 37N25; 92D25; 92D30

arXiv:2311.05505 [pdf, other]

On regular 2-path Hamiltonian graphs

Authors: Xia Li, Weihua Yang, Bo Zhang, Shuang Zhao

Abstract: Kronk introduced the $l$-path hamiltonianicity of graphs in 1969. A graph is $l$-path Hamiltonian if every path of length not exceeding $l$ is contained in a Hamiltonian cycle. We have shown that if $P=uvz$ is a 2-path of a 2-connected, $k$-regular graph on at most $2k$ vertices and $G - V(P)$ is connected, then there must exist a Hamiltonian cycle in $G$ that contains the 2-path $P$. In this pape… ▽ More Kronk introduced the $l$-path hamiltonianicity of graphs in 1969. A graph is $l$-path Hamiltonian if every path of length not exceeding $l$ is contained in a Hamiltonian cycle. We have shown that if $P=uvz$ is a 2-path of a 2-connected, $k$-regular graph on at most $2k$ vertices and $G - V(P)$ is connected, then there must exist a Hamiltonian cycle in $G$ that contains the 2-path $P$. In this paper, we characterize a class of graphs that illustrate the sharpness of the bound $2k$. Additionally, we show that by excluding the class of graphs, both 2-connected, $k$-regular graphs on at most $2k + 1$ vertices and 3-connected, $k$-regular graphs on at most $3k-6$ vertices satisfy that there is a Hamiltonian cycle containing the 2-path $P$ if $G\setminus V(P)$ is connected. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 20. arXiv admin note: text overlap with arXiv:2203.04345

arXiv:2311.02498 [pdf, other]

Berkovich dynamics of twisted rational maps

Authors: Hongming Nie, Shengyuan Zhao

Abstract: A twisted rational map over a non-archimedean field $K$ is the composition of a rational function over $K$ and a continuous automorphism of $K$. We explore the dynamics of some twisted rational maps on the Berkovich projective line. A twisted rational map over a non-archimedean field $K$ is the composition of a rational function over $K$ and a continuous automorphism of $K$. We explore the dynamics of some twisted rational maps on the Berkovich projective line. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: 24 pages

MSC Class: 37P40; 37P50

arXiv:2310.08333 [pdf, other]

GeNIOS: an (almost) second-order operator-splitting solver for large-scale convex optimization

Authors: Theo Diamandis, Zachary Frangella, Shipu Zhao, Bartolomeo Stellato, Madeleine Udell

Abstract: We introduce the GEneralized Newton Inexact Operator Splitting solver (GeNIOS) for large-scale convex optimization. GeNIOS speeds up ADMM by approximately solving approximate subproblems: it uses a second-order approximation to the most challenging ADMM subproblem and solves it inexactly with a fast randomized solver. Despite these approximations, GeNIOS retains the convergence rate of classic ADM… ▽ More We introduce the GEneralized Newton Inexact Operator Splitting solver (GeNIOS) for large-scale convex optimization. GeNIOS speeds up ADMM by approximately solving approximate subproblems: it uses a second-order approximation to the most challenging ADMM subproblem and solves it inexactly with a fast randomized solver. Despite these approximations, GeNIOS retains the convergence rate of classic ADMM and can detect primal and dual infeasibility from the algorithm iterates. At each iteration, the algorithm solves a positive-definite linear system that arises from a second-order approximation of the first subproblem and computes an approximate proximal operator. GeNIOS solves the linear system using an indirect solver with a randomized preconditioner, making it particularly useful for large-scale problems with dense data. Our high-performance open-source implementation in Julia allows users to specify convex optimization problems directly (with or without conic reformulation) and allows extensive customization. We illustrate GeNIOS's performance on a variety of problem types. Notably, GeNIOS is more than ten times faster than existing solvers on large-scale, dense problems. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2309.02014 [pdf, other]

PROMISE: Preconditioned Stochastic Optimization Methods by Incorporating Scalable Curvature Estimates

Authors: Zachary Frangella, Pratik Rathore, Shipu Zhao, Madeleine Udell

Abstract: This paper introduces PROMISE ($\textbf{Pr}$econditioned Stochastic $\textbf{O}$ptimization $\textbf{M}$ethods by $\textbf{I}$ncorporating $\textbf{S}$calable Curvature $\textbf{E}$stimates), a suite of sketching-based preconditioned stochastic gradient algorithms for solving large-scale convex optimization problems arising in machine learning. PROMISE includes preconditioned versions of SVRG, SAG… ▽ More This paper introduces PROMISE ($\textbf{Pr}$econditioned Stochastic $\textbf{O}$ptimization $\textbf{M}$ethods by $\textbf{I}$ncorporating $\textbf{S}$calable Curvature $\textbf{E}$stimates), a suite of sketching-based preconditioned stochastic gradient algorithms for solving large-scale convex optimization problems arising in machine learning. PROMISE includes preconditioned versions of SVRG, SAGA, and Katyusha; each algorithm comes with a strong theoretical analysis and effective default hyperparameter values. In contrast, traditional stochastic gradient methods require careful hyperparameter tuning to succeed, and degrade in the presence of ill-conditioning, a ubiquitous phenomenon in machine learning. Empirically, we verify the superiority of the proposed algorithms by showing that, using default hyperparameter values, they outperform or match popular tuned stochastic gradient optimizers on a test bed of $51$ ridge and logistic regression problems assembled from benchmark machine learning repositories. On the theoretical side, this paper introduces the notion of quadratic regularity in order to establish linear convergence of all proposed methods even when the preconditioner is updated infrequently. The speed of linear convergence is determined by the quadratic regularity ratio, which often provides a tighter bound on the convergence rate compared to the condition number, both in theory and in practice, and explains the fast global linear convergence of the proposed methods. △ Less

Submitted 13 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 52 pages, 9 Figures

arXiv:2309.00232 [pdf, ps, other]

The Existence of Hamilton Cycle in n-Balanced k-Partite Graphs

Authors: Zongyuan Yang, Yi Zhang, Shichang Zhao

Abstract: Let $G_{k,n}$ be the $n$-balanced $k$-partite graph, whose vertex set can be partitioned into $k$ parts, each has $n$ vertices. In this paper, we prove that if $k \geq 2,n \geq 1$, for the edge set $E(G)$ of $G_{k,n}$ $$|E(G)| \geq\left\{\begin{array}{cc} 1 & \text { if } k=2, n=1 n^{2} C_{k}^{2}-(k-1) n+2 & \text { other } \end{array}\right.$$ then $G_{k,n}$ is hamiltonian. And the result may be… ▽ More Let $G_{k,n}$ be the $n$-balanced $k$-partite graph, whose vertex set can be partitioned into $k$ parts, each has $n$ vertices. In this paper, we prove that if $k \geq 2,n \geq 1$, for the edge set $E(G)$ of $G_{k,n}$ $$|E(G)| \geq\left\{\begin{array}{cc} 1 & \text { if } k=2, n=1 n^{2} C_{k}^{2}-(k-1) n+2 & \text { other } \end{array}\right.$$ then $G_{k,n}$ is hamiltonian. And the result may be the best. △ Less

Submitted 31 August, 2023; originally announced September 2023.

arXiv:2308.11114 [pdf, ps, other]

On Möbius functions from automorphic forms and a generalized Sarnak's conjecture

Authors: Zhining Wei, Shifan Zhao

Abstract: In this paper, we consider Möbius functions associated with two types of $L$-functions: Rankin-Selberg $L$-functions of symmetric powers of distinct holomorphic cusp forms and $L$-functions of Maass cusp forms. We show that these Möbius functions are weakly orthogonal to bounded sequences. As a direct corollary, a generalized Sarnak's conjecture holds for these two types of Möbius functions. In this paper, we consider Möbius functions associated with two types of $L$-functions: Rankin-Selberg $L$-functions of symmetric powers of distinct holomorphic cusp forms and $L$-functions of Maass cusp forms. We show that these Möbius functions are weakly orthogonal to bounded sequences. As a direct corollary, a generalized Sarnak's conjecture holds for these two types of Möbius functions. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 13 pages

MSC Class: 11F66; 11F30

arXiv:2307.14776 [pdf, other]

A Variance-Reduced Aggregation Based Gradient Tracking method for Distributed Optimization over Directed Networks

Authors: Shengchao Zhao, Siyuan Song, Yongchao Liu

Abstract: This paper studies the distributed optimization problem over directed networks with noisy information-sharing. To resolve the imperfect communication issue over directed networks, a series of noise-robust variants of Push-Pull/AB method have been developed. These methods improve the robustness of Push-Pull method against the information-sharing noise through adding small factors on weight matrices… ▽ More This paper studies the distributed optimization problem over directed networks with noisy information-sharing. To resolve the imperfect communication issue over directed networks, a series of noise-robust variants of Push-Pull/AB method have been developed. These methods improve the robustness of Push-Pull method against the information-sharing noise through adding small factors on weight matrices and replacing the global gradient tracking with the cumulative gradient tracking. Based on the two techniques, we propose a new variant of the Push-Pull method by presenting a novel mechanism of inter-agent information aggregation, named variance-reduced aggregation (VRA). VRA helps us to release some conditions on the objective function and networks. When the objective function is convex and the sharing-information noise is variance-unbounded, it can be shown that the proposed method converges to the optimal solution almost surely. When the objective function is strongly convex and the sharing-information noise is variance-bounded, the proposed method achieves the convergence rate of $\mathcal{O}\left(k^{-(1-ε)}\right)$ in the mean square sense, where $ε$ could be close to 0 infinitely. Simulated experiments on ridge regression problems verify the effectiveness of the proposed method. △ Less

Submitted 27 July, 2023; originally announced July 2023.

arXiv:2307.10521 [pdf]

Boundary integrated neural networks (BINNs) for acoustic radiation and scattering

Authors: Wenzhen Qu, Yan Gu, Shengdong Zhao, Fajie wang

Abstract: This paper presents a novel approach called the boundary integrated neural networks (BINNs) for analyzing acoustic radiation and scattering. The method introduces fundamental solutions of the time-harmonic wave equation to encode the boundary integral equations (BIEs) within the neural networks, replacing the conventional use of the governing equation in physics-informed neural networks (PINNs). T… ▽ More This paper presents a novel approach called the boundary integrated neural networks (BINNs) for analyzing acoustic radiation and scattering. The method introduces fundamental solutions of the time-harmonic wave equation to encode the boundary integral equations (BIEs) within the neural networks, replacing the conventional use of the governing equation in physics-informed neural networks (PINNs). This approach offers several advantages. Firstly, the input data for the neural networks in the BINNs only require the coordinates of "boundary" collocation points, making it highly suitable for analyzing acoustic fields in unbounded domains. Secondly, the loss function of the BINNs is not a composite form, and has a fast convergence. Thirdly, the BINNs achieve comparable precision to the PINNs using fewer collocation points and hidden layers/neurons. Finally, the semi-analytic characteristic of the BIEs contributes to the higher precision of the BINNs. Numerical examples are presented to demonstrate the performance of the proposed method. △ Less

Submitted 19 July, 2023; originally announced July 2023.

arXiv:2306.16883 [pdf, ps, other]

Quantitative stability of a nonlocal Sobolev inequality

Authors: Paolo Piccione, Minbo Yang, Shuneng Zhao

Abstract: In this paper, we study the quantitative stability of the nonlocal Soblev inequality \begin{equation*} S_{HL}\left(\int_{\mathbb{R}^N}\big(|x|^{-μ} \ast |u|^{2_μ^{\ast}}\big)|u|^{2_μ^{\ast}} dx\right)^{\frac{1}{2_μ^{\ast}}}\leq\int_{\mathbb{R}^N}|\nabla u|^2 dx , \quad \forall~u\in \mathcal{D}^{1,2}(\mathbb{R}^N), \end{equation*} where $2_μ^{\ast}=\frac{2N-μ}{N-2}$ and $S_{HL}$ is a positive… ▽ More In this paper, we study the quantitative stability of the nonlocal Soblev inequality \begin{equation*} S_{HL}\left(\int_{\mathbb{R}^N}\big(|x|^{-μ} \ast |u|^{2_μ^{\ast}}\big)|u|^{2_μ^{\ast}} dx\right)^{\frac{1}{2_μ^{\ast}}}\leq\int_{\mathbb{R}^N}|\nabla u|^2 dx , \quad \forall~u\in \mathcal{D}^{1,2}(\mathbb{R}^N), \end{equation*} where $2_μ^{\ast}=\frac{2N-μ}{N-2}$ and $S_{HL}$ is a positive constant depending only on $N$ and $μ$. For $N\geq3$, and $0<μ<N$, it is well-known that, up to translation and scaling, the nonlocal Soblev inequality has a unique extremal function $W[ξ,λ]$ which is positive and radially symmetric. We first prove a result of quantitative stability of the nonlocal Soblev inequality with the level of gradients. Secondly, we also establish the stability of profile decomposition to the Euler-Lagrange equation of the above inequality for nonnegative functions. Finally we study the stability of the nonlocal Soblev inequality \begin{equation*} \Big\|\nabla u-\sum_{i=1}^κ\nabla W[ξ_i,λ_i]\Big\|_{L^2}\leq C\Big\|Δu+\left(\frac{1}{|x|^μ}\ast |u|^{2_μ^{\ast}}\right)|u|^{2_μ^{\ast}-2}u\Big\|_{(\mathcal{D}^{1,2}(\mathbb{R}^N))^{-1}} \end{equation*} with the parameter region $κ\geq2$, $3\leq N<6-μ$, $μ\in(0,N)$ satisfying $0<μ\leq4$, or dimension $N\geq3$ and $κ=1$, $μ\in(0,N)$ satisfying $0<μ\leq4$. △ Less

Submitted 29 June, 2023; originally announced June 2023.

arXiv:2306.01757 [pdf, ps, other]

State estimation for one-dimensional agro-hydrological processes with model mismatch

Authors: Zhuangyu Liu, **feng Liu, Shunyi Zhao, Xiaoli Luan, Fei Liu

Abstract: The importance of accurate soil moisture data for the development of modern closed-loop irrigation systems cannot be overstated. Due to the diversity of soil, it is difficult to obtain an accurate model for agro-hydrological system. In this study, soil moisture estimation in 1D agro-hydrological systems with model mismatch is the focus. To address the problem of model mismatch, a nonlinear state-s… ▽ More The importance of accurate soil moisture data for the development of modern closed-loop irrigation systems cannot be overstated. Due to the diversity of soil, it is difficult to obtain an accurate model for agro-hydrological system. In this study, soil moisture estimation in 1D agro-hydrological systems with model mismatch is the focus. To address the problem of model mismatch, a nonlinear state-space model derived from the Richards equation is utilized, along with additive unknown inputs. The determination of the number of sensors required is achieved through sensitivity analysis and the orthogonalization projection method. To estimate states and unknown inputs in real-time, a recursive expectation maximization (EM) algorithm derived from the conventional EM algorithm is employed. During the E-step, the extended Kalman filter (EKF) is used to compute states and covariance in the recursive Q-function, while in the M-step, unknown inputs are updated by locally maximizing the recursive Q-function. The estimation performance is evaluated using comprehensive simulations. Through this method, accurate soil moisture estimation can be obtained, even in the presence of model mismatch. △ Less

Submitted 24 May, 2023; originally announced June 2023.

arXiv:2306.00325 [pdf, other]

doi 10.1137/23M1576360

NLTGCR: A class of Nonlinear Acceleration Procedures based on Conjugate Residuals

Authors: Huan He, Ziyuan Tang, Shifan Zhao, Yousef Saad, Yuanzhe Xi

Abstract: This paper develops a new class of nonlinear acceleration algorithms based on extending conjugate residual-type procedures from linear to nonlinear equations. The main algorithm has strong similarities with Anderson acceleration as well as with inexact Newton methods - depending on which variant is implemented. We prove theoretically and verify experimentally, on a variety of problems from simulat… ▽ More This paper develops a new class of nonlinear acceleration algorithms based on extending conjugate residual-type procedures from linear to nonlinear equations. The main algorithm has strong similarities with Anderson acceleration as well as with inexact Newton methods - depending on which variant is implemented. We prove theoretically and verify experimentally, on a variety of problems from simulation experiments to deep learning applications, that our method is a powerful accelerated iterative algorithm. △ Less

Submitted 30 March, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

Journal ref: SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, pp. 1-827 (2024)

arXiv:2305.16857 [pdf, ps, other]

Remainder terms of a nonlocal Sobolev inequality1

Authors: Shengbing Deng, Xingliang Tian, Minbo Yang, Shunneng Zhao

Abstract: In this note we study a nonlocal version of the Sobolev inequality \begin{equation*} \int_{\mathbb{R}^N}|\nabla u|^2 dx \geq S_{HLS}\left(\int_{\mathbb{R}^N}\big(|x|^{-α} \ast u^{2_α^{\ast}}\big)u^{2_α^{\ast}} dx\right)^{\frac{1}{2_α^{\ast}}}, \quad \forall u\in \mathcal{D}^{1,2}(\mathbb{R}^N), \end{equation*} where $S_{HLS}$ is the best constant, $\ast$ denotes the standard convolution and… ▽ More In this note we study a nonlocal version of the Sobolev inequality \begin{equation*} \int_{\mathbb{R}^N}|\nabla u|^2 dx \geq S_{HLS}\left(\int_{\mathbb{R}^N}\big(|x|^{-α} \ast u^{2_α^{\ast}}\big)u^{2_α^{\ast}} dx\right)^{\frac{1}{2_α^{\ast}}}, \quad \forall u\in \mathcal{D}^{1,2}(\mathbb{R}^N), \end{equation*} where $S_{HLS}$ is the best constant, $\ast$ denotes the standard convolution and $\mathcal{D}^{1,2}(\mathbb{R}^N)$ denotes the classical Sobolev space with respect to the norm $\|u\|_{\mathcal{D}^{1,2}(\mathbb{R}^N)}=\|\nabla u\|_{L^2(\mathbb{R}^N)}$. By using the nondegeneracy property of the extremal functions, we prove that the existence of the gradient type remainder term and a reminder term in the weak $L^{\frac{N}{N-2}}$-norm of above inequality for all $0<α<N$. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: 15 pages

MSC Class: 35P30; 35J20

arXiv:2305.13218 [pdf, other]

Ground truth clustering is not the optimum clustering

Authors: Lucia Absalom Bautista, Timotej Hrga, Janez Povh, Shudian Zhao

Abstract: The clustering of data is one of the most important and challenging topics in data science. The minimum sum-of-squares clustering (MSSC) problem asks to cluster the data points into $k$ clusters such that the sum of squared distances between the data points and their cluster centers (centroids) is minimized. This problem is NP-hard, but there exist exact solvers that can solve such problem to opti… ▽ More The clustering of data is one of the most important and challenging topics in data science. The minimum sum-of-squares clustering (MSSC) problem asks to cluster the data points into $k$ clusters such that the sum of squared distances between the data points and their cluster centers (centroids) is minimized. This problem is NP-hard, but there exist exact solvers that can solve such problem to optimality for small or medium size instances. In this paper, we use a branch-and-bound solver based on semidefinite programming relaxations called SOS-SDP to compute the optimum solutions of the MSSC problem for various $k$ and for multiple datasets, with real and artificial data, for which the data provider has provided ground truth clustering. Next, we use several extrinsic and intrinsic measures to evaluate how the optimum clustering and ground truth clustering matches, and how well these clusterings perform with respect to the criteria underlying the intrinsic measures. Our calculations show that the ground truth clusterings are generally far from the optimum solution to the MSSC problem. Moreover, the intrinsic measures evaluated on the ground truth clusterings are generally significantly worse compared to the optimum clusterings. However, when the ground truth clustering is in the form of convex sets, e.g., ellipsoids, that are well separated from each other, the ground truth clustering comes very close to the optimum clustering. △ Less

Submitted 22 May, 2023; originally announced May 2023.

Comments: 23 pages; 2 figures, 5 tables

arXiv:2305.06785 [pdf, other]

Alternating mixed-integer programming and neural network training for approximating stochastic two-stage problems

Authors: Jan Kronqvist, Boda Li, Jan Rolfes, Shudian Zhao

Abstract: The presented work addresses two-stage stochastic programs (2SPs), a broadly applicable model to capture optimization problems subject to uncertain parameters with adjustable decision variables. In case the adjustable or second-stage variables contain discrete decisions, the corresponding 2SPs are known to be NP-complete. The standard approach of forming a single-stage deterministic equivalent pro… ▽ More The presented work addresses two-stage stochastic programs (2SPs), a broadly applicable model to capture optimization problems subject to uncertain parameters with adjustable decision variables. In case the adjustable or second-stage variables contain discrete decisions, the corresponding 2SPs are known to be NP-complete. The standard approach of forming a single-stage deterministic equivalent problem can be computationally challenging even for small instances, as the number of variables and constraints scales with the number of scenarios. To avoid forming a potentially huge MILP problem, we build upon an approach of approximating the expected value of the second-stage problem by a neural network (NN) and encoding the resulting NN into the first-stage problem. The proposed algorithm alternates between optimizing the first-stage variables and retraining the NN. We demonstrate the value of our approach with the example of computing operating points in power systems by showing that the alternating approach provides improved first-stage decisions and a tighter approximation between the expected objective and its neural network approximation. △ Less

Submitted 19 July, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

Comments: 16 pages, 2 figures

MSC Class: 90C15; 90C11; 90-10

arXiv:2304.05460 [pdf, other]

An Adaptive Factorized Nyström Preconditioner for Regularized Kernel Matrices

Authors: Shifan Zhao, Tianshi Xu, Hua Huang, Edmond Chow, Yuanzhe Xi

Abstract: The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the ra… ▽ More The spectrum of a kernel matrix significantly depends on the parameter values of the kernel function used to define the kernel matrix. This makes it challenging to design a preconditioner for a regularized kernel matrix that is robust across different parameter values. This paper proposes the Adaptive Factorized Nyström (AFN) preconditioner. The preconditioner is designed for the case where the rank k of the Nyström approximation is large, i.e., for kernel function parameters that lead to kernel matrices with eigenvalues that decay slowly. AFN deliberately chooses a well-conditioned submatrix to solve with and corrects a Nyström approximation with a factorized sparse approximate matrix inverse. This makes AFN efficient for kernel matrices with large numerical ranks. AFN also adaptively chooses the size of this submatrix to balance accuracy and cost. △ Less

Submitted 9 April, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

arXiv:2303.05576 [pdf, ps, other]

Characterizing bearing equivalence in directed graphs

Authors: Zhiyong Sun, Shiyu Zhao, Daniel Zelazo

Abstract: In this paper, we study bearing equivalence in directed graphs. We first give a strengthened definition of bearing equivalence based on the \textit{kernel equivalence} relationship between bearing rigidity matrix and bearing Laplacian matrix. We then present several conditions to characterize bearing equivalence for both directed acyclic and cyclic graphs. These conditions involve the spectrum and… ▽ More In this paper, we study bearing equivalence in directed graphs. We first give a strengthened definition of bearing equivalence based on the \textit{kernel equivalence} relationship between bearing rigidity matrix and bearing Laplacian matrix. We then present several conditions to characterize bearing equivalence for both directed acyclic and cyclic graphs. These conditions involve the spectrum and null space of the associated bearing Laplacian matrix for a directed bearing formation. For directed acyclic graphs, all eigenvalues of the associated bearing Laplacian are real and nonnegative, while for directed graphs containing cycles, the bearing Laplacian can have eigenvalues with negative real parts. Several examples of bearing equivalent and bearing non-equivalent formations are given to illustrate these conditions. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Comments: Accepted by the 22nd World Congress of the International Federation of Automatic Control

arXiv:2302.10344 [pdf, other]

Model-based feature selection for neural networks: A mixed-integer programming approach

Authors: Shudian Zhao, Calvin Tsay, Jan Kronqvist

Abstract: In this work, we develop a novel input feature selection framework for ReLU-based deep neural networks (DNNs), which builds upon a mixed-integer optimization approach. While the method is generally applicable to various classification tasks, we focus on finding input features for image classification for clarity of presentation. The idea is to use a trained DNN, or an ensemble of trained DNNs, to… ▽ More In this work, we develop a novel input feature selection framework for ReLU-based deep neural networks (DNNs), which builds upon a mixed-integer optimization approach. While the method is generally applicable to various classification tasks, we focus on finding input features for image classification for clarity of presentation. The idea is to use a trained DNN, or an ensemble of trained DNNs, to identify the salient input features. The input feature selection is formulated as a sequence of mixed-integer linear programming (MILP) problems that find sets of sparse inputs that maximize the classification confidence of each category. These ''inverse'' problems are regularized by the number of inputs selected for each category and by distribution constraints. Numerical results on the well-known MNIST and FashionMNIST datasets show that the proposed input feature selection allows us to drastically reduce the size of the input to $\sim$15\% while maintaining a good classification accuracy. This allows us to design DNNs with significantly fewer connections, reducing computational effort and producing DNNs that are more robust towards adversarial attacks. △ Less

Submitted 20 February, 2023; originally announced February 2023.

Comments: 15 pages, 3 figures, 5 tables

arXiv:2302.03863 [pdf, other]

On the (linear) convergence of Generalized Newton Inexact ADMM

Authors: Zachary Frangella, Shipu Zhao, Theo Diamandis, Bartolomeo Stellato, Madeleine Udell

Abstract: This paper presents GeNI-ADMM, a framework for large-scale composite convex optimization, that facilitates theoretical analysis of both existing and new approximate ADMM schemes. GeNI-ADMM encompasses any ADMM algorithm that solves a first- or second-order approximation to the ADMM subproblem inexactly. GeNI-ADMM exhibits the usual $\mathcal O(1/t)$-convergence rate under standard hypotheses and c… ▽ More This paper presents GeNI-ADMM, a framework for large-scale composite convex optimization, that facilitates theoretical analysis of both existing and new approximate ADMM schemes. GeNI-ADMM encompasses any ADMM algorithm that solves a first- or second-order approximation to the ADMM subproblem inexactly. GeNI-ADMM exhibits the usual $\mathcal O(1/t)$-convergence rate under standard hypotheses and converges linearly under additional hypotheses such as strong convexity. Further, the GeNI-ADMM framework provides explicit convergence rates for ADMM variants accelerated with randomized linear algebra, such as NysADMM and sketch-and-solve ADMM, resolving an important open question on the convergence of these methods. This analysis quantifies the benefit of improved approximations and can inspire the design of new ADMM variants with faster convergence. △ Less

Submitted 1 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: 37 pages, 4 figures, 2 tables

arXiv:2301.07869 [pdf, ps, other]

On Siegel Zeros of Symmetric Power L-functions

Authors: Shifan Zhao

Abstract: Let $f$ be a holomorphic cusp form of even weight $k$ for the modular group $SL(2,\mathbb{Z})$, which is assumed to be a common eigenfunction for all Hecke operators. For positive integer $n$, let $\text{Sym}^n(f)$ be the symmetric nth power lifting of $f$ , which was shown by Newton and Thorne to be automorphic and cuspidal. In this paper, we construct certain auxiliary $L$-functions to show that… ▽ More Let $f$ be a holomorphic cusp form of even weight $k$ for the modular group $SL(2,\mathbb{Z})$, which is assumed to be a common eigenfunction for all Hecke operators. For positive integer $n$, let $\text{Sym}^n(f)$ be the symmetric nth power lifting of $f$ , which was shown by Newton and Thorne to be automorphic and cuspidal. In this paper, we construct certain auxiliary $L$-functions to show that Siegel zeros of $\text{Sym}^n(f)$ do not exist, for each given $n$, utilizing the above functoriality result. As an application, we give a lower bound of those symmetric power $L$-functions at $s=1$ of logarithm power type. △ Less

Submitted 20 January, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

Comments: The author was notified that Theorem 1.1 in this note was obtained by Jesse Thorner in his 2021 paper "Effective forms of the Sate-Tate conjecture". The author was not aware of this when he posted it on arXiv. Therefore, the author does not intend to submit this note for publication

MSC Class: 11F11; 11F66; 11F67

arXiv:2211.08597 [pdf, other]

SketchySGD: Reliable Stochastic Optimization via Randomized Curvature Estimates

Authors: Zachary Frangella, Pratik Rathore, Shipu Zhao, Madeleine Udell

Abstract: SketchySGD improves upon existing stochastic gradient methods in machine learning by using randomized low-rank approximations to the subsampled Hessian and by introducing an automated stepsize that works well across a wide range of convex machine learning problems. We show theoretically that SketchySGD with a fixed stepsize converges linearly to a small ball around the optimum. Further, in the ill… ▽ More SketchySGD improves upon existing stochastic gradient methods in machine learning by using randomized low-rank approximations to the subsampled Hessian and by introducing an automated stepsize that works well across a wide range of convex machine learning problems. We show theoretically that SketchySGD with a fixed stepsize converges linearly to a small ball around the optimum. Further, in the ill-conditioned setting we show SketchySGD converges at a faster rate than SGD for least-squares problems. We validate this improvement empirically with ridge regression experiments on real data. Numerical experiments on both ridge and logistic regression problems with dense and sparse data, show that SketchySGD equipped with its default hyperparameters can achieve comparable or better results than popular stochastic gradient methods, even when they have been tuned to yield their best performance. In particular, SketchySGD is able to solve an ill-conditioned logistic regression problem with a data matrix that takes more than $840$GB RAM to store, while its competitors, even when tuned, are unable to make any progress. SketchySGD's ability to work out-of-the box with its default hyperparameters and excel on ill-conditioned problems is an advantage over other stochastic gradient methods, most of which require careful hyperparameter tuning (especially of the learning rate) to obtain good performance and degrade in the presence of ill-conditioning. △ Less

Submitted 20 February, 2024; v1 submitted 15 November, 2022; originally announced November 2022.

Comments: 65 pages, 43 figures, 8 tables

arXiv:2211.04532 [pdf, other]

Numerical Methods for Distributed Stochastic Compositional Optimization Problems with Aggregative Structure

Authors: Shengchao Zhao, Yongchao Liu

Abstract: The paper studies the distributed stochastic compositional optimization problems over networks, where all the agents' inner-level function is the sum of each agent's private expectation function. Focusing on the aggregative structure of the inner-level function, we employ the hybrid variance reduction method to obtain the information on each agent's private expectation function, and apply the dyna… ▽ More The paper studies the distributed stochastic compositional optimization problems over networks, where all the agents' inner-level function is the sum of each agent's private expectation function. Focusing on the aggregative structure of the inner-level function, we employ the hybrid variance reduction method to obtain the information on each agent's private expectation function, and apply the dynamic consensus mechanism to track the information on each agent's inner-level function. Then by combining with the standard distributed stochastic gradient descent method, we propose a distributed aggregative stochastic compositional gradient descent method. When the objective function is smooth, the proposed method achieves the optimal convergence rate $\mathcal{O}\left(K^{-1/2}\right)$. We further combine the proposed method with the communication compression and propose the communication compressed variant distributed aggregative stochastic compositional gradient descent method. The compressed variant of the proposed method maintains the optimal convergence rate $\mathcal{O}\left(K^{-1/2}\right)$. Simulated experiments on decentralized reinforcement learning verify the effectiveness of the proposed methods. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2210.16509 [pdf, other]

doi 10.1088/1361-6420/acdaee

Fast Iterative Reconstruction for Multi-spectral CT by a Schmidt Orthogonal Modification Algorithm (SOMA)

Authors: Huiying Pan, Shusen Zhao, Weibin Zhang, Huitao Zhang, Xing Zhao

Abstract: Multi-spectral CT (MSCT) is increasingly used in industrial non-destructive testing and medical diagnosis because of its outstanding performance like material distinguishability. The process of obtaining MSCT data can be modeled as nonlinear equations and the basis material decomposition comes down to the inverse problem of the nonlinear equations. For different spectra data, geometric inconsisten… ▽ More Multi-spectral CT (MSCT) is increasingly used in industrial non-destructive testing and medical diagnosis because of its outstanding performance like material distinguishability. The process of obtaining MSCT data can be modeled as nonlinear equations and the basis material decomposition comes down to the inverse problem of the nonlinear equations. For different spectra data, geometric inconsistent parameters cause geometrical inconsistent rays, which will lead to mismatched nonlinear equations. How to solve the mismatched nonlinear equations accurately and quickly is a hot issue. This paper proposes a general iterative method to invert the mismatched nonlinear equations and develops Schmidt orthogonalization to accelerate convergence. The validity of the proposed method is verified by MSCT basis material decomposition experiments. The results show that the proposed method can decompose the basis material images accurately and improve the convergence speed greatly. △ Less

Submitted 29 October, 2022; originally announced October 2022.

arXiv:2210.12573 [pdf, other]

An Efficient Nonlinear Acceleration method that Exploits Symmetry of the Hessian

Authors: Huan He, Shifan Zhao, Ziyuan Tang, Joyce C Ho, Yousef Saad, Yuanzhe Xi

Abstract: Nonlinear acceleration methods are powerful techniques to speed up fixed-point iterations. However, many acceleration methods require storing a large number of previous iterates and this can become impractical if computational resources are limited. In this paper, we propose a nonlinear Truncated Generalized Conjugate Residual method (nlTGCR) whose goal is to exploit the symmetry of the Hessian to… ▽ More Nonlinear acceleration methods are powerful techniques to speed up fixed-point iterations. However, many acceleration methods require storing a large number of previous iterates and this can become impractical if computational resources are limited. In this paper, we propose a nonlinear Truncated Generalized Conjugate Residual method (nlTGCR) whose goal is to exploit the symmetry of the Hessian to reduce memory usage. The proposed method can be interpreted as either an inexact Newton or a quasi-Newton method. We show that, with the help of global strategies like residual check techniques, nlTGCR can converge globally for general nonlinear problems and that under mild conditions, nlTGCR is able to achieve superlinear convergence. We further analyze the convergence of nlTGCR in a stochastic setting. Numerical results demonstrate the superiority of nlTGCR when compared with several other competitive baseline approaches on a few problems. Our code will be available in the future. △ Less

Submitted 22 October, 2022; originally announced October 2022.

Comments: Optimization, Short-term recurrence method by exploiting Hessian, Numerical Analysis, Iterative Method, Quasi-Newton, Anderson Acceleration, 31 pages

arXiv:2208.09133 [pdf, ps, other]

Spectrum analysis for the relativistic Boltzmann equation

Authors: Shijia Zhao, Mingying Zhong

Abstract: The spectrum structure of the linearized relativistic Boltzmann equation around a global Maxwellian is studied in this paper. Based on the spectrum analysis, we establish the optimal time-convergence rates of the global solution to the Cauchy problem for the relativistic Boltzmann equation. The spectrum structure of the linearized relativistic Boltzmann equation around a global Maxwellian is studied in this paper. Based on the spectrum analysis, we establish the optimal time-convergence rates of the global solution to the Cauchy problem for the relativistic Boltzmann equation. △ Less

Submitted 18 August, 2022; originally announced August 2022.

MSC Class: 76P05; 82C40; 82D05

arXiv:2207.03661 [pdf, ps, other]

Combinatorial meaning of the number of the even parts in a partition of $n$ into distinct parts

Authors: Jiyou Li, Sicheng Zhao

Abstract: In a recent paper, Andrews and Merca investigated the number of even parts in all partitions of $n$ into distinct parts, which arise naturally from the Euler-Glaisher bijective proof. They obtained new combinatorial interpretations for this number by using generating functions. We obtain a new direct combinatorial proof in this note. In a recent paper, Andrews and Merca investigated the number of even parts in all partitions of $n$ into distinct parts, which arise naturally from the Euler-Glaisher bijective proof. They obtained new combinatorial interpretations for this number by using generating functions. We obtain a new direct combinatorial proof in this note. △ Less

Submitted 7 July, 2022; originally announced July 2022.

Comments: 5 pages

MSC Class: 05A17; 11P83

arXiv:2206.14958 [pdf, ps, other]

Construction of infinitely many solutions for a critical Choquard equation via local Pohožaev identities

Authors: Fashun Gao, Vitaly Moroz, Minbo Yang, Shunneng Zhao

Abstract: In this paper, we study a class of the critical Choquard equations with axisymmetric potentials, $$ -Δu+ V(|x'|,x'')u =\Big(|x|^{-4}\ast |u|^{2}\Big)u\hspace{4.14mm}\mbox{in}\hspace{1.14mm} \mathbb{R}^6, $$ where $(x',x'')\in \mathbb{R}^2\times\mathbb{R}^{4}$, $V(|x'|, x'')$ is a bounded nonnegative function in $\mathbb{R}^{+}\times\mathbb{R}^{4}$, and $*$ stands for the standard convolu… ▽ More In this paper, we study a class of the critical Choquard equations with axisymmetric potentials, $$ -Δu+ V(|x'|,x'')u =\Big(|x|^{-4}\ast |u|^{2}\Big)u\hspace{4.14mm}\mbox{in}\hspace{1.14mm} \mathbb{R}^6, $$ where $(x',x'')\in \mathbb{R}^2\times\mathbb{R}^{4}$, $V(|x'|, x'')$ is a bounded nonnegative function in $\mathbb{R}^{+}\times\mathbb{R}^{4}$, and $*$ stands for the standard convolution. The equation is critical in the sense of the Hardy-Littlewood-Sobolev inequality. By applying a finite dimensional reduction argument and develo** novel local Pohožaev identities, we prove that if the function $r^2V(r,x'')$ has a topologically nontrivial critical point then the problem admits infinitely many solutions with arbitrary large energies. △ Less

Submitted 29 June, 2022; originally announced June 2022.

MSC Class: 35J20; 35J60; 35A15

arXiv:2206.12611 [pdf, ps, other]

Local Uniqueness of blow-up solutions for critical Hartree equations in bounded domain

Authors: Marco Squassina, Minbo Yang, Shunneng Zhao

Abstract: In this paper we are interested in the following critical Hartree equation \begin{equation*} \begin{cases} -Δu =\displaystyle{\Big(\int_Ω\frac{u^{2_μ^\ast} (ξ)}{|x-ξ|^μ}dξ\Big)u^{2_μ^\ast-1}}+\varepsilon u ,~~~\text{in}~Ω,\\ u=0,~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\text{on}~\partialΩ, \end{cases} \end{equation*} where $N\geq4$, $0<μ\leq4$, $\varepsilon>0$ is a small parameter, $Ω$ is a boun… ▽ More In this paper we are interested in the following critical Hartree equation \begin{equation*} \begin{cases} -Δu =\displaystyle{\Big(\int_Ω\frac{u^{2_μ^\ast} (ξ)}{|x-ξ|^μ}dξ\Big)u^{2_μ^\ast-1}}+\varepsilon u ,~~~\text{in}~Ω,\\ u=0,~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\text{on}~\partialΩ, \end{cases} \end{equation*} where $N\geq4$, $0<μ\leq4$, $\varepsilon>0$ is a small parameter, $Ω$ is a bounded domain in $\mathbb{R}^N$, and $2_μ^\ast=\frac{2N-μ}{N-2}$ is the critical exponent in the sense of the Hardy-Littlewood-Sobolev inequality. By establishing various versions of local Pohozaev identities and applying blow-up analysis, we first investigate the location of the blow-up points for single bubbling solutions to above the Hartree equation. Next we prove the local uniqueness of the blow-up solutions that concentrates at the non-degenerate critical point of the Robin function for $\varepsilon$ small. △ Less

Submitted 25 June, 2022; originally announced June 2022.

Comments: 40 pages

MSC Class: 35J61; 35B33

arXiv:2205.10730 [pdf, ps, other]

Orthogonal inner product graphs of odd characteristic and their automorphisms

Authors: Shouxiang Zhao, Hengbin Zhang, Jizhu Nan, Gaohua Tang

Abstract: Let $\mathbb{F}_q$ be a finite field of odd characteristic and $2ν+δ\geq2$ an integer number with $δ=0,1$ or $2$. The orthogonal inner product graph $Oi\big(2ν+δ,q\big)$ over $\mathbb{F}_q$ is defined and the automorphism groups of $Oi\big(2ν+δ,q\big)$ are determined. We show that $Oi\big(2ν+δ,q\big)$ is a disconnected graph if $2ν+δ=2$; otherwise it is not. Moreover, we have two necessary and suf… ▽ More Let $\mathbb{F}_q$ be a finite field of odd characteristic and $2ν+δ\geq2$ an integer number with $δ=0,1$ or $2$. The orthogonal inner product graph $Oi\big(2ν+δ,q\big)$ over $\mathbb{F}_q$ is defined and the automorphism groups of $Oi\big(2ν+δ,q\big)$ are determined. We show that $Oi\big(2ν+δ,q\big)$ is a disconnected graph if $2ν+δ=2$; otherwise it is not. Moreover, we have two necessary and sufficient conditions for two vertices of $Oi\big(2ν+δ,q\big)$ and two edges of $Oi\big(2ν+δ,q\big)$ respectively are in the same orbit under the action of the automorphism group of $Oi\big(2ν+δ,q\big).$ △ Less

Submitted 22 May, 2022; originally announced May 2022.

Comments: arXiv admin note: text overlap with arXiv:2205.09426

MSC Class: 05C25; 05C60; 15A63

arXiv:2205.09426 [pdf, ps, other]

Symplectic inner product graphs and their automorphisms

Authors: Hengbin Zhang, Shouxiang Zhao, Jizhu Nan, Gaohua Tang

Abstract: A new graph, called the symplectic inner product graph $Spi\big(2ν,q\big)$, over a finite field $\mathbb{F}_q$ is introduced. We show that $Spi\big(2ν,q\big)$ is connected with diameter $4$ if and only if $ν\geq2$ and the automorphism group of $Spi\big(2ν,q\big)$ is determined. Two necessary and sufficient conditions for two vertices of $Spi\big(2ν,q\big)$ and two edges of $Spi\big(2ν,q\big)$ resp… ▽ More A new graph, called the symplectic inner product graph $Spi\big(2ν,q\big)$, over a finite field $\mathbb{F}_q$ is introduced. We show that $Spi\big(2ν,q\big)$ is connected with diameter $4$ if and only if $ν\geq2$ and the automorphism group of $Spi\big(2ν,q\big)$ is determined. Two necessary and sufficient conditions for two vertices of $Spi\big(2ν,q\big)$ and two edges of $Spi\big(2ν,q\big)$ respectively are in the same orbit under the action of the automorphism group of $Spi\big(2ν,q\big)$ are obtained. △ Less

Submitted 24 September, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

MSC Class: 05C25; 05C60; 11E57

arXiv:2205.06788 [pdf, other]

doi 10.1016/j.cor.2022.106088

Partitioning through projections: strong SDP bounds for large graph partition problems

Authors: Frank de Meijer, Renata Sotirov, Angelika Wiegele, Shudian Zhao

Abstract: The graph partition problem (GPP) aims at clustering the vertex set of a graph into a fixed number of disjoint subsets of given sizes such that the sum of weights of edges joining different sets is minimized. This paper investigates the quality of doubly nonnegative (DNN) relaxations, i.e., relaxations having matrix variables that are both positive semidefinite and nonnegative, strengthened by add… ▽ More The graph partition problem (GPP) aims at clustering the vertex set of a graph into a fixed number of disjoint subsets of given sizes such that the sum of weights of edges joining different sets is minimized. This paper investigates the quality of doubly nonnegative (DNN) relaxations, i.e., relaxations having matrix variables that are both positive semidefinite and nonnegative, strengthened by additional polyhedral cuts for two variations of the GPP: the $k$-equipartition and the graph bisection problem. After reducing the size of the relaxations by facial reduction, we solve them by a cutting-plane algorithm that combines an augmented Lagrangian method with Dykstra's projection algorithm. Since many components of our algorithm are general, the algorithm is suitable for solving various DNN relaxations with a large number of cutting planes. We are the first to show the power of DNN relaxations with additional cutting planes for the GPP on large benchmark instances up to 1,024 vertices. Computational results show impressive improvements in strengthened DNN bounds. △ Less

Submitted 19 September, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

arXiv:2205.00336 [pdf, other]

A nonparametric regression alternative to empirical Bayes approaches to simultaneous estimation

Authors: Alton Barbehenn, Sihai Dave Zhao

Abstract: The simultaneous estimation of multiple unknown parameters lies at heart of a broad class of important problems across science and technology. Currently, the state-of-the-art performance in the such problems is achieved by nonparametric empirical Bayes methods. However, these approaches still suffer from two major issues. First, they solve a frequentist problem but do so by following Bayesian reas… ▽ More The simultaneous estimation of multiple unknown parameters lies at heart of a broad class of important problems across science and technology. Currently, the state-of-the-art performance in the such problems is achieved by nonparametric empirical Bayes methods. However, these approaches still suffer from two major issues. First, they solve a frequentist problem but do so by following Bayesian reasoning, posing a philosophical dilemma that has contributed to somewhat uneasy attitudes toward empirical Bayes methodology. Second, their computation relies on certain density estimates that become extremely unreliable in some complex simultaneous estimation problems. In this paper, we study these issues in the context of the canonical Gaussian sequence problem. We propose an entirely frequentist alternative to nonparametric empirical Bayes methods by establishing a connection between simultaneous estimation and penalized nonparametric regression. We use flexible regularization strategies, such as shape constraints, to derive accurate estimators without appealing to Bayesian arguments. We prove that our estimators achieve asymptotically optimal regret and show that they are competitive with or can outperform nonparametric empirical Bayes methods in simulations and an analysis of spatially resolved gene expression data. △ Less

Submitted 29 May, 2023; v1 submitted 30 April, 2022; originally announced May 2022.

arXiv:2204.11633 [pdf, ps, other]

doi 10.1080/03081087.2022.2159920

The polar decomposition of the product of three operators

Authors: Dingyi Du, Qingxiang Xu, Shuo Zhao

Abstract: In the setting of adjointable operators on Hilbert $C^*$-modules, this paper deals with the polar decomposition of the product of three operators. The relationship between the polar decompositions associated with three operators is clarified. Based on this relationship, a formula for the polar decomposition of a multiplicative perturbation of an operator is provided. In addition, some characteriza… ▽ More In the setting of adjointable operators on Hilbert $C^*$-modules, this paper deals with the polar decomposition of the product of three operators. The relationship between the polar decompositions associated with three operators is clarified. Based on this relationship, a formula for the polar decomposition of a multiplicative perturbation of an operator is provided. In addition, some characterizations of the polar decomposition associated with three operators are provided. △ Less

Submitted 20 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

Comments: 22 pages

MSC Class: 46L08; 47A05

Journal ref: Linear Multilinear Algebra 72 (2024), no. 3, 528--546

arXiv:2204.09639 [pdf, other]

Planar graphs without cycles of length from 4 to 7 are near-bipartite

Authors: Lili Hao, Weihua Yang, Shuang Zhao

Abstract: A graph is near-bipartite if its vertex set can be partitioned into an independent set and a set which induces a forest. In this paper, planar graphs without cycles of length from 4 to 7 are shown to be near-bipartite. A graph is near-bipartite if its vertex set can be partitioned into an independent set and a set which induces a forest. In this paper, planar graphs without cycles of length from 4 to 7 are shown to be near-bipartite. △ Less

Submitted 10 April, 2022; originally announced April 2022.

Comments: 18

arXiv:2203.11074 [pdf, other]

Distributed Stochastic Compositional Optimization Problems over Directed Networks

Authors: Shengchao Zhao, Yongchao Liu

Abstract: We study the distributed stochastic compositional optimization problems over directed communication networks in which agents privately own a stochastic compositional objective function and collaborate to minimize the sum of all objective functions. We propose a distributed stochastic compositional gradient descent method, where the gradient tracking and the stochastic correction techniques are emp… ▽ More We study the distributed stochastic compositional optimization problems over directed communication networks in which agents privately own a stochastic compositional objective function and collaborate to minimize the sum of all objective functions. We propose a distributed stochastic compositional gradient descent method, where the gradient tracking and the stochastic correction techniques are employed to adapt to the networks' directed structure and increase the accuracy of inner function estimation. When the objective function is smooth, the proposed method achieves the convergence rate $\mathcal{O}\left(k^{-1/2}\right)$ and sample complexity $\mathcal{O}\left(\frac{1}{ε^2}\right)$ for finding the ($ε$)-stationary point. When the objective function is strongly convex, the convergence rate is improved to $\mathcal{O}\left(k^{-1}\right)$. Moreover, the asymptotic normality of Polyak-Ruppert averaged iterates of the proposed method is also presented. We demonstrate the empirical performance of the proposed method on model-agnostic meta-learning problem and logistic regression problem. △ Less

Submitted 21 March, 2022; originally announced March 2022.

arXiv:2202.11599 [pdf, other]

NysADMM: faster composite convex optimization via low-rank approximation

Authors: Shipu Zhao, Zachary Frangella, Madeleine Udell

Abstract: This paper develops a scalable new algorithm, called NysADMM, to minimize a smooth convex loss function with a convex regularizer. NysADMM accelerates the inexact Alternating Direction Method of Multipliers (ADMM) by constructing a preconditioner for the ADMM subproblem from a randomized low-rank Nyström approximation. NysADMM comes with strong theoretical guarantees: it solves the ADMM subproblem… ▽ More This paper develops a scalable new algorithm, called NysADMM, to minimize a smooth convex loss function with a convex regularizer. NysADMM accelerates the inexact Alternating Direction Method of Multipliers (ADMM) by constructing a preconditioner for the ADMM subproblem from a randomized low-rank Nyström approximation. NysADMM comes with strong theoretical guarantees: it solves the ADMM subproblem in a constant number of iterations when the rank of the Nyström approximation is the effective dimension of the subproblem regularized Gram matrix. In practice, ranks much smaller than the effective dimension can succeed, so NysADMM uses an adaptive strategy to choose the rank that enjoys analogous guarantees. Numerical experiments on real-world datasets demonstrate that NysADMM can solve important applications, such as the lasso, logistic regression, and support vector machines, in half the time (or less) required by standard solvers. The breadth of problems on which NysADMM beats standard solvers is a surprise: it suggests that ADMM is a dominant paradigm for numerical optimization across a wide range of statistical learning problems that are usually solved with bespoke methods. △ Less

Submitted 2 July, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

arXiv:2202.00450 [pdf, other]

Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra

Authors: Liang Liao, Sen Lin, Lun Li, Xiuwei Zhang, Song Zhao, Yan Wang, Xinqiang Wang, Qi Gao, **gyu Wang

Abstract: Low-rank approximation of images via singular value decomposition is well-received in the era of big data. However, singular value decomposition (SVD) is only for order-two data, i.e., matrices. It is necessary to flatten a higher order input into a matrix or break it into a series of order-two slices to tackle higher order data such as multispectral images and videos with the SVD. Higher order si… ▽ More Low-rank approximation of images via singular value decomposition is well-received in the era of big data. However, singular value decomposition (SVD) is only for order-two data, i.e., matrices. It is necessary to flatten a higher order input into a matrix or break it into a series of order-two slices to tackle higher order data such as multispectral images and videos with the SVD. Higher order singular value decomposition (HOSVD) extends the SVD and can approximate higher order data using sums of a few rank-one components. We consider the problem of generalizing HOSVD over a finite dimensional commutative algebra. This algebra, referred to as a t-algebra, generalizes the field of complex numbers. The elements of the algebra, called t-scalars, are fix-sized arrays of complex numbers. One can generalize matrices and tensors over t-scalars and then extend many canonical matrix and tensor algorithms, including HOSVD, to obtain higher-performance versions. The generalization of HOSVD is called THOSVD. Its performance of approximating multi-way data can be further improved by an alternating algorithm. THOSVD also unifies a wide range of principal component analysis algorithms. To exploit the potential of generalized algorithms using t-scalars for approximating images, we use a pixel neighborhood strategy to convert each pixel to "deeper-order" t-scalar. Experiments on publicly available images show that the generalized algorithm over t-scalars, namely THOSVD, compares favorably with its canonical counterparts. △ Less

Submitted 25 August, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

Comments: 21 pages, 11 figures, several typos in the appendix corrected

arXiv:2111.02035 [pdf, ps, other]

Classification of pre-Jordan Algebras and Rota-Baxter Operators on Jordan Algebras in Low Dimensions

Authors: Yuze Sun, Zhen Huang, Shilong Zhao, Zheshuai Tian

Abstract: This paper is devoted to the classification of complex pre-Jordan algebras in the sense of isomorphisms in dimensions $\leq$ 3. All Rota-Baxter operators on complex Jordan algebras in dimensions $\leq$ 3 and the induced pre-Jordan algebras are also presented. This paper is devoted to the classification of complex pre-Jordan algebras in the sense of isomorphisms in dimensions $\leq$ 3. All Rota-Baxter operators on complex Jordan algebras in dimensions $\leq$ 3 and the induced pre-Jordan algebras are also presented. △ Less

Submitted 3 November, 2021; originally announced November 2021.

Comments: The authors thank Chengming Bai for guidance and important discussion. All authors are supported by Innovation and Entrepreneurship Training Program for College Students of Tian** 202010055305

MSC Class: 16T25; 16W10; 17C50; 17C55

arXiv:2110.03978 [pdf, ps, other]

Matching forcing polynomial of generalized Petersen graph GP(n, 2)

Authors: Shuang Zhao

Abstract: Harary et al. and Klein and Randic proposed the forcing number of a perfect matching in mathematics and chemistry, respectively. In detail, the forcing number of a perfect matching M of a graph G is the smallest cardinality of subsets of M that are contained in no other perfect matchings of G. The author and cooperators defined the forcing polynomial of G as the count polynomial for perfect matchi… ▽ More Harary et al. and Klein and Randic proposed the forcing number of a perfect matching in mathematics and chemistry, respectively. In detail, the forcing number of a perfect matching M of a graph G is the smallest cardinality of subsets of M that are contained in no other perfect matchings of G. The author and cooperators defined the forcing polynomial of G as the count polynomial for perfect matchings with the same forcing number of G, from which the average forcing number, forcing spectrum, and the maximum and minimum forcing numbers of G can be obtained. Up to now, a few papers have been considered on matching forcing problem of non-plane non-bipartite graphs. In this paper, we investigate the forcing polynomials of generalized Petersen graphs GP(n, 2) for n = 5, 6, . . . , 15, which is a typical class of non-plane non-bipartite graph. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: arXiv admin note: text overlap with arXiv:2109.13709

arXiv:2110.02457 [pdf, other]

GDA-AM: On the effectiveness of solving minimax optimization via Anderson Acceleration

Authors: Huan He, Shifan Zhao, Yuanzhe Xi, Joyce C Ho, Yousef Saad

Abstract: Many modern machine learning algorithms such as generative adversarial networks (GANs) and adversarial training can be formulated as minimax optimization. Gradient descent ascent (GDA) is the most commonly used algorithm due to its simplicity. However, GDA can converge to non-optimal minimax points. We propose a new minimax optimization framework, GDA-AM, that views the GDAdynamics as a fixed-poin… ▽ More Many modern machine learning algorithms such as generative adversarial networks (GANs) and adversarial training can be formulated as minimax optimization. Gradient descent ascent (GDA) is the most commonly used algorithm due to its simplicity. However, GDA can converge to non-optimal minimax points. We propose a new minimax optimization framework, GDA-AM, that views the GDAdynamics as a fixed-point iteration and solves it using Anderson Mixing to con-verge to the local minimax. It addresses the diverging issue of simultaneous GDAand accelerates the convergence of alternating GDA. We show theoretically that the algorithm can achieve global convergence for bilinear problems under mild conditions. We also empirically show that GDA-AMsolves a variety of minimax problems and improves GAN training on several datasets △ Less

Submitted 29 June, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

Comments: 31 Pages, ICLR, minimax, Anderson Acceleration

Showing 1–50 of 123 results for author: Zhao, S