Search | arXiv e-print repository

arXiv:2406.07941 [pdf, other]

Global-in-time energy stability: a powerful analysis tool for the gradient flow problem without maximum principle or Lipschitz assumption

Authors: J. Sun, H. Wang, H. Zhang, X. Qian, S. Song

Abstract: Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global… ▽ More Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global-in-time energy stability, to demonstrate energy dissipation without assuming any strong Lipschitz condition or $L^{\infty}$ boundedness. The fourth-order-in-space Swift-Hohenberg equation is used to elucidate the theoretical results in detail. We also propose a temporal second-order accurate scheme for efficiently solving such a strongly stiff equation. Furthermore, we present the corresponding optimal $L^2$ error estimate and provide several numerical simulations to demonstrate the dynamics. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2404.02476 [pdf, other]

Deep Reinforcement Learning for Traveling Purchaser Problems

Authors: Haofeng Yuan, Rong** Zhu, Wanlu Yang, Shiji Song, Keyou You, Yuli Zhang

Abstract: The traveling purchaser problem (TPP) is an important combinatorial optimization problem with broad applications. Due to the coupling between routing and purchasing, existing works on TPPs commonly address route construction and purchase planning simultaneously, which, however, leads to exact methods with high computational cost and heuristics with sophisticated design but limited performance. In… ▽ More The traveling purchaser problem (TPP) is an important combinatorial optimization problem with broad applications. Due to the coupling between routing and purchasing, existing works on TPPs commonly address route construction and purchase planning simultaneously, which, however, leads to exact methods with high computational cost and heuristics with sophisticated design but limited performance. In sharp contrast, we propose a novel approach based on deep reinforcement learning (DRL), which addresses route construction and purchase planning separately, while evaluating and optimizing the solution from a global perspective. The key components of our approach include a bipartite graph representation for TPPs to capture the market-product relations, and a policy network that extracts information from the bipartite graph and uses it to sequentially construct the route. One significant benefit of our framework is that we can efficiently construct the route using the policy network, and once the route is determined, the associated purchasing plan can be easily derived through linear programming, while, leveraging DRL, we can train the policy network to optimize the global solution objective. Furthermore, by introducing a meta-learning strategy, the policy network can be trained stably on large-sized TPP instances, and generalize well across instances of varying sizes and distributions, even to much larger instances that are never seen during training. Experiments on various synthetic TPP instances and the TPPLIB benchmark demonstrate that our DRL-based approach can significantly outperform well-established TPP heuristics, reducing the optimality gap by 40%-90%, and also showing an advantage in runtime, especially on large-sized instances. △ Less

Submitted 11 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

arXiv:2403.17783 [pdf, ps, other]

Intersecting subsets in finite permutation groups

Authors: CaiHeng Li, Venkata Raghu Tej Pantangi, Shujiao Song, Yilin Xie

Abstract: A subset (subgroup) $S$ of a transitive permutation group $G\leq Sym(Ω)$ is called an intersecting subset (subgroup, resp.) if the ratio $xy^{-1}$ of any elements $x,y\in S$ fixes some point. A transitive group is said to have the EKR property if the size of each intersecting subset is at most the order of the point stabilizer. A nice result of Meagher-Spiga-Tiep (2016) says that 2-transitive perm… ▽ More A subset (subgroup) $S$ of a transitive permutation group $G\leq Sym(Ω)$ is called an intersecting subset (subgroup, resp.) if the ratio $xy^{-1}$ of any elements $x,y\in S$ fixes some point. A transitive group is said to have the EKR property if the size of each intersecting subset is at most the order of the point stabilizer. A nice result of Meagher-Spiga-Tiep (2016) says that 2-transitive permutation groups have the EKR property. In this paper, we systematically study intersecting subsets in more general transitive permutation groups, including primitive (quasiprimitive) groups, rank-3 groups, Suzuki groups, and some special solvable groups. We present new families of groups that have the EKR property, and various families of groups that do not have the EKR property. This paper significantly improves the unpublished version of this paper, and particularly solves Problem 1.4 of it. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 21 pages, 2 figures

MSC Class: 05E18

arXiv:2312.15421 [pdf, ps, other]

How averaged is the projection?

Authors: Shuang Song

Abstract: Projection operators are important in Analysis, Optimization and Algorithm. It is well known that these operators are firmly nonexpansive. In this paper, we provide an exact result that sharpens this well-known result. We develop the theory of averaged operators and provide a lower bound. We give a result on the avergedness of operator compositions. We also provide some nonlinear examples to illus… ▽ More Projection operators are important in Analysis, Optimization and Algorithm. It is well known that these operators are firmly nonexpansive. In this paper, we provide an exact result that sharpens this well-known result. We develop the theory of averaged operators and provide a lower bound. We give a result on the avergedness of operator compositions. We also provide some nonlinear examples to illustrate our results. △ Less

Submitted 24 December, 2023; originally announced December 2023.

Comments: 9 pages

arXiv:2312.14213 [pdf, other]

doi 10.1609/aaai.v38i8.28661

A Reinforcement-Learning-Based Multiple-Column Selection Strategy for Column Generation

Authors: Haofeng Yuan, Lichang Fang, Shiji Song

Abstract: Column generation (CG) is one of the most successful approaches for solving large-scale linear programming (LP) problems. Given an LP with a prohibitively large number of variables (i.e., columns), the idea of CG is to explicitly consider only a subset of columns and iteratively add potential columns to improve the objective value. While adding the column with the most negative reduced cost can gu… ▽ More Column generation (CG) is one of the most successful approaches for solving large-scale linear programming (LP) problems. Given an LP with a prohibitively large number of variables (i.e., columns), the idea of CG is to explicitly consider only a subset of columns and iteratively add potential columns to improve the objective value. While adding the column with the most negative reduced cost can guarantee the convergence of CG, it has been shown that adding multiple columns per iteration rather than a single column can lead to faster convergence. However, it remains a challenge to design a multiple-column selection strategy to select the most promising columns from a large number of candidate columns. In this paper, we propose a novel reinforcement-learning-based (RL) multiple-column selection strategy. To the best of our knowledge, it is the first RL-based multiple-column selection strategy for CG. The effectiveness of our approach is evaluated on two sets of problems: the cutting stock problem and the graph coloring problem. Compared to several widely used single-column and multiple-column selection strategies, our RL-based multiple-column selection strategy leads to faster convergence and achieves remarkable reductions in the number of CG iterations and runtime. △ Less

Submitted 28 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence 38(8) (2024) 8209-8216

arXiv:2312.06980 [pdf, other]

SPFNO: Spectral operator learning for PDEs with Dirichlet and Neumann boundary conditions

Authors: Ziyuan Liu, Yuhang Wu, Daniel Zhengyu Huang, Hong Zhang, Xu Qian, Songhe Song

Abstract: Neural operators have been validated as promising deep surrogate models for solving partial differential equations (PDEs). Despite the critical role of boundary conditions in PDEs, however, only a limited number of neural operators robustly enforce these conditions. In this paper we introduce semi-periodic Fourier neural operator (SPFNO), a novel spectral operator learning method, to learn the tar… ▽ More Neural operators have been validated as promising deep surrogate models for solving partial differential equations (PDEs). Despite the critical role of boundary conditions in PDEs, however, only a limited number of neural operators robustly enforce these conditions. In this paper we introduce semi-periodic Fourier neural operator (SPFNO), a novel spectral operator learning method, to learn the target operators of PDEs with non-periodic BCs. This method extends our previous work (arXiv:2206.12698), which showed significant improvements by employing enhanced neural operators that precisely satisfy the boundary conditions. However, the previous work is associated with Gaussian grids, restricting comprehensive comparisons across most public datasets. Additionally, we present numerical results for various PDEs such as the viscous Burgers' equation, Darcy flow, incompressible pipe flow, and coupled reactiondiffusion equations. These results demonstrate the computational efficiency, resolution invariant property, and BC-satisfaction behavior of proposed model. An accuracy improvement of approximately 1.7X-4.7X over the non-BC-satisfying baselines is also achieved. Furthermore, our studies on SOL underscore the significance of satisfying BCs as a criterion for deep surrogate models of PDEs. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2308.04073 [pdf, other]

doi 10.4208/cicp.OA-2023-0058

Learning Specialized Activation Functions for Physics-informed Neural Networks

Authors: Honghui Wang, Lu Lu, Shiji Song, Gao Huang

Abstract: Physics-informed neural networks (PINNs) are known to suffer from optimization difficulty. In this work, we reveal the connection between the optimization difficulty of PINNs and activation functions. Specifically, we show that PINNs exhibit high sensitivity to activation functions when solving PDEs with distinct properties. Existing works usually choose activation functions by inefficient trial-a… ▽ More Physics-informed neural networks (PINNs) are known to suffer from optimization difficulty. In this work, we reveal the connection between the optimization difficulty of PINNs and activation functions. Specifically, we show that PINNs exhibit high sensitivity to activation functions when solving PDEs with distinct properties. Existing works usually choose activation functions by inefficient trial-and-error. To avoid the inefficient manual selection and to alleviate the optimization difficulty of PINNs, we introduce adaptive activation functions to search for the optimal function when solving different problems. We compare different adaptive activation functions and discuss their limitations in the context of PINNs. Furthermore, we propose to tailor the idea of learning combinations of candidate activation functions to the PINNs optimization, which has a higher requirement for the smoothness and diversity on learned functions. This is achieved by removing activation functions which cannot provide higher-order derivatives from the candidate set and incorporating elementary functions with different properties according to our prior knowledge about the PDE at hand. We further enhance the search space with adaptive slopes. The proposed adaptive activation function can be used to solve different PDE systems in an interpretable way. Its effectiveness is demonstrated on a series of benchmarks. Code is available at https://github.com/LeapLabTHU/AdaAFforPINNs. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Journal ref: Commun. Comput. Phys., 34 (2023), pp. 869-906

arXiv:2307.14776 [pdf, other]

A Variance-Reduced Aggregation Based Gradient Tracking method for Distributed Optimization over Directed Networks

Authors: Shengchao Zhao, Siyuan Song, Yongchao Liu

Abstract: This paper studies the distributed optimization problem over directed networks with noisy information-sharing. To resolve the imperfect communication issue over directed networks, a series of noise-robust variants of Push-Pull/AB method have been developed. These methods improve the robustness of Push-Pull method against the information-sharing noise through adding small factors on weight matrices… ▽ More This paper studies the distributed optimization problem over directed networks with noisy information-sharing. To resolve the imperfect communication issue over directed networks, a series of noise-robust variants of Push-Pull/AB method have been developed. These methods improve the robustness of Push-Pull method against the information-sharing noise through adding small factors on weight matrices and replacing the global gradient tracking with the cumulative gradient tracking. Based on the two techniques, we propose a new variant of the Push-Pull method by presenting a novel mechanism of inter-agent information aggregation, named variance-reduced aggregation (VRA). VRA helps us to release some conditions on the objective function and networks. When the objective function is convex and the sharing-information noise is variance-unbounded, it can be shown that the proposed method converges to the optimal solution almost surely. When the objective function is strongly convex and the sharing-information noise is variance-bounded, the proposed method achieves the convergence rate of $\mathcal{O}\left(k^{-(1-ε)}\right)$ in the mean square sense, where $ε$ could be close to 0 infinitely. Simulated experiments on ridge regression problems verify the effectiveness of the proposed method. △ Less

Submitted 27 July, 2023; originally announced July 2023.

arXiv:2211.09346 [pdf, other]

A class of inexact block factorization preconditioners for indefinite matrices with a three-by-three block structure

Authors: Sheng-Zhong Song, Zheng-Da Huang

Abstract: We consider using the preconditioned-Krylov subspace method to solve the system of linear equations with a three-by-three block structure. By making use of the three-by-three block structure, eight inexact block factorization preconditioners, which can be put into a same theoretical analysis frame, are proposed based on a kind of inexact factorization. By generalizing Bendixson Theorem and develop… ▽ More We consider using the preconditioned-Krylov subspace method to solve the system of linear equations with a three-by-three block structure. By making use of the three-by-three block structure, eight inexact block factorization preconditioners, which can be put into a same theoretical analysis frame, are proposed based on a kind of inexact factorization. By generalizing Bendixson Theorem and develo** a unified technique of spectral equivalence, the bounds of the real and imaginary parts of eigenvalues of the preconditioned matrices are obtained. The comparison to eleven existed exact and inexact preconditioners shows that three of the proposed preconditioners can lead to high-speed and effective preconditioned-GMRES in most tests. △ Less

Submitted 17 November, 2022; originally announced November 2022.

arXiv:2210.08832 [pdf, ps, other]

Mutual Information Density of Massive MIMO Systems over Rayleigh-Product Channels

Authors: Xin Zhang, Shenghui Song

Abstract: The Rayleigh-product channel model is utilized to characterize the rank deficiency caused by keyhole effects. However, the finite blocklength analysis for Rayleigh product channels is not available in the literature. In this paper, we will characterize the mutual information density (MID) and perform the FBL analysis to reveal the impact of rank-deficiency in Rayleigh-product channels. To this end… ▽ More The Rayleigh-product channel model is utilized to characterize the rank deficiency caused by keyhole effects. However, the finite blocklength analysis for Rayleigh product channels is not available in the literature. In this paper, we will characterize the mutual information density (MID) and perform the FBL analysis to reveal the impact of rank-deficiency in Rayleigh-product channels. To this end, we first set up a central limit theorem for the MID over Rayleigh-product MIMO channels in the asymptotic regime where the number of scatterers, number of antennas, and blocklength go to infinity at the same pace. Then, we utilize the CLT to obtain the upper and lower bounds for the packet error probability, whose approximations in the high and low signal to noise ratio regimes are then derived to illustrate the impact of rank deficiency. One interesting observation is that rank-deficiency degrades the performance of MIMO systems with FBL and the fundamental limits of Rayleigh-product channels degenerate to those of the Rayleigh case when the number of scatterers approaches infinity. △ Less

Submitted 9 April, 2024; v1 submitted 17 October, 2022; originally announced October 2022.

arXiv:2207.10853 [pdf, ps, other]

Error Estimate of Multiscale Finite Element Method for Periodic Media Revisited

Authors: **bing Ming, Siqi Song

Abstract: We derive the optimal energy error estimate for multiscale finite element method with oversampling technique applying to elliptic system with rapidly oscillating periodic coefficients under the assumption that the coefficients are bounded and measurable, which may admit rough microstructures. As a by-product of the energy estimate, we derive the rate of convergence in L$^{d/(d-1)}-$norm. We derive the optimal energy error estimate for multiscale finite element method with oversampling technique applying to elliptic system with rapidly oscillating periodic coefficients under the assumption that the coefficients are bounded and measurable, which may admit rough microstructures. As a by-product of the energy estimate, we derive the rate of convergence in L$^{d/(d-1)}-$norm. △ Less

Submitted 19 October, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

arXiv:2206.12698 [pdf, other]

Render unto Numerics: Orthogonal Polynomial Neural Operator for PDEs with Non-periodic Boundary Conditions

Authors: Ziyuan Liu, Haifeng Wang, Hong Zhang, Kaijuna Bao, Xu Qian, Songhe Song

Abstract: By learning the map**s between infinite function spaces using carefully designed neural networks, the operator learning methodology has exhibited significantly more efficiency than traditional methods in solving complex problems such as differential equations, but faces concerns about their accuracy and reliability. To overcomes these limitations, combined with the structures of the spectral num… ▽ More By learning the map**s between infinite function spaces using carefully designed neural networks, the operator learning methodology has exhibited significantly more efficiency than traditional methods in solving complex problems such as differential equations, but faces concerns about their accuracy and reliability. To overcomes these limitations, combined with the structures of the spectral numerical method, a general neural architecture named spectral operator learning (SOL) is introduced, and one variant called the orthogonal polynomial neural operator (OPNO), developed for PDEs with Dirichlet, Neumann and Robin boundary conditions (BCs), is proposed later. The strict BC satisfaction properties and the universal approximation capacity of the OPNO are theoretically proven. A variety of numerical experiments with physical backgrounds show that the OPNO outperforms other existing deep learning methodologies, as well as the traditional 2nd-order finite difference method (FDM) with a considerably fine mesh (with the relative errors reaching the order of 1e-6), and is up to almost 5 magnitudes faster than the traditional method. △ Less

Submitted 3 March, 2023; v1 submitted 25 June, 2022; originally announced June 2022.

arXiv:2205.10241 [pdf, other]

Arbitrary high-order structure-preserving schemes for the generalized Rosenau-type equation

Authors: Chaolong Jiang, Xu Qian, Songhe Song, Chenxuan Zheng

Abstract: In this paper, we are concerned with arbitrarily high-order momentum-preserving and energy-preserving schemes for solving the generalized Rosenau-type equation, respectively. The derivation of the momentum-preserving schemes is made within the symplectic Runge-Kutta method, coupled with the standard Fourier pseudo-spectral method in space. Then, combined with the quadratic auxiliary variable appro… ▽ More In this paper, we are concerned with arbitrarily high-order momentum-preserving and energy-preserving schemes for solving the generalized Rosenau-type equation, respectively. The derivation of the momentum-preserving schemes is made within the symplectic Runge-Kutta method, coupled with the standard Fourier pseudo-spectral method in space. Then, combined with the quadratic auxiliary variable approach and the symplectic Runge-Kutta method, together with the standard Fourier pseudo-spectral method, we present a class of high-order mass- and energy-preserving schemes for the Rosenau equation. Finally, extensive numerical tests and comparisons are also addressed to illustrate the performance of the proposed schemes. △ Less

Submitted 29 January, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

Comments: 23 pages, 31 figures

arXiv:2202.09488 [pdf, other]

An Operator Learning Approach via Function-valued Reproducing Kernel Hilbert Space for Differential Equations

Authors: Kaijun Bao, Xu Qian, Ziyuan Liu, Songhe Song

Abstract: Much recent work has addressed the solution of a family of partial differential equations by computing the inverse operator map between the input and solution space. Toward this end, we incorporate function-valued reproducing kernel Hilbert spaces in our operator learning model. We use neural networks to parameterize Hilbert-Schmidt integral operator and propose an architecture. Experiments includ… ▽ More Much recent work has addressed the solution of a family of partial differential equations by computing the inverse operator map between the input and solution space. Toward this end, we incorporate function-valued reproducing kernel Hilbert spaces in our operator learning model. We use neural networks to parameterize Hilbert-Schmidt integral operator and propose an architecture. Experiments including several typical datasets show that the proposed architecture has desirable accuracy on linear and nonlinear partial differential equations even with a small amount of data. By learning the map**s between function spaces, the proposed method can find the solution of a high-resolution input after learning from lower-resolution data. △ Less

Submitted 2 April, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

Comments: 14 pages, 8 figures, 4 tables

arXiv:2202.07100 [pdf, ps, other]

Locally Finite Vertex-Rotary Maps and Coset Graphs with Finite Valency and Finite Edge Multiplicity

Authors: Cai Heng Li, Cheryl E. Praeger, Shu Jiao Song

Abstract: It is well-known that a simple $G$-arc-transitive graph can be represented as a coset graph for the group $G$. This representation is extended to a construction of $G$-arc-transitive coset graphs $\Cos(G,H,J)$ with finite valency and finite edge-multiplicity, where $H, J$ are stabilisers in $G$ of a vertex and incident edge, respectively. Given a group $G=ła,z\r$ with $|z|=2$ and $|a|$ finite, the… ▽ More It is well-known that a simple $G$-arc-transitive graph can be represented as a coset graph for the group $G$. This representation is extended to a construction of $G$-arc-transitive coset graphs $\Cos(G,H,J)$ with finite valency and finite edge-multiplicity, where $H, J$ are stabilisers in $G$ of a vertex and incident edge, respectively. Given a group $G=ła,z\r$ with $|z|=2$ and $|a|$ finite, the coset graph $\Cos(G,ła\r,łz\r)$ is shown, under suitable finiteness assumptions, to have exactly two different arc-transitive embeddings as a $G$-arc-transitive map $(V,E,F)$, namely, a {\it $G$-rotary} map if $|az|$ is finite, and a {\it $G$-bi-rotary} map if $|zz^a|$ is finite. The $G$-rotary map can be represented as a coset geometry for $G$, extending the notion of a coset graph. However the $G$-bi-rotary map does not have such a representation, and the face boundary cycles must be specified in addition to incidences between faces and edges. We also give a coset geometry construction of a flag-regular map $(V,E,F)$. In all of these constructions we prove that the face boundary cycles are regular cycles which are simple cycles precisely when the given group acts faithfully on $V\cup F$. △ Less

Submitted 14 February, 2022; originally announced February 2022.

arXiv:2111.02615 [pdf, ps, other]

The graphs with a symmetrical Euler cycle

Authors: Jiyong Chen, Cai Heng Li, Cheryl E. Praeger, Shu-Jiao Song

Abstract: The edges surrounding a face of a map $M$ form a cycle $C$, called the boundary cycle of the face, and $C$ is often not a simple cycle. If the map $M$ is arc-transitive, then there is a cyclic subgroup of automorphisms of $M$ which leaves $C$ invariant and is bi-regular on the edges of the induced subgraph $[C]$; that is to say, $C$ is a symmetrical Euler cycle of $[C]$. In this paper we determine… ▽ More The edges surrounding a face of a map $M$ form a cycle $C$, called the boundary cycle of the face, and $C$ is often not a simple cycle. If the map $M$ is arc-transitive, then there is a cyclic subgroup of automorphisms of $M$ which leaves $C$ invariant and is bi-regular on the edges of the induced subgraph $[C]$; that is to say, $C$ is a symmetrical Euler cycle of $[C]$. In this paper we determine the family of graphs (which may have multiple edges) whose edge-sets can be sequenced to form a symmetrical Euler cycle. We first classify all graphs which have a cyclic subgroup of automorphisms acting bi-regularly on edges. We then apply this classification to obtain the graphs possessing a symmetrical Euler cycle, and therefore are the (only) candidates for the induced subgraphs of the boundary cycles of the faces of arc-transitive maps. △ Less

Submitted 3 November, 2021; originally announced November 2021.

Comments: 31 pages

MSC Class: 20B25; 05C25; 05C35

arXiv:2109.15261 [pdf, other]

A simple and flexible test of sample exchangeability with applications to statistical genomics

Authors: Alan J. Aw, Jeffrey P. Spence, Yun S. Song

Abstract: In scientific studies involving analyses of multivariate data, basic but important questions often arise for the researcher: Is the sample exchangeable, meaning that the joint distribution of the sample is invariant to the ordering of the units? Are the features independent of one another, or perhaps the features can be grouped so that the groups are mutually independent? In statistical genomics,… ▽ More In scientific studies involving analyses of multivariate data, basic but important questions often arise for the researcher: Is the sample exchangeable, meaning that the joint distribution of the sample is invariant to the ordering of the units? Are the features independent of one another, or perhaps the features can be grouped so that the groups are mutually independent? In statistical genomics, these considerations are fundamental to downstream tasks such as demographic inference and the construction of polygenic risk scores. We propose a non-parametric approach, which we call the V test, to address these two questions, namely, a test of sample exchangeability given dependency structure of features, and a test of feature independence given sample exchangeability. Our test is conceptually simple, yet fast and flexible. It controls the Type I error across realistic scenarios, and handles data of arbitrary dimensions by leveraging large-sample asymptotics. Through extensive simulations and a comparison against unsupervised tests of stratification based on random matrix theory, we find that our test compares favorably in various scenarios of interest. We apply the test to data from the 1000 Genomes Project, demonstrating how it can be employed to assess exchangeability of the genetic sample, or find optimal linkage disequilibrium (LD) splits for downstream analysis. For exchangeability assessment, we find that removing rare variants can substantially increase the p-value of the test statistic. For optimal LD splitting, the V test reports different optimal splits than previous approaches not relying on hypothesis testing. Software for our methods is available in R (CRAN: flintyR) and Python (PyPI: flintyPy). △ Less

Submitted 30 August, 2023; v1 submitted 30 September, 2021; originally announced September 2021.

Comments: 24 pages. Supplementary Information file (38 pages, contains mathematical proofs) is available at https://github.com/songlab-cal/flinty/

MSC Class: 62G10; 62H15; 62P10 ACM Class: G.3

arXiv:2109.05660 [pdf, ps, other]

doi 10.46298/dmtcs.8484

Asymptotically sharpening the $s$-Hamiltonian index bound

Authors: Sulin Song, Lan Lei, Yehong Shao, Hong-Jian Lai

Abstract: For a non-negative integer $s\le |V(G)|-3$, a graph $G$ is $s$-Hamiltonian if the removal of any $k\le s$ vertices results in a Hamiltonian graph. Given a connected simple graph $G$ that is not isomorphic to a path, a cycle, or a $K_{1,3}$, let $δ(G)$ denote the minimum degree of $G$, let $h_s(G)$ denote the smallest integer $i$ such that the iterated line graph $L^{i}(G)$ is $s$-Hamiltonian, and… ▽ More For a non-negative integer $s\le |V(G)|-3$, a graph $G$ is $s$-Hamiltonian if the removal of any $k\le s$ vertices results in a Hamiltonian graph. Given a connected simple graph $G$ that is not isomorphic to a path, a cycle, or a $K_{1,3}$, let $δ(G)$ denote the minimum degree of $G$, let $h_s(G)$ denote the smallest integer $i$ such that the iterated line graph $L^{i}(G)$ is $s$-Hamiltonian, and let $\ell(G)$ denote the length of the longest non-closed path $P$ in which all internal vertices have degree 2 such that $P$ is not both of length 2 and in a $K_3$. For a simple graph $G$, we establish better upper bounds for $h_s(G)$ as follows. \begin{equation*} h_s(G)\le \left\{ \begin{aligned} & \ell(G)+1, &&\mbox{ if }δ(G)\le 2 \mbox{ and }s=0;\\ & \widetilde d(G)+2+\lceil \lg (s+1)\rceil, &&\mbox{ if }δ(G)\le 2 \mbox{ and }s\ge 1;\\ & 2+\left\lceil\lg\frac{s+1}{δ(G)-2}\right\rceil, && \mbox{ if } 3\leδ(G)\le s+2;\\ & 2, &&{\rm otherwise}, \end{aligned} \right. \end{equation*} where $\widetilde d(G)$ is the smallest integer $i$ such that $δ(L^i(G))\ge 3$. Consequently, when $s \ge 6$, this new upper bound for the $s$-hamiltonian index implies that $h_s(G) = o(\ell(G)+s+1)$ as $s \to \infty$. This sharpens the result, $h_s(G)\le\ell(G)+s+1$, obtained by Zhang et al. in [Discrete Math., 308 (2008) 4779-4785]. △ Less

Submitted 2 June, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

Comments: 9 pages

MSC Class: 05C40; 05C45; 05C76 ACM Class: G.2.2

Journal ref: Discrete Mathematics & Theoretical Computer Science, vol. 24, no. 1, Graph Theory (June 13, 2022) dmtcs:8484

arXiv:2106.11047 [pdf, ps, other]

Partial geometric designs having circulant concurrence matrices

Authors: Sung-Yell Song, Theodore Tranel

Abstract: We survey partial geometric designs and investigate their concurrences of points. The concurrence matrix of a design, which encodes the concurrences of pairs of points, can be used in the classification of designs in some extent. An ordinary 2-$(v,k,λ)$ design has concurrence $λ$ for any pair of distinct points, and its concurrence matrix is circulant. A partial geometry has two concurrences $1$ a… ▽ More We survey partial geometric designs and investigate their concurrences of points. The concurrence matrix of a design, which encodes the concurrences of pairs of points, can be used in the classification of designs in some extent. An ordinary 2-$(v,k,λ)$ design has concurrence $λ$ for any pair of distinct points, and its concurrence matrix is circulant. A partial geometry has two concurrences $1$ and $0$ and a transversal design TD$_λ(k, u)$ has two concurrences $λ$ and $0$. It is also known that the concurrence matrix of a partial geometric design can have at most three distinct eigenvalues, all of which are non-negative integers. In this paper, we show the existence of other partial geometric designs having two or three distinct concurrences, and investigate which symmetric circulant matrices are realized as the concurrence matrices of partial geometric designs. We collect known sources of partial geometric designs and study their structural characteristics and construction methods. We then give a list of feasible parameter sets for partial geometric designs of order up to 12 each of which has a circulant concurrence matrix. We also consider the combinatorial properties and constructions of partial geometric designs satisfying these parameter sets. △ Less

Submitted 3 January, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: 46 pages

MSC Class: 05B05; 05C50 (Primary) 05B10; 05B20; 05B25; 05B30; 05E30; 62K10 (Secondary)

arXiv:2105.03930 [pdf, other]

Arbitrary high-order linear structure-preserving schemes for the regularized long-wave equation

Authors: Chaolong Jiang, Xu Qian, Songhe Song, ** Cui

Abstract: In this paper, a class of arbitrarily high-order linear momentum-preserving and energy-preserving schemes are proposed, respectively, for solving the regularized long-wave equation. For the momentum-preserving scheme, the key idea is based on the extrapolation/prediction-correction technique and the symplectic Runge-Kutta method in time, together with the standard Fourier pseudo-spectral method in… ▽ More In this paper, a class of arbitrarily high-order linear momentum-preserving and energy-preserving schemes are proposed, respectively, for solving the regularized long-wave equation. For the momentum-preserving scheme, the key idea is based on the extrapolation/prediction-correction technique and the symplectic Runge-Kutta method in time, together with the standard Fourier pseudo-spectral method in space. We show that the scheme is linear, high-order, unconditionally stable and preserves the discrete momentum of the system. For the energy-preserving scheme, it is mainly based on the energy quadratization approach and the analogous linearized strategy used in the construction of the linear momentum-preserving scheme. The proposed scheme is linear, high-order and can preserve a discrete quadratic energy exactly. Numerical results are addressed to demonstrate the accuracy and efficiency of the proposed scheme. △ Less

Submitted 5 December, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

Comments: 26 pages, 52 figures

arXiv:2103.13566 [pdf, other]

A Nitsche Hybrid multiscale method with non-matching grids

Authors: **bing Ming, Siqi Song

Abstract: We propose a Nitsche method for multiscale partial differential equations, which retrieves the macroscopic information and the local microscopic information at one stroke. We prove the convergence of the method for second order elliptic problem with bounded and measurable coefficients. The rate of convergence may be derived for coefficients with further structures such as periodicity and ergodicit… ▽ More We propose a Nitsche method for multiscale partial differential equations, which retrieves the macroscopic information and the local microscopic information at one stroke. We prove the convergence of the method for second order elliptic problem with bounded and measurable coefficients. The rate of convergence may be derived for coefficients with further structures such as periodicity and ergodicity. Extensive numerical results confirm the theoretical predictions. △ Less

Submitted 1 March, 2022; v1 submitted 24 March, 2021; originally announced March 2021.

arXiv:2103.00390 [pdf, other]

High-order linearly implicit structure-preserving exponential integrators for the nonlinear Schrödinger equation

Authors: Chaolong Jiang, ** Cui, Xu Qian, Songhe Song

Abstract: A novel class of high-order linearly implicit energy-preserving integrating factor Runge-Kutta methods are proposed for the nonlinear Schrödinger equation. Based on the idea of the scalar auxiliary variable approach, the original equation is first reformulated into an equivalent form which satisfies a quadratic energy. The spatial derivatives of the system are then approximated with the standard F… ▽ More A novel class of high-order linearly implicit energy-preserving integrating factor Runge-Kutta methods are proposed for the nonlinear Schrödinger equation. Based on the idea of the scalar auxiliary variable approach, the original equation is first reformulated into an equivalent form which satisfies a quadratic energy. The spatial derivatives of the system are then approximated with the standard Fourier pseudo-spectral method. Subsequently, we apply the extrapolation technique/prediction-correction strategy to the nonlinear terms of the semi-discretized system and a linearized energy-conserving system is obtained. A fully discrete scheme is gained by further using the integrating factor Runge-Kutta method to the resulting system. We show that, under certain circumstances for the coefficients of a Runge-Kutta method, the proposed scheme can produce numerical solutions along which the modified energy is precisely conserved, as is the case with the analytical solution and is extremely efficient in the sense that only linear equations with constant coefficients need to be solved at every time step. Numerical results are addressed to demonstrate the remarkable superiority of the proposed schemes in comparison with other existing structure-preserving schemes. △ Less

Submitted 5 December, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

Comments: 28 pages, 49 figures

arXiv:2101.04313 [pdf, ps, other]

On finite subnormal Cayley graphs

Authors: Shu Jiao Song

Abstract: In this paper we introduce and study a type of Cayley graph -- subnormal Cayley graph. We prove that a subnormal 2-arc transitive Cayley graph is a normal Cayley graph or a normal cover of a complete bipartite graph $K_{p^d,p^d}$ with $p$ prime. Then we obtain a generic method for constructing half-symmetric (namely edge transitive but not arc transitive) Cayley graphs. In this paper we introduce and study a type of Cayley graph -- subnormal Cayley graph. We prove that a subnormal 2-arc transitive Cayley graph is a normal Cayley graph or a normal cover of a complete bipartite graph $K_{p^d,p^d}$ with $p$ prime. Then we obtain a generic method for constructing half-symmetric (namely edge transitive but not arc transitive) Cayley graphs. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Comments: 9 pages

MSC Class: 05C25; 20B05

arXiv:2101.04270 [pdf, ps, other]

Arc transitive circulants

Authors: Shu Jiao Song

Abstract: This short paper presents characterisations of normal arc-transitive circulants and arc-transitive normal circulants, that is, for a connected arc-transitive circulant $Γ=\Cay(C,S)$, it is shown that 1. Aut(C,S) is transitive on S if and only if each element of S has order n; 2. $AutΓ\rhd C$ if and only if S does not contain a coset of any subgroup. This completes the classification of arc-trans… ▽ More This short paper presents characterisations of normal arc-transitive circulants and arc-transitive normal circulants, that is, for a connected arc-transitive circulant $Γ=\Cay(C,S)$, it is shown that 1. Aut(C,S) is transitive on S if and only if each element of S has order n; 2. $AutΓ\rhd C$ if and only if S does not contain a coset of any subgroup. This completes the classification of arc-transitive circulants given by Li-Xia-Zhou. △ Less

Submitted 11 January, 2021; originally announced January 2021.

Comments: 8 pages

MSC Class: 20B15; 20B30; 05C25

arXiv:2101.04265 [pdf, ps, other]

Finite permutation groups containing a regular dihedral subgroup

Authors: Shu Jiao Song

Abstract: We present a characterization of finite permutation groups which contain a transitive dihedral subgroup. We present a characterization of finite permutation groups which contain a transitive dihedral subgroup. △ Less

Submitted 11 January, 2021; originally announced January 2021.

Comments: 12 pages

MSC Class: 20B15; 20B30; 05C25

arXiv:2008.06664 [pdf, other]

Exact and arbitrarily accurate non-parametric two-sample tests based on rank spacings

Authors: Dan D. Erdmann-Pham, Jonathan Terhorst, Yun S. Song

Abstract: A common method for deriving non-parametric tests is to reformulate a parametric test in terms of sample ranks. Despite being distribution free (even in finite samples), the resulting tests often display remarkable asymptotic power properties, typically matching the efficiency of their parametric counterpart. Empirically, these favorable power properties have been shown to persist in non-asymptoti… ▽ More A common method for deriving non-parametric tests is to reformulate a parametric test in terms of sample ranks. Despite being distribution free (even in finite samples), the resulting tests often display remarkable asymptotic power properties, typically matching the efficiency of their parametric counterpart. Empirically, these favorable power properties have been shown to persist in non-asymptotic regimes as well, prompting the need for finite-sample characterizations of the corresponding rank-based statistics. Here, we provide such characterization for the family of weighted $p$-norms of rank spacings, which includes the classical tests of Mann-Whitney, Dixon, and various generalizations thereof. For $p=1$, we provide exact expressions for the involved distributions, while for $p>1$ we describe the associated moment sequences and derive an algorithm to recover the distributions of interest from these sequences in a fast and stable manner. We use this framework to develop a new family of non-parametric tests mirroring properties of generalized likelihood-ratios, prove new tail bounds for Dixon's and Greenwood's statistics, and prove a previously formulated conjecture regarding the global efficiency of rank-based tests against the $F$-test in the context of scale-families. △ Less

Submitted 8 August, 2022; v1 submitted 15 August, 2020; originally announced August 2020.

Comments: 33 pages, 6 figures

arXiv:2007.13680 [pdf, ps, other]

High order tensor moments of random vectors

Authors: Yan Feng, Shan Song, Changqing Xu

Abstract: A random vector $\bx\in \R^n$ is a vector whose coordinates are all random variables. A random vector is called a Gaussian vector if it follows Gaussian distribution. These terminology can also be extended to a random (Gaussian) matrix and random (Gaussian) tensor. The classical form of an $k$-order moment (for any positive integer $k$) of a random vector $\bx\in \R^n$ is usually expressed in a ma… ▽ More A random vector $\bx\in \R^n$ is a vector whose coordinates are all random variables. A random vector is called a Gaussian vector if it follows Gaussian distribution. These terminology can also be extended to a random (Gaussian) matrix and random (Gaussian) tensor. The classical form of an $k$-order moment (for any positive integer $k$) of a random vector $\bx\in \R^n$ is usually expressed in a matrix form of size $n\times n^{k-1}$ generated from the $k$th derivative of the characteristic function or the moment generating function of $\bx$ , and the expression of an $k$-order moment is very complicate even for a standard normal distributed vector. With the tensor form, we can simplify all the expressions related to high order moments. The main purpose of this paper is to introduce the high order moments of a random vector in tensor forms and the high order moments of a standard normal distributed vector. Finally we present an expression of high order moments of a random vector that follows a Gaussian distribution. △ Less

Submitted 27 July, 2020; originally announced July 2020.

Comments: 18 pages,0 figures,

MSC Class: 15A69; 15A72

arXiv:2007.09619 [pdf, other]

doi 10.4208/csiam-am.2020-0035

An Efficient Online-Offline Method for Elliptic Homogenization Problems

Authors: Yufang Huang, **bing Ming, Siqi Song

Abstract: We present a new numerical method for solving the elliptic homogenization problem. The main idea is that the missing effective matrix is reconstructed by solving the local least-squares in an offline stage, which shall be served as the input data for the online computation. The accuracy of the proposed method are analyzed with the aid of the refined estimates of the reconstruction operator. Two di… ▽ More We present a new numerical method for solving the elliptic homogenization problem. The main idea is that the missing effective matrix is reconstructed by solving the local least-squares in an offline stage, which shall be served as the input data for the online computation. The accuracy of the proposed method are analyzed with the aid of the refined estimates of the reconstruction operator. Two dimensional and three dimensional numerical tests confirm the efficiency of the proposed method, and illustrate that this online-offline strategy may significantly reduce the cost without loss of accuracy. △ Less

Submitted 4 November, 2020; v1 submitted 19 July, 2020; originally announced July 2020.

arXiv:2007.09235 [pdf, ps, other]

Hadamard diagonalizable graphs of order at most 36

Authors: Jane Breen, Steve Butler, Melissa Fuentes, Bernard Lidický, Michael Phillips, Alexander W. N. Riasanovksy, Sung-Yell Song, Ralihe R. Villagrán, Cedar Wiseman, Xiaohong Zhang

Abstract: If the Laplacian matrix of a graph has a full set of orthogonal eigenvectors with entries $\pm1$, then the matrix formed by taking the columns as the eigenvectors is a Hadamard matrix and the graph is said to be Hadamard diagonalizable. In this article, we prove that if $n=8k+4$ the only possible Hadamard diagonalizable graphs are $K_n$, $K_{n/2,n/2}$, $2K_{n/2}$, and $nK_1$, and we develop an e… ▽ More If the Laplacian matrix of a graph has a full set of orthogonal eigenvectors with entries $\pm1$, then the matrix formed by taking the columns as the eigenvectors is a Hadamard matrix and the graph is said to be Hadamard diagonalizable. In this article, we prove that if $n=8k+4$ the only possible Hadamard diagonalizable graphs are $K_n$, $K_{n/2,n/2}$, $2K_{n/2}$, and $nK_1$, and we develop an efficient computation for determining all graphs diagonalized by a given Hadamard matrix of any order. Using these two tools, we determine and present all Hadamard diagonalizable graphs up to order 36. Note that it is not even known how many Hadamard matrices there are of order 36. △ Less

Submitted 15 August, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

MSC Class: 05C50; 15B34; 05B20; 05C76; 05C85

arXiv:2006.10339 [pdf, ps, other]

Erdős-Ko-Rado problems for permutation groups

Authors: Cai Heng Li, Shu Jiao Song, Venkata Raghu Tej Pantangi

Abstract: In this paper, we study intersecting sets in primitive and quasiprimitive permutation groups. Let $G \leqslant \mathrm{Sym}(Ω)$ be a transitive permutation group, and ${S}$ an intersecting set. Previous results show that if $G$ is either 2-transitive or a Frobenius group, then $|{S}|\leqslant|G_ω|$ (for some $ω\in Ω$). Furthermore, for some 2-transitive groups, $|{S}|=|G_ω|$ if and only if ${S}$ i… ▽ More In this paper, we study intersecting sets in primitive and quasiprimitive permutation groups. Let $G \leqslant \mathrm{Sym}(Ω)$ be a transitive permutation group, and ${S}$ an intersecting set. Previous results show that if $G$ is either 2-transitive or a Frobenius group, then $|{S}|\leqslant|G_ω|$ (for some $ω\in Ω$). Furthermore, for some 2-transitive groups, $|{S}|=|G_ω|$ if and only if ${S}$ is a coset of a stabilizer. In this paper, we prove that these statements are far from the truth for general transitive groups. In particular, we show that in the case of primitive groups, there is even no absolute constant $c$ such that $|{S}|\leqslant c|G_ω|$. In the case $G$ is a primitive permutation group isomorphic to $\mathrm{PSL(2,p)}$, we characterize the subgroups of $G$ which are intersecting sets. We also show that if $G \leqslant \mathrm{Sym}(Ω)$ is a permutation group of prime power degree, then for any intersecting set $S$, we have $|S|\leq |G_ω|$ (for some $ω\in Ω$). This proves a part of a conjecture in \cite{MRS}. △ Less

Submitted 16 January, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: 19 pages

arXiv:2006.09370 [pdf, ps, other]

Two regularized energy-preserving finite difference methods for the logarithmic Klein-Gordon equation

Authors: **gye Yan, Xu Qian, Hong Zhang, Songhe Song

Abstract: We present and analyze two regularized finite difference methods which preserve energy of the logarithmic Klein-Gordon equation (LogKGE). In order to avoid singularity caused by the logarithmic nonlinearity of the LogKGE, we propose a regularized logarithmic Klein-Gordon equation (RLogKGE) with a small regulation parameter $0<\varepsilon\ll1$ to approximate the LogKGE with the convergence order… ▽ More We present and analyze two regularized finite difference methods which preserve energy of the logarithmic Klein-Gordon equation (LogKGE). In order to avoid singularity caused by the logarithmic nonlinearity of the LogKGE, we propose a regularized logarithmic Klein-Gordon equation (RLogKGE) with a small regulation parameter $0<\varepsilon\ll1$ to approximate the LogKGE with the convergence order $O(\varepsilon)$. By adopting the energy method, the inverse inequality, and the cut-off technique of the nonlinearity to bound the numerical solution, the error bound $O(h^{2}+\frac{τ^{2}}{\varepsilon^{2}})$ of the two schemes with the mesh size $h$, the time step $τ$ and the parameter $\varepsilon$. Numerical results are reported to support our conclusions. △ Less

Submitted 14 June, 2020; originally announced June 2020.

Comments: arXiv admin note: text overlap with arXiv:2006.08079

arXiv:2006.08079 [pdf, ps, other]

Regularized finite difference methods for the logarithmic Klein-Gordon equation

Authors: **gye Yan, Hong Zhang, Xu Qian, Songhe Song

Abstract: We propose and analyze two regularized finite difference methods for the logarithmic Klein-Gordon equation (LogKGE). Due to the blowup phenomena caused by the logarithmic nonlinearity of the LogKGE, it is difficult to construct numerical schemes and establish their error bounds. In order to avoid singularity, we present a regularized logarithmic Klein-Gordon equation (RLogKGE) with a small regular… ▽ More We propose and analyze two regularized finite difference methods for the logarithmic Klein-Gordon equation (LogKGE). Due to the blowup phenomena caused by the logarithmic nonlinearity of the LogKGE, it is difficult to construct numerical schemes and establish their error bounds. In order to avoid singularity, we present a regularized logarithmic Klein-Gordon equation (RLogKGE) with a small regularized parameter $0<\varepsilon\ll1$. Besides, two finite difference methods are adopted to solve the regularized logarithmic Klein-Gordon equation (RLogKGE) and rigorous error bounds are estimated in terms of the mesh size $h$, time step $τ$, and the small regularized parameter $\varepsilon$. Finally, numerical experiments are carried out to verify our error estimates of the two numerical methods and the convergence results from the LogKGE to the RLogKGE with the linear convergence order $O(\varepsilon)$. △ Less

Submitted 14 June, 2020; originally announced June 2020.

arXiv:2006.06783 [pdf, other]

Evading Curse of Dimensionality in Unconstrained Private GLMs via Private Gradient Descent

Authors: Shuang Song, Thomas Steinke, Om Thakkar, Abhradeep Thakurta

Abstract: We revisit the well-studied problem of differentially private empirical risk minimization (ERM). We show that for unconstrained convex generalized linear models (GLMs), one can obtain an excess empirical risk of $\tilde O\left(\sqrt{\texttt{rank}}/εn\right)$, where ${\texttt{rank}}$ is the rank of the feature matrix in the GLM problem, $n$ is the number of data samples, and $ε$ is the privacy para… ▽ More We revisit the well-studied problem of differentially private empirical risk minimization (ERM). We show that for unconstrained convex generalized linear models (GLMs), one can obtain an excess empirical risk of $\tilde O\left(\sqrt{\texttt{rank}}/εn\right)$, where ${\texttt{rank}}$ is the rank of the feature matrix in the GLM problem, $n$ is the number of data samples, and $ε$ is the privacy parameter. This bound is attained via differentially private gradient descent (DP-GD). Furthermore, via the first lower bound for unconstrained private ERM, we show that our upper bound is tight. In sharp contrast to the constrained ERM setting, there is no dependence on the dimensionality of the ambient model space ($p$). (Notice that ${\texttt{rank}}\leq \min\{n, p\}$.) Besides, we obtain an analogous excess population risk bound which depends on ${\texttt{rank}}$ instead of $p$. For the smooth non-convex GLM setting (i.e., where the objective function is non-convex but preserves the GLM structure), we further show that DP-GD attains a dimension-independent convergence of $\tilde O\left(\sqrt{\texttt{rank}}/εn\right)$ to a first-order-stationary-point of the underlying objective. Finally, we show that for convex GLMs, a variant of DP-GD commonly used in practice (which involves clip** the individual gradients) also exhibits the same dimension-independent convergence to the minimum of a well-defined objective. To that end, we provide a structural lemma that characterizes the effect of clip** on the optimization profile of DP-GD. △ Less

Submitted 2 March, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

arXiv:2002.06751 [pdf, other]

Second-order Conic Programming Approach for Wasserstein Distributionally Robust Two-stage Linear Programs

Authors: Zhuolin Wang, Keyou You, Shiji Song, Yuli Zhang

Abstract: This paper proposes a second-order conic programming (SOCP) approach to solve distributionally robust two-stage stochastic linear programs over 1-Wasserstein balls. We start from the case with distribution uncertainty only in the objective function and exactly reformulate it as an SOCP problem. Then, we study the case with distribution uncertainty only in constraints, and show that such a robust p… ▽ More This paper proposes a second-order conic programming (SOCP) approach to solve distributionally robust two-stage stochastic linear programs over 1-Wasserstein balls. We start from the case with distribution uncertainty only in the objective function and exactly reformulate it as an SOCP problem. Then, we study the case with distribution uncertainty only in constraints, and show that such a robust program is generally NP-hard as it involves a norm maximization problem over a polyhedron. However, it is reduced to an SOCP problem if the extreme points of the polyhedron are given as a prior. This motivates to design a constraint generation algorithm with provable convergence to approximately solve the NP-hard problem. In sharp contrast to the exiting literature, the distribution achieving the worst-case cost is given as an "empirical" distribution by simply perturbing each sample for both cases. Finally, experiments illustrate the advantages of the proposed model in terms of the out-of-sample performance and the computational complexity. △ Less

Submitted 28 May, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

arXiv:1910.13700 [pdf, ps, other]

Mass and energy conservative high order diagonally implicit Runge--Kutta schemes for nonlinear Schrödinger equation in one and two dimensions

Authors: Ziyuan Liu, Hong Zhang, Xu Qian, Songhe Song

Abstract: We present and analyze a series of conservative diagonally implicit Runge--Kutta schemes for the nonlinear Schrödiner equation. With the application of the newly developed invariant energy quadratization approach, these schemes possess not only high accuracy , high order convergence (up to fifth order) and efficiency due to diagonally implicity but also mass and energy conservative properties. Bot… ▽ More We present and analyze a series of conservative diagonally implicit Runge--Kutta schemes for the nonlinear Schrödiner equation. With the application of the newly developed invariant energy quadratization approach, these schemes possess not only high accuracy , high order convergence (up to fifth order) and efficiency due to diagonally implicity but also mass and energy conservative properties. Both theoretical analysis and numerical experiments of one- and two-dimensional dynamics are carried out to verify the invariant conservative properties, convergence orders and longtime simulation stability. △ Less

Submitted 30 October, 2019; originally announced October 2019.

arXiv:1908.03676 [pdf, ps, other]

Law of the Iterated Logarithm and Model Selection Consistency for GLMs with Independent and Dependent Responses

Authors: Xiaowei Yang, Shuang Song, Huiming Zhang

Abstract: We study the law of the iterated logarithm (LIL) for the maximum likelihood estimation of the parameters (as a convex optimization problem) in the generalized linear models with independent or weakly dependent ($ρ$-mixing, $m$-dependent) responses under mild conditions. The LIL is useful to derive the asymptotic bounds for the discrepancy between the empirical process of the log-likelihood functio… ▽ More We study the law of the iterated logarithm (LIL) for the maximum likelihood estimation of the parameters (as a convex optimization problem) in the generalized linear models with independent or weakly dependent ($ρ$-mixing, $m$-dependent) responses under mild conditions. The LIL is useful to derive the asymptotic bounds for the discrepancy between the empirical process of the log-likelihood function and the true log-likelihood. As the application of the LIL, the strong consistency of some penalized likelihood based model selection criteria can be shown. Under some regularity conditions, the model selection criterion will be helpful to select the simplest correct model almost surely when the penalty term increases with model dimension and the penalty term has an order higher than $O({\rm{loglog}}n)$ but lower than $O(n)$. Simulation studies are implemented to verify the selection consistency of BIC. △ Less

Submitted 25 April, 2020; v1 submitted 9 August, 2019; originally announced August 2019.

Comments: 25 pages, 1 table

arXiv:1903.00158 [pdf, ps, other]

The Construction of Two Kinds of Bijections in Simple Random Walk Paths

Authors: Sai Song, Qiang Yao

Abstract: It is known that for the 2n-step symmetric simple random walk on Z, two events have the same probability if and only if their sets of paths have the same cardinality. In this article, we construct two kinds of bijections between sets of paths with the same cardinality. The construction is natural and simple. It can be easily realized through programming. More importantly, this construction opens a… ▽ More It is known that for the 2n-step symmetric simple random walk on Z, two events have the same probability if and only if their sets of paths have the same cardinality. In this article, we construct two kinds of bijections between sets of paths with the same cardinality. The construction is natural and simple. It can be easily realized through programming. More importantly, this construction opens a door to prove that two events in the 2n-step symmetric simple random walk on Z have the same probability and some further related results. △ Less

Submitted 11 July, 2021; v1 submitted 1 March, 2019; originally announced March 2019.

Comments: 8 pages, 2 figures

MSC Class: 60C05; 60J10

arXiv:1902.09128 [pdf, other]

Wasserstein Distributionally Robust Shortest Path Problem

Authors: Zhuolin Wang, Keyou You, Shiji Song, Yuli Zhang

Abstract: This paper proposes a data-driven distributionally robust shortest path (DRSP) model where the distribution of the travel time in the transportation network can only be partially observed through a finite number of samples. Specifically, we aim to find an optimal path to minimize the worst-case $α$-reliable mean-excess travel time (METT) over a Wasserstein ball, which is centered at the empirical… ▽ More This paper proposes a data-driven distributionally robust shortest path (DRSP) model where the distribution of the travel time in the transportation network can only be partially observed through a finite number of samples. Specifically, we aim to find an optimal path to minimize the worst-case $α$-reliable mean-excess travel time (METT) over a Wasserstein ball, which is centered at the empirical distribution of the sample dataset and the ball radius quantifies the level of its confidence. In sharp contrast to the existing DRSP models, our model is equivalently reformulated as a tractable mixed 0-1 convex problem, e.g., 0-1 linear program or 0-1 second-order cone program. Moreover, we also explicitly derive the distribution achieving the worst-case METT by simply perturbing each sample. Experiments demonstrate the advantages of our DRSP model in terms of the out-of-sample performance and computational complexity. Finally, our DRSP model is easily extended to solve the DR bi-criteria shortest path problem and the minimum cost flow problem. △ Less

Submitted 17 November, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

arXiv:1901.07491 [pdf]

doi 10.1016/j.ress.2019.106547

Optimization of On-condition Thresholds for a System of Degrading Components with Competing Dependent Failure Processes

Authors: Sanling Song, Nooshin Yousefi, David W. Coit, Qianmei Feng

Abstract: An optimization model has been formulated and solved to determine on-condition failure thresholds and inspection intervals for multi-component systems with each component experiencing multiple failure processes due to simultaneous exposure to degradation and shock loads. In this new model, we consider on-condition maintenance optimization for systems of degrading components, which offers cost bene… ▽ More An optimization model has been formulated and solved to determine on-condition failure thresholds and inspection intervals for multi-component systems with each component experiencing multiple failure processes due to simultaneous exposure to degradation and shock loads. In this new model, we consider on-condition maintenance optimization for systems of degrading components, which offers cost benefits over time-based preventive maintenance or replace-on-failure policies. For systems of degrading components, this can be a particularly difficult problem because of the dependent degradation and dependent failure times. In previous research, preventive maintenance and periodic inspection models have been considered; however, for systems whose costs due to failure are high, it is prudent to avoid the event of failure, i.e., we should repair or replace the components or system before the failure happens. The determination of optimal on-condition thresholds for all components is effective to avoid failure and to minimize cost. Low on-condition thresholds can be inefficient because they waste components life, and high on-condition thresholds are risky because the components are prone to costly failure. In this paper, we formulated and solved a new optimization model to determine optimal on-condition thresholds and inspection intervals. In our model, when the system is inspected, all components are inspected at that time. An inspection interval may be optimal for one component, but might be undesirable for another component, so the optimization requires a compromise. The on-condition maintenance optimization model is demonstrated on several examples. △ Less

Submitted 22 January, 2019; originally announced January 2019.

arXiv:1812.05734 [pdf, other]

Graphs that are cospectral for the distance Laplacian

Authors: Boris Brimkov, Ken Duna, Leslie Hogben, Kate Lorenzen, Carolyn Reinhart, Sung-Yell Song, Mark Yarrow

Abstract: The distance matrix $\mathcal{D}(G)$ of a graph $G$ is the matrix containing the pairwise distances between vertices, and the distance Laplacian matrix is $\mathcal{D}^L(G)=T(G)-\mathcal{D}(G)$, where $T(G)$ is the diagonal matrix of row sums of $\mathcal{D}(G)$. We establish several general methods for producing $\mathcal{D}^L$-cospectral graphs that can be used to construct infinite families. We… ▽ More The distance matrix $\mathcal{D}(G)$ of a graph $G$ is the matrix containing the pairwise distances between vertices, and the distance Laplacian matrix is $\mathcal{D}^L(G)=T(G)-\mathcal{D}(G)$, where $T(G)$ is the diagonal matrix of row sums of $\mathcal{D}(G)$. We establish several general methods for producing $\mathcal{D}^L$-cospectral graphs that can be used to construct infinite families. We provide examples showing that various properties are not preserved by $\mathcal{D}^L$-cospectrality, including examples of $\mathcal{D}^L$-cospectral strongly regular and circulant graphs. We establish that the absolute values of coefficients of the distance Laplacian characteristic polynomial are decreasing, i.e., $|δ^L_{1}|\geq \dots \geq |δ^L_{n}|$ where $δ^L_{k}$ is the coefficient of $x^k$. △ Less

Submitted 13 December, 2018; originally announced December 2018.

Comments: 18 pages

MSC Class: 05C500; 05C12; 15A18; 15B57

arXiv:1803.05609 [pdf, other]

doi 10.1016/j.cels.2019.12.003

The key parameters that govern translation efficiency

Authors: Dan D. Erdmann-Pham, Khanh Dao Duc, Yun S. Song

Abstract: Translation of mRNA into protein is a fundamental yet complex biological process with multiple factors that can potentially affect its efficiency. Here, we study a stochastic model describing the traffic flow of ribosomes along the mRNA (namely, the inhomogeneous $\ell$-TASEP), and identify the key parameters that govern the overall rate of protein synthesis, sensitivity to initiation rate changes… ▽ More Translation of mRNA into protein is a fundamental yet complex biological process with multiple factors that can potentially affect its efficiency. Here, we study a stochastic model describing the traffic flow of ribosomes along the mRNA (namely, the inhomogeneous $\ell$-TASEP), and identify the key parameters that govern the overall rate of protein synthesis, sensitivity to initiation rate changes, and efficiency of ribosome usage. By analyzing a continuum limit of the model, we obtain closed-form expressions for stationary currents and ribosomal densities, which agree well with Monte Carlo simulations. Furthermore, we completely characterize the phase transitions in the system, and by applying our theoretical results, we formulate design principles that detail how to tune the key parameters we identified to optimize translation efficiency. Using ribosome profiling data from S. cerevisiae, we shows that its translation system is generally consistent with these principles. Our theoretical results have implications for evolutionary biology, as well as synthetic biology. △ Less

Submitted 16 January, 2020; v1 submitted 15 March, 2018; originally announced March 2018.

Comments: To appear in Cell Systems. 32 pages, 10 figures, 1 table

arXiv:1712.05035 [pdf, other]

Geometry of the sample frequency spectrum and the perils of demographic inference

Authors: Zvi Rosen, Anand Bhaskar, Sebastien Roch, Yun S. Song

Abstract: The sample frequency spectrum (SFS), which describes the distribution of mutant alleles in a sample of DNA sequences, is a widely used summary statistic in population genetics. The expected SFS has a strong dependence on the historical population demography and this property is exploited by popular statistical methods to infer complex demographic histories from DNA sequence data. Most, if not all,… ▽ More The sample frequency spectrum (SFS), which describes the distribution of mutant alleles in a sample of DNA sequences, is a widely used summary statistic in population genetics. The expected SFS has a strong dependence on the historical population demography and this property is exploited by popular statistical methods to infer complex demographic histories from DNA sequence data. Most, if not all, of these inference methods exhibit pathological behavior, however. Specifically, they often display runaway behavior in optimization, where the inferred population sizes and epoch durations can degenerate to 0 or diverge to infinity, and show undesirable sensitivity of the inferred demography to perturbations in the data. The goal of this paper is to provide theoretical insights into why such problems arise. To this end, we characterize the geometry of the expected SFS for piecewise-constant demographic histories and use our results to show that the aforementioned pathological behavior of popular inference methods is intrinsic to the geometry of the expected SFS. We provide explicit descriptions and visualizations for a toy model with sample size 4, and generalize our intuition to arbitrary sample sizes n using tools from convex and algebraic geometry. We also develop a universal characterization result which shows that the expected SFS of a sample of size n under an arbitrary population history can be recapitulated by a piecewise-constant demography with only k(n) epochs, where k(n) is between n/2 and 2n-1. The set of expected SFS for piecewise-constant demographies with fewer than k(n) epochs is open and non-convex, which causes the above phenomena for inference from data. △ Less

Submitted 13 December, 2017; originally announced December 2017.

Comments: 21 pages, 5 figures

MSC Class: 92D10; 14P10; 52A20; 62P10

arXiv:1711.02944 [pdf, ps, other]

Multi-dimensional BSDEs whose terminal values are bounded and have bounded Malliavin derivatives

Authors: Shiqi Song

Abstract: We consider a class of multi-dimensional BSDEs on a finite time horizon (containing in particular Lipschitzian-quadratic BSDEs), whose terminal values are bounded as well as their corresponding Malliavin derivatives. We prove two results. The first one is an exponential integrability condition which determines when a BSDE in this class has a solution up to a given time horizon. In the second resul… ▽ More We consider a class of multi-dimensional BSDEs on a finite time horizon (containing in particular Lipschitzian-quadratic BSDEs), whose terminal values are bounded as well as their corresponding Malliavin derivatives. We prove two results. The first one is an exponential integrability condition which determines when a BSDE in this class has a solution up to a given time horizon. In the second result, via an ordinary differential equation, we compute a minimum horizon up to which any BSDE of this class has a solution. The combination of these two results leads to a new scheme to solve quadratic BSDEs. △ Less

Submitted 30 August, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

Comments: In this third arXiv version, the Introduction has been rewritten and new applications have been added to better illustrate the main results

arXiv:1703.06616 [pdf, ps, other]

Hall universal group has ample generic automorphisms

Authors: Shichang Song

Abstract: We show that the automorphism group of Philip Hall's universal locally finite group has ample generics,that is, it admits comeager diagonal conjugacy classes in all dimensions.Consequently, it has the small index property, is not the union of a countable chain of non-open subgroups, and has the automatic continuity property. Also, we discuss some algebraic and topological properties of the automor… ▽ More We show that the automorphism group of Philip Hall's universal locally finite group has ample generics,that is, it admits comeager diagonal conjugacy classes in all dimensions.Consequently, it has the small index property, is not the union of a countable chain of non-open subgroups, and has the automatic continuity property. Also, we discuss some algebraic and topological properties of the automorphism group of Hall universal group. For example, we show that every generic automorphism of Hall universal group is conjugate to all of its powers, and hence has roots of all orders. △ Less

Submitted 19 October, 2017; v1 submitted 20 March, 2017; originally announced March 2017.

MSC Class: 20D45 (Primary); 03E15; 20F28; 22A05 (Secondary)

arXiv:1701.05986 [pdf, other]

Distributed Random-Fixed Projected Algorithm for Constrained Optimization Over Digraphs

Authors: Pei Xie, Keyou You, Shiji Song, Cheng Wu

Abstract: This paper is concerned with a constrained optimization problem over a directed graph (digraph) of nodes, in which the cost function is a sum of local objectives, and each node only knows its local objective and constraints. To collaboratively solve the optimization, most of the existing works require the interaction graph to be balanced or "doubly-stochastic", which is quite restrictive and not n… ▽ More This paper is concerned with a constrained optimization problem over a directed graph (digraph) of nodes, in which the cost function is a sum of local objectives, and each node only knows its local objective and constraints. To collaboratively solve the optimization, most of the existing works require the interaction graph to be balanced or "doubly-stochastic", which is quite restrictive and not necessary as shown in this paper. We focus on an epigraph form of the original optimization to resolve the "unbalanced" problem, and design a novel two-step recursive algorithm with a simple structure. Under strongly connected digraphs, we prove that each node asymptotically converges to some common optimal solution. Finally, simulations are performed to illustrate the effectiveness of the proposed algorithms. △ Less

Submitted 21 January, 2017; originally announced January 2017.

Comments: arXiv admin note: substantial text overlap with arXiv:1612.09029

arXiv:1612.09029 [pdf, ps, other]

Distributed Convex Optimization with Inequality Constraints over Time-varying Unbalanced Digraphs

Authors: Pei Xie, Keyou You, Roberto Tempo, Shiji Song, Cheng Wu

Abstract: This paper considers a distributed convex optimization problem with inequality constraints over time-varying unbalanced digraphs, where the cost function is a sum of local objectives, and each node of the graph only knows its local objective and inequality constraints. Although there is a vast literature on distributed optimization, most of them require the graph to be balanced, which is quite res… ▽ More This paper considers a distributed convex optimization problem with inequality constraints over time-varying unbalanced digraphs, where the cost function is a sum of local objectives, and each node of the graph only knows its local objective and inequality constraints. Although there is a vast literature on distributed optimization, most of them require the graph to be balanced, which is quite restrictive and not necessary. Very recently, the unbalanced problem has been resolved only for either time-invariant graphs or unconstrained optimization. This work addresses the unbalancedness by focusing on an epigraph form of the constrained optimization. A striking feature is that this novel idea can be easily used to study time-varying unbalanced digraphs. Under local communications, a simple iterative algorithm is then designed for each node. We prove that if the graph is uniformly jointly strongly connected, each node asymptotically converges to some common optimal solution. △ Less

Submitted 28 December, 2016; originally announced December 2016.

arXiv:1611.04762 [pdf, ps, other]

Stochastic Source Seeking with Forward and Angular Velocity Regulation

Authors: **biao Lin, Shiji Song, Keyou You, Miroslav Krstic

Abstract: This paper studies a stochastic extremum seeking method to steer a nonholonomic vehicle to the unknown source of a static spatially distributed filed in a plane. The key challenge lies in the lack of vehicle's position information and the distribution of the scalar field. Different from the existing stochastic strategy that keeps the forward velocity constant and controls only the angular velocity… ▽ More This paper studies a stochastic extremum seeking method to steer a nonholonomic vehicle to the unknown source of a static spatially distributed filed in a plane. The key challenge lies in the lack of vehicle's position information and the distribution of the scalar field. Different from the existing stochastic strategy that keeps the forward velocity constant and controls only the angular velocity, we design a stochastic extremum seeking controller to regulate both forward and angular velocities simultaneously in this work. Thus, the vehicle decelerates near the source and stays within a small area as if it comes to a full stop, which solves the overshoot problem in the constant forward velocity case. We use the stochastic averaging theory to prove the local exponential convergence, both almost surely and in probability, to a small neighborhood near the source for elliptical level sets. Finally, simulations are included to illustrate the theoretical results. △ Less

Submitted 15 November, 2016; originally announced November 2016.

Comments: 12 pages, 8 figures

arXiv:1610.09745 [pdf, ps, other]

doi 10.4208/jms.v55n3.22.03

A new method for computing the expected hitting time between arbitrary different configurations of the multiple-urn Ehrenfest model

Authors: Sai Song, Qiang Yao

Abstract: We study a multiple-urn version of the Ehrenfest model. In this setting, we denote the n urns by Urn 1 to Urn n, where n>=2. Initially, M balls are randomly placed in the n urns. At each subsequent step, a ball is selected and put into the other n-1 urns with equal probability. The expected hitting time leading to a change of the M balls' status is computed using the method of stop** times. As a… ▽ More We study a multiple-urn version of the Ehrenfest model. In this setting, we denote the n urns by Urn 1 to Urn n, where n>=2. Initially, M balls are randomly placed in the n urns. At each subsequent step, a ball is selected and put into the other n-1 urns with equal probability. The expected hitting time leading to a change of the M balls' status is computed using the method of stop** times. As a corollary, we obtain the expected hitting time of moving all the M balls from Urn 1 to Urn 2. This proves a conjecture which was recently made in Chen et al.(2017). △ Less

Submitted 26 June, 2021; v1 submitted 30 October, 2016; originally announced October 2016.

Comments: 14 pages

MSC Class: 60C05; 60J10

Journal ref: Journal of Mathematical Study, Vol. 55, No. 3, pp. 254-270 (2022)

arXiv:1610.07946 [pdf, ps, other]

An idelic view of ideals

Authors: Shin Eui Song

Abstract: Ideles and adeles can be viewed as a generalization of Minkowski theory, in which embedding of a number field to the Cartesian product of its completions at the archimedean valuation is generalized to an embedding of the Cartesian product of all its completions with some restriction. This paper introduces the basic notions of point-set topology and builds the real numbers from the rational numbers… ▽ More Ideles and adeles can be viewed as a generalization of Minkowski theory, in which embedding of a number field to the Cartesian product of its completions at the archimedean valuation is generalized to an embedding of the Cartesian product of all its completions with some restriction. This paper introduces the basic notions of point-set topology and builds the real numbers from the rational numbers. Then we review concepts from local fields that will lead to the product formula and the approximation theorem. We, then, construct adeles and ideles. The ideles modulo $k^*$ maps surjectively to the ideal class group, and the compactness of $C_S^0$ will give rise to an alternative proof to the Dirichlet's S-unit theorem and the finiteness of ideal class group. The paper assumes that the reader is familiar with Dedekind domain, principal ideal domain, unique factorization of Dedekind domain, and field norm and traces. The first few parts of Chapter 1 in Neukirch's Algebraic Number Theory will be sufficient for the paper. △ Less

Submitted 9 September, 2018; v1 submitted 24 October, 2016; originally announced October 2016.

arXiv:1609.06911 [pdf, other]

On the Wiener index, distance cospectrality and transmission regular graphs

Authors: Aida Abiad, Boris Brimkov, Aysel Erey, Lorinda Leshock, Xavier Martínez-Rivera, Suil O, Sung-Yell Song, Jason Williford

Abstract: In this paper, we investigate various algebraic and graph theoretic properties of the distance matrix of a graph. Two graphs are $D$-cospectral if their distance matrices have the same spectrum. We construct infinite pairs of $D$-cospectral graphs with different diameter and different Wiener index. A graph is $k$-transmission-regular if its distance matrix has constant row sum equal to $k$. We est… ▽ More In this paper, we investigate various algebraic and graph theoretic properties of the distance matrix of a graph. Two graphs are $D$-cospectral if their distance matrices have the same spectrum. We construct infinite pairs of $D$-cospectral graphs with different diameter and different Wiener index. A graph is $k$-transmission-regular if its distance matrix has constant row sum equal to $k$. We establish tight upper and lower bounds for the row sum of a $k$-transmission-regular graph in terms of the number of vertices of the graph. Finally, we determine the Wiener index and its complexity for linear $k$-trees, and obtain a closed form for the Wiener index of block-clique graphs in terms of the Laplacian eigenvalues of the graph. The latter leads to a generalization of a result for trees which was proved independently by Mohar and Merris. △ Less

Submitted 3 July, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

MSC Class: 05C50; 05C12; 94C15

Journal ref: Discrete Applied Mathematics, 230 (2017), 1-10

Showing 1–50 of 107 results for author: Song, S