-
Contractible Subgraphs of Contraction Critically Quasi $5$-Connected Graphs
Authors:
Shuai Kou,
Chengfu Qin,
Weihua Yang
Abstract:
Let $G$ be a contraction critically quasi $5$-connected graph on at least $14$ vertices. If there is a vertex $x\in V_{4}(G)$ such that $G[N_{G}(x)]\cong K_{1,3}$ or $G[N_{G}(x)]\cong C_{4}$, then $G$ has a quasi $5$-contractible subgraph $H$ such that $0<\|V(H)\|<4$.
Let $G$ be a contraction critically quasi $5$-connected graph on at least $14$ vertices. If there is a vertex $x\in V_{4}(G)$ such that $G[N_{G}(x)]\cong K_{1,3}$ or $G[N_{G}(x)]\cong C_{4}$, then $G$ has a quasi $5$-contractible subgraph $H$ such that $0<\|V(H)\|<4$.
△ Less
Submitted 19 June, 2022;
originally announced July 2022.
-
Contextual Information-Directed Sampling
Authors:
Botao Hao,
Tor Lattimore,
Chao Qin
Abstract:
Information-directed sampling (IDS) has recently demonstrated its potential as a data-efficient reinforcement learning algorithm. However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandi…
▽ More
Information-directed sampling (IDS) has recently demonstrated its potential as a data-efficient reinforcement learning algorithm. However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandits. We provably demonstrate the advantage of contextual IDS over conditional IDS and emphasize the importance of considering the context distribution. The main message is that an intelligent agent should invest more on the actions that are beneficial for the future unseen contexts while the conditional IDS can be myopic. We further propose a computationally-efficient version of contextual IDS based on Actor-Critic and evaluate it empirically on a neural network contextual bandit.
△ Less
Submitted 9 June, 2022; v1 submitted 22 May, 2022;
originally announced May 2022.
-
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap
Authors:
Masahiro Kato,
Kaito Ariu,
Masaaki Imaizumi,
Masahiro Nomura,
Chao Qin
Abstract:
We consider fixed-budget best-arm identification in two-armed Gaussian bandit problems. One of the longstanding open questions is the existence of an optimal strategy under which the probability of misidentification matches a lower bound. We show that a strategy following the Neyman allocation rule (Neyman, 1934) is asymptotically optimal when the gap between the expected rewards is small. First,…
▽ More
We consider fixed-budget best-arm identification in two-armed Gaussian bandit problems. One of the longstanding open questions is the existence of an optimal strategy under which the probability of misidentification matches a lower bound. We show that a strategy following the Neyman allocation rule (Neyman, 1934) is asymptotically optimal when the gap between the expected rewards is small. First, we review a lower bound derived by Kaufmann et al. (2016). Then, we propose the "Neyman Allocation (NA)-Augmented Inverse Probability weighting (AIPW)" strategy, which consists of the sampling rule using the Neyman allocation with an estimated standard deviation and the recommendation rule using an AIPW estimator. Our proposed strategy is optimal because the upper bound matches the lower bound when the budget goes to infinity and the gap goes to zero.
△ Less
Submitted 28 December, 2022; v1 submitted 12 January, 2022;
originally announced January 2022.
-
A Majorized-Generalized Alternating Direction Method of Multipliers for Convex Composite Programming
Authors:
Congying Qin,
Yunhai Xiao,
Peili Li
Abstract:
The linearly constrained convex composite programming problems whose objective function contains two blocks with each block being the form of nonsmooth+smooth arises frequently in multiple fields of applications. If both of the smooth terms are quadratic, this problem can be solved efficiently by using the symmetric Gaussian-Seidel (sGS) technique based proximal alternating direction method of mul…
▽ More
The linearly constrained convex composite programming problems whose objective function contains two blocks with each block being the form of nonsmooth+smooth arises frequently in multiple fields of applications. If both of the smooth terms are quadratic, this problem can be solved efficiently by using the symmetric Gaussian-Seidel (sGS) technique based proximal alternating direction method of multipliers (ADMM). However, in the non-quadratic case, the sGS technique can not be used any more, which leads to the separable structure of nonsmooth+smooth had to be ignored. In this paper, we present a generalized ADMM and particularly use a majorization technique to make the corresponding subproblems more amenable to efficient computations. Under some appropriate conditions, we prove its global convergence for the relaxation factor in $(0,2)$. We apply the algorithm to solve a kind of simulated convex composite optimization problems and a type of sparse inverse covariance matrix estimation problems which illustrates that the effectiveness of the algorithm are obvious.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Motor-imagery classification model for brain-computer interface: a sparse group filter bank representation model
Authors:
Cancheng Li,
Chuanbo Qin,
**g Fang
Abstract:
Background: Common spatial pattern (CSP) has been widely used for feature extraction in the case of motor imagery (MI) electroencephalogram (EEG) recordings and in MI classification of brain-computer interface (BCI) applications. BCI usually requires relatively long EEG data for reliable classifier training. More specifically, before using general spatial patterns for feature extraction, a trainin…
▽ More
Background: Common spatial pattern (CSP) has been widely used for feature extraction in the case of motor imagery (MI) electroencephalogram (EEG) recordings and in MI classification of brain-computer interface (BCI) applications. BCI usually requires relatively long EEG data for reliable classifier training. More specifically, before using general spatial patterns for feature extraction, a training dictionary from two different classes is used to construct a compound dictionary matrix, and the representation of the test samples in the filter band is estimated as a linear combination of the columns in the dictionary matrix. New method: To alleviate the problem of sparse small sample (SS) between frequency bands. We propose a novel sparse group filter bank model (SGFB) for motor imagery in BCI system. Results: We perform a task by representing residuals based on the categories corresponding to the non-zero correlation coefficients. Besides, we also perform joint sparse optimization with constrained filter bands in three different time windows to extract robust CSP features in a multi-task learning framework. To verify the effectiveness of our model, we conduct an experiment on the public EEG dataset of BCI competition to compare it with other competitive methods. Comparison with existing methods: Decent classification performance for different subbands confirms that our algorithm is a promising candidate for improving MI-based BCI performance.
△ Less
Submitted 27 August, 2021;
originally announced August 2021.
-
On Kostant's weight $q$-multiplicity formula for $\mathfrak{sp}_6(\mathbb{C})$
Authors:
Pamela E. Harris,
Peter Hollander,
Daniel C. Qin,
Maria Rodriguez-Hertz
Abstract:
Kostant's weight $q$-multiplicity formula is an alternating sum over a finite group known as the Weyl group, whose terms involve the $q$-analog of Kostant's partition function. The $q$-analog of the partition function is a polynomial-valued function defined by $\wp_q(ξ)=\sum_{i=0}^k c_i q^i$, where $c_i$ is the number of ways the weight $ξ$ can be written as a sum of exactly $i$ positive roots of…
▽ More
Kostant's weight $q$-multiplicity formula is an alternating sum over a finite group known as the Weyl group, whose terms involve the $q$-analog of Kostant's partition function. The $q$-analog of the partition function is a polynomial-valued function defined by $\wp_q(ξ)=\sum_{i=0}^k c_i q^i$, where $c_i$ is the number of ways the weight $ξ$ can be written as a sum of exactly $i$ positive roots of a Lie algebra $\mathfrak{g}$. The evaluation of the $q$-multiplicity formula at $q = 1$ recovers the multiplicity of a weight in an irreducible highest weight representation of $\mathfrak{g}$. In this paper, we specialize to the Lie algebra $\mathfrak{sp}_6(\mathbb{C})$ and we provide a closed formula for the $q$-analog of Kostant's partition function, which extends recent results of Shahi, Refaghat, and Marefat. We also describe the supporting sets of the multiplicity formula (known as the Weyl alternation sets of $\mathfrak{sp}_6(\mathbb{C})$), and use these results to provide a closed formula for the $q$-multiplicity for any pair of dominant integral weights of $\mathfrak{sp}_6(\mathbb{C})$. Throughout this work, we provide code to facilitate these computations.
△ Less
Submitted 16 August, 2021;
originally announced August 2021.
-
Efficiently Solving High-Order and Nonlinear ODEs with Rational Fraction Polynomial: the Ratio Net
Authors:
Chenxin Qin,
Ruhao Liu,
Maocai Li,
Shengyuan Li,
Yi Liu,
Chichun Zhou
Abstract:
Recent advances in solving ordinary differential equations (ODEs) with neural networks have been remarkable. Neural networks excel at serving as trial functions and approximating solutions within functional spaces, aided by gradient backpropagation algorithms. However, challenges remain in solving complex ODEs, including high-order and nonlinear cases, emphasizing the need for improved efficiency…
▽ More
Recent advances in solving ordinary differential equations (ODEs) with neural networks have been remarkable. Neural networks excel at serving as trial functions and approximating solutions within functional spaces, aided by gradient backpropagation algorithms. However, challenges remain in solving complex ODEs, including high-order and nonlinear cases, emphasizing the need for improved efficiency and effectiveness. Traditional methods have typically relied on established knowledge integration to improve problem-solving efficiency. In contrast, this study takes a different approach by introducing a new neural network architecture for constructing trial functions, known as ratio net. This architecture draws inspiration from rational fraction polynomial approximation functions, specifically the Pade approximant. Through empirical trials, it demonstrated that the proposed method exhibits higher efficiency compared to existing approaches, including polynomial-based and multilayer perceptron (MLP) neural network-based methods. The ratio net holds promise for advancing the efficiency and effectiveness of solving differential equations.
△ Less
Submitted 31 January, 2024; v1 submitted 18 May, 2021;
originally announced May 2021.
-
A note on $Oct_{1}^{+}$-free graphs and $Oct_{2}^{+}$-free graphs
Authors:
Wenjian Jia,
Shuai Kou,
Chengfu Qin,
Weihua Yang
Abstract:
Let $Oct_{1}^{+}$ and $Oct_{2}^{+}$ be the planar and non-planar graphs that obtained from the Octahedron by 3-splitting a vertex respectively. For $Oct_{1}^{+}$, we prove that a 4-connected graph is $Oct_{1}^{+}$-free if and only if it is $C_{6}^{2}$, $C_{2k+1}^{2}$ $(k \geq 2)$ or it is obtained from $C_{5}^{2}$ by repeatedly 4-splitting vertices. We also show that a planar graph is…
▽ More
Let $Oct_{1}^{+}$ and $Oct_{2}^{+}$ be the planar and non-planar graphs that obtained from the Octahedron by 3-splitting a vertex respectively. For $Oct_{1}^{+}$, we prove that a 4-connected graph is $Oct_{1}^{+}$-free if and only if it is $C_{6}^{2}$, $C_{2k+1}^{2}$ $(k \geq 2)$ or it is obtained from $C_{5}^{2}$ by repeatedly 4-splitting vertices. We also show that a planar graph is $Oct_{1}^{+}$-free if and only if it is constructed by repeatedly taking 0-, 1-, 2-sums starting from $\{K_{1}, K_{2} ,K_{3}\} \cup \mathscr{K} \cup \{Oct,L_{5} \}$, where $\mathscr{K}$ is the set of graphs obtained by repeatedly taking the special 3-sums of $K_{4}$. For $Oct_{2}^{+}$, we prove that a 4-connected graph is $Oct_{2}^{+}$-free if and only if it is planar, $C_{2k+1}^{2}$ $(k \geq 2)$, $L(K_{3,3})$ or it is obtained from $C_{5}^{2}$ by repeatedly 4-splitting vertices.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
Adam revisited: a weighted past gradients perspective
Authors:
Hui Zhong,
Zaiyi Chen,
Chuan Qin,
Zai Huang,
Vincent W. Zheng,
Tong Xu,
Enhong Chen
Abstract:
Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence i…
▽ More
Adaptive learning rate methods have been successfully applied in many fields, especially in training deep neural networks. Recent results have shown that adaptive methods with exponential increasing weights on squared past gradients (i.e., ADAM, RMSPROP) may fail to converge to the optimal solution. Though many algorithms, such as AMSGRAD and ADAMNC, have been proposed to fix the non-convergence issues, achieving a data-dependent regret bound similar to or better than ADAGRAD is still a challenge to these methods. In this paper, we propose a novel adaptive method weighted adaptive algorithm (WADA) to tackle the non-convergence issues. Unlike AMSGRAD and ADAMNC, we consider using a milder growing weighting strategy on squared past gradient, in which weights grow linearly. Based on this idea, we propose weighted adaptive gradient method framework (WAGMF) and implement WADA algorithm on this framework. Moreover, we prove that WADA can achieve a weighted data-dependent regret bound, which could be better than the original regret bound of ADAGRAD when the gradients decrease rapidly. This bound may partially explain the good performance of ADAM in practice. Finally, extensive experiments demonstrate the effectiveness of WADA and its variants in comparison with several variants of ADAM on training convex problems and deep neural networks.
△ Less
Submitted 1 January, 2021;
originally announced January 2021.
-
Strengthened chain theorems for different versions of 4-connectivity
Authors:
Guoli Ding,
Chengfu Qin
Abstract:
The chain theorem of Tutte states that every 3-connected graph can be constructed from a wheel $W_n$ by repeatedly adding edges and splitting vertices. It is not difficult to prove the following strengthening of this theorem: every non-wheel 3-connected graph can be constructed from $W_4$ by repeatedly adding edges and splitting vertices. In this paper we similarly strengthen several chain theorem…
▽ More
The chain theorem of Tutte states that every 3-connected graph can be constructed from a wheel $W_n$ by repeatedly adding edges and splitting vertices. It is not difficult to prove the following strengthening of this theorem: every non-wheel 3-connected graph can be constructed from $W_4$ by repeatedly adding edges and splitting vertices. In this paper we similarly strengthen several chain theorems for various versions of 4-connectivity.
△ Less
Submitted 27 December, 2020;
originally announced December 2020.
-
On the solvability of regular subgroups in the holomorph of a finite solvable group
Authors:
Cindy Tsang,
Chao Qin
Abstract:
We exhibit infinitely many natural numbers $n$ for which there exists at least one insolvable group of order $n$, and yet the holomorph of any solvable group of order $n$ has no insolvable regular subgroup. We also solve Problem 19.90 (d) in the Kourovka notebook.
We exhibit infinitely many natural numbers $n$ for which there exists at least one insolvable group of order $n$, and yet the holomorph of any solvable group of order $n$ has no insolvable regular subgroup. We also solve Problem 19.90 (d) in the Kourovka notebook.
△ Less
Submitted 2 September, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.
-
Skeleton-stabilized ImmersoGeometric Analysis for incompressible viscous flow problems
Authors:
Tuong Hoang,
Clemens V. Verhoosel,
Chao-Zhong Qin,
Ferdinando Auricchio,
Alessandro Reali,
E. Harald van Brummelen
Abstract:
A Skeleton-stabilized ImmersoGeometric Analysis technique is proposed for incompressible viscous flow problems with moderate Reynolds number. The proposed formulation fits within the framework of the finite cell method, where essential boundary conditions are imposed weakly using a Nitsche-type method. The key idea of the proposed formulation is to stabilize the jumps of high-order derivatives of…
▽ More
A Skeleton-stabilized ImmersoGeometric Analysis technique is proposed for incompressible viscous flow problems with moderate Reynolds number. The proposed formulation fits within the framework of the finite cell method, where essential boundary conditions are imposed weakly using a Nitsche-type method. The key idea of the proposed formulation is to stabilize the jumps of high-order derivatives of variables over the skeleton of the background mesh. The formulation allows the use of identical finite-dimensional spaces for the approximation of the pressure and velocity fields in immersed domains. The stability issues observed for inf-sup stable discretizations of immersed incompressible flow problems are avoided with this formulation. For B-spline basis functions of degree $k$ with highest regularity, only the derivative of order $k$ has to be controlled, which requires specification of only a single stabilization parameter for the pressure field. The Stokes and Navier-Stokes equations are studied numerically in two and three dimensions using various immersed test cases. Oscillation-free solutions and high-order optimal convergence rates can be obtained. The formulation is shown to be stable even in limit cases where almost every elements of the physical domain is cut, and hence it does not require the existence of interior cells. In terms of the sparsity pattern, the algebraic system has a considerably smaller stencil than counterpart approaches based on Lagrange basis functions. This important property makes the proposed skeleton-stabilized technique computationally practical. To demonstrate the stability and robustness of the method, we perform a simulation of fluid flow through a porous medium, of which the geometry is directly extracted from 3D $μ{CT}$ scan data.
△ Less
Submitted 19 July, 2018;
originally announced July 2018.
-
Weighted sum formulas of multiple zeta values with even arguments
Authors:
Zhonghua Li,
Chen Qin
Abstract:
We obtain a weighted sum formula of the zeta values at even arguments, and a weighted sum formula of the multiple zeta values with even arguments and its zeta-star analogue. The weight coefficients are given by (symmetric) polynomials of the arguments. These weighted sum formulas for the zeta values and for the multiple zeta values were conjectured by L. Guo, P. Lei and J. Zhao.
We obtain a weighted sum formula of the zeta values at even arguments, and a weighted sum formula of the multiple zeta values with even arguments and its zeta-star analogue. The weight coefficients are given by (symmetric) polynomials of the arguments. These weighted sum formulas for the zeta values and for the multiple zeta values were conjectured by L. Guo, P. Lei and J. Zhao.
△ Less
Submitted 31 October, 2018; v1 submitted 20 December, 2016;
originally announced December 2016.
-
Some relations deduced from regularized double shuffle relations of multiple zeta values
Authors:
Zhonghua Li,
Chen Qin
Abstract:
It is conjectured that the regularized double shuffle relations give all algebraic relations among the multiple zeta values, and hence all other algebraic relations should be deduced from the regularized double shuffle relations. In this paper, we provide as many as the relations which can be derived from the regularized double shuffle relations, for example, the weighted sum formula of L. Guo and…
▽ More
It is conjectured that the regularized double shuffle relations give all algebraic relations among the multiple zeta values, and hence all other algebraic relations should be deduced from the regularized double shuffle relations. In this paper, we provide as many as the relations which can be derived from the regularized double shuffle relations, for example, the weighted sum formula of L. Guo and B. Xie, some evaluation formulas with even arguments and the restricted sum formulas of M. E. Hoffman and their generalizations.
△ Less
Submitted 14 August, 2019; v1 submitted 18 October, 2016;
originally announced October 2016.
-
The social network model on infinite graphs
Authors:
Jonathan Hermon,
Ben Morris,
Chuan Qin,
Allan Sly
Abstract:
Given an infinite connected regular graph $G=(V,E)$, place at each vertex Pois($λ$) walkers performing independent lazy simple random walks on $G$ simultaneously. When two walkers visit the same vertex at the same time they are declared to be acquainted. We show that when $G$ is vertex-transitive and amenable, for all $λ>0$ a.s. any pair of walkers will eventually have a path of acquaintances betw…
▽ More
Given an infinite connected regular graph $G=(V,E)$, place at each vertex Pois($λ$) walkers performing independent lazy simple random walks on $G$ simultaneously. When two walkers visit the same vertex at the same time they are declared to be acquainted. We show that when $G$ is vertex-transitive and amenable, for all $λ>0$ a.s. any pair of walkers will eventually have a path of acquaintances between them. In contrast, we show that when $G$ is non-amenable (not necessarily transitive) there is always a phase transition at some $λ_{c}(G)>0$. We give general bounds on $λ_{c}(G)$ and study the case that $G$ is the $d$-regular tree in more details. Finally, we show that in the non-amenable setup, for every $λ$ there exists a finite time $t_λ(G)$ such that a.s. there exists an infinite set of walkers having a path of acquaintances between them by time $t_λ(G)$.
△ Less
Submitted 22 June, 2019; v1 submitted 13 October, 2016;
originally announced October 2016.
-
Some relations of interpolated multiple zeta values
Authors:
Zhonghua Li,
Chen Qin
Abstract:
In this paper, the extended double shuffle relations for interpolated multiple zeta values are established. As an application, Hoffman's relations for interpolated multiple zeta values are proved. Furthermore, a generating function for sums of interpolated multiple zeta values of fixed weight, depth and height is represented by hypergeometric functions, and we discuss some special cases.
In this paper, the extended double shuffle relations for interpolated multiple zeta values are established. As an application, Hoffman's relations for interpolated multiple zeta values are proved. Furthermore, a generating function for sums of interpolated multiple zeta values of fixed weight, depth and height is represented by hypergeometric functions, and we discuss some special cases.
△ Less
Submitted 29 March, 2017; v1 submitted 16 August, 2016;
originally announced August 2016.
-
Stuffle product formulas of multiple zeta values
Authors:
Zhonghua Li,
Chen Qin
Abstract:
We obtain recursive formulas for the stuffle product of multiple zeta values and of multiple zeta-star values. Then we apply the formulas to prove several stuffle product formulas with one or two strings of $z_p$'s. We also describe how to use our formulas in general cases.
We obtain recursive formulas for the stuffle product of multiple zeta values and of multiple zeta-star values. Then we apply the formulas to prove several stuffle product formulas with one or two strings of $z_p$'s. We also describe how to use our formulas in general cases.
△ Less
Submitted 4 September, 2017; v1 submitted 28 March, 2016;
originally announced March 2016.
-
Shuffle product formulas of multiple zeta values
Authors:
Zhonghua Li,
Chen Qin
Abstract:
Using the combinatorial description of shuffle product, we prove or reformulate several shuffle product formulas of multiple zeta values, including a general formula of the shuffle product of two multiple zeta values, some restricted shuffle product formulas of the product of two multiple zeta values, and a restricted shuffle product formula of the product of $n$ multiple zeta values.
Using the combinatorial description of shuffle product, we prove or reformulate several shuffle product formulas of multiple zeta values, including a general formula of the shuffle product of two multiple zeta values, some restricted shuffle product formulas of the product of two multiple zeta values, and a restricted shuffle product formula of the product of $n$ multiple zeta values.
△ Less
Submitted 6 September, 2016; v1 submitted 18 March, 2016;
originally announced March 2016.
-
Improved bounds for the mixing time of the random-to-random insertion shuffle
Authors:
Ben Morris,
Chuan Qin
Abstract:
We prove an upper bound of $1.5324 n \log n$ for the mixing time of the random-to-random insertion shuffle, improving on the best known upper bound of $2 n \log n$. Our proof is based on the analysis of a non-Markovian coupling.
We prove an upper bound of $1.5324 n \log n$ for the mixing time of the random-to-random insertion shuffle, improving on the best known upper bound of $2 n \log n$. Our proof is based on the analysis of a non-Markovian coupling.
△ Less
Submitted 28 November, 2014;
originally announced December 2014.