-
Symplectic structure, product structures and complex structures on Leibniz algebras
Authors:
Rong Tang,
Nanyan Xu,
Yunhe Sheng
Abstract:
In this paper, a symplectic structure on a Leibniz algebra is defined to be a {\em symmetric} nondegenerate bilinear form satisfying certain compatibility condition, and a phase space of a Leibniz algebra is defined to be a symplectic Leibniz algebra satisfying certain conditions. We show that a Leibniz algebra has a phase space if and only if there is a compatible Leibniz-dendriform algebra, and…
▽ More
In this paper, a symplectic structure on a Leibniz algebra is defined to be a {\em symmetric} nondegenerate bilinear form satisfying certain compatibility condition, and a phase space of a Leibniz algebra is defined to be a symplectic Leibniz algebra satisfying certain conditions. We show that a Leibniz algebra has a phase space if and only if there is a compatible Leibniz-dendriform algebra, and phase spaces of Leibniz algebras one-to-one corresponds to Manin triples of Leibniz-dendriform algebras. Product (paracomplex) structures and complex structures on Leibniz algebras are studied in terms of decompositions of Leibniz algebras. A para-Kähler structure on a Leibniz algebra is defined to be a symplectic structure and a paracomplex structure satisfying a compatibility condition. We show that a symplectic Leibniz algebra admits a para-Kähler structure if and only if the Leibniz algebra is the direct sum of two isotropic subalgebras as vector spaces. A complex product structure on a Leibniz algebra consists of a complex structure and a product structure satisfying a compatibility condition. A pseudo-Kähler structure on a Leibniz algebra is defined to be a symplectic structure and a complex structure satisfying a compatibility condition. Various properties and relations of complex product structures and pseudo-Kähler structures are studied. In particular, Leibniz-dendriform algebras give rise to complex product structures and pseudo-Kähler structures naturally.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Normalized solutions for the Choquard equation with mass supercritical nonlinearity
Authors:
Na Xu,
Shiwang Ma
Abstract:
We consider the nonlinear Choquard equation $$\begin{cases} & - Δu = (I_α\ast F(u))F'(u) -μu \ \text{in}\ \mathbb{R}^N, & u \in \ H^1(\mathbb{R}^N), \ \int_{\mathbb{R}^N} |u|^2 dx=m, \end{cases} $$ where $α\in(0,N)$, $m>0$ is prescribed, $μ\in \mathbb{R}$ is a Lagarange multiplier, and $I_α$ is the Riesz potential.
Under general assumptions on the nonlinearity $F,$ we prove the existence and mul…
▽ More
We consider the nonlinear Choquard equation $$\begin{cases} & - Δu = (I_α\ast F(u))F'(u) -μu \ \text{in}\ \mathbb{R}^N, & u \in \ H^1(\mathbb{R}^N), \ \int_{\mathbb{R}^N} |u|^2 dx=m, \end{cases} $$ where $α\in(0,N)$, $m>0$ is prescribed, $μ\in \mathbb{R}$ is a Lagarange multiplier, and $I_α$ is the Riesz potential.
Under general assumptions on the nonlinearity $F,$ we prove the existence and multiplicity of normalized solutions.
△ Less
Submitted 27 December, 2022; v1 submitted 26 October, 2022;
originally announced October 2022.
-
The Distribution of Error Terms of Smoothed Summatory Totient Functions
Authors:
Sanjana Das,
Hannah Lang,
Hamilton Wan,
Nancy Xu
Abstract:
We consider the summatory function of the totient function after applications of a suitable smoothing operator and study the limiting behavior of the associated error term. Under several conditional assumptions, we show that the smoothed error term possesses a limiting logarithmic distribution through a framework consolidated by Akbary--Ng--Shahabi. To obtain this result, we prove a truncated vers…
▽ More
We consider the summatory function of the totient function after applications of a suitable smoothing operator and study the limiting behavior of the associated error term. Under several conditional assumptions, we show that the smoothed error term possesses a limiting logarithmic distribution through a framework consolidated by Akbary--Ng--Shahabi. To obtain this result, we prove a truncated version of Perron's inversion formula for arbitrary Riesz typical means. We conclude with a conditional proof that at least two applications of the smoothing operator are necessary and sufficient to bound the growth of the error term by $\sqrt{x}$.
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
Distributions of Hook Lengths Divisible by Two or Three
Authors:
Hannah Lang,
Hamilton Wan,
Nancy Xu
Abstract:
For fixed $t = 2$ or $3$, we investigate the statistical properties of $\{Y_t(n)\}$, the sequence of random variables corresponding to the number of hook lengths divisible by $t$ among the partitions of $n$. We characterize the support of $Y_t(n)$ and show, in accordance with empirical observations, that the support is vanishingly small for large $n$. Moreover, we demonstrate that the nonzero valu…
▽ More
For fixed $t = 2$ or $3$, we investigate the statistical properties of $\{Y_t(n)\}$, the sequence of random variables corresponding to the number of hook lengths divisible by $t$ among the partitions of $n$. We characterize the support of $Y_t(n)$ and show, in accordance with empirical observations, that the support is vanishingly small for large $n$. Moreover, we demonstrate that the nonzero values of the mass functions of $Y_2(n)$ and $Y_3(n)$ approximate continuous functions. Finally, we prove that although the mass functions fail to converge, the cumulative distribution functions of $\{Y_2(n)\}$ and $\{Y_3(n)\}$ converge pointwise to shifted Gamma distributions, completing a characterization initiated by Griffin--Ono--Tsai for $t \geq 4$.
△ Less
Submitted 11 November, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
The Distribution of $k$-Free Effective Divisors and the Summatory Totient Function in Function Fields
Authors:
Sanjana Das,
Hannah Lang,
Hamilton Wan,
Nancy Xu
Abstract:
Motivated by the study of the summatory $k$-free indicator and totient functions in the classical setting, we investigate their function field analogues. First, we derive an expression for the error terms of the summatory functions in terms of the zeros of the associated zeta function. Under the Linear Independence hypothesis, we explicitly construct the limiting distributions of these error terms…
▽ More
Motivated by the study of the summatory $k$-free indicator and totient functions in the classical setting, we investigate their function field analogues. First, we derive an expression for the error terms of the summatory functions in terms of the zeros of the associated zeta function. Under the Linear Independence hypothesis, we explicitly construct the limiting distributions of these error terms and compute the frequency with which they occur in an interval $[-β, β]$ for a real $β> 0$. We also show that these error terms are unbiased, that is, they are positive and negative equally often. Finally, we examine the average behavior of these error terms across families of hyperelliptic curves of fixed genus. We obtain these results by following a general framework initiated by Cha and Humphries.
△ Less
Submitted 25 July, 2023; v1 submitted 30 June, 2022;
originally announced July 2022.
-
A New Sequential Optimality Condition of Cardinality-Constrained Optimization Problems and Application
Authors:
Li** Pang,
Menglong Xue,
Na Xu
Abstract:
In this paper, we consider the cardinality-constrained optimization problems and propose a new sequential optimality condition for the continuous relaxation reformulation which is popular recently. It is stronger than the existing results and is still a first-order necessity condition for the cardinality constraint problems without any additional assumptions. Meanwhile, we provide a problem-tailor…
▽ More
In this paper, we consider the cardinality-constrained optimization problems and propose a new sequential optimality condition for the continuous relaxation reformulation which is popular recently. It is stronger than the existing results and is still a first-order necessity condition for the cardinality constraint problems without any additional assumptions. Meanwhile, we provide a problem-tailored weaker constraint qualification, which can guarantee that new sequential conditions are Mordukhovich-type stationary points. On the other hand, we improve the theoretical results of the augmented Lagrangian algorithm. Under the same condition as the existing results, we prove that any feasible accumulation point of the iterative sequence generated by the algorithm satisfies the new sequence optimality condition. Furthermore, the algorithm can converge to the Mordukhovich-type (essentially strong) stationary point if the problem-tailored constraint qualification is satisfied.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
Isoparametric hypersurfaces induced by navigation in Lorentz Finsler geometry
Authors:
Ming Xu,
Ju Tan,
Na Xu
Abstract:
Using a navigation process with the datum $(F,V)$, in which $F$ is a Finsler metric and the smooth tangent vector field $V$ satisfies $F(-V(x))>1$ everywhere, a Lorentz Finsler metric $\tilde{F}$ can be induced. Isoparametric functions and isoparametric hypersurfaces with or without involving a smooth measure can be defined for $\tilde{F}$. When the vector field $V$ in the navigation datum is homo…
▽ More
Using a navigation process with the datum $(F,V)$, in which $F$ is a Finsler metric and the smooth tangent vector field $V$ satisfies $F(-V(x))>1$ everywhere, a Lorentz Finsler metric $\tilde{F}$ can be induced. Isoparametric functions and isoparametric hypersurfaces with or without involving a smooth measure can be defined for $\tilde{F}$. When the vector field $V$ in the navigation datum is homothetic, we prove the local correspondences between isoparametric functions and isoparametric hypersurfaces before and after this navigation process. Using these correspondences, we provide some examples of isoparametric functions and isoparametric hypersurfaces on a Funk space of Lorentz Randers type.
△ Less
Submitted 11 June, 2021; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Analytics and Machine Learning in Vehicle Routing Research
Authors:
Ruibin Bai,
Xinan Chen,
Zhi-Long Chen,
Tianxiang Cui,
Shuhui Gong,
Wentao He,
** Jiang,
Huan **,
Jiahuan **,
Graham Kendall,
Jiawei Li,
Zheng Lu,
Jianfeng Ren,
Paul Weng,
Ning Xue,
Huayan Zhang
Abstract:
The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic…
▽ More
The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic performance across different problem solving scenarios. However, the relevant papers are scattered in several traditional research fields with very different, sometimes confusing, terminologies. This paper presents a first, comprehensive review of hybrid methods that combine analytical techniques with ML tools in addressing VRP problems. Specifically, we review the emerging research streams on ML-assisted VRP modelling and ML-assisted VRP optimisation. We conclude that ML can be beneficial in enhancing VRP modelling, and improving the performance of algorithms for both online and offline VRP optimisations. Finally, challenges and future opportunities of VRP research are discussed.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Ehrhart-Equivalence, Equidecomposability, and Unimodular Equivalence of Integral Polytopes
Authors:
Fiona Abney-McPeek,
Sanket Biswas,
Senjuti Dutta,
Yongyuan Huang,
Deyuan Li,
Nancy Xu
Abstract:
Ehrhart polynomials are extensively-studied structures that interpolate the discrete volume of the dilations of integral $n$-polytopes. The coefficients of Ehrhart polynomials, however, are still not fully understood, and it is not known when two polytopes have equivalent Ehrhart polynomials. In this paper, we establish a relationship between Ehrhart-equivalence and other forms of equivalence: the…
▽ More
Ehrhart polynomials are extensively-studied structures that interpolate the discrete volume of the dilations of integral $n$-polytopes. The coefficients of Ehrhart polynomials, however, are still not fully understood, and it is not known when two polytopes have equivalent Ehrhart polynomials. In this paper, we establish a relationship between Ehrhart-equivalence and other forms of equivalence: the $\operatorname{GL}_n(\mathbb{Z})$-equidecomposability and unimodular equivalence of two integral $n$-polytopes in $\mathbb{R}^n$. We conjecture that any two Ehrhart-equivalent integral $n$-polytopes $P,Q\subset\mathbb{R}^n$ are $\operatorname{GL}_n(\mathbb{Z})$-equidecomposable into $\frac{1}{(n-1)!}$-th unimodular simplices, thereby generalizing the known cases of $n=1, 2, 3$. We also create an algorithm to check for unimodular equivalence of any two integral $n$-simplices in $\mathbb{R}^n$. We then find and prove a new one-to-one correspondence between unimodular equivalence of integral $2$-simplices and the unimodular equivalence of their $n$-dimensional pyramids. Finally, we prove the existence of integral $n$-simplices in $\mathbb{R}^n$ that are not unimodularly equivalent for all $n \ge 2$.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
A Hybrid Pricing and Cutting Approach for the Multi-Shift Full Truckload Vehicle Routing Problem
Authors:
Ning Xue,
Ruibin Bai,
Rong Qu,
Uwe Aickelin
Abstract:
Full truckload transportation (FTL) in the form of freight containers represents one of the most important transportation modes in international trade. Due to large volume and scale, in FTL, delivery time is often less critical but cost and service quality are crucial. Therefore, efficiently solving large scale multiple shift FTL problems is becoming more and more important and requires further re…
▽ More
Full truckload transportation (FTL) in the form of freight containers represents one of the most important transportation modes in international trade. Due to large volume and scale, in FTL, delivery time is often less critical but cost and service quality are crucial. Therefore, efficiently solving large scale multiple shift FTL problems is becoming more and more important and requires further research. In one of our earlier studies, a set covering model and a three-stage solution method were developed for a multi-shift FTL problem. This paper extends the previous work and presents a significantly more efficient approach by hybridising pricing and cutting strategies with metaheuristics (a variable neighbourhood search and a genetic algorithm). The metaheuristics were adopted to find promising columns (vehicle routes) guided by pricing and cuts are dynamically generated to eliminate infeasible flow assignments caused by incompatible commodities. Computational experiments on real-life and artificial benchmark FTL problems showed superior performance both in terms of computational time and solution quality, when compared with previous MIP based three-stage methods and two existing metaheuristics. The proposed cutting and heuristic pricing approach can efficiently solve large scale real-life FTL problems.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
Essential Norms of difference of generalized composition Operators from $α$-Bloch spaces to $β$-Bloch spaces
Authors:
Ning Xu,
Ze-Hua Zhou
Abstract:
In this paper, we study the boundedness and essential norms of the differences of two generalized composition operators acting from $α$-Bloch space to $β$-Bloch space on the open unit disk. From essential norms, we get the compactness of the differences of two generalized composition operators. This study has a relationship to the topological structure of generalized composition operators acting f…
▽ More
In this paper, we study the boundedness and essential norms of the differences of two generalized composition operators acting from $α$-Bloch space to $β$-Bloch space on the open unit disk. From essential norms, we get the compactness of the differences of two generalized composition operators. This study has a relationship to the topological structure of generalized composition operators acting from $α$-Bloch space to $β$-Bloch space.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Difference of Weighted Composition Operators from $α$-Bloch Spaces to $β$-Bloch Spaces
Authors:
Ning Xu,
Ze-Hua Zhou
Abstract:
In this paper, we study the boundedness and compactness of the differences of two weighted composition operators acting from $α$-Bloch space to $β$-Bloch space on the open unit disk. This study has a relationship to the topological structure of weighted composition from $α$-Bloch space to $β$-Bloch space.
In this paper, we study the boundedness and compactness of the differences of two weighted composition operators acting from $α$-Bloch space to $β$-Bloch space on the open unit disk. This study has a relationship to the topological structure of weighted composition from $α$-Bloch space to $β$-Bloch space.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Realizing Artin-Schreier Covers with Minimal $a$-numbers in Positive Characteristic
Authors:
Fiona Abney-McPeek,
Hugo Berg,
Jeremy Booher,
Sun Mee Choi,
Viktor Fukala,
Miroslav Marinov,
Theo Müller,
Paweł Narkiewicz,
Rachel Pries,
Nancy Xu,
Andrew Yuan
Abstract:
Suppose $X$ is a smooth projective connected curve defined over an algebraically closed field of characteristic $p>0$ and $B \subset X$ is a finite, possibly empty, set of points. Booher and Cais determined a lower bound for the $a$-number of a $\mathbf{Z}/p \mathbf{Z}$-cover of $X$ with branch locus $B$. For odd primes $p$, in most cases it is not known if this lower bound is realized. In this no…
▽ More
Suppose $X$ is a smooth projective connected curve defined over an algebraically closed field of characteristic $p>0$ and $B \subset X$ is a finite, possibly empty, set of points. Booher and Cais determined a lower bound for the $a$-number of a $\mathbf{Z}/p \mathbf{Z}$-cover of $X$ with branch locus $B$. For odd primes $p$, in most cases it is not known if this lower bound is realized. In this note, when $X$ is ordinary, we use formal patching to reduce that question to a computational question about $a$-numbers of $\mathbf{Z}/p\mathbf{Z}$-covers of the affine line. As an application, when $p=3$ or $p=5$, for any ordinary curve $X$ and any choice of $B$, we prove that the lower bound is realized for Artin-Schreier covers of $X$ with branch locus $B$.
△ Less
Submitted 5 December, 2022; v1 submitted 19 March, 2020;
originally announced March 2020.
-
Adaptive RBF Interpolation for Estimating Missing Values in Geographical Data
Authors:
Kaifeng Gao,
Gang Mei,
Salvatore Cuomo,
Francesco Piccialli,
Nengxiong Xu
Abstract:
The quality of datasets is a critical issue in big data mining. More interesting things could be mined from datasets with higher quality. The existence of missing values in geographical data would worsen the quality of big datasets. To improve the data quality, the missing values are generally needed to be estimated using various machine learning algorithms or mathematical methods such as approxim…
▽ More
The quality of datasets is a critical issue in big data mining. More interesting things could be mined from datasets with higher quality. The existence of missing values in geographical data would worsen the quality of big datasets. To improve the data quality, the missing values are generally needed to be estimated using various machine learning algorithms or mathematical methods such as approximations and interpolations. In this paper, we propose an adaptive Radial Basis Function (RBF) interpolation algorithm for estimating missing values in geographical data. In the proposed method, the samples with known values are considered as the data points, while the samples with missing values are considered as the interpolated points. For each interpolated point, first, a local set of data points are adaptively determined. Then, the missing value of the interpolated point is imputed via interpolating using the RBF interpolation based on the local set of data points. Moreover, the shape factors of the RBF are also adaptively determined by considering the distribution of the local set of data points. To evaluate the performance of the proposed method, we compare our method with the commonly used k Nearest Neighbors (kNN) interpolation and Adaptive Inverse Distance Weighted (AIDW) methods, and conduct three groups of benchmark experiments. Experimental results indicate that the proposed method outperforms the kNN interpolation and AIDW in terms of accuracy, but worse than the kNN interpolation and AIDW in terms of efficiency.
△ Less
Submitted 10 August, 2019;
originally announced August 2019.
-
GeoMFree3D: An Under-Development Meshfree Software Package for Geomechanics
Authors:
Gang Mei,
Nengxiong Xu,
Liangliang Xu,
Yazhe Li
Abstract:
This paper briefly reports the GeoMFree3D, a meshfree / meshless software package designed for analyzing the problems of large deformations and crack propagations of rock and soil masses in geotechnics. The GeoMFree3D is developed based on the meshfree RPIM, and accelerated by exploiting the parallel computing on multi-core CPU and many-core GPU. The GeoMFree3D is currently being under intensive d…
▽ More
This paper briefly reports the GeoMFree3D, a meshfree / meshless software package designed for analyzing the problems of large deformations and crack propagations of rock and soil masses in geotechnics. The GeoMFree3D is developed based on the meshfree RPIM, and accelerated by exploiting the parallel computing on multi-core CPU and many-core GPU. The GeoMFree3D is currently being under intensive developments. To demonstrate the correctness and effectiveness of the GeoMFree3D, several simple verification examples are presented in this paper. Moreover, future work on the development of the GeoMFree3D is introduced.
△ Less
Submitted 11 February, 2018;
originally announced February 2018.
-
The reducibility of quasi-periodic linear Hamiltonian systems and its application to Hill's equation
Authors:
Nina Xue,
Xiong Li
Abstract:
In this paper, we consider the reducibility of the quasi-periodic linear Hamiltonian system $$\dot{x}=(A+\varepsilon Q(t))x, $$ where $A$ is a constant matrix with possible multiple eigenvalues, $Q(t)$ is analytic quasi-periodic with respect to $t$, and $\varepsilon$ is a sufficiently small parameter. Under some non-resonant conditions, it is proved that, for most sufficiently small $\varepsilon$,…
▽ More
In this paper, we consider the reducibility of the quasi-periodic linear Hamiltonian system $$\dot{x}=(A+\varepsilon Q(t))x, $$ where $A$ is a constant matrix with possible multiple eigenvalues, $Q(t)$ is analytic quasi-periodic with respect to $t$, and $\varepsilon$ is a sufficiently small parameter. Under some non-resonant conditions, it is proved that, for most sufficiently small $\varepsilon$, the Hamiltonian system can be reduced to a constant coefficient Hamiltonian system by means of a quasi-periodic symplectic change of variables with the same basic frequencies as $Q(t)$. Application to quasi-periodic Hill's equation is also given.
△ Less
Submitted 18 June, 2017;
originally announced June 2017.
-
The stability of equilibrium solutions of periodic Hamiltonian systems in the case of degeneracy
Authors:
Nina Xue,
Xiong Li
Abstract:
In this paper we are concerned with the stability of equilibrium solutions of periodic Hamiltonian systems with one degree of freedom in the case of degeneracy, which means that the characteristic exponents of the linearized system have zero real part, and the high order terms must be considered to solve the stability problem. For almost all degenerate cases, sufficient conditions for the stabilit…
▽ More
In this paper we are concerned with the stability of equilibrium solutions of periodic Hamiltonian systems with one degree of freedom in the case of degeneracy, which means that the characteristic exponents of the linearized system have zero real part, and the high order terms must be considered to solve the stability problem. For almost all degenerate cases, sufficient conditions for the stability and instability are obtained.
△ Less
Submitted 30 May, 2017;
originally announced May 2017.
-
The linearization of periodic Hamiltonian systems with one degree of freedom under the Diophantine condition
Authors:
Nina Xue,
Xiong Li
Abstract:
In this paper we are concerned with the periodic Hamiltonian system with one degree of freedom, where the origin is a trivial solution. We assume that the corresponding linearized system at the origin is elliptic, and the characteristic exponents of the linearized system are $\pm iω$ with $ω$ be a Diophantine number, moreover if the system is formally linearizable, then it is analytically lineariz…
▽ More
In this paper we are concerned with the periodic Hamiltonian system with one degree of freedom, where the origin is a trivial solution. We assume that the corresponding linearized system at the origin is elliptic, and the characteristic exponents of the linearized system are $\pm iω$ with $ω$ be a Diophantine number, moreover if the system is formally linearizable, then it is analytically linearizable. As a result, the origin is always stable in the sense of Liapunov in this case.
△ Less
Submitted 24 May, 2017;
originally announced May 2017.
-
$\left( β, \varpi \right)$-stability for cross-validation and the choice of the number of folds
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
In this paper, we introduce a new concept of stability for cross-validation, called the $\left( β, \varpi \right)$-stability, and use it as a new perspective to build the general theory for cross-validation. The $\left( β, \varpi \right)$-stability mathematically connects the generalization ability and the stability of the cross-validated model via the Rademacher complexity. Our result reveals mat…
▽ More
In this paper, we introduce a new concept of stability for cross-validation, called the $\left( β, \varpi \right)$-stability, and use it as a new perspective to build the general theory for cross-validation. The $\left( β, \varpi \right)$-stability mathematically connects the generalization ability and the stability of the cross-validated model via the Rademacher complexity. Our result reveals mathematically the effect of cross-validation from two sides: on one hand, cross-validation picks the model with the best empirical generalization ability by validating all the alternatives on test sets; on the other hand, cross-validation may compromise the stability of the model selection by causing subsampling error. Moreover, the difference between training and test errors in q\textsuperscript{th} round, sometimes referred to as the generalization error, might be autocorrelated on q. Guided by the ideas above, the $\left( β, \varpi \right)$-stability help us derivd a new class of Rademacher bounds, referred to as the one-round/convoluted Rademacher bounds, for the stability of cross-validation in both the i.i.d.\ and non-i.i.d.\ cases. For both light-tail and heavy-tail losses, the new bounds quantify the stability of the one-round/average test error of the cross-validated model in terms of its one-round/average training error, the sample sizes $n$, number of folds $K$, the tail property of the loss (encoded as Orlicz-$Ψ_ν$ norms) and the Rademacher complexity of the model class $Λ$. The new class of bounds not only quantitatively reveals the stability of the generalization ability of the cross-validated model, it also shows empirically the optimal choice for number of folds $K$, at which the upper bound of the one-round/average test error is lowest, or, to put it in another way, where the test error is most stable.
△ Less
Submitted 5 July, 2017; v1 submitted 20 May, 2017;
originally announced May 2017.
-
Computation of real-valued basis functions which transform as irreducible representations of the polyhedral groups
Authors:
Nan Xu,
Peter C. Doerschuk
Abstract:
Basis functions which are invariant under the operations of a rotational point group $G$ are able to describe any 3-D object which exhibits the rotational point group symmetry. However, in order to characterize the spatial statistics of an ensemble of objects in which each object is different but the statistics exhibit the symmetry, a complete set of basis functions is required. In particular, for…
▽ More
Basis functions which are invariant under the operations of a rotational point group $G$ are able to describe any 3-D object which exhibits the rotational point group symmetry. However, in order to characterize the spatial statistics of an ensemble of objects in which each object is different but the statistics exhibit the symmetry, a complete set of basis functions is required. In particular, for each irreducible representation (irrep) of $G$, it is necessary to include basis functions that transform according to that irrep. This complete set of basis functions is a basis for square-integrable functions on the surface of the sphere in 3-D. Because the objects are real-valued, it is convenient to have real-valued basis functions. In this paper, the existence of such real-valued bases is proven and an algorithm for their computation is provided for the icosahedral $I$ and the octahedral $O$ symmetries. Furthermore, it is proven that such a real-valued basis does not exist for the tetrahedral $T$ symmetry because some irreps of $T$ are essentially complex. The importance of these basis functions to computations in single-particle cryo-electron microscopy is described.
△ Less
Submitted 6 March, 2021; v1 submitted 3 January, 2017;
originally announced January 2017.
-
Real basis functions of polyhedral groups
Authors:
Nan Xu
Abstract:
The basis of the identity representation of a polyhedral group is able to describe functions with symmetries of a platonic solid, i.e., 3-D objects which geometrically obey the cubic symmetries. However, to describe the dynamic of assembles of heterogeneous 3-D structures, a situation that each object lacks the symmetries but obeys the symmetries on a level of statistics, the basis of all represen…
▽ More
The basis of the identity representation of a polyhedral group is able to describe functions with symmetries of a platonic solid, i.e., 3-D objects which geometrically obey the cubic symmetries. However, to describe the dynamic of assembles of heterogeneous 3-D structures, a situation that each object lacks the symmetries but obeys the symmetries on a level of statistics, the basis of all representations of a group is required. While those 3-D objects are often transformed to real functions on $L_2$ space, it is desirable to generate a complete basis on real space. This paper deduces the existence of a basis on real space for each polyhedral group, and introduces a novel approach to explicitly compute these real basis functions, of which properties are further explored.
△ Less
Submitted 20 November, 2016;
originally announced November 2016.
-
Generalization error minimization: a new approach to model evaluation and selection with an application to penalized regression
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
We study model evaluation and model selection from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. We believe that GA is one way formally to address concerns about the external validity of a model. The GA of a model estimated on a sample can be measured by its empirical out-of-sample errors, called the generalizati…
▽ More
We study model evaluation and model selection from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. We believe that GA is one way formally to address concerns about the external validity of a model. The GA of a model estimated on a sample can be measured by its empirical out-of-sample errors, called the generalization errors (GE). We derive upper bounds for the GE, which depend on sample sizes, model complexity and the distribution of the loss function. The upper bounds can be used to evaluate the GA of a model, ex ante. We propose using generalization error minimization (GEM) as a framework for model selection. Using GEM, we are able to unify a big class of penalized regression estimators, including lasso, ridge and bridge, under the same set of assumptions. We establish finite-sample and asymptotic properties (including $\mathcal{L}_2$-consistency) of the GEM estimator for both the $n \geqslant p$ and the $n < p$ cases. We also derive the $\mathcal{L}_2$-distance between the penalized and corresponding unpenalized regression estimates. In practice, GEM can be implemented by validation or cross-validation. We show that the GE bounds can be used for selecting the optimal number of folds in $K$-fold cross-validation. We propose a variant of $R^2$, the $GR^2$, as a measure of GA, which considers both both in-sample and out-of-sample goodness of fit. Simulations are used to demonstrate our key results.
△ Less
Submitted 18 October, 2016;
originally announced October 2016.
-
Finite-sample and asymptotic analysis of generalization ability with an application to penalized regression
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
In this paper, we study the performance of extremum estimators from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. By adapting the classical concentration inequalities, we derive upper bounds on the empirical out-of-sample prediction errors as a function of the in-sample errors, in-sample data size, heaviness in t…
▽ More
In this paper, we study the performance of extremum estimators from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. By adapting the classical concentration inequalities, we derive upper bounds on the empirical out-of-sample prediction errors as a function of the in-sample errors, in-sample data size, heaviness in the tails of the error distribution, and model complexity. We show that the error bounds may be used for tuning key estimation hyper-parameters, such as the number of folds $K$ in cross-validation. We also show how $K$ affects the bias-variance trade-off for cross-validation. We demonstrate that the $\mathcal{L}_2$-norm difference between penalized and the corresponding un-penalized regression estimates is directly explained by the GA of the estimates and the GA of empirical moment conditions. Lastly, we prove that all penalized regression estimates are $L_2$-consistent for both the $n \geqslant p$ and the $n < p$ cases. Simulations are used to demonstrate key results.
Keywords: generalization ability, upper bound of generalization error, penalized regression, cross-validation, bias-variance trade-off, $\mathcal{L}_2$ difference between penalized and unpenalized regression, lasso, high-dimensional data.
△ Less
Submitted 13 September, 2016; v1 submitted 12 September, 2016;
originally announced September 2016.
-
$L^p$ $(p\geq 1)$ solutions of multidimensional BSDEs with monotone generators in general time intervals
Authors:
Lishun Xiao,
Shengjun Fan,
Na Xu
Abstract:
In this paper, we are interested in solving general time interval multidimensional backward stochastic differential equations in $L^p$ $(p\geq 1)$. We first study the existence and uniqueness for $L^p$ $(p>1)$ solutions by the method of convolution and weak convergence when the generator is monotonic in $y$ and Lipschitz continuous in $z$ both non-uniformly with respect to $t$. Then we obtain the…
▽ More
In this paper, we are interested in solving general time interval multidimensional backward stochastic differential equations in $L^p$ $(p\geq 1)$. We first study the existence and uniqueness for $L^p$ $(p>1)$ solutions by the method of convolution and weak convergence when the generator is monotonic in $y$ and Lipschitz continuous in $z$ both non-uniformly with respect to $t$. Then we obtain the existence and uniqueness for $L^1$ solutions with an additional assumption that the generator has a sublinear growth in $z$ non-uniformly with respect to $t$.
△ Less
Submitted 27 September, 2013;
originally announced September 2013.
-
The Modified Direct Method: an Approach for Smoothing Planar and Surface Meshes
Authors:
Gang Mei,
John C. Tipper,
Nengxiong Xu
Abstract:
The Modified Direct Method (MDM) is an iterative mesh smoothing method for smoothing planar and surface meshes, which is developed from the non-iterative smoothing method originated by Balendran [1]. When smooth planar meshes, the performance of the MDM is effectively identical to that of Laplacian smoothing, for triangular and quadrilateral meshes; however, the MDM outperforms Laplacian smoothing…
▽ More
The Modified Direct Method (MDM) is an iterative mesh smoothing method for smoothing planar and surface meshes, which is developed from the non-iterative smoothing method originated by Balendran [1]. When smooth planar meshes, the performance of the MDM is effectively identical to that of Laplacian smoothing, for triangular and quadrilateral meshes; however, the MDM outperforms Laplacian smoothing for tri-quad meshes. When smooth surface meshes, for trian-gular, quadrilateral and quad-dominant mixed meshes, the mean quality(MQ) of all mesh elements always increases and the mean square error (MSE) decreases during smoothing; For tri-dominant mixed mesh, the quality of triangles always descends while that of quads ascends. Test examples show that the MDM is convergent for both planar and surface triangular, quadrilateral and tri-quad meshes.
△ Less
Submitted 13 December, 2012;
originally announced December 2012.
-
Products of redial derivative and integral-type operators from Zygmund spaces to Bloch spaces
Authors:
Ning Xu
Abstract:
Let $H(\mathbb{B})$ denote the space of all holomorphic functions on the unit ball $\mathbb{B}\in \mathbb{C}^n$. In this paper we investigate the boundedness and compactness of the products of radial derivative operator and the following integral-type operator $$ I_φ^g f(z)=\int_0^1 \Re f(φ(tz))g(tz)\frac{dt}{t},\ z\in\mathbb{B} $$ where $g\in H(\mathbb{B}), g(0)=0$, $φ$ is a holomorphic self-map…
▽ More
Let $H(\mathbb{B})$ denote the space of all holomorphic functions on the unit ball $\mathbb{B}\in \mathbb{C}^n$. In this paper we investigate the boundedness and compactness of the products of radial derivative operator and the following integral-type operator $$ I_φ^g f(z)=\int_0^1 \Re f(φ(tz))g(tz)\frac{dt}{t},\ z\in\mathbb{B} $$ where $g\in H(\mathbb{B}), g(0)=0$, $φ$ is a holomorphic self-map of $\mathbb{B}$,\ between Zygmund spaces and Bloch spaces.
△ Less
Submitted 22 November, 2011;
originally announced November 2011.
-
Connectivity of Direct Products of Graphs
Authors:
Wei Wang,
Ni-Ni Xue
Abstract:
Let $κ(G)$ be the connectivity of $G$ and $G\times H$ the direct product of $G$ and $H$. We prove that for any graphs $G$ and $K_n$ with $n\ge 3$, $κ(G\times K_n)=min\{nκ(G),(n-1)δ(G)\}$, which was conjectured by Guji and Vumar.
Let $κ(G)$ be the connectivity of $G$ and $G\times H$ the direct product of $G$ and $H$. We prove that for any graphs $G$ and $K_n$ with $n\ge 3$, $κ(G\times K_n)=min\{nκ(G),(n-1)δ(G)\}$, which was conjectured by Guji and Vumar.
△ Less
Submitted 25 February, 2011;
originally announced February 2011.