Search | arXiv e-print repository

SymmPI: Predictive Inference for Data with Group Symmetries

Abstract: Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often -- for instance, in standard conformal prediction -- relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive… ▽ More Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often -- for instance, in standard conformal prediction -- relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive inference aim to predict unobserved outcomes in sequences of feature-outcome observations. Meanwhile, there is interest in predictive inference under more general observation models (e.g., for partially observed features) and for data satisfying more general distributional symmetries (e.g., rotationally invariant or coordinate-independent observations in physics). Here we propose SymmPI, a methodology for predictive inference when data distributions have general group symmetries in arbitrary observation models. Our methods leverage the novel notion of distributional equivariant transformations, which process the data while preserving their distributional invariances. We show that SymmPI has valid coverage under distributional invariance and characterize its performance under distribution shift, recovering recent results as special cases. We apply SymmPI to predict unobserved values associated to vertices in a network, where the distribution is unchanged under relabelings that keep the network structure unchanged. In several simulations in a two-layer hierarchical model, and in an empirical data analysis example, SymmPI performs favorably compared to existing methods. △ Less

Submitted 28 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

Comments: 45 pages

arXiv:2311.12413 [pdf, ps, other]

On the calculation of upper variance under multiple probabilities

Authors: Xinpeng Li, Miao Yu, Shiyi Zheng

Abstract: The notion of upper variance under multiple probabilities is defined by a corresponding minimax optimization problem. This paper proposes a simple algorithm to solve the related minimax optimization problem exactly. As an application, we provide the probabilistic representation for a class of quadratic programming problems. The notion of upper variance under multiple probabilities is defined by a corresponding minimax optimization problem. This paper proposes a simple algorithm to solve the related minimax optimization problem exactly. As an application, we provide the probabilistic representation for a class of quadratic programming problems. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 8 pages

arXiv:2308.02918 [pdf, other]

Spectral Ranking Inferences based on General Multiway Comparisons

Authors: Jianqing Fan, Zhipeng Lou, Weichen Wang, Mengxin Yu

Abstract: This paper studies the performance of the spectral method in the estimation and uncertainty quantification of the unobserved preference scores of compared entities in a general and more realistic setup. Specifically, the comparison graph consists of hyper-edges of possible heterogeneous sizes, and the number of comparisons can be as low as one for a given hyper-edge. Such a setting is pervasive in… ▽ More This paper studies the performance of the spectral method in the estimation and uncertainty quantification of the unobserved preference scores of compared entities in a general and more realistic setup. Specifically, the comparison graph consists of hyper-edges of possible heterogeneous sizes, and the number of comparisons can be as low as one for a given hyper-edge. Such a setting is pervasive in real applications, circumventing the need to specify the graph randomness and the restrictive homogeneous sampling assumption imposed in the commonly used Bradley-Terry-Luce (BTL) or Plackett-Luce (PL) models. Furthermore, in scenarios where the BTL or PL models are appropriate, we unravel the relationship between the spectral estimator and the Maximum Likelihood Estimator (MLE). We discover that a two-step spectral method, where we apply the optimal weighting estimated from the equal weighting vanilla spectral method, can achieve the same asymptotic efficiency as the MLE. Given the asymptotic distributions of the estimated preference scores, we also introduce a comprehensive framework to carry out both one-sample and two-sample ranking inferences, applicable to both fixed and random graph settings. It is noteworthy that this is the first time effective two-sample rank testing methods have been proposed. Finally, we substantiate our findings via comprehensive numerical simulations and subsequently apply our developed methodologies to perform statistical inferences for statistical journals and movie rankings. △ Less

Submitted 1 March, 2024; v1 submitted 5 August, 2023; originally announced August 2023.

Comments: 62 pages, 4 figures

arXiv:2307.16734 [pdf, other]

Stochastic Filtering of Reaction Networks Partially Observed in Time Snapshots

Authors: Muruhan Rathinam, Mingkai Yu

Abstract: Stochastic reaction network models arise in intracellular chemical reactions, epidemiological models and other population process models, and are a class of continuous time Markov chains which have the nonnegative integer lattice as state space. We consider the problem of estimating the conditional probability distribution of a stochastic reaction network given exact partial state observations in… ▽ More Stochastic reaction network models arise in intracellular chemical reactions, epidemiological models and other population process models, and are a class of continuous time Markov chains which have the nonnegative integer lattice as state space. We consider the problem of estimating the conditional probability distribution of a stochastic reaction network given exact partial state observations in time snapshots. We propose a particle filtering method called the targeting method. Our approach takes into account that the reaction counts in between two observation snapshots satisfy linear constraints and also uses inhomogeneous Poisson processes as proposals for the reaction counts to facilitate exact interpolation. We provide rigorous analysis as well as numerical examples to illustrate our method and compare it with other alternatives. △ Less

Submitted 31 July, 2023; originally announced July 2023.

MSC Class: 65C05; 65C35

arXiv:2306.08117 [pdf, ps, other]

Fiber 2-Functors and Tambara-Yamagami Fusion 2-Categories

Authors: Thibault D. Décoppet, Matthew Yu

Abstract: We introduce group-theoretical fusion 2-categories, a strong categorification of the notion of a group-theoretical fusion 1-category. Physically speaking, such fusion 2-categories arise by gauging subgroups of a global symmetry. We show that group-theoretical fusion 2-categories are completely characterized by the property that the braided fusion 1-category of endomorphisms of the monoidal unit is… ▽ More We introduce group-theoretical fusion 2-categories, a strong categorification of the notion of a group-theoretical fusion 1-category. Physically speaking, such fusion 2-categories arise by gauging subgroups of a global symmetry. We show that group-theoretical fusion 2-categories are completely characterized by the property that the braided fusion 1-category of endomorphisms of the monoidal unit is Tannakian. Then, we describe the underlying finite semisimple 2-category of group-theoretical fusion 2-categories, and, more generally, of certain 2-categories of bimodules. We also partially describe the fusion rules of group-theoretical fusion 2-categories, and investigate the group gradings of such fusion 2-categories. Using our previous results, we classify fusion 2-categories admitting a fiber 2-functor. Next, we study fusion 2-categories with a Tambara-Yamagami defect, that is $\mathbb{Z}/2$-graded fusion 2-categories whose non-trivially graded factor is $\mathbf{2Vect}$. We classify these fusion 2-categories, and examine more closely the more restrictive notion of Tambara-Yamagami fusion 2-categories. Throughout, we give many examples to illustrate our various results. △ Less

Submitted 13 June, 2023; originally announced June 2023.

MSC Class: 16D90; 18M20; 18M25; 18N10

arXiv:2306.01039 [pdf, other]

Semi-Chiral Operators in 4d ${\cal N}=1$ Gauge Theories

Authors: Kasia Budzik, Davide Gaiotto, Justin Kulp, Brian R. Williams, **gxiang Wu, Matthew Yu

Abstract: We discuss the properties of quarter-BPS local operators in four dimensional ${\cal N}=1$ supersymmetric Yang-Mills theory using the formalism of holomorphic twists. We study loop corrections both to the space of local operators and to algebraic operations which endow the twisted theory with an infinite symmetry algebra. We classify all single-trace quarter-BPS operators in the planar approximatio… ▽ More We discuss the properties of quarter-BPS local operators in four dimensional ${\cal N}=1$ supersymmetric Yang-Mills theory using the formalism of holomorphic twists. We study loop corrections both to the space of local operators and to algebraic operations which endow the twisted theory with an infinite symmetry algebra. We classify all single-trace quarter-BPS operators in the planar approximation for $SU(N)$ gauge theory and propose a holographic dual description for the twisted theory. We classify perturbative quarter-BPS operators in $SU(2)$ and $SU(3)$ gauge theories with sufficiently small quantum numbers and discuss possible non-perturbative corrections to the answer. We set up analogous calculations for some theories with matter. △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: 55+20 pages, 61 footnotes, comments welcome

arXiv:2305.01678 [pdf, other]

Adams spectral sequences for non-vector-bundle Thom spectra

Authors: Arun Debray, Matthew Yu

Abstract: When $R$ is one of the spectra $\mathit{ku}$, $\mathit{ko}$, $\mathit{tmf}$, $\mathit{MTSpin}^c$, $\mathit{MTSpin}$, or $\mathit{MTString}$, there is a standard approach to computing twisted $R$-homology groups of a space $X$ with the Adams spectral sequence, by using a change-of-rings isomorphism to simplify the $E_2$-page. This approach requires the assumption that the twist comes from a vector… ▽ More When $R$ is one of the spectra $\mathit{ku}$, $\mathit{ko}$, $\mathit{tmf}$, $\mathit{MTSpin}^c$, $\mathit{MTSpin}$, or $\mathit{MTString}$, there is a standard approach to computing twisted $R$-homology groups of a space $X$ with the Adams spectral sequence, by using a change-of-rings isomorphism to simplify the $E_2$-page. This approach requires the assumption that the twist comes from a vector bundle, i.e. the twist map $X\to B\mathrm{GL}_1(R)$ factors through $B\mathrm{O}$. We show this assumption is unnecessary by working with Baker-Lazarev's Adams spectral sequence of $R$-modules and computing its $E_2$-page for a large class of twists of these spectra. We then work through two example computations motivated by anomaly cancellation for supergravity theories. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Comments: 41 pages, comments welcome

arXiv:2303.09125 [pdf, ps, other]

The distribution of the cokernel of a polynomial evaluated at a random integral matrix

Authors: Gilyoung Cheong, Myungjun Yu

Abstract: Given a prime $p$, let $P(t)$ be a non-constant monic polynomial in $t$ over the ring $\mathbb{Z}_{p}$ of $p$-adic integers. Let $X_{n}$ be an $n \times n$ random matrix over $\mathbb{Z}_{p}$ with independent entries that lie in any residue class modulo $p$ with probability at most $1 - ε$ for a fixed real number $0 < ε< 1$. We prove that as $n \rightarrow \infty$, the distribution of the cokernel… ▽ More Given a prime $p$, let $P(t)$ be a non-constant monic polynomial in $t$ over the ring $\mathbb{Z}_{p}$ of $p$-adic integers. Let $X_{n}$ be an $n \times n$ random matrix over $\mathbb{Z}_{p}$ with independent entries that lie in any residue class modulo $p$ with probability at most $1 - ε$ for a fixed real number $0 < ε< 1$. We prove that as $n \rightarrow \infty$, the distribution of the cokernel $\mathrm{cok}(P(X_{n}))$ of $P(X_{n})$ converges to the distribution given by a finite product of some explicit measures that resemble Cohen--Lenstra measures. For example, the random matrix $X_{n}$ can be taken as a Haar-random matrix or a uniformly random $(0,1)$-matrix. We consider the distribution of $\mathrm{cok}(P(X_{n}))$ as a distribution of modules over $\mathbb{Z}_{p}[t]/(P(t))$, which gives us a clearer formulation in comparison to considering the distribution as that of abelian groups. For the proof, we first reduce our problem into a problem over $\mathbb{Z}/p^{k}\mathbb{Z}$, for large enough positive integer $k$, in place of $\mathbb{Z}_{p}$. Then we use a result of Sawin and Wood to reduce our problem into another problem of computing the limit of the expected number of surjective $(\mathbb{Z}/p^{k}\mathbb{Z})[t]/(P(t))$-linear maps from $\mathrm{cok}(P(X_{n}))$ modulo $p^{k}$ to a fixed finite size $(\mathbb{Z}/p^{k}\mathbb{Z})[t]/(P(t))$-module $G$. To estimate the expected number and compute the desired limit, we carefully adopt subtle techniques developed by Wood, which were originally used to compute the asymptotic distribution of the $p$-part of the sandpile group of a random graph. △ Less

Submitted 21 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: 22 pages. Proof of Lemma 5.5 had a subtle error, so we had to make various changes to correct the proof. We made changes to Sections 3.2, 5.3, and 6.2 for definitions of codes and depth; 3.2 has new technique: it explains how to throw away F's that do not contribute to the moment computation. Proof of Lemmas 6.6 and 6.8 have been changed

arXiv:2301.12095 [pdf, other]

MetaNO: How to Transfer Your Knowledge on Learning Hidden Physics

Authors: Lu Zhang, Huaiqian You, Tian Gao, Mo Yu, Chung-Hao Lee, Yue Yu

Abstract: Gradient-based meta-learning methods have primarily been applied to classical machine learning tasks such as image classification. Recently, PDE-solving deep learning methods, such as neural operators, are starting to make an important impact on learning and predicting the response of a complex physical system directly from observational data. Since the data acquisition in this context is commonly… ▽ More Gradient-based meta-learning methods have primarily been applied to classical machine learning tasks such as image classification. Recently, PDE-solving deep learning methods, such as neural operators, are starting to make an important impact on learning and predicting the response of a complex physical system directly from observational data. Since the data acquisition in this context is commonly challenging and costly, the call of utilization and transfer of existing knowledge to new and unseen physical systems is even more acute. Herein, we propose a novel meta-learning approach for neural operators, which can be seen as transferring the knowledge of solution operators between governing (unknown) PDEs with varying parameter fields. Our approach is a provably universal solution operator for multiple PDE solving tasks, with a key theoretical observation that underlying parameter fields can be captured in the first layer of neural operator models, in contrast to typical final-layer transfer in existing meta-learning methods. As applications, we demonstrate the efficacy of our proposed approach on PDE-based datasets and a real-world material modeling problem, illustrating that our method can handle complex and nonlinear physical response learning tasks while greatly improving the sampling efficiency in unseen tasks. △ Less

Submitted 3 February, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

arXiv:2212.09961 [pdf, other]

Uncertainty Quantification of MLE for Entity Ranking with Covariates

Authors: Jianqing Fan, Jikai Hou, Mengxin Yu

Abstract: This paper concerns with statistical estimation and inference for the ranking problems based on pairwise comparisons with additional covariate information such as the attributes of the compared items. Despite extensive studies, few prior literatures investigate this problem under the more realistic setting where covariate information exists. To tackle this issue, we propose a novel model, Covariat… ▽ More This paper concerns with statistical estimation and inference for the ranking problems based on pairwise comparisons with additional covariate information such as the attributes of the compared items. Despite extensive studies, few prior literatures investigate this problem under the more realistic setting where covariate information exists. To tackle this issue, we propose a novel model, Covariate-Assisted Ranking Estimation (CARE) model, that extends the well-known Bradley-Terry-Luce (BTL) model, by incorporating the covariate information. Specifically, instead of assuming every compared item has a fixed latent score $\{θ_i^*\}_{i=1}^n$, we assume the underlying scores are given by $\{α_i^*+{x}_i^\topβ^*\}_{i=1}^n$, where $α_i^*$ and ${x}_i^\topβ^*$ represent latent baseline and covariate score of the $i$-th item, respectively. We impose natural identifiability conditions and derive the $\ell_{\infty}$- and $\ell_2$-optimal rates for the maximum likelihood estimator of $\{α_i^*\}_{i=1}^{n}$ and $β^*$ under a sparse comparison graph, using a novel `leave-one-out' technique (Chen et al., 2019) . To conduct statistical inferences, we further derive asymptotic distributions for the MLE of $\{α_i^*\}_{i=1}^n$ and $β^*$ with minimal sample complexity. This allows us to answer the question whether some covariates have any explanation power for latent scores and to threshold some sparse parameters to improve the ranking performance. We improve the approximation method used in (Gao et al., 2021) for the BLT model and generalize it to the CARE model. Moreover, we validate our theoretical results through large-scale numerical studies and an application to the mutual fund stock holding dataset. △ Less

Submitted 24 March, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

Comments: 81 pages, 3 figures

arXiv:2211.11959 [pdf, ps, other]

Robust High-dimensional Tuning Free Multiple Testing

Authors: Jianqing Fan, Zhipeng Lou, Mengxin Yu

Abstract: A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-… ▽ More A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a non-asymptotic perspective. Our study develops Berry-Esseen inequality and Cramér type moderate deviation for the HL estimator based on newly developed non-asymptotic Bahadur representation, and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to extend the HL estimator to large-scale studies and propose \emph{tuning-free} and \emph{moment-free} high-dimensional inference procedures for testing global null and for large-scale multiple testing with false discovery proportion control. It is convincingly shown that the resulting tuning-free and moment-free methods control false discovery proportion at a prescribed level. The simulation studies lend further support to our developed theory. △ Less

Submitted 23 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: In this paper, we develop tuning-free and moment-free high dimensional inference procedures;

arXiv:2211.11957 [pdf, other]

Ranking Inferences Based on the Top Choice of Multiway Comparisons

Authors: Jianqing Fan, Zhipeng Lou, Weichen Wang, Mengxin Yu

Abstract: This paper considers ranking inference of $n$ items based on the observed data on the top choice among $M$ randomly selected items at each trial. This is a useful modification of the Plackett-Luce model for $M$-way ranking with only the top choice observed and is an extension of the celebrated Bradley-Terry-Luce model that corresponds to $M=2$. Under a uniform sampling scheme in which any $M$ dist… ▽ More This paper considers ranking inference of $n$ items based on the observed data on the top choice among $M$ randomly selected items at each trial. This is a useful modification of the Plackett-Luce model for $M$-way ranking with only the top choice observed and is an extension of the celebrated Bradley-Terry-Luce model that corresponds to $M=2$. Under a uniform sampling scheme in which any $M$ distinguished items are selected for comparisons with probability $p$ and the selected $M$ items are compared $L$ times with multinomial outcomes, we establish the statistical rates of convergence for underlying $n$ preference scores using both $\ell_2$-norm and $\ell_\infty$-norm, with the minimum sampling complexity. In addition, we establish the asymptotic normality of the maximum likelihood estimator that allows us to construct confidence intervals for the underlying scores. Furthermore, we propose a novel inference framework for ranking items through a sophisticated maximum pairwise difference statistic whose distribution is estimated via a valid Gaussian multiplier bootstrap. The estimated distribution is then used to construct simultaneous confidence intervals for the differences in the preference scores and the ranks of individual items. They also enable us to address various inference questions on the ranks of these items. Extensive simulation studies lend further support to our theoretical results. A real data application illustrates the usefulness of the proposed methods convincingly. △ Less

Submitted 5 January, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: In this paper, we build simultaneous confidence intervals for ranks through multiway comparisons

arXiv:2211.08436 [pdf, other]

doi 10.1007/s11005-023-01655-1

Gauging Noninvertible Defects: A 2-Categorical Perspective

Authors: Thibault D. Décoppet, Matthew Yu

Abstract: We generalize the notion of an anomaly for a symmetry to a noninvertible symmetry enacted by surface operators using the framework of condensation in 2-categories. Given a multifusion 2-category, potentially with some additional levels of monoidality, we prove theorems about the structure of the 2-category obtained by condensing a suitable algebra object. We give examples where the resulting categ… ▽ More We generalize the notion of an anomaly for a symmetry to a noninvertible symmetry enacted by surface operators using the framework of condensation in 2-categories. Given a multifusion 2-category, potentially with some additional levels of monoidality, we prove theorems about the structure of the 2-category obtained by condensing a suitable algebra object. We give examples where the resulting category displays grouplike fusion rules and through a cohomology computation, find the obstruction to condensing further to the vacuum theory. △ Less

Submitted 25 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

Comments: 26 pages, v2 a new theorem about symmetric fusion 2-categories is added

MSC Class: 18M15; 18M20; 18N10

Journal ref: Lett. Math. Phys. 113, 36 (2023)

arXiv:2210.04911 [pdf, other]

What bordism-theoretic anomaly cancellation can do for U

Authors: Arun Debray, Matthew Yu

Abstract: We perform a bordism computation to show that the $E_{7(7)}(\mathbb{R})$ U-duality symmetry of 4d $\mathcal N = 8$ supergravity could have an anomaly invisible to perturbative methods; then we show that this anomaly is trivial. We compute the relevant bordism group using the Adams and Atiyah-Hirzebruch spectral sequences, and we show the anomaly vanishes by computing $η$-invariants on the Wu manif… ▽ More We perform a bordism computation to show that the $E_{7(7)}(\mathbb{R})$ U-duality symmetry of 4d $\mathcal N = 8$ supergravity could have an anomaly invisible to perturbative methods; then we show that this anomaly is trivial. We compute the relevant bordism group using the Adams and Atiyah-Hirzebruch spectral sequences, and we show the anomaly vanishes by computing $η$-invariants on the Wu manifold, which generates the bordism group. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 29 pages, 4 figures

arXiv:2208.11040 [pdf, other]

Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments

Authors: Mengxin Yu, Zhuoran Yang, Jianqing Fan

Abstract: We study offline reinforcement learning under a novel model called strategic MDP, which characterizes the strategic interactions between a principal and a sequence of myopic agents with private types. Due to the bilevel structure and private types, strategic MDP involves information asymmetry between the principal and the agents. We focus on the offline RL problem, where the goal is to learn the o… ▽ More We study offline reinforcement learning under a novel model called strategic MDP, which characterizes the strategic interactions between a principal and a sequence of myopic agents with private types. Due to the bilevel structure and private types, strategic MDP involves information asymmetry between the principal and the agents. We focus on the offline RL problem, where the goal is to learn the optimal policy of the principal concerning a target population of agents based on a pre-collected dataset that consists of historical interactions. The unobserved private types confound such a dataset as they affect both the rewards and observations received by the principal. We propose a novel algorithm, Pessimistic policy Learning with Algorithmic iNstruments (PLAN), which leverages the ideas of instrumental variable regression and the pessimism principle to learn a near-optimal principal's policy in the context of general function approximation. Our algorithm is based on the critical observation that the principal's actions serve as valid instrumental variables. In particular, under a partial coverage assumption on the offline dataset, we prove that PLAN outputs a $1 / \sqrt{K}$-optimal policy with $K$ being the number of collected trajectories. We further apply our framework to some special cases of strategic MDP, including strategic regression, strategic bandit, and noncompliance in recommendation systems. △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: 62 pages

arXiv:2207.14321 [pdf, other]

doi 10.1007/JHEP07(2023)127

Feynman Diagrams in Four-Dimensional Holomorphic Theories and the Operatope

Authors: Kasia Budzik, Davide Gaiotto, Justin Kulp, **gxiang Wu, Matthew Yu

Abstract: We study a class of universal Feynman integrals which appear in four-dimensional holomorphic theories. We recast the integrals as the Fourier transform of a certain polytope in the space of loop momenta (aka the ``Operatope''). We derive a set of quadratic recursion relations which appear to fully determine the final answer. Our strategy can be applied to a very general class of twisted supersymme… ▽ More We study a class of universal Feynman integrals which appear in four-dimensional holomorphic theories. We recast the integrals as the Fourier transform of a certain polytope in the space of loop momenta (aka the ``Operatope''). We derive a set of quadratic recursion relations which appear to fully determine the final answer. Our strategy can be applied to a very general class of twisted supersymmetric quantum field theories. △ Less

Submitted 28 July, 2022; originally announced July 2022.

Comments: 45 pages, Mathematica code attached

arXiv:2203.01219 [pdf, other]

Are Latent Factor Regression and Sparse Regression Adequate?

Authors: Jianqing Fan, Zhipeng Lou, Mengxin Yu

Abstract: We propose the Factor Augmented sparse linear Regression Model (FARM) that not only encompasses both the latent factor regression and sparse linear regression as special cases but also bridges dimension reduction and sparse regression together. We provide theoretical guarantees for the estimation of our model under the existence of sub-Gaussian and heavy-tailed noises (with bounded (1+x)-th moment… ▽ More We propose the Factor Augmented sparse linear Regression Model (FARM) that not only encompasses both the latent factor regression and sparse linear regression as special cases but also bridges dimension reduction and sparse regression together. We provide theoretical guarantees for the estimation of our model under the existence of sub-Gaussian and heavy-tailed noises (with bounded (1+x)-th moment, for all x>0), respectively. In addition, the existing works on supervised learning often assume the latent factor regression or the sparse linear regression is the true underlying model without justifying its adequacy. To fill in such an important gap, we also leverage our model as the alternative model to test the sufficiency of the latent factor regression and the sparse linear regression models. To accomplish these goals, we propose the Factor-Adjusted de-Biased Test (FabTest) and a two-stage ANOVA type test respectively. We also conduct large-scale numerical experiments including both synthetic and FRED macroeconomics data to corroborate the theoretical properties of our methods. Numerical results illustrate the robustness and effectiveness of our model against latent factor regression and sparse linear regression models. △ Less

Submitted 2 March, 2022; originally announced March 2022.

arXiv:2110.08844 [pdf, other]

doi 10.1109/TCYB.2022.3195361

Nash Equilibrium Seeking for General Linear Systems with Disturbance Rejection

Authors: Xin Cai, Feng Xiao, Bo Wei, Mei Yu, Fang Fang

Abstract: This paper explores aggregative games in a network of general linear systems subject to external disturbances. To deal with external disturbances, distributed strategy-updating rules based on internal model are proposed for the case with perfect and imperfect information, respectively. Different from existing algorithms based on gradient dynamics, by introducing the integral of gradient of cost fu… ▽ More This paper explores aggregative games in a network of general linear systems subject to external disturbances. To deal with external disturbances, distributed strategy-updating rules based on internal model are proposed for the case with perfect and imperfect information, respectively. Different from existing algorithms based on gradient dynamics, by introducing the integral of gradient of cost functions on the basis of passive theory, the rules are proposed to force the strategies of all players to evolve to Nash equilibrium regardless the effect of disturbances. The convergence of the two strategy-updating rules is analyzed via Lyapunov stability theory, passive theory and singular perturbation theory. Simulations are presented to verify the obtained results. △ Less

Submitted 11 December, 2021; v1 submitted 17 October, 2021; originally announced October 2021.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

MSC Class: 91A10; 93A14; 93A16; 93D50

arXiv:2109.06368 [pdf, other]

Policy Optimization Using Semi-parametric Models for Dynamic Pricing

Authors: Jianqing Fan, Yongyi Guo, Mengxin Yu

Abstract: In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh [2019] except that we expand the demand curve to a semiparametric model… ▽ More In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh [2019] except that we expand the demand curve to a semiparametric model and need to learn dynamically both parametric and nonparametric components. We propose a dynamic statistical learning and decision-making policy that combines semiparametric estimation from a generalized linear model with an unknown link and online decision-making to minimize regret (maximize revenue). Under mild conditions, we show that for a market noise c.d.f. $F(\cdot)$ with $m$-th order derivative ($m\geq 2$), our policy achieves a regret upper bound of $\tilde{O}_{d}(T^{\frac{2m+1}{4m-1}})$, where $T$ is time horizon and $\tilde{O}_{d}$ is the order that hides logarithmic terms and the dimensionality of feature $d$. The upper bound is further reduced to $\tilde{O}_{d}(\sqrt{T})$ if $F$ is super smooth whose Fourier transform decays exponentially. In terms of dependence on the horizon $T$, these upper bounds are close to $Ω(\sqrt{T})$, the lower bound where $F$ belongs to a parametric class. We further generalize these results to the case with dynamically dependent product features under the strong mixing condition. △ Less

Submitted 3 May, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

Comments: 71 pages, Major Revision

arXiv:2109.00174 [pdf, ps, other]

The rational cuspidal subgroup of $J_0(p^2M)$ with $M$ squarefree

Authors: Jia-Wei Guo, Yifan Yang, Hwajong Yoo, Myungjun Yu

Abstract: For a positive integer $N$, let $\mathscr{C}_N(\mathbb{Q})$ be the rational cuspidal subgroup of $J_0(N)$ and $\mathscr{C}(N)$ be the rational cuspidal divisor class group of $X_0(N)$, which are both subgroups of the rational torsion subgroup of $J_0(N)$. We prove that two groups $\mathscr{C}_N(\mathbb{Q})$ and $\mathscr{C}(N)$ are equal when $N=p^2M$ for any prime $p$ and any squarefree integer… ▽ More For a positive integer $N$, let $\mathscr{C}_N(\mathbb{Q})$ be the rational cuspidal subgroup of $J_0(N)$ and $\mathscr{C}(N)$ be the rational cuspidal divisor class group of $X_0(N)$, which are both subgroups of the rational torsion subgroup of $J_0(N)$. We prove that two groups $\mathscr{C}_N(\mathbb{Q})$ and $\mathscr{C}(N)$ are equal when $N=p^2M$ for any prime $p$ and any squarefree integer $M$. To achieve this we show that all modular units on $X_0(N)$ can be written as products of certain functions $F_{m, h}$, which are constructed from generalized Dedekind eta functions. Also, we determine the necessary and sufficient conditions for such products to be modular units on $X_0(N)$ under a mild assumption. △ Less

Submitted 1 December, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: to appear in Mathematische Nachrichten

MSC Class: 11G18; 14G05; 14G35

arXiv:2104.04534 [pdf, ps, other]

doi 10.21468/SciPostPhys.13.3.068

Topological Orders in (4+1)-Dimensions

Authors: Theo Johnson-Freyd, Matthew Yu

Abstract: We investigate the Morita equivalences of (4+1)-dimensional topological orders. We show that any (4+1)-dimensional super (fermionic) topological order admits a gapped boundary condition -- in other words, all (4+1)-dimensional super topological orders are Morita trivial. As a result, there are no inherently gapless super (3+1)-dimensional theories. On the other hand, we show that there are infinit… ▽ More We investigate the Morita equivalences of (4+1)-dimensional topological orders. We show that any (4+1)-dimensional super (fermionic) topological order admits a gapped boundary condition -- in other words, all (4+1)-dimensional super topological orders are Morita trivial. As a result, there are no inherently gapless super (3+1)-dimensional theories. On the other hand, we show that there are infinitely many algebraically Morita-inequivalent bosonic (4+1)-dimensional topological orders. △ Less

Submitted 19 July, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

Comments: 18 pages, 2 figures

Journal ref: SciPost Phys. 13, 068 (2022)

arXiv:2011.03179 [pdf, ps, other]

doi 10.2140/involve.2022.15.271

Filtering cohomology of ordinary and Lagrangian Grassmannians

Authors: The 2020 Polymath Jr. REU "q-binomials, the Grassmannian group", :, Huda Ahmed, Rasiel Chishti, Yu-Cheng Chiu, Galen Dorpalen-Barry, Jeremy Ellis, David Fang, Michael Feigen, Jonathan Feigert, Mabel González, Dylan Harker, Jiaye Wei, Bhavna Joshi, Gandhar Kulkarni, Kapil Lad, Zhen Liu, Ma Mingyang, Lance Myers, Arjun Nigam, Tudor Popescu, Victor Reiner, Zijian Rong, Eunice Sukarto , et al. (9 additional authors not shown)

Abstract: This paper studies, for a positive integer $m$, the subalgebra of the cohomology ring of the complex Grassmannians generated by the elements of degree at most $m$. We build in two ways upon a conjecture for the Hilbert series of this subalgebra due to Reiner and Tudose. The first reinterprets it in terms of the operation of $k$-conjugation, suggesting two conjectural bases for the subalgebras that… ▽ More This paper studies, for a positive integer $m$, the subalgebra of the cohomology ring of the complex Grassmannians generated by the elements of degree at most $m$. We build in two ways upon a conjecture for the Hilbert series of this subalgebra due to Reiner and Tudose. The first reinterprets it in terms of the operation of $k$-conjugation, suggesting two conjectural bases for the subalgebras that would imply their conjecture. The second introduces an analogous conjecture for the cohomology of Lagrangian Grassmannians. △ Less

Submitted 12 September, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: Version to appear in Involve

MSC Class: 05E14; 05E05; 14N15

Journal ref: Involve 15 (2022) 271-288

arXiv:2010.13904 [pdf, ps, other]

Relative Contrast Estimation and Inference for Treatment Recommendation

Authors: Muxuan Liang, Menggang Yu

Abstract: When there are resource constraints, it is important to rank or estimate treatment benefits according to patient characteristics. This facilitates prioritization of assigning different treatments. Most existing literature on individualized treatment rules targets absolute conditional treatment effect differences as the metric for benefits. However, there can be settings where relative differences… ▽ More When there are resource constraints, it is important to rank or estimate treatment benefits according to patient characteristics. This facilitates prioritization of assigning different treatments. Most existing literature on individualized treatment rules targets absolute conditional treatment effect differences as the metric for benefits. However, there can be settings where relative differences may better represent such benefits. In this paper, we consider modeling such relative differences that form scale-invariant contrasts between conditional treatment effects. We show that all scale-invariant contrasts are monotonic transformations of each other. Therefore we posit a single index model for a particular relative contrast. Identifiability of the model is enforced via an intuitive $l_2$ norm constraint on index parameters. We then derive estimating equations and efficient scores via semiparametric efficiency theory. Based on the efficient score and its variant, we propose a two-step approach that consists of minimizing a doubly robust loss function and a subsequent one-step efficiency augmentation procedure to achieve efficiency bound. Careful theoretical and numerical studies are provided to show the superiority of the proposed approach. △ Less

Submitted 3 May, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

Comments: 19 pages, 3 figures

arXiv:2010.07950 [pdf, ps, other]

doi 10.1017/S0004972721000095

Fusion 2-categories with no line operators are grouplike

Authors: Theo Johnson-Freyd, Matthew Yu

Abstract: We show that if $\mathcal{C}$ is a fusion $2$-category in which the endomorphism category of the unit object is $\rm{Vec}$ or $\rm{SVec}$, then the indecomposable objects of $\mathcal{C}$ form a finite group. We show that if $\mathcal{C}$ is a fusion $2$-category in which the endomorphism category of the unit object is $\rm{Vec}$ or $\rm{SVec}$, then the indecomposable objects of $\mathcal{C}$ form a finite group. △ Less

Submitted 15 October, 2020; originally announced October 2020.

Comments: 7 pages, 5 figures

Journal ref: Bull. Aust. Math. Soc. 104 (2021) 434-442

arXiv:2010.04346 [pdf, ps, other]

doi 10.1063/5.0032539

State and parameter estimation from exact partial state observation in stochastic reaction networks

Authors: Muruhan Rathinam, Mingkai Yu

Abstract: We consider chemical reaction networks modeled by a discrete state and continuous in time Markov process for the vector copy number of the species and provide a novel particle filter method for state and parameter estimation based on exact observation of some of the species in continuous time. The conditional probability distribution of the unobserved states is shown to satisfy a system of differe… ▽ More We consider chemical reaction networks modeled by a discrete state and continuous in time Markov process for the vector copy number of the species and provide a novel particle filter method for state and parameter estimation based on exact observation of some of the species in continuous time. The conditional probability distribution of the unobserved states is shown to satisfy a system of differential equations with jumps. We provide a method of simulating a process that is a proxy for the vector copy number of the unobserved species along with a weight. The resulting weighted Monte Carlo simulation is then used to compute the conditional probability distribution of the unobserved species. We also show how our algorithm can be adapted for a Bayesian estimation of parameters and for the estimation of a past state value based on observations up to a future time. △ Less

Submitted 9 December, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

arXiv:2007.08322 [pdf, other]

Understanding Implicit Regularization in Over-Parameterized Single Index Model

Authors: Jianqing Fan, Zhuoran Yang, Mengxin Yu

Abstract: In this paper, we leverage over-parameterization to design regularization-free algorithms for the high-dimensional single index model and provide theoretical guarantees for the induced implicit regularization phenomenon. Specifically, we study both vector and matrix single index models where the link function is nonlinear and unknown, the signal parameter is either a sparse vector or a low-rank sy… ▽ More In this paper, we leverage over-parameterization to design regularization-free algorithms for the high-dimensional single index model and provide theoretical guarantees for the induced implicit regularization phenomenon. Specifically, we study both vector and matrix single index models where the link function is nonlinear and unknown, the signal parameter is either a sparse vector or a low-rank symmetric matrix, and the response variable can be heavy-tailed. To gain a better understanding of the role played by implicit regularization without excess technicality, we assume that the distribution of the covariates is known a priori. For both the vector and matrix settings, we construct an over-parameterized least-squares loss function by employing the score function transform and a robust truncation step designed specifically for heavy-tailed data. We propose to estimate the true parameter by applying regularization-free gradient descent to the loss function. When the initialization is close to the origin and the stepsize is sufficiently small, we prove that the obtained solution achieves minimax optimal statistical rates of convergence in both the vector and matrix cases. In addition, our experimental results support our theoretical findings and also demonstrate that our methods empirically outperform classical methods with explicit regularization in terms of both $\ell_2$-statistical rate and variable selection consistency. △ Less

Submitted 15 November, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

Comments: major revision

arXiv:2005.07846 [pdf, ps, other]

Jordan--Landau theorem for matrices over finite fields

Authors: Gilyoung Cheong, Jungin Lee, Hayan Nam, Myungjun Yu

Abstract: Given a positive integer $r$ and a prime power $q$, we estimate the probability that the characteristic polynomial $f_{A}(t)$ of a random matrix $A$ in $\mathrm{GL}_{n}(\mathbb{F}_{q})$ is square-free with $r$ (monic) irreducible factors when $n$ is large. We also estimate the analogous probability that $f_{A}(t)$ has $r$ irreducible factors counting with multiplicity. In either case, the main ter… ▽ More Given a positive integer $r$ and a prime power $q$, we estimate the probability that the characteristic polynomial $f_{A}(t)$ of a random matrix $A$ in $\mathrm{GL}_{n}(\mathbb{F}_{q})$ is square-free with $r$ (monic) irreducible factors when $n$ is large. We also estimate the analogous probability that $f_{A}(t)$ has $r$ irreducible factors counting with multiplicity. In either case, the main term $(\log n)^{r-1}((r-1)!n)^{-1}$ and the error term $O((\log n)^{r-2}n^{-1})$, whose implied constant only depends on $r$ but not on $q$ nor $n$, coincide with the probability that a random permutation on $n$ letters is a product of $r$ disjoint cycles. The main ingredient of our proof is a recursion argument due to S. D. Cohen, which was previously used to estimate the probability that a random degree $n$ monic polynomial in $\mathbb{F}_{q}[t]$ is square-free with $r$ irreducible factors and the analogous probability that the polynomial has $r$ irreducible factors counting with multiplicity. We obtain our result by carefully modifying Cohen's recursion argument in the matrix setting, using Reiner's theorem that counts the number of $n \times n$ matrices with a fixed characteristic polynomial over $\mathbb{F}_{q}$. △ Less

Submitted 8 September, 2022; v1 submitted 15 May, 2020; originally announced May 2020.

Comments: 19 pages. A conjecture in the previous draft has been resolved and its proof is included, and another author has been added

arXiv:2005.00194 [pdf, ps, other]

Bounds for 2-Selmer ranks in terms of seminarrow class groups

Authors: Hwajong Yoo, Myungjun Yu

Abstract: Let $E$ be an elliptic curve over a number field $K$ defined by a monic irreducible cubic polynomial $F(x)$. When $E$ is \textit{nice} at all finite primes of $K$, we bound its $2$-Selmer rank in terms of the $2$-rank of a modified ideal class group of the field $L=K[x]/{(F(x))}$, which we call the \textit{semi-narrow class group} of $L$. We then provide several sufficient conditions for $E$ being… ▽ More Let $E$ be an elliptic curve over a number field $K$ defined by a monic irreducible cubic polynomial $F(x)$. When $E$ is \textit{nice} at all finite primes of $K$, we bound its $2$-Selmer rank in terms of the $2$-rank of a modified ideal class group of the field $L=K[x]/{(F(x))}$, which we call the \textit{semi-narrow class group} of $L$. We then provide several sufficient conditions for $E$ being nice at a finite prime. As an application, when $K$ is a real quadratic field, $E/K$ is semistable and the discriminant of $F$ is totally negative, then we frequently determine the $2$-Selmer rank of $E$ by computing the root number of $E$ and the $2$-rank of the narrow class group of $L$. △ Less

Submitted 4 December, 2022; v1 submitted 30 April, 2020; originally announced May 2020.

Comments: To appear in Pacific Journal of Mathematics

MSC Class: 11G05; 14G05

arXiv:2002.08856 [pdf, ps, other]

Bounding the expected run-time of nonconvex optimization with early stop**

Authors: Thomas Flynn, Kwang Min Yu, Abid Malik, Nicolas D'Imperio, Shinjae Yoo

Abstract: This work examines the convergence of stochastic gradient-based optimization algorithms that use early stop** based on a validation function. The form of early stop** we consider is that optimization terminates when the norm of the gradient of a validation function falls below a threshold. We derive conditions that guarantee this stop** rule is well-defined, and provide bounds on the expecte… ▽ More This work examines the convergence of stochastic gradient-based optimization algorithms that use early stop** based on a validation function. The form of early stop** we consider is that optimization terminates when the norm of the gradient of a validation function falls below a threshold. We derive conditions that guarantee this stop** rule is well-defined, and provide bounds on the expected number of iterations and gradient evaluations needed to meet this criterion. The guarantee accounts for the distance between the training and validation sets, measured with the Wasserstein distance. We develop the approach in the general setting of a first-order optimization algorithm, with possibly biased update directions subject to a geometric drift condition. We then derive bounds on the expected running time for early stop** variants of several algorithms, including stochastic gradient descent (SGD), decentralized SGD (DSGD), and the stochastic variance reduced gradient (SVRG) algorithm. Finally, we consider the generalization properties of the iterate returned by early stop**. △ Less

Submitted 22 July, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

Comments: Camera ready version for UAI 2020

arXiv:1909.07079 [pdf, other]

Fast Large-Scale Discrete Optimization Based on Principal Coordinate Descent

Authors: Huan Xiong, Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, Ling Shao

Abstract: Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are NP-hard and difficult to solve due to the binary constraints, especially when the number of variables is very large. Existing methods often suffer from high compu… ▽ More Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are NP-hard and difficult to solve due to the binary constraints, especially when the number of variables is very large. Existing methods often suffer from high computational costs or large accumulated quantization errors, or are only designed for specific tasks. In this paper, we propose a fast algorithm to find effective approximate solutions for general binary optimization problems. The proposed algorithm iteratively solves minimization problems related to the linear surrogates of loss functions, which leads to the updating of some binary variables most impacting the value of loss functions in each step. Our method supports a wide class of empirical objective functions with/without restrictions on the numbers of $1$s and $-1$s in the binary variables. Furthermore, the theoretical convergence of our algorithm is proven, and the explicit convergence rates are derived, for objective functions with Lipschitz continuous gradients, which are commonly adopted in practice. Extensive experiments on several binary optimization tasks and large-scale datasets demonstrate the superiority of the proposed algorithm over several state-of-the-art methods in terms of both effectiveness and efficiency. △ Less

Submitted 15 May, 2021; v1 submitted 16 September, 2019; originally announced September 2019.

Comments: 14 pages

arXiv:1904.03372 [pdf, other]

A robust bootstrap change point test for high-dimensional location parameter

Authors: Mengjia Yu, Xiaohui Chen

Abstract: We consider the problem of change point detection for high-dimensional distributions in a location family when the dimension can be much larger than the sample size. In change point analysis, the widely used cumulative sum (CUSUM) statistics are sensitive to outliers and heavy-tailed distributions. In this paper, we propose a robust, tuning-free (i.e., fully data-dependent), and easy-to-implement… ▽ More We consider the problem of change point detection for high-dimensional distributions in a location family when the dimension can be much larger than the sample size. In change point analysis, the widely used cumulative sum (CUSUM) statistics are sensitive to outliers and heavy-tailed distributions. In this paper, we propose a robust, tuning-free (i.e., fully data-dependent), and easy-to-implement change point test that enjoys strong theoretical guarantees. To achieve the robust purpose in a nonparametric setting, we formulate the change point detection in the multivariate $U$-statistics framework with anti-symmetric and nonlinear kernels. Specifically, the within-sample noise is canceled out by anti-symmetry of the kernel, while the signal distortion under certain nonlinear kernels can be controlled such that the between-sample change point signal is magnitude preserving. A (half) jackknife multiplier bootstrap (JMB) tailored to the change point detection setting is proposed to calibrate the distribution of our $\ell^{\infty}$-norm aggregated test statistic. Subject to mild moment conditions on kernels, we derive the uniform rates of convergence for the JMB to approximate the sampling distribution of the test statistic, and analyze its size and power properties. Extensions to multiple change point testing and estimation are discussed with illustration from numerical studies. △ Less

Submitted 13 October, 2021; v1 submitted 6 April, 2019; originally announced April 2019.

MSC Class: 62F40; 62G35; 62E17

arXiv:1903.05274 [pdf, other]

Forecasting Spatio-Temporal Renewable Scenarios: a Deep Generative Approach

Authors: Congmei Jiang, Yize Chen, Yongfang Mao, Yi Chai, Mingbiao Yu

Abstract: The operation and planning of large-scale power systems are becoming more challenging with the increasing penetration of stochastic renewable generation. In order to minimize the decision risks in power systems with large amount of renewable resources, there is a growing need to model the short-term generation uncertainty. By producing a group of possible future realizations for certain set of ren… ▽ More The operation and planning of large-scale power systems are becoming more challenging with the increasing penetration of stochastic renewable generation. In order to minimize the decision risks in power systems with large amount of renewable resources, there is a growing need to model the short-term generation uncertainty. By producing a group of possible future realizations for certain set of renewable generation plants, scenario approach has become one popular way for renewables uncertainty modeling. However, due to the complex spatial and temporal correlations underlying in renewable generations, traditional model-based approaches for forecasting future scenarios often require extensive knowledge, while fitted models are often hard to scale. To address such modeling burdens, we propose a learning-based, data-driven scenario forecasts method based on generative adversarial networks (GANs), which is a class of deep-learning generative algorithms used for modeling unknown distributions. We firstly utilize an improved GANs with convergence guarantees to learn the intrinsic patterns and model the unknown distributions of (multiple-site) renewable generation time-series. Then by solving an optimization problem, we are able to generate forecasted scenarios without any scenario number and forecasting horizon restrictions. Our method is totally model-free, and could forecast scenarios under different level of forecast uncertainties. Extensive numerical simulations using real-world data from NREL wind and solar integration datasets validate the performance of proposed method in forecasting both wind and solar power scenarios. △ Less

Submitted 12 March, 2019; originally announced March 2019.

arXiv:1801.10536 [pdf, ps, other]

Large Shafarevich-Tate groups over quadratic number fields

Authors: Myungjun Yu

Abstract: Let $E$ be an elliptic curve over the rational field $\mathbf{Q}$. Let $K$ be a quadratic extension over $\mathbf{Q}$. Let $\mathrm{ST}(E/K)$ dente the Shafarevich-Tate group of $E$ over $K$. We show that (under mild conditions on $E$) for every $r>0$, there are infinitely many quadratic twists $E^d/\mathbf{Q}$ of $E/\mathbf{Q}$ such that $\mathrm{dim}_{\mathbf{F}_2}(\mathrm{ST}(E^d/K)[2]) > r$ Let $E$ be an elliptic curve over the rational field $\mathbf{Q}$. Let $K$ be a quadratic extension over $\mathbf{Q}$. Let $\mathrm{ST}(E/K)$ dente the Shafarevich-Tate group of $E$ over $K$. We show that (under mild conditions on $E$) for every $r>0$, there are infinitely many quadratic twists $E^d/\mathbf{Q}$ of $E/\mathbf{Q}$ such that $\mathrm{dim}_{\mathbf{F}_2}(\mathrm{ST}(E^d/K)[2]) > r$ △ Less

Submitted 31 January, 2018; originally announced January 2018.

arXiv:1711.08747 [pdf, other]

doi 10.1111/rssb.12406

Finite sample change point inference and identification for high-dimensional mean vectors

Authors: Mengjia Yu, Xiaohui Chen

Abstract: Cumulative sum (CUSUM) statistics are widely used in the change point inference and identification. For the problem of testing for existence of a change point in an independent sample generated from the mean-shift model, we introduce a Gaussian multiplier bootstrap to calibrate critical values of the CUSUM test statistics in high dimensions. The proposed bootstrap CUSUM test is fully data-dependen… ▽ More Cumulative sum (CUSUM) statistics are widely used in the change point inference and identification. For the problem of testing for existence of a change point in an independent sample generated from the mean-shift model, we introduce a Gaussian multiplier bootstrap to calibrate critical values of the CUSUM test statistics in high dimensions. The proposed bootstrap CUSUM test is fully data-dependent and it has strong theoretical guarantees under arbitrary dependence structures and mild moment conditions. Specifically, we show that with a boundary removal parameter the bootstrap CUSUM test enjoys the uniform validity in size under the null and it achieves the minimax separation rate under the sparse alternatives when the dimension $p$ can be larger than the sample size $n$. Once a change point is detected, we estimate the change point location by maximizing the $\ell^{\infty}$-norm of the generalized CUSUM statistics at two different weighting scales corresponding to covariance stationary and non-stationary CUSUM statistics. For both estimators, we derive their rates of convergence and show that dimension impacts the rates only through logarithmic factors, which implies that consistency of the CUSUM estimators is possible when $p$ is much larger than $n$. In the presence of multiple change points, we propose a principled bootstrap-assisted binary segmentation (BABS) algorithm to dynamically adjust the change point detection rule and recursively estimate their locations. We derive its rate of convergence under suitable signal separation and strength conditions. The results derived in this paper are non-asymptotic and we provide extensive simulation studies to assess the finite sample performance. The empirical evidence shows an encouraging agreement with our theoretical results. △ Less

Submitted 2 January, 2021; v1 submitted 23 November, 2017; originally announced November 2017.

MSC Class: 62F40; 62E17; 60F05; 37M10

arXiv:1711.01469 [pdf, other]

Johnson's bijections and their application to counting simultaneous core partitions

Authors: **eon Baek, Hayan Nam, Myungjun Yu

Abstract: Johnson recently proved Armstrong's conjecture which states that the average size of an $(a,b)$-core partition is $(a+b+1)(a-1)(b-1)/24$. He used various coordinate changes and one-to-one correspondences that are useful for counting problems about simultaneous core partitions. We give an expression for the number of $(b_1,b_2,\cdots, b_n)$-core partitions where $\{b_1,b_2,\cdots,b_n\}$ contains at… ▽ More Johnson recently proved Armstrong's conjecture which states that the average size of an $(a,b)$-core partition is $(a+b+1)(a-1)(b-1)/24$. He used various coordinate changes and one-to-one correspondences that are useful for counting problems about simultaneous core partitions. We give an expression for the number of $(b_1,b_2,\cdots, b_n)$-core partitions where $\{b_1,b_2,\cdots,b_n\}$ contains at least one pair of relatively prime numbers. We also evaluate the largest size of a self-conjugate $(s,s+1,s+2)$-core partition. △ Less

Submitted 4 November, 2017; originally announced November 2017.

arXiv:1705.02691 [pdf, other]

A bijective proof of Amdeberhan's conjecture on the number of $(s, s+2)$-core partitions with distinct parts

Authors: **eon Baek, Hayan Nam, Myungjun Yu

Abstract: Amdeberhan conjectured that the number of $(s,s+2)$-core partitions with distinct parts for an odd integer $s$ is $2^{s-1}$. This conjecture was first proved by Yan, Qin, ** and Zhou, then subsequently by Zaleski and Zeilberger. Since the formula for the number of such core partitions is so simple one can hope for a bijective proof. We give the first direct bijective proof of this fact by establi… ▽ More Amdeberhan conjectured that the number of $(s,s+2)$-core partitions with distinct parts for an odd integer $s$ is $2^{s-1}$. This conjecture was first proved by Yan, Qin, ** and Zhou, then subsequently by Zaleski and Zeilberger. Since the formula for the number of such core partitions is so simple one can hope for a bijective proof. We give the first direct bijective proof of this fact by establishing a bijection between the set of $(s, s+2)$-core partitions with distinct parts and a set of lattice paths. △ Less

Submitted 9 May, 2017; v1 submitted 7 May, 2017; originally announced May 2017.

Comments: 9 pages

arXiv:1612.03068 [pdf, other]

Generalized Algorithm for Wythoff's Game with Basis Vector $(2^b,2^b)$

Authors: Shubham Aggarwal, Jared Geller, Shuvom Sadhuka, Max Yu

Abstract: Wythoff's Game is a variation of Nim in which players may take an equal number of stones from each pile or make valid Nim moves. W. A. Wythoff proved that the set of P-Positions (losing position), $C$, for Wythoff's Game is given by $C := \left\{ (\lfloor kφ\rfloor, \lfloor kφ^2 \rfloor), (\lfloor kφ^2 \rfloor, \lfloor kφ\rfloor) : k \in \mathbb Z_{\geq 0} \right\}$. An open Wythoff problem remain… ▽ More Wythoff's Game is a variation of Nim in which players may take an equal number of stones from each pile or make valid Nim moves. W. A. Wythoff proved that the set of P-Positions (losing position), $C$, for Wythoff's Game is given by $C := \left\{ (\lfloor kφ\rfloor, \lfloor kφ^2 \rfloor), (\lfloor kφ^2 \rfloor, \lfloor kφ\rfloor) : k \in \mathbb Z_{\geq 0} \right\}$. An open Wythoff problem remains where players make the valid Nim moves or remove $kb$ stones from each pile, where $b$ is a fixed integer. We denote this as the $(b,b)$ game. For example, regular Wythoff's Game is just the $(1,1)$ game. In 2009, Duch${ê}$ne and Gravier proved an algorithm to generate the set of P-Positions for the $(2,2)$ game by exploiting the periodic nature of the differences of stones between the two piles modulo $4$. We observe similar cyclic behaviour for any $b$, where $b$ is a power of $2$, modulo $b^2$, and construct an algorithm to generate the set of P-Positions for this game. Let $a$ be a power of $2$. We prove our algorithm works by first showing that it holds for the first $a^2$ terms in the $(a,a)$ game. Next, we construct an ordered multiset for the $(2a,2a)$ game from the $a^2$ terms, and an inductive proof follows. Moreover, we conjecture that all cyclic games require $a$ to be a power of $2$, suggesting that there is no similar structure in the generalised $(b,b)$ game where $b$ isn't a power of $2$. Future directions for generalising this result would likely utilise numeration systems, particularly the PV numbers. △ Less

Submitted 15 February, 2017; v1 submitted 9 December, 2016; originally announced December 2016.

arXiv:1611.09133 [pdf, ps, other]

A Liouville Theorem for a Class of Fractional Systems in $\mathbb{R}^n_+$

Authors: Lizhi Zhang, Mei Yu, Jianming He

Abstract: Let $0<α,β<2$ be any real number. In this paper, we investigate the following semilinear system involving the fractional Laplacian \begin{equation*} \left\{\begin{array}{lll} (-\lap)^{α/2} u(x)=f(v(x)), & (-\lap)^{β/2} v(x)=g(u(x)), & \qquad x\in\mathbb{R}^n_+, u,v\geq0, & \qquad x\in\mathbb{R}^n\setminus\mathbb{R}^n_+. \end{array}\right. \end{equation*} Applying a direct method of moving planes f… ▽ More Let $0<α,β<2$ be any real number. In this paper, we investigate the following semilinear system involving the fractional Laplacian \begin{equation*} \left\{\begin{array}{lll} (-\lap)^{α/2} u(x)=f(v(x)), & (-\lap)^{β/2} v(x)=g(u(x)), & \qquad x\in\mathbb{R}^n_+, u,v\geq0, & \qquad x\in\mathbb{R}^n\setminus\mathbb{R}^n_+. \end{array}\right. \end{equation*} Applying a direct method of moving planes for the fractional Laplacian, without any decay assumption on the solutions at infinity, we prove Liouville theorems of nonnegative solutions under some natural conditions on $f$ and $g$. △ Less

Submitted 24 January, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

arXiv:1610.01195 [pdf, ps, other]

2-Selmer near-companion curves

Authors: Myungjun Yu

Abstract: Let $E$ and $A$ be elliptic curves over a number field $K$. Let $χ$ be a quadratic character of $K$. We prove the conjecture posed by Mazur and Rubin on $n$-Selmer near-companion curves in the case $n=2$. Namely, we show if the difference of the $2$-Selmer ranks of $E^χ$ and $A^χ$ is bounded independent of $χ$, there is a $G_K$-isomorphism $E[2] \cong A[2]$. Let $E$ and $A$ be elliptic curves over a number field $K$. Let $χ$ be a quadratic character of $K$. We prove the conjecture posed by Mazur and Rubin on $n$-Selmer near-companion curves in the case $n=2$. Namely, we show if the difference of the $2$-Selmer ranks of $E^χ$ and $A^χ$ is bounded independent of $χ$, there is a $G_K$-isomorphism $E[2] \cong A[2]$. △ Less

Submitted 8 November, 2016; v1 submitted 4 October, 2016; originally announced October 2016.

Comments: 16 pages

arXiv:1608.08371 [pdf, ps, other]

Solutions of Fully Nonlinear Nonlocal Systems

Authors: Pengyan Wang, Mei Yu

Abstract: In this paper we consider the system involving fully nonlinear nonlocal operators: $$ \left\{ \begin{array}{ll} F_α(u(x)) = C_{n,α} PV \int_{{R}^n} \frac{G(u(x)-u(y))}{|x-y|^{n+α}} dy=f(v(x)), F_β(v(x)) = C_{n,β} PV \int_{{R}^n} \frac{G(v(x)-v(y))}{|x-y|^{n+β}} dy=g(u(x)). \end{array} \right. $$ A \textit{narrow region principle} and a \textit{decay at infinity} for the system for carrying on th… ▽ More In this paper we consider the system involving fully nonlinear nonlocal operators: $$ \left\{ \begin{array}{ll} F_α(u(x)) = C_{n,α} PV \int_{{R}^n} \frac{G(u(x)-u(y))}{|x-y|^{n+α}} dy=f(v(x)), F_β(v(x)) = C_{n,β} PV \int_{{R}^n} \frac{G(v(x)-v(y))}{|x-y|^{n+β}} dy=g(u(x)). \end{array} \right. $$ A \textit{narrow region principle} and a \textit{decay at infinity} for the system for carrying on the method of moving planes are established. Then we prove the radial symmetry and monotonicity for positive solutions to the nonlinear system in the whole space. Non-existence of positive solutions to the nonlinear system on a half space is proved. △ Less

Submitted 30 August, 2016; originally announced August 2016.

Comments: arXiv admin note: text overlap with arXiv:1604.04806 by other authors

arXiv:1605.02254 [pdf, ps, other]

doi 10.1007/s00208-015-1262-4

Slopes for higher rank Artin-Schreier-Witt Towers

Authors: Rufei Ren, Daqing Wan, Liang Xiao, Myungjun Yu

Abstract: We fix a monic polynomial $\bar f(x) \in \mathbb{F}_q[x]$ over a finite field of characteristic $p$, and consider the $\mathbb{Z}_{p^{\ell}}$-Artin-Schreier-Witt tower defined by $\bar f(x)$; this is a tower of curves $\cdots \to C_m \to C_{m-1} \to \cdots \to C_0 =\mathbb{A}^1$, whose Galois group is canonically isomorphic to $\mathbb{Z}_{p^\ell}$, the degree $\ell$ unramified extension of… ▽ More We fix a monic polynomial $\bar f(x) \in \mathbb{F}_q[x]$ over a finite field of characteristic $p$, and consider the $\mathbb{Z}_{p^{\ell}}$-Artin-Schreier-Witt tower defined by $\bar f(x)$; this is a tower of curves $\cdots \to C_m \to C_{m-1} \to \cdots \to C_0 =\mathbb{A}^1$, whose Galois group is canonically isomorphic to $\mathbb{Z}_{p^\ell}$, the degree $\ell$ unramified extension of $\mathbb{Z}_p$, which is abstractly isomorphic to $(\mathbb{Z}_p)^\ell$ as a topological group. We study the Newton slopes of zeta functions of this tower of curves. This reduces to the study of the Newton slopes of L-functions associated to characters of the Galois group of this tower. We prove that, when the conductor of the character is large enough, the Newton slopes of the L-function asymptotically form a finite union of arithmetic progressions. As a corollary, we prove the spectral halo property of the spectral variety associated to the $\mathbb{Z}_{p^{\ell}}$-Artin-Schreier-Witt tower. This extends the main result in [DWX] from rank one case $\ell=1$ to the higher rank case $\ell\geq 1$. △ Less

Submitted 1 January, 2017; v1 submitted 7 May, 2016; originally announced May 2016.

Comments: 20 pages

arXiv:1602.01463 [pdf, other]

Interactions between discontinuities for binary mixture separation problem and hodograph method

Authors: M. S. Elaeva, E. V. Shiryaeva, Zhukov M. Yu

Abstract: The Cauchy problem for first-order PDE with the initial data which have a piecewise discontinuities localized in different spatial points is completely solved. The interactions between discontinuities arising after breakup of initial discontinuities are studied with the help of the hodograph method. The solution is constructed in analytical implicit form. To recovery the explicit form of solution… ▽ More The Cauchy problem for first-order PDE with the initial data which have a piecewise discontinuities localized in different spatial points is completely solved. The interactions between discontinuities arising after breakup of initial discontinuities are studied with the help of the hodograph method. The solution is constructed in analytical implicit form. To recovery the explicit form of solution we propose the transformation of the PDEs into some ODEs on the level lines (isochrones) of implicit solution. In particular, this method allows us to solve the Goursat problem with initial data on characteristics. The paper describes a specific problem for zone electrophoresis (method of the mixture separation). However, the method proposed allows to solve any system of two first-order quasilinear PDEs for which the second order linear PDE, arising after the hodograph transformation, has the Riemann-Green function in explicit form. △ Less

Submitted 3 February, 2016; originally announced February 2016.

Comments: 19 pages, 11 figures

MSC Class: 35Lxx; 35L67; 35L40; 35L45; 35L50; 35L65

arXiv:1511.07512 [pdf, ps, other]

On 2-Selmer ranks of quadratic twists of elliptic curves

Authors: Myungjun Yu

Abstract: We study the $2$-Selmer ranks of elliptic curves. We prove that for an arbitrary elliptic curve $E$ over an arbitrary number field $K$, if the set $A_E$ of 2-Selmer ranks of quadratic twists of $E$ contains an integer $c$, it contains all integers larger than $c$ and having the same parity as $c$. We also find sufficient conditions on $A_E$ such that $A_E$ is equal to $\Z_{\ge t_E}$ for some numbe… ▽ More We study the $2$-Selmer ranks of elliptic curves. We prove that for an arbitrary elliptic curve $E$ over an arbitrary number field $K$, if the set $A_E$ of 2-Selmer ranks of quadratic twists of $E$ contains an integer $c$, it contains all integers larger than $c$ and having the same parity as $c$. We also find sufficient conditions on $A_E$ such that $A_E$ is equal to $\Z_{\ge t_E}$ for some number $t_E$. When all points in $E[2]$ are rational, we give an upper bound for $t_E$. △ Less

Submitted 27 January, 2016; v1 submitted 23 November, 2015; originally announced November 2015.

Comments: 13 pages

arXiv:1511.07511 [pdf, ps, other]

Selmer Ranks of twists of hyperelliptic curves and superelliptic curves

Authors: Myungjun Yu

Abstract: We study the variation of Selmer ranks of Jacobians of twists of hyperelliptic curves and superelliptic curves. We find sufficient conditions for such curves to have infinitely many twists whose Jacobians have Selmer ranks equal to $r$, for any given nonnegative integer $r$. This generalizes earlier results of Mazur-Rubin on elliptic curves. We study the variation of Selmer ranks of Jacobians of twists of hyperelliptic curves and superelliptic curves. We find sufficient conditions for such curves to have infinitely many twists whose Jacobians have Selmer ranks equal to $r$, for any given nonnegative integer $r$. This generalizes earlier results of Mazur-Rubin on elliptic curves. △ Less

Submitted 23 November, 2015; originally announced November 2015.

Comments: 30 pages, to appear in J. Number Theory

arXiv:1510.02862 [pdf, other]

doi 10.1080/02331888.2023.2278034

Moderate deviations for the mildly stationary autoregressive models with dependent errors

Authors: Hui Jiang, Guangyu Yang, Mingming Yu

Abstract: In this paper, we consider the normalized least squares estimator of the parameter in a mildly stationary first-order autoregressive (AR(1)) model with dependent errors which are modeled as a mildly stationary AR(1) process. By martingale methods, we establish the moderate deviations for the least squares estimators of the regressor and error, which can be applied to understand the near-integrated… ▽ More In this paper, we consider the normalized least squares estimator of the parameter in a mildly stationary first-order autoregressive (AR(1)) model with dependent errors which are modeled as a mildly stationary AR(1) process. By martingale methods, we establish the moderate deviations for the least squares estimators of the regressor and error, which can be applied to understand the near-integrated second order autoregressive processes. As an application, we also obtain the moderate deviations for the Durbin-Watson statistic. △ Less

Submitted 7 November, 2023; v1 submitted 9 October, 2015; originally announced October 2015.

Comments: 31 pages,8 figures, to be published by Statistics

arXiv:1506.07437 [pdf, ps, other]

doi 10.2140/involve.2018.11.243

The Truncated & Supplemented Pascal Matrix and Applications

Authors: M. Hua, S. B. Damelin, J. Sun, M. Yu

Abstract: In this paper, we introduce the $k\times n$ (with $k\leq n$) truncated, supplemented Pascal matrix which has the property that any $k$ columns form a linearly independent set. This property is also present in Reed-Solomon codes; however, Reed-Solomon codes are completely dense, whereas the truncated, supplemented Pascal matrix has multiple zeros. If the maximal-distance separable code conjecture i… ▽ More In this paper, we introduce the $k\times n$ (with $k\leq n$) truncated, supplemented Pascal matrix which has the property that any $k$ columns form a linearly independent set. This property is also present in Reed-Solomon codes; however, Reed-Solomon codes are completely dense, whereas the truncated, supplemented Pascal matrix has multiple zeros. If the maximal-distance separable code conjecture is correct, then our matrix has the maximal number of columns (with the aformentioned property) that the conjecture allows. This matrix has applications in coding, network coding, and matroid theory. △ Less

Submitted 5 February, 2016; v1 submitted 24 June, 2015; originally announced June 2015.

MSC Class: 05B20; 05B35

Journal ref: Involve 11 (2018) 243-251

arXiv:1409.8571 [pdf, ps, other]

Asymptotic distributions related to mildly-explosive second order autoregressive models

Authors: Hui Jiang, Mingming Yu, Guangyu Yang

Abstract: In this paper, we consider the normalized least squares estimator of the parameter in a mildly-explosive first-order autoregressive model with dependent errors which are modeled as a mildly-explosive AR(1) process. We prove that the estimator has a Cauchy limit law which provides a bridge between moderate deviation asymptotics and the earlier results on the local to unity and explosive autoregress… ▽ More In this paper, we consider the normalized least squares estimator of the parameter in a mildly-explosive first-order autoregressive model with dependent errors which are modeled as a mildly-explosive AR(1) process. We prove that the estimator has a Cauchy limit law which provides a bridge between moderate deviation asymptotics and the earlier results on the local to unity and explosive autoregressive models. In particular, the results can be applied to understand the near-integrated second order autoregressive processes. Simulation studies are also carried out to assess the performance of least squares estimation in finite samples. △ Less

Submitted 30 September, 2014; originally announced September 2014.

Comments: 27pages,10 figures

arXiv:0908.3135 [pdf, ps, other]

doi 10.1214/08-AOS657

Asymptotic theory for the semiparametric accelerated failure time model with missing data

Authors: Bin Nan, John D. Kalbfleisch, Menggang Yu

Abstract: We consider a class of doubly weighted rank-based estimating methods for the transformation (or accelerated failure time) model with missing data as arise, for example, in case-cohort studies. The weights considered may not be predictable as required in a martingale stochastic process formulation. We treat the general problem as a semiparametric estimating equation problem and provide proofs of… ▽ More We consider a class of doubly weighted rank-based estimating methods for the transformation (or accelerated failure time) model with missing data as arise, for example, in case-cohort studies. The weights considered may not be predictable as required in a martingale stochastic process formulation. We treat the general problem as a semiparametric estimating equation problem and provide proofs of asymptotic properties for the weighted estimators, with either true weights or estimated weights, by using empirical process theory where martingale theory may fail. Simulations show that the outcome-dependent weighted method works well for finite samples in case-cohort studies and improves efficiency compared to methods based on predictable weights. Further, it is seen that the method is even more efficient when estimated weights are used, as is commonly the case in the missing data literature. The Gehan censored data Wilcoxon weights are found to be surprisingly efficient in a wide class of problems. △ Less

Submitted 21 August, 2009; originally announced August 2009.

Comments: Published in at http://dx.doi.org/10.1214/08-AOS657 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS657 MSC Class: 62E20; 62N01 (Primary) 62D05 (Secondary)

Journal ref: Annals of Statistics 2009, Vol. 37, No. 5A, 2351-2376

arXiv:hep-th/9304122 [pdf, ps, other]

Modules Over Affine Lie Superalgebras

Authors: Jiang-Bei Fan, Ming Yu

Abstract: Modules over affine Lie superalgebras ${\cal G}$ are studied, in particular, for ${\cal G}=\hat{OSP(1,2)}$. It is shown that on studying Verma modules, much of the results in Kac-Moody algebra can be generalized to the super case. Of most importance are the generalized Kac-Kazhdan formula and the Malikov-Feigin-Fuchs construction, which give the weights and the explicit form of the singular vect… ▽ More Modules over affine Lie superalgebras ${\cal G}$ are studied, in particular, for ${\cal G}=\hat{OSP(1,2)}$. It is shown that on studying Verma modules, much of the results in Kac-Moody algebra can be generalized to the super case. Of most importance are the generalized Kac-Kazhdan formula and the Malikov-Feigin-Fuchs construction, which give the weights and the explicit form of the singular vectors in the Verma module over affine Kac-Moody superalgebras. We have also considered the decomposition of the admissible representation of $\hat{OSP(1,2)}$ into that of $\hat{SL(2)}\otimes$Virasoro algebra, through which we get the modular transformations on the torus and the fusion rules. Different boundary conditions on the torus correspond to the different modings of the current superalgebra and characters or super-characters, which might be relevant to the Hamiltonian reduction resulting in Neveu-Schwarz or Ramond superconformal algebras. Finally, the Felder BRST complex, which consists of Wakimoto modules by the free field realization, is constructed. △ Less

Submitted 26 April, 1993; originally announced April 1993.

Comments: 35 pages

Report number: ASITP-93-14

Showing 1–49 of 49 results for author: Yu, M