Search | arXiv e-print repository

Spatially Randomized Designs Can Enhance Policy Evaluation

Authors: Ying Yang, Chengchun Shi, Fang Yao, Shouyang Wang, Hongtu Zhu

Abstract: This article studies the benefits of using spatially randomized experimental designs which partition the experimental area into distinct, non-overlap** units with treatments assigned randomly. Such designs offer improved policy evaluation in online experiments by providing more precise policy value estimators and more effective A/B testing algorithms than traditional global designs, which apply… ▽ More This article studies the benefits of using spatially randomized experimental designs which partition the experimental area into distinct, non-overlap** units with treatments assigned randomly. Such designs offer improved policy evaluation in online experiments by providing more precise policy value estimators and more effective A/B testing algorithms than traditional global designs, which apply the same treatment across all units simultaneously. We examine both parametric and nonparametric methods for estimating and inferring policy values based on this randomized approach. Our analysis includes evaluating the mean squared error of the treatment effect estimator and the statistical power of the associated tests. Additionally, we extend our findings to experiments with spatio-temporal dependencies, where treatments are allocated sequentially over time, and account for potential temporal carryover effects. Our theoretical insights are supported by comprehensive numerical experiments. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2403.04815 [pdf, ps, other]

The Smoluchowski-Kramers approximation for a McKean-Vlasov equation subject to environmental noise with state-dependent friction

Authors: Chungang Shi, Yan Lv, Wei Wang

Abstract: The small mass limit is derived for a McKean-Vlasov equation subject to environmental noise with state-dependent friction. By applying the averaging approach to a non-autonomous stochastic slow-fast system with the microscopic and macroscopic scales, the convergence in distribution is obtained. The small mass limit is derived for a McKean-Vlasov equation subject to environmental noise with state-dependent friction. By applying the averaging approach to a non-autonomous stochastic slow-fast system with the microscopic and macroscopic scales, the convergence in distribution is obtained. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: arXiv admin note: text overlap with arXiv:2403.04275

arXiv:2403.04323 [pdf, ps, other]

Second-order McKean-Vlasov stochastic evolution equation driven by Poisson jumps: existence, uniqueness and averaging principle

Authors: Chungang Shi

Abstract: In the paper, a class of second-order McKean-Vlasov stochastic evolution equation driven by Poisson jumps with non-Lipschitz conditions is considered. The existence and uniqueness of the mild solution is established by means of the Carath${\rm \acute{e}}$odory approximation technique. Furthermore, an averaging principle is obtained between the solution of the second-order McKean-Vlasov stochastic… ▽ More In the paper, a class of second-order McKean-Vlasov stochastic evolution equation driven by Poisson jumps with non-Lipschitz conditions is considered. The existence and uniqueness of the mild solution is established by means of the Carath${\rm \acute{e}}$odory approximation technique. Furthermore, an averaging principle is obtained between the solution of the second-order McKean-Vlasov stochastic evolution equation and that of the simplified equation in mean square sense. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.04275 [pdf, ps, other]

The small mass limit for a McKean-Vlasov equation with state-dependent friction

Authors: Chungang Shi, Mengmeng Wang, Yan Lv, Wei Wang

Abstract: The small mass limit is derived for a McKean-Vlasov equation with state-dependent friction in $d$-dimensional space. By applying the averaging approach to a non-autonomous slow-fast system with the microscopic and macroscopic scales, the convergence in distribution is obtained. The small mass limit is derived for a McKean-Vlasov equation with state-dependent friction in $d$-dimensional space. By applying the averaging approach to a non-autonomous slow-fast system with the microscopic and macroscopic scales, the convergence in distribution is obtained. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.04237 [pdf, ps, other]

The Smoluchowski-Kramers approximation with distribution-dependent potential and highly oscillating force

Authors: Chungang Shi, Wei Wang

Abstract: An approximation is derived for a Langevin equation with distribution-dependent potential and state-dependent, randomly fast oscillation. By some estimates and a diffusion approximation the limiting equation is shown to be distribution-dependent stochastic differential equation (SDEs) driven by white noise. An approximation is derived for a Langevin equation with distribution-dependent potential and state-dependent, randomly fast oscillation. By some estimates and a diffusion approximation the limiting equation is shown to be distribution-dependent stochastic differential equation (SDEs) driven by white noise. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.03877 [pdf, ps, other]

Convergence rate of the Smoluchowski-Kramers approximation for diffusions with jumps

Authors: Chungang Shi

Abstract: In the paper, the Kolmogorov distance is used to study the Smoluchowski-Kramers approximation for diffusions with jumps. The convergence rate is derived by Malliavin calculus. In the paper, the Kolmogorov distance is used to study the Smoluchowski-Kramers approximation for diffusions with jumps. The convergence rate is derived by Malliavin calculus. △ Less

Submitted 6 March, 2024; originally announced March 2024.

arXiv:2401.10710 [pdf, other]

Classification with neural networks with quadratic decision functions

Authors: Leon Frischauf, Otmar Scherzer, Cong Shi

Abstract: Neural networks with quadratic decision functions have been introduced as alternatives to standard neural networks with affine linear ones. They are advantageous when the objects or classes to be identified are compact and of basic geometries like circles, ellipses etc. In this paper we investigate the use of such ansatz functions for classification. In particular we test and compare the algorithm… ▽ More Neural networks with quadratic decision functions have been introduced as alternatives to standard neural networks with affine linear ones. They are advantageous when the objects or classes to be identified are compact and of basic geometries like circles, ellipses etc. In this paper we investigate the use of such ansatz functions for classification. In particular we test and compare the algorithm on the MNIST dataset for classification of handwritten digits and for classification of subspecies. We also show, that the implementation can be based on the neural network structure in the software Tensorflow and Keras, respectively. △ Less

Submitted 25 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

MSC Class: 49N45; 41A30; 65XX; 68TXX

arXiv:2401.09445 [pdf, other]

Quadratic neural networks for solving inverse problems

Authors: Leon Frischauf, Otmar Scherzer, Cong Shi

Abstract: In this paper we investigate the solution of inverse problems with neural network ansatz functions with generalized decision functions. The relevant observation for this work is that such functions can approximate typical test cases, such as the Shepp-Logan phantom, better, than standard neural networks. Moreover, we show that the convergence analysis of numerical methods for solving inverse probl… ▽ More In this paper we investigate the solution of inverse problems with neural network ansatz functions with generalized decision functions. The relevant observation for this work is that such functions can approximate typical test cases, such as the Shepp-Logan phantom, better, than standard neural networks. Moreover, we show that the convergence analysis of numerical methods for solving inverse problems with shallow generalized neural network functions leads to more intuitive convergence conditions, than for deep affine linear neural networks. △ Less

Submitted 20 December, 2023; originally announced January 2024.

Comments: arXiv admin note: text overlap with arXiv:2110.01536

MSC Class: 65J22; 45Q05; 42C40; 65F20

arXiv:2401.05517 [pdf, other]

On Efficient Inference of Causal Effects with Multiple Mediators

Authors: Haoyu Wei, Hengrui Cai, Chengchun Shi, Rui Song

Abstract: This paper provides robust estimators and efficient inference of causal effects involving multiple interacting mediators. Most existing works either impose a linear model assumption among the mediators or are restricted to handle conditionally independent mediators given the exposure. To overcome these limitations, we define causal and individual mediation effects in a general setting, and employ… ▽ More This paper provides robust estimators and efficient inference of causal effects involving multiple interacting mediators. Most existing works either impose a linear model assumption among the mediators or are restricted to handle conditionally independent mediators given the exposure. To overcome these limitations, we define causal and individual mediation effects in a general setting, and employ a semiparametric framework to develop quadruply robust estimators for these causal effects. We further establish the asymptotic normality of the proposed estimators and prove their local semiparametric efficiencies. The proposed method is empirically validated via simulated and real datasets concerning psychiatric disorders in trauma survivors. △ Less

Submitted 10 January, 2024; originally announced January 2024.

MSC Class: 62A09; 62G05; 62G35

arXiv:2305.13856 [pdf, ps, other]

On the Optimal Batch Size for Byzantine-Robust Distributed Learning

Authors: Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li

Abstract: Byzantine-robust distributed learning (BRDL), in which computing devices are likely to behave abnormally due to accidental failures or malicious attacks, has recently become a hot research topic. However, even in the independent and identically distributed (i.i.d.) case, existing BRDL methods will suffer from a significant drop on model accuracy due to the large variance of stochastic gradients. I… ▽ More Byzantine-robust distributed learning (BRDL), in which computing devices are likely to behave abnormally due to accidental failures or malicious attacks, has recently become a hot research topic. However, even in the independent and identically distributed (i.i.d.) case, existing BRDL methods will suffer from a significant drop on model accuracy due to the large variance of stochastic gradients. Increasing batch sizes is a simple yet effective way to reduce the variance. However, when the total number of gradient computation is fixed, a too-large batch size will lead to a too-small iteration number (update number), which may also degrade the model accuracy. In view of this challenge, we mainly study the optimal batch size when the total number of gradient computation is fixed in this work. In particular, we theoretically and empirically show that when the total number of gradient computation is fixed, the optimal batch size in BRDL increases with the fraction of Byzantine workers. Therefore, compared to the case without attacks, the batch size should be set larger when under Byzantine attacks. However, for existing BRDL methods, large batch sizes will lead to a drop on model accuracy, even if there is no Byzantine attack. To deal with this problem, we propose a novel BRDL method, called Byzantine-robust stochastic gradient descent with normalized momentum (ByzSGDnm), which can alleviate the drop on model accuracy in large-batch cases. Moreover, we theoretically prove the convergence of ByzSGDnm for general non-convex cases under Byzantine attacks. Empirical results show that ByzSGDnm has a comparable performance to existing BRDL methods under bit-flip** failure, but can outperform existing BRDL methods under deliberately crafted attacks. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2302.03821 [pdf, other]

PASTA: Pessimistic Assortment Optimization

Authors: Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan X. Fang, Vahid Tarokh

Abstract: We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimiza… ▽ More We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimization, the problem of insufficient data coverage is likely to occur in the offline dataset. Therefore, designing a provably efficient offline learning algorithm becomes a significant challenge. To this end, we propose an algorithm referred to as Pessimistic ASsortment opTimizAtion (PASTA for short) designed based on the principle of pessimism, that can correctly identify the optimal assortment by only requiring the offline data to cover the optimal assortment under general settings. In particular, we establish a regret bound for the offline assortment optimization problem under the celebrated multinomial logit model. We also propose an efficient computational procedure to solve our pessimistic assortment optimization problem. Numerical studies demonstrate the superiority of the proposed method over the existing baseline method. △ Less

Submitted 7 February, 2023; originally announced February 2023.

arXiv:2301.05416 [pdf, other]

Two spectral extremal results for graphs with given order and rank

Authors: Xiuqing Li, Xian'an **, Chao Shi, Ruiling Zheng

Abstract: The spectral radius and rank of a graph are defined to be the spectral radius and rank of its adjacency matrix, respectively. It is an important problem in spectral extremal graph theory to determine the extremal graph that has the maximum or minimum spectral radius over certain families of graphs. Monsalve and Rada [Extremal spectral radius of graphs with rank 4, Linear Algebra Appl. 609 (2021) 1… ▽ More The spectral radius and rank of a graph are defined to be the spectral radius and rank of its adjacency matrix, respectively. It is an important problem in spectral extremal graph theory to determine the extremal graph that has the maximum or minimum spectral radius over certain families of graphs. Monsalve and Rada [Extremal spectral radius of graphs with rank 4, Linear Algebra Appl. 609 (2021) 1-11] obtained the extremal graphs with maximum and minimum spectral radii among all graphs with order n and rank 4. In this paper, we first determine the extremal graph which attains the maximum spectral radius among all graphs with any given order n and rank r, and further determine the extremal graph which attains the minimum spectral radius among all graphs with order n and rank 5. △ Less

Submitted 13 January, 2023; originally announced January 2023.

Comments: 26 pages, 3 figures

MSC Class: 05C35 ACM Class: G.2.2

arXiv:2212.06355 [pdf, ps, other]

A Review of Off-Policy Evaluation in Reinforcement Learning

Authors: Masatoshi Uehara, Chengchun Shi, Nathan Kallus

Abstract: Reinforcement learning (RL) is one of the most vibrant research frontiers in machine learning and has been recently applied to solve a number of challenging problems. In this paper, we primarily focus on off-policy evaluation (OPE), one of the most fundamental topics in RL. In recent years, a number of OPE methods have been developed in the statistics and computer science literature. We provide a… ▽ More Reinforcement learning (RL) is one of the most vibrant research frontiers in machine learning and has been recently applied to solve a number of challenging problems. In this paper, we primarily focus on off-policy evaluation (OPE), one of the most fundamental topics in RL. In recent years, a number of OPE methods have been developed in the statistics and computer science literature. We provide a discussion on the efficiency bound of OPE, some of the existing state-of-the-art OPE methods, their statistical properties and some other related research directions that are currently actively explored. △ Less

Submitted 12 December, 2022; originally announced December 2022.

Comments: Still under revision

arXiv:2209.15448 [pdf, other]

Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments

Authors: Jiayi Wang, Zhengling Qi, Chengchun Shi

Abstract: As AI becomes more prevalent throughout society, effective methods of integrating humans and AI systems that leverage their respective strengths and mitigate risk have become an important priority. In this paper, we introduce the paradigm of super reinforcement learning that takes advantage of Human-AI interaction for data driven sequential decision making. This approach utilizes the observed acti… ▽ More As AI becomes more prevalent throughout society, effective methods of integrating humans and AI systems that leverage their respective strengths and mitigate risk have become an important priority. In this paper, we introduce the paradigm of super reinforcement learning that takes advantage of Human-AI interaction for data driven sequential decision making. This approach utilizes the observed action, either from AI or humans, as input for achieving a stronger oracle in policy learning for the decision maker (humans or AI). In the decision process with unmeasured confounding, the actions taken by past agents can offer valuable insights into undisclosed information. By including this information for the policy search in a novel and legitimate manner, the proposed super reinforcement learning will yield a super-policy that is guaranteed to outperform both the standard optimal policy and the behavior one (e.g., past agents' actions). We call this stronger oracle a blessing from human-AI interaction. Furthermore, to address the issue of unmeasured confounding in finding super-policies using the batch data, a number of nonparametric and causal identifications are established. Building upon on these novel identification results, we develop several super-policy learning algorithms and systematically study their theoretical properties such as finite-sample regret guarantee. Finally, we illustrate the effectiveness of our proposal through extensive simulations and real-world applications. △ Less

Submitted 20 October, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

arXiv:2207.13701 [pdf, other]

Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning

Authors: Zeren Huang, Wenhao Chen, Weinan Zhang, Chuhan Shi, Furui Liu, Hui-Ling Zhen, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang

Abstract: Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers. With MIP branching data collected during the previous solution process, learning to branch methods have recently become superior over heuristics. As branch-and-bound is naturally a sequential decision making task, one should learn to optimize the utilit… ▽ More Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers. With MIP branching data collected during the previous solution process, learning to branch methods have recently become superior over heuristics. As branch-and-bound is naturally a sequential decision making task, one should learn to optimize the utility of the whole MIP solving process instead of being myopic on each step. In this work, we formulate learning to branch as an offline reinforcement learning (RL) problem, and propose a long-sighted hybrid search scheme to construct the offline MIP dataset, which values the long-term utilities of branching decisions. During the policy training phase, we deploy a ranking-based reward assignment scheme to distinguish the promising samples from the long-term or short-term view, and train the branching model named Branch Ranking via offline policy learning. Experiments on synthetic MIP benchmarks and real-world tasks demonstrate that Branch Rankink is more efficient and robust, and can better generalize to large scales of MIP instances compared to the widely used heuristics and state-of-the-art learning-based branching models. △ Less

Submitted 26 July, 2022; originally announced July 2022.

arXiv:2204.04373 [pdf, other]

Tight toughness, isolated toughness and binding number bounds for the $\{K_2,C_n\}$-factors

Authors: Xiaxia Guan, Tianlong Ma, Chao Shi

Abstract: The $\{K_2,C_n\}$-factor of a graph is a spanning subgraph whose each component is either $K_2$ or $C_n$. In this paper, a sufficient condition with regard to tight toughness, isolated toughness and binding number bounds to guarantee the existence of the $\{K_2,C_{2i+1}| i\geq 2 \}$-factor for any graph is obtained, which answers a problem due to Gao and Wang (J. Oper. Res. Soc. China (2021), http… ▽ More The $\{K_2,C_n\}$-factor of a graph is a spanning subgraph whose each component is either $K_2$ or $C_n$. In this paper, a sufficient condition with regard to tight toughness, isolated toughness and binding number bounds to guarantee the existence of the $\{K_2,C_{2i+1}| i\geq 2 \}$-factor for any graph is obtained, which answers a problem due to Gao and Wang (J. Oper. Res. Soc. China (2021), https://doi.org/10.1007/s40305-021-00357-6). △ Less

Submitted 8 April, 2022; originally announced April 2022.

arXiv:2201.04429 [pdf, other]

Constraint Learning to Define Trust Regions in Predictive-Model Embedded Optimization

Authors: Chenbo Shi, Mohsen Emadikhiav, Leonardo Lozano, David Bergman

Abstract: There is a recent proliferation of research on the integration of machine learning and optimization. One expansive area within this research stream is predictive-model embedded optimization, which proposes the use of pre-trained predictive models as surrogates for uncertain or highly complex objective functions. In this setting, features of the predictive models become decision variables in the op… ▽ More There is a recent proliferation of research on the integration of machine learning and optimization. One expansive area within this research stream is predictive-model embedded optimization, which proposes the use of pre-trained predictive models as surrogates for uncertain or highly complex objective functions. In this setting, features of the predictive models become decision variables in the optimization problem. Despite a recent surge in publications in this area, only a few papers note the importance of incorporating trust region considerations in this decision-making pipeline, i.e., enforcing solutions to be similar to the data used to train the predictive models. Without such constraints, the evaluation of the predictive model at solutions obtained from optimization cannot be trusted and the practicality of the solutions may be unreasonable. In this paper, we provide an overview of the approaches appearing in the literature to construct a trust region, and propose three alternative approaches. Our numerical evaluation highlights that trust-region constraints learned through isolation forests, one of the newly proposed approaches, outperform all previously suggested approaches, both in terms of solution quality and computational time. △ Less

Submitted 19 October, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

arXiv:2111.08885 [pdf, other]

Jump Interval-Learning for Individualized Decision Making

Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

Abstract: An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-val… ▽ More An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-valued decision rule (I2DR) that maximizes the expected outcome. Unlike IDRs that recommend a single treatment, the proposed I2DR yields an interval of treatment options for each individual, making it more flexible to implement in practice. To derive an optimal I2DR, our jump interval-learning method estimates the conditional mean of the outcome given the treatment and the covariates via jump penalized regression, and derives the corresponding optimal I2DR based on the estimated outcome regression function. The regressor is allowed to be either linear for clear interpretation or deep neural network to model complex treatment-covariates interactions. To implement jump interval-learning, we develop a searching algorithm based on dynamic programming that efficiently computes the outcome regression function. Statistical properties of the resulting I2DR are established when the outcome regression function is either a piecewise or continuous function over the treatment space. We further develop a procedure to infer the mean outcome under the (estimated) optimal policy. Extensive simulations and a real data application to a warfarin study are conducted to demonstrate the empirical validity of the proposed I2DR. △ Less

Submitted 28 January, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

arXiv:2111.03908 [pdf, other]

An Online Sequential Test for Qualitative Treatment Effects

Authors: Chengchun Shi, Shikai Luo, Hongtu Zhu, Rui Song

Abstract: Tech companies (e.g., Google or Facebook) often use randomized online experiments and/or A/B testing primarily based on the average treatment effects to compare their new product with an old one. However, it is also critically important to detect qualitative treatment effects such that the new one may significantly outperform the existing one only under some specific circumstances. The aim of this… ▽ More Tech companies (e.g., Google or Facebook) often use randomized online experiments and/or A/B testing primarily based on the average treatment effects to compare their new product with an old one. However, it is also critically important to detect qualitative treatment effects such that the new one may significantly outperform the existing one only under some specific circumstances. The aim of this paper is to develop a powerful testing procedure to efficiently detect such qualitative treatment effects. We propose a scalable online updating algorithm to implement our test procedure. It has three novelties including adaptive randomization, sequential monitoring, and online updating with guaranteed type-I error control. We also thoroughly examine the theoretical properties of our testing procedure including the limiting distribution of test statistics and the justification of an efficient bootstrap method. Extensive empirical studies are conducted to examine the finite sample performance of our test procedure. △ Less

Submitted 6 November, 2021; originally announced November 2021.

arXiv:2110.01536 [pdf, other]

Universal approximation properties of shallow quadratic neural networks

Authors: Leon Frischauf, Otmar Scherzer, Cong Shi

Abstract: In this paper we study shallow neural network functions which are linear combinations of compositions of activation and quadratic functions, replacing standard affine linear functions, often called neurons. We show the universality of this approximation and prove convergence rates results based on the theory of wavelets and statistical learning. We show for simple test cases that this ansatz requi… ▽ More In this paper we study shallow neural network functions which are linear combinations of compositions of activation and quadratic functions, replacing standard affine linear functions, often called neurons. We show the universality of this approximation and prove convergence rates results based on the theory of wavelets and statistical learning. We show for simple test cases that this ansatz requires a smaller numbers of neurons than standard affine linear neural networks. Moreover, we investigate the efficiency of this approach for clustering tasks with the MNIST data set. Similar observations are made when comparing deep (multi-layer) networks. △ Less

Submitted 11 May, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

MSC Class: 41A30; 65XX; 68TXX

arXiv:2009.09396 [pdf, ps, other]

On non-empty cross-intersecting families

Authors: Chao Shi, Peter Frankl, Jianguo Qian

Abstract: Let $2^{[n]}$ and $\binom{[n]}{i}$ be the power set and the class of all $i$-subsets of $\{1,2,\cdots,n\}$, respectively. We call two families $\mathscr{A}$ and $\mathscr{B}$ cross-intersecting if $A\cap B\neq \emptyset$ for any $A\in \mathscr{A}$ and $B\in \mathscr{B}$. In this paper we show that, for $n\geq k+l,l\geq r\geq 1,c>0$ and… ▽ More Let $2^{[n]}$ and $\binom{[n]}{i}$ be the power set and the class of all $i$-subsets of $\{1,2,\cdots,n\}$, respectively. We call two families $\mathscr{A}$ and $\mathscr{B}$ cross-intersecting if $A\cap B\neq \emptyset$ for any $A\in \mathscr{A}$ and $B\in \mathscr{B}$. In this paper we show that, for $n\geq k+l,l\geq r\geq 1,c>0$ and $\mathscr{A}\subseteq \binom{[n]}{k},\mathscr{B}\subseteq \binom{[n]}{l}$, if $\mathscr{A}$ and $\mathscr{B}$ are cross-intersecting and $\binom{n-r}{l-r}\leq|\mathscr{B}|\leq \binom{n-1}{l-1}$, then $$|\mathscr{A}|+c|\mathscr{B}|\leq \max\left\{\binom{n}{k}-\binom{n-r}{k}+c\binom{n-r}{l-r},\ \binom{n-1}{k-1}+c\binom{n-1}{l-1}\right\}$$ and the families $\mathscr{A}$ and $\mathscr{B}$ attaining the upper bound are also characterized. This generalizes the corresponding result of Hilton and Milner for $c=1$ and $r=k=l$, and implies a result of Tokushige and the second author (Theorem 1.3). △ Less

Submitted 7 October, 2020; v1 submitted 20 September, 2020; originally announced September 2020.

Comments: 12 pages

MSC Class: 05D05

arXiv:2005.06134 [pdf, other]

Exponential stability for time-delay neural networks via new weighted integral inequalities

Authors: Seakweng Vong, Kachon Hoi, Chenyang Shi

Abstract: We study exponential stability for a kind of neural networks having time-varying delay. By extending the auxiliary function-based integral inequality, a novel integral inequality is derived by using weighted orthogonal functions of which one is discontinuous. Then, the new inequality is applied to investigate the exponential stability of time-delay neural networks via Lyapunov-Krasovskii functiona… ▽ More We study exponential stability for a kind of neural networks having time-varying delay. By extending the auxiliary function-based integral inequality, a novel integral inequality is derived by using weighted orthogonal functions of which one is discontinuous. Then, the new inequality is applied to investigate the exponential stability of time-delay neural networks via Lyapunov-Krasovskii functional (LKF) method. Numerical examples are given to verify the advantages of the proposed criterion. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: 22 pages, 2 figures

MSC Class: 13P25 ACM Class: G.1.0

arXiv:2001.11712 [pdf, ps, other]

doi 10.3390/math8050831

Locating arrays with mixed alphabet sizes

Authors: Ce Shi, Hao **, Tatsuhiro Tsuchiya

Abstract: Locating arrays (LAs) can be used to detect and identify interaction faults among factors in a component-based system. The optimality and constructions of LAs with a single fault have been investigated extensively under the assumption that all the factors have the same values. However, in real life, different factors in a system have different numbers of possible values. Thus, it is necessary for… ▽ More Locating arrays (LAs) can be used to detect and identify interaction faults among factors in a component-based system. The optimality and constructions of LAs with a single fault have been investigated extensively under the assumption that all the factors have the same values. However, in real life, different factors in a system have different numbers of possible values. Thus, it is necessary for LAs to satisfy such requirements. We herein establish a general lower bound on the size of mixed-level $(\bar{1},t)$-locating arrays. Some methods for constructing LAs including direct and recursive constructions are provided. In particular, constructions that produce optimal LAs satisfying the lower bound are described. Additionally, some series of optimal LAs satisfying the lower bound are presented. △ Less

Submitted 31 January, 2020; originally announced January 2020.

MSC Class: 05B15; 05B20; 94C12; 62K15

arXiv:1909.00611 [pdf, ps, other]

Two deformed Pascal's triangles and its new properties

Authors: Jishe Feng, Cunqin Shi, Huani Zhao

Abstract: In this paper, firstly, by a determinant of deformed Pascal's triangle, namely the normalized Hessenberg matrix determinant, to count Dyck paths, we give another combinatorial proof of the theorems which are of Catalan numbers determinant representations and the recurrence formula. Secondly, a determinant of normalized Toeplitz-Hessenberg matrix, whose entries are binomials, arising in power serie… ▽ More In this paper, firstly, by a determinant of deformed Pascal's triangle, namely the normalized Hessenberg matrix determinant, to count Dyck paths, we give another combinatorial proof of the theorems which are of Catalan numbers determinant representations and the recurrence formula. Secondly, a determinant of normalized Toeplitz-Hessenberg matrix, whose entries are binomials, arising in power series, we derive new four properties of Pascal's triangle. △ Less

Submitted 26 September, 2020; v1 submitted 2 September, 2019; originally announced September 2019.

Comments: 7 pages

MSC Class: 05A15; 11B65

arXiv:1907.03438 [pdf, ps, other]

doi 10.1088/1361-6420/ab539a

Density Matrix Reconstructions in Ultrafast Transmission Electron Microscopy: Uniqueness, Stability, and Convergence Rates

Authors: Cong Shi, Claus Ropers, Thorsten Hohage

Abstract: In the recent paper [17] the first experimental determination of the density matrix of a free electron beam has been reported. The employed method leads to a linear inverse problem with a positive semidefinite operator as unknown. The purpose of this paper is to complement the experimental and algorithmic results in the work mentioned above by a mathematical analysis of the inverse problem concern… ▽ More In the recent paper [17] the first experimental determination of the density matrix of a free electron beam has been reported. The employed method leads to a linear inverse problem with a positive semidefinite operator as unknown. The purpose of this paper is to complement the experimental and algorithmic results in the work mentioned above by a mathematical analysis of the inverse problem concerning uniqueness, stability, and rates of convergence under different types of a-priori information. △ Less

Submitted 9 July, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

MSC Class: 35R30; 47A52

arXiv:1905.10914 [pdf, ps, other]

doi 10.1007/s00373-020-02176-7

Consecutive Detecting Arrays for Interaction Faults

Authors: Ce Shi, Ling Jiang, Aiyuan Tao

Abstract: The concept of detecting arrays was developed to locate and detect interaction faults arising between the factors in a component-based system during software testing. In this paper, we propose a family of consecutive detecting arrays (CDAs) in which the interactions between factors are considered to be ordered. CDAs can be used to generate test suites for locating and detecting interaction faults… ▽ More The concept of detecting arrays was developed to locate and detect interaction faults arising between the factors in a component-based system during software testing. In this paper, we propose a family of consecutive detecting arrays (CDAs) in which the interactions between factors are considered to be ordered. CDAs can be used to generate test suites for locating and detecting interaction faults between neighboring factors. We establish a general criterion for measuring the optimality of CDAs in terms of their size. Based on this optimality criterion, the equivalence between optimum CDAs and consecutive orthogonal arrays with prescribed properties is explored. Using the advantages of this equivalence, a great number of optimum CDAs are presented. In particular, the existence of optimum CDAs with few factors is almost completely determined. △ Less

Submitted 25 January, 2024; v1 submitted 26 May, 2019; originally announced May 2019.

MSC Class: 05B15; 05B20; 62K15; 94C12

Journal ref: Graphs and Combinatorics 36, 1203-1218 (2020)

arXiv:1711.06333 [pdf, other]

Generation of unstructured meshes in 2-D, 3-D, and spherical geometries with embedded high resolution sub-regions

Authors: J. M. Taramón, J. P. Morgan, C. Shi, J. Hasenclever

Abstract: We present 2-D, 3-D, and spherical mesh generators for the Finite Element Method (FEM) using triangular and tetrahedral elements. The mesh nodes are treated as if they were linked by virtual springs that obey Hooke's law. Given the desired length for the springs, the FEM is used to solve for the optimal nodal positions for the static equilibrium of this spring system. A 'guide-mesh' approach allow… ▽ More We present 2-D, 3-D, and spherical mesh generators for the Finite Element Method (FEM) using triangular and tetrahedral elements. The mesh nodes are treated as if they were linked by virtual springs that obey Hooke's law. Given the desired length for the springs, the FEM is used to solve for the optimal nodal positions for the static equilibrium of this spring system. A 'guide-mesh' approach allows the user to create embedded high resolution sub-regions within a coarser mesh. The method converges rapidly. For example, in 3-D, the algorithm is able to refine a specific region within an unstructured tetrahedral spherical shell so that the edge-length factor $l_{0r}/l_{0c} = 1/33$ within a few iterations, where $l_{0r}$ and $l_{0c}$ are the desired spring length for elements inside the refined and coarse regions respectively. One use for this type of mesh is to model regional problems as a fine region within a global mesh that has no fictitious boundaries, at only a small additional computational cost. The algorithm also includes routines to locally improve the quality of the mesh and to avoid badly shaped 'slivers-like' tetrahedra. △ Less

Submitted 14 November, 2017; originally announced November 2017.

Comments: 20 pages + supplement, submitted to SIAM J. Sci. Comp

arXiv:1705.07466 [pdf, other]

doi 10.1088/1361-6420/aa9ade

Reconstruction formulas for Photoacoustic Imaging in Attenuating Media

Authors: Otmar Scherzer, Cong Shi

Abstract: In this paper we study the problem of photoacoustic inversion in a weakly attenuating medium. We present explicit reconstruction formulas in such media and show that the inversion based on such formulas is moderately ill--posed. Moreover, we present a numerical algorithm for imaging and demonstrate in numerical experiments the feasibility of this approach. In this paper we study the problem of photoacoustic inversion in a weakly attenuating medium. We present explicit reconstruction formulas in such media and show that the inversion based on such formulas is moderately ill--posed. Moreover, we present a numerical algorithm for imaging and demonstrate in numerical experiments the feasibility of this approach. △ Less

Submitted 21 May, 2017; originally announced May 2017.

MSC Class: 35R30 65R32 92C55 46F12

arXiv:1611.05807 [pdf, other]

Singular Values of the Attenuated Photoacoustic Imaging Operator

Authors: Peter Elbau, Otmar Scherzer, Cong Shi

Abstract: We analyse the ill-posedness of the photoacoustic imaging problem in the case of an attenuating medium. To this end, we introduce an attenuated photoacoustic operator and determine the asymptotic behaviour of its singular values. Dividing the known attenuation models into strong and weak attenuation classes, we show that for strong attenuation, the singular values of the attenuated photoacoustic o… ▽ More We analyse the ill-posedness of the photoacoustic imaging problem in the case of an attenuating medium. To this end, we introduce an attenuated photoacoustic operator and determine the asymptotic behaviour of its singular values. Dividing the known attenuation models into strong and weak attenuation classes, we show that for strong attenuation, the singular values of the attenuated photoacoustic operator decay exponentially, and in the weak attenuation case the singular values of the attenuated photoacoustic operator decay with the same rate as the singular values of the non-attenuated photoacoustic operator. △ Less

Submitted 17 November, 2016; originally announced November 2016.

MSC Class: 35S05; 35P15; 92C55; 47G10

arXiv:1608.04382 [pdf, other]

A signal separation technique for sub-cellular imaging using dynamic optical coherence tomography

Authors: Habib Ammari, Francisco Romero, Cong Shi

Abstract: This paper aims at imaging the dynamics of metabolic activity of cells. Using dynamic optical coherence tomography, we introduce a new multi-particle dynamical model to simulate the movements of the collagen and the cell metabolic activity and develop an efficient signal separation technique for sub-cellular imaging. We perform a singular-value decomposition of the dynamic optical images to isolat… ▽ More This paper aims at imaging the dynamics of metabolic activity of cells. Using dynamic optical coherence tomography, we introduce a new multi-particle dynamical model to simulate the movements of the collagen and the cell metabolic activity and develop an efficient signal separation technique for sub-cellular imaging. We perform a singular-value decomposition of the dynamic optical images to isolate the intensity of the metabolic activity. We prove that the largest eigenvalue of the associated Casorati matrix corresponds to the collagen. We present several numerical simulations to illustrate and validate our approach. △ Less

Submitted 15 August, 2016; originally announced August 2016.

MSC Class: 92C55; 78A46; 65Z05

arXiv:1605.07249 [pdf, ps, other]

A Massive Data Framework for M-Estimators with Cubic-Rate

Authors: Chengchun Shi, Wenbin Lu, Rui Song

Abstract: The divide and conquer method is a common strategy for handling massive data. In this article, we study the divide and conquer method for cubic-rate estimators under the massive data framework. We develop a general theory for establishing the asymptotic distribution of the aggregated M-estimators using a simple average. Under certain condition on the growing rate of the number of subgroups, the re… ▽ More The divide and conquer method is a common strategy for handling massive data. In this article, we study the divide and conquer method for cubic-rate estimators under the massive data framework. We develop a general theory for establishing the asymptotic distribution of the aggregated M-estimators using a simple average. Under certain condition on the growing rate of the number of subgroups, the resulting aggregated estimators are shown to have faster convergence rate and asymptotic normal distribution, which are more tractable in both computation and inference than the original M-estimators based on pooled data. Our theory applies to a wide class of M-estimators with cube root convergence rate, including the location estimator, maximum score estimator and value search estimator. Empirical performance via simulations also validate our theoretical findings. △ Less

Submitted 4 April, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

Comments: 55 pages

arXiv:1512.01295 [pdf, other]

Local commensurability graphs of solvable groups

Authors: Khalid Bou-Rabee, Chen Shi

Abstract: The commensurability index between two subgroups $A, B$ of a group $G$ is $[A : A \cap B] [B : A\cap B]$. This gives a notion of distance amongst finite-index subgroups of $G$, which is encoded in the p-local commensurability graphs of $G$. We show that for any metabelian group, any component of the $p$-local commensurabilty graph of $G$ has diameter bounded above by 4. However, no universal upper… ▽ More The commensurability index between two subgroups $A, B$ of a group $G$ is $[A : A \cap B] [B : A\cap B]$. This gives a notion of distance amongst finite-index subgroups of $G$, which is encoded in the p-local commensurability graphs of $G$. We show that for any metabelian group, any component of the $p$-local commensurabilty graph of $G$ has diameter bounded above by 4. However, no universal upper bound on diameters of components exists for the class of finite solvable groups. In the appendix we give a complete classification of components for upper triangular matrix groups in $\text{GL}(2, \mathbb{F}_q)$. △ Less

Submitted 3 December, 2015; originally announced December 2015.

Comments: 10 pages, 2 figures

MSC Class: 20F65; 20E26; 05C25

arXiv:1505.03774 [pdf, ps, other]

Dynamic Allocation Problems in Loss Network Systems with Advanced Reservation

Authors: Retsef Levi, Cong Shi

Abstract: We consider a class of well-known dynamic resource allocation models in loss network systems with advanced reservation. The most important performance measure in any loss network system is to compute its blocking probability, i.e., the probability of an arriving customer in equilibrium finds a fully utilized system (thereby getting rejected by the system). In this paper, we derive upper bounds on… ▽ More We consider a class of well-known dynamic resource allocation models in loss network systems with advanced reservation. The most important performance measure in any loss network system is to compute its blocking probability, i.e., the probability of an arriving customer in equilibrium finds a fully utilized system (thereby getting rejected by the system). In this paper, we derive upper bounds on the asymptotic blocking probabilities for such systems in high-volume regimes. There have been relatively few results on loss network systems with advanced reservation due to its inherent complexity. The theoretical results find applications in a wide class of revenue management problems in systems with reusable resources and advanced reservation, e.g., hotel room, car rental and workforce management. We propose a simple control policy called the improved class selection policy (ICSP) based on solving a continuous knapsack problem, similar in spirit to the one proposed in Levi and Radovanovic (2010). Using our results derived for loss network systems with advanced reservation, we show the ICSP performs asymptotically near-optimal in high-volume regimes. △ Less

Submitted 14 May, 2015; originally announced May 2015.

arXiv:1408.3205 [pdf, ps, other]

doi 10.1214/14-AOS1228

Optimum mixed level detecting arrays

Authors: Ce Shi, Yu Tang, Jianxing Yin

Abstract: As a type of search design, a detecting array can be used to generate test suites to identify and detect faults caused by interactions of factors in a component-based system. Recently, the construction and optimality of detecting arrays have been investigated in depth in the case where all the factors are assumed to have the same number of levels. However, for real world applications, it is more d… ▽ More As a type of search design, a detecting array can be used to generate test suites to identify and detect faults caused by interactions of factors in a component-based system. Recently, the construction and optimality of detecting arrays have been investigated in depth in the case where all the factors are assumed to have the same number of levels. However, for real world applications, it is more desirable to use detecting arrays in which the various factors may have different numbers of levels. This paper gives a general criterion to measure the optimality of a mixed level detecting array in terms of its size. Based on this optimality criterion, the combinatorial characteristics of mixed level detecting arrays of optimum size are investigated. This enables us to construct optimum mixed level detecting arrays with a heuristic optimization algorithm and combinatorial methods. As a result, some existence results for optimum mixed level detecting arrays achieving a lower bound are provided for practical use. △ Less

Submitted 14 August, 2014; originally announced August 2014.

Comments: Published in at http://dx.doi.org/10.1214/14-AOS1228 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1228

Journal ref: Annals of Statistics 2014, Vol. 42, No. 4, 1546-1563

arXiv:1307.3019 [pdf, ps, other]

doi 10.1016/j.disc.2016.11.008

Large sets of Kirkman triple systems with order $q^n+2$

Authors: Chen Wang, Cong Shi

Abstract: The existence of Large sets of Kirkman Triple Systems (LKTS) is an old problem in combinatorics. Known results are very limited, and a lot of them are based on the works of Denniston \cite{MR0349416, MR0369086, MR535159, MR539718}. The only known recursive constructions are an tripling construction by Denniston \cite{MR535159}and a product construction by Lei \cite{MR1931492}, both constructs an L… ▽ More The existence of Large sets of Kirkman Triple Systems (LKTS) is an old problem in combinatorics. Known results are very limited, and a lot of them are based on the works of Denniston \cite{MR0349416, MR0369086, MR535159, MR539718}. The only known recursive constructions are an tripling construction by Denniston \cite{MR535159}and a product construction by Lei \cite{MR1931492}, both constructs an LKTS($uv$) on the basis of an LKTS($v$). In this paper, we describe an construction of LKTS$(q^n+2)$ from LKTS$(q+2)$, where $q$ is a prime power of the form $6t+1$. We could construct previous unknown LKTS($v$) by this result, the smallest among them have $v=171,345,363$. △ Less

Submitted 11 July, 2013; originally announced July 2013.

MSC Class: 05B07

Showing 1–35 of 35 results for author: Shi, C