Search | arXiv e-print repository

An improved version of Kac's Central Limit Theorem

Authors: Suprio Bhar, Ritwik Mukherjee, Prathmesh Patil

Abstract: The classical Central Limit Theorem (CLT) states that for a sequence of independent and identically distributed (i.i.d) random variables with finite mean and variance, the normalized sample mean converges to the standard normal distribution. In $1946$, Victor Kac proved a Central Limit type theorem for a sequence of random variables that were not independent. The random variables under considera… ▽ More The classical Central Limit Theorem (CLT) states that for a sequence of independent and identically distributed (i.i.d) random variables with finite mean and variance, the normalized sample mean converges to the standard normal distribution. In $1946$, Victor Kac proved a Central Limit type theorem for a sequence of random variables that were not independent. The random variables under consideration were obtained from the angle-doubling map. The idea behind Kac's proof was to show that although the random variables under consideration were not independent, they were what he calls \textit{statistically independent} (in modern terminology, this concept is called long range independence). The final conclusion of his paper was that the sample averages of the random variables, suitably normalized converges to the standard normal distribution. In the 1970's, Charles Stein revolutionized the field of probability by discovering a new method to obtain the limiting distribution for a sequence of random variables. Among other things, his method gave an alternative proof of the classical Central Limit Theorem. We obtain an improvement of Victor Kac's result by applying Stein's method. We show that the normalized sample averages converge to the standard normal distribution in the Wasserstein metric, which is stronger than the convergence in distribution. △ Less

Submitted 10 May, 2024; originally announced May 2024.

MSC Class: 60F05; 37A99

arXiv:2402.16541 [pdf, other]

Integer Programming Using A Single Atom

Authors: Kapil Goswami, Peter Schmelcher, Rick Mukherjee

Abstract: Integer programming (IP), as the name suggests is an integer-variable-based approach commonly used to formulate real-world optimization problems with constraints. Currently, quantum algorithms reformulate the IP into an unconstrained form through the use of binary variables, which is an indirect and resource-consuming way of solving it. We develop an algorithm that maps and solves an IP problem in… ▽ More Integer programming (IP), as the name suggests is an integer-variable-based approach commonly used to formulate real-world optimization problems with constraints. Currently, quantum algorithms reformulate the IP into an unconstrained form through the use of binary variables, which is an indirect and resource-consuming way of solving it. We develop an algorithm that maps and solves an IP problem in its original form to any quantum system that possesses a large number of accessible internal degrees of freedom that can be controlled with sufficient accuracy. Using a single Rydberg atom as an example, we associate the integer values to electronic states belonging to different manifolds and implement a selective superposition of these different states to solve the full IP problem. The optimal solution is found within a few microseconds for prototypical IP problems with up to eight variables and a maximum number of four constraints. This also includes non-linear IP problems, which are usually harder to solve with classical algorithms when compared to their linear counterparts. Our algorithm for solving IP outperforms a well-known classical algorithm (branch and bound) in terms of the number of steps needed for convergence to the solution. Our approach carries the potential to improve bounds on the solution for larger problems when compared to the classical algorithms. △ Less

Submitted 28 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

Comments: 12 pages, 7 figures

arXiv:2402.03249 [pdf, other]

Nonsense associations in Markov random fields with pairwise dependence

Authors: Sohom Bhattacharya, Rajarshi Mukherjee, Elizabeth Ogburn

Abstract: Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provi… ▽ More Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provide the first, to our knowledge, rigorous study of this phenomenon for more general forms of (positive) dependence, specifically for Markov random fields on lattices and graphs. We consider both binary and continuous random vectors and three different measures of association: correlation, covariance, and the ordinary least squares coefficient that results from projecting one random vector onto the other. In some settings we find variance inflation consistent with Yule's nonsense correlation. However, surprisingly, we also find variance deflation in some settings, and in others the variance is unchanged under dependence. Perhaps most notably, we find general conditions under which OLS inference that ignores dependence is valid despite positive dependence in the regression errors, contradicting the presentation of OLS in countless textbooks and courses. △ Less

Submitted 5 February, 2024; originally announced February 2024.

MSC Class: 62E20; 62F03

arXiv:2312.10759 [pdf, other]

Counting curves with tangencies

Authors: Indranil Biswas, Apratim Choudhury, Ritwik Mukherjee, Anantadulal Paul

Abstract: Interpreting tangency as a limit of two transverse intersections, we obtain a concrete formula to enumerate smooth degree d plane curves tangent to a given line at multiple points with arbitrary order of tangency. One nodal curves with multiple tangencies of any order are enumerated. Also, one cuspidal curves, that are tangent to first order to a given line at multiple points, are enumerated. We… ▽ More Interpreting tangency as a limit of two transverse intersections, we obtain a concrete formula to enumerate smooth degree d plane curves tangent to a given line at multiple points with arbitrary order of tangency. One nodal curves with multiple tangencies of any order are enumerated. Also, one cuspidal curves, that are tangent to first order to a given line at multiple points, are enumerated. We also present a new way to enumerate curves with one node; it is interpreted as a degeneration of a curve tangent to a given line. That method is extended to enumerate curves with two nodes, and also curves with one tacnode are enumerated. In the final part of the paper, it is shown how this idea can be applied in the setting of stable maps and perform a concrete computation to enumerate rational curves with first order tangency. A large number of low degree cases have been worked out explicitly. △ Less

Submitted 31 March, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

Comments: Changed the title of the paper. 48 pages

MSC Class: 14N35; 14J45; 53D45

arXiv:2306.10590 [pdf, other]

Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators

Authors: Lin Liu, Rajarshi Mukherjee, James M. Robins

Abstract: The class of doubly-robust (DR) functionals studied by Rotnitzky et al. (2021) is of central importance in economics and biostatistics. It strictly includes both (i) the class of mean-square continuous functionals that can be written as an expectation of an affine functional of a conditional expectation studied by Chernozhukov et al. (2022b) and (ii) the class of functionals studied by Robins et a… ▽ More The class of doubly-robust (DR) functionals studied by Rotnitzky et al. (2021) is of central importance in economics and biostatistics. It strictly includes both (i) the class of mean-square continuous functionals that can be written as an expectation of an affine functional of a conditional expectation studied by Chernozhukov et al. (2022b) and (ii) the class of functionals studied by Robins et al. (2008). The present state-of-the-art estimators for DR functionals $ψ$ are double-machine-learning (DML) estimators (Chernozhukov et al., 2018). A DML estimator $\widehatψ_{1}$ of $ψ$ depends on estimates $\widehat{p} (x)$ and $\widehat{b} (x)$ of a pair of nuisance functions $p(x)$ and $b(x)$, and is said to satisfy "rate double-robustness" if the Cauchy--Schwarz upper bound of its bias is $o (n^{- 1/2})$. Were it achievable, our scientific goal would have been to construct valid, assumption-lean (i.e. no complexity-reducing assumptions on $b$ or $p$) tests of the validity of a nominal $(1 - α)$ Wald confidence interval (CI) centered at $\widehatψ_{1}$. But this would require a test of the bias to be $o (n^{-1/2})$, which can be shown not to exist. We therefore adopt the less ambitious goal of falsifying, when possible, an analyst's justification for her claim that the reported $(1 - α)$ Wald CI is valid. In many instances, an analyst justifies her claim by imposing complexity-reducing assumptions on $b$ and $p$ to ensure "rate double-robustness". Here we exhibit valid, assumption-lean tests of $H_{0}$: "rate double-robustness holds", with non-trivial power against certain alternatives. If $H_{0}$ is rejected, we will have falsified her justification. However, no assumption-lean test of $H_{0}$, including ours, can be a consistent test. Thus, the failure of our test to reject is not meaningful evidence in favor of $H_{0}$. △ Less

Submitted 28 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

Comments: corrected several extra typos and references

arXiv:2212.14857 [pdf, other]

Nuisance Function Tuning for Optimal Doubly Robust Estimation

Authors: Sean McGrath, Rajarshi Mukherjee

Abstract: Estimators of doubly robust functionals typically rely on estimating two complex nuisance functions, such as the propensity score and conditional outcome mean for the average treatment effect functional. We consider the problem of how to estimate nuisance functions to obtain optimal rates of convergence for a doubly robust nonparametric functional that has witnessed applications across the causal… ▽ More Estimators of doubly robust functionals typically rely on estimating two complex nuisance functions, such as the propensity score and conditional outcome mean for the average treatment effect functional. We consider the problem of how to estimate nuisance functions to obtain optimal rates of convergence for a doubly robust nonparametric functional that has witnessed applications across the causal inference and conditional independence testing literature. For several plug-in type estimators and a one-step type estimator, we illustrate the interplay between different tuning parameter choices for the nuisance function estimators and sample splitting strategies on the optimal rate of estimating the functional of interest. For each of these estimators and each sample splitting strategy, we show the necessity to undersmooth the nuisance function estimators under low regularity conditions to obtain optimal rates of convergence for the functional of interest. By performing suitable nuisance function tuning and sample splitting strategies, we show that some of these estimators can achieve minimax rates of convergence in all Hölder smoothness classes of the nuisance functions. △ Less

Submitted 29 May, 2024; v1 submitted 30 December, 2022; originally announced December 2022.

arXiv:2212.01664 [pdf, other]

doi 10.1016/j.aim.2023.109258

Counting rational curves with an $m$-fold point

Authors: Indranil Biswas, Chitrabhanu Chaudhuri, Apratim Choudhury, Ritwik Mukherjee, Anantadulal Paul

Abstract: We obtain a recursive formula for the number of rational degree $d$ curves in $\mathbb{CP}^2$ that pass through $3d+1-m$ generic points and that have an $m$-fold singular point. The special case of counting curves with a triple point was solved earlier by other authors. We obtain the formula by considering a family version of Kontsevich's recursion formula, in contrast to the excess intersection t… ▽ More We obtain a recursive formula for the number of rational degree $d$ curves in $\mathbb{CP}^2$ that pass through $3d+1-m$ generic points and that have an $m$-fold singular point. The special case of counting curves with a triple point was solved earlier by other authors. We obtain the formula by considering a family version of Kontsevich's recursion formula, in contrast to the excess intersection theoretic approach of others. A large number of low degree cases have been worked out explicitly. △ Less

Submitted 3 August, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

Comments: 16 pages, 4 figures

MSC Class: 14N35 (Primary) 14J45; 53D45 (Secondary)

Journal ref: Advances in Mathematics, 2023

arXiv:2211.08580 [pdf, ps, other]

Sparse Signal Detection in Heteroscedastic Gaussian Sequence Models: Sharp Minimax Rates

Authors: Julien Chhor, Rajarshi Mukherjee, Subhabrata Sen

Abstract: Given a heterogeneous Gaussian sequence model with unknown mean $θ\in \mathbb R^d$ and known covariance matrix $Σ= \operatorname{diag}(σ_1^2,\dots, σ_d^2)$, we study the signal detection problem against sparse alternatives, for known sparsity $s$. Namely, we characterize how large $ε^*>0$ should be, in order to distinguish with high probability the null hypothesis $θ=0$ from the alternative compos… ▽ More Given a heterogeneous Gaussian sequence model with unknown mean $θ\in \mathbb R^d$ and known covariance matrix $Σ= \operatorname{diag}(σ_1^2,\dots, σ_d^2)$, we study the signal detection problem against sparse alternatives, for known sparsity $s$. Namely, we characterize how large $ε^*>0$ should be, in order to distinguish with high probability the null hypothesis $θ=0$ from the alternative composed of $s$-sparse vectors in $\mathbb R^d$, separated from $0$ in $L^t$ norm ($t \in [1,\infty]$) by at least $ε^*$. We find minimax upper and lower bounds over the minimax separation radius $ε^*$ and prove that they are always matching. We also derive the corresponding minimax tests achieving these bounds. Our results reveal new phase transitions regarding the behavior of $ε^*$ with respect to the level of sparsity, to the $L^t$ metric, and to the heteroscedasticity profile of $Σ$. In the case of the Euclidean (i.e. $L^2$) separation, we bridge the remaining gaps in the literature. △ Less

Submitted 1 August, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

MSC Class: 62G10

arXiv:2209.10774 [pdf, other]

PC Adjusted Testing for Low Dimensional Parameters

Authors: Sohom Bhattacharya, Rounak Dey, Rajarshi Mukherjee

Abstract: In this paper we consider the effect of high dimensional Principal Component (PC) adjustments while inferring the effects of variables on outcomes. This problem is particularly motivated by applications in genetic association studies where one performs PC adjustment to account for population stratification. We consider simple statistical models to obtain asymptotically precise understanding of whe… ▽ More In this paper we consider the effect of high dimensional Principal Component (PC) adjustments while inferring the effects of variables on outcomes. This problem is particularly motivated by applications in genetic association studies where one performs PC adjustment to account for population stratification. We consider simple statistical models to obtain asymptotically precise understanding of when such PC adjustments are supposed to work in terms of providing valid tests with controlled Type I errors. We also verify these results through a class of numerical experiments. △ Less

Submitted 22 September, 2022; originally announced September 2022.

MSC Class: 62G10; 62G20; 62C20

arXiv:2205.10198 [pdf, other]

A New Central Limit Theorem for the Augmented IPW Estimator: Variance Inflation, Cross-Fit Covariance and Beyond

Authors: Kuanhao Jiang, Rajarshi Mukherjee, Subhabrata Sen, Pragya Sur

Abstract: Estimation of the average treatment effect (ATE) is a central problem in causal inference. In recent times, inference for the ATE in the presence of high-dimensional covariates has been extensively studied. Among the diverse approaches that have been proposed, augmented inverse probability weighting (AIPW) with cross-fitting has emerged a popular choice in practice. In this work, we study this cro… ▽ More Estimation of the average treatment effect (ATE) is a central problem in causal inference. In recent times, inference for the ATE in the presence of high-dimensional covariates has been extensively studied. Among the diverse approaches that have been proposed, augmented inverse probability weighting (AIPW) with cross-fitting has emerged a popular choice in practice. In this work, we study this cross-fit AIPW estimator under well-specified outcome regression and propensity score models in a high-dimensional regime where the number of features and samples are both large and comparable. Under assumptions on the covariate distribution, we establish a new central limit theorem for the suitably scaled cross-fit AIPW that applies without any sparsity assumptions on the underlying high-dimensional parameters. Our CLT uncovers two crucial phenomena among others: (i) the AIPW exhibits a substantial variance inflation that can be precisely quantified in terms of the signal-to-noise ratio and other problem parameters, (ii) the asymptotic covariance between the pre-cross-fit estimators is non-negligible even on the root-n scale. These findings are strikingly different from their classical counterparts. On the technical front, our work utilizes a novel interplay between three distinct tools--approximate message passing theory, the theory of deterministic equivalents, and the leave-one-out approach. We believe our proof techniques should be useful for analyzing other two-stage estimators in this high-dimensional regime. Finally, we complement our theoretical results with simulations that demonstrate both the finite sample efficacy of our CLT and its robustness to our assumptions. △ Less

Submitted 28 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

Comments: 132 pages, 7 figures; In V2, we added extensive comparisons with the classical variance formula (c.f.~Sec 3, Fig 2, Fig 4) and elaborated on the non-trivial cross-fit covariance phenomenon further

arXiv:2111.02826 [pdf, other]

Finding the Optimal Dynamic Treatment Regime Using Smooth Fisher Consistent Surrogate Loss

Authors: Nilanjana Laha, Aaron Sonabend-W, Rajarshi Mukherjee, Tianxi Cai

Abstract: Large health care data repositories such as electronic health records (EHR) open new opportunities to derive individualized treatment strategies for complicated diseases such as sepsis. In this paper, we consider the problem of estimating sequential treatment rules tailored to a patient's individual characteristics, often referred to as dynamic treatment regimes (DTRs). Our main objective is to fi… ▽ More Large health care data repositories such as electronic health records (EHR) open new opportunities to derive individualized treatment strategies for complicated diseases such as sepsis. In this paper, we consider the problem of estimating sequential treatment rules tailored to a patient's individual characteristics, often referred to as dynamic treatment regimes (DTRs). Our main objective is to find the optimal DTR that maximizes a discontinuous value function through direct maximization of Fisher consistent surrogate loss functions. In this regard, we demonstrate that a large class of concave surrogates fails to be Fisher consistent -- a behavior that differs from the classical binary classification problems. We further characterize a non-concave family of Fisher consistent smooth surrogate functions, which is amenable to gradient-descent type optimization algorithms. Compared to the existing direct search approach under the support vector machine framework (Zhao et al., 2015), our proposed DTR estimation via surrogate loss optimization (DTRESLO) method is more computationally scalable to large sample sizes and allows for broader functional classes for treatment policies. We establish theoretical properties for our proposed DTR estimator and obtain a sharp upper bound on the regret corresponding to our DTRESLO method. The finite sample performance of our proposed estimator is evaluated through extensive simulations. Finally, we illustrate the working principles and benefits of our method for estimating an optimal DTR for treating sepsis using EHR data from sepsis patients admitted to intensive care units. △ Less

Submitted 30 September, 2023; v1 submitted 3 November, 2021; originally announced November 2021.

MSC Class: 62G20 ACM Class: G.3

arXiv:2110.12336 [pdf, other]

Efficient and Robust Semi-supervised Estimation of ATE with Partially Annotated Treatment and Response

Authors: Jue Hou, Rajarshi Mukherjee, Tianxi Cai

Abstract: A notable challenge of leveraging Electronic Health Records (EHR) for treatment effect assessment is the lack of precise information on important clinical variables, including the treatment received and the response. Both treatment information and response often cannot be accurately captured by readily available EHR features and require labor intensive manual chart review to precisely annotate, wh… ▽ More A notable challenge of leveraging Electronic Health Records (EHR) for treatment effect assessment is the lack of precise information on important clinical variables, including the treatment received and the response. Both treatment information and response often cannot be accurately captured by readily available EHR features and require labor intensive manual chart review to precisely annotate, which limits the number of available gold standard labels on these key variables. We consider average treatment effect (ATE) estimation under such a semi-supervised setting with a large number of unlabeled samples containing both confounders and imperfect EHR features for treatment and response. We derive the efficient influence function for ATE and use it to construct a semi-supervised multiple machine learning (SMMAL) estimator. We showcase that our SMMAL estimator is semi-parametric efficient with B-spline regression under low-dimensional smooth models. We develop the adaptive sparsity/model doubly robust estimation under high-dimensional logistic propensity score and outcome regression models. Results from simulation studies support the validity of our SMMAL method and its superiority over supervised benchmarks. △ Less

Submitted 23 October, 2021; originally announced October 2021.

arXiv:2110.02949 [pdf, other]

Sharp Signal Detection Under Ferromagnetic Ising Models

Authors: Sohom Bhattacharya, Rajarshi Mukherjee, Gourab Ray

Abstract: In this paper we study the effect of dependence on detecting a class of structured signals in Ferromagnetic Ising models. Natural examples of our class include Ising Models on lattices, and Mean-Field type Ising Models such as dense Erdős-Rényi, and dense random regular graphs. Our results not only provide sharp constants of detection in each of these cases and thereby pinpoint the precise relatio… ▽ More In this paper we study the effect of dependence on detecting a class of structured signals in Ferromagnetic Ising models. Natural examples of our class include Ising Models on lattices, and Mean-Field type Ising Models such as dense Erdős-Rényi, and dense random regular graphs. Our results not only provide sharp constants of detection in each of these cases and thereby pinpoint the precise relationship of the detection problem with the underlying dependence, but also demonstrate how to be agnostic over the strength of dependence present in the respective models. △ Less

Submitted 6 October, 2021; originally announced October 2021.

MSC Class: 62G10; 62G20; 62C20

arXiv:2109.11997 [pdf, other]

On Statistical Inference with High Dimensional Sparse CCA

Authors: Nilanjana Laha, Nathan Huey, Brent Coull, Rajarshi Mukherjee

Abstract: We consider asymptotically exact inference on the leading canonical correlation directions and strengths between two high dimensional vectors under sparsity restrictions. In this regard, our main contribution is the development of a loss function, based on which, one can operationalize a one-step bias-correction on reasonable initial estimators. Our analytic results in this regard are adaptive ove… ▽ More We consider asymptotically exact inference on the leading canonical correlation directions and strengths between two high dimensional vectors under sparsity restrictions. In this regard, our main contribution is the development of a loss function, based on which, one can operationalize a one-step bias-correction on reasonable initial estimators. Our analytic results in this regard are adaptive over suitable structural restrictions of the high dimensional nuisance parameters, which, in this set-up, correspond to the covariance matrices of the variables of interest. We further supplement the theoretical guarantees behind our procedures with extensive numerical studies. △ Less

Submitted 9 February, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

MSC Class: 62G20; 62G05

arXiv:2109.10481 [pdf, other]

Sparse Uniformity Testing

Authors: Bhaswar B. Bhattacharya, Rajarshi Mukherjee

Abstract: In this paper we consider the uniformity testing problem for high-dimensional discrete distributions (multinomials) under sparse alternatives. More precisely, we derive sharp detection thresholds for testing, based on $n$ samples, whether a discrete distribution supported on $d$ elements differs from the uniform distribution only in $s$ (out of the $d$) coordinates and is $\varepsilon$-far (in tot… ▽ More In this paper we consider the uniformity testing problem for high-dimensional discrete distributions (multinomials) under sparse alternatives. More precisely, we derive sharp detection thresholds for testing, based on $n$ samples, whether a discrete distribution supported on $d$ elements differs from the uniform distribution only in $s$ (out of the $d$) coordinates and is $\varepsilon$-far (in total variation distance) from uniformity. Our results reveal various interesting phase transitions which depend on the interplay of the sample size $n$ and the signal strength $\varepsilon$ with the dimension $d$ and the sparsity level $s$. For instance, if the sample size is less than a threshold (which depends on $d$ and $s$), then all tests are asymptotically powerless, irrespective of the magnitude of the signal strength. On the other hand, if the sample size is above the threshold, then the detection boundary undergoes a further phase transition depending on the signal strength. Here, a $χ^2$-type test attains the detection boundary in the dense regime, whereas in the sparse regime a Bonferroni correction of two maximum-type tests and a version of the Higher Criticism test is optimal up to sharp constants. These results combined provide a complete description of the phase diagram for the sparse uniformity testing problem across all regimes of the parameters $n$, $d$, and $s$. One of the challenges in dealing with multinomials is that the parameters are always constrained to lie in the simplex. This results in the aforementioned two-layered phase transition, a new phenomenon which does not arise in classical high-dimensional sparse testing problems. △ Less

Submitted 16 February, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

Comments: 33 pages, 1 figure

arXiv:2108.06463 [pdf, other]

On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits

Authors: Nilanjana Laha, Rajarshi Mukherjee

Abstract: In this paper we consider asymptotically exact support recovery in the context of high dimensional and sparse Canonical Correlation Analysis (CCA). Our main results describe four regimes of interest based on information theoretic and computational considerations. In regimes of "low" sparsity we describe a simple, general, and computationally easy method for support recovery, whereas in a regime of… ▽ More In this paper we consider asymptotically exact support recovery in the context of high dimensional and sparse Canonical Correlation Analysis (CCA). Our main results describe four regimes of interest based on information theoretic and computational considerations. In regimes of "low" sparsity we describe a simple, general, and computationally easy method for support recovery, whereas in a regime of "high" sparsity, it turns out that support recovery is information theoretically impossible. For the sake of information theoretic lower bounds, our results also demonstrate a non-trivial requirement on the "minimal" size of the non-zero elements of the canonical vectors that is required for asymptotically consistent support recovery. Subsequently, the regime of "moderate" sparsity is further divided into two sub-regimes. In the lower of the two sparsity regimes, using a sharp analysis of a coordinate thresholding (Deshpande and Montanari, 2014) type method, we show that polynomial time support recovery is possible. In contrast, in the higher end of the moderate sparsity regime, appealing to the "Low Degree Polynomial" Conjecture (Kunisky et al., 2019), we provide evidence that polynomial time support recovery methods are inconsistent. Finally, we carry out numerical experiments to compare the efficacy of various methods discussed. △ Less

Submitted 11 October, 2022; v1 submitted 14 August, 2021; originally announced August 2021.

MSC Class: 62G05

arXiv:2106.02589 [pdf, other]

On Ensembling vs Merging: Least Squares and Random Forests under Covariate Shift

Authors: Maya Ramchandran, Rajarshi Mukherjee

Abstract: It has been postulated and observed in practice that for prediction problems in which covariate data can be naturally partitioned into clusters, ensembling algorithms based on suitably aggregating models trained on individual clusters often perform substantially better than methods that ignore the clustering structure in the data. In this paper, we provide theoretical support to these empirical ob… ▽ More It has been postulated and observed in practice that for prediction problems in which covariate data can be naturally partitioned into clusters, ensembling algorithms based on suitably aggregating models trained on individual clusters often perform substantially better than methods that ignore the clustering structure in the data. In this paper, we provide theoretical support to these empirical observations by asymptotically analyzing linear least squares and random forest regressions under a linear model. Our main results demonstrate that the benefit of ensembling compared to training a single model on the entire data, often termed 'merging', might depend on the underlying bias and variance interplay of the individual predictors to be aggregated. In particular, under both fixed and high dimensional linear models, we show that merging is asymptotically superior to optimal ensembling techniques for linear least squares regression due to the unbiased nature of least squares prediction. In contrast, for random forest regression under fixed dimensional linear models, our bounds imply a strict benefit of ensembling over merging. Finally, we also present numerical experiments to verify the validity of our asymptotic results across different situations. △ Less

Submitted 4 June, 2021; originally announced June 2021.

Comments: 9 pages, 2 figures, 1 table

arXiv:2012.05784 [pdf, ps, other]

Detecting Structured Signals in Ising Models

Authors: Nabarun Deb, Rajarshi Mukherjee, Sumit Mukherjee, Ming Yuan

Abstract: In this paper, we study the effect of dependence on detecting a class of signals in Ising models, where the signals are present in a structured way. Examples include Ising Models on lattices, and Mean-Field type Ising Models (Erdős-Rényi, Random regular, and dense graphs). Our results rely on correlation decay and mixing type behavior for Ising Models, and demonstrate the beneficial behavior of cr… ▽ More In this paper, we study the effect of dependence on detecting a class of signals in Ising models, where the signals are present in a structured way. Examples include Ising Models on lattices, and Mean-Field type Ising Models (Erdős-Rényi, Random regular, and dense graphs). Our results rely on correlation decay and mixing type behavior for Ising Models, and demonstrate the beneficial behavior of criticality in the detection of strictly lower signals. As a by-product of our proof technique, we develop sharp control on mixing and spin-spin correlation for several Mean-Field type Ising Models in all regimes of temperature -- which might be of independent interest. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 43 pages

MSC Class: 62G10; 62G20; 62C20

arXiv:2007.11933 [pdf, ps, other]

Counting planar curves in $\mathbb{P}^3$ with degenerate singularities

Authors: Nilkantha Das, Ritwik Mukherjee

Abstract: In this paper, we consider the following question: how many degree $d$ curves are there in $\mathbb{P}^3$ (passing through the right number of generic lines and points), whose image lies inside a $\mathbb{P}^2$, having $δ$ nodes and one singularity of codimension $k$. We obtain an explicit formula for this number when $δ+k \leq 4$ (i.e. the total codimension of the singularities is not more than f… ▽ More In this paper, we consider the following question: how many degree $d$ curves are there in $\mathbb{P}^3$ (passing through the right number of generic lines and points), whose image lies inside a $\mathbb{P}^2$, having $δ$ nodes and one singularity of codimension $k$. We obtain an explicit formula for this number when $δ+k \leq 4$ (i.e. the total codimension of the singularities is not more than four). We use a topological method to compute the degenerate contribution to the Euler class; it is an extension of the method that originates in a paper by A. Zinger and which is further pursued by S. Basu and the second author. Using this method, we have obtained formulas when the singularities present are more degenerate than nodes (such as cusps, tacnodes and triple points). When the singularities are only nodes, we have verified that our answers are consistent with those obtained by by S. Kleiman and R. Piene and by T. Laarakker. We also verify that our answer for the characteristic number of planar cubics with a cusp and the number of planar quartics with two nodes and one cusp is consistent with the answer obtained by R. Singh and the second author, where they compute the characteristic number of rational planar curves in $\mathbb{P}^3$ with a cusp. We also verify some of the numbers predicted by the conjecture made by Pandharipande, regarding the enumerativity of BPS numbers for $\mathbb{P}^3$. △ Less

Submitted 23 July, 2020; originally announced July 2020.

Comments: 37 Pages, 3 figures. Comments are welcome

MSC Class: 14N35

arXiv:2005.10664 [pdf, ps, other]

Rational Cuspidal Curves in a moving family of $\mathbb{P}^2$

Authors: Ritwik Mukherjee, Rahul Kumar Singh

Abstract: In this paper we obtain a formula for the number of rational degree d curves in $\mathbb{P}^3$ having a cusp, whose image lies in a $\mathbb{P}^2$ and that passes through $r$ lines and $s$ points (where $r + 2s = 3d + 1$). This problem can be viewed as a family version of the classical question of counting rational cuspidal curves in $\mathbb{P}^2$, which has been studied earlier by Z. Ran, R. Pan… ▽ More In this paper we obtain a formula for the number of rational degree d curves in $\mathbb{P}^3$ having a cusp, whose image lies in a $\mathbb{P}^2$ and that passes through $r$ lines and $s$ points (where $r + 2s = 3d + 1$). This problem can be viewed as a family version of the classical question of counting rational cuspidal curves in $\mathbb{P}^2$, which has been studied earlier by Z. Ran, R. Pandharipande and A. Zinger. We obtain this number by computing the Euler class of a relevant bundle and then finding out the corresponding degenerate contribution to the Euler class. The method we use is closely based on the method followed by A. Zinger and I. Biswas, S. D'Mello, R. Mukherjee and V. **ali. We also verify that our answer for the characteristic numbers of rational cuspidal planar cubics and quartics is consistent with the answer obtained by N. Das and the first author, where they compute the characteristic number of $δ$-nodal planar curves in $\mathbb{P}^3$ with one cusp (for $δ\leq 2$). △ Less

Submitted 29 October, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

Comments: 13 pages. Comments are welcome

MSC Class: 14N35; 14J45

arXiv:2003.00570 [pdf, other]

On Minimax Exponents of Sparse Testing

Authors: Rajarshi Mukherjee, Subhabrata Sen

Abstract: We consider exact asymptotics of the minimax risk for global testing against sparse alternatives in the context of high dimensional linear regression. Our results characterize the leading order behavior of this minimax risk in several regimes, uncovering new phase transitions in its behavior. This complements a vast literature characterizing asymptotic consistency in this problem, and provides a u… ▽ More We consider exact asymptotics of the minimax risk for global testing against sparse alternatives in the context of high dimensional linear regression. Our results characterize the leading order behavior of this minimax risk in several regimes, uncovering new phase transitions in its behavior. This complements a vast literature characterizing asymptotic consistency in this problem, and provides a useful benchmark, against which the performance of specific tests may be compared. Finally, we provide some preliminary evidence that popular sparsity adaptive procedures might be sub-optimal in terms of the minimax risk. △ Less

Submitted 1 March, 2020; originally announced March 2020.

Comments: 53 pages, 2 figures

MSC Class: 62C20; 62F03

arXiv:1909.00772 [pdf, ps, other]

Counting curves in a linear system with upto eight singular points

Authors: Somnath Basu, Ritwik Mukherjee

Abstract: In this paper, we develop a systematic approach to enumerate curves with a certain number of nodes and one further singularity which maybe more degenerate. As a result, we obtain an explicit formula for the number of curves in a sufficiently ample linear system, passing through the right number of generic points, that have $δ$ nodes and one singularity of codimension $k$, for all $δ+k \leq 8$. In… ▽ More In this paper, we develop a systematic approach to enumerate curves with a certain number of nodes and one further singularity which maybe more degenerate. As a result, we obtain an explicit formula for the number of curves in a sufficiently ample linear system, passing through the right number of generic points, that have $δ$ nodes and one singularity of codimension $k$, for all $δ+k \leq 8$. In particular, we recover the formulas for curves with upto six nodal points obtained by Vainsencher. Moreover, all the codimension seven numbers we have obtained agree with the formulas obtained by Kazarian. Finally, in codimension eight, we recover the formula of A.Weber, M.Mikosz and P.Pragacz for curves with one singular point and we also recover the formula of Kleiman and Piene for eight nodal curves. All the other codimension eight numbers we have obtained are new. △ Less

Submitted 2 September, 2019; originally announced September 2019.

Comments: 52 pages, 10 figures. Comments are welcome

MSC Class: 14N10; 14H20; 55R55; 57R20; 57R22; 57R45

arXiv:1906.00456 [pdf, other]

On Testing for Parameters in Ising Models

Authors: Rajarshi Mukherjee, Gourab Ray

Abstract: We consider testing for the parameters of Ferromagnetic Ising models. While testing for the presence of possibly sparse magnetizations, we provide a general lower bound of minimax separation rates which yields sharp results in high temperature regimes. Our matching upper bounds are adaptive over both underlying dependence graph and temperature parameter. Moreover our results include the nearest ne… ▽ More We consider testing for the parameters of Ferromagnetic Ising models. While testing for the presence of possibly sparse magnetizations, we provide a general lower bound of minimax separation rates which yields sharp results in high temperature regimes. Our matching upper bounds are adaptive over both underlying dependence graph and temperature parameter. Moreover our results include the nearest neighbor model on lattices, the sparse Erdös-Rényi random graphs, and regular rooted trees -- right up to the critical parameter in the high temperature regime. We also provide parallel results for the entire low temperature regime in nearest neighbor model on lattices -- however in the plus boundary pure phase. Our results for the nearest neighbor model crucially depends on finite volume analogues of correlation decay property for both high and low temperature regimes -- the derivation of which borrows crucial ideas from FK-percolation theory and might be of independent interest. Finally, we also derive lower bounds for estimation and testing rates in two parameter Ising models -- which turn out to be optimal according to several recent results in this area. △ Less

Submitted 2 June, 2019; originally announced June 2019.

Comments: 28 pages, 2 figures

arXiv:1904.04276 [pdf, other]

On nearly assumption-free tests of nominal confidence interval coverage for causal parameters estimated by machine learning

Authors: Lin Liu, Rajarshi Mukherjee, James M. Robins

Abstract: For many causal effect parameters of interest, doubly robust machine learning (DRML) estimators $\hatψ_{1}$ are the state-of-the-art, incorporating the good prediction performance of machine learning; the decreased bias of doubly robust estimators; and the analytic tractability and bias reduction of sample splitting with cross fitting. Nonetheless, even in the absence of confounding by unmeasured… ▽ More For many causal effect parameters of interest, doubly robust machine learning (DRML) estimators $\hatψ_{1}$ are the state-of-the-art, incorporating the good prediction performance of machine learning; the decreased bias of doubly robust estimators; and the analytic tractability and bias reduction of sample splitting with cross fitting. Nonetheless, even in the absence of confounding by unmeasured factors, the nominal $(1 - α)$ Wald confidence interval $\hatψ_{1} \pm z_{α/ 2} \widehat{\mathsf{se}} [\hatψ_{1}]$ may still undercover even in large samples, because the bias of $\hatψ_{1}$ may be of the same or even larger order than its standard error of order $n^{-1/2}$. In this paper, we introduce essentially assumption-free tests that (i) can falsify the null hypothesis that the bias of $\hatψ_{1}$ is of smaller order than its standard error, (ii) can provide an upper confidence bound on the true coverage of the Wald interval, and (iii) are valid under the null under no smoothness/sparsity assumptions on the nuisance parameters. The tests, which we refer to as \underline{A}ssumption \underline{F}ree \underline{E}mpirical \underline{C}overage \underline{T}ests (AFECTs), are based on a U-statistic that estimates part of the bias of $\hatψ_{1}$. △ Less

Submitted 12 July, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

Comments: Significant updates from the previous version. In press in Statistical Science

arXiv:1808.07915 [pdf, other]

On Efficiency of the Plug-in Principle for Estimating Smooth Integrated Functionals of a Nonincreasing Density

Authors: Rajarshi Mukherjee, Bodhisattva Sen

Abstract: We consider the problem of estimating smooth integrated functionals of a monotone nonincreasing density $f$ on $[0,\infty)$ using the nonparametric maximum likelihood based plug-in estimator. We find the exact asymptotic distribution of this natural (tuning parameter-free) plug-in estimator, properly normalized. In particular, we show that the simple plug-in estimator is always $\sqrt{n}$-consiste… ▽ More We consider the problem of estimating smooth integrated functionals of a monotone nonincreasing density $f$ on $[0,\infty)$ using the nonparametric maximum likelihood based plug-in estimator. We find the exact asymptotic distribution of this natural (tuning parameter-free) plug-in estimator, properly normalized. In particular, we show that the simple plug-in estimator is always $\sqrt{n}$-consistent, and is additionally asymptotically normal with zero mean and the semiparametric efficient variance for estimating a subclass of integrated functionals. Compared to the previous results on this topic (see e.g., Nickl (2007), Gine and Nickl (2008), Jankowski (2014), and Sohl (2015)) our results hold for a much larger class of functionals (which include linear and non-linear functionals) under less restrictive assumptions on the underlying $f$ --- we do not require $f$ to be (i) smooth, (ii) bounded away from $0$, or (iii) compactly supported. Further, when $f$ is the uniform distribution on a compact interval we explicitly characterize the asymptotic distribution of the plug-in estimator --- which now converges at a non-standard rate --- thereby extending the results in Groeneboom and Pyke (1983) for the case of the quadratic functional. △ Less

Submitted 14 April, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

arXiv:1808.04237 [pdf, ps, other]

Enumeration of rational curves in a moving family of $\mathbb{P}^2$

Authors: Ritwik Mukherjee, Anantadulal Paul, Rahul Kumar Singh

Abstract: We obtain a recursive formula for the number of rational degree $d$ curves in $\mathbb{P}^3$, whose image lies in a $\mathbb{P}^2$, passing through $r$ lines and $s$ points, where $r + 2s = 3d+2$. This can be viewed as a family version of the classical question of counting rational curves in $\mathbb{P}^2$. We verify that our numbers are consistent with those obtained by T. Laarakker, where he stu… ▽ More We obtain a recursive formula for the number of rational degree $d$ curves in $\mathbb{P}^3$, whose image lies in a $\mathbb{P}^2$, passing through $r$ lines and $s$ points, where $r + 2s = 3d+2$. This can be viewed as a family version of the classical question of counting rational curves in $\mathbb{P}^2$. We verify that our numbers are consistent with those obtained by T. Laarakker, where he studies the parallel question of counting $δ$-nodal degree $d$ curves in $\mathbb{P}^3$ whose image lies inside a $\mathbb{P}^2$. Our numbers give evidence to support the conjecture, that the polynomials obtained by T. Laarakker are enumerative when $d \geq 1 + [\fracδ{2}]$, which is analogous to the {G}öttsche threshold for counting nodal curves in $\mathbb{P}^2$. △ Less

Submitted 13 August, 2018; originally announced August 2018.

Comments: 10 pages. Comments are welcome

MSC Class: 14N35; 14J45

arXiv:1710.03863 [pdf, ps, other]

doi 10.1007/s00440-020-00982-x

On Estimation of $L_{r}$-Norms in Gaussian White Noise Models

Authors: Yanjun Han, Jiantao Jiao, Rajarshi Mukherjee

Abstract: We provide a complete picture of asymptotically minimax estimation of $L_r$-norms (for any $r\ge 1$) of the mean in Gaussian white noise model over Nikolskii-Besov spaces. In this regard, we complement the work of Lepski, Nemirovski and Spokoiny (1999), who considered the cases of $r=1$ (with poly-logarithmic gap between upper and lower bounds) and $r$ even (with asymptotically sharp upper and low… ▽ More We provide a complete picture of asymptotically minimax estimation of $L_r$-norms (for any $r\ge 1$) of the mean in Gaussian white noise model over Nikolskii-Besov spaces. In this regard, we complement the work of Lepski, Nemirovski and Spokoiny (1999), who considered the cases of $r=1$ (with poly-logarithmic gap between upper and lower bounds) and $r$ even (with asymptotically sharp upper and lower bounds) over Hölder spaces. We additionally consider the case of asymptotically adaptive minimax estimation and demonstrate a difference between even and non-even $r$ in terms of an investigator's ability to produce asymptotically adaptive minimax estimators without paying a penalty. △ Less

Submitted 3 March, 2021; v1 submitted 10 October, 2017; originally announced October 2017.

Comments: This version (v6) fixed an error in the proof of Lemma 5.6, and corrected some typos

Journal ref: Published in Probability Theory and Related Fields, vol. 177, no. 3-4, pp. 1243-1294, 2020

arXiv:1705.07577 [pdf, other]

Semiparametric Efficient Empirical Higher Order Influence Function Estimators

Authors: Lin Liu, Rajarshi Mukherjee, Whitney K. Newey, James M. Robins

Abstract: Robins et al. (2008, 2017) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean $ψ$ of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under the minimal conditions of Robins et al. (2009b), together with an a… ▽ More Robins et al. (2008, 2017) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean $ψ$ of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under the minimal conditions of Robins et al. (2009b), together with an additional (non-minimal) smoothness condition on the density g of X, because the Robins et al. (2008, 2017) estimator depends on a nonparametric estimate of g. In this paper, we introduce a new HOIF estimator that has the same asymptotic properties as the original one, but does not impose any smoothness requirement on g. This is important for two reasons. First, one rarely has the knowledge about the properties of g. Second, even when g is smooth, if the dimension of X is even moderate, accurate nonparametric estimation of its density is not feasible at the sample sizes often encountered in applications. In fact, to the best of our knowledge, this new HOIF estimator remains the only semiparametric efficient estimator of $ψ$ under minimal conditions, despite the rapidly growing literature on causal effect estimation. We also show that our estimator can be generalized to the entire class of functionals considered by Robins et al. (2008) which include the average effect of a treatment on a response Y when a vector X suffices to control confounding and the expected conditional variance of a response Y given a vector X. Simulation experiments are also conducted, which demonstrate that our new estimator outperforms those of Robins et al. (2008, 2017) in finite samples, when g is not very smooth. △ Less

Submitted 25 December, 2023; v1 submitted 22 May, 2017; originally announced May 2017.

Comments: 42 pages

arXiv:1705.07527 [pdf, other]

Testing Degree Corrections in Stochastic Block Models

Authors: Rajarshi Mukherjee, Subhabrata Sen

Abstract: We study sharp detection thresholds for degree corrections in Stochastic Block Models in the context of a goodness of fit problem, and explore the effect of the unknown community assignment (a high dimensional nuisance parameter) and the graph density on testing for degree corrections. When degree corrections are relatively dense, a simple test based on the total number of edges is asymptotically… ▽ More We study sharp detection thresholds for degree corrections in Stochastic Block Models in the context of a goodness of fit problem, and explore the effect of the unknown community assignment (a high dimensional nuisance parameter) and the graph density on testing for degree corrections. When degree corrections are relatively dense, a simple test based on the total number of edges is asymptotically optimal. For sparse degree corrections, the results undergo several changes in behavior depending on density of the underlying Stochastic Block Model. For graphs which are not extremely sparse, optimal tests are based on Higher Criticism or Maximum Degree type tests based on a linear combination of within and across (estimated) community degrees. In the special case of balanced communities, a simple degree based Higher Criticism Test (Mukherjee, Mukherjee, Sen 2016) is optimal in case the graph is not completely dense, while the more complicated linear combination based procedure is required in the completely dense setting. The ``necessity" of the two step procedure is demonstrated for the case of balanced communities by the failure of the ordinary Maximum Degree Test in achieving sharp constants. Finally for extremely sparse graphs the optimal rates change, and a version of the maximum degree test with a different rejection region is shown to be optimal. △ Less

Submitted 15 July, 2019; v1 submitted 21 May, 2017; originally announced May 2017.

Comments: Major re-write; Determines detection thresholds below log n graph density; 61 pages, 1 Fig

arXiv:1611.08293 [pdf, other]

Global Testing Against Sparse Alternatives under Ising Models

Authors: Rajarshi Mukherjee, Sumit Mukherjee, Ming Yuan

Abstract: In this paper, we study the effect of dependence on detecting sparse signals. In particular, we focus on global testing against sparse alternatives for the means of binary outcomes following an Ising model, and establish how the interplay between the strength and sparsity of a signal determines its detectability under various notions of dependence. The profound impact of dependence is best illustr… ▽ More In this paper, we study the effect of dependence on detecting sparse signals. In particular, we focus on global testing against sparse alternatives for the means of binary outcomes following an Ising model, and establish how the interplay between the strength and sparsity of a signal determines its detectability under various notions of dependence. The profound impact of dependence is best illustrated under the Curie-Weiss model where we observe the effect of a "thermodynamic" phase transition. In particular, the critical state exhibits a subtle "blessing of dependence" phenomenon in that one can detect much weaker signals at criticality than otherwise. Furthermore, we develop a testing procedure that is broadly applicable to account for dependence and show that it is asymptotically minimax optimal under fairly general regularity conditions. △ Less

Submitted 5 October, 2017; v1 submitted 24 November, 2016; originally announced November 2016.

Comments: 41 pages

arXiv:1608.01801 [pdf, other]

Detection Thresholds for the $β$-Model on Sparse Graphs

Authors: Rajarshi Mukherjee, Sumit Mukherjee, Subhabrata Sen

Abstract: In this paper we study sharp thresholds for detecting sparse signals in $β$-models for potentially sparse random graphs. The results demonstrate interesting interplay between graph sparsity, signal sparsity, and signal strength. In regimes of moderately dense signals, irrespective of graph sparsity, the detection thresholds mirror corresponding results in independent Gaussian sequence problems. Fo… ▽ More In this paper we study sharp thresholds for detecting sparse signals in $β$-models for potentially sparse random graphs. The results demonstrate interesting interplay between graph sparsity, signal sparsity, and signal strength. In regimes of moderately dense signals, irrespective of graph sparsity, the detection thresholds mirror corresponding results in independent Gaussian sequence problems. For sparser signals, extreme graph sparsity implies that all tests are asymptotically powerless, irrespective of the signal strength. On the other hand, sharp detection thresholds are obtained, up to matching constants, on denser graphs. The phase transition mentioned above are sharp. As a crucial ingredient, we study a version of the Higher Criticism Test which is provably sharp up to optimal constants in the regime of sparse signals. The theoretical results are further verified by numerical simulations. △ Less

Submitted 27 May, 2017; v1 submitted 5 August, 2016; originally announced August 2016.

Comments: 37 pages, 2 figures, minor corrections

arXiv:1608.01364 [pdf, ps, other]

Adaptive Estimation of Nonparametric Functionals

Authors: Lin Liu, Rajarshi Mukherjee, James Robins, Eric Tchetgen Tchetgen

Abstract: We provide general adaptive upper bounds for estimating nonparametric functionals based on second order U-statistics arising from finite dimensional approximation of the infinite dimensional models. We then provide examples of functionals for which the theory produces rate optimally matching adaptive upper and lower bounds. Our results are automatically adaptive in both parametric and nonparametri… ▽ More We provide general adaptive upper bounds for estimating nonparametric functionals based on second order U-statistics arising from finite dimensional approximation of the infinite dimensional models. We then provide examples of functionals for which the theory produces rate optimally matching adaptive upper and lower bounds. Our results are automatically adaptive in both parametric and nonparametric regimes of estimation and are automatically adaptive and semiparametric efficient in the regime of parametric convergence rate. △ Less

Submitted 3 June, 2021; v1 submitted 3 August, 2016; originally announced August 2016.

Comments: 61 pages, polished writing and added some discussion on numerical issues of wavelets and potential connections to deep neural networks

Journal ref: Journal of Machine Learning Research, 2021, 22

arXiv:1601.05842 [pdf, other]

Asymptotic Normality of Scrambled Geometric Net Quadrature

Authors: Kinjal Basu, Rajarshi Mukherjee

Abstract: In a very recent work, Basu and Owen (2015) propose the use of scrambled geometric nets in numerical integration when the domain is a product of $s$ arbitrary spaces of dimension $d$ having a certain partitioning constraint. It was shown that for a class of smooth functions, the integral estimate has variance $O( n^{-1 -2/d} (\log n)^{s-1})$ for scrambled geometric nets, compared to $O(n^{-1})$ fo… ▽ More In a very recent work, Basu and Owen (2015) propose the use of scrambled geometric nets in numerical integration when the domain is a product of $s$ arbitrary spaces of dimension $d$ having a certain partitioning constraint. It was shown that for a class of smooth functions, the integral estimate has variance $O( n^{-1 -2/d} (\log n)^{s-1})$ for scrambled geometric nets, compared to $O(n^{-1})$ for ordinary Monte Carlo. The main idea of this paper is to develop on the work by Loh (2003), to show that the scrambled geometric net estimate has an asymptotic normal distribution for certain smooth functions defined on products of suitable subsets of $\mathbb{R}^d$. △ Less

Submitted 26 April, 2016; v1 submitted 21 January, 2016; originally announced January 2016.

Comments: 41 pages, 6 figures

MSC Class: 62E20; 62D05; 65D30

arXiv:1512.03479 [pdf, ps, other]

Optimal Adaptive Inference in Random Design Binary Regression

Authors: Rajarshi Mukherjee, Subhabrata Sen

Abstract: We construct confidence sets for the regression function in nonparametric binary regression with an unknown design density. These confidence sets are adaptive in $L^2$ loss over a continuous class of Sobolev type spaces. Adaptation holds in the smoothness of the regression function, over the maximal parameter spaces where adaptation is possible, provided the design density is smooth enough. We ide… ▽ More We construct confidence sets for the regression function in nonparametric binary regression with an unknown design density. These confidence sets are adaptive in $L^2$ loss over a continuous class of Sobolev type spaces. Adaptation holds in the smoothness of the regression function, over the maximal parameter spaces where adaptation is possible, provided the design density is smooth enough. We identify two key regimes --- one where adaptation is possible, and one where some critical regions must be removed. We address related questions about goodness of fit testing and adaptive estimation of relevant parameters. △ Less

Submitted 2 August, 2016; v1 submitted 10 December, 2015; originally announced December 2015.

Comments: 37 pages

arXiv:1511.04900 [pdf, other]

Genus two enumerative invariants in del-Pezzo surfaces with a fixed complex structure

Authors: Indranil Biswas, Ritwik Mukherjee, Varun Thakre

Abstract: We obtain a formula for the number of genus two curves with a fixed complex structure of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This is done by extending the symplectic approach of Aleksey Zinger. This enumerative problem is expressed as the difference between the symplectic invariant and an intersection number on the moduli… ▽ More We obtain a formula for the number of genus two curves with a fixed complex structure of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This is done by extending the symplectic approach of Aleksey Zinger. This enumerative problem is expressed as the difference between the symplectic invariant and an intersection number on the moduli space of rational curves on the surface. △ Less

Submitted 2 November, 2017; v1 submitted 16 November, 2015; originally announced November 2015.

Comments: 21 pages; to appear in Geometriae Dedicata. Comments are welcome

MSC Class: 53D45; 14N35; 14J45

arXiv:1509.08284 [pdf, ps, other]

Genus one enumerative invariants in del-Pezzo surfaces with a fixed complex structure

Authors: Indranil Biswas, Ritwik Mukherjee, Varun Thakre

Abstract: We obtain a formula for the number of genus one curves with a fixed complex structure of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This enumerative problem is expressed as the difference between the symplectic invariant and an intersection number on the moduli space of rational curves. We obtain a formula for the number of genus one curves with a fixed complex structure of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This enumerative problem is expressed as the difference between the symplectic invariant and an intersection number on the moduli space of rational curves. △ Less

Submitted 25 February, 2016; v1 submitted 28 September, 2015; originally announced September 2015.

Comments: Seven pages. To appear in Comptes Rendus Mathematique

MSC Class: 53D45; 14N35; 14J45

arXiv:1509.06300 [pdf, ps, other]

Rational cuspidal curves on del-Pezzo surfaces

Authors: Indranil Biswas, Shane D'Mello, Ritwik Mukherjee, Vamsi **ali

Abstract: We obtain an explicit formula for the number of rational cuspidal curves of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This enumerative problem is expressed as an Euler class computation on the moduli space of curves. A topological method is employed in computing the contribution of the degenerate locus to this Euler class. We obtain an explicit formula for the number of rational cuspidal curves of a given degree on a del-Pezzo surface that pass through an appropriate number of generic points of the surface. This enumerative problem is expressed as an Euler class computation on the moduli space of curves. A topological method is employed in computing the contribution of the degenerate locus to this Euler class. △ Less

Submitted 21 September, 2015; originally announced September 2015.

Comments: Comments are welcome

MSC Class: 14N35; 14J45

arXiv:1508.00249 [pdf, ps, other]

Lepski's Method and Adaptive Estimation of Nonlinear Integral Functionals of Density

Authors: Rajarshi Mukherjee, Eric Tchetgen Tchetgen, James Robins

Abstract: We study the adaptive minimax estimation of non-linear integral functionals of a density and extend the results obtained for linear and quadratic functionals to general functionals. The typical rate optimal non-adaptive minimax estimators of "smooth" non-linear functionals are higher order U-statistics. Since Lepski's method requires tight control of tails of such estimators, we bypass such calcul… ▽ More We study the adaptive minimax estimation of non-linear integral functionals of a density and extend the results obtained for linear and quadratic functionals to general functionals. The typical rate optimal non-adaptive minimax estimators of "smooth" non-linear functionals are higher order U-statistics. Since Lepski's method requires tight control of tails of such estimators, we bypass such calculations by a modification of Lepski's method which is applicable in such situations. As a necessary ingredient, we also provide a method to control higher order moments of minimax estimator of cubic integral functionals. Following a standard constrained risk inequality method, we also show the optimality of our adaptation rates. △ Less

Submitted 11 January, 2016; v1 submitted 2 August, 2015; originally announced August 2015.

Comments: 52 pages

arXiv:1501.01557 [pdf, ps, other]

Counting curves on a general linear system with up to two singular points

Authors: Somnath Basu, Ritwik Mukherjee

Abstract: In this paper we obtain an explicit formula for the number of curves in a compact complex surface $X$ (passing through the right number of generic points), that has up to one node and one singularity of codimension $k$, provided the total codimension is at most $7$. We use a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle $V$ over $M$,… ▽ More In this paper we obtain an explicit formula for the number of curves in a compact complex surface $X$ (passing through the right number of generic points), that has up to one node and one singularity of codimension $k$, provided the total codimension is at most $7$. We use a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle $V$ over $M$, counted with signs, is the Euler class of $V$ evaluated on the fundamental class of $M$. △ Less

Submitted 7 January, 2015; originally announced January 2015.

Comments: 22 pages; generalizes results of our previous papers to curves on any linear system. We welcome comments and suggestions

MSC Class: 14N10; 14H20; 55R55; 57R20; 57R22; 57R45

arXiv:1412.3902 [pdf, ps, other]

Probability distribution of constrained Random Walks

Authors: Ritwik Mukherjee

Abstract: In this paper we consider a sequence of n coin tosses, whose outcome depends on the previous n-1 tosses. In particular, their distribution is not i.i.d. We compute the limiting distribution of this sequence using the method of images. In this paper we consider a sequence of n coin tosses, whose outcome depends on the previous n-1 tosses. In particular, their distribution is not i.i.d. We compute the limiting distribution of this sequence using the method of images. △ Less

Submitted 12 December, 2014; originally announced December 2014.

Comments: 8 pages

arXiv:1410.4142 [pdf, ps, other]

Enumeration of singular hypersurfaces on arbitrary complex manifolds

Authors: Ritwik Mukherjee

Abstract: In this paper we obtain an explicit formula for the number of hypersurfaces in a compact complex manifold X (passing through the right number of points), that has a simple node, a cusp or a tacnode. The hypersurfaces belong to a linear system, which is obtained by considering a holomorphic line bundle L over X. Our main tool is a classical fact from differential topology: the number of zeros of a… ▽ More In this paper we obtain an explicit formula for the number of hypersurfaces in a compact complex manifold X (passing through the right number of points), that has a simple node, a cusp or a tacnode. The hypersurfaces belong to a linear system, which is obtained by considering a holomorphic line bundle L over X. Our main tool is a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle V over M, counted with a sign, is the Euler class of V evaluated on the fundamental class of M. △ Less

Submitted 15 October, 2014; originally announced October 2014.

Comments: 6 pages; comments are welcome

MSC Class: 14N10; 14H20; 55R55; 57R20; 57R22; 57R45

arXiv:1409.6702 [pdf, ps, other]

doi 10.1016/j.bulsci.2014.11.006

Enumeration of curves with two singular points

Authors: Somnath Basu, Ritwik Mukherjee

Abstract: In this paper we obtain an explicit formula for the number of curves in two dimensional complex projective space, of degree d, passing through d(d+3)/2-(k+1) generic points and having one node and one codimension k singularity, where k is at most 6. Our main tool is a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle V over M, counted wit… ▽ More In this paper we obtain an explicit formula for the number of curves in two dimensional complex projective space, of degree d, passing through d(d+3)/2-(k+1) generic points and having one node and one codimension k singularity, where k is at most 6. Our main tool is a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle V over M, counted with a sign, is the Euler class of V evaluated on the fundamental class of M. △ Less

Submitted 23 September, 2014; originally announced September 2014.

Comments: 56 pages, 1 figure; comments are welcome. arXiv admin note: text overlap with arXiv:1308.2902

MSC Class: 14N10; 14H20; 55R55; 57R20; 57R22; 57R45

Journal ref: Bulletin des Sciences Mathematiques Volume 139, Issue 6, September 2015, pp 667-735

arXiv:1405.0136 [pdf, ps, other]

Value sharing by an entire function with its derivatives

Authors: Indrajit Lahiri, Rajib Mukherjee

Abstract: We prove a uniqueness theorem for an entire function, which shares certain values with its higher order derivatives. We prove a uniqueness theorem for an entire function, which shares certain values with its higher order derivatives. △ Less

Submitted 1 May, 2014; originally announced May 2014.

Comments: Accepted for publication in Acta Math. Vietnam

arXiv:1308.2902 [pdf, ps, other]

Enumeration of curves with one singular point

Authors: Somnath Basu, Ritwik Mukherjee

Abstract: In this paper we obtain an explicit formula for the number of degree d curves in two dimensional complex projective space, passing through (d(d+3)/2 -k) generic points and having a codimension k singularity, where k is at most 7. In the past, many of these numbers were computed using techniques from algebraic geometry. In this paper we use purely topological methods to count curves. Our main tool… ▽ More In this paper we obtain an explicit formula for the number of degree d curves in two dimensional complex projective space, passing through (d(d+3)/2 -k) generic points and having a codimension k singularity, where k is at most 7. In the past, many of these numbers were computed using techniques from algebraic geometry. In this paper we use purely topological methods to count curves. Our main tool is a classical fact from differential topology: the number of zeros of a generic smooth section of a vector bundle V over M, counted with a sign, is the Euler class of V evaluated on the fundamental class of M. △ Less

Submitted 7 January, 2015; v1 submitted 13 August, 2013; originally announced August 2013.

Comments: 61 pages; changed to larger version (this one) to facilitate reference regarding details

MSC Class: 14N10; 14H20; 55R55; 57R20; 57R22; 57R45

arXiv:1308.0764 [pdf, ps, other]

doi 10.1214/14-AOS1279

Hypothesis testing for high-dimensional sparse binary regression

Authors: Rajarshi Mukherjee, Natesh S. Pillai, Xihong Lin

Abstract: In this paper, we study the detection boundary for minimax hypothesis testing in the context of high-dimensional, sparse binary regression models. Motivated by genetic sequencing association studies for rare variant effects, we investigate the complexity of the hypothesis testing problem when the design matrix is sparse. We observe a new phenomenon in the behavior of detection boundary which does… ▽ More In this paper, we study the detection boundary for minimax hypothesis testing in the context of high-dimensional, sparse binary regression models. Motivated by genetic sequencing association studies for rare variant effects, we investigate the complexity of the hypothesis testing problem when the design matrix is sparse. We observe a new phenomenon in the behavior of detection boundary which does not occur in the case of Gaussian linear regression. We derive the detection boundary as a function of two components: a design matrix sparsity index and signal strength, each of which is a function of the sparsity of the alternative. For any alternative, if the design matrix sparsity index is too high, any test is asymptotically powerless irrespective of the magnitude of signal strength. For binary design matrices with the sparsity index that is not too high, our results are parallel to those in the Gaussian case. In this context, we derive detection boundaries for both dense and sparse regimes. For the dense regime, we show that the generalized likelihood ratio is rate optimal; for the sparse regime, we propose an extended Higher Criticism Test and show it is rate optimal and sharp. We illustrate the finite sample properties of the theoretical results using simulation studies. △ Less

Submitted 5 March, 2015; v1 submitted 3 August, 2013; originally announced August 2013.

Comments: Published in at http://dx.doi.org/10.1214/14-AOS1279 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS1279

Journal ref: Annals of Statistics 2015, Vol. 43, No. 1, 352-381

Showing 1–45 of 45 results for author: Mukherjee, R