Skip to main content

Showing 1–50 of 131 results for author: Ramdas, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.15586  [pdf, other

    stat.ME

    Multiple testing with anytime-valid Monte-Carlo p-values

    Authors: Lasse Fischer, Aaditya Ramdas

    Abstract: In contemporary problems involving genetic or neuroimaging data, thousands of hypotheses need to be tested. Due to their high power, and finite sample guarantees on type-1 error under weak assumptions, Monte-Carlo permutation tests are often considered as gold standard for these settings. However, the enormous computational effort required for (thousands of) permutation tests is a major burden. Re… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 22 pages, 2 figures

  2. arXiv:2403.15527  [pdf, other

    stat.ML cs.LG

    Conformal online model aggregation

    Authors: Matteo Gasparin, Aaditya Ramdas

    Abstract: Conformal prediction equips machine learning models with a reasonable notion of uncertainty quantification without making strong distributional assumptions. It wraps around any black-box prediction model and converts point predictions into set predictions that have a predefined marginal coverage guarantee. However, conformal prediction only works if we fix the underlying machine learning model in… ▽ More

    Submitted 2 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 22 pages, 12 figures. arXiv admin note: text overlap with arXiv:2401.09379

  3. arXiv:2402.18810  [pdf, ps, other

    math.ST stat.ME

    The numeraire e-variable and reverse information projection

    Authors: Martin Larsson, Aaditya Ramdas, Johannes Ruf

    Abstract: We consider testing a composite null hypothesis $\mathcal{P}$ against a point alternative $\mathsf{Q}$ using e-variables, which are nonnegative random variables $X$ such that $\mathbb{E}_\mathsf{P}[X] \leq 1$ for every $\mathsf{P} \in \mathcal{P}$. This paper establishes a fundamental result: under no conditions whatsoever on $\mathcal{P}$ or $\mathsf{Q}$, there exists a special e-variable $X^*$ t… ▽ More

    Submitted 4 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2402.09698  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Combining Evidence Across Filtrations Using Adjusters

    Authors: Yo Joong Choe, Aaditya Ramdas

    Abstract: In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stop** time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same n… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Substantially revised with new results in Sections 5 and 6. Code is available at https://github.com/yjchoe/CombiningEvidenceAcrossFiltrations

  5. arXiv:2401.15567  [pdf, other

    math.PR math.FA math.ST stat.ME stat.ML

    Positive Semidefinite Supermartingales and Randomized Matrix Concentration Inequalities

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: We present new concentration inequalities for either martingale dependent or exchangeable random symmetric matrices under a variety of tail conditions, encompassing now-standard Chernoff bounds to self-normalized heavy-tailed settings. These inequalities are often randomized in a way that renders them strictly tighter than existing deterministic results in the literature, are typically expressed i… ▽ More

    Submitted 26 February, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    MSC Class: 60B20; 60G48; 62L10

  6. arXiv:2401.15063  [pdf, other

    stat.ME math.ST stat.OT

    Graph fission and cross-validation

    Authors: James Leiner, Aaditya Ramdas

    Abstract: We introduce a technique called graph fission which takes in a graph which potentially contains only one observation per node (whose distribution lies in a known class) and produces two (or more) independent graphs with the same node/edge set in a way that splits the original graph's information amongst them in any desired proportion. Our proposal builds on data fission/thinning, a method that use… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 19 pages, 9 figures

  7. arXiv:2401.09379  [pdf, other

    stat.ME

    Merging uncertainty sets via majority vote

    Authors: Matteo Gasparin, Aaditya Ramdas

    Abstract: Given $K$ uncertainty sets that are arbitrarily dependent -- for example, confidence intervals for an unknown parameter obtained with $K$ different estimators, or prediction sets obtained via conformal prediction based on $K$ different algorithms on shared data -- we address the question of how to efficiently combine them in a black-box manner to produce a single uncertainty set. We present a simp… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Removed COMA (developed in separate paper), added derandomization examples. 34 pages, 8 figures, 2 tables

  8. arXiv:2401.07365  [pdf, other

    stat.ME

    Sequential Monte-Carlo testing by betting

    Authors: Lasse Fischer, Aaditya Ramdas

    Abstract: In a Monte-Carlo test, the observed dataset is fixed, and several resampled or permuted versions of the dataset are generated in order to test a null hypothesis that the original dataset is exchangeable with the resampled/permuted ones. Sequential Monte-Carlo tests aim to save computational resources by generating these additional datasets sequentially one by one, and potentially stop** early. W… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 33 pages, 8 figures

  9. arXiv:2311.18274  [pdf, other

    stat.ML cs.LG stat.ME

    Semiparametric Efficient Inference in Adaptive Experiments

    Authors: Thomas Cook, Alan Mishler, Aaditya Ramdas

    Abstract: We consider the problem of efficient inference of the Average Treatment Effect in a sequential experiment where the policy governing the assignment of subjects to treatment or control can change over time. We first provide a central limit theorem for the Adaptive Augmented Inverse-Probability Weighted estimator, which is semiparametric efficient, under weaker assumptions than those previously made… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 24 pages, 6 figures. To appear at CLeaR 2024

  10. arXiv:2311.08168  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Time-Uniform Confidence Spheres for Means of Random Vectors

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We derive and study time-uniform confidence spheres -- confidence sphere sequences (CSSs) -- which contain the mean of random vectors with high probability simultaneously across all sample sizes. Inspired by the original work of Catoni and Giulini, we unify and extend their analysis to cover both the sequential setting and to handle a variety of distributional assumptions. Our results include an e… ▽ More

    Submitted 28 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 46 pages, 1 figure

  11. arXiv:2311.06412  [pdf, other

    stat.ME stat.ML

    Online multiple testing with e-values

    Authors: Ziyu Xu, Aaditya Ramdas

    Abstract: A scientist tests a continuous stream of hypotheses over time in the course of her investigation -- she does not test a predetermined, fixed number of hypotheses. The scientist wishes to make as many discoveries as possible while ensuring the number of false discoveries is controlled -- a well recognized way for accomplishing this is to control the false discovery rate (FDR). Prior methods for FDR… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 26 pages, 4 figures

  12. arXiv:2311.03343  [pdf, other

    math.ST stat.ME

    Distribution-uniform anytime-valid sequential inference

    Authors: Ian Waudby-Smith, Edward H. Kennedy, Aaditya Ramdas

    Abstract: Are asymptotic confidence sequences and anytime $p$-values uniformly valid for a nontrivial class of distributions $\mathcal{P}$? We give a positive answer to this question by deriving distribution-uniform anytime-valid inference procedures. Historically, anytime-valid methods -- including confidence sequences, anytime $p$-values, and sequential hypothesis tests that enable inference at stop** t… ▽ More

    Submitted 18 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  13. arXiv:2310.19384  [pdf, other

    stat.ML cs.LG

    Deep anytime-valid hypothesis testing

    Authors: Teodora Pandeva, Patrick Forré, Aaditya Ramdas, Shubhanshu Shekhar

    Abstract: We propose a general framework for constructing powerful, sequential hypothesis tests for a large class of nonparametric testing problems. The null hypothesis for these problems is defined in an abstract form using the action of two known operators on the data distribution. This abstraction allows for a unified treatment of several classical tasks, such as two-sample testing, independence testing,… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  14. arXiv:2310.16626  [pdf, other

    stat.ME stat.AP

    Scalable Causal Structure Learning via Amortized Conditional Independence Testing

    Authors: James Leiner, Brian Manzo, Aaditya Ramdas, Wesley Tansey

    Abstract: Controlling false positives (Type I errors) through statistical hypothesis testing is a foundation of modern scientific data analysis. Existing causal structure discovery algorithms either do not provide Type I error control or cannot scale to the size of modern scientific datasets. We consider a variant of the causal discovery problem with two sets of nodes, where the only edges of interest form… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 10 figures, 24 pages

  15. arXiv:2310.14293  [pdf, other

    stat.ME

    Testing exchangeability by pairwise betting

    Authors: Aytijhya Saha, Aaditya Ramdas

    Abstract: In this paper, we address the problem of testing exchangeability of a sequence of random variables, $X_1, X_2,\cdots$. This problem has been studied under the recently popular framework of testing by betting. But the map** of testing problems to game is not one to one: many games can be designed for the same test. Past work established that it is futile to play single game betting on every obser… ▽ More

    Submitted 30 December, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

  16. arXiv:2310.09100  [pdf, other

    math.PR math.ST stat.ME

    Time-Uniform Self-Normalized Concentration for Vector-Valued Processes

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: Self-normalized processes arise naturally in many statistical tasks. While self-normalized concentration has been extensively studied for scalar-valued processes, there is less work on multidimensional processes outside of the sub-Gaussian setting. In this work, we construct a general, self-normalized inequality for $\mathbb{R}^d$-valued processes that satisfy a simple yet broad "sub-$ψ$" tail con… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 50 pages, 3 figures

  17. arXiv:2310.03722  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: In 1976, Lai constructed a nontrivial confidence sequence for the mean $μ$ of a Gaussian distribution with unknown variance $σ^2$. Curiously, he employed both an improper (right Haar) mixture over $σ$ and an improper (flat) mixture over $μ$. Here, we elaborate carefully on the details of his construction, which use generalized nonintegrable martingales and an extended Ville's inequality. While thi… ▽ More

    Submitted 14 May, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Substantive revision in v3 (Apr 23 2024)

  18. arXiv:2310.01547  [pdf, other

    math.ST cs.IT cs.LG stat.AP stat.ML

    On the near-optimality of betting confidence sets for bounded means

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical nonparametric approach proceeds by inverting standard concentration bounds, such as Hoeffding's or Bernstein's inequalities. Recently, an alternative betting-base… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 53 pages, 2 figures

  19. arXiv:2309.09111  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Reducing sequential change detection to sequential estimation

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $θ$ of the data stream distribution that has small detection delay, but guarantees control on the frequency of false alarms in the absence of changes. In this paper, we describe a simple reduction from sequential change detection to sequential estimati… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 11 pages

  20. arXiv:2309.04002  [pdf, other

    stat.ME

    Total Variation Floodgate for Variable Importance Inference in Classification

    Authors: Wenshuo Wang, Lucas Janson, Lihua Lei, Aaditya Ramdas

    Abstract: Inferring variable importance is the key problem of many scientific studies, where researchers seek to learn the effect of a feature $X$ on the outcome $Y$ in the presence of confounding variables $Z$. Focusing on classification problems, we define the expected total variation (ETV), which is an intuitive and deterministic measure of variable importance that does not rely on any model context. We… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  21. arXiv:2307.07539  [pdf, ps, other

    cs.LG math.ST stat.ML

    On the Sublinear Regret of GP-UCB

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to minimize regret, which is a measure of the suboptimality of the choices made. Arguably the most popular algorithm is the Gaussian Process Upper Confidence Bound (… ▽ More

    Submitted 14 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 20 pages, 0 figures

  22. arXiv:2306.06721  [pdf, other

    stat.ML cs.CR cs.LG

    Differentially Private Conditional Independence Testing

    Authors: Iden Kalemaj, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Conditional independence (CI) tests are widely used in statistical data analysis, e.g., they are the building block of many algorithms for causal graph discovery. The goal of a CI test is to accept or reject the null hypothesis that $X \perp \!\!\! \perp Y \mid Z$, where $X \in \mathbb{R}, Y \in \mathbb{R}, Z \in \mathbb{R}^d$. In this work, we investigate conditional independence testing under th… ▽ More

    Submitted 22 March, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

  23. arXiv:2305.17570  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.AP stat.ME

    Auditing Fairness by Betting

    Authors: Ben Chugg, Santiago Cortes-Gomez, Bryan Wilder, Aaditya Ramdas

    Abstract: We provide practical, efficient, and nonparametric methods for auditing the fairness of deployed classification and regression models. Whereas previous work relies on a fixed-sample size, our methods are sequential and allow for the continuous monitoring of incoming data, making them highly amenable to tracking the fairness of real-world systems. We also allow the data to be collected by a probabi… ▽ More

    Submitted 29 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023. 29 pages, 5 figures

  24. arXiv:2305.16539  [pdf, other

    math.ST cs.IT math.PR stat.ME

    On the existence of powerful p-values and e-values for composite hypotheses

    Authors: Zhenyuan Zhang, Aaditya Ramdas, Ruodu Wang

    Abstract: Given a composite null $ \mathcal P$ and composite alternative $ \mathcal Q$, when and how can we construct a p-value whose distribution is exactly uniform under the null, and stochastically smaller than uniform under the alternative? Similarly, when and how can we construct an e-value whose expectation exactly equals one under the null, but its expected logarithm under the alternative is positive… ▽ More

    Submitted 23 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 47 pages, 6 figures

  25. arXiv:2305.11126  [pdf, other

    stat.ME

    More powerful multiple testing under dependence via randomization

    Authors: Ziyu Xu, Aaditya Ramdas

    Abstract: We show that two procedures for false discovery rate (FDR) control -- the Benjamini-Yekutieli procedure for dependent p-values, and the e-Benjamini-Hochberg procedure for dependent e-values -- can both be made more powerful by a simple randomization involving one independent uniform random variable. As a corollary, the Hommel test under arbitrary dependence is also improved. Importantly, our rando… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 45 pages, 9 figures

  26. arXiv:2305.10564  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Counterfactually Comparing Abstaining Classifiers

    Authors: Yo Joong Choe, Aditya Gangrade, Aaditya Ramdas

    Abstract: Abstaining classifiers have the option to abstain from making predictions on inputs that they are unsure about. These classifiers are becoming increasingly popular in high-stakes decision-making problems, as they can withhold uncertain predictions to improve their reliability and safety. When evaluating black-box abstaining classifier(s), however, we lack a principled approach that accounts for wh… ▽ More

    Submitted 9 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023. Preliminary work presented at the ICML 2023 Workshop on Counterfactuals in Minds and Machines. Code available at https://github.com/yjchoe/ComparingAbstainingClassifiers

  27. arXiv:2305.06884  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.AP stat.ML

    Risk-limiting Financial Audits via Weighted Sampling without Replacement

    Authors: Shubhanshu Shekhar, Ziyu Xu, Zachary C. Lipton, Pierre J. Liang, Aaditya Ramdas

    Abstract: We introduce the notion of a risk-limiting financial auditing (RLFA): given $N$ transactions, the goal is to estimate the total misstated monetary fraction~($m^*$) to a given accuracy $ε$, with confidence $1-δ$. We do this by constructing new confidence sequences (CSs) for the weighted average of $N$ unknown values, based on samples drawn without replacement according to a (randomized) weighted sa… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 23 pages, 8 figures, to appear in the Proceedings of Uncertainty in Artificial Intelligence (UAI) 2023

  28. arXiv:2305.00143  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Predictive Two-Sample and Independence Testing

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: We study the problems of sequential nonparametric two-sample and independence testing. Sequential tests process data online and allow using observed data to decide whether to stop and reject the null hypothesis or to collect more data, while maintaining type I error control. We build upon the principle of (nonparametric) testing by betting, where a gambler places bets on future observations and th… ▽ More

    Submitted 19 July, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  29. arXiv:2305.00070  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Online Platt Scaling with Calibeating

    Authors: Chirag Gupta, Aaditya Ramdas

    Abstract: We present an online post-hoc calibration method, called Online Platt Scaling (OPS), which combines the Platt scaling technique with online logistic regression. We demonstrate that OPS smoothly adapts between i.i.d. and non-i.i.d. settings with distribution drift. Further, in scenarios where the best Platt scaling model is itself miscalibrated, we enhance OPS by incorporating a recently developed… ▽ More

    Submitted 16 August, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICML 2023; 24 pages and 16 figures

  30. arXiv:2304.13237  [pdf, other

    stat.ME stat.ML

    An Efficient Doubly-Robust Test for the Kernel Treatment Effect

    Authors: Diego Martinez-Taboada, Aaditya Ramdas, Edward H. Kennedy

    Abstract: The average treatment effect, which is the difference in expectation of the counterfactuals, is probably the most popular target effect in causal inference with binary treatments. However, treatments may have effects beyond the mean, for instance decreasing or increasing the variance. We propose a new kernel-based test for distributional effects of the treatment. It is, to the best of our knowledg… ▽ More

    Submitted 31 October, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  31. arXiv:2304.02611  [pdf, other

    math.ST cs.IT math.PR stat.ME

    Randomized and Exchangeable Improvements of Markov's, Chebyshev's and Chernoff's Inequalities

    Authors: Aaditya Ramdas, Tudor Manole

    Abstract: We present simple randomized and exchangeable improvements of Markov's inequality, as well as Chebyshev's inequality and Chernoff bounds. Our variants are never worse and typically strictly more powerful than the original inequalities. The proofs are short and elementary, and can easily yield similarly randomized or exchangeable versions of a host of other inequalities that employ Markov's inequal… ▽ More

    Submitted 9 May, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  32. arXiv:2304.01163  [pdf, ps, other

    math.PR math.ST stat.ME stat.ML

    The extended Ville's inequality for nonintegrable nonnegative supermartingales

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Following the initial work by Robbins, we rigorously present an extended theory of nonnegative supermartingales, requiring neither integrability nor finiteness. In particular, we derive a key maximal inequality foreshadowed by Robbins, which we call the extended Ville's inequality, that strengthens the classical Ville's inequality (for integrable nonnegative supermartingales), and also applies to… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  33. Anytime-Valid Confidence Sequences in an Enterprise A/B Testing Platform

    Authors: Akash V. Maharaj, Ritwik Sinha, David Arbour, Ian Waudby-Smith, Simon Z. Liu, Moumita Sinha, Raghavendra Addanki, Aaditya Ramdas, Manas Garg, Viswanathan Swaminathan

    Abstract: A/B tests are the gold standard for evaluating digital experiences on the web. However, traditional "fixed-horizon" statistical methods are often incompatible with the needs of modern industry practitioners as they do not permit continuous monitoring of experiments. Frequent evaluation of fixed-horizon tests ("peeking") leads to inflated type-I error and can result in erroneous conclusions. We hav… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures. Expanded version of ACM Web Conference Proceedings paper

    ACM Class: G.3

    Journal ref: Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion)

  34. arXiv:2302.03421  [pdf, ps, other

    stat.ML cs.IT cs.LG math.ST

    A unified recipe for deriving (time-uniform) PAC-Bayes bounds

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike most previous literature on this topic, our bounds are anytime-valid (i.e., time-uniform), meaning that they hold at all stop** times, not only for a fixed sample size. Our approach combines four tools in the following order: (a) nonnegative supermartingales or reverse submartingales, (b) the method of mixture… ▽ More

    Submitted 3 January, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 56 pages. Published in the Journal of Machine Learning Research, Volume 24 Issue 372

  35. arXiv:2302.02544  [pdf, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Sequential change detection via backward confidence sequences

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $θ$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $θ$, then we can also successfully perform SCD for $θ$. This is accomplished by checking if two CSs… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 24 pages, 10 figures

  36. arXiv:2301.09573  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Huber-Robust Confidence Sequences

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Confidence sequences are confidence intervals that can be sequentially tracked, and are valid at arbitrary data-dependent stop** times. This paper presents confidence sequences for a univariate mean of an unknown distribution with a known upper bound on the $p$-th central moment ($p$ > 1), but allowing for (at most) $ε$ fraction of arbitrary distribution corruption, as in Huber's contamination m… ▽ More

    Submitted 7 February, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted for publication at the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  37. arXiv:2301.03542  [pdf, ps, other

    math.ST stat.ME

    A Sequential Test for Log-Concavity

    Authors: Aditya Gangrade, Alessandro Rinaldo, Aaditya Ramdas

    Abstract: On observing a sequence of i.i.d.\ data with distribution $P$ on $\mathbb{R}^d$, we ask the question of how one can test the null hypothesis that $P$ has a log-concave density. This paper proves one interesting negative and positive result: the non-existence of test (super)martingales, and the consistency of universal inference. To elaborate, the set of log-concave distributions $\mathcal{L}$ is a… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  38. arXiv:2212.09706  [pdf, ps, other

    math.ST math.PR stat.ME

    Multiple testing under negative dependence

    Authors: Ziyu Chi, Aaditya Ramdas, Ruodu Wang

    Abstract: The multiple testing literature has primarily dealt with three types of dependence assumptions between p-values: independence, positive regression dependence, and arbitrary dependence. In this paper, we provide what we believe are the first theoretical results under various notions of negative dependence (negative Gaussian dependence, negative regression dependence, negative association, negative… ▽ More

    Submitted 8 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 28 pages, 5 figures

  39. arXiv:2212.09108  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-Free Kernel Independence Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: In nonparametric independence testing, we observe i.i.d.\ data $\{(X_i,Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 52 pages, 4 figures

  40. arXiv:2212.07383  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Kernelized Independence Testing

    Authors: Aleksandr Podkopaev, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Independence testing is a classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the complexity of a problem at hand instead of setting sample size in advance. Ideally, such procedures should (a) stop earlier on easy tasks (and later on harder tasks), he… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: To appear at ICML 2023

  41. arXiv:2211.14908  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-free Kernel Two-Sample Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: The kernel Maximum Mean Discrepancy~(MMD) is a popular multivariate distance metric between distributions that has found utility in two-sample testing. The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution. Hence, to design a level-$α$ test, one usually selects the rejection threshold as the $(1-α)$-quantile of the perm… ▽ More

    Submitted 4 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentation

  42. arXiv:2210.10768  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Anytime-valid off-policy inference for contextual bandits

    Authors: Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro

    Abstract: Contextual bandit algorithms are ubiquitous tools for active sequential experimentation in healthcare and the tech industry. They involve online learning algorithms that adaptively learn policies over time to map observed contexts $X_t$ to actions $A_t$ in an attempt to maximize stochastic rewards $R_t$. This adaptivity raises interesting but hard statistical inference questions, especially counte… ▽ More

    Submitted 17 November, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 40 pages, 6 figures

  43. arXiv:2210.04334  [pdf, other

    stat.ME cs.LG eess.SP

    QuTE: decentralized multiple testing on sensor networks with false discovery rate control

    Authors: Aaditya Ramdas, Jianbo Chen, Martin J. Wainwright, Michael I. Jordan

    Abstract: This paper designs methods for decentralized multiple hypothesis testing on graphs that are equipped with provable guarantees on the false discovery rate (FDR). We consider the setting where distinct agents reside on the nodes of an undirected graph, and each agent possesses p-values corresponding to one or more hypotheses local to its node. Each agent must individually decide whether to reject on… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: This paper appeared in the IEEE CDC'17 conference proceedings. The last two sections were then developed in 2018, and it is now being put on arXiv simply for easier access

  44. arXiv:2210.01948  [pdf, ps, other

    math.ST cs.GT cs.IT stat.ME

    Game-theoretic statistics and safe anytime-valid inference

    Authors: Aaditya Ramdas, Peter Grünwald, Vladimir Vovk, Glenn Shafer

    Abstract: Safe anytime-valid inference (SAVI) provides measures of statistical evidence and certainty -- e-processes for testing and confidence sequences for estimation -- that remain valid at all stop** times, accommodating continuous monitoring and analysis of accumulating data and optional stop** or continuation for any reason. These measures crucially rely on test martingales, which are nonnegative… ▽ More

    Submitted 17 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 25 pages. Under review. ArXiv does not compile/space some references properly

  45. arXiv:2208.11418  [pdf, other

    stat.ME stat.AP

    Online multiple hypothesis testing

    Authors: David S. Robertson, James M. S. Wason, Aaditya Ramdas

    Abstract: Modern data analysis frequently involves large-scale hypothesis testing, which naturally gives rise to the problem of maintaining control of a suitable type I error rate, such as the false discovery rate (FDR). In many biomedical and technological applications, an additional complexity is that hypotheses are tested in an online manner, one-by-one over time. However, traditional procedures that con… ▽ More

    Submitted 24 July, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Updated in response to reviewer comments

    MSC Class: 62-02

  46. arXiv:2206.07234  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Brownian Noise Reduction: Maximizing Privacy Subject to Accuracy Constraints

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas, Ryan Rogers

    Abstract: There is a disconnect between how researchers and practitioners handle privacy-utility tradeoffs. Researchers primarily operate from a privacy first perspective, setting strict privacy requirements and minimizing risk subject to these constraints. Practitioners often desire an accuracy first perspective, possibly satisfied with the greatest privacy they can get subject to obtaining sufficiently sm… ▽ More

    Submitted 10 November, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 26 pages, 4 figures

  47. arXiv:2204.13581  [pdf, ps, other

    stat.ME

    Permutation tests using arbitrary permutation distributions

    Authors: Aaditya Ramdas, Rina Foygel Barber, Emmanuel J. Candes, Ryan J. Tibshirani

    Abstract: Permutation tests date back nearly a century to Fisher's randomized experiments, and remain an immensely popular statistical tool, used for testing hypotheses of independence between variables and other common inferential questions. Much of the existing literature has emphasized that, for the permutation p-value to be valid, one must first pick a subgroup $G$ of permutations (which could equal the… ▽ More

    Submitted 2 December, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

  48. arXiv:2204.13087  [pdf, ps, other

    cs.LG cs.DS cs.GT stat.ME stat.ML

    Faster online calibration without randomization: interval forecasts and the power of two choices

    Authors: Chirag Gupta, Aaditya Ramdas

    Abstract: We study the problem of making calibrated probabilistic forecasts for a binary sequence generated by an adversarial nature. Following the seminal paper of Foster and Vohra (1998), nature is often modeled as an adaptive adversary who sees all activity of the forecaster except the randomization that the forecaster may deploy. A number of papers have proposed randomized forecasting strategies that ac… ▽ More

    Submitted 26 July, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 28 pages, published at Conference on Learning Theory (COLT) 2022

    Journal ref: PMLR 178:4283-4309, 2022

  49. arXiv:2204.12447  [pdf, other

    stat.ME math.ST

    E-values as unnormalized weights in multiple testing

    Authors: Nikolaos Ignatiadis, Ruodu Wang, Aaditya Ramdas

    Abstract: We study how to combine p-values and e-values, and design multiple testing procedures where both p-values and e-values are available for every hypothesis. Our results provide a new perspective on multiple testing with data-driven weights: while standard weighted multiple testing methods require the weights to deterministically add up to the number of hypotheses being tested, we show that this norm… ▽ More

    Submitted 18 July, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

  50. arXiv:2203.12572  [pdf, other

    math.ST stat.ME

    Post-selection inference for e-value based confidence intervals

    Authors: Ziyu Xu, Ruodu Wang, Aaditya Ramdas

    Abstract: Suppose that one can construct a valid $(1-δ)$-confidence interval (CI) for each of $K$ parameters of potential interest. If a data analyst uses an arbitrary data-dependent criterion to select some subset $S$ of parameters, then the aforementioned CIs for the selected parameters are no longer valid due to selection bias. We design a new method to adjust the intervals in order to control the false… ▽ More

    Submitted 27 February, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 46 pages, 6 figures

    Journal ref: Electronic Journal of Statistics 18(1): 2292-2338 (2024)