Skip to main content

Showing 1–50 of 154 results for author: Ramdas, A

.
  1. arXiv:2405.17694  [pdf, ps, other

    cs.GT

    Bias Detection Via Signaling

    Authors: Yiling Chen, Tao Lin, Ariel D. Procaccia, Aaditya Ramdas, Itai Shapira

    Abstract: We introduce and study the problem of detecting whether an agent is updating their prior beliefs given new evidence in an optimal way that is Bayesian, or whether they are biased towards their own prior. In our model, biased agents form posterior beliefs that are a convex combination of their prior and the Bayesian posterior, where the more biased an agent is, the closer their posterior is to the… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.15586  [pdf, other

    stat.ME

    Multiple testing with anytime-valid Monte-Carlo p-values

    Authors: Lasse Fischer, Aaditya Ramdas

    Abstract: In contemporary problems involving genetic or neuroimaging data, thousands of hypotheses need to be tested. Due to their high power, and finite sample guarantees on type-1 error under weak assumptions, Monte-Carlo permutation tests are often considered as gold standard for these settings. However, the enormous computational effort required for (thousands of) permutation tests is a major burden. Re… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 22 pages, 2 figures

  3. arXiv:2404.03484  [pdf, other

    math.ST

    Combining exchangeable p-values

    Authors: Matteo Gasparin, Ruodu Wang, Aaditya Ramdas

    Abstract: The problem of combining p-values is an old and fundamental one, and the classic assumption of independence is often violated or unverifiable in many applications. There are many well-known rules that can combine a set of arbitrarily dependent p-values (for the same hypothesis) into a single p-value. We show that essentially all these existing rules can be strictly improved when the p-values are e… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 35 pages, 4 figures

  4. arXiv:2403.15527  [pdf, other

    stat.ML cs.LG

    Conformal online model aggregation

    Authors: Matteo Gasparin, Aaditya Ramdas

    Abstract: Conformal prediction equips machine learning models with a reasonable notion of uncertainty quantification without making strong distributional assumptions. It wraps around any black-box prediction model and converts point predictions into set predictions that have a predefined marginal coverage guarantee. However, conformal prediction only works if we fix the underlying machine learning model in… ▽ More

    Submitted 2 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 22 pages, 12 figures. arXiv admin note: text overlap with arXiv:2401.09379

  5. arXiv:2402.18810  [pdf, ps, other

    math.ST stat.ME

    The numeraire e-variable and reverse information projection

    Authors: Martin Larsson, Aaditya Ramdas, Johannes Ruf

    Abstract: We consider testing a composite null hypothesis $\mathcal{P}$ against a point alternative $\mathsf{Q}$ using e-variables, which are nonnegative random variables $X$ such that $\mathbb{E}_\mathsf{P}[X] \leq 1$ for every $\mathsf{P} \in \mathcal{P}$. This paper establishes a fundamental result: under no conditions whatsoever on $\mathcal{P}$ or $\mathsf{Q}$, there exists a special e-variable $X^*$ t… ▽ More

    Submitted 4 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  6. arXiv:2402.09698  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Combining Evidence Across Filtrations Using Adjusters

    Authors: Yo Joong Choe, Aaditya Ramdas

    Abstract: In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stop** time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same n… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Substantially revised with new results in Sections 5 and 6. Code is available at https://github.com/yjchoe/CombiningEvidenceAcrossFiltrations

  7. arXiv:2402.00713  [pdf, ps, other

    math.PR math.ST

    Distribution-uniform strong laws of large numbers

    Authors: Ian Waudby-Smith, Martin Larsson, Aaditya Ramdas

    Abstract: We revisit the question of whether the strong law of large numbers (SLLN) holds uniformly in a rich family of distributions, culminating in a distribution-uniform generalization of the Marcinkiewicz-Zygmund SLLN. These results can be viewed as extensions of Chung's distribution-uniform SLLN to random variables with uniformly integrable $q^\text{th}$ absolute central moments for… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 35 pages

  8. arXiv:2401.15567  [pdf, other

    math.PR math.FA math.ST stat.ME stat.ML

    Positive Semidefinite Supermartingales and Randomized Matrix Concentration Inequalities

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: We present new concentration inequalities for either martingale dependent or exchangeable random symmetric matrices under a variety of tail conditions, encompassing now-standard Chernoff bounds to self-normalized heavy-tailed settings. These inequalities are often randomized in a way that renders them strictly tighter than existing deterministic results in the literature, are typically expressed i… ▽ More

    Submitted 26 February, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    MSC Class: 60B20; 60G48; 62L10

  9. arXiv:2401.15063  [pdf, other

    stat.ME math.ST stat.OT

    Graph fission and cross-validation

    Authors: James Leiner, Aaditya Ramdas

    Abstract: We introduce a technique called graph fission which takes in a graph which potentially contains only one observation per node (whose distribution lies in a known class) and produces two (or more) independent graphs with the same node/edge set in a way that splits the original graph's information amongst them in any desired proportion. Our proposal builds on data fission/thinning, a method that use… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 19 pages, 9 figures

  10. arXiv:2401.09379  [pdf, other

    stat.ME

    Merging uncertainty sets via majority vote

    Authors: Matteo Gasparin, Aaditya Ramdas

    Abstract: Given $K$ uncertainty sets that are arbitrarily dependent -- for example, confidence intervals for an unknown parameter obtained with $K$ different estimators, or prediction sets obtained via conformal prediction based on $K$ different algorithms on shared data -- we address the question of how to efficiently combine them in a black-box manner to produce a single uncertainty set. We present a simp… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Removed COMA (developed in separate paper), added derandomization examples. 34 pages, 8 figures, 2 tables

  11. arXiv:2401.07365  [pdf, other

    stat.ME

    Sequential Monte-Carlo testing by betting

    Authors: Lasse Fischer, Aaditya Ramdas

    Abstract: In a Monte-Carlo test, the observed dataset is fixed, and several resampled or permuted versions of the dataset are generated in order to test a null hypothesis that the original dataset is exchangeable with the resampled/permuted ones. Sequential Monte-Carlo tests aim to save computational resources by generating these additional datasets sequentially one by one, and potentially stop** early. W… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 33 pages, 8 figures

  12. arXiv:2312.16078  [pdf, other

    cond-mat.mtrl-sci physics.data-an

    Targeted materials discovery using Bayesian algorithm execution

    Authors: Sathya Chitturi, Akash Ramdas, Yue Wu, Brian Rohr, Stefano Ermon, Jennifer Dionne, Felipe H. da Jornada, Mike Dunne, Christopher Tassone, Willie Neiswanger, Daniel Ratner

    Abstract: Rapid discovery and synthesis of new materials requires intelligent data acquisition strategies to navigate large design spaces. A popular strategy is Bayesian optimization, which aims to find candidates that maximize material properties; however, materials design often requires finding specific subsets of the design space which meet more complex or specialized goals. We present a framework that c… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 29 pages; 12 figures

  13. arXiv:2311.18274  [pdf, other

    stat.ML cs.LG stat.ME

    Semiparametric Efficient Inference in Adaptive Experiments

    Authors: Thomas Cook, Alan Mishler, Aaditya Ramdas

    Abstract: We consider the problem of efficient inference of the Average Treatment Effect in a sequential experiment where the policy governing the assignment of subjects to treatment or control can change over time. We first provide a central limit theorem for the Adaptive Augmented Inverse-Probability Weighted estimator, which is semiparametric efficient, under weaker assumptions than those previously made… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 24 pages, 6 figures. To appear at CLeaR 2024

  14. arXiv:2311.08168  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Time-Uniform Confidence Spheres for Means of Random Vectors

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We derive and study time-uniform confidence spheres -- confidence sphere sequences (CSSs) -- which contain the mean of random vectors with high probability simultaneously across all sample sizes. Inspired by the original work of Catoni and Giulini, we unify and extend their analysis to cover both the sequential setting and to handle a variety of distributional assumptions. Our results include an e… ▽ More

    Submitted 28 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 46 pages, 1 figure

  15. arXiv:2311.06412  [pdf, other

    stat.ME stat.ML

    Online multiple testing with e-values

    Authors: Ziyu Xu, Aaditya Ramdas

    Abstract: A scientist tests a continuous stream of hypotheses over time in the course of her investigation -- she does not test a predetermined, fixed number of hypotheses. The scientist wishes to make as many discoveries as possible while ensuring the number of false discoveries is controlled -- a well recognized way for accomplishing this is to control the false discovery rate (FDR). Prior methods for FDR… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 26 pages, 4 figures

  16. arXiv:2311.03343  [pdf, other

    math.ST stat.ME

    Distribution-uniform anytime-valid sequential inference

    Authors: Ian Waudby-Smith, Edward H. Kennedy, Aaditya Ramdas

    Abstract: Are asymptotic confidence sequences and anytime $p$-values uniformly valid for a nontrivial class of distributions $\mathcal{P}$? We give a positive answer to this question by deriving distribution-uniform anytime-valid inference procedures. Historically, anytime-valid methods -- including confidence sequences, anytime $p$-values, and sequential hypothesis tests that enable inference at stop** t… ▽ More

    Submitted 18 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  17. arXiv:2310.19384  [pdf, other

    stat.ML cs.LG

    Deep anytime-valid hypothesis testing

    Authors: Teodora Pandeva, Patrick Forré, Aaditya Ramdas, Shubhanshu Shekhar

    Abstract: We propose a general framework for constructing powerful, sequential hypothesis tests for a large class of nonparametric testing problems. The null hypothesis for these problems is defined in an abstract form using the action of two known operators on the data distribution. This abstraction allows for a unified treatment of several classical tasks, such as two-sample testing, independence testing,… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  18. arXiv:2310.16626  [pdf, other

    stat.ME stat.AP

    Scalable Causal Structure Learning via Amortized Conditional Independence Testing

    Authors: James Leiner, Brian Manzo, Aaditya Ramdas, Wesley Tansey

    Abstract: Controlling false positives (Type I errors) through statistical hypothesis testing is a foundation of modern scientific data analysis. Existing causal structure discovery algorithms either do not provide Type I error control or cannot scale to the size of modern scientific datasets. We consider a variant of the causal discovery problem with two sets of nodes, where the only edges of interest form… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 10 figures, 24 pages

  19. arXiv:2310.14293  [pdf, other

    stat.ME

    Testing exchangeability by pairwise betting

    Authors: Aytijhya Saha, Aaditya Ramdas

    Abstract: In this paper, we address the problem of testing exchangeability of a sequence of random variables, $X_1, X_2,\cdots$. This problem has been studied under the recently popular framework of testing by betting. But the map** of testing problems to game is not one to one: many games can be designed for the same test. Past work established that it is futile to play single game betting on every obser… ▽ More

    Submitted 30 December, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

  20. arXiv:2310.09100  [pdf, other

    math.PR math.ST stat.ME

    Time-Uniform Self-Normalized Concentration for Vector-Valued Processes

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: Self-normalized processes arise naturally in many statistical tasks. While self-normalized concentration has been extensively studied for scalar-valued processes, there is less work on multidimensional processes outside of the sub-Gaussian setting. In this work, we construct a general, self-normalized inequality for $\mathbb{R}^d$-valued processes that satisfy a simple yet broad "sub-$ψ$" tail con… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 50 pages, 3 figures

  21. arXiv:2310.03722  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: In 1976, Lai constructed a nontrivial confidence sequence for the mean $μ$ of a Gaussian distribution with unknown variance $σ^2$. Curiously, he employed both an improper (right Haar) mixture over $σ$ and an improper (flat) mixture over $μ$. Here, we elaborate carefully on the details of his construction, which use generalized nonintegrable martingales and an extended Ville's inequality. While thi… ▽ More

    Submitted 14 May, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Substantive revision in v3 (Apr 23 2024)

  22. arXiv:2310.01547  [pdf, other

    math.ST cs.IT cs.LG stat.AP stat.ML

    On the near-optimality of betting confidence sets for bounded means

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical nonparametric approach proceeds by inverting standard concentration bounds, such as Hoeffding's or Bernstein's inequalities. Recently, an alternative betting-base… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 53 pages, 2 figures

  23. arXiv:2309.09111  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Reducing sequential change detection to sequential estimation

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $θ$ of the data stream distribution that has small detection delay, but guarantees control on the frequency of false alarms in the absence of changes. In this paper, we describe a simple reduction from sequential change detection to sequential estimati… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 11 pages

  24. arXiv:2309.04002  [pdf, other

    stat.ME

    Total Variation Floodgate for Variable Importance Inference in Classification

    Authors: Wenshuo Wang, Lucas Janson, Lihua Lei, Aaditya Ramdas

    Abstract: Inferring variable importance is the key problem of many scientific studies, where researchers seek to learn the effect of a feature $X$ on the outcome $Y$ in the presence of confounding variables $Z$. Focusing on classification problems, we define the expected total variation (ETV), which is an intuitive and deterministic measure of variable importance that does not rely on any model context. We… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  25. arXiv:2307.07539  [pdf, ps, other

    cs.LG math.ST stat.ML

    On the Sublinear Regret of GP-UCB

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to minimize regret, which is a measure of the suboptimality of the choices made. Arguably the most popular algorithm is the Gaussian Process Upper Confidence Bound (… ▽ More

    Submitted 14 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 20 pages, 0 figures

  26. arXiv:2306.13824  [pdf, other

    cs.CR cs.DS cs.LG

    Adaptive Privacy Composition for Accuracy-first Mechanisms

    Authors: Ryan Rogers, Gennady Samorodnitsky, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: In many practical applications of differential privacy, practitioners seek to provide the best privacy guarantees subject to a target level of accuracy. A recent line of work by Ligett et al. '17 and Whitehouse et al. '22 has developed such accuracy-first mechanisms by leveraging the idea of noise reduction that adds correlated noise to the sufficient statistic in a private computation and produce… ▽ More

    Submitted 5 December, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  27. arXiv:2306.06721  [pdf, other

    stat.ML cs.CR cs.LG

    Differentially Private Conditional Independence Testing

    Authors: Iden Kalemaj, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Conditional independence (CI) tests are widely used in statistical data analysis, e.g., they are the building block of many algorithms for causal graph discovery. The goal of a CI test is to accept or reject the null hypothesis that $X \perp \!\!\! \perp Y \mid Z$, where $X \in \mathbb{R}, Y \in \mathbb{R}, Z \in \mathbb{R}^d$. In this work, we investigate conditional independence testing under th… ▽ More

    Submitted 22 March, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

  28. arXiv:2305.17570  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.AP stat.ME

    Auditing Fairness by Betting

    Authors: Ben Chugg, Santiago Cortes-Gomez, Bryan Wilder, Aaditya Ramdas

    Abstract: We provide practical, efficient, and nonparametric methods for auditing the fairness of deployed classification and regression models. Whereas previous work relies on a fixed-sample size, our methods are sequential and allow for the continuous monitoring of incoming data, making them highly amenable to tracking the fairness of real-world systems. We also allow the data to be collected by a probabi… ▽ More

    Submitted 29 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023. 29 pages, 5 figures

  29. arXiv:2305.16539  [pdf, other

    math.ST cs.IT math.PR stat.ME

    On the existence of powerful p-values and e-values for composite hypotheses

    Authors: Zhenyuan Zhang, Aaditya Ramdas, Ruodu Wang

    Abstract: Given a composite null $ \mathcal P$ and composite alternative $ \mathcal Q$, when and how can we construct a p-value whose distribution is exactly uniform under the null, and stochastically smaller than uniform under the alternative? Similarly, when and how can we construct an e-value whose expectation exactly equals one under the null, but its expected logarithm under the alternative is positive… ▽ More

    Submitted 23 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 47 pages, 6 figures

  30. arXiv:2305.11126  [pdf, other

    stat.ME

    More powerful multiple testing under dependence via randomization

    Authors: Ziyu Xu, Aaditya Ramdas

    Abstract: We show that two procedures for false discovery rate (FDR) control -- the Benjamini-Yekutieli procedure for dependent p-values, and the e-Benjamini-Hochberg procedure for dependent e-values -- can both be made more powerful by a simple randomization involving one independent uniform random variable. As a corollary, the Hommel test under arbitrary dependence is also improved. Importantly, our rando… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 45 pages, 9 figures

  31. arXiv:2305.10564  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Counterfactually Comparing Abstaining Classifiers

    Authors: Yo Joong Choe, Aditya Gangrade, Aaditya Ramdas

    Abstract: Abstaining classifiers have the option to abstain from making predictions on inputs that they are unsure about. These classifiers are becoming increasingly popular in high-stakes decision-making problems, as they can withhold uncertain predictions to improve their reliability and safety. When evaluating black-box abstaining classifier(s), however, we lack a principled approach that accounts for wh… ▽ More

    Submitted 9 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023. Preliminary work presented at the ICML 2023 Workshop on Counterfactuals in Minds and Machines. Code available at https://github.com/yjchoe/ComparingAbstainingClassifiers

  32. arXiv:2305.06884  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.AP stat.ML

    Risk-limiting Financial Audits via Weighted Sampling without Replacement

    Authors: Shubhanshu Shekhar, Ziyu Xu, Zachary C. Lipton, Pierre J. Liang, Aaditya Ramdas

    Abstract: We introduce the notion of a risk-limiting financial auditing (RLFA): given $N$ transactions, the goal is to estimate the total misstated monetary fraction~($m^*$) to a given accuracy $ε$, with confidence $1-δ$. We do this by constructing new confidence sequences (CSs) for the weighted average of $N$ unknown values, based on samples drawn without replacement according to a (randomized) weighted sa… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 23 pages, 8 figures, to appear in the Proceedings of Uncertainty in Artificial Intelligence (UAI) 2023

  33. arXiv:2305.00143  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Predictive Two-Sample and Independence Testing

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: We study the problems of sequential nonparametric two-sample and independence testing. Sequential tests process data online and allow using observed data to decide whether to stop and reject the null hypothesis or to collect more data, while maintaining type I error control. We build upon the principle of (nonparametric) testing by betting, where a gambler places bets on future observations and th… ▽ More

    Submitted 19 July, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  34. arXiv:2305.00070  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Online Platt Scaling with Calibeating

    Authors: Chirag Gupta, Aaditya Ramdas

    Abstract: We present an online post-hoc calibration method, called Online Platt Scaling (OPS), which combines the Platt scaling technique with online logistic regression. We demonstrate that OPS smoothly adapts between i.i.d. and non-i.i.d. settings with distribution drift. Further, in scenarios where the best Platt scaling model is itself miscalibrated, we enhance OPS by incorporating a recently developed… ▽ More

    Submitted 16 August, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICML 2023; 24 pages and 16 figures

  35. arXiv:2304.13237  [pdf, other

    stat.ME stat.ML

    An Efficient Doubly-Robust Test for the Kernel Treatment Effect

    Authors: Diego Martinez-Taboada, Aaditya Ramdas, Edward H. Kennedy

    Abstract: The average treatment effect, which is the difference in expectation of the counterfactuals, is probably the most popular target effect in causal inference with binary treatments. However, treatments may have effects beyond the mean, for instance decreasing or increasing the variance. We propose a new kernel-based test for distributional effects of the treatment. It is, to the best of our knowledg… ▽ More

    Submitted 31 October, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  36. arXiv:2304.03927  [pdf, ps, other

    math.ST math.PR

    De Finetti's theorem and related results for infinite weighted exchangeable sequences

    Authors: Rina Foygel Barber, Emmanuel J. Candes, Aaditya Ramdas, Ryan J. Tibshirani

    Abstract: De Finetti's theorem, also called the de Finetti-Hewitt-Savage theorem, is a foundational result in probability and statistics. Roughly, it says that an infinite sequence of exchangeable random variables can always be written as a mixture of independent and identically distributed (i.i.d.) sequences of random variables. In this paper, we consider a weighted generalization of exchangeability that a… ▽ More

    Submitted 27 November, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  37. arXiv:2304.02611  [pdf, other

    math.ST cs.IT math.PR stat.ME

    Randomized and Exchangeable Improvements of Markov's, Chebyshev's and Chernoff's Inequalities

    Authors: Aaditya Ramdas, Tudor Manole

    Abstract: We present simple randomized and exchangeable improvements of Markov's inequality, as well as Chebyshev's inequality and Chernoff bounds. Our variants are never worse and typically strictly more powerful than the original inequalities. The proofs are short and elementary, and can easily yield similarly randomized or exchangeable versions of a host of other inequalities that employ Markov's inequal… ▽ More

    Submitted 9 May, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  38. arXiv:2304.01163  [pdf, ps, other

    math.PR math.ST stat.ME stat.ML

    The extended Ville's inequality for nonintegrable nonnegative supermartingales

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Following the initial work by Robbins, we rigorously present an extended theory of nonnegative supermartingales, requiring neither integrability nor finiteness. In particular, we derive a key maximal inequality foreshadowed by Robbins, which we call the extended Ville's inequality, that strengthens the classical Ville's inequality (for integrable nonnegative supermartingales), and also applies to… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  39. arXiv:2303.16350  [pdf, other

    cond-mat.mtrl-sci

    Electrolyte Coatings for High Adhesion Interfaces in Solid-state Batteries from First Principles

    Authors: Brandi Ransom, Akash Ramdas, Eder Lomeli, Jad Fidawi, Austin Sendek, Thomas Devereaux, Evan Reed, Peter Schindler

    Abstract: We introduce an adhesion parameter that enables rapid screening for materials interfaces with high adhesion. This parameter is obtained by density functional theory calculations of individual single-material slabs rather than slabs consisting of combinations of two materials, eliminating the need to calculate all configurations of a prohibitively vast space of possible interface configurations. Cl… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  40. Anytime-Valid Confidence Sequences in an Enterprise A/B Testing Platform

    Authors: Akash V. Maharaj, Ritwik Sinha, David Arbour, Ian Waudby-Smith, Simon Z. Liu, Moumita Sinha, Raghavendra Addanki, Aaditya Ramdas, Manas Garg, Viswanathan Swaminathan

    Abstract: A/B tests are the gold standard for evaluating digital experiences on the web. However, traditional "fixed-horizon" statistical methods are often incompatible with the needs of modern industry practitioners as they do not permit continuous monitoring of experiments. Frequent evaluation of fixed-horizon tests ("peeking") leads to inflated type-I error and can result in erroneous conclusions. We hav… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures. Expanded version of ACM Web Conference Proceedings paper

    ACM Class: G.3

    Journal ref: Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion)

  41. arXiv:2302.03421  [pdf, ps, other

    stat.ML cs.IT cs.LG math.ST

    A unified recipe for deriving (time-uniform) PAC-Bayes bounds

    Authors: Ben Chugg, Hongjian Wang, Aaditya Ramdas

    Abstract: We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike most previous literature on this topic, our bounds are anytime-valid (i.e., time-uniform), meaning that they hold at all stop** times, not only for a fixed sample size. Our approach combines four tools in the following order: (a) nonnegative supermartingales or reverse submartingales, (b) the method of mixture… ▽ More

    Submitted 3 January, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 56 pages. Published in the Journal of Machine Learning Research, Volume 24 Issue 372

  42. arXiv:2302.02544  [pdf, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Sequential change detection via backward confidence sequences

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $θ$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $θ$, then we can also successfully perform SCD for $θ$. This is accomplished by checking if two CSs… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 24 pages, 10 figures

  43. arXiv:2301.09573  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Huber-Robust Confidence Sequences

    Authors: Hongjian Wang, Aaditya Ramdas

    Abstract: Confidence sequences are confidence intervals that can be sequentially tracked, and are valid at arbitrary data-dependent stop** times. This paper presents confidence sequences for a univariate mean of an unknown distribution with a known upper bound on the $p$-th central moment ($p$ > 1), but allowing for (at most) $ε$ fraction of arbitrary distribution corruption, as in Huber's contamination m… ▽ More

    Submitted 7 February, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted for publication at the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  44. arXiv:2301.03542  [pdf, ps, other

    math.ST stat.ME

    A Sequential Test for Log-Concavity

    Authors: Aditya Gangrade, Alessandro Rinaldo, Aaditya Ramdas

    Abstract: On observing a sequence of i.i.d.\ data with distribution $P$ on $\mathbb{R}^d$, we ask the question of how one can test the null hypothesis that $P$ has a log-concave density. This paper proves one interesting negative and positive result: the non-existence of test (super)martingales, and the consistency of universal inference. To elaborate, the set of log-concave distributions $\mathcal{L}$ is a… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  45. arXiv:2212.09706  [pdf, ps, other

    math.ST math.PR stat.ME

    Multiple testing under negative dependence

    Authors: Ziyu Chi, Aaditya Ramdas, Ruodu Wang

    Abstract: The multiple testing literature has primarily dealt with three types of dependence assumptions between p-values: independence, positive regression dependence, and arbitrary dependence. In this paper, we provide what we believe are the first theoretical results under various notions of negative dependence (negative Gaussian dependence, negative regression dependence, negative association, negative… ▽ More

    Submitted 8 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 28 pages, 5 figures

  46. arXiv:2212.09108  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-Free Kernel Independence Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: In nonparametric independence testing, we observe i.i.d.\ data $\{(X_i,Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 52 pages, 4 figures

  47. arXiv:2212.07383  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Kernelized Independence Testing

    Authors: Aleksandr Podkopaev, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Independence testing is a classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the complexity of a problem at hand instead of setting sample size in advance. Ideally, such procedures should (a) stop earlier on easy tasks (and later on harder tasks), he… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: To appear at ICML 2023

  48. arXiv:2211.14908  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-free Kernel Two-Sample Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: The kernel Maximum Mean Discrepancy~(MMD) is a popular multivariate distance metric between distributions that has found utility in two-sample testing. The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution. Hence, to design a level-$α$ test, one usually selects the rejection threshold as the $(1-α)$-quantile of the perm… ▽ More

    Submitted 4 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentation

  49. arXiv:2210.10768  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Anytime-valid off-policy inference for contextual bandits

    Authors: Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro

    Abstract: Contextual bandit algorithms are ubiquitous tools for active sequential experimentation in healthcare and the tech industry. They involve online learning algorithms that adaptively learn policies over time to map observed contexts $X_t$ to actions $A_t$ in an attempt to maximize stochastic rewards $R_t$. This adaptivity raises interesting but hard statistical inference questions, especially counte… ▽ More

    Submitted 17 November, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 40 pages, 6 figures

  50. arXiv:2210.04334  [pdf, other

    stat.ME cs.LG eess.SP

    QuTE: decentralized multiple testing on sensor networks with false discovery rate control

    Authors: Aaditya Ramdas, Jianbo Chen, Martin J. Wainwright, Michael I. Jordan

    Abstract: This paper designs methods for decentralized multiple hypothesis testing on graphs that are equipped with provable guarantees on the false discovery rate (FDR). We consider the setting where distinct agents reside on the nodes of an undirected graph, and each agent possesses p-values corresponding to one or more hypotheses local to its node. Each agent must individually decide whether to reject on… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: This paper appeared in the IEEE CDC'17 conference proceedings. The last two sections were then developed in 2018, and it is now being put on arXiv simply for easier access