Skip to main content

Showing 1–50 of 268 results for author: Jordan, M I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.18490  [pdf, other

    cs.LG stat.ML

    Reduced-Rank Multi-objective Policy Learning and Optimization

    Authors: Ezinne Nwankwo, Michael I. Jordan, Angela Zhou

    Abstract: Evaluating the causal impacts of possible interventions is crucial for informing decision-making, especially towards improving access to opportunity. However, if causal effects are heterogeneous and predictable from covariates, personalized treatment decisions can improve individual outcomes and contribute to both efficiency and equity. In practice, however, causal researchers do not have a single… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2404.15746  [pdf, other

    stat.ML cs.CR cs.LG

    Collaborative Heterogeneous Causal Inference Beyond Meta-analysis

    Authors: Tianyu Guo, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: Collaboration between different data centers is often challenged by heterogeneity across sites. To account for the heterogeneity, the state-of-the-art method is to re-weight the covariate distributions in each site to match the distribution of the target population. Nevertheless, this method could easily fail when a certain site couldn't cover the entire population. Moreover, it still relies on th… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: submitted to ICML

  3. arXiv:2403.19605  [pdf, other

    stat.ME cs.LG

    Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

    Authors: Drew T. Nguyen, Reese Pathak, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: Decision-making pipelines are generally characterized by tradeoffs among various risk functions. It is often desirable to manage such tradeoffs in a data-adaptive manner. As we demonstrate, if this is done naively, state-of-the art uncertainty quantification methods can lead to significant violations of putative risk guarantees. To address this issue, we develop methods that permit valid control… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 10 figures

  4. arXiv:2403.07008  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    AutoEval Done Right: Using Synthetic Data for Model Evaluation

    Authors: Pierre Boyeau, Anastasios N. Angelopoulos, Nir Yosef, Jitendra Malik, Michael I. Jordan

    Abstract: The evaluation of machine learning models using human-labeled validation data can be expensive and time-consuming. AI-labeled synthetic data can be used to decrease the number of human annotations required for this purpose in a process called autoevaluation. We suggest efficient and statistically principled algorithms for this purpose that improve sample efficiency while remaining unbiased. These… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: New experiments, fix fig 1

  5. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  6. arXiv:2401.16335  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

    Authors: Banghua Zhu, Michael I. Jordan, Jiantao Jiao

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial phase of RLHF involves learning human values using a reward model from ranking data. It is observed that the performance of the reward model degrades after one epoch of training, and optimizing too much against the learned reward model eventually hinde… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  7. arXiv:2312.07930  [pdf, other

    cs.LG cs.CL cs.CR cs.IT stat.ML

    Towards Optimal Statistical Watermarking

    Authors: Baihe Huang, Hanlin Zhu, Banghua Zhu, Kannan Ramchandran, Michael I. Jordan, Jason D. Lee, Jiantao Jiao

    Abstract: We study statistical watermarking by formulating it as a hypothesis testing problem, a general framework which subsumes all previous statistical watermarking methods. Key to our formulation is a coupling of the output tokens and the rejection region, realized by pseudo-random generators in practice, that allows non-trivial trade-offs between the Type I error and Type II error. We characterize the… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2310.05921  [pdf, other

    stat.ML cs.LG cs.RO stat.ME

    Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions

    Authors: Jordan Lekeufack, Anastasios N. Angelopoulos, Andrea Bajcsy, Michael I. Jordan, Jitendra Malik

    Abstract: We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms that rely on pedestrian predictions, to calibrating autonomous manufacturing to exhibit high throughput and low error, to the choice of trusting a nominal policy versus switching to a sa… ▽ More

    Submitted 2 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures

  9. arXiv:2309.04877  [pdf, other

    cs.LG stat.ML

    A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning

    Authors: Neha S. Wadia, Yatin Dandi, Michael I. Jordan

    Abstract: The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and multi-agent problems. In these broader settings, new mathematical challenges emerge that involve equilibria and game theory instead of optima. Gradient-based method… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 36 pages, 7 figures; minor corrections

  10. arXiv:2309.01837  [pdf, other

    cs.LG stat.ML

    Delegating Data Collection in Decentralized Machine Learning

    Authors: Nivasini Ananthakrishnan, Stephen Bates, Michael I. Jordan, Nika Haghtalab

    Abstract: Motivated by the emergence of decentralized machine learning (ML) ecosystems, we study the delegation of data collection. Taking the field of contract theory as our starting point, we design optimal and near-optimal contracts that deal with two fundamental information asymmetries that arise in decentralized ML: uncertainty in the assessment of model quality and uncertainty regarding the optimal pe… ▽ More

    Submitted 2 May, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  11. arXiv:2307.13381  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Scaff-PD: Communication Efficient Fair and Robust Federated Learning

    Authors: Yaodong Yu, Sai Praneeth Karimireddy, Yi Ma, Michael I. Jordan

    Abstract: We present Scaff-PD, a fast and communication-efficient algorithm for distributionally robust federated learning. Our approach improves fairness by optimizing a family of distributionally robust objectives tailored to heterogeneous clients. We leverage the special structure of these objectives, and design an accelerated primal dual (APD) algorithm which uses bias corrected local steps (as in Scaff… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  12. arXiv:2307.03748  [pdf, other

    stat.ME cs.GT cs.LG stat.ML

    Incentive-Theoretic Bayesian Inference for Collaborative Science

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Contemporary scientific research is a distributed, collaborative endeavor, carried out by teams of researchers, regulatory institutions, funding agencies, commercial partners, and scientific bodies, all interacting with each other and facing different incentives. To maintain scientific rigor, statistical methods should acknowledge this state of affairs. To this end, we study hypothesis testing whe… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  13. arXiv:2307.00126  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Inexact HyperGradient Descent for Bilevel Optimization

    Authors: Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

    Abstract: We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an $ε$-first-order stationary point of the objective with $\tilde{\mathcal{O}}(κ^{3.25}ε^{-1.75})$ oracle complexity, where $κ$ is the condition number of the lower-level objective and $ε$ is the desir… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  14. arXiv:2306.14670  [pdf, other

    cs.GT cs.CY cs.LG stat.ML

    Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

    Authors: Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt, Nika Haghtalab

    Abstract: As the scale of machine learning models increases, trends such as scaling laws anticipate consistent downstream improvements in predictive accuracy. However, these trends take the perspective of a single model-provider in isolation, while in reality providers often compete with each other for users. In this work, we demonstrate that competition can fundamentally alter the behavior of these scaling… ▽ More

    Submitted 6 February, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Appeared at NeurIPS 2023; this is the full version

  15. arXiv:2306.09335  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    Class-Conditional Conformal Prediction with Many Classes

    Authors: Tiffany Ding, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: Standard conformal prediction methods provide a marginal coverage guarantee, which means that for a random test point, the conformal prediction set contains the true label with a user-specified probability. In many classification problems, we would like to obtain a stronger guarantee--that for test points of a specific class, the prediction set contains the true label with the same user-chosen pro… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  16. arXiv:2306.07479  [pdf, ps, other

    cs.GT cs.IR cs.LG stat.ML

    Incentivizing High-Quality Content in Online Recommender Systems

    Authors: Xinyan Hu, Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt

    Abstract: In content recommender systems such as TikTok and YouTube, the platform's recommendation algorithm shapes content producer incentives. Many platforms employ online learning, which generates intertemporal incentives, since content produced today affects recommendations of future content. We study the game between producers and analyze the content created at equilibrium. We show that standard online… ▽ More

    Submitted 21 June, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Updated version with revised and expanded content

  17. arXiv:2306.02003  [pdf, other

    cs.LG cs.AI cs.PF eess.SY stat.ML

    On Optimal Caching and Model Multiplexing for Large Model Inference

    Authors: Banghua Zhu, Ying Sheng, Lianmin Zheng, Clark Barrett, Michael I. Jordan, Jiantao Jiao

    Abstract: Large Language Models (LLMs) and other large foundation models have achieved noteworthy success, but their size exacerbates existing resource consumption and latency challenges. In particular, the large-scale deployment of these models is hindered by the significant resource requirements during inference. In this paper, we study two approaches for mitigating these challenges: employing a cache to… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  18. arXiv:2303.06317  [pdf, ps, other

    stat.ME

    Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics (Rejoinder)

    Authors: Ryan Giordano, Run**g Liu, Michael I. Jordan, Tamara Broderick

    Abstract: One can typically form a local robustness metric for a particular problem quite directly, for Markov chain Monte Carlo applications as well as optimization problems such as variational Bayes. However, we argue that simply forming a local robustness metric is not enough: the hard work is showing that it is useful. Computability, interpretability, and the ability of a local robustness metric to extr… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: Rejoinder for the discussion article "Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics'' in Bayesian Analysis

  19. arXiv:2302.00316  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Accelerated First-Order Optimization under Nonlinear Constraints

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected gradients, these algorithms avoid optimization over the entire feasible set at each iteration. We prove convergence to stationary points even in a nonconvex setting… ▽ More

    Submitted 2 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 44 pages, 6 figures

  20. arXiv:2301.11270  [pdf, other

    cs.LG cs.AI cs.HC math.ST stat.ML

    Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a theoretical framework for Reinforcement Learning with Human Feedback (RLHF). Our analysis shows that when the true reward function is linear, the widely used maximum likelihood estimator (MLE) converges under both the Bradley-Terry-Luce (BTL) model and the Plackett-Luce (PL) model. However, we show that when training a policy based on the learned reward model, MLE fails while a pessim… ▽ More

    Submitted 7 February, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  21. arXiv:2301.09633  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM stat.ME

    Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

    Abstract: Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients, without making any assumptions on the ma… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Code is available at https://github.com/aangelopoulos/ppi_py

  22. arXiv:2211.15381  [pdf, other

    cs.IR cs.LG stat.ML

    Incentive-Aware Recommender Systems in Two-Sided Markets

    Authors: Xiaowu Dai, Wenlu Xu, Yuan Qi, Michael I. Jordan

    Abstract: Online platforms in the Internet Economy commonly incorporate recommender systems that recommend products (or "arms") to users (or "agents"). A key challenge in this domain arises from myopic agents who are naturally incentivized to exploit by choosing the optimal arm based on current information, rather than exploring various alternatives to gather information that benefits the collective. We pro… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

  23. arXiv:2210.17550  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

    Authors: Chris Junchi Li, Angela Yuan, Gauthier Gidel, Quanquan Gu, Michael I. Jordan

    Abstract: We propose a new first-order optimization algorithm -- AcceleratedGradient-OptimisticGradient (AG-OG) Descent Ascent -- for separable convex-concave minimax optimization. The main idea of our algorithm is to carefully leverage the structure of the minimax problem, performing Nesterov acceleration on the individual component and optimistic gradient on the coupling component. Equipped with proper re… ▽ More

    Submitted 14 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 44 pages. This version matches the camera-ready that appeared at ICML 2023 under the same title

  24. arXiv:2210.15659  [pdf, other

    stat.ML cs.LG

    A Primal-dual Approach for Solving Variational Inequalities with General-form Constraints

    Authors: Tatjana Chavdarova, Matteo Pagliardini, Tong Yang, Michael I. Jordan

    Abstract: Yang et al. (2023) recently addressed the open problem of solving Variational Inequalities (VIs) with equality and inequality constraints through a first-order gradient method. However, the proposed primal-dual method called ACVI is applicable when we can compute analytic solutions of its subproblems; thus, the general case remains an open problem. In this paper, we adopt a warm-starting technique… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2206.10575

  25. arXiv:2210.10278  [pdf, other

    cs.LG cs.GT stat.ML

    A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

    Authors: Rui Ai, Boxiang Lyu, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan

    Abstract: We study reserve price optimization in multi-phase second price auctions, where seller's prior actions affect the bidders' later valuations through a Markov Decision Process (MDP). Compared to the bandit setting in existing works, the setting in ours involves three challenges. First, from the seller's perspective, we need to efficiently explore the environment in the presence of potentially nontru… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  26. arXiv:2210.04334  [pdf, other

    stat.ME cs.LG eess.SP

    QuTE: decentralized multiple testing on sensor networks with false discovery rate control

    Authors: Aaditya Ramdas, Jianbo Chen, Martin J. Wainwright, Michael I. Jordan

    Abstract: This paper designs methods for decentralized multiple hypothesis testing on graphs that are equipped with provable guarantees on the false discovery rate (FDR). We consider the setting where distinct agents reside on the nodes of an undirected graph, and each agent possesses p-values corresponding to one or more hypotheses local to its node. Each agent must individually decide whether to reject on… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: This paper appeared in the IEEE CDC'17 conference proceedings. The last two sections were then developed in 2018, and it is now being put on arXiv simply for easier access

  27. arXiv:2209.15634  [pdf, other

    cs.LG cs.AI stat.ML

    A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

    Authors: Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan

    Abstract: With the increasing need for handling large state and action spaces, general function approximation has become a key technique in reinforcement learning (RL). In this paper, we propose a general framework that unifies model-based and model-free RL, and an Admissible Bellman Characterization (ABC) class that subsumes nearly all Markov Decision Process (MDP) models in the literature for tractable RL… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  28. arXiv:2208.13701  [pdf, other

    stat.ME cs.LG math.OC stat.ML

    Data-Driven Influence Functions for Optimization-Based Causal Inference

    Authors: Michael I. Jordan, Yixin Wang, Angela Zhou

    Abstract: We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing, with a focus on functionals that arise in causal inference. We study the case where probability distributions are not known a priori but need to be estimated from data. These estimated distributions lead to empirical Gateaux derivatives, and we study the relationships betwe… ▽ More

    Submitted 15 June, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: Extended version of conference version "Empirical Gateaux Derivatives for Causal Inference" accepted at Neurips 2022; new results on optimization and sensitivity analysis

  29. arXiv:2208.05949  [pdf, other

    stat.ME cs.LG stat.ML

    Valid Inference after Causal Discovery

    Authors: Paula Gradu, Tijana Zrnic, Yixin Wang, Michael I. Jordan

    Abstract: Causal discovery and causal effect estimation are two fundamental tasks in causal inference. While many methods have been developed for each task individually, statistical challenges arise when applying these methods jointly: estimating causal effects after running causal discovery algorithms on the same data leads to "double dip**," invalidating the coverage guarantees of classical confidence i… ▽ More

    Submitted 20 March, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

  30. arXiv:2208.05363  [pdf, ps, other

    cs.LG cs.AI cs.GT math.OC stat.ML

    Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

    Authors: Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan

    Abstract: We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is how to do exploration in the high-dimensional function space. We propose a novel online learning algorithm to find a Nash equilibrium by minimizing the duality… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 42 pages

  31. arXiv:2207.07105  [pdf, ps, other

    stat.ML cs.LG math.OC

    Continuous-time Analysis for Variational Inequalities: An Overview and Desiderata

    Authors: Tatjana Chavdarova, Ya-** Hsieh, Michael I. Jordan

    Abstract: Algorithms that solve zero-sum games, multi-objective agent objectives, or, more generally, variational inequality (VI) problems are notoriously unstable on general problems. Owing to the increasing need for solving such problems in machine learning, this instability has been highlighted in recent years as a significant research challenge. In this paper, we provide an overview of recent progress i… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  32. arXiv:2207.06343  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

    Authors: Yaodong Yu, Alexander Wei, Sai Praneeth Karimireddy, Yi Ma, Michael I. Jordan

    Abstract: State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is simultaneously performant for all clients, current federated optimization methods fail to converge to a comparable solution. We show that this performance disparity can l… ▽ More

    Submitted 5 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS) 2022. V2 releases code

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  33. arXiv:2207.01616  [pdf, other

    cs.IR cs.LG stat.ML

    Breaking Feedback Loops in Recommender Systems with Causal Inference

    Authors: Karl Krauth, Yixin Wang, Michael I. Jordan

    Abstract: Recommender systems play a key role in sha** modern web ecosystems. These systems alternate between (1) making recommendations (2) collecting user responses to these recommendations, and (3) retraining the recommendation algorithm based on this feedback. During this process the recommender system influences the user behavioral data that is subsequently used to update it, thus creating a feedback… ▽ More

    Submitted 14 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  34. arXiv:2207.01609  [pdf, other

    cs.IR cs.LG stat.ML

    Recommendation Systems with Distribution-Free Reliability Guarantees

    Authors: Anastasios N. Angelopoulos, Karl Krauth, Stephen Bates, Yixin Wang, Michael I. Jordan

    Abstract: When building recommendation systems, we seek to output a helpful set of items to the user. Under the hood, a ranking model predicts which of two candidate items is better, and we must distill these pairwise comparisons into the user-facing output. However, a learned ranking model is never perfect, so taking its predictions at face value gives no guarantee that the user-facing output is reliable.… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  35. arXiv:2206.14276  [pdf, other

    cs.DC cs.LG cs.MS stat.AP

    NumS: Scalable Array Programming for the Cloud

    Authors: Melih Elibol, Vinamra Benara, Samyu Yagati, Lianmin Zheng, Alvin Cheung, Michael I. Jordan, Ion Stoica

    Abstract: Scientists increasingly rely on Python tools to perform scalable distributed memory array operations using rich, NumPy-like expressions. However, many of these tools rely on dynamic schedulers optimized for abstract task graphs, which often encounter memory and network bandwidth-related bottlenecks due to sub-optimal data and operator placement decisions. Tools built on the message passing interfa… ▽ More

    Submitted 12 July, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

  36. arXiv:2206.13102  [pdf, other

    cs.GT cs.CY cs.IR cs.LG stat.ML

    Modeling Content Creator Incentives on Algorithm-Curated Platforms

    Authors: Jiri Hron, Karl Krauth, Michael I. Jordan, Niki Kilbertus, Sarah Dean

    Abstract: Content creators compete for user attention. Their reach crucially depends on algorithmic choices made by developers on online platforms. To maximize exposure, many creators adapt strategically, as evidenced by examples like the sprawling search engine optimization industry. This begets competition for the finite user attention pool. We formalize these dynamics in what we call an exposure game, a… ▽ More

    Submitted 6 July, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: presented at ICLR 2023 (top 5%)

  37. arXiv:2206.10575  [pdf, other

    stat.ML cs.LG math.OC

    Solving Constrained Variational Inequalities via a First-order Interior Point-based Method

    Authors: Tong Yang, Michael I. Jordan, Tatjana Chavdarova

    Abstract: We develop an interior-point approach to solve constrained variational inequality (cVI) problems. Inspired by the efficacy of the alternating direction method of multipliers (ADMM) method in the single-objective context, we generalize ADMM to derive a first-order method for cVIs, that we refer to as ADMM-based interior-point method for constrained VIs (ACVI). We provide convergence guarantees for… ▽ More

    Submitted 4 March, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Journal ref: International Conference on Learning Representations 2023, Kigali, Rwanda

  38. arXiv:2206.02757  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Calibration with Multi-domain Temperature Scaling

    Authors: Yaodong Yu, Stephen Bates, Yi Ma, Michael I. Jordan

    Abstract: Uncertainty quantification is essential for the reliable deployment of machine learning models to high-stakes application domains. Uncertainty quantification is all the more challenging when training distribution and test distribution are different, even the distribution shifts are mild. Despite the ubiquity of distribution shifts in real-world applications, existing uncertainty quantification app… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  39. arXiv:2205.11765  [pdf, ps, other

    cs.LG cs.AI cs.CR cs.DC stat.ML

    Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees

    Authors: Banghua Zhu, Lun Wang, Qi Pang, Shuai Wang, Jiantao Jiao, Dawn Song, Michael I. Jordan

    Abstract: We propose Byzantine-robust federated learning protocols with nearly optimal statistical rates. In contrast to prior work, our proposed protocols improve the dimension dependence and achieve a tight statistical rate in terms of all the parameters for strongly convex losses. We benchmark against competing protocols and show the empirical superiority of the proposed protocols. Finally, we remark tha… ▽ More

    Submitted 18 March, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  40. arXiv:2205.06812  [pdf, other

    cs.GT cs.LG cs.MA math.ST stat.ME

    Principal-Agent Hypothesis Testing

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Consider the relationship between a regulator (the principal) and an experimenter (the agent) such as a pharmaceutical company. The pharmaceutical company wishes to sell a drug for profit, whereas the regulator wishes to allow only efficacious drugs to be marketed. The efficacy of the drug is not known to the regulator, so the pharmaceutical company must run a costly trial to prove efficacy to the… ▽ More

    Submitted 15 April, 2024; v1 submitted 13 May, 2022; originally announced May 2022.

  41. arXiv:2203.10592  [pdf, other

    stat.ML cs.LG math.DG math.OC math.ST

    Geometric Methods for Sampling, Optimisation, Inference and Adaptive Agents

    Authors: Alessandro Barp, Lancelot Da Costa, Guilherme França, Karl Friston, Mark Girolami, Michael I. Jordan, Grigorios A. Pavliotis

    Abstract: In this chapter, we identify fundamental geometric structures that underlie the problems of sampling, optimisation, inference and adaptive decision-making. Based on this identification, we derive algorithms that exploit these geometric structures to solve these problems efficiently. We show that a wide range of geometric theories emerge naturally in these fields, ranging from measure-preserving pr… ▽ More

    Submitted 25 July, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: 30 pages, 4 figures; 42 pages including table of contents and references

    Journal ref: Handbook of Statistics, vol. 46, pp. 21--78 (2022)

  42. arXiv:2202.12797  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

    Authors: Shuang Qiu, Boxiang Lyu, Qinglin Meng, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan

    Abstract: Dynamic mechanism design studies how mechanism designers should allocate resources among agents in a time-varying environment. We consider the problem where the agents interact with the mechanism designer according to an unknown Markov Decision Process (MDP), where agent rewards and the mechanism designer's state evolve according to an episodic MDP with unknown reward functions and transition kern… ▽ More

    Submitted 25 February, 2024; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Minor Revision for JMLR. The first three authors contribute equally

  43. arXiv:2202.10665  [pdf, ps, other

    cs.LG stat.ME

    Partial Identification with Noisy Covariates: A Robust Optimization Approach

    Authors: Wenshuo Guo, Mingzhang Yin, Yixin Wang, Michael I. Jordan

    Abstract: Causal inference from observational datasets often relies on measuring and adjusting for covariates. In practice, measurements of the covariates can often be noisy and/or biased, or only measurements of their proxies may be available. Directly adjusting for these imperfect measurements of the covariates can lead to biased causal estimates. Moreover, without additional assumptions, the causal effec… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Proceedings of Conference on Causal Learning and Reasoning (CLeaR) 2022

  44. arXiv:2202.05265  [pdf, other

    cs.LG cs.CV eess.IV q-bio.QM stat.ML

    Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging

    Authors: Anastasios N Angelopoulos, Amit P Kohli, Stephen Bates, Michael I Jordan, Jitendra Malik, Thayer Alshaabi, Srigokul Upadhyayula, Yaniv Romano

    Abstract: Image-to-image regression is an important learning task, used frequently in biological imaging. Current algorithms, however, do not generally offer statistical guarantees that protect against a model's mistakes and hallucinations. To address this, we develop uncertainty quantification techniques with rigorous statistical guarantees for image-to-image regression problems. In particular, we show how… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Code available at https://github.com/aangelopoulos/im2im-uq

  45. arXiv:2202.04709  [pdf, other

    cs.LG stat.ME

    Transferred Q-learning

    Authors: Elynn Y. Chen, Michael I. Jordan, Sai Li

    Abstract: We consider $Q$-learning with knowledge transfer, using samples from a target reinforcement learning (RL) task as well as source samples from different but related RL tasks. We propose transfer learning algorithms for both batch and online $Q$-learning with offline source studies. The proposed transferred $Q$-learning algorithm contains a novel re-targeting step that enables vertical information-c… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  46. arXiv:2202.03613  [pdf, other

    cs.LG q-bio.QM stat.ME

    Conformal prediction for the design problem

    Authors: Clara Fannjiang, Stephen Bates, Anastasios N. Angelopoulos, Jennifer Listgarten, Michael I. Jordan

    Abstract: Many applications of machine learning methods involve an iterative protocol in which data are collected, a model is trained, and then outputs of that model are used to choose what data to consider next. For example, one data-driven approach for designing proteins is to train a regression model to predict the fitness of protein sequences, then use it to propose new sequences believed to exhibit gre… ▽ More

    Submitted 31 May, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: for associated code, see https://github.com/clarafy/conformal-for-design

    Journal ref: Proc. Natl. Acad. Sci. 119 (43) e2204569119 (2022)

  47. arXiv:2202.01269  [pdf, ps, other

    cs.LG eess.SP math.ST stat.CO stat.ML

    Robust Estimation for Nonparametric Families via Generative Adversarial Networks

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a general framework for designing Generative Adversarial Networks (GANs) to solve high dimensional robust statistics problems, which aim at estimating unknown parameter of the true distribution given adversarially corrupted samples. Prior work focus on the problem of robust mean and covariance estimation when the true distribution lies in the family of Gaussian distributions or elliptic… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  48. arXiv:2202.00088  [pdf, other

    cs.LG stat.ME

    Reinforcement Learning with Heterogeneous Data: Estimation and Inference

    Authors: Elynn Y. Chen, Rui Song, Michael I. Jordan

    Abstract: Reinforcement Learning (RL) has the promise of providing data-driven support for decision-making in a wide range of problems in healthcare, education, business, and other domains. Classical RL methods focus on the mean of the total return and, thus, may provide misleading results in the setting of the heterogeneous populations that commonly underlie large-scale datasets. We introduce the K-Heterog… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  49. arXiv:2201.08536  [pdf, other

    stat.ML cs.LG

    Instance-Dependent Confidence and Early Stop** for Reinforcement Learning

    Authors: Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

    Abstract: Various algorithms for reinforcement learning (RL) exhibit dramatic variation in their convergence rates as a function of problem structure. Such problem-dependent behavior is not captured by worst-case analyses and has accordingly inspired a growing effort in obtaining instance-dependent guarantees and deriving instance-optimal algorithms for RL problems. This research has been carried out, howev… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  50. arXiv:2201.08518  [pdf, ps, other

    math.ST cs.LG math.OC stat.ML

    Optimal variance-reduced stochastic approximation in Banach spaces

    Authors: Wenlong Mou, Koulik Khamaru, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space. Focusing on a stochastic query model that provides noisy evaluations of the operator, we analyze a variance-reduced stochastic approximation scheme, and establish non-asymptotic bounds for both the operator defect and the estimation error, measured in an arbitrary semi-norm. In contras… ▽ More

    Submitted 29 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.