Skip to main content

Showing 1–42 of 42 results for author: Simchowitz, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.11428  [pdf, other

    cs.LG math.OC stat.ML

    Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

    Authors: Adam Block, Dylan J. Foster, Akshay Krishnamurthy, Max Simchowitz, Cyril Zhang

    Abstract: This work studies training instabilities of behavior cloning with deep neural networks. We observe that minibatch SGD updates to the policy network during training result in sharp oscillations in long-horizon rewards, despite negligibly affecting the behavior cloning loss. We empirically disentangle the statistical and computational causes of these oscillations, and find them to stem from the chao… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  2. arXiv:2307.14619  [pdf, other

    cs.LG math.ST stat.ML

    Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior

    Authors: Adam Block, Ali Jadbabaie, Daniel Pfrommer, Max Simchowitz, Russ Tedrake

    Abstract: We propose a theoretical framework for studying behavior cloning of complex expert demonstrations using generative modeling. Our framework invokes low-level controllers - either learned or implicit in position-command control - to stabilize imitation around expert demonstrations. We show that with (a) a suitable low-level stability guarantee and (b) a powerful enough generative model as our imitat… ▽ More

    Submitted 24 October, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: updated figures, minor notational change for readability

  3. arXiv:2307.06457  [pdf, other

    cs.LG cs.DS stat.ML

    Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective

    Authors: Max Simchowitz, Abhishek Gupta, Kaiqing Zhang

    Abstract: Obtaining rigorous statistical guarantees for generalization under distribution shift remains an open and active research area. We study a setting we call combinatorial distribution shift, where (a) under the test- and training-distributions, the labels $z$ are determined by pairs of features $(x,y)$, (b) the training distribution has coverage of certain marginal distributions over $x$ and $y$ sep… ▽ More

    Submitted 28 July, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: The 36th Annual Conference on Learning Theory (COLT 2023)

  4. arXiv:2305.09619  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

    Authors: Daniel Pfrommer, Max Simchowitz, Tyler Westenbroek, Nikolai Matni, Stephen Tu

    Abstract: A common pipeline in learning-based control is to iteratively estimate a model of system dynamics, and apply a trajectory optimization algorithm - e.g.~$\mathtt{iLQR}$ - on the learned model to minimize a target cost. This paper conducts a rigorous analysis of a simplified variant of this strategy for general nonlinear systems. We analyze an algorithm which iterates between estimating local linear… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  5. arXiv:2302.13934  [pdf, other

    cs.LG stat.ML

    Statistical Learning under Heterogeneous Distribution Shift

    Authors: Max Simchowitz, Anurag Ajay, Pulkit Agrawal, Akshay Krishnamurthy

    Abstract: This paper studies the prediction of a target $\mathbf{z}$ from a pair of random variables $(\mathbf{x},\mathbf{y})$, where the ground-truth predictor is additive $\mathbb{E}[\mathbf{z} \mid \mathbf{x},\mathbf{y}] = f_\star(\mathbf{x}) +g_{\star}(\mathbf{y})$. We study the performance of empirical risk minimization (ERM) over functions $f+g$, $f \in F$ and $g \in G$, fit on a given training distri… ▽ More

    Submitted 27 October, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  6. arXiv:2302.05430  [pdf, other

    stat.ML cs.LG

    Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision Making

    Authors: Adam Block, Alexander Rakhlin, Max Simchowitz

    Abstract: Smoothed online learning has emerged as a popular framework to mitigate the substantial loss in statistical and computational complexity that arises when one moves from classical to adversarial learning. Unfortunately, for some spaces, it has been shown that efficient algorithms suffer an exponentially worse regret than that which is minimax optimal, even when the learner has access to an optimiza… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 February, 2023; originally announced February 2023.

  7. arXiv:2301.11187  [pdf, ps, other

    stat.ML cs.LG

    Smoothed Online Learning for Prediction in Piecewise Affine Systems

    Authors: Adam Block, Max Simchowitz, Russ Tedrake

    Abstract: The problem of piecewise affine (PWA) regression and planning is of foundational importance to the study of online learning, control, and robotics, where it provides a theoretically and empirically tractable setting to study systems undergoing sharp changes in the dynamics. Unfortunately, due to the discontinuities that arise when crossing into different ``pieces,'' learning in general sequential… ▽ More

    Submitted 19 March, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  8. arXiv:2205.13056  [pdf, ps, other

    stat.ML cs.LG

    Efficient and Near-Optimal Smoothed Online Learning for Generalized Linear Functions

    Authors: Adam Block, Max Simchowitz

    Abstract: Due to the drastic gap in complexity between sequential and batch statistical learning, recent work has studied a smoothed sequential learning setting, where Nature is constrained to select contexts with density bounded by 1/σ with respect to a known measure μ. Unfortunately, for some function classes, there is an exponential gap between the statistically optimal regret and that which can be achie… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  9. arXiv:2202.11659  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Globally Convergent Policy Search over Dynamic Filters for Output Estimation

    Authors: Jack Umenberger, Max Simchowitz, Juan C. Perdomo, Kaiqing Zhang, Russ Tedrake

    Abstract: We introduce the first direct policy search algorithm which provably converges to the globally optimal $\textit{dynamic}$ filter for the classical problem of predicting the outputs of a linear dynamical system, given noisy, partial observations. Despite the ubiquity of partial observability in practice, theoretical guarantees for direct policy search algorithms, one of the backbones of modern rein… ▽ More

    Submitted 25 February, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

  10. arXiv:2202.07890  [pdf, other

    cs.LG math.OC stat.ML

    Online Control of Unknown Time-Varying Dynamical Systems

    Authors: Edgar Minasyan, Paula Gradu, Max Simchowitz, Elad Hazan

    Abstract: We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is \emph{qualitatively harder} than that of either unknown time-invariant or known time-varying dynamics, and complement our negative results with algorithmic upper bounds in regimes where sublinear regret is possible. More specifically,… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  11. arXiv:2201.11206  [pdf, other

    cs.LG stat.ML

    Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes

    Authors: Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

    Abstract: Reward-free reinforcement learning (RL) considers the setting where the agent does not have access to a reward function during exploration, but must propose a near-optimal policy for an arbitrary reward function revealed only after exploring. In the the tabular setting, it is well known that this is a more difficult problem than reward-aware (PAC) RL -- where the agent has access to the reward fun… ▽ More

    Submitted 18 June, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  12. arXiv:2112.03432  [pdf, other

    cs.LG stat.ML

    First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

    Authors: Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

    Abstract: Obtaining first-order regret bounds -- regret bounds scaling not as the worst-case but with some measure of the performance of the optimal policy on a given instance -- is a core question in sequential decision-making. While such bounds exist in many settings, they have proven elusive in reinforcement learning with large state spaces. In this work we address this gap, and show that it is possible… ▽ More

    Submitted 20 October, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  13. arXiv:2110.06418  [pdf, other

    eess.SY cs.LG stat.ML

    Stabilizing Dynamical Systems via Policy Gradient Methods

    Authors: Juan C. Perdomo, Jack Umenberger, Max Simchowitz

    Abstract: Stabilizing an unknown control system is one of the most fundamental problems in control systems engineering. In this paper, we provide a simple, model-free algorithm for stabilizing fully observed dynamical systems. While model-free methods have become increasingly popular in practice due to their simplicity and flexibility, stabilization via direct policy search has received surprisingly little… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: accepted for publication at Neurips 2021

  14. arXiv:2108.02717  [pdf, other

    cs.LG stat.ML

    Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

    Authors: Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson

    Abstract: The theory of reinforcement learning has focused on two fundamental problems: achieving low regret, and identifying $ε$-optimal policies. While a simple reduction allows one to apply a low-regret algorithm to obtain an $ε$-optimal policy and achieve the worst-case optimal rate, it is unknown whether low-regret algorithms can obtain the instance-optimal rate for policy identification. We show this… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

  15. arXiv:2107.01509  [pdf, other

    cs.LG math.ST stat.ML

    Bayesian decision-making under misspecified priors with applications to meta-learning

    Authors: Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu, Thodoris Lykouris, Miroslav Dudík, Robert E. Schapire

    Abstract: Thompson sampling and other Bayesian sequential decision-making algorithms are among the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The choice of prior in these algorithms offers flexibility to encode domain knowledge but can also lead to poor performance when misspecified. In this paper, we demonstrate that performance degrades gracefully with misspecifi… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

  16. arXiv:2103.10620  [pdf, other

    math.OC cs.LG stat.ML

    Towards a Dimension-Free Understanding of Adaptive Linear Control

    Authors: Juan C. Perdomo, Max Simchowitz, Alekh Agarwal, Peter Bartlett

    Abstract: We study the problem of adaptive control of the linear quadratic regulator for systems in very high, or even infinite dimension. We demonstrate that while sublinear regret requires finite dimensional inputs, the ambient state dimension of the system need not be bounded in order to perform online control. We provide the first regret bounds for LQR which hold for infinite dimensional systems, replac… ▽ More

    Submitted 15 July, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: presented at COLT 2021

  17. arXiv:2102.05214  [pdf, other

    cs.LG math.OC stat.ML

    Task-Optimal Exploration in Linear Dynamical Systems

    Authors: Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson

    Abstract: Exploration in unknown environments is a fundamental problem in reinforcement learning and control. In this work, we study task-guided exploration and determine what precisely an agent must learn about their environment in order to complete a particular task. Formally, we study a broad class of decision-making problems in the setting of linear dynamical systems, a class that includes the linear qu… ▽ More

    Submitted 9 July, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

  18. arXiv:2010.03799  [pdf, ps, other

    cs.LG math.OC math.ST stat.ML

    Learning the Linear Quadratic Regulator from Nonlinear Observations

    Authors: Zakaria Mhammedi, Dylan J. Foster, Max Simchowitz, Dipendra Misra, Wen Sun, Akshay Krishnamurthy, Alexander Rakhlin, John Langford

    Abstract: We introduce a new problem setting for continuous control called the LQR with Rich Observations, or RichLQR. In our setting, the environment is summarized by a low-dimensional continuous latent state with linear dynamics and quadratic costs, but the agent operates on high-dimensional, nonlinear observations such as images from a camera. To enable sample-efficient learning, we assume that the learn… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020

  19. arXiv:2006.05910  [pdf, other

    cs.LG math.OC stat.ML

    Making Non-Stochastic Control (Almost) as Easy as Stochastic

    Authors: Max Simchowitz

    Abstract: Recent literature has made much progress in understanding \emph{online LQR}: a modern learning-theoretic take on the classical control problem in which a learner attempts to optimally control an unknown linear dynamical system with fully observed state, perturbed by i.i.d. Gaussian noise. It is now understood that the optimal regret on time horizon $T$ against the optimal control law scales as… ▽ More

    Submitted 4 October, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

  20. arXiv:2006.05051  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Constrained episodic reinforcement learning in concave-convex and knapsack settings

    Authors: Kianté Brantley, Miroslav Dudik, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

    Abstract: We propose an algorithm for tabular episodic reinforcement learning with constraints. We provide a modular analysis with strong theoretical guarantees for settings with concave rewards and convex constraints, and for settings with hard constraints (knapsacks). Most of the previous work in constrained reinforcement learning is limited to linear constraints, and the remaining work focuses on either… ▽ More

    Submitted 5 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: The NeurIPS 2020 version of this paper includes a small bug, leading to an incorrect dependence on H in Theorem 3.4. This version fixes it by adjusting Eq. (9), Theorem 3.4 and the relevant proofs. Changes in the main text are noted in red. Changes in the appendix are limited to Appendices B.1, B.5, and B.6 and the statement of Lemma F.3

  21. arXiv:2003.06740  [pdf, other

    cs.LG stat.ML

    Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning

    Authors: Esther Rolf, Max Simchowitz, Sarah Dean, Lydia T. Liu, Daniel Björkegren, Moritz Hardt, Joshua Blumenstock

    Abstract: While real-world decisions involve many competing objectives, algorithmic decisions are often evaluated with a single objective function. In this paper, we study algorithmic policies which explicitly trade off between a private objective (such as profit) and a public objective (such as social welfare). We analyze a natural class of policies which trace an empirical Pareto frontier based on learned… ▽ More

    Submitted 15 July, 2020; v1 submitted 14 March, 2020; originally announced March 2020.

  22. arXiv:2003.00189  [pdf, ps, other

    cs.LG math.OC stat.ML

    Logarithmic Regret for Adversarial Online Control

    Authors: Dylan J. Foster, Max Simchowitz

    Abstract: We introduce a new algorithm for online linear-quadratic control in a known system subject to adversarial disturbances. Existing regret bounds for this setting scale as $\sqrt{T}$ unless strong stochastic assumptions are imposed on the disturbance process. We give the first algorithm with logarithmic regret for arbitrary adversarial disturbance sequences, provided the state and control costs are g… ▽ More

    Submitted 23 June, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: ICML 2020

  23. arXiv:2002.02794  [pdf, ps, other

    cs.LG stat.ML

    Reward-Free Exploration for Reinforcement Learning

    Authors: Chi **, Akshay Krishnamurthy, Max Simchowitz, Tiancheng Yu

    Abstract: Exploration is widely regarded as one of the most challenging aspects of reinforcement learning (RL), with many naive approaches succumbing to exponential sample complexity. To isolate the challenges of exploration, we propose a new "reward-free RL" framework. In the exploration phase, the agent first collects trajectories from an MDP $\mathcal{M}$ without a pre-specified reward function. After ex… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  24. arXiv:2001.09576  [pdf, other

    cs.LG math.OC stat.ML

    Naive Exploration is Optimal for Online LQR

    Authors: Max Simchowitz, Dylan J. Foster

    Abstract: We consider the problem of online adaptive control of the linear quadratic regulator, where the true system parameters are unknown. We prove new upper and lower bounds demonstrating that the optimal regret scales as $\widetildeΘ({\sqrt{d_{\mathbf{u}}^2 d_{\mathbf{x}} T}})$, where $T$ is the number of time steps, $d_{\mathbf{u}}$ is the dimension of the input space, and $d_{\mathbf{x}}$ is the dime… ▽ More

    Submitted 3 October, 2023; v1 submitted 26 January, 2020; originally announced January 2020.

  25. arXiv:2001.09254  [pdf, other

    cs.LG math.OC stat.ML

    Improper Learning for Non-Stochastic Control

    Authors: Max Simchowitz, Karan Singh, Elad Hazan

    Abstract: We consider the problem of controlling a possibly unknown linear dynamical system with adversarial perturbations, adversarially chosen convex loss functions, and partially observed states, known as non-stochastic control. We introduce a controller parametrization based on the denoised observations, and prove that applying online gradient descent to this parametrization yields a new controller whic… ▽ More

    Submitted 24 June, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

  26. arXiv:1911.08689  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    Corruption-robust exploration in episodic reinforcement learning

    Authors: Thodoris Lykouris, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

    Abstract: We initiate the study of multi-stage episodic reinforcement learning under adversarial corruptions in both the rewards and the transition probabilities of the underlying system extending recent results for the special case of stochastic bandits. We provide a framework which modifies the aggressive exploration enjoyed by existing reinforcement learning approaches based on "optimism in the face of u… ▽ More

    Submitted 31 October, 2023; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted in Mathematics of Operations Research. Preliminary version was accepted for presentation at COLT'21

  27. arXiv:1911.02212  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    The gradient complexity of linear regression

    Authors: Mark Braverman, Elad Hazan, Max Simchowitz, Blake Woodworth

    Abstract: We investigate the computational complexity of several basic linear algebra primitives, including largest eigenvector computation and linear regression, in the computational model that allows access to the data via a matrix-vector product oracle. We show that for polynomial accuracy, $Θ(d)$ calls to the oracle are necessary and sufficient even for a randomized algorithm. Our lower bound is based… ▽ More

    Submitted 23 May, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

  28. arXiv:1905.03814  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs

    Authors: Max Simchowitz, Kevin Jamieson

    Abstract: This paper establishes that optimistic algorithms attain gap-dependent and non-asymptotic logarithmic regret for episodic MDPs. In contrast to prior work, our bounds do not suffer a dependence on diameter-like quantities or ergodicity, and smoothly interpolate between the gap dependent logarithmic-regret, and the $\widetilde{\mathcal{O}}(\sqrt{HSAT})$-minimax rate. The key technique in our analysi… ▽ More

    Submitted 28 October, 2019; v1 submitted 9 May, 2019; originally announced May 2019.

  29. arXiv:1902.00768  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Learning Linear Dynamical Systems with Semi-Parametric Least Squares

    Authors: Max Simchowitz, Ross Boczar, Benjamin Recht

    Abstract: We analyze a simple prefiltered variation of the least squares estimator for the problem of estimation with biased, semi-parametric noise, an error model studied more broadly in causal statistics and active learning. We prove an oracle inequality which demonstrates that this procedure provably mitigates the variance introduced by long-term dependencies. We then demonstrate that prefiltered least s… ▽ More

    Submitted 2 February, 2019; originally announced February 2019.

  30. arXiv:1809.10611  [pdf, other

    cs.LG cs.RO stat.ML

    A Successive-Elimination Approach to Adaptive Robotic Sensing

    Authors: Esther Rolf, David Fridovich-Keil, Max Simchowitz, Benjamin Recht, Claire Tomlin

    Abstract: We study an adaptive source seeking problem, in which a mobile robot must identify the strongest emitter(s) of a signal in an environment with background emissions. Background signals may be highly heterogeneous and can mislead algorithms that are based on receding horizon control. We propose AdaSearch, a general algorithm for adaptive source seeking in the face of heterogeneous background noise.… ▽ More

    Submitted 23 June, 2020; v1 submitted 27 September, 2018; originally announced September 2018.

    Journal ref: IEEE Transactions on Robotics Research, 2020

  31. arXiv:1808.10013  [pdf, other

    cs.LG stat.ML

    The implicit fairness criterion of unconstrained learning

    Authors: Lydia T. Liu, Max Simchowitz, Moritz Hardt

    Abstract: We clarify what fairness guarantees we can and cannot expect to follow from unconstrained machine learning. Specifically, we characterize when unconstrained learning on its own implies group calibration, that is, the outcome variable is conditionally independent of group membership given the score. We show that under reasonable conditions, the deviation from satisfying group calibration is upper b… ▽ More

    Submitted 25 January, 2019; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: 37 pages, 9 figures

  32. arXiv:1808.04523  [pdf, other

    cs.LG math.ST stat.ML

    Adaptive Sampling for Convex Regression

    Authors: Max Simchowitz, Kevin Jamieson, Jordan W. Suchow, Thomas L. Griffiths

    Abstract: In this paper, we introduce the first principled adaptive-sampling procedure for learning a convex function in the $L_\infty$ norm, a problem that arises often in the behavioral and social sciences. We present a function-specific measure of complexity and use it to prove that, for each convex function $f_{\star}$, our algorithm nearly attains the information-theoretically optimal, function-specifi… ▽ More

    Submitted 26 August, 2018; v1 submitted 14 August, 2018; originally announced August 2018.

  33. arXiv:1807.09386  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    On the Randomized Complexity of Minimizing a Convex Quadratic Function

    Authors: Max Simchowitz

    Abstract: Minimizing a convex, quadratic objective of the form $f_{\mathbf{A},\mathbf{b}}(x) := \frac{1}{2}x^\top \mathbf{A} x - \langle \mathbf{b}, x \rangle$ for $\mathbf{A} \succ 0 $ is a fundamental problem in machine learning and optimization. In this work, we prove gradient-query complexity lower bounds for minimizing convex quadratic functions which apply to both deterministic and \emph{randomized} a… ▽ More

    Submitted 16 April, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

  34. arXiv:1804.01221  [pdf, other

    cs.LG cs.DS cs.IT math.OC stat.ML

    Tight Query Complexity Lower Bounds for PCA via Finite Sample Deformed Wigner Law

    Authors: Max Simchowitz, Ahmed El Alaoui, Benjamin Recht

    Abstract: We prove a \emph{query complexity} lower bound for approximating the top $r$ dimensional eigenspace of a matrix. We consider an oracle model where, given a symmetric matrix $\mathbf{M} \in \mathbb{R}^{d \times d}$, an algorithm $\mathsf{Alg}$ is allowed to make $\mathsf{T}$ exact queries of the form $\mathsf{w}^{(i)} = \mathbf{M} \mathsf{v}^{(i)}$ for $i$ in $\{1,...,\mathsf{T}\}$, where… ▽ More

    Submitted 27 June, 2020; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: To appear in STOC 2018

  35. arXiv:1803.04383  [pdf, other

    cs.LG stat.ML

    Delayed Impact of Fair Machine Learning

    Authors: Lydia T. Liu, Sarah Dean, Esther Rolf, Max Simchowitz, Moritz Hardt

    Abstract: Fairness in machine learning has predominantly been studied in static classification settings without concern for how decisions change the underlying population over time. Conventional wisdom suggests that fairness criteria promote the long-term well-being of those groups they aim to protect. We study how static fairness criteria interact with temporal indicators of well-being, such as long-term… ▽ More

    Submitted 7 April, 2018; v1 submitted 12 March, 2018; originally announced March 2018.

    Comments: 37 pages, 6 figures

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:3150-3158, 2018

  36. arXiv:1802.08334  [pdf, other

    cs.LG math.OC stat.ML

    Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification

    Authors: Max Simchowitz, Horia Mania, Stephen Tu, Michael I. Jordan, Benjamin Recht

    Abstract: We prove that the ordinary least-squares (OLS) estimator attains nearly minimax optimal performance for the identification of linear dynamical systems from a single observed trajectory. Our upper bound relies on a generalization of Mendelson's small-ball method to dependent data, eschewing the use of standard mixing-time arguments. Our lower bounds reveal that these upper bounds match up to logari… ▽ More

    Submitted 24 May, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

  37. arXiv:1801.01253  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Approximate Ranking from Pairwise Comparisons

    Authors: Reinhard Heckel, Max Simchowitz, Kannan Ramchandran, Martin J. Wainwright

    Abstract: A common problem in machine learning is to rank a set of n items based on pairwise comparisons. Here ranking refers to partitioning the items into sets of pre-specified sizes according to their scores, which includes identification of the top-k items as the most prominent special case. The score of a given item is defined as the probability that it beats a randomly chosen other item. Finding an ex… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Comments: AISTATS 2017

  38. arXiv:1710.07406  [pdf, ps, other

    stat.ML cs.LG math.OC

    First-order Methods Almost Always Avoid Saddle Points

    Authors: Jason D. Lee, Ioannis Panageas, Georgios Piliouras, Max Simchowitz, Michael I. Jordan, Benjamin Recht

    Abstract: We establish that first-order methods avoid saddle points for almost all initializations. Our results apply to a wide variety of first-order methods, including gradient descent, block coordinate descent, mirror descent and variants thereof. The connecting thread is that such algorithms can be studied from a dynamical systems perspective in which appropriate instantiations of the Stable Manifold Th… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

  39. arXiv:1704.04548  [pdf, other

    cs.LG cs.DS cs.IT math.CO stat.ML

    On the Gap Between Strict-Saddles and True Convexity: An Omega(log d) Lower Bound for Eigenvector Approximation

    Authors: Max Simchowitz, Ahmed El Alaoui, Benjamin Recht

    Abstract: We prove a \emph{query complexity} lower bound on rank-one principal component analysis (PCA). We consider an oracle model where, given a symmetric matrix $M \in \mathbb{R}^{d \times d}$, an algorithm is allowed to make $T$ \emph{exact} queries of the form $w^{(i)} = Mv^{(i)}$ for $i \in \{1,\dots,T\}$, where $v^{(i)}$ is drawn from a distribution which depends arbitrarily on the past queries and… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

  40. arXiv:1702.05186  [pdf, other

    cs.LG stat.ML

    The Simulator: Understanding Adaptive Sampling in the Moderate-Confidence Regime

    Authors: Max Simchowitz, Kevin Jamieson, Benjamin Recht

    Abstract: We propose a novel technique for analyzing adaptive sampling called the {\em Simulator}. Our approach differs from the existing methods by considering not how much information could be gathered by any fixed sampling strategy, but how difficult it is to distinguish a good sampling strategy from a bad one given the limited amount of data collected up to any given time. This change of perspective all… ▽ More

    Submitted 23 April, 2023; v1 submitted 16 February, 2017; originally announced February 2017.

  41. arXiv:1603.02752  [pdf, ps, other

    cs.LG stat.ML

    Best-of-K Bandits

    Authors: Max Simchowitz, Kevin Jamieson, Benjamin Recht

    Abstract: This paper studies the Best-of-K Bandit game: At each time the player chooses a subset S among all N-choose-K possible options and observes reward max(X(i) : i in S) where X is a random vector drawn from a joint distribution. The objective is to identify the subset that achieves the highest expected reward with high probability using as few queries as possible. We present distribution-dependent lo… ▽ More

    Submitted 18 March, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

  42. arXiv:1602.04915  [pdf, ps, other

    stat.ML cs.LG math.OC

    Gradient Descent Converges to Minimizers

    Authors: Jason D. Lee, Max Simchowitz, Michael I. Jordan, Benjamin Recht

    Abstract: We show that gradient descent converges to a local minimizer, almost surely with random initialization. This is proved by applying the Stable Manifold Theorem from dynamical systems theory.

    Submitted 4 March, 2016; v1 submitted 16 February, 2016; originally announced February 2016.

    Comments: Submitted to COLT 2016