Skip to main content

Showing 1–41 of 41 results for author: Golowich, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13668  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Improved bounds for calibration via stronger sign preservation games

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich, Robert Kleinberg, Princewill Okoroafor

    Abstract: A set of probabilistic forecasts is calibrated if each prediction of the forecaster closely approximates the empirical distribution of outcomes on the subset of timesteps where that prediction was made. We study the fundamental problem of online calibrated forecasting of binary sequences, which was initially studied by Foster & Vohra (1998). They derived an algorithm with $O(T^{2/3})$ calibration… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.13116  [pdf, ps, other

    cs.GT

    A Lower Bound on Swap Regret in Extensive-Form Games

    Authors: Constantinos Daskalakis, Gabriele Farina, Noah Golowich, Tuomas Sandholm, Brian Hu Zhang

    Abstract: Recent simultaneous works by Peng and Rubinstein [2024] and Dagan et al. [2024] have demonstrated the existence of a no-swap-regret learning algorithm that can reach $ε$ average swap regret against an adversary in any extensive-form game within $m^{\tilde{\mathcal O}(1/ε)}$ rounds, where $m$ is the number of nodes in the game tree. However, the question of whether a $\mathrm{poly}(m, 1/ε)$-round a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.11686  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation

    Authors: Noah Golowich, Ankur Moitra

    Abstract: In this paper, we study the offline RL problem with linear function approximation. Our main structural assumption is that the MDP has low inherent Bellman error, which stipulates that linear value functions have linear Bellman backups with respect to the greedy policy. This assumption is natural in that it is essentially the minimal assumption required for value iteration to succeed. We give a com… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: RLC 2024

  4. arXiv:2406.11667  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Is Efficient PAC Learning Possible with an Oracle That Responds 'Yes' or 'No'?

    Authors: Constantinos Daskalakis, Noah Golowich

    Abstract: The empirical risk minimization (ERM) principle has been highly impactful in machine learning, leading both to near-optimal theoretical guarantees for ERM-based learning algorithms as well as driving many of the recent empirical successes in deep learning. In this paper, we investigate the question of whether the ability to perform ERM, which computes a hypothesis minimizing empirical risk on a gi… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  5. arXiv:2406.11640  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions

    Authors: Noah Golowich, Ankur Moitra

    Abstract: One of the most natural approaches to reinforcement learning (RL) with function approximation is value iteration, which inductively generates approximations to the optimal value function by solving a sequence of regression problems. To ensure the success of value iteration, it is typically assumed that Bellman completeness holds, which ensures that these regression problems are well-specified. We… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  6. arXiv:2406.07920  [pdf, ps, other

    cs.LG cs.AI cs.CC math.ST stat.ML

    Near-Optimal Learning and Planning in Separated Latent MDPs

    Authors: Fan Chen, Constantinos Daskalakis, Noah Golowich, Alexander Rakhlin

    Abstract: We study computational and statistical aspects of learning Latent Markov Decision Processes (LMDPs). In this model, the learner interacts with an MDP drawn at the beginning of each epoch from an unknown mixture of MDPs. To sidestep known impossibility results, we consider several notions of separation of the constituent MDPs. The main thrust of this paper is in establishing a nearly-sharp *statist… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  7. arXiv:2406.02633  [pdf, ps, other

    cs.CR cs.AI cs.LG

    Edit Distance Robust Watermarks for Language Models

    Authors: Noah Golowich, Ankur Moitra

    Abstract: Motivated by the problem of detecting AI-generated text, we consider the problem of watermarking the output of language models with provable guarantees. We aim for watermarks which satisfy: (a) undetectability, a cryptographic notion introduced by Christ, Gunn & Zamir (2024) which stipulates that it is computationally hard to distinguish watermarked language model outputs from the model's actual o… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  8. arXiv:2406.01799  [pdf, other

    cs.LG math.OC stat.ML

    Online Control in Population Dynamics

    Authors: Noah Golowich, Elad Hazan, Zhou Lu, Dhruv Rohatgi, Y. Jennifer Sun

    Abstract: The study of population dynamics originated with early sociological works but has since extended into many fields, including biology, epidemiology, evolutionary game theory, and economics. Most studies on population dynamics focus on the problem of prediction rather than control. Existing mathematical models for control in population dynamics are often restricted to specific, noise-free dynamics,… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2404.11325  [pdf, ps, other

    cs.CR cs.DS

    On Learning Parities with Dependent Noise

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: In this expository note we show that the learning parities with noise (LPN) assumption is robust to weak dependencies in the noise distribution of small batches of samples. This provides a partial converse to the linearization technique of [AG11]. The material in this note is drawn from a recent work by the authors [GMR24], where the robustness guarantee was a key component in a cryptographic sepa… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: This note draws heavily from arXiv:2404.03774

  10. arXiv:2404.03774  [pdf, other

    cs.LG cs.CC cs.CR cs.DS

    Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: Supervised learning is often computationally easy in practice. But to what extent does this mean that other modes of learning, such as reinforcement learning (RL), ought to be computationally easy by extension? In this work we show the first cryptographic separation between RL and supervised learning, by exhibiting a class of block MDPs and associated decoding functions where reward-free explorati… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 112 pages, 3 figures

  11. arXiv:2310.19786  [pdf, ps, other

    cs.LG cs.AI cs.GT

    From External to Swap Regret 2.0: An Efficient Reduction and Oblivious Adversary for Large Action Spaces

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich

    Abstract: We provide a novel reduction from swap-regret minimization to external-regret minimization, which improves upon the classical reductions of Blum-Mansour [BM07] and Stolz-Lugosi [SL05] in that it does not require finiteness of the space of actions. We show that, whenever there exists a no-external-regret algorithm for some hypothesis class, there must also exist a no-swap-regret algorithm for that… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  12. arXiv:2309.12226  [pdf, ps, other

    cs.GT cs.DS cs.LG

    Smooth Nash Equilibria: Algorithms and Complexity

    Authors: Constantinos Daskalakis, Noah Golowich, Nika Haghtalab, Abhishek Shetty

    Abstract: A fundamental shortcoming of the concept of Nash equilibrium is its computational intractability: approximating Nash equilibria in normal-form games is PPAD-hard. In this paper, inspired by the ideas of smoothed analysis, we introduce a relaxed variant of Nash equilibrium called $σ$-smooth Nash equilibrium, for a smoothness parameter $σ$. In a $σ$-smooth Nash equilibrium, players only need to achi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  13. arXiv:2309.09457  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: The key assumption underlying linear Markov Decision Processes (MDPs) is that the learner has access to a known feature map $φ(x, a)$ that maps state-action pairs to $d$-dimensional vectors, and that the rewards and transitions are linear functions in this representation. But where do these features come from? In the absence of expert domain knowledge, a tempting strategy is to use the ``kitchen s… ▽ More

    Submitted 18 September, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

  14. arXiv:2305.00684  [pdf, other

    cs.LG cs.AI cs.GT cs.MA stat.ML

    On the Complexity of Multi-Agent Decision Making: From Learning in Games to Partial Monitoring

    Authors: Dylan J. Foster, Dean P. Foster, Noah Golowich, Alexander Rakhlin

    Abstract: A central problem in the theory of multi-agent reinforcement learning (MARL) is to understand what structural conditions and algorithmic principles lead to sample-efficient learning guarantees, and how these considerations change as we move from few to many agents. We study this question in a general framework for interactive decision making with multiple agents, encompassing Markov games with fun… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 95 pages

  15. arXiv:2303.12287  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games

    Authors: Dylan J. Foster, Noah Golowich, Sham M. Kakade

    Abstract: We consider the problem of decentralized multi-agent reinforcement learning in Markov games. A fundamental question is whether there exist algorithms that, when adopted by all agents and run independently in a decentralized fashion, lead to no-regret for each player, analogous to celebrated convergence results in normal-form games. While recent work has shown that such algorithms exist for restric… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 51 pages

  16. arXiv:2301.08215  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient

    Authors: Dylan J. Foster, Noah Golowich, Yanjun Han

    Abstract: A foundational problem in reinforcement learning and interactive decision making is to understand what modeling assumptions lead to sample-efficient learning guarantees, and what algorithm design principles achieve optimal sample complexity. Recently, Foster et al. (2021) introduced the Decision-Estimation Coefficient (DEC), a measure of statistical complexity which leads to upper and lower bounds… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  17. arXiv:2211.14250  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Model-Free Reinforcement Learning with the Decision-Estimation Coefficient

    Authors: Dylan J. Foster, Noah Golowich, Jian Qian, Alexander Rakhlin, Ayush Sekhari

    Abstract: We consider the problem of interactive decision making, encompassing structured bandits and reinforcement learning with general function approximation. Recently, Foster et al. (2021) introduced the Decision-Estimation Coefficient, a measure of statistical complexity that lower bounds the optimal regret for interactive decision making, as well as a meta-algorithm, Estimation-to-Decisions, which ach… ▽ More

    Submitted 12 August, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: V2 changes: Improved writing and added more examples

  18. arXiv:2210.09769  [pdf, other

    cs.LG cs.GT math.OC

    STay-ON-the-Ridge: Guaranteed Convergence to Local Minimax Equilibrium in Nonconvex-Nonconcave Games

    Authors: Constantinos Daskalakis, Noah Golowich, Stratis Skoulakis, Manolis Zampetakis

    Abstract: Min-max optimization problems involving nonconvex-nonconcave objectives have found important applications in adversarial training and other multi-agent learning settings. Yet, no known gradient descent-based method is guaranteed to converge to (even local notions of) min-max equilibrium in the nonconvex-nonconcave setting. For all known methods, there exist relatively simple objectives for which t… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  19. arXiv:2206.03446  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Learning in Observable POMDPs, without Computationally Intractable Oracles

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: Much of reinforcement learning theory is built on top of oracles that are computationally hard to implement. Specifically for learning near-optimal policies in Partially Observable Markov Decision Processes (POMDPs), existing algorithms either need to make strong assumptions about the model dynamics (e.g. deterministic transitions) or assume access to an oracle for solving a hard optimistic planni… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  20. arXiv:2204.03991  [pdf, ps, other

    cs.LG cs.GT

    The Complexity of Markov Equilibrium in Stochastic Games

    Authors: Constantinos Daskalakis, Noah Golowich, Kaiqing Zhang

    Abstract: We show that computing approximate stationary Markov coarse correlated equilibria (CCE) in general-sum stochastic games is computationally intractable, even when there are two players, the game is turn-based, the discount factor is an absolute constant, and the approximation is an absolute constant. Our intractability results stand in sharp contrast to normal-form games where exact CCEs are effici… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 50 pages

  21. arXiv:2202.04690  [pdf, ps, other

    stat.ML cs.LG

    Smoothed Online Learning is as Easy as Statistical Learning

    Authors: Adam Block, Yuval Dagan, Noah Golowich, Alexander Rakhlin

    Abstract: Much of modern learning theory has been split between two regimes: the classical offline setting, where data arrive independently, and the online setting, where data arrive adversarially. While the former model is often both computationally and statistically tractable, the latter requires no distributional assumptions. In an attempt to achieve the best of both worlds, previous work proposed the sm… ▽ More

    Submitted 31 May, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

  22. arXiv:2201.04735  [pdf, ps, other

    cs.LG cs.DS math.OC stat.ML

    Planning in Observable POMDPs in Quasipolynomial Time

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: Partially Observable Markov Decision Processes (POMDPs) are a natural and general model in reinforcement learning that take into account the agent's uncertainty about its current state. In the literature on POMDPs, it is customary to assume access to a planning oracle that computes an optimal policy when the parameters are known, even though the problem is known to be computationally hard. Almost… ▽ More

    Submitted 23 March, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 52 pages

  23. arXiv:2111.12786  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Differentially Private Nonparametric Regression Under a Growth Condition

    Authors: Noah Golowich

    Abstract: Given a real-valued hypothesis class $\mathcal{H}$, we investigate under what conditions there is a differentially private algorithm which learns an optimal hypothesis from $\mathcal{H}$ given i.i.d. data. Inspired by recent results for the related setting of binary classification (Alon et al., 2019; Bun et al., 2020), where it was shown that online learnability of a binary class is necessary and… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 41 pages; appeared in COLT 2021

  24. arXiv:2111.08911  [pdf, ps, other

    cs.LG cs.DS cs.GT stat.ML

    Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games

    Authors: Constantinos Daskalakis, Noah Golowich

    Abstract: We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the realizable setting of nonparametric online regression with the absolute loss, we propose a randomized proper learning algorithm which gets a near-optimal cumulative l… ▽ More

    Submitted 12 April, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 61 pages

  25. Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

    Authors: Ioannis Anagnostides, Constantinos Daskalakis, Gabriele Farina, Maxwell Fishelson, Noah Golowich, Tuomas Sandholm

    Abstract: Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a multi-player general-sum normal-form game employ Optimistic Multiplicative Weights Update (OMWU), the external regret of every player is $O(\textrm{polylog}(T))$ after $T$ repetitions of the game. We extend their result from external regret to internal regret and swap regret, thereby establishing uncoup… ▽ More

    Submitted 24 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Appeared at STOC 2022

  26. arXiv:2110.13052  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Can Q-Learning be Improved with Advice?

    Authors: Noah Golowich, Ankur Moitra

    Abstract: Despite rapid progress in theoretical reinforcement learning (RL) over the last few years, most of the known guarantees are worst-case in nature, failing to take advantage of structure that may be known a priori about a given RL problem at hand. In this paper we address the question of whether worst-case lower bounds for regret in online learning of Markov decision processes (MDPs) can be circumve… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  27. arXiv:2108.06924  [pdf, other

    cs.LG

    Near-Optimal No-Regret Learning in General Games

    Authors: Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich

    Abstract: We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bias -- attains ${\rm poly}(\log T)$ regret in multi-player general-sum games. In particular, when every player of the game uses Optimistic Hedge to iteratively update her strategy in response to the history of play so far, then after $T$ rounds of interaction, each player experiences total regret that… ▽ More

    Submitted 24 January, 2023; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: 40 pages

  28. arXiv:2106.13513  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Littlestone Classes are Privately Online Learnable

    Authors: Noah Golowich, Roi Livni

    Abstract: We consider the problem of online classification under a privacy constraint. In this setting a learner observes sequentially a stream of labelled examples $(x_t, y_t)$, for $1 \leq t \leq T$, and returns at each iteration $t$ a hypothesis $h_t$ which is used to predict the label of each new example $x_t$. The learner's performance is measured by her regret against a known hypothesis class… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  29. arXiv:2102.06062  [pdf, other

    cs.LG cs.DS

    Deep Learning with Label Differential Privacy

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi, Chiyuan Zhang

    Abstract: The Randomized Response (RR) algorithm is a classical technique to improve robustness in survey aggregation, and has been widely adopted in applications with differential privacy guarantees. We propose a novel algorithm, Randomized Response with Prior (RRWithPrior), which can provide more accurate results while maintaining the same level of privacy guaranteed by RR. We then apply RRWithPrior to le… ▽ More

    Submitted 26 October, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021; 29 pages, 6 figures

  30. arXiv:2101.04233  [pdf, other

    cs.LG

    Independent Policy Gradient Methods for Competitive Reinforcement Learning

    Authors: Constantinos Daskalakis, Dylan J. Foster, Noah Golowich

    Abstract: We obtain global, non-asymptotic convergence guarantees for independent learning algorithms in competitive reinforcement learning settings with two agents (i.e., zero-sum stochastic games). We consider an episodic setting where in each episode, each player independently selects a policy and observes only their own actions and rewards, along with the state. We show that if both players run policy g… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Appeared at NeurIPS 2020

  31. arXiv:2012.03893  [pdf, other

    cs.LG cs.CR

    Sample-efficient proper PAC learning with approximate differential privacy

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi

    Abstract: In this paper we prove that the sample complexity of properly learning a class of Littlestone dimension $d$ with approximate differential privacy is $\tilde O(d^6)$, ignoring privacy and accuracy parameters. This result answers a question of Bun et al. (FOCS 2020) by improving upon their upper bound of $2^{O(d)}$ on the sample complexity. Prior to our work, finiteness of the sample complexity for… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 40 pages

  32. arXiv:2010.13724  [pdf, ps, other

    cs.LG math.OC

    Tight last-iterate convergence rates for no-regret learning in multi-player games

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis

    Abstract: We study the question of obtaining last-iterate convergence rates for no-regret learning algorithms in multi-player games. We show that the optimistic gradient (OG) algorithm with a constant step-size, which is no-regret, achieves a last-iterate rate of $O(1/\sqrt{T})$ with respect to the gap function in smooth monotone games. This result addresses a question of Mertikopoulos & Zhou (2018), who as… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020. 41 pages

  33. arXiv:2007.03668  [pdf, ps, other

    cs.LG math.CO stat.ML

    Near-tight closure bounds for Littlestone and threshold dimensions

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi

    Abstract: We study closure properties for the Littlestone and threshold dimensions of binary hypothesis classes. Given classes $\mathcal{H}_1, \ldots, \mathcal{H}_k$ of Boolean functions with bounded Littlestone (respectively, threshold) dimension, we establish an upper bound on the Littlestone (respectively, threshold) dimension of the class defined by applying an arbitrary binary aggregation rule to… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 7 pages

  34. arXiv:2002.01919  [pdf, other

    cs.CR cs.DS

    Pure Differentially Private Summation from Anonymous Messages

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi, Rasmus Pagh, Ameya Velingker

    Abstract: The shuffled (aka anonymous) model has recently generated significant interest as a candidate distributed privacy framework with trust assumptions better than the central model but with achievable errors smaller than the local model. We study pure differentially private (DP) protocols in the shuffled model for summation, a basic and widely used primitive: - For binary summation where each of n u… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: 40 pages, 3 figures

  35. arXiv:2002.00057  [pdf, ps, other

    cs.LG math.OC stat.ML

    Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis, Asuman Ozdaglar

    Abstract: In this paper we study the smooth convex-concave saddle point problem. Specifically, we analyze the last iterate convergence properties of the Extragradient (EG) algorithm. It is well known that the ergodic (averaged) iterates of EG converge at a rate of $O(1/T)$ (Nemirovski, 2004). In this paper, we show that the last iterate of EG converges at a rate of $O(1/\sqrt{T})$. To the best of our knowle… ▽ More

    Submitted 6 July, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: 27 pages

  36. arXiv:1909.00323  [pdf, other

    cs.IT

    Round Complexity of Common Randomness Generation: The Amortized Setting

    Authors: Noah Golowich, Madhu Sudan

    Abstract: We study the effect of rounds of interaction on the common randomness generation (CRG) problem. In the CRG problem, two parties, Alice and Bob, receive samples $X_i$ and $Y_i$, respectively, drawn jointly from a source distribution $μ$. The two parties wish to agree on a common random key consisting of many bits of randomness, by exchanging messages that depend on each party's input and the previo… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: 41 pages

  37. arXiv:1908.11358  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    On the Power of Multiple Anonymous Messages

    Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Rasmus Pagh, Ameya Velingker

    Abstract: An exciting new development in differential privacy is the shuffled model, in which an anonymous channel enables non-interactive, differentially private protocols with error much smaller than what is possible in the local model, while relying on weaker trust assumptions than in the central model. In this paper, we study basic counting problems in the shuffled model and establish separations betwee… ▽ More

    Submitted 19 May, 2020; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: 70 pages, 2 figures, 3 tables

  38. arXiv:1810.02281  [pdf, other

    cs.LG cs.NE stat.ML

    A Convergence Analysis of Gradient Descent for Deep Linear Neural Networks

    Authors: Sanjeev Arora, Nadav Cohen, Noah Golowich, Wei Hu

    Abstract: We analyze speed of convergence to global optimum for gradient descent training a deep linear neural network (parameterized as $x \mapsto W_N W_{N-1} \cdots W_1 x$) by minimizing the $\ell_2$ loss over whitened data. Convergence at a linear rate is guaranteed when the following hold: (i) dimensions of hidden layers are at least the minimum of the input and output dimensions; (ii) weight matrices a… ▽ More

    Submitted 26 October, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: Published as a conference paper at ICLR 2019

  39. arXiv:1808.08907  [pdf, other

    cs.IT

    Communication-Rounds Tradeoffs for Common Randomness and Secret Key Generation

    Authors: Mitali Bafna, Badih Ghazi, Noah Golowich, Madhu Sudan

    Abstract: We study the role of interaction in the Common Randomness Generation (CRG) and Secret Key Generation (SKG) problems. In the CRG problem, two players, Alice and Bob, respectively get samples $X_1,X_2,\dots$ and $Y_1,Y_2,\dots$ with the pairs $(X_1,Y_1)$, $(X_2, Y_2)$, $\dots$ being drawn independently from some known probability distribution $μ$. They wish to communicate so as to agree on $L$ bits… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: 41 pages, 3 figures

  40. arXiv:1801.02254  [pdf, other

    cs.LG

    Theory of Deep Learning IIb: Optimization Properties of SGD

    Authors: Chiyuan Zhang, Qianli Liao, Alexander Rakhlin, Brando Miranda, Noah Golowich, Tomaso Poggio

    Abstract: In Theory IIb we characterize with a mix of theory and experiments the optimization of deep convolutional networks by Stochastic Gradient Descent. The main new result in this paper is theoretical and experimental evidence for the following conjecture about SGD: SGD concentrates in probability -- like the classical Langevin equation -- on large volume, "flat" minima, selecting flat minimizers which… ▽ More

    Submitted 7 January, 2018; originally announced January 2018.

  41. arXiv:1712.06541  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Size-Independent Sample Complexity of Neural Networks

    Authors: Noah Golowich, Alexander Rakhlin, Ohad Shamir

    Abstract: We study the sample complexity of learning neural networks, by providing new bounds on their Rademacher complexity assuming norm constraints on the parameter matrix of each layer. Compared to previous work, these complexity bounds have improved dependence on the network depth, and under some additional assumptions, are fully independent of the network size (both depth and width). These results are… ▽ More

    Submitted 17 November, 2019; v1 submitted 18 December, 2017; originally announced December 2017.

    Comments: Fixed a bug in the proof of theorem 7 (not affecting theorem statement), by slightly changing the construction