Skip to main content

Showing 1–50 of 105 results for author: Daskalakis, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13668  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Improved bounds for calibration via stronger sign preservation games

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich, Robert Kleinberg, Princewill Okoroafor

    Abstract: A set of probabilistic forecasts is calibrated if each prediction of the forecaster closely approximates the empirical distribution of outcomes on the subset of timesteps where that prediction was made. We study the fundamental problem of online calibrated forecasting of binary sequences, which was initially studied by Foster & Vohra (1998). They derived an algorithm with $O(T^{2/3})$ calibration… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.13116  [pdf, ps, other

    cs.GT

    A Lower Bound on Swap Regret in Extensive-Form Games

    Authors: Constantinos Daskalakis, Gabriele Farina, Noah Golowich, Tuomas Sandholm, Brian Hu Zhang

    Abstract: Recent simultaneous works by Peng and Rubinstein [2024] and Dagan et al. [2024] have demonstrated the existence of a no-swap-regret learning algorithm that can reach $ε$ average swap regret against an adversary in any extensive-form game within $m^{\tilde{\mathcal O}(1/ε)}$ rounds, where $m$ is the number of nodes in the game tree. However, the question of whether a $\mathrm{poly}(m, 1/ε)$-round a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.11667  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Is Efficient PAC Learning Possible with an Oracle That Responds 'Yes' or 'No'?

    Authors: Constantinos Daskalakis, Noah Golowich

    Abstract: The empirical risk minimization (ERM) principle has been highly impactful in machine learning, leading both to near-optimal theoretical guarantees for ERM-based learning algorithms as well as driving many of the recent empirical successes in deep learning. In this paper, we investigate the question of whether the ability to perform ERM, which computes a hypothesis minimizing empirical risk on a gi… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  4. arXiv:2406.07920  [pdf, ps, other

    cs.LG cs.AI cs.CC math.ST stat.ML

    Near-Optimal Learning and Planning in Separated Latent MDPs

    Authors: Fan Chen, Constantinos Daskalakis, Noah Golowich, Alexander Rakhlin

    Abstract: We study computational and statistical aspects of learning Latent Markov Decision Processes (LMDPs). In this model, the learner interacts with an MDP drawn at the beginning of each epoch from an unknown mixture of MDPs. To sidestep known impossibility results, we consider several notions of separation of the constituent MDPs. The main thrust of this paper is in establishing a nearly-sharp *statist… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  5. arXiv:2404.10177  [pdf, other

    cs.CV cs.AI cs.LG

    Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data

    Authors: Giannis Daras, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: Ambient diffusion is a recently proposed framework for training diffusion models using corrupted data. Both Ambient Diffusion and alternative SURE-based approaches for learning diffusion models from corrupted data resort to approximations which deteriorate performance. We present the first framework for training diffusion models that provably sample from the uncorrupted distribution given only noi… ▽ More

    Submitted 20 March, 2024; originally announced April 2024.

    Comments: Preprint, work in progress. 19 pages, 9 figures

  6. arXiv:2403.08171  [pdf, other

    cs.GT cs.LG

    Tractable Local Equilibria in Non-Concave Games

    Authors: Yang Cai, Constantinos Daskalakis, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng

    Abstract: While Online Gradient Descent and other no-regret learning procedures are known to efficiently converge to coarse correlated equilibrium in games where each agent's utility is concave in their own strategy, this is not the case when the utilities are non-concave, a situation that is common in machine learning applications where the agents' strategies are parameterized by deep neural networks, or t… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  7. arXiv:2310.19786  [pdf, ps, other

    cs.LG cs.AI cs.GT

    From External to Swap Regret 2.0: An Efficient Reduction and Oblivious Adversary for Large Action Spaces

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich

    Abstract: We provide a novel reduction from swap-regret minimization to external-regret minimization, which improves upon the classical reductions of Blum-Mansour [BM07] and Stolz-Lugosi [SL05] in that it does not require finiteness of the space of actions. We show that, whenever there exists a no-external-regret algorithm for some hypothesis class, there must also exist a no-swap-regret algorithm for that… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  8. arXiv:2309.12226  [pdf, ps, other

    cs.GT cs.DS cs.LG

    Smooth Nash Equilibria: Algorithms and Complexity

    Authors: Constantinos Daskalakis, Noah Golowich, Nika Haghtalab, Abhishek Shetty

    Abstract: A fundamental shortcoming of the concept of Nash equilibrium is its computational intractability: approximating Nash equilibria in normal-form games is PPAD-hard. In this paper, inspired by the ideas of smoothed analysis, we introduce a relaxed variant of Nash equilibrium called $σ$-smooth Nash equilibrium, for a smoothness parameter $σ$. In a $σ$-smooth Nash equilibrium, players only need to achi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  9. arXiv:2307.01689  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Online Learning and Solving Infinite Games with an ERM Oracle

    Authors: Angelos Assos, Idan Attias, Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson

    Abstract: While ERM suffices to attain near-optimal generalization error in the stochastic learning setting, this is not known to be the case in the online learning setting, where algorithms for general concept classes rely on computationally inefficient oracles such as the Standard Optimal Algorithm (SOA). In this work, we propose an algorithm for online binary classification setting that relies solely on… ▽ More

    Submitted 10 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: In COLT2023

  10. arXiv:2302.09057  [pdf, other

    cs.LG cs.AI cs.CV cs.IT

    Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent

    Authors: Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: Imperfect score-matching leads to a shift between the training and the sampling distribution of diffusion models. Due to the recursive nature of the generation process, errors in previous steps yield sampling iterates that drift away from the training distribution. Yet, the standard training objective via Denoising Score Matching (DSM) is only designed to optimize over non-drifted data. To train o… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 29 pages, 8 figures

  11. arXiv:2302.03663  [pdf, other

    cs.LG math.DS math.NA physics.data-an stat.ML

    SDYN-GANs: Adversarial Learning Methods for Multistep Generative Models for General Order Stochastic Dynamics

    Authors: Panos Stinis, Constantinos Daskalakis, Paul J. Atzberger

    Abstract: We introduce adversarial learning methods for data-driven generative modeling of the dynamics of $n^{th}$-order stochastic systems. Our approach builds on Generative Adversarial Networks (GANs) with generative model classes based on stable $m$-step stochastic numerical integrators. We introduce different formulations and training methods for learning models of stochastic dynamics based on observat… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 7 figures

    Report number: PNNL-SA-181736

  12. arXiv:2211.13291  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST

    Learning and Testing Latent-Tree Ising Models Efficiently

    Authors: Davin Choo, Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide… ▽ More

    Submitted 10 July, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  13. arXiv:2211.11904  [pdf, ps, other

    cs.LG math.ST stat.ML

    EM's Convergence in Gaussian Latent Tree Models

    Authors: Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We study the optimization landscape of the log-likelihood function and the convergence of the Expectation-Maximization (EM) algorithm in latent Gaussian tree models, i.e. tree-structured Gaussian graphical models whose leaf nodes are observable and non-leaf nodes are unobservable. We show that the unique non-trivial stationary point of the population log-likelihood is its global maximum, and estab… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

  14. arXiv:2210.09769  [pdf, other

    cs.LG cs.GT math.OC

    STay-ON-the-Ridge: Guaranteed Convergence to Local Minimax Equilibrium in Nonconvex-Nonconcave Games

    Authors: Constantinos Daskalakis, Noah Golowich, Stratis Skoulakis, Manolis Zampetakis

    Abstract: Min-max optimization problems involving nonconvex-nonconcave objectives have found important applications in adversarial training and other multi-agent learning settings. Yet, no known gradient descent-based method is guaranteed to converge to (even local notions of) min-max equilibrium in the nonconvex-nonconcave setting. For all known methods, there exist relatively simple objectives for which t… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  15. arXiv:2208.12042  [pdf, other

    stat.ME cs.LG

    Efficient Truncated Linear Regression with Unknown Noise Variance

    Authors: Constantinos Daskalakis, Patroklos Stefanou, Rui Yao, Manolis Zampetakis

    Abstract: Truncated linear regression is a classical challenge in Statistics, wherein a label, $y = w^T x + \varepsilon$, and its corresponding feature vector, $x \in \mathbb{R}^k$, are only observed if the label falls in some subset $S \subseteq \mathbb{R}$; otherwise the existence of the pair $(x, y)$ is hidden from observation. Linear regression with truncated observations has remained a challenge, in it… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  16. arXiv:2206.09104  [pdf, other

    cs.LG cs.AI

    Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

    Authors: Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: We prove fast mixing and characterize the stationary distribution of the Langevin Algorithm for inverting random weighted DNN generators. This result extends the work of Hand and Voroninski from efficient inversion to efficient posterior sampling. In practice, to allow for increased expressivity, we propose to do posterior sampling in the latent space of a pre-trained generative model. To achieve… ▽ More

    Submitted 22 June, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022. 32 pages, 9 Figures

  17. arXiv:2205.03246  [pdf, other

    math.ST cs.DS cs.LG stat.ML

    What Makes A Good Fisherman? Linear Regression under Self-Selection Bias

    Authors: Yeshwanth Cherapanamjeri, Constantinos Daskalakis, Andrew Ilyas, Manolis Zampetakis

    Abstract: In the classical setting of self-selection, the goal is to learn $k$ models, simultaneously from observations $(x^{(i)}, y^{(i)})$ where $y^{(i)}$ is the output of one of $k$ underlying models on input $x^{(i)}$. In contrast to mixture models, where we observe the output of a randomly selected model, here the observed model depends on the outputs themselves, and is determined by some known selecti… ▽ More

    Submitted 10 December, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

  18. arXiv:2205.02060  [pdf, ps, other

    cs.GT cs.DS math.ST stat.ML

    Estimation of Standard Auction Models

    Authors: Yeshwanth Cherapanamjeri, Constantinos Daskalakis, Andrew Ilyas, Manolis Zampetakis

    Abstract: We provide efficient estimation methods for first- and second-price auctions under independent (asymmetric) private values and partial observability. Given a finite set of observations, each comprising the identity of the winner and the price they paid in a sequence of identical auctions, we provide algorithms for non-parametrically estimating the bid distribution of each bidder, as well as their… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  19. arXiv:2204.03991  [pdf, ps, other

    cs.LG cs.GT

    The Complexity of Markov Equilibrium in Stochastic Games

    Authors: Constantinos Daskalakis, Noah Golowich, Kaiqing Zhang

    Abstract: We show that computing approximate stationary Markov coarse correlated equilibria (CCE) in general-sum stochastic games is computationally intractable, even when there are two players, the game is turn-based, the discount factor is an absolute constant, and the approximation is an absolute constant. Our intractability results stand in sharp contrast to normal-form games where exact CCEs are effici… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 50 pages

  20. arXiv:2112.06410   

    stat.ML cs.LG math.ST

    How Good are Low-Rank Approximations in Gaussian Process Regression?

    Authors: Constantinos Daskalakis, Petros Dellaportas, Aristeidis Panos

    Abstract: We provide guarantees for approximate Gaussian Process (GP) regression resulting from two common low-rank kernel approximations: based on random Fourier features, and based on truncating the kernel's Mercer expansion. In particular, we bound the Kullback-Leibler divergence between an exact GP and one resulting from one of the afore-described low-rank approximations to its kernel, as well as betwee… ▽ More

    Submitted 21 February, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: The arxiv ID of the correct article is arXiv:2004.01584. This arXiv article is redundant and it is not needed anymore

  21. arXiv:2111.08911  [pdf, ps, other

    cs.LG cs.DS cs.GT stat.ML

    Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games

    Authors: Constantinos Daskalakis, Noah Golowich

    Abstract: We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the realizable setting of nonparametric online regression with the absolute loss, we propose a randomized proper learning algorithm which gets a near-optimal cumulative l… ▽ More

    Submitted 12 April, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 61 pages

  22. Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

    Authors: Ioannis Anagnostides, Constantinos Daskalakis, Gabriele Farina, Maxwell Fishelson, Noah Golowich, Tuomas Sandholm

    Abstract: Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a multi-player general-sum normal-form game employ Optimistic Multiplicative Weights Update (OMWU), the external regret of every player is $O(\textrm{polylog}(T))$ after $T$ repetitions of the game. We extend their result from external regret to internal regret and swap regret, thereby establishing uncoup… ▽ More

    Submitted 24 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Appeared at STOC 2022

  23. arXiv:2110.12558  [pdf, ps, other

    cs.GT cs.IR cs.LG stat.ML

    Recommender Systems meet Mechanism Design

    Authors: Yang Cai, Constantinos Daskalakis

    Abstract: Machine learning has developed a variety of tools for learning and representing high-dimensional distributions with structure. Recent years have also seen big advances in designing multi-item mechanisms. Akin to overfitting, however, these mechanisms can be extremely sensitive to the Bayesian prior that they target, which becomes problematic when that prior is only approximately known. At the same… ▽ More

    Submitted 31 March, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

  24. arXiv:2108.06924  [pdf, other

    cs.LG

    Near-Optimal No-Regret Learning in General Games

    Authors: Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich

    Abstract: We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bias -- attains ${\rm poly}(\log T)$ regret in multi-player general-sum games. In particular, when every player of the game uses Optimistic Hedge to iteratively update her strategy in response to the history of play so far, then after $T$ rounds of interaction, each player experiences total regret that… ▽ More

    Submitted 24 January, 2023; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: 40 pages

  25. arXiv:2107.09773  [pdf, other

    cs.LG math.ST stat.ML

    Statistical Estimation from Dependent Data

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Surbhi Goel, Anthimos Vardis Kandiros

    Abstract: We consider a general statistical estimation problem wherein binary labels across different observations are not independent conditioned on their feature vectors, but dependent, capturing settings where e.g. these observations are collected on a spatial domain, a temporal domain, or a social network, which induce dependencies. We model these dependencies in the language of Markov Random Fields and… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 41 pages, ICML 2021

  26. arXiv:2101.04233  [pdf, other

    cs.LG

    Independent Policy Gradient Methods for Competitive Reinforcement Learning

    Authors: Constantinos Daskalakis, Dylan J. Foster, Noah Golowich

    Abstract: We obtain global, non-asymptotic convergence guarantees for independent learning algorithms in competitive reinforcement learning settings with two agents (i.e., zero-sum stochastic games). We consider an episodic setting where in each episode, each player independently selects a policy and observes only their own actions and rewards, along with the state. We show that if both players run policy g… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Appeared at NeurIPS 2020

  27. arXiv:2011.00364  [pdf, ps, other

    math.OC cs.DS cs.LG stat.ML

    Efficient Methods for Structured Nonconvex-Nonconcave Min-Max Optimization

    Authors: Jelena Diakonikolas, Constantinos Daskalakis, Michael I. Jordan

    Abstract: The use of min-max optimization in adversarial training of deep neural network classifiers and training of generative adversarial networks has motivated the study of nonconvex-nonconcave optimization objectives, which frequently arise in these applications. Unfortunately, recent results have established that even approximate first-order stationary points of such objectives are intractable, even un… ▽ More

    Submitted 27 February, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

    Comments: in Proc. AISTATS'21

  28. arXiv:2010.14864  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Sample-Optimal and Efficient Learning of Tree Ising models

    Authors: Constantinos Daskalakis, Qinxuan Pan

    Abstract: We show that $n$-variable tree-structured Ising models can be learned computationally-efficiently to within total variation distance $ε$ from an optimal $O(n \ln n/ε^2)$ samples, where $O(\cdot)$ hides an absolute constant which, importantly, does not depend on the model being learned - neither its tree nor the magnitude of its edge strengths, on which we place no assumptions. Our guarantees hold,… ▽ More

    Submitted 29 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

  29. arXiv:2010.13724  [pdf, ps, other

    cs.LG math.OC

    Tight last-iterate convergence rates for no-regret learning in multi-player games

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis

    Abstract: We study the question of obtaining last-iterate convergence rates for no-regret learning algorithms in multi-player games. We show that the optimistic gradient (OG) algorithm with a constant step-size, which is no-regret, achieves a last-iterate rate of $O(1/\sqrt{T})$ with respect to the gap function in smooth monotone games. This result addresses a question of Mertikopoulos & Zhou (2018), who as… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020. 41 pages

  30. arXiv:2010.12000  [pdf, other

    math.ST cs.DS cs.LG

    Computationally and Statistically Efficient Truncated Regression

    Authors: Constantinos Daskalakis, Themis Gouleakis, Christos Tzamos, Manolis Zampetakis

    Abstract: We provide a computationally and statistically efficient estimator for the classical problem of truncated linear regression, where the dependent variable $y = w^T x + ε$ and its corresponding vector of covariates $x \in R^k$ are only revealed if the dependent variable falls in some subset $S \subseteq R$; otherwise the existence of the pair $(x, y)$ is hidden. This problem has remained a challenge… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2019

  31. arXiv:2009.09623  [pdf, other

    cs.CC cs.LG math.OC

    The Complexity of Constrained Min-Max Optimization

    Authors: Constantinos Daskalakis, Stratis Skoulakis, Manolis Zampetakis

    Abstract: Despite its important applications in Machine Learning, min-max optimization of nonconvex-nonconcave objectives remains elusive. Not only are there no known first-order methods converging even to approximate local min-max points, but the computational complexity of identifying them is also poorly understood. In this paper, we provide a characterization of the computational complexity of the proble… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  32. arXiv:2008.01915  [pdf, other

    cs.LG stat.ML

    Generative Ensemble Regression: Learning Particle Dynamics from Observations of Ensembles with Physics-Informed Deep Generative Models

    Authors: Liu Yang, Constantinos Daskalakis, George Em Karniadakis

    Abstract: We propose a new method for inferring the governing stochastic ordinary differential equations (SODEs) by observing particle ensembles at discrete and sparse time instants, i.e., multiple "snapshots". Particle coordinates at a single time instant, possibly noisy or truncated, are recorded in each snapshot but are unpaired across the snapshots. By training a physics-informed generative model that g… ▽ More

    Submitted 20 March, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  33. arXiv:2007.14539  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Truncated Linear Regression in High Dimensions

    Authors: Constantinos Daskalakis, Dhruv Rohatgi, Manolis Zampetakis

    Abstract: As in standard linear regression, in truncated linear regression, we are given access to observations $(A_i, y_i)_i$ whose dependent variable equals $y_i= A_i^{\rm T} \cdot x^* + η_i$, where $x^*$ is some fixed unknown vector of interest and $η_i$ is independent noise; except we are only given an observation if its dependent variable $y_i$ lies in some "truncation set" $S \subset \mathbb{R}$. The… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 30 pages, 1 figure

  34. arXiv:2006.04237  [pdf, other

    cs.IT cs.LG math.OC math.PR

    Constant-Expansion Suffices for Compressed Sensing with Generative Priors

    Authors: Constantinos Daskalakis, Dhruv Rohatgi, Manolis Zampetakis

    Abstract: Generative neural networks have been empirically found very promising in providing effective structural priors for compressed sensing, since they can be trained to span low-dimensional data manifolds in high-dimensional signal spaces. Despite the non-convexity of the resulting optimization problem, it has also been shown theoretically that, for neural networks with random Gaussian weights, a signa… ▽ More

    Submitted 26 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 21 pages, 1 figure; added an application

  35. arXiv:2004.09370  [pdf, ps, other

    math.ST cs.LG

    Learning Ising models from one or multiple samples

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Anthimos Vardis Kandiros

    Abstract: There have been two separate lines of work on estimating Ising models: (1) estimating them from multiple independent samples under minimal assumptions about the model's interaction matrix; and (2) estimating them from one sample in restrictive settings. We propose a unified framework that smoothly interpolates between these two settings, enabling significantly richer estimation guarantees from one… ▽ More

    Submitted 10 December, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  36. arXiv:2004.01584  [pdf, other

    stat.ML cs.LG

    How Good are Low-Rank Approximations in Gaussian Process Regression?

    Authors: Constantinos Daskalakis, Petros Dellaportas, Aristeidis Panos

    Abstract: We provide guarantees for approximate Gaussian Process (GP) regression resulting from two common low-rank kernel approximations: based on random Fourier features, and based on truncating the kernel's Mercer expansion. In particular, we bound the Kullback-Leibler divergence between an exact GP and one resulting from one of the afore-described low-rank approximations to its kernel, as well as betwee… ▽ More

    Submitted 14 December, 2021; v1 submitted 3 April, 2020; originally announced April 2020.

  37. arXiv:2003.08259  [pdf, ps, other

    stat.ML cs.LG

    Logistic-Regression with peer-group effects via inference in higher order Ising models

    Authors: Constantinos Daskalakis, Nishanth Dikkala, Ioannis Panageas

    Abstract: Spin glass models, such as the Sherrington-Kirkpatrick, Hopfield and Ising models, are all well-studied members of the exponential family of discrete distributions, and have been influential in a number of application domains where they are used to model correlation phenomena on networks. Conventionally these models have quadratic sufficient statistics and consequently capture correlations arising… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: 16 pages

  38. arXiv:2003.00652  [pdf, other

    cs.LG stat.ML

    GANs with Conditional Independence Graphs: On Subadditivity of Probability Divergences

    Authors: Mucong Ding, Constantinos Daskalakis, Soheil Feizi

    Abstract: Generative Adversarial Networks (GANs) are modern methods to learn the underlying distribution of a data set. GANs have been widely used in sample synthesis, de-noising, domain transfer, etc. GANs, however, are designed in a model-free fashion where no additional information about the underlying distribution is available. In many applications, however, practitioners have access to the underlying i… ▽ More

    Submitted 25 February, 2021; v1 submitted 1 March, 2020; originally announced March 2020.

  39. arXiv:2002.06702  [pdf, ps, other

    cs.GT econ.TH

    Multi-item Non-truthful Auctions Achieve Good Revenue

    Authors: Constantinos Daskalakis, Maxwell Fishelson, Brendan Lucier, Vasilis Syrgkanis, Santhoshini Velusamy

    Abstract: We present a general framework for designing approximately revenue-optimal mechanisms for multi-item additive auctions, which applies to both truthful and non-truthful auctions. Given a (not necessarily truthful) single-item auction format $A$ satisfying certain technical conditions, we run simultaneous item auctions augmented with a personalized entry fee for each bidder that must be paid before… ▽ More

    Submitted 21 September, 2022; v1 submitted 16 February, 2020; originally announced February 2020.

  40. arXiv:2002.00057  [pdf, ps, other

    cs.LG math.OC stat.ML

    Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis, Asuman Ozdaglar

    Abstract: In this paper we study the smooth convex-concave saddle point problem. Specifically, we analyze the last iterate convergence properties of the Extragradient (EG) algorithm. It is well known that the ergodic (averaged) iterates of EG converge at a rate of $O(1/T)$ (Nemirovski, 2004). In this paper, we show that the last iterate of EG converges at a rate of $O(1/\sqrt{T})$. To the best of our knowle… ▽ More

    Submitted 6 July, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: 27 pages

  41. arXiv:1911.02146  [pdf, other

    cs.GT cs.DS cs.LG

    Multi-Item Mechanisms without Item-Independence: Learnability via Robustness

    Authors: Johaness Brustle, Yang Cai, Constantinos Daskalakis

    Abstract: We study the sample complexity of learning revenue-optimal multi-item auctions. We obtain the first set of positive results that go beyond the standard but unrealistic setting of item-independence. In particular, we consider settings where bidders' valuations are drawn from correlated distributions that can be captured by Markov Random Fields or Bayesian Networks -- two of the most prominent graph… ▽ More

    Submitted 31 May, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  42. arXiv:1910.07030  [pdf, other

    cs.LG stat.ML

    SGD Learns One-Layer Networks in WGANs

    Authors: Qi Lei, Jason D. Lee, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: Generative adversarial networks (GANs) are a widely used framework for learning generative models. Wasserstein GANs (WGANs), one of the most successful variants of GANs, require solving a minmax optimization problem to global optimality, but are in practice successfully trained using stochastic gradient descent-ascent. In this paper, we show that, when the generator is a one-layer network, stochas… ▽ More

    Submitted 1 July, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: 24 pages, 4 figures, ICML2020

  43. arXiv:1906.09247  [pdf, ps, other

    cs.LG stat.ML

    Learning from weakly dependent data under Dobrushin's condition

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Siddhartha Jayanti

    Abstract: Statistical learning theory has largely focused on learning and generalization given independent and identically distributed (i.i.d.) samples. Motivated by applications involving time-series data, there has been a growing literature on learning and generalization in settings where data is sampled from an ergodic process. This work has also developed complexity measures, which appropriately extend… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  44. arXiv:1905.03353  [pdf, ps, other

    cs.LG math.ST stat.ML

    Regression from Dependent Observations

    Authors: Constantinos Daskalakis, Nishanth Dikkala, Ioannis Panageas

    Abstract: The standard linear and logistic regression models assume that the response variables are independent, but share the same linear relationship to their corresponding vectors of covariates. The assumption that the response variables are independent is, however, too strong. In many applications, these responses are collected on nodes of a network, or some spatial or temporal domain, and are dependent… ▽ More

    Submitted 8 October, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: 33 pages, in proceedings of STOC 2019

  45. arXiv:1811.10581  [pdf, other

    cs.LG stat.ML

    HOGWILD!-Gibbs can be PanAccurate

    Authors: Constantinos Daskalakis, Nishanth Dikkala, Siddhartha Jayanti

    Abstract: Asynchronous Gibbs sampling has been recently shown to be fast-mixing and an accurate method for estimating probabilities of events on a small number of variables of a graphical model satisfying Dobrushin's condition~\cite{DeSaOR16}. We investigate whether it can be used to accurately estimate expectations of functions of {\em all the variables} of the model. Under the same condition, we show that… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: 19 pages, 3 figures, published at NeurIPS2018

  46. arXiv:1810.11896  [pdf, ps, other

    cs.DS cs.LG cs.NE stat.ML

    Smoothed Analysis of Discrete Tensor Decomposition and Assemblies of Neurons

    Authors: Nima Anari, Constantinos Daskalakis, Wolfgang Maass, Christos H. Papadimitriou, Amin Saberi, Santosh Vempala

    Abstract: We analyze linear independence of rank one tensors produced by tensor powers of randomly perturbed vectors. This enables efficient decomposition of sums of high-order tensors. Our analysis builds upon [BCMV14] but allows for a wider range of perturbation models, including discrete ones. We give an application to recovering assemblies of neurons. Assemblies are large sets of neurons representing… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Comments: To appear in NIPS 2018

  47. arXiv:1809.03986  [pdf, other

    math.ST cs.DS cs.LG stat.CO stat.ML

    Efficient Statistics, in High Dimensions, from Truncated Samples

    Authors: Constantinos Daskalakis, Themis Gouleakis, Christos Tzamos, Manolis Zampetakis

    Abstract: We provide an efficient algorithm for the classical problem, going back to Galton, Pearson, and Fisher, of estimating, with arbitrary accuracy the parameters of a multivariate normal distribution from truncated samples. Truncated samples from a $d$-variate normal ${\cal N}(\mathbfμ,\mathbfΣ)$ means a samples is only revealed if it falls in some subset $S \subseteq \mathbb{R}^d$; otherwise the samp… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: Appeared at 59th Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2018

  48. arXiv:1807.04252  [pdf, other

    math.OC cs.GT stat.ML

    Last-Iterate Convergence: Zero-Sum Games and Constrained Min-Max Optimization

    Authors: Constantinos Daskalakis, Ioannis Panageas

    Abstract: Motivated by applications in Game Theory, Optimization, and Generative Adversarial Networks, recent work of Daskalakis et al \cite{DISZ17} and follow-up work of Liang and Stokes \cite{LiangS18} have established that a variant of the widely used Gradient Descent/Ascent procedure, called "Optimistic Gradient Descent/Ascent (OGDA)", exhibits last-iterate convergence to saddle points in {\em unconstra… ▽ More

    Submitted 2 December, 2020; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: Appeared in ITCS 2019

  49. arXiv:1807.03907  [pdf, other

    math.OC cs.LG stat.ML

    The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization

    Authors: Constantinos Daskalakis, Ioannis Panageas

    Abstract: Motivated by applications in Optimization, Game Theory, and the training of Generative Adversarial Networks, the convergence properties of first order methods in min-max problems have received extensive study. It has been recognized that they may cycle, and there is no good understanding of their limit points when they do not. When they converge, do they converge to local min-max solutions? We cha… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  50. arXiv:1805.09697  [pdf, ps, other

    cs.DS cs.AI stat.ML

    Learning and Testing Causal Models with Interventions

    Authors: Jayadev Acharya, Arnab Bhattacharyya, Constantinos Daskalakis, Saravanan Kandasamy

    Abstract: We consider testing and learning problems on causal Bayesian networks as defined by Pearl (Pearl, 2009). Given a causal Bayesian network $\mathcal{M}$ on a graph with $n$ discrete variables and bounded in-degree and bounded `confounded components', we show that $O(\log n)$ interventions on an unknown causal Bayesian network $\mathcal{X}$ on the same graph, and $\tilde{O}(n/ε^2)$ samples per interv… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.