Skip to main content

Showing 1–25 of 25 results for author: Dagan, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13668  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Improved bounds for calibration via stronger sign preservation games

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich, Robert Kleinberg, Princewill Okoroafor

    Abstract: A set of probabilistic forecasts is calibrated if each prediction of the forecaster closely approximates the empirical distribution of outcomes on the subset of timesteps where that prediction was made. We study the fundamental problem of online calibrated forecasting of binary sequences, which was initially studied by Foster & Vohra (1998). They derived an algorithm with $O(T^{2/3})$ calibration… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.03855  [pdf, other

    cs.CL

    Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As

    Authors: Eden Avnat, Michal Levy, Daniel Herstain, Elia Yanko, Daniel Ben Joya, Michal Tzuchman Katz, Dafna Eshel, Sahar Laros, Yael Dagan, Shahar Barami, Joseph Mermelstein, Shahar Ovadia, Noam Shomron, Varda Shalev, Raja-Elie E. Abdulnour

    Abstract: Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited… ▽ More

    Submitted 1 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2310.19786  [pdf, ps, other

    cs.LG cs.AI cs.GT

    From External to Swap Regret 2.0: An Efficient Reduction and Oblivious Adversary for Large Action Spaces

    Authors: Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson, Noah Golowich

    Abstract: We provide a novel reduction from swap-regret minimization to external-regret minimization, which improves upon the classical reductions of Blum-Mansour [BM07] and Stolz-Lugosi [SL05] in that it does not require finiteness of the space of actions. We show that, whenever there exists a no-external-regret algorithm for some hypothesis class, there must also exist a no-swap-regret algorithm for that… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2307.01689  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Online Learning and Solving Infinite Games with an ERM Oracle

    Authors: Angelos Assos, Idan Attias, Yuval Dagan, Constantinos Daskalakis, Maxwell Fishelson

    Abstract: While ERM suffices to attain near-optimal generalization error in the stochastic learning setting, this is not known to be the case in the online learning setting, where algorithms for general concept classes rely on computationally inefficient oracles such as the Standard Optimal Algorithm (SOA). In this work, we propose an algorithm for online binary classification setting that relies solely on… ▽ More

    Submitted 10 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: In COLT2023

  5. arXiv:2305.19256  [pdf, other

    cs.LG cs.AI cs.CV cs.IT

    Ambient Diffusion: Learning Clean Distributions from Corrupted Data

    Authors: Giannis Daras, Kulin Shah, Yuval Dagan, Aravind Gollakota, Alexandros G. Dimakis, Adam Klivans

    Abstract: We present the first diffusion-based framework that can learn an unknown distribution using only highly-corrupted samples. This problem arises in scientific applications where access to uncorrupted samples is impossible or expensive to acquire. Another benefit of our approach is the ability to train generative models that are less likely to memorize individual training samples since they never obs… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 24 pages, 11 figures

  6. arXiv:2302.09057  [pdf, other

    cs.LG cs.AI cs.CV cs.IT

    Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent

    Authors: Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: Imperfect score-matching leads to a shift between the training and the sampling distribution of diffusion models. Due to the recursive nature of the generation process, errors in previous steps yield sampling iterates that drift away from the training distribution. Yet, the standard training objective via Denoising Score Matching (DSM) is only designed to optimize over non-drifted data. To train o… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 29 pages, 8 figures

  7. arXiv:2211.13291  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST

    Learning and Testing Latent-Tree Ising Models Efficiently

    Authors: Davin Choo, Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide… ▽ More

    Submitted 10 July, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  8. arXiv:2211.11904  [pdf, ps, other

    cs.LG math.ST stat.ML

    EM's Convergence in Gaussian Latent Tree Models

    Authors: Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We study the optimization landscape of the log-likelihood function and the convergence of the Expectation-Maximization (EM) algorithm in latent Gaussian tree models, i.e. tree-structured Gaussian graphical models whose leaf nodes are observable and non-leaf nodes are unobservable. We show that the unique non-trivial stationary point of the population log-likelihood is its global maximum, and estab… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

  9. arXiv:2206.09104  [pdf, other

    cs.LG cs.AI

    Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

    Authors: Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis

    Abstract: We prove fast mixing and characterize the stationary distribution of the Langevin Algorithm for inverting random weighted DNN generators. This result extends the work of Hand and Voroninski from efficient inversion to efficient posterior sampling. In practice, to allow for increased expressivity, we propose to do posterior sampling in the latent space of a pre-trained generative model. To achieve… ▽ More

    Submitted 22 June, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022. 32 pages, 9 Figures

  10. arXiv:2202.04690  [pdf, ps, other

    stat.ML cs.LG

    Smoothed Online Learning is as Easy as Statistical Learning

    Authors: Adam Block, Yuval Dagan, Noah Golowich, Alexander Rakhlin

    Abstract: Much of modern learning theory has been split between two regimes: the classical offline setting, where data arrive independently, and the online setting, where data arrive adversarially. While the former model is often both computationally and statistically tractable, the latter requires no distributional assumptions. In an attempt to achieve the best of both worlds, previous work proposed the sm… ▽ More

    Submitted 31 May, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

  11. arXiv:2107.09773  [pdf, other

    cs.LG math.ST stat.ML

    Statistical Estimation from Dependent Data

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Surbhi Goel, Anthimos Vardis Kandiros

    Abstract: We consider a general statistical estimation problem wherein binary labels across different observations are not independent conditioned on their feature vectors, but dependent, capturing settings where e.g. these observations are collected on a spatial domain, a temporal domain, or a social network, which induce dependencies. We model these dependencies in the language of Markov Random Fields and… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 41 pages, ICML 2021

  12. arXiv:2102.01729  [pdf, ps, other

    stat.ML cs.LG

    Majorizing Measures, Sequential Complexities, and Online Learning

    Authors: Adam Block, Yuval Dagan, Sasha Rakhlin

    Abstract: We introduce the technique of generic chaining and majorizing measures for controlling sequential Rademacher complexity. We relate majorizing measures to the notion of fractional covering numbers, which we show to be dominated in terms of sequential scale-sensitive dimensions in a horizon-independent way, and, under additional complexity assumptions establish a tight control on worst-case sequenti… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

  13. arXiv:2101.09054  [pdf, other

    cs.LG cs.CR cs.DS math.ST stat.ML

    Adversarial Laws of Large Numbers and Optimal Regret in Online Classification

    Authors: Noga Alon, Omri Ben-Eliezer, Yuval Dagan, Shay Moran, Moni Naor, Eylon Yogev

    Abstract: Laws of large numbers guarantee that given a large enough sample from some population, the measure of any fixed sub-population is well-estimated by its frequency in the sample. We study laws of large numbers in sampling processes that can affect the environment they are acting upon and interact with it. Specifically, we consider the sequential sampling model proposed by Ben-Eliezer and Yogev (2020… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

  14. arXiv:2012.03817  [pdf, other

    cs.DS cs.CR cs.LG

    A bounded-noise mechanism for differential privacy

    Authors: Yuval Dagan, Gil Kur

    Abstract: We present an asymptotically optimal $(ε,δ)$ differentially private mechanism for answering multiple, adaptively asked, $Δ$-sensitive queries, settling the conjecture of Steinke and Ullman [2020]. Our algorithm has a significant advantage that it adds independent bounded noise to each query, thus providing an absolute error bound. Additionally, we apply our algorithm in adaptive data analysis, obt… ▽ More

    Submitted 6 November, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

  15. arXiv:2004.09370  [pdf, ps, other

    math.ST cs.LG

    Learning Ising models from one or multiple samples

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Anthimos Vardis Kandiros

    Abstract: There have been two separate lines of work on estimating Ising models: (1) estimating them from multiple independent samples under minimal assumptions about the model's interaction matrix; and (2) estimating them from one sample in restrictive settings. We propose a unified framework that smoothly interpolates between these two settings, enabling significantly richer estimation guarantees from one… ▽ More

    Submitted 10 December, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  16. arXiv:1911.10541  [pdf, ps, other

    cs.LG cs.CR cs.DS stat.ML

    PAC learning with stable and private predictions

    Authors: Yuval Dagan, Vitaly Feldman

    Abstract: We study binary classification algorithms for which the prediction on any point is not too sensitive to individual examples in the dataset. Specifically, we consider the notions of uniform stability (Bousquet and Elisseeff, 2001) and prediction privacy (Dwork and Feldman, 2018). Previous work on these notions shows how they can be achieved in the standard PAC model via simple aggregation of models… ▽ More

    Submitted 23 September, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

  17. arXiv:1911.04014  [pdf, ps, other

    cs.LG cs.CR cs.DS stat.ML

    Interaction is necessary for distributed learning with privacy or communication constraints

    Authors: Yuval Dagan, Vitaly Feldman

    Abstract: Local differential privacy (LDP) is a model where users send privatized data to an untrusted central server whose goal it to solve some data analysis task. In the non-interactive version of this model the protocol consists of a single round in which a server sends requests to all users then receives their responses. This version is deployed in industry due to its practical advantages and has attra… ▽ More

    Submitted 23 September, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

  18. arXiv:1906.09247  [pdf, ps, other

    cs.LG stat.ML

    Learning from weakly dependent data under Dobrushin's condition

    Authors: Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Siddhartha Jayanti

    Abstract: Statistical learning theory has largely focused on learning and generalization given independent and identically distributed (i.i.d.) samples. Motivated by applications involving time-series data, there has been a growing literature on learning and generalization in settings where data is sampled from an ergodic process. This work has also developed complexity measures, which appropriately extend… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  19. arXiv:1903.05315  [pdf, ps, other

    math.ST cs.LG

    Optimality of Maximum Likelihood for Log-Concave Density Estimation and Bounded Convex Regression

    Authors: Gil Kur, Yuval Dagan, Alexander Rakhlin

    Abstract: In this paper, we study two problems: (1) estimation of a $d$-dimensional log-concave distribution and (2) bounded multivariate convex regression with random design with an underlying log-concave density or a compactly supported distribution with a continuous density. First, we show that for all $d \ge 4$ the maximum likelihood estimators of both problems achieve an optimal risk of… ▽ More

    Submitted 20 February, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    MSC Class: 62G07 62G08

  20. arXiv:1902.03498  [pdf, ps, other

    cs.LG stat.ML

    Space lower bounds for linear prediction in the streaming model

    Authors: Yuval Dagan, Gil Kur, Ohad Shamir

    Abstract: We show that fundamental learning tasks, such as finding an approximate linear separator or linear regression, require memory at least \emph{quadratic} in the dimension, in a natural streaming setting. This implies that such problems cannot be solved (at least in this setting) by scalable memory-efficient streaming algorithms. Our results build on a memory lower bound for a simple linear-algebraic… ▽ More

    Submitted 11 June, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: Added a minor correction in referencing the prior work

  21. arXiv:1811.02177  [pdf, ps, other

    cs.DS cs.DM math.CO

    The entropy of lies: playing twenty questions with a liar

    Authors: Yuval Dagan, Yuval Filmus, Daniel Kane, Shay Moran

    Abstract: `Twenty questions' is a guessing game played by two players: Bob thinks of an integer between $1$ and $n$, and Alice's goal is to recover it using a minimal number of Yes/No questions. Shannon's entropy has a natural interpretation in this context. It characterizes the average number of questions used by an optimal strategy in the distributional variant of the game: let $μ$ be a distribution over… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  22. arXiv:1803.10415  [pdf, other

    cs.LG stat.ML

    A Better Resource Allocation Algorithm with Semi-Bandit Feedback

    Authors: Yuval Dagan, Koby Crammer

    Abstract: We study a sequential resource allocation problem between a fixed number of arms. On each iteration the algorithm distributes a resource among the arms in order to maximize the expected success rate. Allocating more of the resource to a given arm increases the probability that it succeeds, yet with a cut-off. We follow Lattimore et al. (2014) and assume that the probability increases linearly unti… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

  23. arXiv:1803.01420  [pdf, ps, other

    cs.LG stat.ML

    Detecting Correlations with Little Memory and Communication

    Authors: Yuval Dagan, Ohad Shamir

    Abstract: We study the problem of identifying correlations in multivariate data, under information constraints: Either on the amount of memory that can be used by the algorithm, or the amount of communication when the data is distributed across several machines. We prove a tight trade-off between the memory/communication complexity and the sample complexity, implying (for example) that to detect pairwise co… ▽ More

    Submitted 6 June, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: Accepted for presentation at Conference on Learning Theory (COLT) 2018. Changes: Added a comparison to Raz [2016]; Corrected typos; Added references

  24. arXiv:1611.06650  [pdf, ps, other

    cs.CC

    Trading information complexity for error

    Authors: Yuval Dagan, Yuval Filmus, Hamed Hatami, Yaqiao Li

    Abstract: We consider the standard two-party communication model. The central problem studied in this article is how much one can save in information complexity by allowing an error of $ε$. For arbitrary functions, we obtain lower bounds and upper bounds indicating a gain that is of order $Ω(h(ε))$ and $O(h(\sqrtε))$. Here $h$ denotes the binary entropy function. We analyze the case of the two-bit AND fun… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  25. arXiv:1611.01655  [pdf, ps, other

    cs.DM cs.DS cs.IT cs.LG math.CO

    Twenty (simple) questions

    Authors: Yuval Dagan, Yuval Filmus, Ariel Gabizon, Shay Moran

    Abstract: A basic combinatorial interpretation of Shannon's entropy function is via the "20 questions" game. This cooperative game is played by two players, Alice and Bob: Alice picks a distribution $π$ over the numbers $\{1,\ldots,n\}$, and announces it to Bob. She then chooses a number $x$ according to $π$, and Bob attempts to identify $x$ using as few Yes/No queries as possible, on average. An optimal… ▽ More

    Submitted 25 April, 2017; v1 submitted 5 November, 2016; originally announced November 2016.

    Comments: 33 pages; to appear in STOC 2017