Skip to main content

Showing 1–22 of 22 results for author: Levy, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.02951  [pdf, other

    cs.LG cs.DC stat.ML

    Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers

    Authors: Ron Dorfman, Naseem Yehya, Kfir Y. Levy

    Abstract: Byzantine-robust learning has emerged as a prominent fault-tolerant distributed machine learning framework. However, most techniques focus on the static setting, wherein the identity of Byzantine workers remains unchanged throughout the learning process. This assumption fails to capture real-world dynamic Byzantine behaviors, which may include intermittent malfunctions or targeted, time-limited at… ▽ More

    Submitted 16 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  2. arXiv:2307.02295  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning Adversarial Bandit Algorithms

    Authors: Mikhail Khodak, Ilya Osadchiy, Keegan Harris, Maria-Florina Balcan, Kfir Y. Levy, Ron Meir, Zhiwei Steven Wu

    Abstract: We study online meta-learning with bandit feedback, with the goal of improving performance across multiple tasks if they are similar according to some natural similarity measure. As the first to target the adversarial online-within-online partial-information setting, we design meta-algorithms that combine outer learners to simultaneously tune the initialization and other hyperparameters of an inne… ▽ More

    Submitted 1 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Merger of arXiv:2205.14128 and arXiv:2205.15921, with some additional improvements; to appear in NeurIPS 2023

  3. arXiv:2302.00543  [pdf, other

    cs.LG cs.AI stat.ML

    DoCoFL: Downlink Compression for Cross-Device Federated Learning

    Authors: Ron Dorfman, Shay Vargaftik, Yaniv Ben-Itzhak, Kfir Y. Levy

    Abstract: Many compression techniques have been proposed to reduce the communication overhead of Federated Learning training procedures. However, these are typically designed for compressing model updates, which are expected to decay throughout training. As a result, such methods are inapplicable to downlink (i.e., from the parameter server to clients) compression in the cross-device setting, where heteroge… ▽ More

    Submitted 13 July, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  4. arXiv:2205.15921  [pdf, ps, other

    cs.LG stat.ML

    Online Meta-Learning in Adversarial Multi-Armed Bandits

    Authors: Ilya Osadchiy, Kfir Y. Levy, Ron Meir

    Abstract: We study meta-learning for adversarial multi-armed bandits. We consider the online-within-online setup, in which a player (learner) encounters a sequence of multi-armed bandit episodes. The player's performance is measured as regret against the best arm in each episode, according to the losses generated by an adversary. The difficulty of the problem depends on the empirical distribution of the per… ▽ More

    Submitted 12 July, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: v1: The paper is submitted to NeurIPS 2022. An older version was rejected from ICML 2022 v2: Added a reference to concurrent work in Prior Art section

  5. arXiv:2202.04428  [pdf, other

    cs.LG stat.ML

    Adapting to Mixing Time in Stochastic Optimization with Markovian Data

    Authors: Ron Dorfman, Kfir Y. Levy

    Abstract: We consider stochastic optimization problems where data is drawn from a Markov chain. Existing methods for this setting crucially rely on knowing the mixing time of the chain, which in real-world applications is usually unknown. We propose the first optimization method that does not require the knowledge of the mixing time, yet obtains the optimal asymptotic convergence rate when applied to convex… ▽ More

    Submitted 13 July, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  6. arXiv:2002.03419  [pdf, other

    q-bio.PE stat.AP

    The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) Challenge: Results after 1 Year Follow-up

    Authors: Razvan V. Marinescu, Neil P. Oxtoby, Alexandra L. Young, Esther E. Bron, Arthur W. Toga, Michael W. Weiner, Frederik Barkhof, Nick C. Fox, Arman Eshaghi, Tina Toni, Marcin Salaterski, Veronika Lunina, Manon Ansart, Stanley Durrleman, Pascal Lu, Samuel Iddi, Dan Li, Wesley K. Thompson, Michael C. Donohue, Aviv Nahon, Yarden Levy, Dan Halbersberg, Mariya Cohen, Huiling Liao, Tengfei Li , et al. (71 additional authors not shown)

    Abstract: We present the findings of "The Alzheimer's Disease Prediction Of Longitudinal Evolution" (TADPOLE) Challenge, which compared the performance of 92 algorithms from 33 international teams at predicting the future trajectory of 219 individuals at risk of Alzheimer's disease. Challenge participants were required to make a prediction, for each month of a 5-year future time period, of three key outcome… ▽ More

    Submitted 27 December, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: Presents final results of the TADPOLE competition. 60 pages, 7 tables, 14 figures

    Journal ref: Machine Learning for Biomedical Imaging (MELBA), Dec 2021

  7. arXiv:1910.12511  [pdf, other

    cs.LG stat.ML

    Adaptive Sampling for Stochastic Risk-Averse Learning

    Authors: Sebastian Curi, Kfir. Y. Levy, Stefanie Jegelka, Andreas Krause

    Abstract: In high-stakes machine learning applications, it is crucial to not only perform well on average, but also when restricted to difficult examples. To address this, we consider the problem of training models in a risk-averse manner. We propose an adaptive sampling algorithm for stochastically optimizing the Conditional Value-at-Risk (CVaR) of a loss distribution, which measures its performance on the… ▽ More

    Submitted 6 November, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

  8. arXiv:1903.12416  [pdf, other

    cs.LG stat.ML

    Online Variance Reduction with Mixtures

    Authors: Zalán Borsos, Sebastian Curi, Kfir Y. Levy, Andreas Krause

    Abstract: Adaptive importance sampling for stochastic optimization is a promising approach that offers improved convergence through variance reduction. In this work, we propose a new framework for variance reduction that enables the use of mixtures over predefined sampling distributions, which can naturally encode prior knowledge about the data. While these sampling distributions are fixed, the mixture weig… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

  9. arXiv:1902.08036  [pdf, other

    cs.LG stat.ML

    Multi-Player Bandits: The Adversarial Case

    Authors: Pragnya Alatur, Kfir Y. Levy, Andreas Krause

    Abstract: We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a cognitive radio networks application, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strengt… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  10. arXiv:1902.01637  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Universal Algorithm for Variational Inequalities Adaptive to Smoothness and Noise

    Authors: Francis Bach, Kfir Y. Levy

    Abstract: We consider variational inequalities coming from monotone operators, a setting that includes convex minimization and convex-concave saddle-point problems. We assume an access to potentially noisy unbiased values of the monotone operators and assess convergence through a compatible gap function which corresponds to the standard optimality criteria in the aforementioned subcases. We present a univer… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  11. arXiv:1811.05512  [pdf, other

    cs.LG stat.ML

    A domain agnostic measure for monitoring and evaluating GANs

    Authors: Paulina Grnarova, Kfir Y Levy, Aurelien Lucchi, Nathanael Perraudin, Ian Goodfellow, Thomas Hofmann, Andreas Krause

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as… ▽ More

    Submitted 15 July, 2020; v1 submitted 13 November, 2018; originally announced November 2018.

  12. arXiv:1809.02864  [pdf, other

    cs.LG math.OC stat.ML

    Online Adaptive Methods, Universality and Acceleration

    Authors: Kfir Y. Levy, Alp Yurtsever, Volkan Cevher

    Abstract: We present a novel method for convex unconstrained optimization that, without any modifications, ensures: (i) accelerated convergence rate for smooth objectives, (ii) standard convergence rate in the general (non-smooth) setting, and (iii) standard convergence rate in the stochastic optimization setting. To the best of our knowledge, this is the first method that simultaneously applies to all of t… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

  13. arXiv:1806.07200  [pdf, other

    cs.LG eess.SY stat.ML

    Adaptive Input Estimation in Linear Dynamical Systems with Applications to Learning-from-Observations

    Authors: Sebastian Curi, Kfir Y. Levy, Andreas Krause

    Abstract: We address the problem of estimating the inputs of a dynamical system from measurements of the system's outputs. To this end, we introduce a novel estimation algorithm that explicitly trades off bias and variance to optimally reduce the overall estimation error. This optimal trade-off is done efficiently and adaptively in every time step. Experimentally, we show that our method often produces esti… ▽ More

    Submitted 19 September, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: CDC 2019

  14. arXiv:1805.08079  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Faster Neural Network Training with Approximate Tensor Operations

    Authors: Menachem Adelman, Kfir Y. Levy, Ido Hakimi, Mark Silberstein

    Abstract: We propose a novel technique for faster deep neural network training which systematically applies sample-based approximation to the constituent tensor operations, i.e., matrix multiplications and convolutions. We introduce new sampling techniques, study their theoretical properties, and prove that they provide the same convergence guarantees when applied to SGD training. We apply approximate tenso… ▽ More

    Submitted 25 October, 2021; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2021 camera ready

  15. arXiv:1805.06792  [pdf, ps, other

    cs.LG stat.ML

    Faster Rates for Convex-Concave Games

    Authors: Jacob Abernethy, Kevin A. Lai, Kfir Y. Levy, Jun-Kun Wang

    Abstract: We consider the use of no-regret algorithms to compute equilibria for particular classes of convex-concave games. While standard regret bounds would lead to convergence rates on the order of $O(T^{-1/2})$, recent work \citep{RS13,SALS15} has established $O(1/T)$ rates by taking advantage of a particular class of optimistic prediction algorithms. In this work we go further, showing that for a parti… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: COLT 2018

  16. arXiv:1802.04715  [pdf, other

    stat.ML cs.LG

    Online Variance Reduction for Stochastic Optimization

    Authors: Zalán Borsos, Andreas Krause, Kfir Y. Levy

    Abstract: Modern stochastic optimization methods often rely on uniform sampling which is agnostic to the underlying characteristics of the data. This might degrade the convergence by yielding estimates that suffer from a high variance. A possible remedy is to employ non-uniform importance sampling techniques, which take the structure of the dataset into account. In this work, we investigate a recently propo… ▽ More

    Submitted 6 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: COLT 2018

  17. arXiv:1711.02515  [pdf, other

    cs.LG cs.AI stat.ML

    Continuous DR-submodular Maximization: Structure and Algorithms

    Authors: An Bian, Kfir Y. Levy, Andreas Krause, Joachim M. Buhmann

    Abstract: DR-submodular continuous functions are important objectives with wide real-world applications spanning MAP inference in determinantal point processes (DPPs), and mean-field inference for probabilistic submodular models, amongst others. DR-submodularity captures a subclass of non-convex functions that enables both exact minimization and approximate maximization in polynomial time. In this work we… ▽ More

    Submitted 24 May, 2019; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published in NIPS 2017

  18. arXiv:1706.03269  [pdf, other

    cs.LG stat.ML

    An Online Learning Approach to Generative Adversarial Networks

    Authors: Paulina Grnarova, Kfir Y. Levy, Aurelien Lucchi, Thomas Hofmann, Andreas Krause

    Abstract: We consider the problem of training generative models with a Generative Adversarial Network (GAN). Although GANs can accurately model complex distributions, they are known to be difficult to train due to instabilities caused by a difficult minimax optimization problem. In this paper, we view the problem of training GANs as finding a mixed strategy in a zero-sum game. Building on ideas from online… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

  19. arXiv:1705.10499  [pdf, other

    cs.LG math.OC stat.ML

    Online to Offline Conversions, Universality and Adaptive Minibatch Sizes

    Authors: Kfir Y. Levy

    Abstract: We present an approach towards convex optimization that relies on a novel scheme which converts online adaptive algorithms into offline methods. In the offline optimization setting, our derived methods are shown to obtain favourable adaptive guarantees which depend on the harmonic sum of the queried gradients. We further show that our methods implicitly adapt to the objective's structure: in the s… ▽ More

    Submitted 31 May, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  20. arXiv:1701.07266  [pdf, other

    stat.ML cs.LG

    k*-Nearest Neighbors: From Global to Local

    Authors: Oren Anava, Kfir Y. Levy

    Abstract: The weighted k-nearest neighbors algorithm is one of the most fundamental non-parametric methods in pattern recognition and machine learning. The question of setting the optimal number of neighbors as well as the optimal weights has received much attention throughout the years, nevertheless this problem seems to have remained unsettled. In this paper we offer a simple approach to locally weighted… ▽ More

    Submitted 25 January, 2017; originally announced January 2017.

  21. arXiv:1611.04831  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Normalization: Faster Evasion of Saddle Points

    Authors: Kfir Y. Levy

    Abstract: A commonly used heuristic in non-convex optimization is Normalized Gradient Descent (NGD) - a variant of gradient descent in which only the direction of the gradient is taken into account and its magnitude ignored. We analyze this heuristic and show that with carefully chosen parameters and noise injection, this method can provably evade saddle points. We establish the convergence of NGD to a loca… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

  22. arXiv:1602.05399  [pdf, other

    stat.AP

    Modeling CD4+ T cells dynamics in HIV-infected patients receiving repeated cycles of exogenous Interleukin 7

    Authors: Ana Jarne, Daniel Commenges, Mélanie Prague, Yves Levy, Rodolphe Thiébaut

    Abstract: Combination Antiretroviral Therapy (cART) succeeds to control viral replication in most HIV infected patients. This is normally followed by a reconstitution of the CD4$^+$ T cells pool; however, this does not happen for a substantial proportion of patients. For these patients, an immunotherapy based on injections of Interleukin 7 (IL-7) has been recently proposed as a co-adjutant treatment in the… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.