Skip to main content

Showing 1–10 of 10 results for author: Katz-Samuels, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.04915  [pdf, other

    cs.LG stat.ML

    Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

    Authors: Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

    Abstract: We consider interactive learning in the realizable setting and develop a general framework to handle problems ranging from best arm identification to active classification. We begin our investigation with the observation that agnostic algorithms \emph{cannot} be minimax-optimal in the realizable setting. Hence, we design novel computationally efficient algorithms for the realizable setting that ma… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  2. arXiv:2109.05131  [pdf, other

    stat.ML cs.LG

    Near Instance Optimal Model Selection for Pure Exploration Linear Bandits

    Authors: Yinglun Zhu, Julian Katz-Samuels, Robert Nowak

    Abstract: We introduce the model selection problem in pure exploration linear bandits, where the learner needs to adapt to the instance-dependent complexity measure of the smallest hypothesis class containing the true model. We design algorithms in both fixed confidence and fixed budget settings with near instance optimal guarantees. The core of our algorithms is a new optimization problem based on experime… ▽ More

    Submitted 17 March, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

  3. arXiv:2105.06499  [pdf, other

    cs.LG stat.ML

    Improved Algorithms for Agnostic Pool-based Active Classification

    Authors: Julian Katz-Samuels, Jifan Zhang, Lalit Jain, Kevin Jamieson

    Abstract: We consider active learning for binary classification in the agnostic pool-based setting. The vast majority of works in active learning in the agnostic setting are inspired by the CAL algorithm where each query is uniformly sampled from the disagreement region of the current version space. The sample complexity of such algorithms is described by a quantity known as the disagreement coefficient whi… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  4. arXiv:2011.00576  [pdf, other

    cs.LG stat.ML

    Experimental Design for Regret Minimization in Linear Bandits

    Authors: Andrew Wagenmaker, Julian Katz-Samuels, Kevin Jamieson

    Abstract: In this paper we propose a novel experimental design-based algorithm to minimize regret in online stochastic linear and combinatorial bandits. While existing literature tends to focus on optimism-based algorithms--which have been shown to be suboptimal in many cases--our approach carefully plans which action to take by balancing the tradeoff between information gain and reward, overcoming the fail… ▽ More

    Submitted 26 February, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

  5. arXiv:2007.00077  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Similarity Search for Efficient Active Learning and Search of Rare Concepts

    Authors: Cody Coleman, Edward Chou, Julian Katz-Samuels, Sean Culatana, Peter Bailis, Alexander C. Berg, Robert Nowak, Roshan Sumbaly, Matei Zaharia, I. Zeki Yalniz

    Abstract: Many active learning and search approaches are intractable for large-scale industrial settings with billions of unlabeled examples. Existing approaches search globally for the optimal examples to label, scaling linearly or even quadratically with the unlabeled data. In this paper, we improve the computational efficiency of active learning and search methods by restricting the candidate pool for la… ▽ More

    Submitted 22 July, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

  6. arXiv:2006.11685  [pdf, other

    cs.LG stat.ML

    An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

    Authors: Julian Katz-Samuels, Lalit Jain, Zohar Karnin, Kevin Jamieson

    Abstract: This paper proposes near-optimal algorithms for the pure-exploration linear bandit problem in the fixed confidence and fixed budget settings. Leveraging ideas from the theory of suprema of empirical processes, we provide an algorithm whose sample complexity scales with the geometry of the instance and avoids an explicit union bound over the number of arms. Unlike previous approaches which sample b… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  7. arXiv:1906.06594  [pdf, other

    stat.ML cs.LG

    The True Sample Complexity of Identifying Good Arms

    Authors: Julian Katz-Samuels, Kevin Jamieson

    Abstract: We consider two multi-armed bandit problems with $n$ arms: (i) given an $ε> 0$, identify an arm with mean that is within $ε$ of the largest mean and (ii) given a threshold $μ_0$ and integer $k$, identify $k$ arms with means larger than $μ_0$. Existing lower bounds and algorithms for the PAC framework suggest that both of these problems require $Ω(n)$ samples. However, we argue that these definitio… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

  8. arXiv:1710.01167  [pdf, other

    stat.ML

    Decontamination of Mutual Contamination Models

    Authors: Julian Katz-Samuels, Gilles Blanchard, Clayton Scott

    Abstract: Many machine learning problems can be characterized by mutual contamination models. In these problems, one observes several random samples from different convex combinations of a set of unknown base distributions and the goal is to infer these base distributions. This paper considers the general setting where the base distributions are defined on arbitrary probability spaces. We examine three popu… ▽ More

    Submitted 11 April, 2019; v1 submitted 30 September, 2017; originally announced October 2017.

    Comments: Published in JMLR. Subsumes arXiv:1602.06235

  9. arXiv:1705.08621  [pdf, ps, other

    stat.ML cs.LG

    Nonparametric Preference Completion

    Authors: Julian Katz-Samuels, Clayton Scott

    Abstract: We consider the task of collaborative preference completion: given a pool of items, a pool of users and a partially observed item-user rating matrix, the goal is to recover the \emph{personalized ranking} of each user over all of the items. Our approach is nonparametric: we assume that each item $i$ and each user $u$ have unobserved features $x_i$ and $y_u$, and that the associated rating is given… ▽ More

    Submitted 10 April, 2018; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: AISTATS 2018

  10. arXiv:1602.06235  [pdf, other

    stat.ML

    A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

    Authors: Julian Katz-Samuels, Clayton Scott

    Abstract: Many machine learning problems can be characterized by mutual contamination models. In these problems, one observes several random samples from different convex combinations of a set of unknown base distributions. It is of interest to decontaminate mutual contamination models, i.e., to recover the base distributions either exactly or up to a permutation. This paper considers the general setting wh… ▽ More

    Submitted 19 February, 2016; originally announced February 2016.