Skip to main content

Showing 1–3 of 3 results for author: Kazerouni, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2005.11442  [pdf, other

    cs.LG stat.ML

    Active Learning for Skewed Data Sets

    Authors: Abbas Kazerouni, Qi Zhao, **g Xie, Sandeep Tata, Marc Najork

    Abstract: Consider a sequential active learning problem where, at each round, an agent selects a batch of unlabeled data points, queries their labels and updates a binary classifier. While there exists a rich body of work on active learning in this general form, in this paper, we focus on problems with two distinguishing characteristics: severe class imbalance (skew) and small amounts of initial training da… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

  2. arXiv:1905.08224  [pdf, other

    cs.LG stat.ML

    Best Arm Identification in Generalized Linear Bandits

    Authors: Abbas Kazerouni, Lawrence M. Wein

    Abstract: Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

  3. arXiv:1611.06426  [pdf, other

    stat.ML cs.LG

    Conservative Contextual Linear Bandits

    Authors: Abbas Kazerouni, Mohammad Ghavamzadeh, Yasin Abbasi-Yadkori, Benjamin Van Roy

    Abstract: Safety is a desirable property that can immensely increase the applicability of learning algorithms in real-world decision-making problems. It is much easier for a company to deploy an algorithm that is safe, i.e., guaranteed to perform at least as well as a baseline. In this paper, we study the issue of safety in contextual linear bandits that have application in many different fields including p… ▽ More

    Submitted 3 March, 2017; v1 submitted 19 November, 2016; originally announced November 2016.