Skip to main content

Showing 1–6 of 6 results for author: Kagrecha, A

.
  1. arXiv:2401.13239  [pdf, other

    cs.LG cs.HC

    Adaptive Crowdsourcing Via Self-Supervised Learning

    Authors: Anmol Kagrecha, Henrik Marklund, Benjamin Van Roy, Hong Jun Jeon, Richard Zeckhauser

    Abstract: Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across c… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 33 pages, 3 figures

  2. arXiv:2008.13629  [pdf, other

    cs.LG stat.ML

    Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits

    Authors: Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

    Abstract: Traditional multi-armed bandit (MAB) formulations usually make certain assumptions about the underlying arms' distributions, such as bounds on the support or their tail behaviour. Moreover, such parametric information is usually 'baked' into the algorithms. In this paper, we show that specialized algorithms that exploit such parametric information are prone to inconsistent learning performance whe… ▽ More

    Submitted 27 March, 2022; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: 21 pages. Preliminary version appeared in NeurIPS 2019. Accepted for publication at IEEE Transactions of Information Theory. arXiv admin note: text overlap with arXiv:1906.00569

  3. arXiv:2006.12038  [pdf, ps, other

    cs.LG stat.ML

    Bandit algorithms: Letting go of logarithmic regret for statistical robustness

    Authors: Kumar Ashutosh, Jayakrishnan Nair, Anmol Kagrecha, Krishna Jagannathan

    Abstract: We study regret minimization in a stochastic multi-armed bandit setting and establish a fundamental trade-off between the regret suffered under an algorithm, and its statistical robustness. Considering broad classes of underlying arms' distributions, we show that bandit learning algorithms with logarithmic regret are always inconsistent and that consistent learning algorithms always suffer a super… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  4. arXiv:2006.09649  [pdf, other

    cs.LG stat.ML

    Constrained regret minimization for multi-criterion multi-armed bandits

    Authors: Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

    Abstract: We consider a stochastic multi-armed bandit setting and study the problem of constrained regret minimization over a given time horizon. Each arm is associated with an unknown, possibly multi-dimensional distribution, and the merit of an arm is determined by several, possibly conflicting attributes. The aim is to optimize a 'primary' attribute subject to user-provided constraints on other 'secondar… ▽ More

    Submitted 3 January, 2023; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 26 pages

  5. arXiv:1910.12894  [pdf, other

    cs.PF

    Please come back later: Benefiting from deferrals in service systems

    Authors: Anmol Kagrecha, Jayakrishnan Nair

    Abstract: The performance evaluation of loss service systems, where customers who cannot be served upon arrival get dropped, has a long history going back to the classical Erlang B model. In this paper, we consider the performance benefits arising from the possibility of deferring customers who cannot be served upon arrival. Specifically, we consider an Erlang B type loss system where the system operator ca… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

  6. arXiv:1906.00569  [pdf, other

    cs.LG stat.ML

    Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards

    Authors: Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

    Abstract: Classical multi-armed bandit problems use the expected value of an arm as a metric to evaluate its goodness. However, the expected value is a risk-neutral metric. In many applications like finance, one is interested in balancing the expected return of an arm (or portfolio) with the risk associated with that return. In this paper, we consider the problem of selecting the arm that optimizes a linear… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.