Skip to main content

Showing 1–3 of 3 results for author: Emmenegger, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.04402  [pdf, other

    cs.LG stat.ML

    Likelihood Ratio Confidence Sets for Sequential Decision Making

    Authors: Nicolas Emmenegger, Mojmír Mutný, Andreas Krause

    Abstract: Certifiable, adaptive uncertainty estimates for unknown quantities are an essential ingredient of sequential decision-making algorithms. Standard approaches rely on problem-dependent concentration results and are limited to a specific combination of parameterization, noise family, and estimator. In this paper, we revisit the likelihood-based inference principle and propose to use likelihood ratios… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  2. arXiv:2307.12897  [pdf, other

    stat.ML cs.AI cs.LG

    Anytime Model Selection in Linear Bandits

    Authors: Parnian Kassraie, Nicolas Emmenegger, Andreas Krause, Aldo Pacchiano

    Abstract: Model selection in the context of bandit optimization is a challenging problem, as it requires balancing exploration and exploitation not only for action selection, but also for model selection. One natural approach is to rely on online learning algorithms that treat different models as experts. Existing methods, however, scale poorly ($\text{poly}M$) with the number of models $M$ in terms of thei… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023, 37 pages

  3. arXiv:2103.05138  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization

    Authors: Nicolas Emmenegger, Rasmus Kyng, Ahad N. Zehmakan

    Abstract: We prove lower bounds for higher-order methods in smooth non-convex finite-sum optimization. Our contribution is threefold: We first show that a deterministic algorithm cannot profit from the finite-sum structure of the objective, and that simulating a pth-order regularized method on the whole function by constructing exact gradient information is optimal up to constant factors. We further show lo… ▽ More

    Submitted 2 July, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Added missing upper bound assumption on n in Theorems 4.7 and 4.10