Skip to main content

Showing 1–9 of 9 results for author: Kassraie, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16745  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Bandits with Preference Feedback: A Stackelberg Game Perspective

    Authors: Barna Pásztor, Parnian Kassraie, Andreas Krause

    Abstract: Bandits with preference feedback present a powerful tool for optimizing unknown target functions when only pairwise comparisons are allowed instead of direct value queries. This model allows for incorporating human feedback into online inference and optimization and has been employed in systems for fine-tuning large language models. The problem is well understood in simplified settings with linear… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 30 pages, 8 figures

  2. arXiv:2406.05061  [pdf, other

    stat.ML cs.LG

    Progressive Entropic Optimal Transport Solvers

    Authors: Parnian Kassraie, Aram-Alexandre Pooladian, Michal Klein, James Thornton, Jonathan Niles-Weed, Marco Cuturi

    Abstract: Optimal transport (OT) has profoundly impacted machine learning by providing theoretical and computational tools to realign datasets. In this context, given two large point clouds of sizes $n$ and $m$ in $\mathbb{R}^d$, entropic OT (EOT) solvers have emerged as the most reliable tool to either solve the Kantorovich problem and output a $n\times m$ coupling matrix, or to solve the Monge problem and… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  3. arXiv:2307.12897  [pdf, other

    stat.ML cs.AI cs.LG

    Anytime Model Selection in Linear Bandits

    Authors: Parnian Kassraie, Nicolas Emmenegger, Andreas Krause, Aldo Pacchiano

    Abstract: Model selection in the context of bandit optimization is a challenging problem, as it requires balancing exploration and exploitation not only for action selection, but also for model selection. One natural approach is to rely on online learning algorithms that treat different models as experts. Existing methods, however, scale poorly ($\text{poly}M$) with the number of models $M$ in terms of thei… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023, 37 pages

  4. arXiv:2303.01076  [pdf, other

    cs.LG cs.AI stat.ML

    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

    Authors: Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

    Abstract: We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance. This is crucial when deciding whether a given policy satisfies certain minimal performance/safety criteria before it can be deployed in the real world. To this end, we introduce HA… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2023, first three authors contributed equally

  5. arXiv:2211.01258  [pdf, other

    stat.ML cs.LG

    Instance-Dependent Generalization Bounds via Optimal Transport

    Authors: Songyan Hou, Parnian Kassraie, Anastasis Kratsios, Andreas Krause, Jonas Rothfuss

    Abstract: Existing generalization bounds fail to explain crucial factors that drive the generalization of modern neural networks. Since such bounds often hold uniformly over all parameters, they suffer from over-parametrization and fail to account for the strong inductive bias of initialization and stochastic gradient descent. As an alternative, we propose a novel optimal transport interpretation of the gen… ▽ More

    Submitted 13 November, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Journal of Machine Learning Research (JMLR), 51 pages

  6. arXiv:2210.15513  [pdf, other

    stat.ML cs.AI cs.LG

    Lifelong Bandit Optimization: No Prior and No Regret

    Authors: Felix Schur, Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Machine learning algorithms are often repeatedly applied to problems with similar structure over and over again. We focus on solving a sequence of bandit optimization tasks and develop LIBO, an algorithm which adapts to the environment by learning from past experience and becomes more sample-efficient in the process. We assume a kernelized structure where the kernel is unknown but shared across al… ▽ More

    Submitted 20 June, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 35 pages, 6 figures, In Proceedings of UAI 2023

  7. arXiv:2207.06456  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Network Bandits

    Authors: Parnian Kassraie, Andreas Krause, Ilija Bogunovic

    Abstract: We consider the bandit optimization problem with the reward function defined over graph-structured data. This problem has important applications in molecule design and drug discovery, where the reward is naturally invariant to graph permutations. The key challenges in this setting are scaling to large domains, and to graphs with many nodes. We resolve these challenges by embedding the permutation… ▽ More

    Submitted 11 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted to Neurips2022, 37 pages, 8 figures

  8. arXiv:2202.00602  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Hypothesis Spaces for Sequential Decision-making

    Authors: Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Obtaining reliable, adaptive confidence sets for prediction functions (hypotheses) is a central challenge in sequential decision-making tasks, such as bandits and model-based reinforcement learning. These confidence sets typically rely on prior assumptions on the hypothesis space, e.g., the known kernel of a Reproducing Kernel Hilbert Space (RKHS). Hand-designing such kernels is error prone, and m… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 23 pages, 11 figures

  9. arXiv:2107.03144  [pdf, other

    stat.ML cs.AI cs.LG

    Neural Contextual Bandits without Regret

    Authors: Parnian Kassraie, Andreas Krause

    Abstract: Contextual bandits are a rich model for sequential decision making given side information, with important applications, e.g., in recommender systems. We propose novel algorithms for contextual bandits harnessing neural networks to approximate the unknown reward function. We resolve the open problem of proving sublinear regret bounds in this setting for general context sequences, considering both f… ▽ More

    Submitted 28 February, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: 37 pages, 6 figures