Skip to main content

Showing 1–5 of 5 results for author: Marjani, A A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.06408  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    Differentially Private Best-Arm Identification

    Authors: Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu

    Abstract: Best Arm Identification (BAI) problems are progressively used for data-sensitive applications, such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user studies. Motivated by the data privacy concerns invoked by these applications, we study the problem of BAI with fixed confidence in both the local and central models, i.e. $ε$-local and $ε$-global Differential Privac… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.02202

  2. arXiv:2309.02202  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence

    Authors: Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu

    Abstract: Best Arm Identification (BAI) problems are progressively used for data-sensitive applications, such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user studies to name a few. Motivated by the data privacy concerns invoked by these applications, we study the problem of BAI with fixed confidence under $ε$-global Differential Privacy (DP). First, to quantify the cost o… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  3. arXiv:2202.06280  [pdf, other

    stat.ML cs.LG

    On the complexity of All $\varepsilon$-Best Arms Identification

    Authors: Aymen Al Marjani, Tomáš Kocák, Aurélien Garivier

    Abstract: We consider the question introduced by \cite{Mason2020} of identifying all the $\varepsilon$-optimal arms in a finite stochastic multi-armed bandit with Gaussian rewards. We give two lower bounds on the sample complexity of any algorithm solving the problem with a confidence at least $1-δ$. The first, unimprovable in the asymptotic regime, motivates the design of a Track-and-Stop strategy whose av… ▽ More

    Submitted 6 April, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  4. arXiv:2106.02847  [pdf, other

    stat.ML cs.LG

    Navigating to the Best Policy in Markov Decision Processes

    Authors: Aymen Al Marjani, Aurélien Garivier, Alexandre Proutiere

    Abstract: We investigate the classical active pure exploration problem in Markov Decision Processes, where the agent sequentially selects actions and, from the resulting system trajectory, aims at identifying the best policy as fast as possible. We propose a problem-dependent lower bound on the average number of steps required before a correct answer can be given with probability at least $1-δ$. We further… ▽ More

    Submitted 25 October, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

  5. arXiv:2009.13405  [pdf, other

    stat.ML cs.LG

    Adaptive Sampling for Best Policy Identification in Markov Decision Processes

    Authors: Aymen Al Marjani, Alexandre Proutiere

    Abstract: We investigate the problem of best-policy identification in discounted Markov Decision Processes (MDPs) when the learner has access to a generative model. The objective is to devise a learning algorithm returning the best policy as early as possible. We first derive a problem-specific lower bound of the sample complexity satisfied by any learning algorithm. This lower bound corresponds to an optim… ▽ More

    Submitted 10 May, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 43 pages