Skip to main content

Showing 1–4 of 4 results for author: Genalti, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02975  [pdf, ps, other

    cs.LG cs.AI

    $(ε, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits

    Authors: Gianmarco Genalti, Lupo Marsigli, Nicola Gatti, Alberto Maria Metelli

    Abstract: Heavy-tailed distributions naturally arise in several settings, from finance to telecommunications. While regret minimization under subgaussian or bounded rewards has been widely studied, learning with heavy-tailed distributions only gained popularity over the last decade. In this paper, we consider the setting in which the reward distributions have finite absolute raw moments of maximum order… ▽ More

    Submitted 12 February, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  2. arXiv:2304.14326  [pdf, ps, other

    cs.LG

    A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

    Authors: Jacopo Germano, Francesco Emanuele Stradi, Gianmarco Genalti, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study online learning in episodic constrained Markov decision processes (CMDPs), where the goal of the learner is to collect as much reward as possible over the episodes, while guaranteeing that some long-term constraints are satisfied during the learning process. Rewards and constraints can be selected either stochastically or adversarially, and the transition function is not known to the lear… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  3. arXiv:2212.06251  [pdf, other

    cs.LG stat.ML

    Autoregressive Bandits

    Authors: Francesco Bacchiocchi, Gianmarco Genalti, Davide Maran, Marco Mussi, Marcello Restelli, Nicola Gatti, Alberto Maria Metelli

    Abstract: Autoregressive processes naturally arise in a large variety of real-world scenarios, including stock markets, sales forecasting, weather prediction, advertising, and pricing. When facing a sequential decision-making problem in such a context, the temporal dependence between consecutive observations should be properly accounted for guaranteeing convergence to the optimal policy. In this work, we pr… ▽ More

    Submitted 19 February, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: Accepted to AISTATS 2024

  4. arXiv:2211.09612  [pdf, other

    cs.LG

    Dynamic Pricing with Volume Discounts in Online Settings

    Authors: Marco Mussi, Gianmarco Genalti, Alessandro Nuara, Francesco Trovò, Marcello Restelli, Nicola Gatti

    Abstract: According to the main international reports, more pervasive industrial and business-process automation, thanks to machine learning and advanced analytic tools, will unlock more than 14 trillion USD worldwide annually by 2030. In the specific case of pricing problems-which constitute the class of problems we investigate in this paper-, the estimated unlocked value will be about 0.5 trillion USD per… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted to IAAI 2023