Skip to main content

Showing 1–50 of 55 results for author: Gaillard, P

.
  1. arXiv:2406.12366  [pdf, ps, other

    cs.LG math.ST stat.ML

    Structured Prediction in Online Learning

    Authors: Pierre Boudart, Alessandro Rudi, Pierre Gaillard

    Abstract: We study a theoretical and algorithmic framework for structured prediction in the online learning setting. The problem of structured prediction, i.e. estimating function where the output space lacks a vectorial structure, is well studied in the literature of supervised statistical learning. We show that our algorithm is a generalisation of optimal algorithms from the supervised learning setting, a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 29 pages

  2. arXiv:2405.19807  [pdf, ps, other

    cs.LG math.PR math.ST stat.ML

    MetaCURL: Non-stationary Concave Utility Reinforcement Learning

    Authors: Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

    Abstract: We explore online learning in episodic loop-free Markov decision processes on non-stationary environments (changing losses and probability transitions). Our focus is on the Concave Utility Reinforcement Learning problem (CURL), an extension of classical RL for handling convex performance criteria in state-action distributions induced by agent policies. While various machine learning problems can b… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2403.07460  [pdf, other

    cs.LG

    Experimental Comparison of Ensemble Methods and Time-to-Event Analysis Models Through Integrated Brier Score and Concordance Index

    Authors: Camila Fernandez, Chung Shue Chen, Chen Pierre Gaillard, Alonso Silva

    Abstract: Time-to-event analysis is a branch of statistics that has increased in popularity during the last decades due to its many application fields, such as predictive maintenance, customer churn prediction and population lifetime estimation. In this paper, we review and compare the performance of several prediction models for time-to-event analysis. These consist of semi-parametric and parametric statis… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  4. arXiv:2402.18917  [pdf, other

    cs.LG cs.IR

    Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization

    Authors: Aadirupa Saha, Pierre Gaillard

    Abstract: We address the problem of active online assortment optimization problem with preference feedback, which is a framework for modeling user choices and subsetwise utility maximization. The framework is useful in various real-world applications including ad placement, online retail, recommender systems, fine-tuning language models, amongst many. The problem, although has been studied in the past, lack… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  5. arXiv:2402.15171  [pdf, ps, other

    cs.LG math.ST stat.ML

    Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits

    Authors: Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

    Abstract: We address the problem of stochastic combinatorial semi-bandits, where a player selects among $P$ actions from the power set of a set containing $d$ base items. Adaptivity to the problem's structure is essential in order to obtain optimal regret upper bounds. As estimating the coefficients of a covariance matrix can be manageable in practice, leveraging them should improve the regret. We design ``… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  6. arXiv:2402.05145  [pdf, other

    cs.LG physics.data-an stat.ML

    Online Learning Approach for Survival Analysis

    Authors: Camila Fernandez, Pierre Gaillard, Joseph de Vilmarest, Olivier Wintenberger

    Abstract: We introduce an online mathematical framework for survival analysis, allowing real time adaptation to dynamic environments and censored data. This framework enables the estimation of event time distributions through an optimal second order online convex optimization algorithm-Online Newton Step (ONS). This approach, previously unexplored, presents substantial advantages, including explicit algorit… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  7. arXiv:2311.18346  [pdf, other

    math.OC physics.data-an stat.ML

    Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent

    Authors: Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

    Abstract: Many machine learning tasks can be solved by minimizing a convex function of an occupancy measure over the policies that generate them. These include reinforcement learning, imitation learning, among others. This more general paradigm is called the Concave Utility Reinforcement Learning problem (CURL). Since CURL invalidates classical Bellman equations, it requires new algorithms. We introduce MD-… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2309.07530  [pdf, other

    cs.LG math.NA

    Adaptive approximation of monotone functions

    Authors: Pierre Gaillard, Sébastien Gerchinovitz, Étienne de Montbrun

    Abstract: We study the classical problem of approximating a non-decreasing function $f: \mathcal{X} \to \mathcal{Y}$ in $L^p(μ)$ norm by sequentially querying its values, for known compact real intervals $\mathcal{X}$, $\mathcal{Y}$ and a known probability measure $μ$ on $\cX$. For any function~$f$ we characterize the minimum number of evaluations of $f$ that algorithms need to guarantee an approximation… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  9. arXiv:2302.12120  [pdf, other

    cs.LG

    Sequential Counterfactual Risk Minimization

    Authors: Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

    Abstract: Counterfactual Risk Minimization (CRM) is a framework for dealing with the logged bandit feedback problem, where the goal is to improve a logging policy using offline data. In this paper, we explore the case where it is possible to deploy learned policies multiple times and acquire new data. We extend the CRM principle and its theory to this scenario, which we call "Sequential Counterfactual Risk… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear at ICML23

  10. arXiv:2302.08190  [pdf, other

    math.OC cs.LG math.PR stat.AP stat.ML

    Reimagining Demand-Side Management with Mean Field Learning

    Authors: Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

    Abstract: Integrating renewable energy into the power grid while balancing supply and demand is a complex issue, given its intermittent nature. Demand side management (DSM) offers solutions to this challenge. We propose a new method for DSM, in particular the problem of controlling a large population of electrical devices to follow a desired consumption signal. We model it as a finite horizon Markovian mean… ▽ More

    Submitted 25 May, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  11. arXiv:2210.14998  [pdf, other

    cs.LG

    One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Slee** Bandits

    Authors: Pierre Gaillard, Aadirupa Saha, Soham Dan

    Abstract: We address the problem of \emph{`Internal Regret'} in \emph{Slee** Bandits} in the fully adversarial setup, as well as draw connections between different existing notions of slee** regrets in the multiarmed bandits (MAB) literature and consequently analyze the implications: Our first contribution is to propose the new notion of \emph{Internal Regret} for slee** MAB. We then proposed an algor… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  12. arXiv:2209.13932  [pdf, ps, other

    math.OC q-fin.CP q-fin.PM

    Efficient and Near-Optimal Online Portfolio Selection

    Authors: Rémi Jézéquel, Dmitrii M. Ostrovskii, Pierre Gaillard

    Abstract: In the problem of online portfolio selection as formulated by Cover (1991), the trader repeatedly distributes her capital over $ d $ assets in each of $ T > 1 $ rounds, with the goal of maximizing the total return. Cover proposed an algorithm, termed Universal Portfolios, that performs nearly as well as the best (in hindsight) static assignment of a portfolio, with an $ O(d\log(T)) $ regret in ter… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  13. arXiv:2202.06694  [pdf, other

    cs.LG stat.ML

    Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences

    Authors: Aadirupa Saha, Pierre Gaillard

    Abstract: We study the problem of $K$-armed dueling bandit for both stochastic and adversarial environments, where the goal of the learner is to aggregate information through relative preferences of pair of decisions points queried in an online sequential manner. We first propose a novel reduction from any (general) dueling bandits to multi-armed bandits and despite the simplicity, it allows us to improve m… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  14. arXiv:2202.05638  [pdf, other

    cs.LG

    Efficient Kernel UCB for Contextual Bandits

    Authors: Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

    Abstract: In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT^3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: To appear at AISTATS2022

  15. arXiv:2110.09133  [pdf, other

    cs.LG

    Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

    Authors: Reda Ouhamma, Rémy Degenne, Pierre Gaillard, Vianney Perchet

    Abstract: In the fixed budget thresholding bandit problem, an algorithm sequentially allocates a budgeted number of samples to different distributions. It then predicts whether the mean of each distribution is larger or lower than a given threshold. We introduce a large family of algorithms (containing most existing relevant ones), inspired by the Frank-Wolfe algorithm, and provide a thorough yet generic an… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 10+15 pages. To be published in the proceedings of NeurIPS 2021

  16. arXiv:2110.03960  [pdf, other

    cs.LG math.ST stat.ML

    Mixability made efficient: Fast online multiclass logistic regression

    Authors: Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

    Abstract: Mixability has been shown to be a powerful tool to obtain algorithms with optimal regret. However, the resulting methods often suffer from high computational complexity which has reduced their practical applicability. For example, in the case of multiclass logistic regression, the aggregating forecaster (Foster et al. (2018)) achieves a regret of $O(\log(Bn))$ whereas Online Newton Step achieves… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  17. arXiv:2107.02274  [pdf, other

    cs.LG cs.AI

    Dueling Bandits with Adversarial Slee**

    Authors: Aadirupa Saha, Pierre Gaillard

    Abstract: We introduce the problem of slee** dueling bandits with stochastic preferences and adversarial availabilities (DB-SPAA). In almost all dueling bandit applications, the decision space often changes over time; eg, retail store management, online shop**, restaurant recommendation, search engine optimization, etc. Surprisingly, this `slee** aspect' of dueling bandits has never been studied in th… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

  18. arXiv:2106.07644  [pdf, other

    math.OC cs.LG cs.MA math.PR stat.ML

    A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip

    Authors: Mathieu Even, Raphaël Berthier, Francis Bach, Nicolas Flammarion, Pierre Gaillard, Hadrien Hendrikx, Laurent Massoulié, Adrien Taylor

    Abstract: We introduce the continuized Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process, o… ▽ More

    Submitted 27 October, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.06035

  19. arXiv:2102.06035  [pdf, other

    cs.DC math.OC

    A Continuized View on Nesterov Acceleration

    Authors: Raphaël Berthier, Francis Bach, Nicolas Flammarion, Pierre Gaillard, Adrien Taylor

    Abstract: We introduce the "continuized" Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process,… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  20. arXiv:2102.03594  [pdf, other

    math.ST cs.LG stat.ML

    Online nonparametric regression with Sobolev kernels

    Authors: Oleksandr Zadorozhnyi, Pierre Gaillard, Sebastien Gerschinovitz, Alessandro Rudi

    Abstract: In this work we investigate the variation of the online kernelized ridge regression algorithm in the setting of $d-$dimensional adversarial nonparametric regression. We derive the regret upper bounds on the classes of Sobolev spaces $W_{p}^β(\mathcal{X})$, $p\geq 2, β>\frac{d}{p}$. The upper bounds are supported by the minimax regret analysis, which reveals that in the cases $β> \frac{d}{2}$ or… ▽ More

    Submitted 13 July, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

    Comments: 40 pages, 5 figures, 3 tables (version 2)

  21. arXiv:2011.06957  [pdf, other

    cs.LG math.ST

    Non-stationary Online Regression

    Authors: Anant Raj, Pierre Gaillard, Christophe Saad

    Abstract: Online forecasting under a changing environment has been a problem of increasing importance in many real-world applications. In this paper, we consider the meta-algorithm presented in \citet{zhang2017dynamic} combined with different subroutines. We show that an expected cumulative error of order $\tilde{O}(n^{1/3} C_n^{2/3})$ can be obtained for non-stationary online linear regression where the to… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  22. arXiv:2006.08212  [pdf, other

    cs.LG cs.MA math.OC stat.ML

    Tight Nonparametric Convergence Rates for Stochastic Gradient Descent under the Noiseless Linear Model

    Authors: Raphaël Berthier, Francis Bach, Pierre Gaillard

    Abstract: In the context of statistical supervised learning, the noiseless linear model assumes that there exists a deterministic linear relation $Y = \langle θ_*, X \rangle$ between the random output $Y$ and the random feature vector $Φ(U)$, a potentially non-linear transformation of the inputs $U$. We analyze the convergence of single-pass, fixed step-size stochastic gradient descent on the least-square r… ▽ More

    Submitted 27 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  23. arXiv:2004.11722  [pdf, other

    stat.ML cs.LG

    Counterfactual Learning of Stochastic Policies with Continuous Actions: from Models to Offline Evaluation

    Authors: Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal

    Abstract: Counterfactual reasoning from logged data has become increasingly important for many applications such as web advertising or healthcare. In this paper, we address the problem of learning stochastic policies with continuous actions from the viewpoint of counterfactual risk minimization (CRM). While the CRM framework is appealing and well studied for discrete actions, the continuous action case rais… ▽ More

    Submitted 14 December, 2022; v1 submitted 22 April, 2020; originally announced April 2020.

  24. arXiv:2004.06248  [pdf, other

    cs.LG stat.ML

    Improved Slee** Bandits with Stochastic Actions Sets and Adversarial Rewards

    Authors: Aadirupa Saha, Pierre Gaillard, Michal Valko

    Abstract: In this paper, we consider the problem of slee** bandits with stochastic action sets and adversarial rewards. In this setting, in contrast to most work in bandits, the actions may not be available at all times. For instance, some products might be out of stock in item recommendation. The best existing efficient (i.e., polynomial-time) algorithms for this problem only guarantee an $O(T^{2/3})$ up… ▽ More

    Submitted 8 August, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: Accepted to ICML 2020

  25. arXiv:2003.08820  [pdf, other

    cs.LG stat.ML

    Experimental Comparison of Semi-parametric, Parametric, and Machine Learning Models for Time-to-Event Analysis Through the Concordance Index

    Authors: Camila Fernandez, Chung Shue Chen, Pierre Gaillard, Alonso Silva

    Abstract: In this paper, we make an experimental comparison of semi-parametric (Cox proportional hazards model, Aalen's additive regression model), parametric (Weibull AFT model), and machine learning models (Random Survival Forest, Gradient Boosting with Cox Proportional Hazards Loss, DeepSurv) through the concordance index on two different datasets (PBC and GBCSG2). We present two comparisons: one with th… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

  26. arXiv:2003.08109  [pdf, other

    cs.LG math.ST stat.ML

    Efficient improper learning for online logistic regression

    Authors: Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

    Abstract: We consider the setting of online logistic regression and consider the regret with respect to the 2-ball of radius B. It is known (see [Hazan et al., 2014]) that any proper algorithm which has logarithmic regret in the number of samples (denoted n) necessarily suffers an exponential multiplicative constant in B. In this work, we design an efficient improper algorithm that avoids this exponential c… ▽ More

    Submitted 3 November, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Journal ref: Conference on Learning Theory 2020, Jul 2020, Graz, Austria

  27. arXiv:1902.09917  [pdf, other

    stat.ML cs.LG math.ST

    Efficient online learning with kernels for adversarial large scale problems

    Authors: Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

    Abstract: We are interested in a framework of online learning with kernels for low-dimensional but large-scale and potentially adversarial datasets. We study the computational and theoretical performance of online variations of kernel Ridge regression. Despite its simplicity, the algorithm we study is the first to achieve the optimal regret for a wide range of kernels with a per-round complexity of order… ▽ More

    Submitted 29 May, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

  28. Bayesian inference and non-linear extensions of the CIRCE method for quantifying the uncertainty of closure relationships integrated into thermal-hydraulic system codes

    Authors: Guillaume Damblin, Pierre Gaillard

    Abstract: Uncertainty Quantification of closure relationships integrated into thermal-hydraulic system codes is a critical prerequisite in applying the Best-Estimate Plus Uncertainty (BEPU) methodology for nuclear safety and licensing processes.The purpose of the CIRCE method is to estimate the (log)-Gaussian probability distribution of a multiplicative factor applied to a reference closure relationship in… ▽ More

    Submitted 9 March, 2020; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: 37 pages, 5 figures

    MSC Class: 62F15

    Journal ref: Nuclear Engineering and Design, 2020, Volume 359, 1 April 2020, 110391

  29. arXiv:1901.09532  [pdf, other

    cs.LG stat.ML

    Target Tracking for Contextual Bandits: Application to Demand Side Management

    Authors: Margaux Brégère, Pierre Gaillard, Yannig Goude, Gilles Stoltz

    Abstract: We propose a contextual-bandit approach for demand side management by offering price incentives. More precisely, a target mean consumption is set at each round and the mean consumption is modeled as a complex function of the distribution of prices sent and of some contextual variables such as the temperature, weather, and so on. The performance of our strategies is measured in quadratic losses thr… ▽ More

    Submitted 13 May, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Journal ref: ICML 2019 (Thirty-sixth International Conference on Machine Learning), Jun 2019, Long Beach, United States

  30. arXiv:1805.11386  [pdf, ps, other

    stat.ML cs.LG math.ST

    Uniform regret bounds over $R^d$ for the sequential linear regression problem with the square loss

    Authors: Pierre Gaillard, Sébastien Gerchinovitz, Malo Huard, Gilles Stoltz

    Abstract: We consider the setting of online linear regression for arbitrary deterministic sequences, with the square loss. We are interested in the aim set by Bartlett et al. (2015): obtain regret bounds that hold uniformly over all competitor vectors. When the feature sequence is known at the beginning of the game, they provided closed-form regret bounds of $2d B^2 \ln T + \mathcal{O}_T(1)$, where $T$ is t… ▽ More

    Submitted 25 February, 2019; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Proceedings of ALT'2019

  31. arXiv:1805.09174  [pdf, other

    math.ST cs.LG

    Efficient online algorithms for fast-rate regret bounds under sparsity

    Authors: Pierre Gaillard, Olivier Wintenberger

    Abstract: We consider the online convex optimization problem. In the setting of arbitrary sequences and finite set of parameters, we establish a new fast-rate quantile regret bound. Then we investigate the optimization into the L1-ball by discretizing the parameter space. Our algorithm is projection free and we propose an efficient solution by restarting the algorithm on adaptive discretization grids. In th… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

  32. arXiv:1805.08531  [pdf, other

    cs.MA cs.DC stat.ML

    Accelerated Gossip in Networks of Given Dimension using Jacobi Polynomial Iterations

    Authors: Raphaël Berthier, Francis Bach, Pierre Gaillard

    Abstract: Consider a network of agents connected by communication links, where each agent holds a real value. The gossip problem consists in estimating the average of the values diffused in the network in a distributed manner. We develop a method solving the gossip problem that depends only on the spectral dimension of the network, that is, in the communication network set-up, the dimension of the space in… ▽ More

    Submitted 11 June, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

  33. arXiv:1702.08211  [pdf, ps, other

    stat.ML cs.LG math.ST

    Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

    Authors: Nicolò Cesa-Bianchi, Pierre Gaillard, Claudio Gentile, Sébastien Gerchinovitz

    Abstract: We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz… ▽ More

    Submitted 30 June, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: This document is the full version of an extended abstract accepted for presentation at COLT 2017

  34. arXiv:1610.05022  [pdf, other

    math.ST

    Sparse Accelerated Exponential Weights

    Authors: Pierre Gaillard, Olivier Wintenberger

    Abstract: We consider the stochastic optimization problem where a convex function is minimized observing recursively the gradients. We introduce SAEW, a new procedure that accelerates exponential weights procedures with the slow rate $1/\sqrt{T}$ to procedures achieving the fast rate $1/T$. Under the strong convexity of the risk, we achieve the optimal rate of convergence for approximating sparse parameters… ▽ More

    Submitted 17 October, 2016; originally announced October 2016.

  35. arXiv:1503.07899  [pdf, ps, other

    math-ph nlin.SI

    Multi-parametric solutions to the NLS equation

    Authors: Pierre Gaillard

    Abstract: The structure of the solutions to the one dimensional focusing nonlin-ear Schr{ö}dinger equation (NLS) for the order N in terms of quasi rational functions is given here. We first give the proof that the solutions can be expressed as a ratio of two wronskians of order 2N and then two determinants by an exponential depending on t with 2N -- 2 parameters. It also is proved that for the order N , the… ▽ More

    Submitted 26 March, 2015; originally announced March 2015.

  36. arXiv:1502.07697  [pdf, other

    stat.ML cs.LG

    A Chaining Algorithm for Online Nonparametric Regression

    Authors: Pierre Gaillard, Sébastien Gerchinovitz

    Abstract: We consider the problem of online nonparametric regression with arbitrary deterministic sequences. Using ideas from the chaining technique, we design an algorithm that achieves a Dudley-type regret bound similar to the one obtained in a non-constructive fashion by Rakhlin and Sridharan (2014). Our regret bound is expressed in terms of the metric entropy in the sup norm, which yields optimal guaran… ▽ More

    Submitted 1 July, 2015; v1 submitted 26 February, 2015; originally announced February 2015.

    Comments: Published in the proceedings of COLT 2015: http://jmlr.org/proceedings/papers/v40/Gaillard15.html

  37. arXiv:1405.1533  [pdf, ps, other

    math.ST cs.LG stat.ML

    A consistent deterministic regression tree for non-parametric prediction of time series

    Authors: Pierre Gaillard, Paul Baudin

    Abstract: We study online prediction of bounded stationary ergodic processes. To do so, we consider the setting of prediction of individual sequences and build a deterministic regression tree that performs asymptotically as well as the best L-Lipschitz constant predictors. Then, we show why the obtained regret bound entails the asymptotical optimality with respect to the class of bounded stationary ergodic… ▽ More

    Submitted 8 May, 2014; v1 submitted 7 May, 2014; originally announced May 2014.

  38. arXiv:1402.2044  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Second-order Bound with Excess Losses

    Authors: Pierre Gaillard, Gilles Stoltz, Tim Van Erven

    Abstract: We study online aggregation of the predictions of experts, and first show new second-order regret bounds in the standard setting, which are obtained via a version of the Prod algorithm (and also a version of the polynomially weighted average algorithm) with multiple learning rates. These bounds are in terms of excess losses, the differences between the instantaneous losses suffered by the algorith… ▽ More

    Submitted 10 February, 2014; originally announced February 2014.

  39. arXiv:1207.1965  [pdf, other

    stat.ML cs.LG stat.AP

    Forecasting electricity consumption by aggregating specialized experts

    Authors: Marie Devaine, Pierre Gaillard, Yannig Goude, Gilles Stoltz

    Abstract: We consider the setting of sequential prediction of arbitrary sequences based on specialized experts. We first provide a review of the relevant literature and present two theoretical contributions: a general analysis of the specialist aggregation rule of Freund et al. (1997) and an adaptation of fixed-share rules of Herbster and Warmuth (1998) in this setting. We then apply these rules to the sequ… ▽ More

    Submitted 9 July, 2012; originally announced July 2012.

    Comments: 33 pages

  40. arXiv:1202.3323  [pdf, ps, other

    cs.LG stat.ML

    Mirror Descent Meets Fixed Share (and feels no regret)

    Authors: Nicolò Cesa-Bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz

    Abstract: Mirror descent with an entropic regularizer is known to achieve shifting regret bounds that are logarithmic in the dimension. This is done using either a carefully designed projection or by a weight sharing technique. Via a novel unified analysis, we show that these two approaches deliver essentially equivalent bounds on a notion of regret generalizing shifting, adaptive, discounted, and other rel… ▽ More

    Submitted 27 September, 2012; v1 submitted 15 February, 2012; originally announced February 2012.

    Journal ref: NIPS 2012, Lake Tahoe : United States (2012)

  41. arXiv:0809.1918  [pdf, ps, other

    math.GM

    The Gauss-Dirichlet Orbit Number

    Authors: Pierre-Yves Gaillard

    Abstract: Dirichlet computed in some particular cases the number of equivalence classes of representations of a nonzero integer by a representative system for the integral binary quadratic forms of a given discriminant. We complete this computation.

    Submitted 20 September, 2008; v1 submitted 11 September, 2008; originally announced September 2008.

    Comments: I changed one word in the abstract

  42. arXiv:0809.0550  [pdf, ps, other

    math.GM

    Hurwitz's Freeness Property

    Authors: Pierre-Yves Gaillard

    Abstract: The groupoid attached to the action of PSL(2,Z) on the irrational reals by linear fractional transformations is free.

    Submitted 27 October, 2008; v1 submitted 3 September, 2008; originally announced September 2008.

    Comments: LaTeX, 3 pages. Minor change

  43. arXiv:math/0510369  [pdf, ps, other

    math.NT

    Integral Congruences

    Authors: Pierre-Yves Gaillard

    Abstract: To each i, j belonging to some set of integers, attach the integer a(i,j). Are there integers x(i) such that x(j)-x(i) is congruent to a(i,j) mod (i,j)? A necessary condition is that a(i,j)+a(j,k) be congruent to a(i,k) mod (i,j,k). This condition is sufficient.

    Submitted 29 October, 2005; v1 submitted 18 October, 2005; originally announced October 2005.

    Comments: 9 pages, LaTeX. Results have been improved

  44. arXiv:math/0502574  [pdf, ps, other

    math.HO math.NT

    The functional equation of the zeta function of a global field

    Authors: Pierre-Yves Gaillard

    Abstract: We write down the functional equation of the zeta function of a global field. This equation is implicit in Weil's ``Basic Number Theory''.

    Submitted 28 February, 2005; originally announced February 2005.

    Comments: 2 pages, LaTeX

  45. arXiv:math/0412133  [pdf, ps, other

    math.GM

    Around the Chinese Remainder Theorem

    Authors: Jean-Marie Didry, Pierre-Yves Gaillard

    Abstract: We prove an explicit Chinese Remainder Theorem for one variable polynomials with complex coefficients, and derive some consequences.

    Submitted 24 December, 2008; v1 submitted 7 December, 2004; originally announced December 2004.

    Comments: New section, titled "Wronski"; 22 pages, LaTeX. Last version available at http://www.iecn.u-nancy.fr/~gaillard/DIVERS/Chinese.Remainder.Theorem/

  46. arXiv:math/0405053  [pdf, ps, other

    math.GM

    There are only countably many sets

    Authors: Pierre-Yves Gaillard

    Abstract: We prove that Bourbaki's mathematics is incomplete.

    Submitted 4 May, 2004; originally announced May 2004.

    Comments: 4 pages, LaTeX

  47. arXiv:math/0309296  [pdf, ps, other

    math.CT

    Grothendieck categories and support conditions

    Authors: Pierre-Yves Gaillard

    Abstract: We give examples of pairs (G1,G2) where G1 is a Grothendieck category and G2 a full Grothendieck subcategory of G1, the inclusion G2 --> G1 being denoted i, for which R^+i : D^+G2 --> D^+G1 (or even Ri : DG2 --> DG1) is a full embedding. This yields generalizations of some results of Bernstein and Lunts, and of Cline, Parshall and Scott.

    Submitted 16 March, 2004; v1 submitted 18 September, 2003; originally announced September 2003.

    Comments: 11 pages, LaTeX, minor changes

  48. arXiv:math/0303285  [pdf, ps, other

    math.RT

    About a Theorem of Cline, Parshall and Scott

    Authors: Pierre-Yves Gaillard

    Abstract: We give a simple proof of a Theorem of Cline, Parshall and Scott about the category O of BGG and suggest an analog for Harish-Chandra modules.

    Submitted 24 March, 2003; originally announced March 2003.

    Comments: 6 pages, LaTeX. Related material is available at http://www.iecn.u-nancy.fr/~gaillard

  49. arXiv:math/0004006  [pdf, ps, other

    math.QA

    A naive question about quantum groups

    Authors: Pierre-Yves Gaillard

    Abstract: The category O of BGG can be thought of as a category of sheaves over the flag variety F in the sense that the algebra E of self-extensions of the trivial object of O is isomorphic to the cohomology algebra of the flag variety. A deformation of O' - giving rise to a "new" algebra E' - can be thought of as a (possibly noncommutative) deformation F' of F. The mythic variety F', being a deformation… ▽ More

    Submitted 22 July, 2009; v1 submitted 2 April, 2000; originally announced April 2000.

    Comments: 4 pages, TeX

  50. arXiv:math/0003183  [pdf, ps, other

    math.RT

    A simple question about a complicated object

    Authors: Pierre-Yves Gaillard

    Abstract: Let n and k be positive integers with and k < n. Then of course SU(k,1) is contained into SU(n,1). Moreover, which is less clear - but proved by Khoroshkin -, the representation theory of SU(k,1) at the generalized infinitesimal character of the trivial module can be fully (and even Ext-fully) embedded into that of SU(n,1). Here is the obvious bet: This embedding is implemented by the cohomo… ▽ More

    Submitted 22 July, 2009; v1 submitted 28 March, 2000; originally announced March 2000.

    Comments: 4 pages, TeX