Skip to main content

Showing 1–35 of 35 results for author: Biau, G

.
  1. arXiv:2402.07514  [pdf, other

    cs.AI math.ST

    Physics-informed machine learning as a kernel method

    Authors: Nathan Doumèche, Francis Bach, Gérard Biau, Claire Boyer

    Abstract: Physics-informed machine learning combines the expressiveness of data-based approaches with the interpretability of physical models. In this context, we consider a general regression problem where the empirical risk is regularized by a partial differential equation that quantifies the physical inconsistency. We prove that for linear differential priors, the problem can be formulated as a kernel re… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  2. arXiv:2309.01213  [pdf, other

    stat.ML cs.LG

    Implicit regularization of deep residual networks towards neural ODEs

    Authors: Pierre Marion, Yu-Han Wu, Michael E. Sander, Gérard Biau

    Abstract: Residual neural networks are state-of-the-art deep learning models. Their continuous-depth analog, neural ordinary differential equations (ODEs), are also widely used. Despite their success, the link between the discrete and continuous models still lacks a solid mathematical foundation. In this article, we take a step in this direction by establishing an implicit regularization of deep residual ne… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 (spotlight). 40 pages, 3 figures

  3. arXiv:2305.01240  [pdf, other

    math.ST

    Convergence and error analysis of PINNs

    Authors: Nathan Doumèche, Gérard Biau, Claire Boyer

    Abstract: Physics-informed neural networks (PINNs) are a promising approach that combines the power of neural networks with the interpretability of physical modeling. PINNs have shown good practical performance in solving partial differential equations (PDEs) and in hybrid modeling scenarios, where physical models enhance data-driven approaches. However, it is essential to establish their theoretical proper… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  4. arXiv:2304.01862  [pdf, other

    stat.ME

    The insertion method to invert the signature of a path

    Authors: Adeline Fermanian, Jiawei Chang, Terry Lyons, Gérard Biau

    Abstract: The signature is a representation of a path as an infinite sequence of its iterated integrals. Under certain assumptions, the signature characterizes the path, up to translation and reparameterization. Therefore, a crucial question of interest is the development of efficient algorithms to invert the signature, i.e., to reconstruct the path from the information of its (truncated) signature. In this… ▽ More

    Submitted 19 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  5. arXiv:2206.06929  [pdf, other

    cs.LG stat.ML

    Scaling ResNets in the Large-depth Regime

    Authors: Pierre Marion, Adeline Fermanian, Gérard Biau, Jean-Philippe Vert

    Abstract: Deep ResNets are recognized for achieving state-of-the-art results in complex machine learning tasks. However, the remarkable performance of these architectures relies on a training procedure that needs to be carefully crafted to avoid vanishing or exploding gradients, particularly as the depth $L$ increases. No consensus has been reached on how to mitigate this issue, although a widely discussed… ▽ More

    Submitted 10 June, 2024; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 44 pages, 9 figures. Updated with clarifications and additional references

  6. arXiv:2201.02824  [pdf, other

    stat.ML cs.LG math.ST

    Optimal 1-Wasserstein Distance for WGANs

    Authors: Arthur Stéphanovitch, Ugo Tanielian, Benoît Cadre, Nicolas Klutchnikoff, Gérard Biau

    Abstract: The mathematical forces at work behind Generative Adversarial Networks raise challenging theoretical issues. Motivated by the important question of characterizing the geometrical properties of the generated distributions, we provide a thorough analysis of Wasserstein GANs (WGANs) in both the finite sample and asymptotic regimes. We study the specific case where the latent space is univariate and d… ▽ More

    Submitted 5 October, 2023; v1 submitted 8 January, 2022; originally announced January 2022.

  7. arXiv:2106.01202  [pdf, other

    stat.ML cs.LG

    Framing RNN as a kernel method: A neural ODE approach

    Authors: Adeline Fermanian, Pierre Marion, Jean-Philippe Vert, Gérard Biau

    Abstract: Building on the interpretation of a recurrent neural network (RNN) as a continuous-time neural differential equation, we show, under appropriate conditions, that the solution of a RNN can be viewed as a linear function of a specific feature set of the input sequence, known as the signature. This connection allows us to frame a RNN as a kernel method in a suitable reproducing kernel Hilbert space.… ▽ More

    Submitted 29 October, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: 33 pages, 7 figures, accepted for an oral presentation at NeurIPS 2021

  8. arXiv:2105.11724  [pdf, other

    stat.ML cs.LG

    SHAFF: Fast and consistent SHApley eFfect estimates via random Forests

    Authors: Clément Bénard, Gérard Biau, Sébastien da Veiga, Erwan Scornet

    Abstract: Interpretability of learning algorithms is crucial for applications involving critical decisions, and variable importance is one of the main interpretation tools. Shapley effects are now widely used to interpret both tree ensembles and neural networks, as they can efficiently handle dependence and interactions in the data, as opposed to most other variable importance measures. However, estimating… ▽ More

    Submitted 2 February, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

  9. arXiv:2006.05254  [pdf, other

    stat.ML cs.LG

    Approximating Lipschitz continuous functions with GroupSort neural networks

    Authors: Ugo Tanielian, Maxime Sangnier, Gerard Biau

    Abstract: Recent advances in adversarial attacks and Wasserstein GANs have advocated for use of neural networks with restricted Lipschitz constants. Motivated by these observations, we study the recently introduced GroupSort neural networks, with constraints on the weights, and make a theoretical step towards a better understanding of their expressive power. We show in particular how these networks can repr… ▽ More

    Submitted 8 February, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 16 pages

  10. arXiv:2006.04709  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Wasserstein Random Forests and Applications in Heterogeneous Treatment Effects

    Authors: Qiming Du, Gérard Biau, François Petit, Raphaël Porcher

    Abstract: We present new insights into causal inference in the context of Heterogeneous Treatment Effects by proposing natural variants of Random Forests to estimate the key conditional distributions. To achieve this, we recast Breiman's original splitting criterion in terms of Wasserstein distances between empirical measures. This reformulation indicates that Random Forests are well adapted to estimate con… ▽ More

    Submitted 15 February, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  11. arXiv:2006.02682  [pdf, other

    cs.LG stat.ML

    Some Theoretical Insights into Wasserstein GANs

    Authors: Gérard Biau, Maxime Sangnier, Ugo Tanielian

    Abstract: Generative Adversarial Networks (GANs) have been successful in producing outstanding results in areas as diverse as image, video, and text generation. Building on these successes, a large number of empirical studies have validated the benefits of the cousin approach called Wasserstein GANs (WGANs), which brings stabilization in the training process. In the present paper, we add a new stone to the… ▽ More

    Submitted 18 June, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Journal ref: Journal of Machine Learning Research, Microtome Publishing, 2021

  12. arXiv:2004.14841  [pdf, other

    stat.ML cs.AI cs.LG

    Interpretable Random Forests via Rule Extraction

    Authors: Clément Bénard, Gérard Biau, Sébastien da Veiga, Erwan Scornet

    Abstract: We introduce SIRUS (Stable and Interpretable RUle Set) for regression, a stable rule learning algorithm which takes the form of a short and simple list of rules. State-of-the-art learning algorithms are often referred to as "black boxes" because of the high number of operations involved in their prediction process. Despite their powerful predictivity, this lack of interpretability may be highly re… ▽ More

    Submitted 10 February, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

  13. arXiv:1908.06852  [pdf, other

    stat.ML cs.LG math.ST

    SIRUS: Stable and Interpretable RUle Set for Classification

    Authors: Clément Bénard, Gérard Biau, Sébastien da Veiga, Erwan Scornet

    Abstract: State-of-the-art learning algorithms, such as random forests or neural networks, are often qualified as "black-boxes" because of the high number and complexity of operations involved in their prediction mechanism. This lack of interpretability is a strong limitation for applications involving critical decisions, typically the analysis of production processes in the manufacturing industry. In such… ▽ More

    Submitted 16 December, 2020; v1 submitted 19 August, 2019; originally announced August 2019.

  14. arXiv:1803.07819  [pdf, other

    stat.ML cs.LG

    Some Theoretical Properties of GANs

    Authors: G. Biau, B. Cadre, M. Sangnier, U. Tanielian

    Abstract: Generative Adversarial Networks (GANs) are a class of generative algorithms that have been shown to produce state-of-the art samples, especially in the domain of image creation. The fundamental principle of GANs is to approximate the unknown distribution of a given data set by optimizing an objective function through an adversarial game between a family of generators and a family of discriminators… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

  15. arXiv:1803.02042  [pdf, other

    stat.ML cs.LG

    Accelerated Gradient Boosting

    Authors: Gérard Biau, Benoît Cadre, Laurent Rouvìère

    Abstract: Gradient tree boosting is a prediction algorithm that sequentially produces a model in the form of linear combinations of decision trees, by solving an infinite-dimensional optimization problem. We combine gradient boosting and Nesterov's accelerated descent to design a new algorithm, which we call AGB (for Accelerated Gradient Boosting). Substantial numerical evidence is provided on both syntheti… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

  16. arXiv:1707.05023  [pdf, other

    math.ST cs.LG

    Optimization by gradient boosting

    Authors: Gérard Biau, Benoît Cadre

    Abstract: Gradient boosting is a state-of-the-art prediction technique that sequentially produces a model in the form of linear combinations of simple predictors---typically decision trees---by solving an infinite-dimensional convex optimization problem. We provide in the present paper a thorough analysis of two widespread versions of gradient boosting, and introduce a general framework for studying these a… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

  17. arXiv:1604.07143  [pdf, other

    stat.ML cs.LG math.ST

    Neural Random Forests

    Authors: Gérard Biau, Erwan Scornet, Johannes Welbl

    Abstract: Given an ensemble of randomized regression trees, it is possible to restructure them as a collection of multilayered neural networks with particular connection weights. Following this principle, we reformulate the random forest method of Breiman (2001) into a neural network setting, and in turn propose two new hybrid procedures that we call neural random forests. Both predictors exploit prior know… ▽ More

    Submitted 3 April, 2018; v1 submitted 25 April, 2016; originally announced April 2016.

  18. arXiv:1511.05741  [pdf, other

    math.ST stat.ML

    A Random Forest Guided Tour

    Authors: Gérard Biau, Erwan Scornet

    Abstract: The random forest algorithm, proposed by L. Breiman in 2001, has been extremely successful as a general-purpose classification and regression method. The approach, which combines several randomized decision trees and aggregates their predictions by averaging, has shown excellent performance in settings where the number of variables is much larger than the number of observations. Moreover, it is ve… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  19. arXiv:1507.00171  [pdf, other

    math.ST stat.AP stat.ME

    The Statistical Performance of Collaborative Inference

    Authors: Gérard Biau, Kevin Bleakley, Benoit Cadre

    Abstract: The statistical analysis of massive and complex data sets will require the development of algorithms that depend on distributed computing and collaborative inference. Inspired by this, we propose a collaborative framework that aims to estimate the unknown mean $θ$ of a random variable $X$. In the model we present, a certain number of calculation units, distributed across a communication network re… ▽ More

    Submitted 1 July, 2015; originally announced July 2015.

  20. arXiv:1504.01702  [pdf, other

    math.ST stat.AP

    Long signal change-point detection

    Authors: Gérard Biau, Kevin Bleakley, David Mason

    Abstract: The detection of change-points in a spatially or time ordered data sequence is an important problem in many fields such as genetics and finance. We derive the asymptotic distribution of a statistic recently suggested for detecting change-points. Simulation of its estimated limit distribution leads to a new and computationally efficient change-point detection algorithm, which can be used on very lo… ▽ More

    Submitted 30 September, 2015; v1 submitted 7 April, 2015; originally announced April 2015.

  21. arXiv:1410.4029  [pdf, ps, other

    math.ST

    Cox process functional learning

    Authors: Gérard Biau, Benoît Cadre, Quentin Paris

    Abstract: This article addresses the problem of functional supervised classification of Cox process trajectories, whose random intensity is driven by some exogenous random covariable. The classification task is achieved through a regularized convex empirical risk minimization procedure, and a nonasymptotic oracle inequality is derived. We show that the algorithm provides a Bayes-risk consistent classifier.… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    MSC Class: 62G05; 62G20

  22. arXiv:1407.4373  [pdf, other

    math.ST stat.ML

    Online Asynchronous Distributed Regression

    Authors: Gérard Biau, Ryad Zenine

    Abstract: Distributed computing offers a high degree of flexibility to accommodate modern learning constraints and the ever increasing size of datasets involved in massive data issues. Drawing inspiration from the theory of distributed computation models developed in the context of gradient-type optimization algorithms, we present a consensus-based asynchronous distributed approach for nonparametric online… ▽ More

    Submitted 16 July, 2014; originally announced July 2014.

  23. arXiv:1405.2881  [pdf, ps, other

    math.ST stat.ML

    Consistency of random forests

    Authors: Erwan Scornet, Gérard Biau, Jean-Philippe Vert

    Abstract: Random forests are a learning algorithm proposed by Breiman [Mach. Learn. 45 (2001) 5--32] that combines several randomized decision trees and aggregates their predictions by averaging. Despite its wide usage and outstanding practical performance, little is known about the mathematical properties of the procedure. This disparity between theory and practice originates in the difficulty to simultane… ▽ More

    Submitted 8 August, 2015; v1 submitted 12 May, 2014; originally announced May 2014.

    Journal ref: Annals of Statistics, Institute of Mathematical Statistics (IMS), 2015, 43 (4), pp.1716-1741

  24. arXiv:1311.0587  [pdf, ps, other

    math.ST

    High-dimensional $p$-norms

    Authors: Gérard Biau, David D. M. Mason

    Abstract: Let $\bX=(X_1, \hdots, X_d)$ be a $\mathbb R^d$-valued random vector with i.i.d. components, and let $\Vert\bX\Vert_p= (\sum_{j=1}^d|X_j|^p)^{1/p}$ be its $p$-norm, for $p>0$. The impact of letting $d$ go to infinity on $\Vert\bX\Vert_p$ has surprising consequences, which may dramatically affect high-dimensional data processing. This effect is usually referred to as the {\it distance concentration… ▽ More

    Submitted 4 November, 2013; originally announced November 2013.

    Comments: 19 pages

  25. COBRA: A Combined Regression Strategy

    Authors: Gérard Biau, Aurélie Fischer, Benjamin Guedj, James Malley

    Abstract: A new method for combining several initial estimators of the regression function is introduced. Instead of building a linear or convex optimized combination over a collection of basic estimators $r_1,\dots,r_M$, we use them as a collective indicator of the proximity between the training data and a test observation. This local distance approach is model-free and very fast. More specifically, the re… ▽ More

    Submitted 23 May, 2019; v1 submitted 9 March, 2013; originally announced March 2013.

    Comments: 42 pages

    Journal ref: Journal of Multivariate Analysis (2016), vol. 146, 18--28

  26. arXiv:1301.4679  [pdf, ps, other

    stat.ML cs.LG math.ST

    Cellular Tree Classifiers

    Authors: Gérard Biau, Luc Devroye

    Abstract: The cellular tree classifier model addresses a fundamental problem in the design of classifiers for a parallel or distributed computing world: Given a data set, is it sufficient to apply a majority rule for classification, or shall one split the data into two or more parts and send each part to a potentially different computer (or cell) for further processing? At first sight, it seems impossible t… ▽ More

    Submitted 25 June, 2013; v1 submitted 20 January, 2013; originally announced January 2013.

  27. arXiv:1207.6461  [pdf, ps, other

    math.ST

    New Insights Into Approximate Bayesian Computation

    Authors: Gérard Biau, Frédéric Cérou, Arnaud Guyader

    Abstract: Approximate Bayesian Computation (ABC for short) is a family of computational techniques which offer an almost automated solution in situations where evaluation of the posterior likelihood is computationally prohibitive, or whenever suitable likelihoods are not available. In the present paper, we analyze the procedure from the point of view of k-nearest neighbor theory and explore the statistical… ▽ More

    Submitted 3 June, 2013; v1 submitted 27 July, 2012; originally announced July 2012.

  28. arXiv:1201.0586  [pdf, ps, other

    math.ST

    An Affine Invariant $k$-Nearest Neighbor Regression Estimate

    Authors: Gérard Biau, Luc Devroye, Vida Dujmovic, Adam Krzyzak

    Abstract: We design a data-dependent metric in $\mathbb R^d$ and use it to define the $k$-nearest neighbors of a given point. Our metric is invariant under all affine transformations. We show that, with this metric, the standard $k$-nearest neighbor regression estimate is asymptotically consistent under the usual conditions on $k$, and minimal requirements on the input data.

    Submitted 18 May, 2012; v1 submitted 3 January, 2012; originally announced January 2012.

  29. arXiv:1101.3229  [pdf, other

    math.ST

    Sparse single-index model

    Authors: Pierre Alquier, Gérard Biau

    Abstract: Let $(\bX, Y)$ be a random pair taking values in $\mathbb R^p \times \mathbb R$. In the so-called single-index model, one has $Y=f^{\star}(θ^{\star T}\bX)+\bW$, where $f^{\star}$ is an unknown univariate measurable function, $θ^{\star}$ is an unknown vector in $\mathbb R^d$, and $W$ denotes a random noise satisfying $\mathbb E[\bW|\bX]=0$. The single-index model is known to offer a flexible way to… ▽ More

    Submitted 6 October, 2011; v1 submitted 17 January, 2011; originally announced January 2011.

    Journal ref: Journal of Machine Learning Research 14 (2013) 243-280

  30. Statistical analysis of $k$-nearest neighbor collaborative recommendation

    Authors: Gérard Biau, Benoît Cadre, Laurent Rouvière

    Abstract: Collaborative recommendation is an information-filtering technique that attempts to present information items that are likely of interest to an Internet user. Traditionally, collaborative systems deal with situations with two types of variables, users and items. In its most common form, the problem is framed as trying to estimate ratings for items that have not yet been consumed by a user. Despite… ▽ More

    Submitted 4 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOS759 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS759

    Journal ref: Annals of Statistics 2010, Vol. 38, No. 3, 1568-1592

  31. arXiv:1005.0208  [pdf, other

    stat.ML math.ST

    Analysis of a Random Forests Model

    Authors: Gérard Biau

    Abstract: Random forests are a scheme proposed by Leo Breiman in the 2000's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we off… ▽ More

    Submitted 26 March, 2012; v1 submitted 3 May, 2010; originally announced May 2010.

  32. arXiv:1003.5089  [pdf, ps, other

    math.ST

    PCA-Kernel Estimation

    Authors: Gérard Biau, André Mas

    Abstract: Many statistical estimation techniques for high-dimensional or functional data are based on a preliminary dimension reduction step, which consists in projecting the sample $\bX_1, \hdots, \bX_n$ onto the first $D$ eigenvectors of the Principal Component Analysis (PCA) associated with the empirical projector $\hat Π_D$. Classical nonparametric inference methods such as kernel density estimation or… ▽ More

    Submitted 26 March, 2010; originally announced March 2010.

  33. arXiv:0910.2340  [pdf, ps, other

    stat.ML math.ST

    A Stochastic Model for Collaborative Recommendation

    Authors: Gérard Biau, Benoit Cadre, Laurent Rouvière

    Abstract: Collaborative recommendation is an information-filtering technique that attempts to present information items (movies, music, books, news, images, Web pages, etc.) that are likely of interest to the Internet user. Traditionally, collaborative systems deal with situations with two types of variables, users and items. In its most common form, the problem is framed as trying to estimate ratings for… ▽ More

    Submitted 13 October, 2009; originally announced October 2009.

  34. arXiv:0908.2503  [pdf, ps, other

    stat.ME math.ST

    Sequential Quantile Prediction of Time Series

    Authors: Gérard Biau, Benoît Patra

    Abstract: Motivated by a broad range of potential applications, we address the quantile prediction problem of real-valued time series. We present a sequential quantile forecasting model based on the combination of a set of elementary nearest neighbor-type predictors called "experts" and show its consistency under a minimum of conditions. Our approach builds on the methodology developed in recent years for p… ▽ More

    Submitted 31 May, 2010; v1 submitted 18 August, 2009; originally announced August 2009.

  35. arXiv:0801.0327  [pdf, ps, other

    stat.ME math.PR

    Nonparametric sequential prediction of time series

    Authors: Gérard Biau, Kevin Bleakley, László Györfi, György Ottucsák

    Abstract: Time series prediction covers a vast field of every-day statistical applications in medical, environmental and economic domains. In this paper we develop nonparametric prediction strategies based on the combination of a set of 'experts' and show the universal consistency of these strategies under a minimum of conditions. We perform an in-depth analysis of real-world data sets and show that these… ▽ More

    Submitted 1 January, 2008; originally announced January 2008.

    Comments: article + 2 figures

    MSC Class: 62G99