Skip to main content

Showing 1–26 of 26 results for author: d'Alché-Buc, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.09253  [pdf, other

    stat.ML cs.LG

    Deep Sketched Output Kernel Regression for Structured Prediction

    Authors: Tamim El Ahmad, Junjie Yang, Pierre Laforgue, Florence d'Alché-Buc

    Abstract: By leveraging the kernel trick in the output space, kernel-induced losses provide a principled way to define structured output prediction tasks for a wide variety of output modalities. In particular, they have been successfully used in the context of surrogate non-parametric regression, where the kernel trick is typically exploited in the input space as well. However, when inputs are images or tex… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2312.16139  [pdf, other

    stat.ME cs.LG stat.ML

    Anomaly component analysis

    Authors: Romain Valla, Pavlo Mozharovskyi, Florence d'Alché-Buc

    Abstract: At the crossway of machine learning and data analysis, anomaly detection aims at identifying observations that exhibit abnormal behaviour. Be it measurement errors, disease development, severe weather, production quality default(s) (items) or failed equipment, financial frauds or crisis events, their on-time identification and isolation constitute an important task in almost any area of industry a… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 41 pages, 25 figures, 13 tables

  3. arXiv:2312.14136  [pdf, other

    stat.ML cs.LG

    Fast kernel half-space depth for data with non-convex supports

    Authors: Arturo Castellanos, Pavlo Mozharovskyi, Florence d'Alché-Buc, Hicham Janati

    Abstract: Data depth is a statistical function that generalizes order and quantiles to the multivariate setting and beyond, with applications spanning over descriptive and visual statistics, anomaly detection, testing, etc. The celebrated halfspace depth exploits data geometry via an optimization program to deliver properties of invariances, robustness, and non-parametricity. Nevertheless, it implicitly ass… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 30 pages

  4. arXiv:2311.01434  [pdf, other

    cs.LG cs.AI stat.ML

    Tailoring Mixup to Data for Calibration

    Authors: Quentin Bouniot, Pavlo Mozharovskyi, Florence d'Alché-Buc

    Abstract: Among all data augmentation techniques proposed so far, linear interpolation of training samples, also called Mixup, has found to be effective for a large panel of applications. Along with improved performance, Mixup is also a good technique for improving calibration and predictive uncertainty. However, mixing data carelessly can lead to manifold intrusion, i.e., conflicts between the synthetic la… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  5. arXiv:2309.16604  [pdf, other

    stat.ML cs.LG

    Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance

    Authors: Junjie Yang, Matthieu Labeau, Florence d'Alché-Buc

    Abstract: Pairwise comparison of graphs is key to many applications in Machine learning ranging from clustering, kernel-based classification/regression and more recently supervised graph prediction. Distances between graphs usually rely on informative representations of these structured objects such as bag of substructures or other graph embeddings. A recently popular solution consists in representing graph… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  6. arXiv:2302.10128  [pdf, other

    stat.ML cs.LG

    Sketch In, Sketch Out: Accelerating both Learning and Inference for Structured Prediction with Kernels

    Authors: Tamim El Ahmad, Luc Brogat-Motte, Pierre Laforgue, Florence d'Alché-Buc

    Abstract: Leveraging the kernel trick in both the input and output spaces, surrogate kernel methods are a flexible and theoretically grounded solution to structured output prediction. If they provide state-of-the-art performance on complex data sets of moderate size (e.g., in chemoinformatics), these approaches however fail to scale. We propose to equip surrogate kernel methods with sketching-based approxim… ▽ More

    Submitted 6 May, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:109-117, 2024

  7. arXiv:2211.08958  [pdf, other

    stat.ML cs.LG

    Vector-Valued Least-Squares Regression under Output Regularity Assumptions

    Authors: Luc Brogat-Motte, Alessandro Rudi, Céline Brouard, Juho Rousu, Florence d'Alché-Buc

    Abstract: We propose and analyse a reduced-rank method for solving least-squares regression problems with infinite dimensional output. We derive learning bounds for our method, and study under which setting statistical performance is improved in comparison to full-rank method. Our analysis extends the interest of reduced-rank regression beyond the standard low-rank setting to more general output regularity… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  8. arXiv:2206.08220  [pdf, other

    stat.ML cs.LG

    Functional Output Regression with Infimal Convolution: Exploring the Huber and $ε$-insensitive Losses

    Authors: Alex Lambert, Dimitri Bouche, Zoltan Szabo, Florence d'Alché-Buc

    Abstract: The focus of the paper is functional output regression (FOR) with convoluted losses. While most existing work consider the square loss setting, we leverage extensions of the Huber and the $ε$-insensitive loss (induced by infimal convolution) and propose a flexible framework capable of handling various forms of outliers and sparsity in the FOR family. We derive computationally tractable algorithms… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 24 pages, ICML 2022

  9. arXiv:2206.03827  [pdf, other

    stat.ML cs.LG

    Fast Kernel Methods for Generic Lipschitz Losses via $p$-Sparsified Sketches

    Authors: Tamim El Ahmad, Pierre Laforgue, Florence d'Alché-Buc

    Abstract: Kernel methods are learning algorithms that enjoy solid theoretical foundations while suffering from important computational limitations. Sketching, which consists in looking for solutions among a subspace of reduced dimension, is a well studied approach to alleviate these computational burdens. However, statistically-accurate sketches, such as the Gaussian one, usually contain few null entries, s… ▽ More

    Submitted 6 November, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Journal ref: Transactions on Machine Learning Research (2023)

  10. arXiv:2204.09362  [pdf, other

    cs.LG stat.AP stat.ML

    Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

    Authors: Dimitri Bouche, Rémi Flamary, Florence d'Alché-Buc, Riwal Plougonven, Marianne Clausel, Jordi Badosa, Philippe Drobinski

    Abstract: We study short-term prediction of wind speed and wind power (every 10 minutes up to 4 hours ahead). Accurate forecasts for these quantities are crucial to mitigate the negative effects of wind farms' intermittent production on energy systems and markets. We use machine learning to combine outputs from numerical weather prediction models with local observations. The former provide valuable informat… ▽ More

    Submitted 13 December, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  11. arXiv:2202.03813  [pdf, other

    stat.ML cs.LG

    Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters

    Authors: Luc Brogat-Motte, Rémi Flamary, Céline Brouard, Juho Rousu, Florence d'Alché-Buc

    Abstract: This paper introduces a novel and generic framework to solve the flagship task of supervised labeled graph prediction by leveraging Optimal Transport tools. We formulate the problem as regression with the Fused Gromov-Wasserstein (FGW) loss and propose a predictive model relying on a FGW barycenter whose weights depend on inputs. First we introduce a non-parametric estimator based on kernel ridge… ▽ More

    Submitted 24 June, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  12. arXiv:2103.12711  [pdf, other

    stat.ML cs.LG

    A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Pierre Colombo, Stéphan Clémençon, Florence d'Alché-Buc

    Abstract: The design of a metric between probability distributions is a longstanding problem motivated by numerous applications in Machine Learning. Focusing on continuous probability distributions on the Euclidean space $\mathbb{R}^d$, we introduce a novel pseudo-metric between probability distributions by leveraging the extension of univariate quantiles to multivariate spaces. Data depth is a nonparametri… ▽ More

    Submitted 10 October, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

  13. arXiv:2102.05075  [pdf, other

    stat.ML cs.LG

    Emotion Transfer Using Vector-Valued Infinite Task Learning

    Authors: Alex Lambert, Sanjeel Parekh, Zoltán Szabó, Florence d'Alché-Buc

    Abstract: Style transfer is a significant problem of machine learning with numerous successful applications. In this work, we present a novel style transfer framework building upon infinite task learning and vector-valued reproducing kernel Hilbert spaces. We instantiate the idea in emotion transfer where the goal is to transform facial images to different target emotions. The proposed approach provides a p… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 17 pages, 10 figures

  14. arXiv:2010.09345  [pdf, other

    cs.LG stat.ML

    A Framework to Learn with Interpretation

    Authors: Jayneel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc

    Abstract: To tackle interpretability in deep learning, we present a novel framework to jointly learn a predictive model and its associated interpretation model. The interpreter provides both local and global interpretability about the predictive model in terms of human-understandable high level attribute functions, with minimal loss of accuracy. This is achieved by a dedicated architecture and well chosen r… ▽ More

    Submitted 23 February, 2022; v1 submitted 19 October, 2020; originally announced October 2020.

  15. arXiv:2007.14703  [pdf, other

    stat.ML cs.LG

    Learning Output Embeddings in Structured Prediction

    Authors: Luc Brogat-Motte, Alessandro Rudi, Céline Brouard, Juho Rousu, Florence d'Alché-Buc

    Abstract: A powerful and flexible approach to structured prediction consists in embedding the structured objects to be predicted into a feature space of possibly infinite dimension by means of output kernels, and then, solving a regression problem in this output space. A prediction in the original space is computed by solving a pre-image problem. In such an approach, the embedding, linked to the target loss… ▽ More

    Submitted 2 November, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

  16. arXiv:2006.10325  [pdf, other

    stat.ML cs.LG

    When OT meets MoM: Robust estimation of Wasserstein Distance

    Authors: Guillaume Staerman, Pierre Laforgue, Pavlo Mozharovskyi, Florence d'Alché-Buc

    Abstract: Issued from Optimal Transport, the Wasserstein distance has gained importance in Machine Learning due to its appealing geometrical properties and the increasing availability of efficient approximations. In this work, we consider the problem of estimating the Wasserstein distance between two probability distributions when observations are polluted by outliers. To that end, we investigate how to lev… ▽ More

    Submitted 18 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021

  17. arXiv:2003.12206  [pdf, other

    cs.LG stat.ML

    Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

    Authors: Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d'Alché-Buc, Emily Fox, Hugo Larochelle

    Abstract: One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data (when available), is a necessary step to verify the reliability of research findings. Reproducibility is also an important step to promote open and accessible res… ▽ More

    Submitted 30 December, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: To appear at JMLR, 16 pages + Appendix

  18. arXiv:2003.01432  [pdf, other

    stat.ML cs.LG

    Nonlinear Functional Output Regression: a Dictionary Approach

    Authors: Dimitri Bouche, Marianne Clausel, François Roueff, Florence d'Alché-Buc

    Abstract: To address functional-output regression, we introduce projection learning (PL), a novel dictionary-based approach that learns to predict a function that is expanded on a dictionary while minimizing an empirical risk based on a functional loss. PL makes it possible to use non orthogonal dictionaries and can then be combined with dictionary learning; it is thus much more flexible than expansion-base… ▽ More

    Submitted 26 February, 2021; v1 submitted 3 March, 2020; originally announced March 2020.

  19. arXiv:1910.04621  [pdf, other

    stat.ML cs.LG

    Duality in RKHSs with Infinite Dimensional Outputs: Application to Robust Losses

    Authors: Pierre Laforgue, Alex Lambert, Luc Brogat-Motte, Florence d'Alché-Buc

    Abstract: Operator-Valued Kernels (OVKs) and associated vector-valued Reproducing Kernel Hilbert Spaces provide an elegant way to extend scalar kernel methods when the output space is a Hilbert space. Although primarily used in finite dimension for problems like multi-task regression, the ability of this framework to deal with infinite dimensional output spaces unlocks many more applications, such as functi… ▽ More

    Submitted 21 August, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

  20. arXiv:1904.04573  [pdf, other

    stat.ML cs.LG

    Functional Isolation Forest

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Stephan Clémençon, Florence d'Alché-Buc

    Abstract: For the purpose of monitoring the behavior of complex infrastructures (e.g. aircrafts, transport or energy networks), high-rate sensors are deployed to capture multivariate data, generally unlabeled, in quasi continuous-time to detect quickly the occurrence of anomalies that may jeopardize the smooth operation of the system of interest. The statistical analysis of such massive data of functional n… ▽ More

    Submitted 9 October, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

  21. arXiv:1805.11028  [pdf, other

    stat.ML cs.LG

    Autoencoding any Data through Kernel Autoencoders

    Authors: Pierre Laforgue, Stephan Clémençon, Florence d'Alché-Buc

    Abstract: This paper investigates a novel algorithmic approach to data representation based on kernel methods. Assuming that the observations lie in a Hilbert space X, the introduced Kernel Autoencoder (KAE) is the composition of map**s from vector-valued Reproducing Kernel Hilbert Spaces (vv-RKHSs) that minimizes the expected reconstruction error. Beyond a first extension of the autoencoding scheme to po… ▽ More

    Submitted 2 December, 2020; v1 submitted 28 May, 2018; originally announced May 2018.

  22. arXiv:1805.08809  [pdf, other

    cs.LG stat.ML

    Infinite-Task Learning with RKHSs

    Authors: Romain Brault, Alex Lambert, Zoltán Szabó, Maxime Sangnier, Florence d'Alché-Buc

    Abstract: Machine learning has witnessed tremendous success in solving tasks depending on a single hyperparameter. When considering simultaneously a finite number of tasks, multi-task learning enables one to account for the similarities of the tasks via appropriate regularizers. A step further consists of learning a continuum of tasks for various loss functions. A promising approach, called \emph{Parametric… ▽ More

    Submitted 11 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: 23 pages, 6 figures, 3 tables

    MSC Class: 46E22; 62G08; 65D15; 47B32 ACM Class: I.2.6

  23. arXiv:1803.08355  [pdf, other

    cs.LG cs.AI stat.ML

    Structured Output Learning with Abstention: Application to Accurate Opinion Prediction

    Authors: Alexandre Garcia, Slim Essid, Chloé Clavel, Florence d'Alché-Buc

    Abstract: Motivated by Supervised Opinion Analysis, we propose a novel framework devoted to Structured Output Learning with Abstention (SOLA). The structure prediction model is able to abstain from predicting some labels in the structured output at a cost chosen by the user in a flexible way. For that purpose, we decompose the problem into the learning of a pair of predictors, one devoted to structured abst… ▽ More

    Submitted 8 June, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

    Journal ref: Proceedings of Machine Learning Research 80 (2018) 1695-1703

  24. arXiv:1605.02536  [pdf, other

    cs.LG stat.ML

    Random Fourier Features for Operator-Valued Kernels

    Authors: Romain Brault, Florence d'Alché-Buc, Markus Heinonen

    Abstract: Devoted to multi-task learning and structured output learning, operator-valued kernels provide a flexible tool to build vector-valued functions in the context of Reproducing Kernel Hilbert Spaces. To scale up these methods, we extend the celebrated Random Fourier Feature methodology to get an approximation of operator-valued kernels. We propose a general principle for Operator-valued Random Fourie… ▽ More

    Submitted 24 May, 2016; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: 32 pages, 6 figures

    Report number: PMLR 63:110-125

    Journal ref: ACML, Hamilton, New-Zealand, JMLR Workshop and Conference Proceedings, November 2016, vol. 63, pp. 110-125

  25. arXiv:1411.5172  [pdf, other

    cs.LG stat.ML

    Learning nonparametric differential equations with operator-valued kernels and gradient matching

    Authors: Markus Heinonen, Florence d'Alché-Buc

    Abstract: Modeling dynamical systems with ordinary differential equations implies a mechanistic view of the process underlying the dynamics. However in many cases, this knowledge is not available. To overcome this issue, we introduce a general framework for nonparametric ODE models using penalized regression in Reproducing Kernel Hilbert Spaces (RKHS) based on operator-valued kernels. Moreover, we extend th… ▽ More

    Submitted 19 November, 2014; originally announced November 2014.

  26. Parametric Estimation of Ordinary Differential Equations with Orthogonality Conditions

    Authors: Nicolas J-B Brunel, Quentin Clairon, Florence d'Alche-Buc

    Abstract: Differential equations are commonly used to model dynamical deterministic systems in applications. When statistical parameter estimation is required to calibrate theoretical models to data, classical statistical estimators are often confronted to complex and potentially ill-posed optimization problem. As a consequence, alternative estimators to classical parametric estimators are needed for obtain… ▽ More

    Submitted 28 October, 2014; originally announced October 2014.

    Comments: 35 pages, 5 figures

    Journal ref: Brunel, N. JB, Clairon, Q., d Alche Buc, F. (2014). Parametric Estimation of Ordinary Differential Equations With Orthogonality Conditions. Journal of the American Statistical Association, 109(505), 173-185