Skip to main content

Showing 1–12 of 12 results for author: Nadjahi, K

.
  1. arXiv:2406.04047  [pdf, other

    stat.ML cs.LG

    Slicing Mutual Information Generalization Bounds for Neural Networks

    Authors: Kimia Nadjahi, Kristjan Greenewald, Rickard Brüel Gabrielsson, Justin Solomon

    Abstract: The ability of machine learning (ML) algorithms to generalize well to unseen data has been studied through the lens of information theory, by bounding the generalization error with the input-output mutual information (MI), i.e., the MI between the training data and the learned hypothesis. Yet, these bounds have limited practicality for modern ML applications (e.g., deep learning), due to the diffi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  2. arXiv:2402.16842  [pdf, other

    cs.LG

    Asymmetry in Low-Rank Adapters of Foundation Models

    Authors: Jiacheng Zhu, Kristjan Greenewald, Kimia Nadjahi, Haitz Sáez de Ocáriz Borde, Rickard Brüel Gabrielsson, Leshem Choshen, Marzyeh Ghassemi, Mikhail Yurochkin, Justin Solomon

    Abstract: Parameter-efficient fine-tuning optimizes large, pre-trained foundation models by updating a subset of parameters; in this class, Low-Rank Adaptation (LoRA) is particularly effective. Inspired by an effort to investigate the different roles of LoRA matrices during fine-tuning, this paper characterizes and leverages unexpected asymmetry in the importance of low-rank adapter matrices. Specifically,… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 17 pages, 2 figures, 9 tables

  3. arXiv:2310.01973  [pdf, other

    cs.LG cs.DC

    Federated Wasserstein Distance

    Authors: Alain Rakotomamonjy, Kimia Nadjahi, Liva Ralaivola

    Abstract: We introduce a principled way of computing the Wasserstein distance between two distributions in a federated manner. Namely, we show how to estimate the Wasserstein distance between two samples stored and kept on different devices/clients whilst a central entity/server orchestrates the computations (again, without having access to the samples). To achieve this feat, we take advantage of the geomet… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 23 pages

  4. arXiv:2306.07176  [pdf, other

    cs.LG math.OC

    Unbalanced Optimal Transport meets Sliced-Wasserstein

    Authors: Thibault Séjourné, Clément Bonet, Kilian Fatras, Kimia Nadjahi, Nicolas Courty

    Abstract: Optimal transport (OT) has emerged as a powerful framework to compare probability measures, a fundamental task in many statistical and machine learning problems. Substantial advances have been made over the last decade in designing OT variants which are either computationally and statistically more efficient, or more robust to the measures and datasets to compare. Among them, sliced OT distances h… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  5. arXiv:2206.03230  [pdf, other

    stat.ML cs.LG

    Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances

    Authors: Ruben Ohana, Kimia Nadjahi, Alain Rakotomamonjy, Liva Ralaivola

    Abstract: The Sliced-Wasserstein distance (SW) is a computationally efficient and theoretically grounded alternative to the Wasserstein distance. Yet, the literature on its statistical properties -- or, more accurately, its generalization properties -- with respect to the distribution of slices, beyond the uniform measure, is scarce. To bring new contributions to this line of research, we leverage the PAC-B… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  6. arXiv:2106.15427  [pdf, other

    stat.ML cs.LG

    Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections

    Authors: Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli

    Abstract: The Sliced-Wasserstein distance (SW) is being increasingly used in machine learning applications as an alternative to the Wasserstein distance and offers significant computational and statistical benefits. Since it is defined as an expectation over random projections, SW is commonly approximated by Monte Carlo. We adopt a new perspective to approximate SW by making use of the concentration of meas… ▽ More

    Submitted 4 January, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

  7. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  8. arXiv:2002.12537  [pdf, other

    stat.ML cs.LG

    Generalized Sliced Distances for Probability Distributions

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Shahin Shahrampour

    Abstract: Probability metrics have become an indispensable part of modern statistics and machine learning, and they play a quintessential role in various applications, including statistical hypothesis testing and generative modeling. However, in a practical setting, the convergence behavior of the algorithms built upon these distances have not been well established, except for a few specific cases. In this… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  9. arXiv:1910.12815  [pdf, other

    stat.CO stat.ME stat.ML

    Approximate Bayesian Computation with the Sliced-Wasserstein Distance

    Authors: Kimia Nadjahi, Valentin De Bortoli, Alain Durmus, Roland Badeau, Umut Şimşekli

    Abstract: Approximate Bayesian Computation (ABC) is a popular method for approximate inference in generative models with intractable but easy-to-sample likelihood. It constructs an approximate posterior distribution by finding parameters for which the simulated data are close to the observations in terms of summary statistics. These statistics are defined beforehand and might induce a loss of information, w… ▽ More

    Submitted 6 March, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted at ICASSP 2020 (publication and oral presentation)

  10. arXiv:1907.05079  [pdf, other

    cs.LG cs.AI stat.ML

    Safe Policy Improvement with Soft Baseline Bootstrap**

    Authors: Kimia Nadjahi, Romain Laroche, Rémi Tachet des Combes

    Abstract: Batch Reinforcement Learning (Batch RL) consists in training a policy using trajectories collected with another policy, called the behavioural policy. Safe policy improvement (SPI) provides guarantees with high probability that the trained policy performs better than the behavioural policy, also called baseline in this setting. Previous work shows that the SPI objective improves mean performance a… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: Accepted paper at ECML-PKDD2019

  11. arXiv:1906.04516  [pdf, other

    stat.ML cs.LG

    Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

    Authors: Kimia Nadjahi, Alain Durmus, Umut Şimşekli, Roland Badeau

    Abstract: Minimum expected distance estimation (MEDE) algorithms have been widely used for probabilistic models with intractable likelihood functions and they have become increasingly popular due to their use in implicit generative modeling (e.g. Wasserstein generative adversarial networks, Wasserstein autoencoders). Emerging from computational optimal transport, the Sliced-Wasserstein (SW) distance has bec… ▽ More

    Submitted 24 March, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted at NeurIPS 2019 (publication and spotlight presentation)

  12. arXiv:1902.00434  [pdf, other

    cs.LG stat.ML

    Generalized Sliced Wasserstein Distances

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K. Rohde

    Abstract: The Wasserstein distance and its variations, e.g., the sliced-Wasserstein (SW) distance, have recently drawn attention from the machine learning community. The SW distance, specifically, was shown to have similar properties to the Wasserstein distance, while being much simpler to compute, and is therefore used in various applications including generative modeling and general supervised/unsupervise… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.