Skip to main content

Showing 1–20 of 20 results for author: Boudiaf, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.02285  [pdf, other

    cs.CV

    LP++: A Surprisingly Strong Linear Probe for Few-Shot CLIP

    Authors: Yunshi Huang, Fereshteh Shakeri, Jose Dolz, Malik Boudiaf, Houda Bahig, Ismail Ben Ayed

    Abstract: In a recent, strongly emergent literature on few-shot CLIP adaptation, Linear Probe (LP) has been often reported as a weak baseline. This has motivated intensive research building convoluted prompt learning or feature adaptation strategies. In this work, we propose and examine from convex-optimization perspectives a generalization of the standard LP baseline, in which the linear classifier weights… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2403.03883  [pdf, other

    cs.CL

    SaulLM-7B: A pioneering Large Language Model for Law

    Authors: Pierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Dominic Culver, Rui Melo, Caio Corro, Andre F. T. Martins, Fabrizio Esposito, Vera Lúcia Raposo, Sofia Morgado, Michael Desa

    Abstract: In this paper, we introduce SaulLM-7B, a large language model (LLM) tailored for the legal domain. With 7 billion parameters, SaulLM-7B is the first LLM designed explicitly for legal text comprehension and generation. Leveraging the Mistral 7B architecture as its foundation, SaulLM-7B is trained on an English legal corpus of over 30 billion tokens. SaulLM-7B exhibits state-of-the-art proficiency i… ▽ More

    Submitted 7 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2310.13998  [pdf, other

    cs.CL

    Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models

    Authors: Pierre Colombo, Victor Pellegrain, Malik Boudiaf, Victor Storchan, Myriam Tami, Ismail Ben Ayed, Celine Hudelot, Pablo Piantanida

    Abstract: Proprietary and closed APIs are becoming increasingly common to process natural language, and are impacting the practical applications of natural language processing, including few-shot classification. Few-shot classification involves training a model to perform a new classification task with a handful of labeled data. This paper presents three contributions. First, we introduce a scenario where t… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  4. arXiv:2310.02416  [pdf, other

    cs.LG cs.CV

    Bag of Tricks for Fully Test-Time Adaptation

    Authors: Saypraseuth Mounsaveng, Florent Chiaroni, Malik Boudiaf, Marco Pedersoli, Ismail Ben Ayed

    Abstract: Fully Test-Time Adaptation (TTA), which aims at adapting models to data drifts, has recently attracted wide interest. Numerous tricks and techniques have been proposed to ensure robust learning on arbitrary streams of unlabeled data. However, assessing the true impact of each individual technique and obtaining a fair comparison still constitutes a significant challenge. To help consolidate the com… ▽ More

    Submitted 9 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at WACV 2024

  5. arXiv:2302.06658  [pdf, other

    cs.LG

    In Search for a Generalizable Method for Source Free Domain Adaptation

    Authors: Malik Boudiaf, Tom Denton, Bart van Merriënboer, Vincent Dumoulin, Eleni Triantafillou

    Abstract: Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-shelf model to a new domain using only unlabelled data. In this work, we apply existing SFDA techniques to a challenging set of naturally-occurring distribution shifts in bioacoustics, which are very different from the ones commonly studied in computer vision. We find existing methods perform differently relat… ▽ More

    Submitted 24 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  6. arXiv:2301.08390  [pdf, other

    cs.CV cs.LG

    Open-Set Likelihood Maximization for Few-Shot Learning

    Authors: Malik Boudiaf, Etienne Bennequin, Myriam Tami, Antoine Toubhans, Pablo Piantanida, Céline Hudelot, Ismail Ben Ayed

    Abstract: We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have a few labeled samples, while simultaneously detecting instances that do not belong to any known class. We explore the popular transductive setting, which leverages the unlabelled query instances at inference. Motivated by the observation that existing transductive m… ▽ More

    Submitted 19 May, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: CVPR 2023. Supercedes arXiv:2206.09236

  7. arXiv:2211.14126  [pdf, other

    cs.CV

    A Strong Baseline for Generalized Few-Shot Semantic Segmentation

    Authors: Sina Hajimiri, Malik Boudiaf, Ismail Ben Ayed, Jose Dolz

    Abstract: This paper introduces a generalized few-shot segmentation framework with a straightforward training process and an easy-to-optimize inference phase. In particular, we propose a simple yet effective model based on the well-known InfoMax principle, where the Mutual Information (MI) between the learned feature representations and their corresponding predictions is maximized. In addition, the terms de… ▽ More

    Submitted 3 April, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023

  8. arXiv:2210.14545  [pdf, other

    cs.LG math.OC

    Towards Practical Few-Shot Query Sets: Transductive Minimum Description Length Inference

    Authors: Ségolène Martin, Malik Boudiaf, Emilie Chouzenoux, Jean-Christophe Pesquet, Ismail Ben Ayed

    Abstract: Standard few-shot benchmarks are often built upon simplifying assumptions on the query sets, which may not always hold in practice. In particular, for each task at testing time, the classes effectively present in the unlabeled query set are known a priori, and correspond exactly to the set of classes represented in the labeled support set. We relax these assumptions and extend current benchmarks,… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  9. arXiv:2208.00287  [pdf, other

    cs.CV cs.AI cs.LG

    Simplex Clustering via sBeta with Applications to Online Adjustment of Black-Box Predictions

    Authors: Florent Chiaroni, Malik Boudiaf, Amar Mitiche, Ismail Ben Ayed

    Abstract: We explore clustering the softmax predictions of deep neural networks and introduce a novel probabilistic clustering method, referred to as k-sBetas. In the general context of clustering discrete distributions, the existing methods focused on exploring distortion measures tailored to simplex data, such as the KL divergence, as alternatives to the standard Euclidean distance. We provide a general m… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 July, 2022; originally announced August 2022.

  10. arXiv:2206.09236  [pdf, other

    cs.LG

    Model-Agnostic Few-Shot Open-Set Recognition

    Authors: Malik Boudiaf, Etienne Bennequin, Myriam Tami, Celine Hudelot, Antoine Toubhans, Pablo Piantanida, Ismail Ben Ayed

    Abstract: We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have few labeled samples, while simultaneously detecting instances that do not belong to any known class. Departing from existing literature, we focus on develo** model-agnostic inference methods that can be plugged into any existing model, regardless of its architectu… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

    Comments: Under review. Code available at https://github.com/ebennequin/few-shot-open-set

  11. arXiv:2206.00092  [pdf, other

    cs.CV

    FHIST: A Benchmark for Few-shot Classification of Histological Images

    Authors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi Kahou

    Abstract: Few-shot learning has recently attracted wide interest in image classification, but almost all the current public benchmarks are focused on natural images. The few-shot paradigm is highly relevant in medical-imaging applications due to the scarcity of labeled data, as annotations are expensive and require specialized expertise. However, in medical imaging, few-shot learning research is sparse, lim… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: Code available at: https://github.com/mboudiaf/Few-shot-histology

  12. arXiv:2204.11181  [pdf, other

    cs.LG cs.CV

    Realistic Evaluation of Transductive Few-Shot Learning

    Authors: Olivier Veilleux, Malik Boudiaf, Pablo Piantanida, Ismail Ben Ayed

    Abstract: Transductive inference is widely used in few-shot learning, as it leverages the statistics of the unlabeled query set of a few-shot task, typically yielding substantially better performances than its inductive counterpart. The current few-shot benchmarks use perfectly class-balanced tasks at inference. We argue that such an artificial regularity is unrealistic, as it assumes that the marginal labe… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2021. Code at https://github.com/oveilleux/Realistic_Transductive_Few_Shot

  13. arXiv:2202.06618  [pdf, other

    cs.LG stat.ML

    A Differential Entropy Estimator for Training Neural Networks

    Authors: Georg Pichler, Pierre Colombo, Malik Boudiaf, Günther Koliander, Pablo Piantanida

    Abstract: Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when learn disentangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of a… ▽ More

    Submitted 19 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: to be presented at ICML2022 in Baltimore, MD

  14. arXiv:2201.05718  [pdf, other

    cs.CV

    Parameter-free Online Test-time Adaptation

    Authors: Malik Boudiaf, Romain Mueller, Ismail Ben Ayed, Luca Bertinetto

    Abstract: Training state-of-the-art vision models has become prohibitively expensive for researchers and practitioners. For the sake of accessibility and resource reuse, it is important to focus on adapting these models to a variety of downstream scenarios. An interesting and practical paradigm is online test-time adaptation, according to which training data is inaccessible, no labelled data from the test d… ▽ More

    Submitted 4 April, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: CVPR 2022 (oral). Code available at https://github.com/fiveai/LAME

  15. arXiv:2106.12252  [pdf, ps, other

    cs.CV

    Mutual-Information Based Few-Shot Classification

    Authors: Malik Boudiaf, Ziko Imtiaz Masud, Jérôme Rony, Jose Dolz, Ismail Ben Ayed, Pablo Piantanida

    Abstract: We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. We motivate our transductive loss by deriving a formal relation between the classification accuracy and mutual-information maximiz… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: Journal extension of arXiv:2008.11297. PyTorch implementation of META-DATASET available at https://github.com/mboudiaf/pytorch-meta-dataset

  16. arXiv:2106.09516  [pdf, other

    cs.LG cs.CV

    Transductive Few-Shot Learning: Clustering is All You Need?

    Authors: Imtiaz Masud Ziko, Malik Boudiaf, Jose Dolz, Eric Granger, Ismail Ben Ayed

    Abstract: We investigate a general formulation for clustering and transductive few-shot learning, which integrates prototype-based objectives, Laplacian regularization and supervision constraints from a few labeled data points. We propose a concave-convex relaxation of the problem, and derive a computationally efficient block-coordinate bound optimizer, with convergence guarantee. At each iteration,our opti… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  17. Adversarial Robustness via Fisher-Rao Regularization

    Authors: Marine Picot, Francisco Messina, Malik Boudiaf, Fabrice Labeau, Ismail Ben Ayed, Pablo Piantanida

    Abstract: Adversarial robustness has become a topic of growing interest in machine learning since it was observed that neural networks tend to be brittle. We propose an information-geometric formulation of adversarial defense and introduce FIRE, a new Fisher-Rao regularization for the categorical cross-entropy loss, which is based on the geodesic distance between the softmax outputs corresponding to natural… ▽ More

    Submitted 13 June, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (Early Access)

  18. arXiv:2012.06166  [pdf, other

    cs.CV

    Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?

    Authors: Malik Boudiaf, Hoel Kervadec, Ziko Imtiaz Masud, Pablo Piantanida, Ismail Ben Ayed, Jose Dolz

    Abstract: We show that the way inference is performed in few-shot segmentation tasks has a substantial effect on performances -- an aspect often overlooked in the literature in favor of the meta-learning paradigm. We introduce a transductive inference for a given query image, leveraging the statistics of its unlabeled pixels, by optimizing a new loss containing three complementary terms: i) the cross-entrop… ▽ More

    Submitted 29 March, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: CVPR 2021. Code available at https://github.com/mboudiaf/RePRI-for-Few-Shot-Segmentation

  19. arXiv:2008.11297  [pdf, other

    cs.LG cs.CV stat.ML

    Transductive Information Maximization For Few-Shot Learning

    Authors: Malik Boudiaf, Ziko Imtiaz Masud, Jérôme Rony, José Dolz, Pablo Piantanida, Ismail Ben Ayed

    Abstract: We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. Furthermore, we propose a new alternating-direction solver for our mutual-information loss, which substantially speeds up transduc… ▽ More

    Submitted 23 October, 2020; v1 submitted 25 August, 2020; originally announced August 2020.

    Comments: NeurIPS 2020. Code available at https://github.com/mboudiaf/TIM

  20. arXiv:2003.08983  [pdf, other

    cs.LG cs.CV stat.ML

    A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses

    Authors: Malik Boudiaf, Jérôme Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida, Ismail Ben Ayed

    Abstract: Recently, substantial research efforts in Deep Metric Learning (DML) focused on designing complex pairwise-distance losses, which require convoluted schemes to ease optimization, such as sample mining or pair weighting. The standard cross-entropy loss for classification has been largely overlooked in DML. On the surface, the cross-entropy may seem unrelated and irrelevant to metric learning as it… ▽ More

    Submitted 26 November, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: ECCV 2020 (Spotlight) - Code available at: https://github.com/jeromerony/dml_cross_entropy