Skip to main content

Showing 1–4 of 4 results for author: Douze, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2002.00937  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Radioactive data: tracing through training

    Authors: Alexandre Sablayrolles, Matthijs Douze, Cordelia Schmid, Hervé Jégou

    Abstract: We want to detect whether a particular image dataset has been used to train a model. We propose a new technique, \emph{radioactive data}, that makes imperceptible changes to this dataset such that any model trained on it will bear an identifiable mark. The mark is robust to strong variations such as different architectures or optimization methods. Given a trained model, our technique detects the u… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  2. arXiv:1908.11229  [pdf, other

    stat.ML cs.CR cs.LG

    White-box vs Black-box: Bayes Optimal Strategies for Membership Inference

    Authors: Alexandre Sablayrolles, Matthijs Douze, Yann Ollivier, Cordelia Schmid, Hervé Jégou

    Abstract: Membership inference determines, given a sample and trained parameters of a machine learning model, whether the sample was part of the training set. In this paper, we derive the optimal strategy for membership inference with a few assumptions on the distribution of the parameters. We show that optimal attacks only depend on the loss function, and thus black-box attacks are as good as white-box att… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  3. arXiv:1806.03198  [pdf, other

    stat.ML cs.LG

    Spreading vectors for similarity search

    Authors: Alexandre Sablayrolles, Matthijs Douze, Cordelia Schmid, Hervé Jégou

    Abstract: Discretizing multi-dimensional data distributions is a fundamental step of modern indexing methods. State-of-the-art techniques learn parameters of quantizers on training data for optimal performance, thus adapting quantizers to the data. In this work, we propose to reverse this paradigm and adapt the data to the quantizer: we train a neural net which last layer forms a fixed parameter-free quanti… ▽ More

    Submitted 30 August, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: Published at ICLR 2019

  4. arXiv:1706.02332  [pdf, other

    cs.CV cs.LG stat.ML

    Low-shot learning with large-scale diffusion

    Authors: Matthijs Douze, Arthur Szlam, Bharath Hariharan, Hervé Jégou

    Abstract: This paper considers the problem of inferring image labels from images when only a few annotated examples are available at training time. This setup is often referred to as low-shot learning, where a standard approach is to re-train the last few layers of a convolutional neural network learned on separate classes for which training examples are abundant. We consider a semi-supervised setting based… ▽ More

    Submitted 15 June, 2018; v1 submitted 7 June, 2017; originally announced June 2017.